Sample records for final multivariate model

  1. Critical elements on fitting the Bayesian multivariate Poisson Lognormal model

    NASA Astrophysics Data System (ADS)

    Zamzuri, Zamira Hasanah binti

    2015-10-01

    Motivated by a problem on fitting multivariate models to traffic accident data, a detailed discussion of the Multivariate Poisson Lognormal (MPL) model is presented. This paper reveals three critical elements on fitting the MPL model: the setting of initial estimates, hyperparameters and tuning parameters. These issues have not been highlighted in the literature. Based on simulation studies conducted, we have shown that to use the Univariate Poisson Model (UPM) estimates as starting values, at least 20,000 iterations are needed to obtain reliable final estimates. We also illustrated the sensitivity of the specific hyperparameter, which if it is not given extra attention, may affect the final estimates. The last issue is regarding the tuning parameters where they depend on the acceptance rate. Finally, a heuristic algorithm to fit the MPL model is presented. This acts as a guide to ensure that the model works satisfactorily given any data set.

  2. Multivariate Boosting for Integrative Analysis of High-Dimensional Cancer Genomic Data

    PubMed Central

    Xiong, Lie; Kuan, Pei-Fen; Tian, Jianan; Keles, Sunduz; Wang, Sijian

    2015-01-01

    In this paper, we propose a novel multivariate component-wise boosting method for fitting multivariate response regression models under the high-dimension, low sample size setting. Our method is motivated by modeling the association among different biological molecules based on multiple types of high-dimensional genomic data. Particularly, we are interested in two applications: studying the influence of DNA copy number alterations on RNA transcript levels and investigating the association between DNA methylation and gene expression. For this purpose, we model the dependence of the RNA expression levels on DNA copy number alterations and the dependence of gene expression on DNA methylation through multivariate regression models and utilize boosting-type method to handle the high dimensionality as well as model the possible nonlinear associations. The performance of the proposed method is demonstrated through simulation studies. Finally, our multivariate boosting method is applied to two breast cancer studies. PMID:26609213

  3. A simple prognostic model for overall survival in metastatic renal cell carcinoma.

    PubMed

    Assi, Hazem I; Patenaude, Francois; Toumishey, Ethan; Ross, Laura; Abdelsalam, Mahmoud; Reiman, Tony

    2016-01-01

    The primary purpose of this study was to develop a simpler prognostic model to predict overall survival for patients treated for metastatic renal cell carcinoma (mRCC) by examining variables shown in the literature to be associated with survival. We conducted a retrospective analysis of patients treated for mRCC at two Canadian centres. All patients who started first-line treatment were included in the analysis. A multivariate Cox proportional hazards regression model was constructed using a stepwise procedure. Patients were assigned to risk groups depending on how many of the three risk factors from the final multivariate model they had. There were three risk factors in the final multivariate model: hemoglobin, prior nephrectomy, and time from diagnosis to treatment. Patients in the high-risk group (two or three risk factors) had a median survival of 5.9 months, while those in the intermediate-risk group (one risk factor) had a median survival of 16.2 months, and those in the low-risk group (no risk factors) had a median survival of 50.6 months. In multivariate analysis, shorter survival times were associated with hemoglobin below the lower limit of normal, absence of prior nephrectomy, and initiation of treatment within one year of diagnosis.

  4. A simple prognostic model for overall survival in metastatic renal cell carcinoma

    PubMed Central

    Assi, Hazem I.; Patenaude, Francois; Toumishey, Ethan; Ross, Laura; Abdelsalam, Mahmoud; Reiman, Tony

    2016-01-01

    Introduction: The primary purpose of this study was to develop a simpler prognostic model to predict overall survival for patients treated for metastatic renal cell carcinoma (mRCC) by examining variables shown in the literature to be associated with survival. Methods: We conducted a retrospective analysis of patients treated for mRCC at two Canadian centres. All patients who started first-line treatment were included in the analysis. A multivariate Cox proportional hazards regression model was constructed using a stepwise procedure. Patients were assigned to risk groups depending on how many of the three risk factors from the final multivariate model they had. Results: There were three risk factors in the final multivariate model: hemoglobin, prior nephrectomy, and time from diagnosis to treatment. Patients in the high-risk group (two or three risk factors) had a median survival of 5.9 months, while those in the intermediate-risk group (one risk factor) had a median survival of 16.2 months, and those in the low-risk group (no risk factors) had a median survival of 50.6 months. Conclusions: In multivariate analysis, shorter survival times were associated with hemoglobin below the lower limit of normal, absence of prior nephrectomy, and initiation of treatment within one year of diagnosis. PMID:27217858

  5. A Review of Multivariate Distributions for Count Data Derived from the Poisson Distribution.

    PubMed

    Inouye, David; Yang, Eunho; Allen, Genevera; Ravikumar, Pradeep

    2017-01-01

    The Poisson distribution has been widely studied and used for modeling univariate count-valued data. Multivariate generalizations of the Poisson distribution that permit dependencies, however, have been far less popular. Yet, real-world high-dimensional count-valued data found in word counts, genomics, and crime statistics, for example, exhibit rich dependencies, and motivate the need for multivariate distributions that can appropriately model this data. We review multivariate distributions derived from the univariate Poisson, categorizing these models into three main classes: 1) where the marginal distributions are Poisson, 2) where the joint distribution is a mixture of independent multivariate Poisson distributions, and 3) where the node-conditional distributions are derived from the Poisson. We discuss the development of multiple instances of these classes and compare the models in terms of interpretability and theory. Then, we empirically compare multiple models from each class on three real-world datasets that have varying data characteristics from different domains, namely traffic accident data, biological next generation sequencing data, and text data. These empirical experiments develop intuition about the comparative advantages and disadvantages of each class of multivariate distribution that was derived from the Poisson. Finally, we suggest new research directions as explored in the subsequent discussion section.

  6. Classical least squares multivariate spectral analysis

    DOEpatents

    Haaland, David M.

    2002-01-01

    An improved classical least squares multivariate spectral analysis method that adds spectral shapes describing non-calibrated components and system effects (other than baseline corrections) present in the analyzed mixture to the prediction phase of the method. These improvements decrease or eliminate many of the restrictions to the CLS-type methods and greatly extend their capabilities, accuracy, and precision. One new application of PACLS includes the ability to accurately predict unknown sample concentrations when new unmodeled spectral components are present in the unknown samples. Other applications of PACLS include the incorporation of spectrometer drift into the quantitative multivariate model and the maintenance of a calibration on a drifting spectrometer. Finally, the ability of PACLS to transfer a multivariate model between spectrometers is demonstrated.

  7. Data driven discrete-time parsimonious identification of a nonlinear state-space model for a weakly nonlinear system with short data record

    NASA Astrophysics Data System (ADS)

    Relan, Rishi; Tiels, Koen; Marconato, Anna; Dreesen, Philippe; Schoukens, Johan

    2018-05-01

    Many real world systems exhibit a quasi linear or weakly nonlinear behavior during normal operation, and a hard saturation effect for high peaks of the input signal. In this paper, a methodology to identify a parsimonious discrete-time nonlinear state space model (NLSS) for the nonlinear dynamical system with relatively short data record is proposed. The capability of the NLSS model structure is demonstrated by introducing two different initialisation schemes, one of them using multivariate polynomials. In addition, a method using first-order information of the multivariate polynomials and tensor decomposition is employed to obtain the parsimonious decoupled representation of the set of multivariate real polynomials estimated during the identification of NLSS model. Finally, the experimental verification of the model structure is done on the cascaded water-benchmark identification problem.

  8. A Review of Multivariate Distributions for Count Data Derived from the Poisson Distribution

    PubMed Central

    Inouye, David; Yang, Eunho; Allen, Genevera; Ravikumar, Pradeep

    2017-01-01

    The Poisson distribution has been widely studied and used for modeling univariate count-valued data. Multivariate generalizations of the Poisson distribution that permit dependencies, however, have been far less popular. Yet, real-world high-dimensional count-valued data found in word counts, genomics, and crime statistics, for example, exhibit rich dependencies, and motivate the need for multivariate distributions that can appropriately model this data. We review multivariate distributions derived from the univariate Poisson, categorizing these models into three main classes: 1) where the marginal distributions are Poisson, 2) where the joint distribution is a mixture of independent multivariate Poisson distributions, and 3) where the node-conditional distributions are derived from the Poisson. We discuss the development of multiple instances of these classes and compare the models in terms of interpretability and theory. Then, we empirically compare multiple models from each class on three real-world datasets that have varying data characteristics from different domains, namely traffic accident data, biological next generation sequencing data, and text data. These empirical experiments develop intuition about the comparative advantages and disadvantages of each class of multivariate distribution that was derived from the Poisson. Finally, we suggest new research directions as explored in the subsequent discussion section. PMID:28983398

  9. Local polynomial estimation of heteroscedasticity in a multivariate linear regression model and its applications in economics.

    PubMed

    Su, Liyun; Zhao, Yanyong; Yan, Tianshun; Li, Fenglan

    2012-01-01

    Multivariate local polynomial fitting is applied to the multivariate linear heteroscedastic regression model. Firstly, the local polynomial fitting is applied to estimate heteroscedastic function, then the coefficients of regression model are obtained by using generalized least squares method. One noteworthy feature of our approach is that we avoid the testing for heteroscedasticity by improving the traditional two-stage method. Due to non-parametric technique of local polynomial estimation, it is unnecessary to know the form of heteroscedastic function. Therefore, we can improve the estimation precision, when the heteroscedastic function is unknown. Furthermore, we verify that the regression coefficients is asymptotic normal based on numerical simulations and normal Q-Q plots of residuals. Finally, the simulation results and the local polynomial estimation of real data indicate that our approach is surely effective in finite-sample situations.

  10. Predictive model for falling in Parkinson disease patients.

    PubMed

    Custodio, Nilton; Lira, David; Herrera-Perez, Eder; Montesinos, Rosa; Castro-Suarez, Sheila; Cuenca-Alfaro, Jose; Cortijo, Patricia

    2016-12-01

    Falls are a common complication of advancing Parkinson's disease (PD). Although numerous risk factors are known, reliable predictors of future falls are still lacking. The aim of this study was to develop a multivariate model to predict falling in PD patients. Prospective cohort with forty-nine PD patients. The area under the receiver-operating characteristic curve (AUC) was calculated to evaluate predictive performance of the purposed multivariate model. The median of PD duration and UPDRS-III score in the cohort was 6 years and 24 points, respectively. Falls occurred in 18 PD patients (30%). Predictive factors for falling identified by univariate analysis were age, PD duration, physical activity, and scores of UPDRS motor, FOG, ACE, IFS, PFAQ and GDS ( p -value < 0.001), as well as fear of falling score ( p -value = 0.04). The final multivariate model (PD duration, FOG, ACE, and physical activity) showed an AUC = 0.9282 (correctly classified = 89.83%; sensitivity = 92.68%; specificity = 83.33%). This study showed that our multivariate model have a high performance to predict falling in a sample of PD patients.

  11. Multivariate Bayesian analysis of Gaussian, right censored Gaussian, ordered categorical and binary traits using Gibbs sampling

    PubMed Central

    Korsgaard, Inge Riis; Lund, Mogens Sandø; Sorensen, Daniel; Gianola, Daniel; Madsen, Per; Jensen, Just

    2003-01-01

    A fully Bayesian analysis using Gibbs sampling and data augmentation in a multivariate model of Gaussian, right censored, and grouped Gaussian traits is described. The grouped Gaussian traits are either ordered categorical traits (with more than two categories) or binary traits, where the grouping is determined via thresholds on the underlying Gaussian scale, the liability scale. Allowances are made for unequal models, unknown covariance matrices and missing data. Having outlined the theory, strategies for implementation are reviewed. These include joint sampling of location parameters; efficient sampling from the fully conditional posterior distribution of augmented data, a multivariate truncated normal distribution; and sampling from the conditional inverse Wishart distribution, the fully conditional posterior distribution of the residual covariance matrix. Finally, a simulated dataset was analysed to illustrate the methodology. This paper concentrates on a model where residuals associated with liabilities of the binary traits are assumed to be independent. A Bayesian analysis using Gibbs sampling is outlined for the model where this assumption is relaxed. PMID:12633531

  12. Square Root Graphical Models: Multivariate Generalizations of Univariate Exponential Families that Permit Positive Dependencies

    PubMed Central

    Inouye, David I.; Ravikumar, Pradeep; Dhillon, Inderjit S.

    2016-01-01

    We develop Square Root Graphical Models (SQR), a novel class of parametric graphical models that provides multivariate generalizations of univariate exponential family distributions. Previous multivariate graphical models (Yang et al., 2015) did not allow positive dependencies for the exponential and Poisson generalizations. However, in many real-world datasets, variables clearly have positive dependencies. For example, the airport delay time in New York—modeled as an exponential distribution—is positively related to the delay time in Boston. With this motivation, we give an example of our model class derived from the univariate exponential distribution that allows for almost arbitrary positive and negative dependencies with only a mild condition on the parameter matrix—a condition akin to the positive definiteness of the Gaussian covariance matrix. Our Poisson generalization allows for both positive and negative dependencies without any constraints on the parameter values. We also develop parameter estimation methods using node-wise regressions with ℓ1 regularization and likelihood approximation methods using sampling. Finally, we demonstrate our exponential generalization on a synthetic dataset and a real-world dataset of airport delay times. PMID:27563373

  13. A land use regression model for ambient ultrafine particles in Montreal, Canada: A comparison of linear regression and a machine learning approach.

    PubMed

    Weichenthal, Scott; Ryswyk, Keith Van; Goldstein, Alon; Bagg, Scott; Shekkarizfard, Maryam; Hatzopoulou, Marianne

    2016-04-01

    Existing evidence suggests that ambient ultrafine particles (UFPs) (<0.1µm) may contribute to acute cardiorespiratory morbidity. However, few studies have examined the long-term health effects of these pollutants owing in part to a need for exposure surfaces that can be applied in large population-based studies. To address this need, we developed a land use regression model for UFPs in Montreal, Canada using mobile monitoring data collected from 414 road segments during the summer and winter months between 2011 and 2012. Two different approaches were examined for model development including standard multivariable linear regression and a machine learning approach (kernel-based regularized least squares (KRLS)) that learns the functional form of covariate impacts on ambient UFP concentrations from the data. The final models included parameters for population density, ambient temperature and wind speed, land use parameters (park space and open space), length of local roads and rail, and estimated annual average NOx emissions from traffic. The final multivariable linear regression model explained 62% of the spatial variation in ambient UFP concentrations whereas the KRLS model explained 79% of the variance. The KRLS model performed slightly better than the linear regression model when evaluated using an external dataset (R(2)=0.58 vs. 0.55) or a cross-validation procedure (R(2)=0.67 vs. 0.60). In general, our findings suggest that the KRLS approach may offer modest improvements in predictive performance compared to standard multivariable linear regression models used to estimate spatial variations in ambient UFPs. However, differences in predictive performance were not statistically significant when evaluated using the cross-validation procedure. Crown Copyright © 2015. Published by Elsevier Inc. All rights reserved.

  14. MULTIVARIATE RECEPTOR MODELING BY N-DIMENSIONAL EDGE DETECTION. (R826238)

    EPA Science Inventory

    The perspectives, information and conclusions conveyed in research project abstracts, progress reports, final reports, journal abstracts and journal publications convey the viewpoints of the principal investigator and may not represent the views and policies of ORD and EPA. Concl...

  15. PSEUDOLIKELIHOOD MODELING OF MULTIVARIATE OUTCOMES IN DEVELOPMENTAL TOXICOLOGY. (R824757)

    EPA Science Inventory

    The perspectives, information and conclusions conveyed in research project abstracts, progress reports, final reports, journal abstracts and journal publications convey the viewpoints of the principal investigator and may not represent the views and policies of ORD and EPA. Concl...

  16. The Recoverability of P-Technique Factor Analysis

    ERIC Educational Resources Information Center

    Molenaar, Peter C. M.; Nesselroade, John R.

    2009-01-01

    It seems that just when we are about to lay P-technique factor analysis finally to rest as obsolete because of newer, more sophisticated multivariate time-series models using latent variables--dynamic factor models--it rears its head to inform us that an obituary may be premature. We present the results of some simulations demonstrating that even…

  17. MULTIVARIATE RECEPTOR MODELING FOR TEMPORALLY CORRELATED DATA BY USING MCMC. (R826238)

    EPA Science Inventory

    The perspectives, information and conclusions conveyed in research project abstracts, progress reports, final reports, journal abstracts and journal publications convey the viewpoints of the principal investigator and may not represent the views and policies of ORD and EPA. Concl...

  18. What matters? Assessing and developing inquiry and multivariable reasoning skills in high school chemistry

    NASA Astrophysics Data System (ADS)

    Daftedar Abdelhadi, Raghda Mohamed

    Although the Next Generation Science Standards (NGSS) present a detailed set of Science and Engineering Practices, a finer grained representation of the underlying skills is lacking in the standards document. Therefore, it has been reported that teachers are facing challenges deciphering and effectively implementing the standards, especially with regards to the Practices. This analytical study assessed the development of high school chemistry students' (N = 41) inquiry, multivariable causal reasoning skills, and metacognition as a mediator for their development. Inquiry tasks based on concepts of element properties of the periodic table as well as reaction kinetics required students to conduct controlled thought experiments, make inferences, and declare predictions of the level of the outcome variable by coordinating the effects of multiple variables. An embedded mixed methods design was utilized for depth and breadth of understanding. Various sources of data were collected including students' written artifacts, audio recordings of in-depth observational groups and interviews. Data analysis was informed by a conceptual framework formulated around the concepts of coordinating theory and evidence, metacognition, and mental models of multivariable causal reasoning. Results of the study indicated positive change towards conducting controlled experimentation, making valid inferences and justifications. Additionally, significant positive correlation between metastrategic and metacognitive competencies, and sophistication of experimental strategies, signified the central role metacognition played. Finally, lack of consistency in indicating effective variables during the multivariable prediction task pointed towards the fragile mental models of multivariable causal reasoning the students had. Implications for teacher education, science education policy as well as classroom research methods are discussed. Finally, recommendations for developing reform-based chemistry curricula based on the Practices are presented.

  19. Utility of an Abbreviated Dizziness Questionnaire to Differentiate between Causes of Vertigo and Guide Appropriate Referral: A Multicenter Prospective Blinded Study

    PubMed Central

    Roland, Lauren T.; Kallogjeri, Dorina; Sinks, Belinda C.; Rauch, Steven D.; Shepard, Neil T.; White, Judith A.; Goebel, Joel A.

    2015-01-01

    Objective Test performance of a focused dizziness questionnaire’s ability to discriminate between peripheral and non-peripheral causes of vertigo. Study Design Prospective multi-center Setting Four academic centers with experienced balance specialists Patients New dizzy patients Interventions A 32-question survey was given to participants. Balance specialists were blinded and a diagnosis was established for all participating patients within 6 months. Main outcomes Multinomial logistic regression was used to evaluate questionnaire performance in predicting final diagnosis and differentiating between peripheral and non-peripheral vertigo. Univariate and multivariable stepwise logistic regression were used to identify questions as significant predictors of the ultimate diagnosis. C-index was used to evaluate performance and discriminative power of the multivariable models. Results 437 patients participated in the study. Eight participants without confirmed diagnoses were excluded and 429 were included in the analysis. Multinomial regression revealed that the model had good overall predictive accuracy of 78.5% for the final diagnosis and 75.5% for differentiating between peripheral and non-peripheral vertigo. Univariate logistic regression identified significant predictors of three main categories of vertigo: peripheral, central and other. Predictors were entered into forward stepwise multivariable logistic regression. The discriminative power of the final models for peripheral, central and other causes were considered good as measured by c-indices of 0.75, 0.7 and 0.78, respectively. Conclusions This multicenter study demonstrates a focused dizziness questionnaire can accurately predict diagnosis for patients with chronic/relapsing dizziness referred to outpatient clinics. Additionally, this survey has significant capability to differentiate peripheral from non-peripheral causes of vertigo and may, in the future, serve as a screening tool for specialty referral. Clinical utility of this questionnaire to guide specialty referral is discussed. PMID:26485598

  20. Utility of an Abbreviated Dizziness Questionnaire to Differentiate Between Causes of Vertigo and Guide Appropriate Referral: A Multicenter Prospective Blinded Study.

    PubMed

    Roland, Lauren T; Kallogjeri, Dorina; Sinks, Belinda C; Rauch, Steven D; Shepard, Neil T; White, Judith A; Goebel, Joel A

    2015-12-01

    Test performance of a focused dizziness questionnaire's ability to discriminate between peripheral and nonperipheral causes of vertigo. Prospective multicenter. Four academic centers with experienced balance specialists. New dizzy patients. A 32-question survey was given to participants. Balance specialists were blinded and a diagnosis was established for all participating patients within 6 months. Multinomial logistic regression was used to evaluate questionnaire performance in predicting final diagnosis and differentiating between peripheral and nonperipheral vertigo. Univariate and multivariable stepwise logistic regression were used to identify questions as significant predictors of the ultimate diagnosis. C-index was used to evaluate performance and discriminative power of the multivariable models. In total, 437 patients participated in the study. Eight participants without confirmed diagnoses were excluded and 429 were included in the analysis. Multinomial regression revealed that the model had good overall predictive accuracy of 78.5% for the final diagnosis and 75.5% for differentiating between peripheral and nonperipheral vertigo. Univariate logistic regression identified significant predictors of three main categories of vertigo: peripheral, central, and other. Predictors were entered into forward stepwise multivariable logistic regression. The discriminative power of the final models for peripheral, central, and other causes was considered good as measured by c-indices of 0.75, 0.7, and 0.78, respectively. This multicenter study demonstrates a focused dizziness questionnaire can accurately predict diagnosis for patients with chronic/relapsing dizziness referred to outpatient clinics. Additionally, this survey has significant capability to differentiate peripheral from nonperipheral causes of vertigo and may, in the future, serve as a screening tool for specialty referral. Clinical utility of this questionnaire to guide specialty referral is discussed.

  1. Analyzing developmental processes on an individual level using nonstationary time series modeling.

    PubMed

    Molenaar, Peter C M; Sinclair, Katerina O; Rovine, Michael J; Ram, Nilam; Corneal, Sherry E

    2009-01-01

    Individuals change over time, often in complex ways. Generally, studies of change over time have combined individuals into groups for analysis, which is inappropriate in most, if not all, studies of development. The authors explain how to identify appropriate levels of analysis (individual vs. group) and demonstrate how to estimate changes in developmental processes over time using a multivariate nonstationary time series model. They apply this model to describe the changing relationships between a biological son and father and a stepson and stepfather at the individual level. The authors also explain how to use an extended Kalman filter with iteration and smoothing estimator to capture how dynamics change over time. Finally, they suggest further applications of the multivariate nonstationary time series model and detail the next steps in the development of statistical models used to analyze individual-level data.

  2. Multivariate Phylogenetic Comparative Methods: Evaluations, Comparisons, and Recommendations.

    PubMed

    Adams, Dean C; Collyer, Michael L

    2018-01-01

    Recent years have seen increased interest in phylogenetic comparative analyses of multivariate data sets, but to date the varied proposed approaches have not been extensively examined. Here we review the mathematical properties required of any multivariate method, and specifically evaluate existing multivariate phylogenetic comparative methods in this context. Phylogenetic comparative methods based on the full multivariate likelihood are robust to levels of covariation among trait dimensions and are insensitive to the orientation of the data set, but display increasing model misspecification as the number of trait dimensions increases. This is because the expected evolutionary covariance matrix (V) used in the likelihood calculations becomes more ill-conditioned as trait dimensionality increases, and as evolutionary models become more complex. Thus, these approaches are only appropriate for data sets with few traits and many species. Methods that summarize patterns across trait dimensions treated separately (e.g., SURFACE) incorrectly assume independence among trait dimensions, resulting in nearly a 100% model misspecification rate. Methods using pairwise composite likelihood are highly sensitive to levels of trait covariation, the orientation of the data set, and the number of trait dimensions. The consequences of these debilitating deficiencies are that a user can arrive at differing statistical conclusions, and therefore biological inferences, simply from a dataspace rotation, like principal component analysis. By contrast, algebraic generalizations of the standard phylogenetic comparative toolkit that use the trace of covariance matrices are insensitive to levels of trait covariation, the number of trait dimensions, and the orientation of the data set. Further, when appropriate permutation tests are used, these approaches display acceptable Type I error and statistical power. We conclude that methods summarizing information across trait dimensions, as well as pairwise composite likelihood methods should be avoided, whereas algebraic generalizations of the phylogenetic comparative toolkit provide a useful means of assessing macroevolutionary patterns in multivariate data. Finally, we discuss areas in which multivariate phylogenetic comparative methods are still in need of future development; namely highly multivariate Ornstein-Uhlenbeck models and approaches for multivariate evolutionary model comparisons. © The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  3. From point process observations to collective neural dynamics: Nonlinear Hawkes process GLMs, low-dimensional dynamics and coarse graining

    PubMed Central

    Truccolo, Wilson

    2017-01-01

    This review presents a perspective on capturing collective dynamics in recorded neuronal ensembles based on multivariate point process models, inference of low-dimensional dynamics and coarse graining of spatiotemporal measurements. A general probabilistic framework for continuous time point processes reviewed, with an emphasis on multivariate nonlinear Hawkes processes with exogenous inputs. A point process generalized linear model (PP-GLM) framework for the estimation of discrete time multivariate nonlinear Hawkes processes is described. The approach is illustrated with the modeling of collective dynamics in neocortical neuronal ensembles recorded in human and non-human primates, and prediction of single-neuron spiking. A complementary approach to capture collective dynamics based on low-dimensional dynamics (“order parameters”) inferred via latent state-space models with point process observations is presented. The approach is illustrated by inferring and decoding low-dimensional dynamics in primate motor cortex during naturalistic reach and grasp movements. Finally, we briefly review hypothesis tests based on conditional inference and spatiotemporal coarse graining for assessing collective dynamics in recorded neuronal ensembles. PMID:28336305

  4. From point process observations to collective neural dynamics: Nonlinear Hawkes process GLMs, low-dimensional dynamics and coarse graining.

    PubMed

    Truccolo, Wilson

    2016-11-01

    This review presents a perspective on capturing collective dynamics in recorded neuronal ensembles based on multivariate point process models, inference of low-dimensional dynamics and coarse graining of spatiotemporal measurements. A general probabilistic framework for continuous time point processes reviewed, with an emphasis on multivariate nonlinear Hawkes processes with exogenous inputs. A point process generalized linear model (PP-GLM) framework for the estimation of discrete time multivariate nonlinear Hawkes processes is described. The approach is illustrated with the modeling of collective dynamics in neocortical neuronal ensembles recorded in human and non-human primates, and prediction of single-neuron spiking. A complementary approach to capture collective dynamics based on low-dimensional dynamics ("order parameters") inferred via latent state-space models with point process observations is presented. The approach is illustrated by inferring and decoding low-dimensional dynamics in primate motor cortex during naturalistic reach and grasp movements. Finally, we briefly review hypothesis tests based on conditional inference and spatiotemporal coarse graining for assessing collective dynamics in recorded neuronal ensembles. Published by Elsevier Ltd.

  5. Analysis and control of the METC fluid bed gasifier. Final report (includes technical progress report for October 1994--January 1995), September 1994--September 1996

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    NONE

    1996-09-01

    This document presents a modeling and control study of the Fluid Bed Gasification (FBG) unit at the Morgantown Energy Technology Center (METC). The work is performed under contract no. DE-FG21-94MC31384. The purpose of this study is to generate a simple FBG model from process data, and then use the model to suggest an improved control scheme which will improve operation of the gasifier. The work first developes a simple linear model of the gasifier, then suggests an improved gasifier pressure and MGCR control configuration, and finally suggests the use of a multivariable control strategy for the gasifier.

  6. Optimal moment determination in POME-copula based hydrometeorological dependence modelling

    NASA Astrophysics Data System (ADS)

    Liu, Dengfeng; Wang, Dong; Singh, Vijay P.; Wang, Yuankun; Wu, Jichun; Wang, Lachun; Zou, Xinqing; Chen, Yuanfang; Chen, Xi

    2017-07-01

    Copula has been commonly applied in multivariate modelling in various fields where marginal distribution inference is a key element. To develop a flexible, unbiased mathematical inference framework in hydrometeorological multivariate applications, the principle of maximum entropy (POME) is being increasingly coupled with copula. However, in previous POME-based studies, determination of optimal moment constraints has generally not been considered. The main contribution of this study is the determination of optimal moments for POME for developing a coupled optimal moment-POME-copula framework to model hydrometeorological multivariate events. In this framework, margins (marginals, or marginal distributions) are derived with the use of POME, subject to optimal moment constraints. Then, various candidate copulas are constructed according to the derived margins, and finally the most probable one is determined, based on goodness-of-fit statistics. This optimal moment-POME-copula framework is applied to model the dependence patterns of three types of hydrometeorological events: (i) single-site streamflow-water level; (ii) multi-site streamflow; and (iii) multi-site precipitation, with data collected from Yichang and Hankou in the Yangtze River basin, China. Results indicate that the optimal-moment POME is more accurate in margin fitting and the corresponding copulas reflect a good statistical performance in correlation simulation. Also, the derived copulas, capturing more patterns which traditional correlation coefficients cannot reflect, provide an efficient way in other applied scenarios concerning hydrometeorological multivariate modelling.

  7. Three-Dimensional Visualization of Ozone Process Data.

    DTIC Science & Technology

    1997-06-18

    Scattered Multivariate Data. IEEE Computer Graphics & Applications. 11 (May), 47-55. Odman, M.T. and Ingram, C.L. (1996) Multiscale Air Quality Simulation...the Multiscale Air Quality Simulation Platform (MAQSIP) modeling system. MAQSIP is a modular comprehensive air quality modeling system which MCNC...photolyzed back again to nitric oxide. Finally, oxides of 6 nitrogen are terminated through loss or combination into nitric acid, organic nitrates

  8. Use of antimüllerian hormone to predict the menopausal transition in HIV-infected women

    PubMed Central

    Scherzer, Rebecca; Greenblatt, Ruth M.; Merhi, Zaher O.; Kassaye, Seble; Lambert-Messerlian, Geralyn; Maki, Pauline M.; Murphy, Kerry; Karim, Roksana; Bacchetti, Peter

    2016-01-01

    BACKGROUND HIV infection has been associated with early menopausal onset, which may have adverse long-term health consequences. Antimüllerian hormone, a biomarker of ovarian reserve and gonadal aging, is reduced in HIV-infected women. OBJECTIVE We sought to assess the relationship of antimüllerian hormone to age of menopause onset in HIV-infected women. STUDY DESIGN We used antimüllerian hormone levels measured in plasma in 2461 HIV-infected participants from the Women’s Interagency HIV Study to model the age at final menstrual period. Multivariable normal mixture models for censored data were used to identify factors associated with age at final menstrual period. RESULTS Higher antimüllerian hormone at age 40 years was associated with later age at final menstrual period, even after multivariable adjustment for smoking, CD4 cell count, plasma HIV RNA, hepatitis C infection, and history of clinical AIDS. Each doubling of antimüllerian hormone was associated with a 1.5-year increase in the age at final menstrual period. Median age at final menstrual period ranged from 45 years for those in the 10th percentile of antimüllerian hormone to 52 years for those in the 90th percentile. Other factors independently associated with earlier age at final menstrual period included smoking, hepatitis C infection, higher HIV RNA levels, and history of clinical AIDS. CONCLUSION Antimüllerian hormone is highly predictive of age at final menstrual period in HIV-infected women. Measuring antimüllerian hormone in HIV-infected women may enable clinicians to predict risk of early menopause, and potentially implement individualized treatment plans to prevent menopause-related comorbidities and to aid in interpretation of symptoms. PMID:27473002

  9. Using Time Series Analysis to Predict Cardiac Arrest in a PICU.

    PubMed

    Kennedy, Curtis E; Aoki, Noriaki; Mariscalco, Michele; Turley, James P

    2015-11-01

    To build and test cardiac arrest prediction models in a PICU, using time series analysis as input, and to measure changes in prediction accuracy attributable to different classes of time series data. Retrospective cohort study. Thirty-one bed academic PICU that provides care for medical and general surgical (not congenital heart surgery) patients. Patients experiencing a cardiac arrest in the PICU and requiring external cardiac massage for at least 2 minutes. None. One hundred three cases of cardiac arrest and 109 control cases were used to prepare a baseline dataset that consisted of 1,025 variables in four data classes: multivariate, raw time series, clinical calculations, and time series trend analysis. We trained 20 arrest prediction models using a matrix of five feature sets (combinations of data classes) with four modeling algorithms: linear regression, decision tree, neural network, and support vector machine. The reference model (multivariate data with regression algorithm) had an accuracy of 78% and 87% area under the receiver operating characteristic curve. The best model (multivariate + trend analysis data with support vector machine algorithm) had an accuracy of 94% and 98% area under the receiver operating characteristic curve. Cardiac arrest predictions based on a traditional model built with multivariate data and a regression algorithm misclassified cases 3.7 times more frequently than predictions that included time series trend analysis and built with a support vector machine algorithm. Although the final model lacks the specificity necessary for clinical application, we have demonstrated how information from time series data can be used to increase the accuracy of clinical prediction models.

  10. Bayesian meta-analytical methods to incorporate multiple surrogate endpoints in drug development process.

    PubMed

    Bujkiewicz, Sylwia; Thompson, John R; Riley, Richard D; Abrams, Keith R

    2016-03-30

    A number of meta-analytical methods have been proposed that aim to evaluate surrogate endpoints. Bivariate meta-analytical methods can be used to predict the treatment effect for the final outcome from the treatment effect estimate measured on the surrogate endpoint while taking into account the uncertainty around the effect estimate for the surrogate endpoint. In this paper, extensions to multivariate models are developed aiming to include multiple surrogate endpoints with the potential benefit of reducing the uncertainty when making predictions. In this Bayesian multivariate meta-analytic framework, the between-study variability is modelled in a formulation of a product of normal univariate distributions. This formulation is particularly convenient for including multiple surrogate endpoints and flexible for modelling the outcomes which can be surrogate endpoints to the final outcome and potentially to one another. Two models are proposed, first, using an unstructured between-study covariance matrix by assuming the treatment effects on all outcomes are correlated and second, using a structured between-study covariance matrix by assuming treatment effects on some of the outcomes are conditionally independent. While the two models are developed for the summary data on a study level, the individual-level association is taken into account by the use of the Prentice's criteria (obtained from individual patient data) to inform the within study correlations in the models. The modelling techniques are investigated using an example in relapsing remitting multiple sclerosis where the disability worsening is the final outcome, while relapse rate and MRI lesions are potential surrogates to the disability progression. © 2015 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.

  11. Assessing Principal Component Regression Prediction of Neurochemicals Detected with Fast-Scan Cyclic Voltammetry

    PubMed Central

    2011-01-01

    Principal component regression is a multivariate data analysis approach routinely used to predict neurochemical concentrations from in vivo fast-scan cyclic voltammetry measurements. This mathematical procedure can rapidly be employed with present day computer programming languages. Here, we evaluate several methods that can be used to evaluate and improve multivariate concentration determination. The cyclic voltammetric representation of the calculated regression vector is shown to be a valuable tool in determining whether the calculated multivariate model is chemically appropriate. The use of Cook’s distance successfully identified outliers contained within in vivo fast-scan cyclic voltammetry training sets. This work also presents the first direct interpretation of a residual color plot and demonstrated the effect of peak shifts on predicted dopamine concentrations. Finally, separate analyses of smaller increments of a single continuous measurement could not be concatenated without substantial error in the predicted neurochemical concentrations due to electrode drift. Taken together, these tools allow for the construction of more robust multivariate calibration models and provide the first approach to assess the predictive ability of a procedure that is inherently impossible to validate because of the lack of in vivo standards. PMID:21966586

  12. Assessing principal component regression prediction of neurochemicals detected with fast-scan cyclic voltammetry.

    PubMed

    Keithley, Richard B; Wightman, R Mark

    2011-06-07

    Principal component regression is a multivariate data analysis approach routinely used to predict neurochemical concentrations from in vivo fast-scan cyclic voltammetry measurements. This mathematical procedure can rapidly be employed with present day computer programming languages. Here, we evaluate several methods that can be used to evaluate and improve multivariate concentration determination. The cyclic voltammetric representation of the calculated regression vector is shown to be a valuable tool in determining whether the calculated multivariate model is chemically appropriate. The use of Cook's distance successfully identified outliers contained within in vivo fast-scan cyclic voltammetry training sets. This work also presents the first direct interpretation of a residual color plot and demonstrated the effect of peak shifts on predicted dopamine concentrations. Finally, separate analyses of smaller increments of a single continuous measurement could not be concatenated without substantial error in the predicted neurochemical concentrations due to electrode drift. Taken together, these tools allow for the construction of more robust multivariate calibration models and provide the first approach to assess the predictive ability of a procedure that is inherently impossible to validate because of the lack of in vivo standards.

  13. When Does Neoadjuvant Chemotherapy Really Avoid Radiotherapy? Clinical Predictors of Adjuvant Radiotherapy in Cervical Cancer.

    PubMed

    Papadia, Andrea; Bellati, Filippo; Bogani, Giorgio; Ditto, Antonino; Martinelli, Fabio; Lorusso, Domenica; Donfrancesco, Cristina; Gasparri, Maria Luisa; Raspagliesi, Francesco

    2015-12-01

    The aim of this study was to identify clinical variables that may predict the need for adjuvant radiotherapy after neoadjuvant chemotherapy (NACT) and radical surgery in locally advanced cervical cancer patients. A retrospective series of cervical cancer patients with International Federation of Gynecology and Obstetrics (FIGO) stages IB2-IIB treated with NACT followed by radical surgery was analyzed. Clinical predictors of persistence of intermediate- and/or high-risk factors at final pathological analysis were investigated. Statistical analysis was performed using univariate and multivariate analysis and using a model based on artificial intelligence known as artificial neuronal network (ANN) analysis. Overall, 101 patients were available for the analyses. Fifty-two (51 %) patients were considered at high risk secondary to parametrial, resection margin and/or lymph node involvement. When disease was confined to the cervix, four (4 %) patients were considered at intermediate risk. At univariate analysis, FIGO grade 3, stage IIB disease at diagnosis and the presence of enlarged nodes before NACT predicted the presence of intermediate- and/or high-risk factors at final pathological analysis. At multivariate analysis, only FIGO grade 3 and tumor diameter maintained statistical significance. The specificity of ANN models in evaluating predictive variables was slightly superior to conventional multivariable models. FIGO grade, stage, tumor diameter, and histology are associated with persistence of pathological intermediate- and/or high-risk factors after NACT and radical surgery. This information is useful in counseling patients at the time of treatment planning with regard to the probability of being subjected to pelvic radiotherapy after completion of the initially planned treatment.

  14. Quality Reporting of Multivariable Regression Models in Observational Studies: Review of a Representative Sample of Articles Published in Biomedical Journals.

    PubMed

    Real, Jordi; Forné, Carles; Roso-Llorach, Albert; Martínez-Sánchez, Jose M

    2016-05-01

    Controlling for confounders is a crucial step in analytical observational studies, and multivariable models are widely used as statistical adjustment techniques. However, the validation of the assumptions of the multivariable regression models (MRMs) should be made clear in scientific reporting. The objective of this study is to review the quality of statistical reporting of the most commonly used MRMs (logistic, linear, and Cox regression) that were applied in analytical observational studies published between 2003 and 2014 by journals indexed in MEDLINE.Review of a representative sample of articles indexed in MEDLINE (n = 428) with observational design and use of MRMs (logistic, linear, and Cox regression). We assessed the quality of reporting about: model assumptions and goodness-of-fit, interactions, sensitivity analysis, crude and adjusted effect estimate, and specification of more than 1 adjusted model.The tests of underlying assumptions or goodness-of-fit of the MRMs used were described in 26.2% (95% CI: 22.0-30.3) of the articles and 18.5% (95% CI: 14.8-22.1) reported the interaction analysis. Reporting of all items assessed was higher in articles published in journals with a higher impact factor.A low percentage of articles indexed in MEDLINE that used multivariable techniques provided information demonstrating rigorous application of the model selected as an adjustment method. Given the importance of these methods to the final results and conclusions of observational studies, greater rigor is required in reporting the use of MRMs in the scientific literature.

  15. Estimating Risk of Natural Gas Portfolios by Using GARCH-EVT-Copula Model.

    PubMed

    Tang, Jiechen; Zhou, Chao; Yuan, Xinyu; Sriboonchitta, Songsak

    2015-01-01

    This paper concentrates on estimating the risk of Title Transfer Facility (TTF) Hub natural gas portfolios by using the GARCH-EVT-copula model. We first use the univariate ARMA-GARCH model to model each natural gas return series. Second, the extreme value distribution (EVT) is fitted to the tails of the residuals to model marginal residual distributions. Third, multivariate Gaussian copula and Student t-copula are employed to describe the natural gas portfolio risk dependence structure. Finally, we simulate N portfolios and estimate value at risk (VaR) and conditional value at risk (CVaR). Our empirical results show that, for an equally weighted portfolio of five natural gases, the VaR and CVaR values obtained from the Student t-copula are larger than those obtained from the Gaussian copula. Moreover, when minimizing the portfolio risk, the optimal natural gas portfolio weights are found to be similar across the multivariate Gaussian copula and Student t-copula and different confidence levels.

  16. Estimating Risk of Natural Gas Portfolios by Using GARCH-EVT-Copula Model

    PubMed Central

    Tang, Jiechen; Zhou, Chao; Yuan, Xinyu; Sriboonchitta, Songsak

    2015-01-01

    This paper concentrates on estimating the risk of Title Transfer Facility (TTF) Hub natural gas portfolios by using the GARCH-EVT-copula model. We first use the univariate ARMA-GARCH model to model each natural gas return series. Second, the extreme value distribution (EVT) is fitted to the tails of the residuals to model marginal residual distributions. Third, multivariate Gaussian copula and Student t-copula are employed to describe the natural gas portfolio risk dependence structure. Finally, we simulate N portfolios and estimate value at risk (VaR) and conditional value at risk (CVaR). Our empirical results show that, for an equally weighted portfolio of five natural gases, the VaR and CVaR values obtained from the Student t-copula are larger than those obtained from the Gaussian copula. Moreover, when minimizing the portfolio risk, the optimal natural gas portfolio weights are found to be similar across the multivariate Gaussian copula and Student t-copula and different confidence levels. PMID:26351652

  17. Modelling lifetime data with multivariate Tweedie distribution

    NASA Astrophysics Data System (ADS)

    Nor, Siti Rohani Mohd; Yusof, Fadhilah; Bahar, Arifah

    2017-05-01

    This study aims to measure the dependence between individual lifetimes by applying multivariate Tweedie distribution to the lifetime data. Dependence between lifetimes incorporated in the mortality model is a new form of idea that gives significant impact on the risk of the annuity portfolio which is actually against the idea of standard actuarial methods that assumes independent between lifetimes. Hence, this paper applies Tweedie family distribution to the portfolio of lifetimes to induce the dependence between lives. Tweedie distribution is chosen since it contains symmetric and non-symmetric, as well as light-tailed and heavy-tailed distributions. Parameter estimation is modified in order to fit the Tweedie distribution to the data. This procedure is developed by using method of moments. In addition, the comparison stage is made to check for the adequacy between the observed mortality and expected mortality. Finally, the importance of including systematic mortality risk in the model is justified by the Pearson's chi-squared test.

  18. Simulation Modeling to Compare High-Throughput, Low-Iteration Optimization Strategies for Metabolic Engineering

    PubMed Central

    Heinsch, Stephen C.; Das, Siba R.; Smanski, Michael J.

    2018-01-01

    Increasing the final titer of a multi-gene metabolic pathway can be viewed as a multivariate optimization problem. While numerous multivariate optimization algorithms exist, few are specifically designed to accommodate the constraints posed by genetic engineering workflows. We present a strategy for optimizing expression levels across an arbitrary number of genes that requires few design-build-test iterations. We compare the performance of several optimization algorithms on a series of simulated expression landscapes. We show that optimal experimental design parameters depend on the degree of landscape ruggedness. This work provides a theoretical framework for designing and executing numerical optimization on multi-gene systems. PMID:29535690

  19. Study for Updated Gout Classification Criteria (SUGAR): identification of features to classify gout

    PubMed Central

    Taylor, William J.; Fransen, Jaap; Jansen, Tim L.; Dalbeth, Nicola; Schumacher, H. Ralph; Brown, Melanie; Louthrenoo, Worawit; Vazquez-Mellado, Janitzia; Eliseev, Maxim; McCarthy, Geraldine; Stamp, Lisa K.; Perez-Ruiz, Fernando; Sivera, Francisca; Ea, Hang-Korng; Gerritsen, Martijn; Scire, Carlo; Cavagna, Lorenzo; Lin, Chingtsai; Chou, Yin-Yi; Tausche, Anne-Kathrin; Vargas-Santos, Ana Beatriz; Janssen, Matthijs; Chen, Jiunn-Horng; Slot, Ole; Cimmino, Marco A.; Uhlig, Till; Neogi, Tuhina

    2015-01-01

    Objective To determine which clinical, laboratory and imaging features most accurately distinguished gout from non-gout. Methods A cross-sectional study of consecutive rheumatology clinic patients with at least one swollen joint or subcutaneous tophus. Gout was defined by synovial fluid or tophus aspirate microscopy by certified examiners in all patients. The sample was randomly divided into a model development (2/3) and test sample (1/3). Univariate and multivariate association between clinical features and MSU-defined gout was determined using logistic regression modelling. Shrinkage of regression weights was performed to prevent over-fitting of the final model. Latent class analysis was conducted to identify patterns of joint involvement. Results In total, 983 patients were included. Gout was present in 509 (52%). In the development sample (n=653), these features were selected for the final model (multivariate OR) joint erythema (2.13), difficulty walking (7.34), time to maximal pain < 24 hours (1.32), resolution by 2 weeks (3.58), tophus (7.29), MTP1 ever involved (2.30), location of currently tender joints: Other foot/ankle (2.28), MTP1 (2.82), serum urate level > 6 mg/dl (0.36 mmol/l) (3.35), ultrasound double contour sign (7.23), Xray erosion or cyst (2.49). The final model performed adequately in the test set with no evidence of misfit, high discrimination and predictive ability. MTP1 involvement was the most common joint pattern (39.4%) in gout cases. Conclusion Ten key discriminating features have been identified for further evaluation for new gout classification criteria. Ultrasound findings and degree of uricemia add discriminating value, and will significantly contribute to more accurate classification criteria. PMID:25777045

  20. Identification and quantification of ciprofloxacin in urine through excitation-emission fluorescence and three-way PARAFAC calibration.

    PubMed

    Ortiz, M C; Sarabia, L A; Sánchez, M S; Giménez, D

    2009-05-29

    Due to the second-order advantage, calibration models based on parallel factor analysis (PARAFAC) decomposition of three-way data are becoming important in routine analysis. This work studies the possibility of fitting PARAFAC models with excitation-emission fluorescence data for the determination of ciprofloxacin in human urine. The finally chosen PARAFAC decomposition is built with calibration samples spiked with ciprofloxacin, and with other series of urine samples that were also spiked. One of the series of samples has also another drug because the patient was taking mesalazine. The mesalazine is a fluorescent substance that interferes with the ciprofloxacin. Finally, the procedure is applied to samples of a patient who was being treated with ciprofloxacin. The trueness has been established by the regression "predicted concentration versus added concentration". The recovery factor is 88.3% for ciprofloxacin in urine, and the mean of the absolute value of the relative errors is 4.2% for 46 test samples. The multivariate sensitivity of the fit calibration model is evaluated by a regression between the loadings of PARAFAC linked to ciprofloxacin versus the true concentration in spiked samples. The multivariate capability of discrimination is near 8 microg L(-1) when the probabilities of false non-compliance and false compliance are fixed at 5%.

  1. The lexical development of children with hearing impairment and associated factors.

    PubMed

    Penna, Leticia Macedo; Lemos, Stela Maris Aguiar; Alves, Cláudia Regina Lindgren

    2014-01-01

    This study aimed at analyzing the association between the lexical development of children with hearing impairment and their psychosocial and socioeconomic characteristics and medical history. An analytic transversal study was conducted in an Auditive Health Attention Service. One hundred and ten children from 6 to 10 years old using hearing aids and presenting hearing loss that ranged from light to deep levels were evaluated. All children were subjected to oral, written language and auditory perception tests. Parents answered a structured questionnaire to collect data from their medical history and socioeconomic status, and questionnaires about the features of the family environment and psychosocial characteristics. Multivariate analysis was performed by logistic regression, being the initial model composed by variables with p<0,20 in the univariate analysis. In the final model, we adopted a significance level of 5%. The final model of the multivariate analysis showed an association between the performance on the vocabulary test and the results of phonemic discrimination test (OR=0.81; 95%CI 0.73-0.89). The results show the importance of stimulating the auditory processing, particularly the phonemic discrimination skill, throughout the rehabilitation process of children with hearing impairment. This stimulation can enhance lexical development and minimize the metalanguage and learning difficulties often observed in these children.

  2. Some Recent Developments on Complex Multivariate Distributions

    ERIC Educational Resources Information Center

    Krishnaiah, P. R.

    1976-01-01

    In this paper, the author gives a review of the literature on complex multivariate distributions. Some new results on these distributions are also given. Finally, the author discusses the applications of the complex multivariate distributions in the area of the inference on multiple time series. (Author)

  3. The Effect of Alcohol Abuse and Dependence and School Experiences on Depression: A National Study of Adolescents

    ERIC Educational Resources Information Center

    Merianos, Ashley L.; King, Keith A.; Vidourek, Rebecca A.; Hardee, Angelica M.

    2016-01-01

    The study purpose was to examine the effect alcohol abuse/dependence and school experiences have on depression among a nationwide sample of adolescents. A secondary analysis of the 2013 National Survey on Drug Use and Health was conducted. The results of the final multivariable logistic regression model revealed that adolescents who reported…

  4. Vibration Attenuation of the NASA Langley Evolutionary Structure Experiment Using H(infinity) and Structured Singular Value (mu) Robust Multivariable Control Techniques

    NASA Technical Reports Server (NTRS)

    Balas, Gary J.

    1996-01-01

    This final report summarizes the research results under NASA Contract NAG-1-1254 from May, 1991 - April, 1995. The main contribution of this research are in the areas of control of flexible structures, model validation, optimal control analysis and synthesis techniques, and use of shape memory alloys for structural damping.

  5. Disproportionation of rosin on an industrial Pd/C catalyst: reaction pathway and kinetic model discrimination.

    PubMed

    Souto, Juan Carlos; Yustos, Pedro; Ladero, Miguel; Garcia-Ochoa, Felix

    2011-02-01

    In this work, a phenomenological study of the isomerisation and disproportionation of rosin acids using an industrial 5% Pd on charcoal catalyst from 200 to 240°C is carried out. Medium composition is determined by elemental microanalysis, GC-MS and GC-FID. Dehydrogenated and hydrogenated acid species molar amounts in the final product show that dehydrogenation is the main reaction. Moreover, both hydrogen and non-hydrogen concentration considering kinetic models are fitted to experimental data using a multivariable non-linear technique. Statistical discrimination among the proposed kinetic models lead to the conclusion hydrogen considering models fit much better to experimental results. The final kinetic model involves first-order isomerisation reactions of neoabietic and palustric acids to abietic acid, first-order dehydrogenation and hydrogenation of this latter acid, and hydrogenation of pimaric acids. Hydrogenation reactions are partial first-order regarding the acid and hydrogen. Copyright © 2010 Elsevier Ltd. All rights reserved.

  6. Combined Prediction Model of Death Toll for Road Traffic Accidents Based on Independent and Dependent Variables

    PubMed Central

    Zhong-xiang, Feng; Shi-sheng, Lu; Wei-hua, Zhang; Nan-nan, Zhang

    2014-01-01

    In order to build a combined model which can meet the variation rule of death toll data for road traffic accidents and can reflect the influence of multiple factors on traffic accidents and improve prediction accuracy for accidents, the Verhulst model was built based on the number of death tolls for road traffic accidents in China from 2002 to 2011; and car ownership, population, GDP, highway freight volume, highway passenger transportation volume, and highway mileage were chosen as the factors to build the death toll multivariate linear regression model. Then the two models were combined to be a combined prediction model which has weight coefficient. Shapley value method was applied to calculate the weight coefficient by assessing contributions. Finally, the combined model was used to recalculate the number of death tolls from 2002 to 2011, and the combined model was compared with the Verhulst and multivariate linear regression models. The results showed that the new model could not only characterize the death toll data characteristics but also quantify the degree of influence to the death toll by each influencing factor and had high accuracy as well as strong practicability. PMID:25610454

  7. Combined prediction model of death toll for road traffic accidents based on independent and dependent variables.

    PubMed

    Feng, Zhong-xiang; Lu, Shi-sheng; Zhang, Wei-hua; Zhang, Nan-nan

    2014-01-01

    In order to build a combined model which can meet the variation rule of death toll data for road traffic accidents and can reflect the influence of multiple factors on traffic accidents and improve prediction accuracy for accidents, the Verhulst model was built based on the number of death tolls for road traffic accidents in China from 2002 to 2011; and car ownership, population, GDP, highway freight volume, highway passenger transportation volume, and highway mileage were chosen as the factors to build the death toll multivariate linear regression model. Then the two models were combined to be a combined prediction model which has weight coefficient. Shapley value method was applied to calculate the weight coefficient by assessing contributions. Finally, the combined model was used to recalculate the number of death tolls from 2002 to 2011, and the combined model was compared with the Verhulst and multivariate linear regression models. The results showed that the new model could not only characterize the death toll data characteristics but also quantify the degree of influence to the death toll by each influencing factor and had high accuracy as well as strong practicability.

  8. Multivariate statistical process control (MSPC) using Raman spectroscopy for in-line culture cell monitoring considering time-varying batches synchronized with correlation optimized warping (COW).

    PubMed

    Liu, Ya-Juan; André, Silvère; Saint Cristau, Lydia; Lagresle, Sylvain; Hannas, Zahia; Calvosa, Éric; Devos, Olivier; Duponchel, Ludovic

    2017-02-01

    Multivariate statistical process control (MSPC) is increasingly popular as the challenge provided by large multivariate datasets from analytical instruments such as Raman spectroscopy for the monitoring of complex cell cultures in the biopharmaceutical industry. However, Raman spectroscopy for in-line monitoring often produces unsynchronized data sets, resulting in time-varying batches. Moreover, unsynchronized data sets are common for cell culture monitoring because spectroscopic measurements are generally recorded in an alternate way, with more than one optical probe parallelly connecting to the same spectrometer. Synchronized batches are prerequisite for the application of multivariate analysis such as multi-way principal component analysis (MPCA) for the MSPC monitoring. Correlation optimized warping (COW) is a popular method for data alignment with satisfactory performance; however, it has never been applied to synchronize acquisition time of spectroscopic datasets in MSPC application before. In this paper we propose, for the first time, to use the method of COW to synchronize batches with varying durations analyzed with Raman spectroscopy. In a second step, we developed MPCA models at different time intervals based on the normal operation condition (NOC) batches synchronized by COW. New batches are finally projected considering the corresponding MPCA model. We monitored the evolution of the batches using two multivariate control charts based on Hotelling's T 2 and Q. As illustrated with results, the MSPC model was able to identify abnormal operation condition including contaminated batches which is of prime importance in cell culture monitoring We proved that Raman-based MSPC monitoring can be used to diagnose batches deviating from the normal condition, with higher efficacy than traditional diagnosis, which would save time and money in the biopharmaceutical industry. Copyright © 2016 Elsevier B.V. All rights reserved.

  9. Empirical study of the dependence of the results of multivariable flexible survival analyses on model selection strategy.

    PubMed

    Binquet, C; Abrahamowicz, M; Mahboubi, A; Jooste, V; Faivre, J; Bonithon-Kopp, C; Quantin, C

    2008-12-30

    Flexible survival models, which avoid assumptions about hazards proportionality (PH) or linearity of continuous covariates effects, bring the issues of model selection to a new level of complexity. Each 'candidate covariate' requires inter-dependent decisions regarding (i) its inclusion in the model, and representation of its effects on the log hazard as (ii) either constant over time or time-dependent (TD) and, for continuous covariates, (iii) either loglinear or non-loglinear (NL). Moreover, 'optimal' decisions for one covariate depend on the decisions regarding others. Thus, some efficient model-building strategy is necessary.We carried out an empirical study of the impact of the model selection strategy on the estimates obtained in flexible multivariable survival analyses of prognostic factors for mortality in 273 gastric cancer patients. We used 10 different strategies to select alternative multivariable parametric as well as spline-based models, allowing flexible modeling of non-parametric (TD and/or NL) effects. We employed 5-fold cross-validation to compare the predictive ability of alternative models.All flexible models indicated significant non-linearity and changes over time in the effect of age at diagnosis. Conventional 'parametric' models suggested the lack of period effect, whereas more flexible strategies indicated a significant NL effect. Cross-validation confirmed that flexible models predicted better mortality. The resulting differences in the 'final model' selected by various strategies had also impact on the risk prediction for individual subjects.Overall, our analyses underline (a) the importance of accounting for significant non-parametric effects of covariates and (b) the need for developing accurate model selection strategies for flexible survival analyses. Copyright 2008 John Wiley & Sons, Ltd.

  10. Distant stereoacuity in children with anisometropic amblyopia.

    PubMed

    Chung, Yeon Woong; Park, Shin Hae; Shin, Sun Young

    2017-09-01

    To characterize changes in distant stereoacuity using Frisby-Davis Distance test (FD2) and Distant Randot test (DR) during treatment for anisometropic amblyopia, to determine factors that influence posttreatment stereoacuity and to compare the two distant stereotests. Fifty-eight anisometropic amblyopic patients with an interocular difference of ≥1.00 diopter who achieved the visual acuity 20/20 following amblyopia treatment were retrospectively included. Stereoacuity using FD2 and DR for distant and Titmus test for near measurement were assessed and compared at the initial, intermediate, and final visit. Multivariate regression models were used to identify factors associated with initial and final stereoacuity. The two distant stereotests revealed a significant improvement in distant stereoacuity after successful amblyopia treatment. Distant stereoacuity using FD2 showed the greatest improvement during the follow up period. The number of nil scores was higher in DR than FD2 at each period. In multivariate analysis, better final stereoacuity was associated with better initial amblyopic eye acuity in both distant stereotests, but not in the Titmus test. Comparing the two distant stereotests, final stereoacuity using FD2 was associated with initial stereoacuity and was moderately related with the Titmus test at each period, but final stereoacuity using DR was not. Distant stereoacuity measured with both FD2 and DR showed significant improvement when the visual acuity of the amblyopic eye achieved 20/20. Changes in distant stereoacuity by FD2 and DR during the amblyopia treatment were somewhat different.

  11. Determination of main fruits in adulterated nectars by ATR-FTIR spectroscopy combined with multivariate calibration and variable selection methods.

    PubMed

    Miaw, Carolina Sheng Whei; Assis, Camila; Silva, Alessandro Rangel Carolino Sales; Cunha, Maria Luísa; Sena, Marcelo Martins; de Souza, Scheilla Vitorino Carvalho

    2018-07-15

    Grape, orange, peach and passion fruit nectars were formulated and adulterated by dilution with syrup, apple and cashew juices at 10 levels for each adulterant. Attenuated total reflectance Fourier transform mid infrared (ATR-FTIR) spectra were obtained. Partial least squares (PLS) multivariate calibration models allied to different variable selection methods, such as interval partial least squares (iPLS), ordered predictors selection (OPS) and genetic algorithm (GA), were used to quantify the main fruits. PLS improved by iPLS-OPS variable selection showed the highest predictive capacity to quantify the main fruit contents. The selected variables in the final models varied from 72 to 100; the root mean square errors of prediction were estimated from 0.5 to 2.6%; the correlation coefficients of prediction ranged from 0.948 to 0.990; and, the mean relative errors of prediction varied from 3.0 to 6.7%. All of the developed models were validated. Copyright © 2018 Elsevier Ltd. All rights reserved.

  12. [A novel method of multi-channel feature extraction combining multivariate autoregression and multiple-linear principal component analysis].

    PubMed

    Wang, Jinjia; Zhang, Yanna

    2015-02-01

    Brain-computer interface (BCI) systems identify brain signals through extracting features from them. In view of the limitations of the autoregressive model feature extraction method and the traditional principal component analysis to deal with the multichannel signals, this paper presents a multichannel feature extraction method that multivariate autoregressive (MVAR) model combined with the multiple-linear principal component analysis (MPCA), and used for magnetoencephalography (MEG) signals and electroencephalograph (EEG) signals recognition. Firstly, we calculated the MVAR model coefficient matrix of the MEG/EEG signals using this method, and then reduced the dimensions to a lower one, using MPCA. Finally, we recognized brain signals by Bayes Classifier. The key innovation we introduced in our investigation showed that we extended the traditional single-channel feature extraction method to the case of multi-channel one. We then carried out the experiments using the data groups of IV-III and IV - I. The experimental results proved that the method proposed in this paper was feasible.

  13. Linear models of coregionalization for multivariate lattice data: Order-dependent and order-free cMCARs.

    PubMed

    MacNab, Ying C

    2016-08-01

    This paper concerns with multivariate conditional autoregressive models defined by linear combination of independent or correlated underlying spatial processes. Known as linear models of coregionalization, the method offers a systematic and unified approach for formulating multivariate extensions to a broad range of univariate conditional autoregressive models. The resulting multivariate spatial models represent classes of coregionalized multivariate conditional autoregressive models that enable flexible modelling of multivariate spatial interactions, yielding coregionalization models with symmetric or asymmetric cross-covariances of different spatial variation and smoothness. In the context of multivariate disease mapping, for example, they facilitate borrowing strength both over space and cross variables, allowing for more flexible multivariate spatial smoothing. Specifically, we present a broadened coregionalization framework to include order-dependent, order-free, and order-robust multivariate models; a new class of order-free coregionalized multivariate conditional autoregressives is introduced. We tackle computational challenges and present solutions that are integral for Bayesian analysis of these models. We also discuss two ways of computing deviance information criterion for comparison among competing hierarchical models with or without unidentifiable prior parameters. The models and related methodology are developed in the broad context of modelling multivariate data on spatial lattice and illustrated in the context of multivariate disease mapping. The coregionalization framework and related methods also present a general approach for building spatially structured cross-covariance functions for multivariate geostatistics. © The Author(s) 2016.

  14. Multivariate methods for evaluating the efficiency of electrodialytic removal of heavy metals from polluted harbour sediments.

    PubMed

    Pedersen, Kristine Bondo; Kirkelund, Gunvor M; Ottosen, Lisbeth M; Jensen, Pernille E; Lejon, Tore

    2015-01-01

    Chemometrics was used to develop a multivariate model based on 46 previously reported electrodialytic remediation experiments (EDR) of five different harbour sediments. The model predicted final concentrations of Cd, Cu, Pb and Zn as a function of current density, remediation time, stirring rate, dry/wet sediment, cell set-up as well as sediment properties. Evaluation of the model showed that remediation time and current density had the highest comparative influence on the clean-up levels. Individual models for each heavy metal showed variance in the variable importance, indicating that the targeted heavy metals were bound to different sediment fractions. Based on the results, a PLS model was used to design five new EDR experiments of a sixth sediment to achieve specified clean-up levels of Cu and Pb. The removal efficiencies were up to 82% for Cu and 87% for Pb and the targeted clean-up levels were met in four out of five experiments. The clean-up levels were better than predicted by the model, which could hence be used for predicting an approximate remediation strategy; the modelling power will however improve with more data included. Copyright © 2014 Elsevier B.V. All rights reserved.

  15. Clustering Multivariate Time Series Using Hidden Markov Models

    PubMed Central

    Ghassempour, Shima; Girosi, Federico; Maeder, Anthony

    2014-01-01

    In this paper we describe an algorithm for clustering multivariate time series with variables taking both categorical and continuous values. Time series of this type are frequent in health care, where they represent the health trajectories of individuals. The problem is challenging because categorical variables make it difficult to define a meaningful distance between trajectories. We propose an approach based on Hidden Markov Models (HMMs), where we first map each trajectory into an HMM, then define a suitable distance between HMMs and finally proceed to cluster the HMMs with a method based on a distance matrix. We test our approach on a simulated, but realistic, data set of 1,255 trajectories of individuals of age 45 and over, on a synthetic validation set with known clustering structure, and on a smaller set of 268 trajectories extracted from the longitudinal Health and Retirement Survey. The proposed method can be implemented quite simply using standard packages in R and Matlab and may be a good candidate for solving the difficult problem of clustering multivariate time series with categorical variables using tools that do not require advanced statistic knowledge, and therefore are accessible to a wide range of researchers. PMID:24662996

  16. Modeling in the quality by design environment: Regulatory requirements and recommendations for design space and control strategy appointment.

    PubMed

    Djuris, Jelena; Djuric, Zorica

    2017-11-30

    Mathematical models can be used as an integral part of the quality by design (QbD) concept throughout the product lifecycle for variety of purposes, including appointment of the design space and control strategy, continual improvement and risk assessment. Examples of different mathematical modeling techniques (mechanistic, empirical and hybrid) in the pharmaceutical development and process monitoring or control are provided in the presented review. In the QbD context, mathematical models are predominantly used to support design space and/or control strategies. Considering their impact to the final product quality, models can be divided into the following categories: high, medium and low impact models. Although there are regulatory guidelines on the topic of modeling applications, review of QbD-based submission containing modeling elements revealed concerns regarding the scale-dependency of design spaces and verification of models predictions at commercial scale of manufacturing, especially regarding real-time release (RTR) models. Authors provide critical overview on the good modeling practices and introduce concepts of multiple-unit, adaptive and dynamic design space, multivariate specifications and methods for process uncertainty analysis. RTR specification with mathematical model and different approaches to multivariate statistical process control supporting process analytical technologies are also presented. Copyright © 2017 Elsevier B.V. All rights reserved.

  17. Analyzing Multiple Outcomes in Clinical Research Using Multivariate Multilevel Models

    PubMed Central

    Baldwin, Scott A.; Imel, Zac E.; Braithwaite, Scott R.; Atkins, David C.

    2014-01-01

    Objective Multilevel models have become a standard data analysis approach in intervention research. Although the vast majority of intervention studies involve multiple outcome measures, few studies use multivariate analysis methods. The authors discuss multivariate extensions to the multilevel model that can be used by psychotherapy researchers. Method and Results Using simulated longitudinal treatment data, the authors show how multivariate models extend common univariate growth models and how the multivariate model can be used to examine multivariate hypotheses involving fixed effects (e.g., does the size of the treatment effect differ across outcomes?) and random effects (e.g., is change in one outcome related to change in the other?). An online supplemental appendix provides annotated computer code and simulated example data for implementing a multivariate model. Conclusions Multivariate multilevel models are flexible, powerful models that can enhance clinical research. PMID:24491071

  18. On the Effectiveness of Security Countermeasures for Critical Infrastructures.

    PubMed

    Hausken, Kjell; He, Fei

    2016-04-01

    A game-theoretic model is developed where an infrastructure of N targets is protected against terrorism threats. An original threat score is determined by the terrorist's threat against each target and the government's inherent protection level and original protection. The final threat score is impacted by the government's additional protection. We investigate and verify the effectiveness of countermeasures using empirical data and two methods. The first is to estimate the model's parameter values to minimize the sum of the squared differences between the government's additional resource investment predicted by the model and the empirical data. The second is to develop a multivariate regression model where the final threat score varies approximately linearly relative to the original threat score, sectors, and threat scenarios, and depends nonlinearly on the additional resource investment. The model and method are offered as tools, and as a way of thinking, to determine optimal resource investments across vulnerable targets subject to terrorism threats. © 2014 Society for Risk Analysis.

  19. Quality by design case study: an integrated multivariate approach to drug product and process development.

    PubMed

    Huang, Jun; Kaul, Goldi; Cai, Chunsheng; Chatlapalli, Ramarao; Hernandez-Abad, Pedro; Ghosh, Krishnendu; Nagi, Arwinder

    2009-12-01

    To facilitate an in-depth process understanding, and offer opportunities for developing control strategies to ensure product quality, a combination of experimental design, optimization and multivariate techniques was integrated into the process development of a drug product. A process DOE was used to evaluate effects of the design factors on manufacturability and final product CQAs, and establish design space to ensure desired CQAs. Two types of analyses were performed to extract maximal information, DOE effect & response surface analysis and multivariate analysis (PCA and PLS). The DOE effect analysis was used to evaluate the interactions and effects of three design factors (water amount, wet massing time and lubrication time), on response variables (blend flow, compressibility and tablet dissolution). The design space was established by the combined use of DOE, optimization and multivariate analysis to ensure desired CQAs. Multivariate analysis of all variables from the DOE batches was conducted to study relationships between the variables and to evaluate the impact of material attributes/process parameters on manufacturability and final product CQAs. The integrated multivariate approach exemplifies application of QbD principles and tools to drug product and process development.

  20. Molecular Classification Substitutes for the Prognostic Variables Stage, Age, and MYCN Status in Neuroblastoma Risk Assessment.

    PubMed

    Rosswog, Carolina; Schmidt, Rene; Oberthuer, André; Juraeva, Dilafruz; Brors, Benedikt; Engesser, Anne; Kahlert, Yvonne; Volland, Ruth; Bartenhagen, Christoph; Simon, Thorsten; Berthold, Frank; Hero, Barbara; Faldum, Andreas; Fischer, Matthias

    2017-12-01

    Current risk stratification systems for neuroblastoma patients consider clinical, histopathological, and genetic variables, and additional prognostic markers have been proposed in recent years. We here sought to select highly informative covariates in a multistep strategy based on consecutive Cox regression models, resulting in a risk score that integrates hazard ratios of prognostic variables. A cohort of 695 neuroblastoma patients was divided into a discovery set (n=75) for multigene predictor generation, a training set (n=411) for risk score development, and a validation set (n=209). Relevant prognostic variables were identified by stepwise multivariable L1-penalized least absolute shrinkage and selection operator (LASSO) Cox regression, followed by backward selection in multivariable Cox regression, and then integrated into a novel risk score. The variables stage, age, MYCN status, and two multigene predictors, NB-th24 and NB-th44, were selected as independent prognostic markers by LASSO Cox regression analysis. Following backward selection, only the multigene predictors were retained in the final model. Integration of these classifiers in a risk scoring system distinguished three patient subgroups that differed substantially in their outcome. The scoring system discriminated patients with diverging outcome in the validation cohort (5-year event-free survival, 84.9±3.4 vs 63.6±14.5 vs 31.0±5.4; P<.001), and its prognostic value was validated by multivariable analysis. We here propose a translational strategy for developing risk assessment systems based on hazard ratios of relevant prognostic variables. Our final neuroblastoma risk score comprised two multigene predictors only, supporting the notion that molecular properties of the tumor cells strongly impact clinical courses of neuroblastoma patients. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  1. The influence of API concentration on the roller compaction process: modeling and prediction of the post compacted ribbon, granule and tablet properties using multivariate data analysis.

    PubMed

    Boersen, Nathan; Carvajal, M Teresa; Morris, Kenneth R; Peck, Garnet E; Pinal, Rodolfo

    2015-01-01

    While previous research has demonstrated roller compaction operating parameters strongly influence the properties of the final product, a greater emphasis might be placed on the raw material attributes of the formulation. There were two main objectives to this study. First, to assess the effects of different process variables on the properties of the obtained ribbons and downstream granules produced from the rolled compacted ribbons. Second, was to establish if models obtained with formulations of one active pharmaceutical ingredient (API) could predict the properties of similar formulations in terms of the excipients used, but with a different API. Tolmetin and acetaminophen, chosen for their different compaction properties, were roller compacted on Fitzpatrick roller compactor using the same formulation. Models created using tolmetin and tested using acetaminophen. The physical properties of the blends, ribbon, granule and tablet were characterized. Multivariate analysis using partial least squares was used to analyze all data. Multivariate models showed that the operating parameters and raw material attributes were essential in the prediction of ribbon porosity and post-milled particle size. The post compacted ribbon and granule attributes also significantly contributed to the prediction of the tablet tensile strength. Models derived using tolmetin could reasonably predict the ribbon porosity of a second API. After further processing, the post-milled ribbon and granules properties, rather than the physical attributes of the formulation were needed to predict downstream tablet properties. An understanding of the percolation threshold of the formulation significantly improved the predictive ability of the models.

  2. Health-related quality-of-life parameters as independent prognostic factors in advanced or metastatic bladder cancer.

    PubMed

    Roychowdhury, D F; Hayden, A; Liepa, A M

    2003-02-15

    This retrospective analysis examined prognostic significance of health-related quality-of-life (HRQoL) parameters combined with baseline clinical factors on outcomes (overall survival, time to progressive disease, and time to treatment failure) in bladder cancer. Outcome and HRQoL (European Organization for Research and Treatment of Cancer Quality of Life Questionnaire C30) data were collected prospectively in a phase III study assessing gemcitabine and cisplatin versus methotrexate, vinblastine, doxorubicin, and cisplatin in locally advanced or metastatic bladder cancer. Prespecified baseline clinical factors (performance status, tumor-node-metastasis staging, visceral metastases [VM], alkaline phosphatase [AP] level, number of metastatic sites, prior radiotherapy, disease measurability, sex, time from diagnosis, and sites of disease) and selected HRQoL parameters (global QoL; all functional scales; symptoms: pain, fatigue, insomnia, dyspnea, anorexia) were evaluated using Cox's proportional hazards model. Factors with individual prognostic value (P <.05) on outcomes in univariate models were assessed for joint prognostic value in a multivariate model. A final model was developed using a backward selection strategy. Patients with baseline HRQoL were included (364 of 405, 90%). The final model predicted longer survival with low/normal AP levels, no VM, high physical functioning, low role functioning, and no anorexia. Positive prognostic factors for time to progressive disease were good performance status, low/normal AP levels, no VM, and minimal fatigue; for time to treatment failure, they were low/normal AP levels, minimal fatigue, and no anorexia. Global QoL was a significant predictor of outcome in univariate analyses but was not retained in the multivariate model. HRQoL parameters are independent prognostic factors for outcome in advanced bladder cancer; their prognostic importance needs further evaluation.

  3. Risk prediction for myocardial infarction via generalized functional regression models.

    PubMed

    Ieva, Francesca; Paganoni, Anna M

    2016-08-01

    In this paper, we propose a generalized functional linear regression model for a binary outcome indicating the presence/absence of a cardiac disease with multivariate functional data among the relevant predictors. In particular, the motivating aim is the analysis of electrocardiographic traces of patients whose pre-hospital electrocardiogram (ECG) has been sent to 118 Dispatch Center of Milan (the Italian free-toll number for emergencies) by life support personnel of the basic rescue units. The statistical analysis starts with a preprocessing of ECGs treated as multivariate functional data. The signals are reconstructed from noisy observations. The biological variability is then removed by a nonlinear registration procedure based on landmarks. Thus, in order to perform a data-driven dimensional reduction, a multivariate functional principal component analysis is carried out on the variance-covariance matrix of the reconstructed and registered ECGs and their first derivatives. We use the scores of the Principal Components decomposition as covariates in a generalized linear model to predict the presence of the disease in a new patient. Hence, a new semi-automatic diagnostic procedure is proposed to estimate the risk of infarction (in the case of interest, the probability of being affected by Left Bundle Brunch Block). The performance of this classification method is evaluated and compared with other methods proposed in literature. Finally, the robustness of the procedure is checked via leave-j-out techniques. © The Author(s) 2013.

  4. A Web-based nomogram predicting para-aortic nodal metastasis in incompletely staged patients with endometrial cancer: a Korean Multicenter Study.

    PubMed

    Kang, Sokbom; Lee, Jong-Min; Lee, Jae-Kwan; Kim, Jae-Weon; Cho, Chi-Heum; Kim, Seok-Mo; Park, Sang-Yoon; Park, Chan-Yong; Kim, Ki-Tae

    2014-03-01

    The purpose of this study is to develop a Web-based nomogram for predicting the individualized risk of para-aortic nodal metastasis in incompletely staged patients with endometrial cancer. From 8 institutions, the medical records of 397 patients who underwent pelvic and para-aortic lymphadenectomy as a surgical staging procedure were retrospectively reviewed. A multivariate logistic regression model was created and internally validated by rigorous bootstrap resampling methods. Finally, the model was transformed into a user-friendly Web-based nomogram (http://http://www.kgog.org/nomogram/empa001.html). The rate of para-aortic nodal metastasis was 14.4% (57/397 patients). Using a stepwise variable selection, 4 variables including deep myometrial invasion, non-endometrioid subtype, lymphovascular space invasion, and log-transformed CA-125 levels were finally adopted. After 1000 repetitions of bootstrapping, all of these 4 variables retained a significant association with para-aortic nodal metastasis in the multivariate analysis-deep myometrial invasion (P = 0.001), non-endometrioid histologic subtype (P = 0.034), lymphovascular space invasion (P = 0.003), and log-transformed serum CA-125 levels (P = 0.004). The model showed good discrimination (C statistics = 0.87; 95% confidence interval, 0.82-0.92) and accurate calibration (Hosmer-Lemeshow P = 0.74). This nomogram showed good performance in predicting para-aortic metastasis in patients with endometrial cancer. The tool may be useful in determining the extent of lymphadenectomy after incomplete surgery.

  5. Herd factors associated with dairy cow mortality.

    PubMed

    McConnel, C; Lombard, J; Wagner, B; Kopral, C; Garry, F

    2015-08-01

    Summary studies of dairy cow removal indicate increasing levels of mortality over the past several decades. This poses a serious problem for the US dairy industry. The objective of this project was to evaluate associations between facilities, herd management practices, disease occurrence and death rates on US dairy operations through an analysis of the National Animal Health Monitoring System's Dairy 2007 survey. The survey included farms in 17 states that represented 79.5% of US dairy operations and 82.5% of the US dairy cow population. During the first phase of the study operations were randomly selected from a sampling list maintained by the National Agricultural Statistics Service. Only farms that participated in phase I and had 30 or more dairy cows were eligible to participate in phase II. In total, 459 farms had complete data for all selected variables and were included in this analysis. Univariable associations between dairy cow mortality and 162 a priori identified operation-level management practices or characteristics were evaluated. Sixty of the 162 management factors explored in the univariate analysis met initial screening criteria and were further evaluated in a multivariable model exploring more complex relationships. The final weighted, negative binomial regression model included six variables. Based on the incidence rate ratio, this model predicted 32.0% less mortality for operations that vaccinated heifers for at least one of the following: bovine viral diarrhea, infectious bovine rhinotracheitis, parainfluenza 3, bovine respiratory syncytial virus, Haemophilus somnus, leptospirosis, Salmonella, Escherichia coli or clostridia. The final multivariable model also predicted a 27.0% increase in mortality for operations from which a bulk tank milk sample tested ELISA positive for bovine leukosis virus. Additionally, an 18.0% higher mortality was predicted for operations that used necropsies to determine the cause of death for some proportion of dead dairy cows. The final model also predicted that increased proportions of dairy cows with clinical mastitis and infertility problems were associated with increased mortality. Finally, an increase in mortality was predicted to be associated with an increase in the proportion of lame or injured permanently removed dairy cows. In general terms, this model identified that mortality was associated with reproductive problems, non-infectious postpartum disease, infectious disease and infectious disease prevention, and information derived from postmortem evaluations. Ultimately, addressing excessive mortality levels requires a concerted effort that recognizes and appropriately manages the numerous and diverse underlying risks.

  6. Real-Time Onboard Global Nonlinear Aerodynamic Modeling from Flight Data

    NASA Technical Reports Server (NTRS)

    Brandon, Jay M.; Morelli, Eugene A.

    2014-01-01

    Flight test and modeling techniques were developed to accurately identify global nonlinear aerodynamic models onboard an aircraft. The techniques were developed and demonstrated during piloted flight testing of an Aermacchi MB-326M Impala jet aircraft. Advanced piloting techniques and nonlinear modeling techniques based on fuzzy logic and multivariate orthogonal function methods were implemented with efficient onboard calculations and flight operations to achieve real-time maneuver monitoring and analysis, and near-real-time global nonlinear aerodynamic modeling and prediction validation testing in flight. Results demonstrated that global nonlinear aerodynamic models for a large portion of the flight envelope were identified rapidly and accurately using piloted flight test maneuvers during a single flight, with the final identified and validated models available before the aircraft landed.

  7. Bayesian inference on risk differences: an application to multivariate meta-analysis of adverse events in clinical trials.

    PubMed

    Chen, Yong; Luo, Sheng; Chu, Haitao; Wei, Peng

    2013-05-01

    Multivariate meta-analysis is useful in combining evidence from independent studies which involve several comparisons among groups based on a single outcome. For binary outcomes, the commonly used statistical models for multivariate meta-analysis are multivariate generalized linear mixed effects models which assume risks, after some transformation, follow a multivariate normal distribution with possible correlations. In this article, we consider an alternative model for multivariate meta-analysis where the risks are modeled by the multivariate beta distribution proposed by Sarmanov (1966). This model have several attractive features compared to the conventional multivariate generalized linear mixed effects models, including simplicity of likelihood function, no need to specify a link function, and has a closed-form expression of distribution functions for study-specific risk differences. We investigate the finite sample performance of this model by simulation studies and illustrate its use with an application to multivariate meta-analysis of adverse events of tricyclic antidepressants treatment in clinical trials.

  8. A Penalized Likelihood Framework For High-Dimensional Phylogenetic Comparative Methods And An Application To New-World Monkeys Brain Evolution.

    PubMed

    Julien, Clavel; Leandro, Aristide; Hélène, Morlon

    2018-06-19

    Working with high-dimensional phylogenetic comparative datasets is challenging because likelihood-based multivariate methods suffer from low statistical performances as the number of traits p approaches the number of species n and because some computational complications occur when p exceeds n. Alternative phylogenetic comparative methods have recently been proposed to deal with the large p small n scenario but their use and performances are limited. Here we develop a penalized likelihood framework to deal with high-dimensional comparative datasets. We propose various penalizations and methods for selecting the intensity of the penalties. We apply this general framework to the estimation of parameters (the evolutionary trait covariance matrix and parameters of the evolutionary model) and model comparison for the high-dimensional multivariate Brownian (BM), Early-burst (EB), Ornstein-Uhlenbeck (OU) and Pagel's lambda models. We show using simulations that our penalized likelihood approach dramatically improves the estimation of evolutionary trait covariance matrices and model parameters when p approaches n, and allows for their accurate estimation when p equals or exceeds n. In addition, we show that penalized likelihood models can be efficiently compared using Generalized Information Criterion (GIC). We implement these methods, as well as the related estimation of ancestral states and the computation of phylogenetic PCA in the R package RPANDA and mvMORPH. Finally, we illustrate the utility of the new proposed framework by evaluating evolutionary models fit, analyzing integration patterns, and reconstructing evolutionary trajectories for a high-dimensional 3-D dataset of brain shape in the New World monkeys. We find a clear support for an Early-burst model suggesting an early diversification of brain morphology during the ecological radiation of the clade. Penalized likelihood offers an efficient way to deal with high-dimensional multivariate comparative data.

  9. Finding the multipath propagation of multivariable crude oil prices using a wavelet-based network approach

    NASA Astrophysics Data System (ADS)

    Jia, Xiaoliang; An, Haizhong; Sun, Xiaoqi; Huang, Xuan; Gao, Xiangyun

    2016-04-01

    The globalization and regionalization of crude oil trade inevitably give rise to the difference of crude oil prices. The understanding of the pattern of the crude oil prices' mutual propagation is essential for analyzing the development of global oil trade. Previous research has focused mainly on the fuzzy long- or short-term one-to-one propagation of bivariate oil prices, generally ignoring various patterns of periodical multivariate propagation. This study presents a wavelet-based network approach to help uncover the multipath propagation of multivariable crude oil prices in a joint time-frequency period. The weekly oil spot prices of the OPEC member states from June 1999 to March 2011 are adopted as the sample data. First, we used wavelet analysis to find different subseries based on an optimal decomposing scale to describe the periodical feature of the original oil price time series. Second, a complex network model was constructed based on an optimal threshold selection to describe the structural feature of multivariable oil prices. Third, Bayesian network analysis (BNA) was conducted to find the probability causal relationship based on periodical structural features to describe the various patterns of periodical multivariable propagation. Finally, the significance of the leading and intermediary oil prices is discussed. These findings are beneficial for the implementation of periodical target-oriented pricing policies and investment strategies.

  10. Biological and Behavioral Risks for Incident Chlamydia trachomatis Infection in a Prospective Cohort

    PubMed Central

    Hwang, Loris Y.; Ma, Yifei; Moscicki, Anna-Barbara

    2014-01-01

    Objective To identify biological and behavioral risks for incident Chlamydia trachomatis among a prospective cohort of young women followed frequently. Methods Our cohort of 629 women from two outpatient sites was seen every 4 months (October 2000 through April 2012) for behavioral interviews and infection testing. C trachomatis was tested annually, and anytime patients reported symptoms or possible exposure using commercial nucleic acid amplification tests. Analyses excluded baseline prevalent C trachomatis infections. Risk factors for incident C trachomatis were assessed using Cox proportional hazards models. Significant risks (p<0.10) from bivariate models were entered in a multivariate model, adjusted for four covariates chosen a priori (age, race or ethnicity, condom use, study site). Backwards step-wise elimination produced a final parsimonious model retaining significant variables (p<0.05) and the four adjustment variables. Results The 629 women attended 9,594 total visits. Median follow-up time was 6.9 years (interquartile range 3.2-9.8), during which 97 (15%) women had incident C trachomatis . In the final multivariate model, incident C trachomatis was independently associated with HPV at the preceding visit (p<0.01), smoking (p=0.02), and weekly use of substances besides alcohol and marijuana (p<0.01) since prior visit. Among 207 women with available colpophotographs (1,742 visits), cervical ectopy was not a significant risk factor (p range=0.16-0.39 for ectopy as continuous and ordinal variables). Conclusion Novel risks for C trachomatis include preceding HPV, smoking, and substance use, which may reflect both biological and behavioral mechanisms of risk, such as immune modulation, higher-risk sexual networks, or both. Improved understanding of the biological bases for C trachomatis risk would inform our strategies for C trachomatis control. PMID:25437724

  11. Climate and Leishmaniasis in French Guiana

    PubMed Central

    Roger, Amaury; Nacher, Mathieu; Hanf, Matthieu; Drogoul, Anne Sophie; Adenis, Antoine; Basurko, Celia; Dufour, Julie; Sainte Marie, Dominique; Blanchet, Denis; Simon, Stephane; Carme, Bernard; Couppié, Pierre

    2013-01-01

    To study the link between climatic variables and the incidence of leishmaniasis a study was conducted in Cayenne, French Guiana. Patients infected between January 1994 and December 2010. Meteorological data were studied in relation to the incidence of leishmaniasis using an ARIMA model. In the final model, the infections were negatively correlated with rainfall (with a 2-month lag) and with the number of days with rainfall > 50 mm (lags of 4 and 7 months). The variables that were positively correlated were temperature and the Multivariate El Niño Southern Oscillation Index with lags of 8 and 4 months, respectively. Significantly greater correlations were observed in March for rainfall and in November for the Multivariate El Niño/Southern Oscillation Index. Climate thus seems to be a non-negligible explanatory variable for the fluctuations of leishmaniasis. A decrease in rainfall is linked to increased cases 2 months later. This easily perceptible point could lead to an interesting prevention message. PMID:23939706

  12. Novel active contour model based on multi-variate local Gaussian distribution for local segmentation of MR brain images

    NASA Astrophysics Data System (ADS)

    Zheng, Qiang; Li, Honglun; Fan, Baode; Wu, Shuanhu; Xu, Jindong

    2017-12-01

    Active contour model (ACM) has been one of the most widely utilized methods in magnetic resonance (MR) brain image segmentation because of its ability of capturing topology changes. However, most of the existing ACMs only consider single-slice information in MR brain image data, i.e., the information used in ACMs based segmentation method is extracted only from one slice of MR brain image, which cannot take full advantage of the adjacent slice images' information, and cannot satisfy the local segmentation of MR brain images. In this paper, a novel ACM is proposed to solve the problem discussed above, which is based on multi-variate local Gaussian distribution and combines the adjacent slice images' information in MR brain image data to satisfy segmentation. The segmentation is finally achieved through maximizing the likelihood estimation. Experiments demonstrate the advantages of the proposed ACM over the single-slice ACM in local segmentation of MR brain image series.

  13. Evolutionary grinding model for nanometric control of surface roughness for aspheric optical surfaces.

    PubMed

    Han, Jeong-Yeol; Kim, Sug-Whan; Han, Inwoo; Kim, Geon-Hee

    2008-03-17

    A new evolutionary grinding process model has been developed for nanometric control of material removal from an aspheric surface of Zerodur substrate. The model incorporates novel control features such as i) a growing database; ii) an evolving, multi-variable regression equation; and iii) an adaptive correction factor for target surface roughness (Ra) for the next machine run. This process model demonstrated a unique evolutionary controllability of machining performance resulting in the final grinding accuracy (i.e. averaged difference between target and measured surface roughness) of -0.2+/-2.3(sigma) nm Ra over seven trial machine runs for the target surface roughness ranging from 115 nm to 64 nm Ra.

  14. A new multivariate zero-adjusted Poisson model with applications to biomedicine.

    PubMed

    Liu, Yin; Tian, Guo-Liang; Tang, Man-Lai; Yuen, Kam Chuen

    2018-05-25

    Recently, although advances were made on modeling multivariate count data, existing models really has several limitations: (i) The multivariate Poisson log-normal model (Aitchison and Ho, ) cannot be used to fit multivariate count data with excess zero-vectors; (ii) The multivariate zero-inflated Poisson (ZIP) distribution (Li et al., 1999) cannot be used to model zero-truncated/deflated count data and it is difficult to apply to high-dimensional cases; (iii) The Type I multivariate zero-adjusted Poisson (ZAP) distribution (Tian et al., 2017) could only model multivariate count data with a special correlation structure for random components that are all positive or negative. In this paper, we first introduce a new multivariate ZAP distribution, based on a multivariate Poisson distribution, which allows the correlations between components with a more flexible dependency structure, that is some of the correlation coefficients could be positive while others could be negative. We then develop its important distributional properties, and provide efficient statistical inference methods for multivariate ZAP model with or without covariates. Two real data examples in biomedicine are used to illustrate the proposed methods. © 2018 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  15. Growth status of Korean orphans raised in the affluent West: anthropometric trend, multivariate determinants, and descriptive comparison with their North and South Korean peers.

    PubMed

    Schwekendiek, Daniel J

    2017-04-01

    This paper investigates the trend in height among adult Korean orphans who were adopted in early life into affluent Western nations. Final heights of 148 females were analyzed based on a Korean government survey conducted in 2008. Height of the orphans was descriptively compared against final heights of South and North Koreans. Furthermore, statistical determinants of orphan height were investigated in multivariate regressions. Mean height of Korean orphans was 160.44 cm (SD 5.89), which was higher than that of South Koreans at 158.83 cm (SD 5.01). Both Korean orphans and South Koreans were taller than North Koreans at 155.30 cm (SD 4.94). However, height of Korean orphans stagnated at around 160-161 cm while those of North and South Koreans improved over time. In the regression analysis, the socioeconomic status of the adoptive family was statistically significant in all models, while dummies for the adoptive nations and age at adoption were insignificant. This study shows that the mean final height of women experiencing extreme environmental improvements in early-life is capped at 160-161 cm, tentatively suggesting that social stress factors in the host nation or early-life factors in the birth nation might have offset some of the environmental enrichment effects achieved through intercountry adoption.

  16. Modeling and managing risk early in software development

    NASA Technical Reports Server (NTRS)

    Briand, Lionel C.; Thomas, William M.; Hetmanski, Christopher J.

    1993-01-01

    In order to improve the quality of the software development process, we need to be able to build empirical multivariate models based on data collectable early in the software process. These models need to be both useful for prediction and easy to interpret, so that remedial actions may be taken in order to control and optimize the development process. We present an automated modeling technique which can be used as an alternative to regression techniques. We show how it can be used to facilitate the identification and aid the interpretation of the significant trends which characterize 'high risk' components in several Ada systems. Finally, we evaluate the effectiveness of our technique based on a comparison with logistic regression based models.

  17. About the dark and bright sides of self-efficacy: workaholism and work engagement.

    PubMed

    Del Líbano, Mario; Llorens, Susana; Salanoval, Marisa; Schaufeli, Wilmar B

    2012-07-01

    Taking the Resources-Experiences-Demands Model (RED Model) by Salanova and colleagues as our starting point, we tested how work self-efficacy relates positively to negative (i.e., work overload and work-family conflict) and positive outcomes (i.e., job satisfaction and organizational commitment), through the mediating role of workaholism (health impairment process) and work engagement (motivational process). In a sample of 386 administrative staff from a Spanish University (65% women), Structural Equation Modeling provided full evidence for the research model. In addition, Multivariate Analyses of Variance showed that self-efficacy was only related positively to one of the two dimensions of workaholism, namely, working excessively. Finally, we discuss the theoretical and practical contributions in terms of the RED Model.

  18. Nonlinear multivariate and time series analysis by neural network methods

    NASA Astrophysics Data System (ADS)

    Hsieh, William W.

    2004-03-01

    Methods in multivariate statistical analysis are essential for working with large amounts of geophysical data, data from observational arrays, from satellites, or from numerical model output. In classical multivariate statistical analysis, there is a hierarchy of methods, starting with linear regression at the base, followed by principal component analysis (PCA) and finally canonical correlation analysis (CCA). A multivariate time series method, the singular spectrum analysis (SSA), has been a fruitful extension of the PCA technique. The common drawback of these classical methods is that only linear structures can be correctly extracted from the data. Since the late 1980s, neural network methods have become popular for performing nonlinear regression and classification. More recently, neural network methods have been extended to perform nonlinear PCA (NLPCA), nonlinear CCA (NLCCA), and nonlinear SSA (NLSSA). This paper presents a unified view of the NLPCA, NLCCA, and NLSSA techniques and their applications to various data sets of the atmosphere and the ocean (especially for the El Niño-Southern Oscillation and the stratospheric quasi-biennial oscillation). These data sets reveal that the linear methods are often too simplistic to describe real-world systems, with a tendency to scatter a single oscillatory phenomenon into numerous unphysical modes or higher harmonics, which can be largely alleviated in the new nonlinear paradigm.

  19. Relationship between water quality and macro-scale parameters (land use, erosion, geology, and population density) in the Siminehrood River Basin.

    PubMed

    Bostanmaneshrad, Farshid; Partani, Sadegh; Noori, Roohollah; Nachtnebel, Hans-Peter; Berndtsson, Ronny; Adamowski, Jan Franklin

    2018-10-15

    To date, few studies have investigated the simultaneous effects of macro-scale parameters (MSPs) such as land use, population density, geology, and erosion layers on micro-scale water quality variables (MSWQVs). This research focused on an evaluation of the relationship between MSPs and MSWQVs in the Siminehrood River Basin, Iran. In addition, we investigated the importance of water particle travel time (hydrological distance) on this relationship. The MSWQVs included 13 physicochemical and biochemical parameters observed at 15 stations during three seasons. Primary screening was performed by utilizing three multivariate statistical analyses (Pearson's correlation, cluster and discriminant analyses) in seven series of observed data. These series included three separate seasonal data, three two-season data, and aggregated three-season data for investigation of relationships between MSPs and MSWQVs. Coupled data (pairs of MSWQVs and MSPs) repeated in at least two out of three statistical analyses were selected for final screening. The primary screening results demonstrated significant relationships between land use and phosphorus, total solids and turbidity, erosion levels and electrical conductivity, and erosion and total solids. Furthermore, water particle travel time effects were considered through three geographical pattern definitions of distance for each MSP by using two weighting methods. To find effective MSP factors on MSWQVs, a multivariate linear regression analysis was employed. Then, preliminary equations that estimated MSWQVs were developed. The preliminary equations were modified to adaptive equations to obtain the final models. The final models indicated that a new metric, referred to as hydrological distance, provided better MSWQV estimation and water quality prediction compared to the National Sanitation Foundation Water Quality Index. Crown Copyright © 2018. Published by Elsevier B.V. All rights reserved.

  20. Estimation of Particulate Mass and Manganese Exposure Levels among Welders

    PubMed Central

    Hobson, Angela; Seixas, Noah; Sterling, David; Racette, Brad A.

    2011-01-01

    Background: Welders are frequently exposed to Manganese (Mn), which may increase the risk of neurological impairment. Historical exposure estimates for welding-exposed workers are needed for epidemiological studies evaluating the relationship between welding and neurological or other health outcomes. The objective of this study was to develop and validate a multivariate model to estimate quantitative levels of welding fume exposures based on welding particulate mass and Mn concentrations reported in the published literature. Methods: Articles that described welding particulate and Mn exposures during field welding activities were identified through a comprehensive literature search. Summary measures of exposure and related determinants such as year of sampling, welding process performed, type of ventilation used, degree of enclosure, base metal, and location of sampling filter were extracted from each article. The natural log of the reported arithmetic mean exposure level was used as the dependent variable in model building, while the independent variables included the exposure determinants. Cross-validation was performed to aid in model selection and to evaluate the generalizability of the models. Results: A total of 33 particulate and 27 Mn means were included in the regression analysis. The final model explained 76% of the variability in the mean exposures and included welding process and degree of enclosure as predictors. There was very little change in the explained variability and root mean squared error between the final model and its cross-validation model indicating the final model is robust given the available data. Conclusions: This model may be improved with more detailed exposure determinants; however, the relatively large amount of variance explained by the final model along with the positive generalizability results of the cross-validation increases the confidence that the estimates derived from this model can be used for estimating welder exposures in absence of individual measurement data. PMID:20870928

  1. A retrospective review of fall risk factors in the bone marrow transplant inpatient service.

    PubMed

    Vela, Cory M; Grate, Lisa M; McBride, Ali; Devine, Steven; Andritsos, Leslie A

    2018-06-01

    Purpose The purpose of this study was to compare medications and potential risk factors between patients who experienced a fall during hospitalization compared to those who did not fall while admitted to the Blood and Marrow Transplant inpatient setting at The James Cancer Hospital. Secondary objectives included evaluation of transplant-related disease states and medications in the post-transplant setting that may lead to an increased risk of falls, post-fall variables, and number of tests ordered after a fall. Methods This retrospective, case-control study matched patients in a 2:1 ratio of nonfallers to fallers. Data from The Ohio State University Wexner Medical Center (OSUWMC) reported fall events and patient electronic medical records were utilized. A total of 168 adult Blood and Marrow Transplant inpatients with a hematological malignancy diagnosis were evaluated from 1 January 2010 to 30 September 2012. Results Univariable and multivariable conditional logistic regression models were used to assess the relationship between potential predictor variables of interest and falls. Variables that were found to be significant predictors of falls from the univariable models include age group, incontinence, benzodiazepines, corticosteroids, anticonvulsants and antidepressants, and number of days status-post transplant. When considered for a multivariable model age group, corticosteroids, and a cancer diagnosis of leukemia were significant in the final model. Conclusion Recent medication utilization such as benzodiazepines, anticonvulsants, corticosteroids, and antidepressants placed patients at a higher risk of experiencing a fall. Other significant factors identified from a multivariable analysis found were patients older than age 65, patients with recent corticosteroid administration and a cancer diagnosis of leukemia.

  2. Novel high-resolution computed tomography-based radiomic classifier for screen-identified pulmonary nodules in the National Lung Screening Trial.

    PubMed

    Peikert, Tobias; Duan, Fenghai; Rajagopalan, Srinivasan; Karwoski, Ronald A; Clay, Ryan; Robb, Richard A; Qin, Ziling; Sicks, JoRean; Bartholmai, Brian J; Maldonado, Fabien

    2018-01-01

    Optimization of the clinical management of screen-detected lung nodules is needed to avoid unnecessary diagnostic interventions. Herein we demonstrate the potential value of a novel radiomics-based approach for the classification of screen-detected indeterminate nodules. Independent quantitative variables assessing various radiologic nodule features such as sphericity, flatness, elongation, spiculation, lobulation and curvature were developed from the NLST dataset using 726 indeterminate nodules (all ≥ 7 mm, benign, n = 318 and malignant, n = 408). Multivariate analysis was performed using least absolute shrinkage and selection operator (LASSO) method for variable selection and regularization in order to enhance the prediction accuracy and interpretability of the multivariate model. The bootstrapping method was then applied for the internal validation and the optimism-corrected AUC was reported for the final model. Eight of the originally considered 57 quantitative radiologic features were selected by LASSO multivariate modeling. These 8 features include variables capturing Location: vertical location (Offset carina centroid z), Size: volume estimate (Minimum enclosing brick), Shape: flatness, Density: texture analysis (Score Indicative of Lesion/Lung Aggression/Abnormality (SILA) texture), and surface characteristics: surface complexity (Maximum shape index and Average shape index), and estimates of surface curvature (Average positive mean curvature and Minimum mean curvature), all with P<0.01. The optimism-corrected AUC for these 8 features is 0.939. Our novel radiomic LDCT-based approach for indeterminate screen-detected nodule characterization appears extremely promising however independent external validation is needed.

  3. Multivariate Strategies in Functional Magnetic Resonance Imaging

    ERIC Educational Resources Information Center

    Hansen, Lars Kai

    2007-01-01

    We discuss aspects of multivariate fMRI modeling, including the statistical evaluation of multivariate models and means for dimensional reduction. In a case study we analyze linear and non-linear dimensional reduction tools in the context of a "mind reading" predictive multivariate fMRI model.

  4. Investigating College and Graduate Students' Multivariable Reasoning in Computational Modeling

    ERIC Educational Resources Information Center

    Wu, Hsin-Kai; Wu, Pai-Hsing; Zhang, Wen-Xin; Hsu, Ying-Shao

    2013-01-01

    Drawing upon the literature in computational modeling, multivariable reasoning, and causal attribution, this study aims at characterizing multivariable reasoning practices in computational modeling and revealing the nature of understanding about multivariable causality. We recruited two freshmen, two sophomores, two juniors, two seniors, four…

  5. Secular trends and factors associated with overweight among Brazilian preschool children: PNSN-1989, PNDS-1996, and 2006/07.

    PubMed

    Silveira, Jonas Augusto C; Colugnati, Fernando Antônio B; Cocetti, Monize; Taddei, José Augusto A C

    2014-01-01

    to describe the secular trends in overweight among preschool children in the years 1989, 1996, and 2006, and to identify risk factors associated with this condition in 2006. anthropometric data from three surveys (1989, 1996, and 2006) with a representative sample of the population were analyzed. Overweight was defined as the weight-for-height Z-score. The multivariable models of overweight association with risk factors were generated by Poisson regression, and the estimates were shown as prevalence ratios with their respective 95% confidence intervals (PR [95% CI]). throughout the 17-year period studied, the relative prevalence of overweight in preschoolers increased by 160% in Brazil, representing an increase of 9.4% per year. Based on data from the National Survey on Demography and Health of Women and Children - 2006/07, four multivariable models were created (macro-environmental, maternal, individual, and final model) assuming hierarchy among the risk factors. In the final model, only the following remained associated with overweight: regions South/Southeast (1.55 [1.17 to 2.06]), middle-class (1.35 [1.02 to 1.77]), maternal obesity (1.66 [1.22 to 2.27]), birth weight ≥ 3.9kg (1.87 [1.31 to 2.67]), and being an only child or having only one sibling (1.81 [1.31 to 2.49]). the prevalence of overweight among preschool children in Brazil has increased dramatically over the past 17 years, and it was higher in the 1996-2006 period. Future strategies for prevention and control of overweight in public health should focus or intensify actions in communities that are characterized by the presence of the risks identified in the present study. Copyright © 2013 Sociedade Brasileira de Pediatria. Published by Elsevier Editora Ltda. All rights reserved.

  6. Fruit and vegetable consumption - the influence of aspects associated with trust in food and safety and quality of food.

    PubMed

    Taylor, Anne W; Coveney, John; Ward, Paul R; Henderson, Julie; Meyer, Samantha B; Pilkington, Rhiannon; Gill, Tiffany K

    2012-02-01

    To profile adults who eat less than the recommended servings of fruit and vegetables per day. Australia-wide population telephone survey on a random sample of the Australian population, with results analysed by univariate and multivariate models. Australia. One thousand one hundred and eight interviews, respondents' (49·3 % males) mean age was 45·12 (sd 17·63) years. Overall 54·8 % and 10·7 % were eating the recommended number of servings of fruit and vegetables. Variables included in the multivariate model indicating low fruit consumption included gender, age, employment, education and those who were less likely to consider the safety and quality of food as important. In regard to low vegetable consumption, people who were more likely to do the food shopping only 'some of the time' and have a high level of trust in groups of people such as immediate family, neighbours, doctors and different levels of government were included in the final model. They were also less likely to neither consider the safety and quality of food as important nor trust organisations/institutions such as the press, television and politicians. In the final model depicting both low fruit and low vegetable servings, sex, age and a low level of importance with regard to safety and quality of food were included. To increase fruit and vegetable consumption, research into a broad range of determinants associated with behaviours should be coupled with a deeper understanding of the process associated with changing behaviours. While levels of trust are related to behaviour change, knowledge and attitudes about aspects associated with safety and quality of food are also of importance.

  7. A Multivariate Model for the Study of Parental Acceptance-Rejection and Child Abuse.

    ERIC Educational Resources Information Center

    Rohner, Ronald P.; Rohner, Evelyn C.

    This paper proposes a multivariate strategy for the study of parental acceptance-rejection and child abuse and describes a research study on parental rejection and child abuse which illustrates the advantages of using a multivariate, (rather than a simple-model) approach. The multivariate model is a combination of three simple models used to study…

  8. Characterizing the relationship between in-hospital measures and workers' compensation outcomes among severely injured construction workers using a data linkage strategy.

    PubMed

    Ruestow, Peter S; Friedman, Lee S

    2013-10-01

    To characterize the relationship between acute measures of severity and three important workers' compensation outcomes associated with a worker's ability to return to work and the cost of a work-related injury. Probabilistic data linkage of workers' compensation claims made by injured construction workers from 2000 to 2005 with two Illinois medical record registries. Multivariable robust regression models were built to assess the relationship between three in-hospital measures and three outcomes captured in the Workers' Compensation data. In the final multivariable models, a categorical increase in injury severity was associated with an extra $7,830 (95% CI: $4,729-$10,930) of monetary compensation awarded, though not with temporary total disability (TTD) or permanent partial disability (PPD). Our models also predicted that every extra day spent in the hospital results in an increase of 0.51 (95% CI: 0.23-0.80) weeks of TTD and an extra $1,248 (95% CI: $810-$1,686) in monetary compensation. Discharge to an intermediate care facility following the initial hospitalization was associated with an increase of 8.15 (95% CI: 4.03-12.28) weeks of TTD and an increase of $23,440 (95% CI: $17,033-$29,847) in monetary compensation. We were able to link data from the initial hospitalization for an injured worker with the final workers' compensation claims decision or settlement. The in-hospital measures of injury severity were associated with total monetary compensation as captured in the workers' compensation process. Copyright © 2013 Wiley Periodicals, Inc.

  9. Does the Presence of Blood in the Catheter or the Degree of Difficulty of Embryo Transfer Affect Live Birth?

    PubMed

    Plowden, Torie C; Hill, Micah J; Miles, Shana M; Hoyt, Benjamin; Yauger, Belinda; Segars, James H; Csokmay, John M; Chason, Rebecca J

    2017-05-01

    The technique used for embryo transfer (ET) can affect implantation. Prior research that evaluated the effect of postprocedural blood of the transfer catheter tip have yielded mixed results, and it is unclear whether this is actually a marker of difficulty of the transfer. Our objective was to estimate the effect of blood at the time of ET and the difficulty of ET on live birth rates (LBR). This retrospective cohort study utilized generalized estimating equations (GEEs) with nesting for repeated cycles for all analyses. Univariate modeling was performed and a final multivariate (adjusted) GEE model accounted for all significant confounders. Embryo transfers were subjectively graded (easy, medium, or hard) by a physician at the time of transfer. The presence of blood at ET was associated with more difficult ETs, retained embryos, and presence of mucous in the catheter. In the univariate analysis, ET with blood was not associated with live birth, while the degree of difficulty for ET had a negative impact on LBR. In the final multivariate GEE model, which accounts for repeated cycles from a patient, the only factors associated with an increased LBR were the degree of difficulty of the ET, female age, and blastocyst transfer. After controlling for confounding variables, the presence of blood in the transfer catheter was not associated with the likelihood of pregnancy and thus was not an independent predictor of cycle outcome. This indicates that the difficulty of the transfer itself was a strong negative predictor of pregnancy.

  10. SEM-PLS Analysis of Inhibiting Factors of Cost Performance for Large Construction Projects in Malaysia: Perspective of Clients and Consultants

    PubMed Central

    Memon, Aftab Hameed; Rahman, Ismail Abdul

    2014-01-01

    This study uncovered inhibiting factors to cost performance in large construction projects of Malaysia. Questionnaire survey was conducted among clients and consultants involved in large construction projects. In the questionnaire, a total of 35 inhibiting factors grouped in 7 categories were presented to the respondents for rating significant level of each factor. A total of 300 questionnaire forms were distributed. Only 144 completed sets were received and analysed using advanced multivariate statistical software of Structural Equation Modelling (SmartPLS v2). The analysis involved three iteration processes where several of the factors were deleted in order to make the model acceptable. The result of the analysis found that R 2 value of the model is 0.422 which indicates that the developed model has a substantial impact on cost performance. Based on the final form of the model, contractor's site management category is the most prominent in exhibiting effect on cost performance of large construction projects. This finding is validated using advanced techniques of power analysis. This vigorous multivariate analysis has explicitly found the significant category which consists of several causative factors to poor cost performance in large construction projects. This will benefit all parties involved in construction projects for controlling cost overrun. PMID:24693227

  11. SEM-PLS analysis of inhibiting factors of cost performance for large construction projects in Malaysia: perspective of clients and consultants.

    PubMed

    Memon, Aftab Hameed; Rahman, Ismail Abdul

    2014-01-01

    This study uncovered inhibiting factors to cost performance in large construction projects of Malaysia. Questionnaire survey was conducted among clients and consultants involved in large construction projects. In the questionnaire, a total of 35 inhibiting factors grouped in 7 categories were presented to the respondents for rating significant level of each factor. A total of 300 questionnaire forms were distributed. Only 144 completed sets were received and analysed using advanced multivariate statistical software of Structural Equation Modelling (SmartPLS v2). The analysis involved three iteration processes where several of the factors were deleted in order to make the model acceptable. The result of the analysis found that R(2) value of the model is 0.422 which indicates that the developed model has a substantial impact on cost performance. Based on the final form of the model, contractor's site management category is the most prominent in exhibiting effect on cost performance of large construction projects. This finding is validated using advanced techniques of power analysis. This vigorous multivariate analysis has explicitly found the significant category which consists of several causative factors to poor cost performance in large construction projects. This will benefit all parties involved in construction projects for controlling cost overrun.

  12. Predicting introductory programming performance: A multi-institutional multivariate study

    NASA Astrophysics Data System (ADS)

    Bergin, Susan; Reilly, Ronan

    2006-12-01

    A model for predicting student performance on introductory programming modules is presented. The model uses attributes identified in a study carried out at four third-level institutions in the Republic of Ireland. Four instruments were used to collect the data and over 25 attributes were examined. A data reduction technique was applied and a logistic regression model using 10-fold stratified cross validation was developed. The model used three attributes: Leaving Certificate Mathematics result (final mathematics examination at second level), number of hours playing computer games while taking the module and programming self-esteem. Prediction success was significant with 80% of students correctly classified. The model also works well on a per-institution level. A discussion on the implications of the model is provided and future work is outlined.

  13. Predicting surgical site infection after spine surgery: a validated model using a prospective surgical registry.

    PubMed

    Lee, Michael J; Cizik, Amy M; Hamilton, Deven; Chapman, Jens R

    2014-09-01

    The impact of surgical site infection (SSI) is substantial. Although previous study has determined relative risk and odds ratio (OR) values to quantify risk factors, these values may be difficult to translate to the patient during counseling of surgical options. Ideally, a model that predicts absolute risk of SSI, rather than relative risk or OR values, would greatly enhance the discussion of safety of spine surgery. To date, there is no risk stratification model that specifically predicts the risk of medical complication. The purpose of this study was to create and validate a predictive model for the risk of SSI after spine surgery. This study performs a multivariate analysis of SSI after spine surgery using a large prospective surgical registry. Using the results of this analysis, this study will then create and validate a predictive model for SSI after spine surgery. The patient sample is from a high-quality surgical registry from our two institutions with prospectively collected, detailed demographic, comorbidity, and complication data. An SSI that required return to the operating room for surgical debridement. Using a prospectively collected surgical registry of more than 1,532 patients with extensive demographic, comorbidity, surgical, and complication details recorded for 2 years after the surgery, we identified several risk factors for SSI after multivariate analysis. Using the beta coefficients from those regression analyses, we created a model to predict the occurrence of SSI after spine surgery. We split our data into two subsets for internal and cross-validation of our model. We created a predictive model based on our beta coefficients from our multivariate analysis. The final predictive model for SSI had a receiver-operator curve characteristic of 0.72, considered to be a fair measure. The final model has been uploaded for use on SpineSage.com. We present a validated model for predicting SSI after spine surgery. The value in this model is that it gives the user an absolute percent likelihood of SSI after spine surgery based on the patient's comorbidity profile and invasiveness of surgery. Patients are far more likely to understand an absolute percentage, rather than relative risk and confidence interval values. A model such as this is of paramount importance in counseling patients and enhancing the safety of spine surgery. In addition, a tool such as this can be of great use particularly as health care trends toward pay for performance, quality metrics (such as SSI), and risk adjustment. To facilitate the use of this model, we have created a Web site (SpineSage.com) where users can enter patient data to determine likelihood for SSI. Copyright © 2014 Elsevier Inc. All rights reserved.

  14. Basin Characterisation by Means of Joint Inversion of Electromagnetic Geophysical Data, Borehole Data and Multivariate Statistical Methods: The Loop Head Peninsula, Western Ireland, Case Study

    NASA Astrophysics Data System (ADS)

    Campanya, J. L.; Ogaya, X.; Jones, A. G.; Rath, V.; McConnell, B.; Haughton, P.; Prada, M.

    2016-12-01

    The Science Foundation Ireland funded project IRECCSEM project (www.ireccsem.ie) aims to evaluate Ireland's potential for onshore carbon sequestration in saline aquifers by integrating new electromagnetic geophysical data with existing geophysical and geological data. One of the objectives of this component of IRECCSEM is to characterise the subsurface beneath the Loop Head Peninsula (part of Clare Basin, Co. Clare, Ireland), and identify major electrical resistivity structures that can guide an interpretation of the carbon sequestration potential of this area. During the summer of 2014, a magnetotelluric (MT) survey was carried out on the Loop Head Peninsula, and data from a total of 140 sites were acquired, including audio-magnetotelluric (AMT), and broadband magnetotelluric (BBMT). The dataset was used to generate shallow three-dimensional (3-D) electrical resistivity models constraining the subsurface to depths of up to 3.5 km. The three-dimensional (3-D) joint inversions were performed using three different types of electromagnetic data: MT impedance tensor (Z), geomagnetic transfer functions (T), and inter-station horizontal magnetic transfer-functions (H). The interpretation of the results was complemented with second-derivative models of the resulting electrical resistivity models, and a quantitative comparison with borehole data using multivariate statistical methods. Second-derivative models were used to define the main interfaces between the geoelectrical structures, facilitating superior comparison with geological and seismic results, and also reducing the influence of the colour scale when interpreting the results. Specific analysis was performed to compare the extant borehole data with the electrical resistivity model, identifying those structures that are better characterised by the resistivity model. Finally, the electrical resistivity model was also used to propagate some of the physical properties measured in the borehole, when a good relation was possible between the different types of data. The final results were compared with independent geological and geophysical data for a high-quality interpretation.

  15. How do Indonesian youth perceive cigarette advertising? A cross-sectional study among Indonesian high school students

    PubMed Central

    Prabandari, Yayi Suryo; Dewi, Arika

    2016-01-01

    Background Previous studies have reported an association between cigarette advertising and smoking behavior. Although this has been reported extensively in the West, it has been reported less in Southeast Asian countries that have not completely banned tobacco advertising promotion and sponsorship (TAPS). Indonesia is the only ASEAN country that has not ratified the Framework Convention on Tobacco Control, so TAPS regulation is limited. This study aimed to assess the association between youths’ perceptions of cigarette ads and smoking initiation. Design We conducted a cross-sectional survey among 2,115 high school students aged 13–18 years in Yogyakarta, Indonesia. A self-administered questionnaire was distributed to gauge the perception of cigarette ads and initiation to smoking. We calculated the odds ratio (OR) between the perception of cigarette ads and smoking initiation, adjusting for sociodemographic and psychosocial variables. The sociodemographic variables included in the final model were age and sex. Results The final multivariate model showed an association between perception of tobacco ads encouraging youths to smoke and smoking initiation (OR 2.70) and current smoking (OR 7.63). Attitude toward TAPS was associated with smoking initiation (OR 1.51) and current smoking (OR 3.32). Exposure to cigarette ads had an association with smoking initiation only (OR 1.27) and did not have an association with current smoking. Having friends and family who smoked was associated with smoking initiation and current smoking in the final multivariate model. Smoking initiation and current smoking were also related to the susceptibility to smoke. Conclusions This study revealed that cigarette ads were perceived as encouraging youths to smoke and that smoking status was consistently associated with perception of cigarette ads targeted at youths, attitude toward TAPS, and susceptibility as well as smoking friends and family. Regulations to ban TAPS, particularly cigarette ads for preventing youths from smoking, should be adopted rapidly in Indonesia, where tobacco control remains limited. PMID:27578600

  16. PM10 modeling in the Oviedo urban area (Northern Spain) by using multivariate adaptive regression splines

    NASA Astrophysics Data System (ADS)

    Nieto, Paulino José García; Antón, Juan Carlos Álvarez; Vilán, José Antonio Vilán; García-Gonzalo, Esperanza

    2014-10-01

    The aim of this research work is to build a regression model of the particulate matter up to 10 micrometers in size (PM10) by using the multivariate adaptive regression splines (MARS) technique in the Oviedo urban area (Northern Spain) at local scale. This research work explores the use of a nonparametric regression algorithm known as multivariate adaptive regression splines (MARS) which has the ability to approximate the relationship between the inputs and outputs, and express the relationship mathematically. In this sense, hazardous air pollutants or toxic air contaminants refer to any substance that may cause or contribute to an increase in mortality or serious illness, or that may pose a present or potential hazard to human health. To accomplish the objective of this study, the experimental dataset of nitrogen oxides (NOx), carbon monoxide (CO), sulfur dioxide (SO2), ozone (O3) and dust (PM10) were collected over 3 years (2006-2008) and they are used to create a highly nonlinear model of the PM10 in the Oviedo urban nucleus (Northern Spain) based on the MARS technique. One main objective of this model is to obtain a preliminary estimate of the dependence between PM10 pollutant in the Oviedo urban area at local scale. A second aim is to determine the factors with the greatest bearing on air quality with a view to proposing health and lifestyle improvements. The United States National Ambient Air Quality Standards (NAAQS) establishes the limit values of the main pollutants in the atmosphere in order to ensure the health of healthy people. Firstly, this MARS regression model captures the main perception of statistical learning theory in order to obtain a good prediction of the dependence among the main pollutants in the Oviedo urban area. Secondly, the main advantages of MARS are its capacity to produce simple, easy-to-interpret models, its ability to estimate the contributions of the input variables, and its computational efficiency. Finally, on the basis of these numerical calculations, using the multivariate adaptive regression splines (MARS) technique, conclusions of this research work are exposed.

  17. Examining the impacts of increased corn production on groundwater quality using a coupled modeling system.

    PubMed

    Garcia, Valerie; Cooter, Ellen; Crooks, James; Hinckley, Brian; Murphy, Mark; Xing, Xiangnan

    2017-05-15

    This study demonstrates the value of a coupled chemical transport modeling system for investigating groundwater nitrate contamination responses associated with nitrogen (N) fertilizer application and increased corn production. The coupled Community Multiscale Air Quality Bidirectional and Environmental Policy Integrated Climate modeling system incorporates agricultural management practices and N exchange processes between the soil and atmosphere to estimate levels of N that may volatilize into the atmosphere, re-deposit, and seep or flow into surface and groundwater. Simulated values from this modeling system were used in a land-use regression model to examine associations between groundwater nitrate-N measurements and a suite of factors related to N fertilizer and groundwater nitrate contamination. Multi-variable modeling analysis revealed that the N-fertilizer rate (versus total) applied to irrigated (versus rainfed) grain corn (versus other crops) was the strongest N-related predictor variable of groundwater nitrate-N concentrations. Application of this multi-variable model considered groundwater nitrate-N concentration responses under two corn production scenarios. Findings suggest that increased corn production between 2002 and 2022 could result in 56% to 79% increase in areas vulnerable to groundwater nitrate-N concentrations ≥5mg/L. These above-threshold areas occur on soils with a hydraulic conductivity 13% higher than the rest of the domain. Additionally, the average number of animal feeding operations (AFOs) for these areas was nearly 5 times higher, and the mean N-fertilizer rate was 4 times higher. Finally, we found that areas prone to high groundwater nitrate-N concentrations attributable to the expansion scenario did not occur in new grid cells of irrigated grain-corn croplands, but were clustered around areas of existing corn crops. This application demonstrates the value of the coupled modeling system in developing spatially refined multi-variable models to provide information for geographic locations lacking complete observational data; and in projecting possible groundwater nitrate-N concentration outcomes under alternative future crop production scenarios. Published by Elsevier B.V.

  18. Extensions to Multivariate Space Time Mixture Modeling of Small Area Cancer Data.

    PubMed

    Carroll, Rachel; Lawson, Andrew B; Faes, Christel; Kirby, Russell S; Aregay, Mehreteab; Watjou, Kevin

    2017-05-09

    Oral cavity and pharynx cancer, even when considered together, is a fairly rare disease. Implementation of multivariate modeling with lung and bronchus cancer, as well as melanoma cancer of the skin, could lead to better inference for oral cavity and pharynx cancer. The multivariate structure of these models is accomplished via the use of shared random effects, as well as other multivariate prior distributions. The results in this paper indicate that care should be taken when executing these types of models, and that multivariate mixture models may not always be the ideal option, depending on the data of interest.

  19. Learning multivariate distributions by competitive assembly of marginals.

    PubMed

    Sánchez-Vega, Francisco; Younes, Laurent; Geman, Donald

    2013-02-01

    We present a new framework for learning high-dimensional multivariate probability distributions from estimated marginals. The approach is motivated by compositional models and Bayesian networks, and designed to adapt to small sample sizes. We start with a large, overlapping set of elementary statistical building blocks, or "primitives," which are low-dimensional marginal distributions learned from data. Each variable may appear in many primitives. Subsets of primitives are combined in a Lego-like fashion to construct a probabilistic graphical model; only a small fraction of the primitives will participate in any valid construction. Since primitives can be precomputed, parameter estimation and structure search are separated. Model complexity is controlled by strong biases; we adapt the primitives to the amount of training data and impose rules which restrict the merging of them into allowable compositions. The likelihood of the data decomposes into a sum of local gains, one for each primitive in the final structure. We focus on a specific subclass of networks which are binary forests. Structure optimization corresponds to an integer linear program and the maximizing composition can be computed for reasonably large numbers of variables. Performance is evaluated using both synthetic data and real datasets from natural language processing and computational biology.

  20. Multitrophic functional diversity predicts ecosystem functioning in experimental assemblages of estuarine consumers.

    PubMed

    Lefcheck, Jonathan S; Duffy, J Emmett

    2015-11-01

    The use of functional traits to explain how biodiversity affects ecosystem functioning has attracted intense interest, yet few studies have a priori altered functional diversity, especially in multitrophic communities. Here, we manipulated multivariate functional diversity of estuarine grazers and predators within multiple levels of species richness to test how species richness and functional diversity predicted ecosystem functioning in a multitrophic food web. Community functional diversity was a better predictor than species richness for the majority of ecosystem properties, based on generalized linear mixed-effects models. Combining inferences from eight traits into a single multivariate index increased prediction accuracy of these models relative to any individual trait. Structural equation modeling revealed that functional diversity of both grazers and predators was important in driving final biomass within trophic levels, with stronger effects observed for predators. We also show that different species drove different ecosystem responses, with evidence for both sampling effects and complementarity. Our study extends experimental investigations of functional trait diversity to a multilevel food web, and demonstrates that functional diversity can be more accurate and effective than species richness in predicting community biomass in a food web context.

  1. Nondestructive Handheld Fourier Transform Infrared (FT-IR) Analysis of Spectroscopic Changes and Multivariate Modeling of Thermally Degraded Plain Portland Cement Concrete and its Slag and Fly Ash-Based Analogs.

    PubMed

    Leung Tang, Pik; Alqassim, Mohammad; Nic Daéid, Niamh; Berlouis, Leonard; Seelenbinder, John

    2016-05-01

    Concrete is by far the world's most common construction material. Modern concrete is a mixture of industrial pozzolanic cement formulations and aggregate fillers. The former acts as the glue or binder in the final inorganic composite; however, when exposed to a fire the degree of concrete damage is often difficult to evaluate nondestructively. Fourier transform infrared (FT-IR) spectroscopy through techniques such as transmission, attenuated total reflectance, and diffuse reflectance have been rarely used to evaluate thermally damaged concrete. In this paper, we report on a study assessing the thermal damage of concrete via the use of a nondestructive handheld FT-IR with a diffuse reflectance sample interface. In situ measurements can be made on actual damaged areas, without the need for sample preparation. Separate multivariate models were developed to determine the equivalent maximal temperature endured for three common industrial concrete formulations. The concrete mixtures were successfully modeled displaying high predictive power as well as good specificity. This has potential uses in forensic investigation and remediation services particularly for fires in buildings. © The Author(s) 2016.

  2. Study of cyanotoxins presence from experimental cyanobacteria concentrations using a new data mining methodology based on multivariate adaptive regression splines in Trasona reservoir (Northern Spain).

    PubMed

    Garcia Nieto, P J; Sánchez Lasheras, F; de Cos Juez, F J; Alonso Fernández, J R

    2011-11-15

    There is an increasing need to describe cyanobacteria blooms since some cyanobacteria produce toxins, termed cyanotoxins. These latter can be toxic and dangerous to humans as well as other animals and life in general. It must be remarked that the cyanobacteria are reproduced explosively under certain conditions. This results in algae blooms, which can become harmful to other species if the cyanobacteria involved produce cyanotoxins. In this research work, the evolution of cyanotoxins in Trasona reservoir (Principality of Asturias, Northern Spain) was studied with success using the data mining methodology based on multivariate adaptive regression splines (MARS) technique. The results of the present study are two-fold. On one hand, the importance of the different kind of cyanobacteria over the presence of cyanotoxins in the reservoir is presented through the MARS model and on the other hand a predictive model able to forecast the possible presence of cyanotoxins in a short term was obtained. The agreement of the MARS model with experimental data confirmed the good performance of the same one. Finally, conclusions of this innovative research are exposed. Copyright © 2011 Elsevier B.V. All rights reserved.

  3. The role of traumatic event history in non-medical use of prescription drugs among a nationally representative sample of US adolescents.

    PubMed

    McCauley, Jenna L; Danielson, Carla Kmett; Amstadter, Ananda B; Ruggiero, Kenneth J; Resnick, Heidi S; Hanson, Rochelle F; Smith, Daniel W; Saunders, Benjamin E; Kilpatrick, Dean G

    2010-01-01

    Building on previous research with adolescents that examined demographic variables and other forms of substance abuse in relation to non-medical use of prescription drugs (NMUPD), the current study examined potentially traumatic events, depression, posttraumatic stress disorder (PTSD), other substance use, and delinquent behavior as potential correlates of past-year non-medical use of prescription drugs. A nationally representative sample of 3,614 non-institutionalized, civilian, English-speaking adolescents (aged 12-17 years) residing in households with a telephone was selected. Demographic characteristics, traumatic event history, mental health, and substance abuse variables were assessed. NMUPD was assessed by asking if, in the past year, participants had used a prescription drug in a non-medical manner. Multivariable logistic regressions were conducted for each theoretically derived predictor set. Significant predictors from each set were then entered into a final multivariable logistic regression to determine significant predictors of past-year NMUPD. NMUPD was endorsed by 6.7% of the sample (n = 242). The final multivariable model showed that lifetime history of delinquent behavior, other forms of substance use/abuse, history of witnessed violence, and lifetime history of PTSD were significantly associated with increased likelihood of NMUPD. Risk reduction efforts targeting NMUPD among adolescents who have witnessed significant violence, endorsed abuse of other substances and delinquent behavior, and/or endorsed PTSD are warranted. Interventions for adolescents with history of violence exposure or PTSD, or those adjudicated for delinquent behavior, should include treatment or prevention modules that specifically address NMUPD.

  4. Factors Impacting Growth in Infants with Single Ventricle Physiology: A Report from Pediatric Heart Network Infant Single Ventricle Trial

    PubMed Central

    Williams, Richard V.; Zak, Victor; Ravishankar, Chitra; Altmann, Karen; Anderson, Jeffrey; Atz, Andrew M.; Dunbar-Masterson, Carolyn; Ghanayem, Nancy; Lambert, Linda; Lurito, Karen; Medoff-Cooper, Barbara; Margossian, Renee; Pemberton, Victoria L.; Russell, Jennifer; Stylianou, Mario; Hsu, Daphne

    2011-01-01

    Objectives To describe growth patterns in infants with single ventricle physiology and determine factors influencing growth. Study design Data from 230 subjects enrolled in the Pediatric Heart Network Infant Single Ventricle Enalapril Trial were used to assess factors influencing change in weight-for-age z-score (Δz) from study enrollment (0.7 ± 0.4 months) to pre-superior cavopulmonary connection (SCPC) (5.1 ± 1.8 months, period 1), and pre-SCPC to final study visit (14.1 ± 0.9 months, period 2). Predictor variables included patient characteristics, feeding regimen, clinical center, and medical factors during neonatal (period 1) and SCPC hospitalizations (period 2). Univariate regression analysis was performed, followed by backward stepwise regression and bootstrapping reliability to inform a final multivariable model. Results Weights were available for 197/230 subjects for period 1 and 173/197 for period 2. For period 1, greater gestational age, younger age at study enrollment, tube feeding at neonatal discharge, and clinical center were associated with a greater negative Δz (poorer growth) in multivariable modeling (adjusted R2 = 0.39, p < 0.001). For period 2, younger age at SCPC and greater daily caloric intake were associated with greater positive Δz (better growth) (R2 = 0.10, p = 0.002). Conclusions Aggressive nutritional support and earlier SCPC are modifiable factors associated with a favorable change in weight-for-age z-score. PMID:21784436

  5. Inferring Weighted Directed Association Network from Multivariate Time Series with a Synthetic Method of Partial Symbolic Transfer Entropy Spectrum and Granger Causality

    PubMed Central

    Hu, Yanzhu; Ai, Xinbo

    2016-01-01

    Complex network methodology is very useful for complex system explorer. However, the relationships among variables in complex system are usually not clear. Therefore, inferring association networks among variables from their observed data has been a popular research topic. We propose a synthetic method, named small-shuffle partial symbolic transfer entropy spectrum (SSPSTES), for inferring association network from multivariate time series. The method synthesizes surrogate data, partial symbolic transfer entropy (PSTE) and Granger causality. A proper threshold selection is crucial for common correlation identification methods and it is not easy for users. The proposed method can not only identify the strong correlation without selecting a threshold but also has the ability of correlation quantification, direction identification and temporal relation identification. The method can be divided into three layers, i.e. data layer, model layer and network layer. In the model layer, the method identifies all the possible pair-wise correlation. In the network layer, we introduce a filter algorithm to remove the indirect weak correlation and retain strong correlation. Finally, we build a weighted adjacency matrix, the value of each entry representing the correlation level between pair-wise variables, and then get the weighted directed association network. Two numerical simulated data from linear system and nonlinear system are illustrated to show the steps and performance of the proposed approach. The ability of the proposed method is approved by an application finally. PMID:27832153

  6. The upper respiratory pyramid: early factors and later treatment utilization in World Trade Center exposed firefighters.

    PubMed

    Niles, Justin K; Webber, Mayris P; Liu, Xiaoxue; Zeig-Owens, Rachel; Hall, Charles B; Cohen, Hillel W; Glaser, Michelle S; Weakley, Jessica; Schwartz, Theresa M; Weiden, Michael D; Nolan, Anna; Aldrich, Thomas K; Glass, Lara; Kelly, Kerry J; Prezant, David J

    2014-08-01

    We investigated early post 9/11 factors that could predict rhinosinusitis healthcare utilization costs up to 11 years later in 8,079 World Trade Center-exposed rescue/recovery workers. We used bivariate and multivariate analytic techniques to investigate utilization outcomes; we also used a pyramid framework to describe rhinosinusitis healthcare groups at early (by 9/11/2005) and late (by 9/11/2012) time points. Multivariate models showed that pre-9/11/2005 chronic rhinosinusitis diagnoses and nasal symptoms predicted final year healthcare utilization outcomes more than a decade after WTC exposure. The relative proportion of workers on each pyramid level changed significantly during the study period. Diagnoses of chronic rhinosinusitis within 4 years of a major inhalation event only partially explain future healthcare utilization. Exposure intensity, early symptoms and other factors must also be considered when anticipating future healthcare needs. © 2014 Wiley Periodicals, Inc.

  7. The Upper Respiratory Pyramid: Early Factors and Later Treatment Utilization in World Trade Center Exposed Firefighters

    PubMed Central

    Niles, Justin K.; Webber, Mayris P.; Liu, Xiaoxue; Zeig-Owens, Rachel; Hall, Charles B.; Cohen, Hillel W.; Glaser, Michelle S.; Weakley, Jessica; Schwartz, Theresa M.; Weiden, Michael D.; Nolan, Anna; Aldrich, Thomas K.; Glass, Lara; Kelly, Kerry J.; Prezant, David J.

    2015-01-01

    Background We investigated early post 9/11 factors that could predict rhinosinusitis healthcare utilization costs up to 11 years later in 8,079 World Trade Center-exposed rescue/recovery workers. Methods We used bivariate and multivariate analytic techniques to investigate utilization outcomes; we also used a pyramid framework to describe rhinosinusitis healthcare groups at early (by 9/11/2005) and late (by 9/11/2012) time points. Results Multivariate models showed that pre-9/11/2005 chronic rhinosinusitis diagnoses and nasal symptoms predicted final year healthcare utilization outcomes more than a decade after WTC exposure. The relative proportion of workers on each pyramid level changed significantly during the study period. Conclusions Diagnoses of chronic rhinosinusitis within 4 years of a major inhalation event only partially explain future healthcare utilization. Exposure intensity, early symptoms and other factors must also be considered when anticipating future healthcare needs. PMID:24898816

  8. Simultaneous quantitative analysis of olmesartan, amlodipine and hydrochlorothiazide in their combined dosage form utilizing classical and alternating least squares based chemometric methods.

    PubMed

    Darwish, Hany W; Bakheit, Ahmed H; Abdelhameed, Ali S

    2016-03-01

    Simultaneous spectrophotometric analysis of a multi-component dosage form of olmesartan, amlodipine and hydrochlorothiazide used for the treatment of hypertension has been carried out using various chemometric methods. Multivariate calibration methods include classical least squares (CLS) executed by net analyte processing (NAP-CLS), orthogonal signal correction (OSC-CLS) and direct orthogonal signal correction (DOSC-CLS) in addition to multivariate curve resolution-alternating least squares (MCR-ALS). Results demonstrated the efficiency of the proposed methods as quantitative tools of analysis as well as their qualitative capability. The three analytes were determined precisely using the aforementioned methods in an external data set and in a dosage form after optimization of experimental conditions. Finally, the efficiency of the models was validated via comparison with the partial least squares (PLS) method in terms of accuracy and precision.

  9. Combinations of NIR, Raman spectroscopy and physicochemical measurements for improved monitoring of solvent extraction processes using hierarchical multivariate analysis models

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nee, K.; Bryan, S.; Levitskaia, T.

    The reliability of chemical processes can be greatly improved by implementing inline monitoring systems. Combining multivariate analysis with non-destructive sensors can enhance the process without interfering with the operation. Here, we present here hierarchical models using both principal component analysis and partial least square analysis developed for different chemical components representative of solvent extraction process streams. A training set of 380 samples and an external validation set of 95 samples were prepared and Near infrared and Raman spectral data as well as conductivity under variable temperature conditions were collected. The results from the models indicate that careful selection of themore » spectral range is important. By compressing the data through Principal Component Analysis (PCA), we lower the rank of the data set to its most dominant features while maintaining the key principal components to be used in the regression analysis. Within the studied data set, concentration of five chemical components were modeled; total nitrate (NO 3 -), total acid (H +), neodymium (Nd 3+), sodium (Na +), and ionic strength (I.S.). The best overall model prediction for each of the species studied used a combined data set comprised of complementary techniques including NIR, Raman, and conductivity. Finally, our study shows that chemometric models are powerful but requires significant amount of carefully analyzed data to capture variations in the chemistry.« less

  10. Combinations of NIR, Raman spectroscopy and physicochemical measurements for improved monitoring of solvent extraction processes using hierarchical multivariate analysis models

    DOE PAGES

    Nee, K.; Bryan, S.; Levitskaia, T.; ...

    2017-12-28

    The reliability of chemical processes can be greatly improved by implementing inline monitoring systems. Combining multivariate analysis with non-destructive sensors can enhance the process without interfering with the operation. Here, we present here hierarchical models using both principal component analysis and partial least square analysis developed for different chemical components representative of solvent extraction process streams. A training set of 380 samples and an external validation set of 95 samples were prepared and Near infrared and Raman spectral data as well as conductivity under variable temperature conditions were collected. The results from the models indicate that careful selection of themore » spectral range is important. By compressing the data through Principal Component Analysis (PCA), we lower the rank of the data set to its most dominant features while maintaining the key principal components to be used in the regression analysis. Within the studied data set, concentration of five chemical components were modeled; total nitrate (NO 3 -), total acid (H +), neodymium (Nd 3+), sodium (Na +), and ionic strength (I.S.). The best overall model prediction for each of the species studied used a combined data set comprised of complementary techniques including NIR, Raman, and conductivity. Finally, our study shows that chemometric models are powerful but requires significant amount of carefully analyzed data to capture variations in the chemistry.« less

  11. Improved accuracy in quantitative laser-induced breakdown spectroscopy using sub-models

    USGS Publications Warehouse

    Anderson, Ryan; Clegg, Samuel M.; Frydenvang, Jens; Wiens, Roger C.; McLennan, Scott M.; Morris, Richard V.; Ehlmann, Bethany L.; Dyar, M. Darby

    2017-01-01

    Accurate quantitative analysis of diverse geologic materials is one of the primary challenges faced by the Laser-Induced Breakdown Spectroscopy (LIBS)-based ChemCam instrument on the Mars Science Laboratory (MSL) rover. The SuperCam instrument on the Mars 2020 rover, as well as other LIBS instruments developed for geochemical analysis on Earth or other planets, will face the same challenge. Consequently, part of the ChemCam science team has focused on the development of improved multivariate analysis calibrations methods. Developing a single regression model capable of accurately determining the composition of very different target materials is difficult because the response of an element’s emission lines in LIBS spectra can vary with the concentration of other elements. We demonstrate a conceptually simple “sub-model” method for improving the accuracy of quantitative LIBS analysis of diverse target materials. The method is based on training several regression models on sets of targets with limited composition ranges and then “blending” these “sub-models” into a single final result. Tests of the sub-model method show improvement in test set root mean squared error of prediction (RMSEP) for almost all cases. The sub-model method, using partial least squares regression (PLS), is being used as part of the current ChemCam quantitative calibration, but the sub-model method is applicable to any multivariate regression method and may yield similar improvements.

  12. Solving large mixed linear models using preconditioned conjugate gradient iteration.

    PubMed

    Strandén, I; Lidauer, M

    1999-12-01

    Continuous evaluation of dairy cattle with a random regression test-day model requires a fast solving method and algorithm. A new computing technique feasible in Jacobi and conjugate gradient based iterative methods using iteration on data is presented. In the new computing technique, the calculations in multiplication of a vector by a matrix were recorded to three steps instead of the commonly used two steps. The three-step method was implemented in a general mixed linear model program that used preconditioned conjugate gradient iteration. Performance of this program in comparison to other general solving programs was assessed via estimation of breeding values using univariate, multivariate, and random regression test-day models. Central processing unit time per iteration with the new three-step technique was, at best, one-third that needed with the old technique. Performance was best with the test-day model, which was the largest and most complex model used. The new program did well in comparison to other general software. Programs keeping the mixed model equations in random access memory required at least 20 and 435% more time to solve the univariate and multivariate animal models, respectively. Computations of the second best iteration on data took approximately three and five times longer for the animal and test-day models, respectively, than did the new program. Good performance was due to fast computing time per iteration and quick convergence to the final solutions. Use of preconditioned conjugate gradient based methods in solving large breeding value problems is supported by our findings.

  13. Comparison between splines and fractional polynomials for multivariable model building with continuous covariates: a simulation study with continuous response.

    PubMed

    Binder, Harald; Sauerbrei, Willi; Royston, Patrick

    2013-06-15

    In observational studies, many continuous or categorical covariates may be related to an outcome. Various spline-based procedures or the multivariable fractional polynomial (MFP) procedure can be used to identify important variables and functional forms for continuous covariates. This is the main aim of an explanatory model, as opposed to a model only for prediction. The type of analysis often guides the complexity of the final model. Spline-based procedures and MFP have tuning parameters for choosing the required complexity. To compare model selection approaches, we perform a simulation study in the linear regression context based on a data structure intended to reflect realistic biomedical data. We vary the sample size, variance explained and complexity parameters for model selection. We consider 15 variables. A sample size of 200 (1000) and R(2)  = 0.2 (0.8) is the scenario with the smallest (largest) amount of information. For assessing performance, we consider prediction error, correct and incorrect inclusion of covariates, qualitative measures for judging selected functional forms and further novel criteria. From limited information, a suitable explanatory model cannot be obtained. Prediction performance from all types of models is similar. With a medium amount of information, MFP performs better than splines on several criteria. MFP better recovers simpler functions, whereas splines better recover more complex functions. For a large amount of information and no local structure, MFP and the spline procedures often select similar explanatory models. Copyright © 2012 John Wiley & Sons, Ltd.

  14. Multivariate random regression analysis for body weight and main morphological traits in genetically improved farmed tilapia (Oreochromis niloticus).

    PubMed

    He, Jie; Zhao, Yunfeng; Zhao, Jingli; Gao, Jin; Han, Dandan; Xu, Pao; Yang, Runqing

    2017-11-02

    Because of their high economic importance, growth traits in fish are under continuous improvement. For growth traits that are recorded at multiple time-points in life, the use of univariate and multivariate animal models is limited because of the variable and irregular timing of these measures. Thus, the univariate random regression model (RRM) was introduced for the genetic analysis of dynamic growth traits in fish breeding. We used a multivariate random regression model (MRRM) to analyze genetic changes in growth traits recorded at multiple time-point of genetically-improved farmed tilapia. Legendre polynomials of different orders were applied to characterize the influences of fixed and random effects on growth trajectories. The final MRRM was determined by optimizing the univariate RRM for the analyzed traits separately via penalizing adaptively the likelihood statistical criterion, which is superior to both the Akaike information criterion and the Bayesian information criterion. In the selected MRRM, the additive genetic effects were modeled by Legendre polynomials of three orders for body weight (BWE) and body length (BL) and of two orders for body depth (BD). By using the covariance functions of the MRRM, estimated heritabilities were between 0.086 and 0.628 for BWE, 0.155 and 0.556 for BL, and 0.056 and 0.607 for BD. Only heritabilities for BD measured from 60 to 140 days of age were consistently higher than those estimated by the univariate RRM. All genetic correlations between growth time-points exceeded 0.5 for either single or pairwise time-points. Moreover, correlations between early and late growth time-points were lower. Thus, for phenotypes that are measured repeatedly in aquaculture, an MRRM can enhance the efficiency of the comprehensive selection for BWE and the main morphological traits.

  15. Fast-NPS-A Markov Chain Monte Carlo-based analysis tool to obtain structural information from single-molecule FRET measurements

    NASA Astrophysics Data System (ADS)

    Eilert, Tobias; Beckers, Maximilian; Drechsler, Florian; Michaelis, Jens

    2017-10-01

    The analysis tool and software package Fast-NPS can be used to analyse smFRET data to obtain quantitative structural information about macromolecules in their natural environment. In the algorithm a Bayesian model gives rise to a multivariate probability distribution describing the uncertainty of the structure determination. Since Fast-NPS aims to be an easy-to-use general-purpose analysis tool for a large variety of smFRET networks, we established an MCMC based sampling engine that approximates the target distribution and requires no parameter specification by the user at all. For an efficient local exploration we automatically adapt the multivariate proposal kernel according to the shape of the target distribution. In order to handle multimodality, the sampler is equipped with a parallel tempering scheme that is fully adaptive with respect to temperature spacing and number of chains. Since the molecular surrounding of a dye molecule affects its spatial mobility and thus the smFRET efficiency, we introduce dye models which can be selected for every dye molecule individually. These models allow the user to represent the smFRET network in great detail leading to an increased localisation precision. Finally, a tool to validate the chosen model combination is provided. Programme Files doi:http://dx.doi.org/10.17632/7ztzj63r68.1 Licencing provisions: Apache-2.0 Programming language: GUI in MATLAB (The MathWorks) and the core sampling engine in C++ Nature of problem: Sampling of highly diverse multivariate probability distributions in order to solve for macromolecular structures from smFRET data. Solution method: MCMC algorithm with fully adaptive proposal kernel and parallel tempering scheme.

  16. Accuracy enhancement of a multivariate calibration for lead determination in soils by laser induced breakdown spectroscopy

    NASA Astrophysics Data System (ADS)

    Zaytsev, Sergey M.; Krylov, Ivan N.; Popov, Andrey M.; Zorov, Nikita B.; Labutin, Timur A.

    2018-02-01

    We have investigated matrix effects and spectral interferences on example of lead determination in different types of soils by laser induced breakdown spectroscopy (LIBS). Comparison between analytical performances of univariate and multivariate calibrations with the use of different laser wavelength for ablation (532, 355 and 266 nm) have been reported. A set of 17 soil samples (Ca-rich, Fe-rich, lean soils etc., 8.5-280 ppm of Pb) was involved into construction of the calibration models. Spectral interferences from main components (Ca, Fe, Ti, Mg) and trace components (Mn, Nb, Zr) were estimated by spectra modeling, and they were a reason for significant differences between the univariate calibration models obtained for a three different soil types (black, red, gray) separately. Implementation of 3rd harmonic of Nd:YAG laser in combination with multivariate calibration model based on PCR with 3 principal components provided the best analytical results: the RMSEC has been lowered down to 8 ppm. The sufficient improvement of the relative uncertainty (up to 5-10%) in comparison with univariate calibration was observed at the Pb concentration level > 50 ppm, while the problem of accuracy still remains for some samples with Pb concentration at the 20 ppm level. We have also discussed a few possible ways to estimate LOD without a blank sample. The most rigorous criterion has resulted in LOD of Pb in soils being 13 ppm. Finally, a good agreement between the values of lead content predicted by LIBS (46 ± 5 ppm) and XRF (42.1 ± 3.3 ppm) in the unknown soil sample from Lomonosov Moscow State University area was demonstrated.

  17. Dynamic Granger causality based on Kalman filter for evaluation of functional network connectivity in fMRI data

    PubMed Central

    Havlicek, Martin; Jan, Jiri; Brazdil, Milan; Calhoun, Vince D.

    2015-01-01

    Increasing interest in understanding dynamic interactions of brain neural networks leads to formulation of sophisticated connectivity analysis methods. Recent studies have applied Granger causality based on standard multivariate autoregressive (MAR) modeling to assess the brain connectivity. Nevertheless, one important flaw of this commonly proposed method is that it requires the analyzed time series to be stationary, whereas such assumption is mostly violated due to the weakly nonstationary nature of functional magnetic resonance imaging (fMRI) time series. Therefore, we propose an approach to dynamic Granger causality in the frequency domain for evaluating functional network connectivity in fMRI data. The effectiveness and robustness of the dynamic approach was significantly improved by combining a forward and backward Kalman filter that improved estimates compared to the standard time-invariant MAR modeling. In our method, the functional networks were first detected by independent component analysis (ICA), a computational method for separating a multivariate signal into maximally independent components. Then the measure of Granger causality was evaluated using generalized partial directed coherence that is suitable for bivariate as well as multivariate data. Moreover, this metric provides identification of causal relation in frequency domain, which allows one to distinguish the frequency components related to the experimental paradigm. The procedure of evaluating Granger causality via dynamic MAR was demonstrated on simulated time series as well as on two sets of group fMRI data collected during an auditory sensorimotor (SM) or auditory oddball discrimination (AOD) tasks. Finally, a comparison with the results obtained from a standard time-invariant MAR model was provided. PMID:20561919

  18. Factors Associated with the Emergence of Highly Pathogenic Avian Influenza A (H5N1) Poultry Outbreaks in China: Evidence from an Epidemiological Investigation in Ningxia, 2012.

    PubMed

    Liu, H; Zhou, X; Zhao, Y; Zheng, D; Wang, J; Wang, X; Castellan, D; Huang, B; Wang, Z; Soares Magalhães, R J

    2017-06-01

    In April 2012, highly pathogenic avian influenza virus of the H5N1 subtype (HPAIV H5N1) emerged in poultry layers in Ningxia. A retrospective case-control study was conducted to identify possible risk factors associated with the emergence of H5N1 infection and describe and quantify the spatial variation in H5N1 infection. A multivariable logistic regression model was used to identify risk factors significantly associated with the presence of infection; residual spatial variation in H5N1 risk unaccounted by the factors included in the multivariable model was investigated using a semivariogram. Our results indicate that HPAIV H5N1-infected farms were three times more likely to improperly dispose farm waste [adjusted OR = 0.37; 95% CI: 0.12-0.82] and five times more likely to have had visitors in their farm within the past month [adjusted OR = 5.47; 95% CI: 1.97-15.64] compared to H5N1-non-infected farms. The variables included in the final multivariable model accounted only 20% for the spatial clustering of H5N1 infection. The average size of a H5N1 cluster was 660 m. Bio-exclusion practices should be strengthened on poultry farms to prevent further emergence of H5N1 infection. For future poultry depopulation, operations should consider H5N1 disease clusters to be as large as 700 m. © 2015 Blackwell Verlag GmbH.

  19. How does pedogenesis drive plant diversity?

    USGS Publications Warehouse

    Laliberté, Etienne; Grace, James B.; Huston, Michael A.; Lambers, Hans; Teste, François P.; Turner, Benjamin L.; Wardle, David A.

    2013-01-01

    Some of the most species-rich plant communities occur on ancient, strongly weathered soils, whereas those on recently developed soils tend to be less diverse. Mechanisms underlying this well-known pattern, however, remain unresolved. Here, we present a conceptual model describing alternative mechanisms by which pedogenesis (the process of soil formation) might drive plant diversity. We suggest that long-term soil chronosequences offer great, yet largely untapped, potential as 'natural experiments' to determine edaphic controls over plant diversity. Finally, we discuss how our conceptual model can be evaluated quantitatively using structural equation modeling to advance multivariate theories about the determinants of local plant diversity. This should help us to understand broader-scale diversity patterns, such as the latitudinal gradient of plant diversity.

  20. Conditional Survival in Anal Carcinoma Using the National Population-Based Survey of Epidemiology and End Results Database (1988-2012).

    PubMed

    Kim, Ellen; Kim, Jong S; Choi, Mehee; Thomas, Charles R

    2016-04-01

    Conditional survival can provide valuable information for both patients and healthcare providers about the changing prognosis in surviving patients over time. This study estimated conditional survival for patients with anal cancer in the United States through analysis of a national population-based cancer registry. Log-rank test identified significant covariates of cause-specific survival (defined as time from diagnosis until death from anal cancer). Significant covariates were considered in the multivariable regression of cause-specific survival using Cox proportional hazards models. Covariates included cancer stage and demographic variables. Patients in Surveillance, Epidemiology, and End Results regions diagnosed with anal squamous cell carcinoma as their first and only cancer diagnosis from 1988 to 2012 were selected from this database, and 5145 patients were included in the retrospective cohort study. Five-year conditional survival stratified by each variable in the final Cox models was measured : The final multivariable models of overall and cause-specific survivals included stage, grade, sex, age, race, and relationship status. Over the first 6 years after diagnosis, conditional survival of distant stage increased from 37% to 89%, whereas regional stage increased from 65% to 93% and localized stage increased from 84% to 96%. The other variables had increasing prognosis as well, but the subgroups increased at a more similar rate over time. The data source used does not include information on chemotherapy treatment, patient comorbidities, or socioeconomic status. Conditional survival showed improvement over time. Patients with advanced stage had the greatest improvement in conditional survival. This is the first study to provide specific conditional survival probabilities for patients with anal cancer.

  1. Correlational analysis of neck/shoulder pain and low back pain with the use of digital products, physical activity and psychological status among adolescents in Shanghai.

    PubMed

    Shan, Zhi; Deng, Guoying; Li, Jipeng; Li, Yangyang; Zhang, Yongxing; Zhao, Qinghua

    2013-01-01

    This study investigates the neck/shoulder pain (NSP) and low back pain (LBP) among current high school students in Shanghai and explores the relationship between these pains and their possible influences, including digital products, physical activity, and psychological status. An anonymous self-assessment was administered to 3,600 students across 30 high schools in Shanghai. This questionnaire examined the prevalence of NSP and LBP and the level of physical activity as well as the use of mobile phones, personal computers (PC) and tablet computers (Tablet). The CES-D (Center for Epidemiological Studies Depression) scale was also included in the survey. The survey data were analyzed using the chi-square test, univariate logistic analyses and a multivariate logistic regression model. Three thousand sixteen valid questionnaires were received including 1,460 (48.41%) from male respondents and 1,556 (51.59%) from female respondents. The high school students in this study showed NSP and LBP rates of 40.8% and 33.1%, respectively, and the prevalence of both influenced by the student's grade, use of digital products, and mental status; these factors affected the rates of NSP and LBP to varying degrees. The multivariate logistic regression analysis revealed that Gender, grade, soreness after exercise, PC using habits, tablet use, sitting time after school and academic stress entered the final model of NSP, while the final model of LBP consisted of gender, grade, soreness after exercise, PC using habits, mobile phone use, sitting time after school, academic stress and CES-D score. High school students in Shanghai showed high prevalence of NSP and LBP that were closely related to multiple factors. Appropriate interventions should be implemented to reduce the occurrences of NSP and LBP.

  2. A multivariate time series approach to modeling and forecasting demand in the emergency department.

    PubMed

    Jones, Spencer S; Evans, R Scott; Allen, Todd L; Thomas, Alun; Haug, Peter J; Welch, Shari J; Snow, Gregory L

    2009-02-01

    The goals of this investigation were to study the temporal relationships between the demands for key resources in the emergency department (ED) and the inpatient hospital, and to develop multivariate forecasting models. Hourly data were collected from three diverse hospitals for the year 2006. Descriptive analysis and model fitting were carried out using graphical and multivariate time series methods. Multivariate models were compared to a univariate benchmark model in terms of their ability to provide out-of-sample forecasts of ED census and the demands for diagnostic resources. Descriptive analyses revealed little temporal interaction between the demand for inpatient resources and the demand for ED resources at the facilities considered. Multivariate models provided more accurate forecasts of ED census and of the demands for diagnostic resources. Our results suggest that multivariate time series models can be used to reliably forecast ED patient census; however, forecasts of the demands for diagnostic resources were not sufficiently reliable to be useful in the clinical setting.

  3. Comparison of Multidimensional Item Response Models: Multivariate Normal Ability Distributions versus Multivariate Polytomous Ability Distributions. Research Report. ETS RR-08-45

    ERIC Educational Resources Information Center

    Haberman, Shelby J.; von Davier, Matthias; Lee, Yi-Hsuan

    2008-01-01

    Multidimensional item response models can be based on multivariate normal ability distributions or on multivariate polytomous ability distributions. For the case of simple structure in which each item corresponds to a unique dimension of the ability vector, some applications of the two-parameter logistic model to empirical data are employed to…

  4. Spatial land-use inventory, modeling, and projection/Denver metropolitan area, with inputs from existing maps, airphotos, and LANDSAT imagery

    NASA Technical Reports Server (NTRS)

    Tom, C.; Miller, L. D.; Christenson, J. W.

    1978-01-01

    A landscape model was constructed with 34 land-use, physiographic, socioeconomic, and transportation maps. A simple Markov land-use trend model was constructed from observed rates of change and nonchange from photointerpreted 1963 and 1970 airphotos. Seven multivariate land-use projection models predicting 1970 spatial land-use changes achieved accuracies from 42 to 57 percent. A final modeling strategy was designed, which combines both Markov trend and multivariate spatial projection processes. Landsat-1 image preprocessing included geometric rectification/resampling, spectral-band, and band/insolation ratioing operations. A new, systematic grid-sampled point training-set approach proved to be useful when tested on the four orginal MSS bands, ten image bands and ratios, and all 48 image and map variables (less land use). Ten variable accuracy was raised over 15 percentage points from 38.4 to 53.9 percent, with the use of the 31 ancillary variables. A land-use classification map was produced with an optimal ten-channel subset of four image bands and six ancillary map variables. Point-by-point verification of 331,776 points against a 1972/1973 U.S. Geological Survey (UGSG) land-use map prepared with airphotos and the same classification scheme showed average first-, second-, and third-order accuracies of 76.3, 58.4, and 33.0 percent, respectively.

  5. Advanced statistics: linear regression, part II: multiple linear regression.

    PubMed

    Marill, Keith A

    2004-01-01

    The applications of simple linear regression in medical research are limited, because in most situations, there are multiple relevant predictor variables. Univariate statistical techniques such as simple linear regression use a single predictor variable, and they often may be mathematically correct but clinically misleading. Multiple linear regression is a mathematical technique used to model the relationship between multiple independent predictor variables and a single dependent outcome variable. It is used in medical research to model observational data, as well as in diagnostic and therapeutic studies in which the outcome is dependent on more than one factor. Although the technique generally is limited to data that can be expressed with a linear function, it benefits from a well-developed mathematical framework that yields unique solutions and exact confidence intervals for regression coefficients. Building on Part I of this series, this article acquaints the reader with some of the important concepts in multiple regression analysis. These include multicollinearity, interaction effects, and an expansion of the discussion of inference testing, leverage, and variable transformations to multivariate models. Examples from the first article in this series are expanded on using a primarily graphic, rather than mathematical, approach. The importance of the relationships among the predictor variables and the dependence of the multivariate model coefficients on the choice of these variables are stressed. Finally, concepts in regression model building are discussed.

  6. F100 Multivariable Control Synthesis Program. Computer Implementation of the F100 Multivariable Control Algorithm

    NASA Technical Reports Server (NTRS)

    Soeder, J. F.

    1983-01-01

    As turbofan engines become more complex, the development of controls necessitate the use of multivariable control techniques. A control developed for the F100-PW-100(3) turbofan engine by using linear quadratic regulator theory and other modern multivariable control synthesis techniques is described. The assembly language implementation of this control on an SEL 810B minicomputer is described. This implementation was then evaluated by using a real-time hybrid simulation of the engine. The control software was modified to run with a real engine. These modifications, in the form of sensor and actuator failure checks and control executive sequencing, are discussed. Finally recommendations for control software implementations are presented.

  7. Stochastic modelling of temperatures affecting the in situ performance of a solar-assisted heat pump: The multivariate approach and physical interpretation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Loveday, D.L.; Craggs, C.

    Box-Jenkins-based multivariate stochastic modeling is carried out using data recorded from a domestic heating system. The system comprises an air-source heat pump sited in the roof space of a house, solar assistance being provided by the conventional tile roof acting as a radiation absorber. Multivariate models are presented which illustrate the time-dependent relationships between three air temperatures - at external ambient, at entry to, and at exit from, the heat pump evaporator. Using a deterministic modeling approach, physical interpretations are placed on the results of the multivariate technique. It is concluded that the multivariate Box-Jenkins approach is a suitable techniquemore » for building thermal analysis. Application to multivariate Box-Jenkins approach is a suitable technique for building thermal analysis. Application to multivariate model-based control is discussed, with particular reference to building energy management systems. It is further concluded that stochastic modeling of data drawn from a short monitoring period offers a means of retrofitting an advanced model-based control system in existing buildings, which could be used to optimize energy savings. An approach to system simulation is suggested.« less

  8. Characterizing multivariate decoding models based on correlated EEG spectral features

    PubMed Central

    McFarland, Dennis J.

    2013-01-01

    Objective Multivariate decoding methods are popular techniques for analysis of neurophysiological data. The present study explored potential interpretative problems with these techniques when predictors are correlated. Methods Data from sensorimotor rhythm-based cursor control experiments was analyzed offline with linear univariate and multivariate models. Features were derived from autoregressive (AR) spectral analysis of varying model order which produced predictors that varied in their degree of correlation (i.e., multicollinearity). Results The use of multivariate regression models resulted in much better prediction of target position as compared to univariate regression models. However, with lower order AR features interpretation of the spectral patterns of the weights was difficult. This is likely to be due to the high degree of multicollinearity present with lower order AR features. Conclusions Care should be exercised when interpreting the pattern of weights of multivariate models with correlated predictors. Comparison with univariate statistics is advisable. Significance While multivariate decoding algorithms are very useful for prediction their utility for interpretation may be limited when predictors are correlated. PMID:23466267

  9. Multivariate multiscale entropy of financial markets

    NASA Astrophysics Data System (ADS)

    Lu, Yunfan; Wang, Jun

    2017-11-01

    In current process of quantifying the dynamical properties of the complex phenomena in financial market system, the multivariate financial time series are widely concerned. In this work, considering the shortcomings and limitations of univariate multiscale entropy in analyzing the multivariate time series, the multivariate multiscale sample entropy (MMSE), which can evaluate the complexity in multiple data channels over different timescales, is applied to quantify the complexity of financial markets. Its effectiveness and advantages have been detected with numerical simulations with two well-known synthetic noise signals. For the first time, the complexity of four generated trivariate return series for each stock trading hour in China stock markets is quantified thanks to the interdisciplinary application of this method. We find that the complexity of trivariate return series in each hour show a significant decreasing trend with the stock trading time progressing. Further, the shuffled multivariate return series and the absolute multivariate return series are also analyzed. As another new attempt, quantifying the complexity of global stock markets (Asia, Europe and America) is carried out by analyzing the multivariate returns from them. Finally we utilize the multivariate multiscale entropy to assess the relative complexity of normalized multivariate return volatility series with different degrees.

  10. A spatial analysis of the spread of mumps: the importance of college students and their spring-break-associated travel.

    PubMed

    Polgreen, P M; Bohnett, L C; Yang, M; Pentella, M A; Cavanaugh, J E

    2010-03-01

    To characterize the association between county-level risk factors and the incidence of mumps in the 2006 Iowa outbreak, we used generalized linear mixed models with the number of mumps cases per county as the dependent variable. To assess the impact of spring-break travel, we tested for differences in the proportions of mumps cases in three different age groups. In the final multivariable model, the proportion of Iowa's college students per county was positively associated (P<0.0001) with mumps cases, but the number of colleges was negatively associated with cases (P=0.0002). Thus, if the college students in a county were spread among more campuses, this was associated with fewer mumps cases. Finally, we found the proportion of mumps cases in both older and younger persons increased after 1 April (P=0.0029), suggesting that spring-break college travel was associated with the spread of mumps to other age groups.

  11. Forecasted range shifts of arid-land fishes in response to climate change

    USGS Publications Warehouse

    Whitney, James E.; Whittier, Joanna B.; Paukert, Craig P.; Olden, Julian D.; Strecker, Angela L.

    2017-01-01

    Climate change is poised to alter the distributional limits, center, and size of many species. Traits may influence different aspects of range shifts, with trophic generality facilitating shifts at the leading edge, and greater thermal tolerance limiting contractions at the trailing edge. The generality of relationships between traits and range shifts remains ambiguous however, especially for imperiled fishes residing in xeric riverscapes. Our objectives were to quantify contemporary fish distributions in the Lower Colorado River Basin, forecast climate change by 2085 using two general circulation models, and quantify shifts in the limits, center, and size of fish elevational ranges according to fish traits. We examined relationships among traits and range shift metrics either singly using univariate linear modeling or combined with multivariate redundancy analysis. We found that trophic and dispersal traits were associated with shifts at the leading and trailing edges, respectively, although projected range shifts were largely unexplained by traits. As expected, piscivores and omnivores with broader diets shifted upslope most at the leading edge while more specialized invertivores exhibited minimal changes. Fishes that were more mobile shifted upslope most at the trailing edge, defying predictions. No traits explained changes in range center or size. Finally, current preference explained multivariate range shifts, as fishes with faster current preferences exhibited smaller multivariate changes. Although range shifts were largely unexplained by traits, more specialized invertivorous fishes with lower dispersal propensity or greater current preference may require the greatest conservation efforts because of their limited capacity to shift ranges under climate change.

  12. Multivariate Longitudinal Analysis with Bivariate Correlation Test.

    PubMed

    Adjakossa, Eric Houngla; Sadissou, Ibrahim; Hounkonnou, Mahouton Norbert; Nuel, Gregory

    2016-01-01

    In the context of multivariate multilevel data analysis, this paper focuses on the multivariate linear mixed-effects model, including all the correlations between the random effects when the dimensional residual terms are assumed uncorrelated. Using the EM algorithm, we suggest more general expressions of the model's parameters estimators. These estimators can be used in the framework of the multivariate longitudinal data analysis as well as in the more general context of the analysis of multivariate multilevel data. By using a likelihood ratio test, we test the significance of the correlations between the random effects of two dependent variables of the model, in order to investigate whether or not it is useful to model these dependent variables jointly. Simulation studies are done to assess both the parameter recovery performance of the EM estimators and the power of the test. Using two empirical data sets which are of longitudinal multivariate type and multivariate multilevel type, respectively, the usefulness of the test is illustrated.

  13. Clinical predictors of advanced sellar masses.

    PubMed

    Rambaldini, Gloria M; Butalia, Sonia; Ezzat, Shereen; Kucharczyk, Walter; Sawka, Anna M

    2007-10-01

    To identify clinical variables associated with the presence of a structurally advanced sellar mass (ASM). We performed a retrospective study of patients referred for evaluation of suspected new pituitary disease or sellar mass to the Endocrine Oncology Unit of Mount Sinai Hospital in Toronto, Ontario, Canada. By multivariate analysis, we examined predictors of a structurally ASM (a sellar lesion with any of the following characteristics: diameter of >or=1 cm on magnetic resonance imaging [MRI], optic chiasmal compression on MRI, or clinical or biochemical evidence of hypopituitarism). Data from 152 patients were analyzed. Of the 152 sellar masses, 142 (93%) were pituitary adenomas. An ASM was noted in 85 of the 152 patients (56%). In the final multivariate model, male sex (odds ratio [OR], 6.23; 95% confidence interval [CI], 2.84 to 13.56; P<0.001) and self-reported visual field defect (OR, 3.62; 95% CI, 1.07 to 12.25; P = 0.039) were significantly independently associated with the presence of an ASM. The presence of new or changed headaches also tended to be associated with an ASM (OR, 2.11; 95% CI, 0.96 to 4.64; P = 0.063). Age and self-reported galactorrhea were not independently associated with the presence of an ASM and were conditionally removed from the final model. In patients with suspected sellar or pituitary disease, male sex and self-reported visual field defects independently predict the presence of an ASM. New or changed headaches also tend to be related to the presence of an ASM. The presence of predictors of an ASM should prompt expedited sellar MRI and biochemical evaluation.

  14. Implementation of a guideline for local health policy making by regional health services: exploring determinants of use by a web survey.

    PubMed

    Kuunders, Theo J M; Jacobs, Monique A M; Goor, Ien A M van de; Bon-Martens, Marja J H van; Oers, Hans A M van; Paulussen, Theo G W M

    2017-08-15

    Previous evaluation showed insufficient use of a national guideline for integrated local health policy by Regional Health Services (RHS) in the Netherlands. The guideline focuses on five health topics and includes five checklists to support integrated municipal health policies. This study explores the determinants of guideline use by regional Dutch health professionals. A web survey was send to 304 RHS health professionals. The questionnaire was based on a theory- and research-based framework of determinants of public health innovations. Main outcomes were guideline use and completeness of use, defined as the number of health topics and checklists used. Associations between determinants and (completeness of) guideline use were explored by multivariate regression models. The survey was started by 120 professionals (39%). Finally, results from 73 respondents (24%) were eligible for analyses. All 28 Dutch RHS organizations were represented in the final dataset. About half of the respondents (48%) used the guideline. The average score for completeness of use (potential range 1-10) was 2.37 (sd = 1.78; range 1-7). Knowledge, perceived task responsibility and usability were significantly related to guideline use in univariate analyses. Only usability remained significant in the multivariate model on guideline use. Only self-efficacy accounted for significant proportions of variance in completeness of use. The results imply that strategies to improve guideline use by RHSs should primarily target perceived usability. Self-efficacy appeared the primary target for improving completeness of guideline use. Methods for targeting these determinants in RHSs are discussed.

  15. Influence of individual and social contextual factors on changes in leisure-time physical activity in working-class populations: results of the Healthy Directions–Small Businesses Study

    PubMed Central

    Stoddard, Anne; Bennett, Gary G.; Wolin, Kathleen Y.; Sorensen, Glorian G.

    2012-01-01

    Background As part of the Harvard Cancer Prevention Program Project, we sought to address disparities reflected in social class and race/ethnicity by developing and testing a behavioral intervention model that targeted fruit and vegetable consumption, red meat consumption, multivitamin intake, and physical activity in working-class, multiethnic populations. Methods This paper examined the associations between change in leisure-time physical activity and individual and social contextual factors in participants employed in small businesses (n = 850) at both baseline and at 18-month final. Results In bivariate analyses, age, language acculturation, social ties, and workplace social capital were significantly associated with physical activity at final. In multivariable analyses, being younger and having high language acculturation were significantly associated with greater leisuretime physical activity at final; high workplace social capital was significantly associated with a decline in physical activity at final. Conclusion These findings have implications for understanding factors that are integral to promoting change in physical activity among working-class, multiethnic populations. PMID:22806257

  16. Improved accuracy in quantitative laser-induced breakdown spectroscopy using sub-models

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Anderson, Ryan B.; Clegg, Samuel M.; Frydenvang, Jens

    We report that accurate quantitative analysis of diverse geologic materials is one of the primary challenges faced by the Laser-Induced Breakdown Spectroscopy (LIBS)-based ChemCam instrument on the Mars Science Laboratory (MSL) rover. The SuperCam instrument on the Mars 2020 rover, as well as other LIBS instruments developed for geochemical analysis on Earth or other planets, will face the same challenge. Consequently, part of the ChemCam science team has focused on the development of improved multivariate analysis calibrations methods. Developing a single regression model capable of accurately determining the composition of very different target materials is difficult because the response ofmore » an element’s emission lines in LIBS spectra can vary with the concentration of other elements. We demonstrate a conceptually simple “submodel” method for improving the accuracy of quantitative LIBS analysis of diverse target materials. The method is based on training several regression models on sets of targets with limited composition ranges and then “blending” these “sub-models” into a single final result. Tests of the sub-model method show improvement in test set root mean squared error of prediction (RMSEP) for almost all cases. Lastly, the sub-model method, using partial least squares regression (PLS), is being used as part of the current ChemCam quantitative calibration, but the sub-model method is applicable to any multivariate regression method and may yield similar improvements.« less

  17. Improved accuracy in quantitative laser-induced breakdown spectroscopy using sub-models

    DOE PAGES

    Anderson, Ryan B.; Clegg, Samuel M.; Frydenvang, Jens; ...

    2016-12-15

    We report that accurate quantitative analysis of diverse geologic materials is one of the primary challenges faced by the Laser-Induced Breakdown Spectroscopy (LIBS)-based ChemCam instrument on the Mars Science Laboratory (MSL) rover. The SuperCam instrument on the Mars 2020 rover, as well as other LIBS instruments developed for geochemical analysis on Earth or other planets, will face the same challenge. Consequently, part of the ChemCam science team has focused on the development of improved multivariate analysis calibrations methods. Developing a single regression model capable of accurately determining the composition of very different target materials is difficult because the response ofmore » an element’s emission lines in LIBS spectra can vary with the concentration of other elements. We demonstrate a conceptually simple “submodel” method for improving the accuracy of quantitative LIBS analysis of diverse target materials. The method is based on training several regression models on sets of targets with limited composition ranges and then “blending” these “sub-models” into a single final result. Tests of the sub-model method show improvement in test set root mean squared error of prediction (RMSEP) for almost all cases. Lastly, the sub-model method, using partial least squares regression (PLS), is being used as part of the current ChemCam quantitative calibration, but the sub-model method is applicable to any multivariate regression method and may yield similar improvements.« less

  18. Causal diagrams and multivariate analysis III: confound it!

    PubMed

    Jupiter, Daniel C

    2015-01-01

    This commentary concludes my series concerning inclusion of variables in multivariate analyses. We take up the issues of confounding and effect modification and summarize the work we have thus far done. Finally, we provide a rough algorithm to help guide us through the maze of possibilities that we have outlined. Copyright © 2015 American College of Foot and Ankle Surgeons. Published by Elsevier Inc. All rights reserved.

  19. How does pedogenesis drive plant diversity?

    PubMed

    Laliberté, Etienne; Grace, James B; Huston, Michael A; Lambers, Hans; Teste, François P; Turner, Benjamin L; Wardle, David A

    2013-06-01

    Some of the most species-rich plant communities occur on ancient, strongly weathered soils, whereas those on recently developed soils tend to be less diverse. Mechanisms underlying this well-known pattern, however, remain unresolved. Here, we present a conceptual model describing alternative mechanisms by which pedogenesis (the process of soil formation) might drive plant diversity. We suggest that long-term soil chronosequences offer great, yet largely untapped, potential as 'natural experiments' to determine edaphic controls over plant diversity. Finally, we discuss how our conceptual model can be evaluated quantitatively using structural equation modeling to advance multivariate theories about the determinants of local plant diversity. This should help us to understand broader-scale diversity patterns, such as the latitudinal gradient of plant diversity. Copyright © 2013 Elsevier Ltd. All rights reserved.

  20. Workers' compensation costs among construction workers: a robust regression analysis.

    PubMed

    Friedman, Lee S; Forst, Linda S

    2009-11-01

    Workers' compensation data are an important source for evaluating costs associated with construction injuries. We describe the characteristics of injured construction workers filing claims in Illinois between 2000 and 2005 and the factors associated with compensation costs using a robust regression model. In the final multivariable model, the cumulative percent temporary and permanent disability-measures of severity of injury-explained 38.7% of the variance of cost. Attorney costs explained only 0.3% of the variance of the dependent variable. The model used in this study clearly indicated that percent disability was the most important determinant of cost, although the method and uniformity of percent impairment allocation could be better elucidated. There is a need to integrate analytical methods that are suitable for skewed data when analyzing claim costs.

  1. Measurement of non-sugar solids content in Chinese rice wine using near infrared spectroscopy combined with an efficient characteristic variables selection algorithm.

    PubMed

    Ouyang, Qin; Zhao, Jiewen; Chen, Quansheng

    2015-01-01

    The non-sugar solids (NSS) content is one of the most important nutrition indicators of Chinese rice wine. This study proposed a rapid method for the measurement of NSS content in Chinese rice wine using near infrared (NIR) spectroscopy. We also systemically studied the efficient spectral variables selection algorithms that have to go through modeling. A new algorithm of synergy interval partial least square with competitive adaptive reweighted sampling (Si-CARS-PLS) was proposed for modeling. The performance of the final model was back-evaluated using root mean square error of calibration (RMSEC) and correlation coefficient (Rc) in calibration set and similarly tested by mean square error of prediction (RMSEP) and correlation coefficient (Rp) in prediction set. The optimum model by Si-CARS-PLS algorithm was achieved when 7 PLS factors and 18 variables were included, and the results were as follows: Rc=0.95 and RMSEC=1.12 in the calibration set, Rp=0.95 and RMSEP=1.22 in the prediction set. In addition, Si-CARS-PLS algorithm showed its superiority when compared with the commonly used algorithms in multivariate calibration. This work demonstrated that NIR spectroscopy technique combined with a suitable multivariate calibration algorithm has a high potential in rapid measurement of NSS content in Chinese rice wine. Copyright © 2015 Elsevier B.V. All rights reserved.

  2. Predicting early cognitive decline in newly-diagnosed Parkinson's patients: A practical model.

    PubMed

    Hogue, Olivia; Fernandez, Hubert H; Floden, Darlene P

    2018-06-19

    To create a multivariable model to predict early cognitive decline among de novo patients with Parkinson's disease, using brief, inexpensive assessments that are easily incorporated into clinical flow. Data for 351 drug-naïve patients diagnosed with idiopathic Parkinson's disease were obtained from the Parkinson's Progression Markers Initiative. Baseline demographic, disease history, motor, and non-motor features were considered as candidate predictors. Best subsets selection was used to determine the multivariable baseline symptom profile that most accurately predicted individual cognitive decline within three years. Eleven per cent of the sample experienced cognitive decline. The final logistic regression model predicting decline included five baseline variables: verbal memory retention, right-sided bradykinesia, years of education, subjective report of cognitive impairment, and REM behavior disorder. Model discrimination was good (optimism-adjusted concordance index = .749). The associated nomogram provides a tool to determine individual patient risk of meaningful cognitive change in the early stages of the disease. Through the consideration of easily-implemented or routinely-gathered assessments, we have identified a multidimensional baseline profile and created a convenient, inexpensive tool to predict cognitive decline in the earliest stages of Parkinson's disease. The use of this tool would generate prediction at the individual level, allowing clinicians to tailor medical management for each patient and identify at-risk patients for clinical trials aimed at disease modifying therapies. Copyright © 2018. Published by Elsevier Ltd.

  3. Contribution au developpement d'une methode de controle des procedes dans une usine de bouletage

    NASA Astrophysics Data System (ADS)

    Gosselin, Claude

    This thesis, a collaborative effort between Ecole de technologie superieure and ArcelorMittal Company, presents the development of a methodology for monitoring and quality control of multivariable industrial production processes. This innovation research mandate was developed at ArcelorMittal Exploitation Miniere (AMEM) pellet plant in Port-Cartier (Quebec, Canada). With this undertaking, ArcelorMittal is striving to maintain its world class level of excellence and continues to pursue initiatives that can augment its competitive advantage worldwide. The plant's gravimetric classification process was retained as a prototype and development laboratory due to its effect on the company's competitiveness and its impact on subsequent steps leading to final production of iron oxide pellets. Concretely, the development of this expertise in process control and in situ monitoring will establish a firm basic knowledge in the fields of complex system physical modeling, data reconciliation, statistical observers, multivariate command and quality control using real-time monitoring of the desirability function. The hydraulic classifier is mathematically modeled. Using planned disturbances on the production line, an identification procedure was established to provide empirical estimations of the model's structural parameters. A new sampling campaign and a previously unpublished data collection and consolidation policy were implemented plant-wide. Access to these invaluable data sources has enabled the establishment of new thresholds that govern the production process and its control. Finally, as a substitute for the traditional quality control process, we have implemented a new strategy based on the use of the desirability function. Our innovation is not in using this Finally, as a substitute for the traditional quality control process, we have implemented a new strategy based on the use of the desirability function. Our innovation is not in using this function as an indicator of overall (economic) satisfaction in the production process, but rather in proposing it as an "observer" of the system's state. The first implementation steps have already demonstrated the method's feasibility as well as other numerous industrial impacts on production processes within the company. Namely, the emergence of the economical aspect as a strategic variable that assures better governance of production processes where quality variables present strategic issues.

  4. Small Sample Properties of Bayesian Multivariate Autoregressive Time Series Models

    ERIC Educational Resources Information Center

    Price, Larry R.

    2012-01-01

    The aim of this study was to compare the small sample (N = 1, 3, 5, 10, 15) performance of a Bayesian multivariate vector autoregressive (BVAR-SEM) time series model relative to frequentist power and parameter estimation bias. A multivariate autoregressive model was developed based on correlated autoregressive time series vectors of varying…

  5. Characterizing multivariate decoding models based on correlated EEG spectral features.

    PubMed

    McFarland, Dennis J

    2013-07-01

    Multivariate decoding methods are popular techniques for analysis of neurophysiological data. The present study explored potential interpretative problems with these techniques when predictors are correlated. Data from sensorimotor rhythm-based cursor control experiments was analyzed offline with linear univariate and multivariate models. Features were derived from autoregressive (AR) spectral analysis of varying model order which produced predictors that varied in their degree of correlation (i.e., multicollinearity). The use of multivariate regression models resulted in much better prediction of target position as compared to univariate regression models. However, with lower order AR features interpretation of the spectral patterns of the weights was difficult. This is likely to be due to the high degree of multicollinearity present with lower order AR features. Care should be exercised when interpreting the pattern of weights of multivariate models with correlated predictors. Comparison with univariate statistics is advisable. While multivariate decoding algorithms are very useful for prediction their utility for interpretation may be limited when predictors are correlated. Copyright © 2013 International Federation of Clinical Neurophysiology. Published by Elsevier Ireland Ltd. All rights reserved.

  6. PTEN Loss as Determined by Clinical-grade Immunohistochemistry Assay Is Associated with Worse Recurrence-free Survival in Prostate Cancer.

    PubMed

    Lotan, Tamara L; Wei, Wei; Morais, Carlos L; Hawley, Sarah T; Fazli, Ladan; Hurtado-Coll, Antonio; Troyer, Dean; McKenney, Jesse K; Simko, Jeffrey; Carroll, Peter R; Gleave, Martin; Lance, Raymond; Lin, Daniel W; Nelson, Peter S; Thompson, Ian M; True, Lawrence D; Feng, Ziding; Brooks, James D

    2016-06-01

    PTEN is the most commonly deleted tumor suppressor gene in primary prostate cancer (PCa) and its loss is associated with poor clinical outcomes and ERG gene rearrangement. We tested whether PTEN loss is associated with shorter recurrence-free survival (RFS) in surgically treated PCa patients with known ERG status. A genetically validated, automated PTEN immunohistochemistry (IHC) protocol was used for 1275 primary prostate tumors from the Canary Foundation retrospective PCa tissue microarray cohort to assess homogeneous (in all tumor tissue sampled) or heterogeneous (in a subset of tumor tissue sampled) PTEN loss. ERG status as determined by a genetically validated IHC assay was available for a subset of 938 tumors. Associations between PTEN and ERG status were assessed using Fisher's exact test. Kaplan-Meier and multivariate weighted Cox proportional models for RFS were constructed. When compared to intact PTEN, homogeneous (hazard ratio [HR] 1.66, p = 0.001) but not heterogeneous (HR 1.24, p = 0.14) PTEN loss was significantly associated with shorter RFS in multivariate models. Among ERG-positive tumors, homogeneous (HR 3.07, p < 0.0001) but not heterogeneous (HR 1.46, p = 0.10) PTEN loss was significantly associated with shorter RFS. Among ERG-negative tumors, PTEN did not reach significance for inclusion in the final multivariate models. The interaction term for PTEN and ERG status with respect to RFS did not reach statistical significance ( p = 0.11) for the current sample size. These data suggest that PTEN is a useful prognostic biomarker and that there is no statistically significant interaction between PTEN and ERG status for RFS. We found that loss of the PTEN tumor suppressor gene in prostate tumors as assessed by tissue staining is correlated with shorter time to prostate cancer recurrence after radical prostatectomy.

  7. Regression analysis for LED color detection of visual-MIMO system

    NASA Astrophysics Data System (ADS)

    Banik, Partha Pratim; Saha, Rappy; Kim, Ki-Doo

    2018-04-01

    Color detection from a light emitting diode (LED) array using a smartphone camera is very difficult in a visual multiple-input multiple-output (visual-MIMO) system. In this paper, we propose a method to determine the LED color using a smartphone camera by applying regression analysis. We employ a multivariate regression model to identify the LED color. After taking a picture of an LED array, we select the LED array region, and detect the LED using an image processing algorithm. We then apply the k-means clustering algorithm to determine the number of potential colors for feature extraction of each LED. Finally, we apply the multivariate regression model to predict the color of the transmitted LEDs. In this paper, we show our results for three types of environmental light condition: room environmental light, low environmental light (560 lux), and strong environmental light (2450 lux). We compare the results of our proposed algorithm from the analysis of training and test R-Square (%) values, percentage of closeness of transmitted and predicted colors, and we also mention about the number of distorted test data points from the analysis of distortion bar graph in CIE1931 color space.

  8. Mammalian cell culture monitoring using in situ spectroscopy: Is your method really optimised?

    PubMed

    André, Silvère; Lagresle, Sylvain; Hannas, Zahia; Calvosa, Éric; Duponchel, Ludovic

    2017-03-01

    In recent years, as a result of the process analytical technology initiative of the US Food and Drug Administration, many different works have been carried out on direct and in situ monitoring of critical parameters for mammalian cell cultures by Raman spectroscopy and multivariate regression techniques. However, despite interesting results, it cannot be said that the proposed monitoring strategies, which will reduce errors of the regression models and thus confidence limits of the predictions, are really optimized. Hence, the aim of this article is to optimize some critical steps of spectroscopic acquisition and data treatment in order to reach a higher level of accuracy and robustness of bioprocess monitoring. In this way, we propose first an original strategy to assess the most suited Raman acquisition time for the processes involved. In a second part, we demonstrate the importance of the interbatch variability on the accuracy of the predictive models with a particular focus on the optical probes adjustment. Finally, we propose a methodology for the optimization of the spectral variables selection in order to decrease prediction errors of multivariate regressions. © 2017 American Institute of Chemical Engineers Biotechnol. Prog., 33:308-316, 2017. © 2017 American Institute of Chemical Engineers.

  9. Deconstructing multivariate decoding for the study of brain function.

    PubMed

    Hebart, Martin N; Baker, Chris I

    2017-08-04

    Multivariate decoding methods were developed originally as tools to enable accurate predictions in real-world applications. The realization that these methods can also be employed to study brain function has led to their widespread adoption in the neurosciences. However, prior to the rise of multivariate decoding, the study of brain function was firmly embedded in a statistical philosophy grounded on univariate methods of data analysis. In this way, multivariate decoding for brain interpretation grew out of two established frameworks: multivariate decoding for predictions in real-world applications, and classical univariate analysis based on the study and interpretation of brain activation. We argue that this led to two confusions, one reflecting a mixture of multivariate decoding for prediction or interpretation, and the other a mixture of the conceptual and statistical philosophies underlying multivariate decoding and classical univariate analysis. Here we attempt to systematically disambiguate multivariate decoding for the study of brain function from the frameworks it grew out of. After elaborating these confusions and their consequences, we describe six, often unappreciated, differences between classical univariate analysis and multivariate decoding. We then focus on how the common interpretation of what is signal and noise changes in multivariate decoding. Finally, we use four examples to illustrate where these confusions may impact the interpretation of neuroimaging data. We conclude with a discussion of potential strategies to help resolve these confusions in interpreting multivariate decoding results, including the potential departure from multivariate decoding methods for the study of brain function. Copyright © 2017. Published by Elsevier Inc.

  10. Application and validation of Cox regression models in a single-center series of double kidney transplantation.

    PubMed

    Santori, G; Fontana, I; Bertocchi, M; Gasloli, G; Magoni Rossi, A; Tagliamacco, A; Barocci, S; Nocera, A; Valente, U

    2010-05-01

    A useful approach to reduce the number of discarded marginal kidneys and to increase the nephron mass is double kidney transplantation (DKT). In this study, we retrospectively evaluated the potential predictors for patient and graft survival in a single-center series of 59 DKT procedures performed between April 21, 1999, and September 21, 2008. The kidney recipients of mean age 63.27 +/- 5.17 years included 16 women (27%) and 43 men (73%). The donors of mean age 69.54 +/- 7.48 years included 32 women (54%) and 27 men (46%). The mean posttransplant dialysis time was 2.37 +/- 3.61 days. The mean hospitalization was 20.12 +/- 13.65 days. Average serum creatinine (SCr) at discharge was 1.5 +/- 0.59 mg/dL. In view of the limited numbers of recipient deaths (n = 4) and graft losses (n = 8) that occurred in our series, the proportional hazards assumption for each Cox regression model with P < .05 was tested by using correlation coefficients between transformed survival times and scaled Schoenfeld residuals, and checked with smoothed plots of Schoenfeld residuals. For patient survival, the variables that reached statistical significance were donor SCr (P = .007), donor creatinine cleararance (P = .023), and recipient age (P = .047). Each significant model passed the Schoenfeld test. By entering these variables into a multivariate Cox model for patient survival, no further significance was observed. In the univariate Cox models performed for graft survival, statistical significance was noted for donor SCr (P = .027), SCr 3 months post-DKT (P = .043), and SCr 6 months post-DKT (P = .017). All significant univariate models for graft survival passed the Schoenfeld test. A final multivariate model retained SCr at 6 months (beta = 1.746, P = .042) and donor SCr (beta = .767, P = .090). In our analysis, SCr at 6 months seemed to emerge from both univariate and multivariate Cox models as a potential predictor of graft survival among DKT. Multicenter studies with larger recipient populations and more graft losses should be performed to confirm our findings. Copyright (c) 2010 Elsevier Inc. All rights reserved.

  11. Robust Nonlinear Feedback Control of Aircraft Propulsion Systems

    NASA Technical Reports Server (NTRS)

    Garrard, William L.; Balas, Gary J.; Litt, Jonathan (Technical Monitor)

    2001-01-01

    This is the final report on the research performed under NASA Glen grant NASA/NAG-3-1975 concerning feedback control of the Pratt & Whitney (PW) STF 952, a twin spool, mixed flow, after burning turbofan engine. The research focussed on the design of linear and gain-scheduled, multivariable inner-loop controllers for the PW turbofan engine using H-infinity and linear, parameter-varying (LPV) control techniques. The nonlinear turbofan engine simulation was provided by PW within the NASA Rocket Engine Transient Simulator (ROCETS) simulation software environment. ROCETS was used to generate linearized models of the turbofan engine for control design and analysis as well as the simulation environment to evaluate the performance and robustness of the controllers. Comparison between the H-infinity, and LPV controllers are made with the baseline multivariable controller and developed by Pratt & Whitney engineers included in the ROCETS simulation. Simulation results indicate that H-infinity and LPV techniques effectively achieve desired response characteristics with minimal cross coupling between commanded values and are very robust to unmodeled dynamics and sensor noise.

  12. Predictive Utility of Brief Alcohol Use Disorders Identification Test (AUDIT) for human immunodeficiency virus antiretroviral medication nonadherence.

    PubMed

    Broyles, Lauren Matukaitis; Gordon, Adam J; Sereika, Susan M; Ryan, Christopher M; Erlen, Judith A

    2011-10-01

    Alcohol use negatively affects adherence to antiretroviral therapy (ART), thus human immunodeficiency virus/acquired immunodeficiency syndrome (HIV/AIDS) care providers need accurate, efficient assessments of alcohol use. Using existing data from an efficacy trial of 2 cognitive-behavioral ART adherence interventions, the authors sought to determine if results on 2 common alcohol screening tests (Alcohol Use Disorders Identification Test--Consumption [AUDIT-C] and its binge-related question [AUDIT-3]) predict ART nonadherence. Twenty-seven percent of the sample (n = 308) were positive on the AUDIT-C and 34% were positive on the AUDIT-3. In multivariate analyses, AUDIT-C-positive status predicted ART nonadherence after controlling for race, age, conscientiousness, and self-efficacy (P = .036). Although AUDIT-3-positive status was associated with ART nonadherence in unadjusted analyses, this relationship was not maintained in the final multivariate model. The AUDIT-C shows potential as an indirect screening tool for both at-risk drinking and ART nonadherence, underscoring the relationship between alcohol and chronic disease management.

  13. Associations between social support network characteristics and receipt of emotional and material support among a sample of male sexual minority youth

    PubMed Central

    Kapadia, Farzana; Halkitis, Perry; Barton, Staci; Siconolfi, Daniel; Figueroa, Rafael Perez

    2014-01-01

    Few studies have examined how social support network characteristics are related to perceived receipt of social support among male sexual minority youth. Using egocentric network data collected from a study of male sexual minority youth (n=592), multivariable logistic regression analyses examined distinct associations between individual and social network characteristics with receipt of (1) emotional and (2) material support. In multivariable models, frequent communication and having friends in one’s network yielded a two-fold increase in the likelihood of receiving emotional support whereas frequent communication was associated with an almost three-fold higher likelihood of perceived material support. Finally, greater internalized homophobia and personal experiences of gay-related stigma were inversely associated with perceived receipt of emotional and material support, respectively. Understanding the evolving social context and social interactions of this new generation of male sexual minority youth is warranted in order to understand the broader, contextual factors associated with their overall health and well-being. PMID:25214756

  14. Predictive Utility of Brief AUDIT for HIV Antiretroviral Medication Nonadherence

    PubMed Central

    Broyles, Lauren Matukaitis; Gordon, Adam J.; Sereika, Susan M.; Ryan, Christopher M.; Erlen, Judith A.

    2012-01-01

    Alcohol use negatively affects adherence to antiretroviral therapy (ART), thus HIV/AIDS providers need accurate, efficient assessments of alcohol use. Using existing data from an efficacy trial of two cognitive-behavioral ART adherence interventions, we sought to determine if results on two common alcohol screening tests (Alcohol Use Disorders Identification Test—Consumption (AUDIT-C) and its binge-related question (AUDIT-3)) predict ART nonadherence. Twenty seven percent of the sample (n=308) were positive on the AUDIT-C and 34% were positive on the AUDIT-3. In multivariate analyses, AUDIT-C positive status predicted ART nonadherence after controlling for race, age, conscientiousness, and self-efficacy (p=.036). While AUDIT-3 positive status was associated with ART nonadherence in unadjusted analyses, this relationship was not maintained in the final multivariate model. The AUDIT-C shows potential as an indirect screening tool for both at-risk drinking and ART nonadherence, underscoring the relationship between alcohol and chronic disease management. PMID:22014256

  15. Blackberry wines mineral and heavy metal content determination after dry ashing: multivariate data analysis as a tool for fruit wine quality control.

    PubMed

    Amidžić Klarić, Daniela; Klarić, Ilija; Mornar, Ana; Velić, Darko; Velić, Natalija

    2015-08-01

    This study brings out the data on the content of 21 mineral and heavy metal in 15 blackberry wines made of conventionally and organically grown blackberries. The objective of this study was to classify the blackberry wine samples based on their mineral composition and the applied cultivation method of the starting raw material by using chemometric analysis. The metal content of Croatian blackberry wine samples was determined by AAS after dry ashing. The comparison between an organic and conventional group of investigated blackberry wines showed statistically significant difference in concentrations of Si and Li, where the organic group contained higher concentrations of these compounds. According to multivariate data analysis, the model based on the original metal content data set finally included seven original variables (K, Fe, Mn, Cu, Ba, Cd and Cr) and gave a satisfactory separation of two applied cultivation methods of the starting raw material.

  16. Multivariate Analysis and Modeling of Sediment Pollution Using Neural Network Models and Geostatistics

    NASA Astrophysics Data System (ADS)

    Golay, Jean; Kanevski, Mikhaïl

    2013-04-01

    The present research deals with the exploration and modeling of a complex dataset of 200 measurement points of sediment pollution by heavy metals in Lake Geneva. The fundamental idea was to use multivariate Artificial Neural Networks (ANN) along with geostatistical models and tools in order to improve the accuracy and the interpretability of data modeling. The results obtained with ANN were compared to those of traditional geostatistical algorithms like ordinary (co)kriging and (co)kriging with an external drift. Exploratory data analysis highlighted a great variety of relationships (i.e. linear, non-linear, independence) between the 11 variables of the dataset (i.e. Cadmium, Mercury, Zinc, Copper, Titanium, Chromium, Vanadium and Nickel as well as the spatial coordinates of the measurement points and their depth). Then, exploratory spatial data analysis (i.e. anisotropic variography, local spatial correlations and moving window statistics) was carried out. It was shown that the different phenomena to be modeled were characterized by high spatial anisotropies, complex spatial correlation structures and heteroscedasticity. A feature selection procedure based on General Regression Neural Networks (GRNN) was also applied to create subsets of variables enabling to improve the predictions during the modeling phase. The basic modeling was conducted using a Multilayer Perceptron (MLP) which is a workhorse of ANN. MLP models are robust and highly flexible tools which can incorporate in a nonlinear manner different kind of high-dimensional information. In the present research, the input layer was made of either two (spatial coordinates) or three neurons (when depth as auxiliary information could possibly capture an underlying trend) and the output layer was composed of one (univariate MLP) to eight neurons corresponding to the heavy metals of the dataset (multivariate MLP). MLP models with three input neurons can be referred to as Artificial Neural Networks with EXternal drift (ANNEX). Moreover, the exact number of output neurons and the selection of the corresponding variables were based on the subsets created during the exploratory phase. Concerning hidden layers, no restriction were made and multiple architectures were tested. For each MLP model, the quality of the modeling procedure was assessed by variograms: if the variogram of the residuals demonstrates pure nugget effect and if the level of the nugget exactly corresponds to the nugget value of the theoretical variogram of the corresponding variable, all the structured information has been correctly extracted without overfitting. Finally, it is worth mentioning that simple MLP models are not always able to remove all the spatial correlation structure from the data. In that case, Neural Network Residual Kriging (NNRK) can be carried out and risk assessment can be conducted with Neural Network Residual Simulations (NNRS). Finally, the results of the ANNEX models were compared to those of ordinary (co)kriging and (co)kriging with an external drift. It was shown that the ANNEX models performed better than traditional geostatistical algorithms when the relationship between the variable of interest and the auxiliary predictor was not linear. References Kanevski, M. and Maignan, M. (2004). Analysis and Modelling of Spatial Environmental Data. Lausanne: EPFL Press.

  17. Multivariate Models of Parent-Late Adolescent Gender Dyads: The Importance of Parenting Processes in Predicting Adjustment

    ERIC Educational Resources Information Center

    McKinney, Cliff; Renk, Kimberly

    2008-01-01

    Although parent-adolescent interactions have been examined, relevant variables have not been integrated into a multivariate model. As a result, this study examined a multivariate model of parent-late adolescent gender dyads in an attempt to capture important predictors in late adolescents' important and unique transition to adulthood. The sample…

  18. Design of neural networks for classification of remotely sensed imagery

    NASA Technical Reports Server (NTRS)

    Chettri, Samir R.; Cromp, Robert F.; Birmingham, Mark

    1992-01-01

    Classification accuracies of a backpropagation neural network are discussed and compared with a maximum likelihood classifier (MLC) with multivariate normal class models. We have found that, because of its nonparametric nature, the neural network outperforms the MLC in this area. In addition, we discuss techniques for constructing optimal neural nets on parallel hardware like the MasPar MP-1 currently at GSFC. Other important discussions are centered around training and classification times of the two methods, and sensitivity to the training data. Finally, we discuss future work in the area of classification and neural nets.

  19. TH-E-BRF-03: A Multivariate Interaction Model for Assessment of Hippocampal Vascular Dose-Response and Early Prediction of Radiation-Induced Neurocognitive Dysfunction

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Farjam, R; Pramanik, P; Srinivasan, A

    Purpose: Vascular injury could be a cause of hippocampal dysfunction leading to late neurocognitive decline in patients receiving brain radiotherapy (RT). Hence, our aim was to develop a multivariate interaction model for characterization of hippocampal vascular dose-response and early prediction of radiation-induced late neurocognitive impairments. Methods: 27 patients (17 males and 10 females, age 31–80 years) were enrolled in an IRB-approved prospective longitudinal study. All patients were diagnosed with a low-grade glioma or benign tumor and treated by 3-D conformal or intensity-modulated RT with a median dose of 54 Gy (50.4–59.4 Gy in 1.8− Gy fractions). Six DCE-MRI scans weremore » performed from pre-RT to 18 months post-RT. DCE data were fitted to the modified Toft model to obtain the transfer constant of gadolinium influx from the intravascular space into the extravascular extracellular space, Ktrans, and the fraction of blood plasma volume, Vp. The hippocampus vascular property alterations after starting RT were characterized by changes in the hippocampal mean values of, μh(Ktrans)τ and μh(Vp)τ. The dose-response, Δμh(Ktrans/Vp)pre->τ, was modeled using a multivariate linear regression considering integrations of doses with age, sex, hippocampal laterality and presence of tumor/edema near a hippocampus. Finally, the early vascular dose-response in hippocampus was correlated with neurocognitive decline 6 and 18 months post-RT. Results: The μh(Ktrans) increased significantly from pre-RT to 1 month post-RT (p<0.0004). The multivariate model showed that the dose effect on Δμh(Ktrans)pre->1M post-RT was interacted with sex (p<0.0007) and age (p<0.00004), with the dose-response more pronounced in older females. Also, the vascular dose-response in the left hippocampus of females was significantly correlated with memory function decline at 6 (r = − 0.95, p<0.0006) and 18 (r = −0.88, p<0.02) months post-RT. Conclusion: The hippocampal vascular response to radiation could be sex and age dependent. The early hippocampal vascular dose-response could predict late neurocognitive dysfunction. (Support: NIH-RO1NS064973)« less

  20. A multivariate model and statistical method for validating tree grade lumber yield equations

    Treesearch

    Donald W. Seegrist

    1975-01-01

    Lumber yields within lumber grades can be described by a multivariate linear model. A method for validating lumber yield prediction equations when there are several tree grades is presented. The method is based on multivariate simultaneous test procedures.

  1. Multivariate and Multiscale Data Assimilation in Terrestrial Systems: A Review

    PubMed Central

    Montzka, Carsten; Pauwels, Valentijn R. N.; Franssen, Harrie-Jan Hendricks; Han, Xujun; Vereecken, Harry

    2012-01-01

    More and more terrestrial observational networks are being established to monitor climatic, hydrological and land-use changes in different regions of the World. In these networks, time series of states and fluxes are recorded in an automated manner, often with a high temporal resolution. These data are important for the understanding of water, energy, and/or matter fluxes, as well as their biological and physical drivers and interactions with and within the terrestrial system. Similarly, the number and accuracy of variables, which can be observed by spaceborne sensors, are increasing. Data assimilation (DA) methods utilize these observations in terrestrial models in order to increase process knowledge as well as to improve forecasts for the system being studied. The widely implemented automation in observing environmental states and fluxes makes an operational computation more and more feasible, and it opens the perspective of short-time forecasts of the state of terrestrial systems. In this paper, we review the state of the art with respect to DA focusing on the joint assimilation of observational data precedents from different spatial scales and different data types. An introduction is given to different DA methods, such as the Ensemble Kalman Filter (EnKF), Particle Filter (PF) and variational methods (3/4D-VAR). In this review, we distinguish between four major DA approaches: (1) univariate single-scale DA (UVSS), which is the approach used in the majority of published DA applications, (2) univariate multiscale DA (UVMS) referring to a methodology which acknowledges that at least some of the assimilated data are measured at a different scale than the computational grid scale, (3) multivariate single-scale DA (MVSS) dealing with the assimilation of at least two different data types, and (4) combined multivariate multiscale DA (MVMS). Finally, we conclude with a discussion on the advantages and disadvantages of the assimilation of multiple data types in a simulation model. Existing approaches can be used to simultaneously update several model states and model parameters if applicable. In other words, the basic principles for multivariate data assimilation are already available. We argue that a better understanding of the measurement errors for different observation types, improved estimates of observation bias and improved multiscale assimilation methods for data which scale nonlinearly is important to properly weight them in multiscale multivariate data assimilation. In this context, improved cross-validation of different data types, and increased ground truth verification of remote sensing products are required. PMID:23443380

  2. Multivariate and multiscale data assimilation in terrestrial systems: a review.

    PubMed

    Montzka, Carsten; Pauwels, Valentijn R N; Franssen, Harrie-Jan Hendricks; Han, Xujun; Vereecken, Harry

    2012-11-26

    More and more terrestrial observational networks are being established to monitor climatic, hydrological and land-use changes in different regions of the World. In these networks, time series of states and fluxes are recorded in an automated manner, often with a high temporal resolution. These data are important for the understanding of water, energy, and/or matter fluxes, as well as their biological and physical drivers and interactions with and within the terrestrial system. Similarly, the number and accuracy of variables, which can be observed by spaceborne sensors, are increasing. Data assimilation (DA) methods utilize these observations in terrestrial models in order to increase process knowledge as well as to improve forecasts for the system being studied. The widely implemented automation in observing environmental states and fluxes makes an operational computation more and more feasible, and it opens the perspective of short-time forecasts of the state of terrestrial systems. In this paper, we review the state of the art with respect to DA focusing on the joint assimilation of observational data precedents from different spatial scales and different data types. An introduction is given to different DA methods, such as the Ensemble Kalman Filter (EnKF), Particle Filter (PF) and variational methods (3/4D-VAR). In this review, we distinguish between four major DA approaches: (1) univariate single-scale DA (UVSS), which is the approach used in the majority of published DA applications, (2) univariate multiscale DA (UVMS) referring to a methodology which acknowledges that at least some of the assimilated data are measured at a different scale than the computational grid scale, (3) multivariate single-scale DA (MVSS) dealing with the assimilation of at least two different data types, and (4) combined multivariate multiscale DA (MVMS). Finally, we conclude with a discussion on the advantages and disadvantages of the assimilation of multiple data types in a simulation model. Existing approaches can be used to simultaneously update several model states and model parameters if applicable. In other words, the basic principles for multivariate data assimilation are already available. We argue that a better understanding of the measurement errors for different observation types, improved estimates of observation bias and improved multiscale assimilation methods for data which scale nonlinearly is important to properly weight them in multiscale multivariate data assimilation. In this context, improved cross-validation of different data types, and increased ground truth verification of remote sensing products are required.

  3. Multivariate Feature Selection of Image Descriptors Data for Breast Cancer with Computer-Assisted Diagnosis

    PubMed Central

    Galván-Tejada, Carlos E.; Zanella-Calzada, Laura A.; Galván-Tejada, Jorge I.; Celaya-Padilla, José M.; Gamboa-Rosales, Hamurabi; Garza-Veloz, Idalia; Martinez-Fierro, Margarita L.

    2017-01-01

    Breast cancer is an important global health problem, and the most common type of cancer among women. Late diagnosis significantly decreases the survival rate of the patient; however, using mammography for early detection has been demonstrated to be a very important tool increasing the survival rate. The purpose of this paper is to obtain a multivariate model to classify benign and malignant tumor lesions using a computer-assisted diagnosis with a genetic algorithm in training and test datasets from mammography image features. A multivariate search was conducted to obtain predictive models with different approaches, in order to compare and validate results. The multivariate models were constructed using: Random Forest, Nearest centroid, and K-Nearest Neighbor (K-NN) strategies as cost function in a genetic algorithm applied to the features in the BCDR public databases. Results suggest that the two texture descriptor features obtained in the multivariate model have a similar or better prediction capability to classify the data outcome compared with the multivariate model composed of all the features, according to their fitness value. This model can help to reduce the workload of radiologists and present a second opinion in the classification of tumor lesions. PMID:28216571

  4. Multivariate Feature Selection of Image Descriptors Data for Breast Cancer with Computer-Assisted Diagnosis.

    PubMed

    Galván-Tejada, Carlos E; Zanella-Calzada, Laura A; Galván-Tejada, Jorge I; Celaya-Padilla, José M; Gamboa-Rosales, Hamurabi; Garza-Veloz, Idalia; Martinez-Fierro, Margarita L

    2017-02-14

    Breast cancer is an important global health problem, and the most common type of cancer among women. Late diagnosis significantly decreases the survival rate of the patient; however, using mammography for early detection has been demonstrated to be a very important tool increasing the survival rate. The purpose of this paper is to obtain a multivariate model to classify benign and malignant tumor lesions using a computer-assisted diagnosis with a genetic algorithm in training and test datasets from mammography image features. A multivariate search was conducted to obtain predictive models with different approaches, in order to compare and validate results. The multivariate models were constructed using: Random Forest, Nearest centroid, and K-Nearest Neighbor (K-NN) strategies as cost function in a genetic algorithm applied to the features in the BCDR public databases. Results suggest that the two texture descriptor features obtained in the multivariate model have a similar or better prediction capability to classify the data outcome compared with the multivariate model composed of all the features, according to their fitness value. This model can help to reduce the workload of radiologists and present a second opinion in the classification of tumor lesions.

  5. A radiomics model from joint FDG-PET and MRI texture features for the prediction of lung metastases in soft-tissue sarcomas of the extremities

    NASA Astrophysics Data System (ADS)

    Vallières, M.; Freeman, C. R.; Skamene, S. R.; El Naqa, I.

    2015-07-01

    This study aims at developing a joint FDG-PET and MRI texture-based model for the early evaluation of lung metastasis risk in soft-tissue sarcomas (STSs). We investigate if the creation of new composite textures from the combination of FDG-PET and MR imaging information could better identify aggressive tumours. Towards this goal, a cohort of 51 patients with histologically proven STSs of the extremities was retrospectively evaluated. All patients had pre-treatment FDG-PET and MRI scans comprised of T1-weighted and T2-weighted fat-suppression sequences (T2FS). Nine non-texture features (SUV metrics and shape features) and forty-one texture features were extracted from the tumour region of separate (FDG-PET, T1 and T2FS) and fused (FDG-PET/T1 and FDG-PET/T2FS) scans. Volume fusion of the FDG-PET and MRI scans was implemented using the wavelet transform. The influence of six different extraction parameters on the predictive value of textures was investigated. The incorporation of features into multivariable models was performed using logistic regression. The multivariable modeling strategy involved imbalance-adjusted bootstrap resampling in the following four steps leading to final prediction model construction: (1) feature set reduction; (2) feature selection; (3) prediction performance estimation; and (4) computation of model coefficients. Univariate analysis showed that the isotropic voxel size at which texture features were extracted had the most impact on predictive value. In multivariable analysis, texture features extracted from fused scans significantly outperformed those from separate scans in terms of lung metastases prediction estimates. The best performance was obtained using a combination of four texture features extracted from FDG-PET/T1 and FDG-PET/T2FS scans. This model reached an area under the receiver-operating characteristic curve of 0.984 ± 0.002, a sensitivity of 0.955 ± 0.006, and a specificity of 0.926 ± 0.004 in bootstrapping evaluations. Ultimately, lung metastasis risk assessment at diagnosis of STSs could improve patient outcomes by allowing better treatment adaptation.

  6. Multivariate Longitudinal Analysis with Bivariate Correlation Test

    PubMed Central

    Adjakossa, Eric Houngla; Sadissou, Ibrahim; Hounkonnou, Mahouton Norbert; Nuel, Gregory

    2016-01-01

    In the context of multivariate multilevel data analysis, this paper focuses on the multivariate linear mixed-effects model, including all the correlations between the random effects when the dimensional residual terms are assumed uncorrelated. Using the EM algorithm, we suggest more general expressions of the model’s parameters estimators. These estimators can be used in the framework of the multivariate longitudinal data analysis as well as in the more general context of the analysis of multivariate multilevel data. By using a likelihood ratio test, we test the significance of the correlations between the random effects of two dependent variables of the model, in order to investigate whether or not it is useful to model these dependent variables jointly. Simulation studies are done to assess both the parameter recovery performance of the EM estimators and the power of the test. Using two empirical data sets which are of longitudinal multivariate type and multivariate multilevel type, respectively, the usefulness of the test is illustrated. PMID:27537692

  7. Multivariate spatial models of excess crash frequency at area level: case of Costa Rica.

    PubMed

    Aguero-Valverde, Jonathan

    2013-10-01

    Recently, areal models of crash frequency have being used in the analysis of various area-wide factors affecting road crashes. On the other hand, disease mapping methods are commonly used in epidemiology to assess the relative risk of the population at different spatial units. A natural next step is to combine these two approaches to estimate the excess crash frequency at area level as a measure of absolute crash risk. Furthermore, multivariate spatial models of crash severity are explored in order to account for both frequency and severity of crashes and control for the spatial correlation frequently found in crash data. This paper aims to extent the concept of safety performance functions to be used in areal models of crash frequency. A multivariate spatial model is used for that purpose and compared to its univariate counterpart. Full Bayes hierarchical approach is used to estimate the models of crash frequency at canton level for Costa Rica. An intrinsic multivariate conditional autoregressive model is used for modeling spatial random effects. The results show that the multivariate spatial model performs better than its univariate counterpart in terms of the penalized goodness-of-fit measure Deviance Information Criteria. Additionally, the effects of the spatial smoothing due to the multivariate spatial random effects are evident in the estimation of excess equivalent property damage only crashes. Copyright © 2013 Elsevier Ltd. All rights reserved.

  8. Determinants of Paramedic Response Readiness for CBRNE Threats

    PubMed Central

    Jones, Alison; Smith, George; Nelson, Jenny; Agho, Kingsley; Taylor, Melanie; Raphael, Beverley

    2010-01-01

    Paramedics play a pivotal role in the response to major emergencies. Recent evidence indicates that their confidence and willingness to respond to chemical, biological, radiological, nuclear, and explosives-related (CBRNE) incidents differs from that relating to their “routine” emergency work. To further investigate the factors underpinning their readiness to respond to CBRNE incidents, paramedics in New South Wales (NSW), Australia, were asked to complete a validated online survey instrument. Univariate and multivariate analyses were performed to examine associated factors determining readiness. The sample of 663 respondents was weighted to reflect the NSW paramedic population as a whole. The univariate analysis indicated that gender, length of service, deployment concern, perceived personal resilience, CBRNE training, and incident experience were significantly associated with perceived CBRNE response readiness. In the initial multivariate analysis, significantly higher response readiness was associated with male gender, university education, and greater length of service (10-15 years). In the final multivariate model, the combined effect of training/incident experience negated the significant effects observed in the initial model and, importantly, showed that those with recent training reported higher readiness, irrespective of incident experience. Those with lower concern regarding CBRNE deployment and those with higher personal resilience were significantly more likely to report higher readiness (Adjusted Relative Risk [ARR] = 0.91, 95% CI: 0.84-0.99; ARR = 1.40, 95% CI: 1.11-1.72, respectively). These findings will assist emergency medical planners in recognizing occupational and dispositional factors associated with enhanced CBRNE readiness and highlight the important role of training in redressing potential readiness differences associated with these factors. PMID:20569060

  9. DOE Office of Scientific and Technical Information (OSTI.GOV)

    VanderWalde, Noam A.; Lineberger Comprehensive Cancer Center, University of North Carolina at Chapel Hill, Chapel Hill, North Carolina; Meyer, Anne Marie

    Purpose: The purpose of this study was to compare chemoradiation therapy (CRT) with radiation therapy (RT) only in an older patient population with head and neck squamous cell carcinoma (HNSCC). Methods and Materials: Using the Surveillance, Epidemiology, and End Results (SEER)-Medicare linked database (1992-2007), we identified a retrospective cohort of nonmetastatic HNSCC patients and divided them into treatment groups. Comparisons were made between CRT and RT cohorts. Propensity scores for CRT were estimated from covariates associated with receipt of treatment using multivariable logistic regression. Standardized mortality ratio weights (SMRW) were created from the propensity scores and used to balance groupsmore » on measured confounders. Multivariable and SMR-weighted Cox proportional hazard models were used to estimate the hazard ratio (HR) of death for receipt of CRT versus RT among the whole group and for separate patient and tumor categories. Results: The final cohort of 10,599 patients was 68% male and 89% white. Median age was 74 years. Seventy-four percent were treated with RT, 26% were treated with CRT. Median follow-up points for CRT and RT survivors were 4.6 and 6.3 years, respectively. On multivariable analysis, HR for death with CRT was 1.13 (95% confidence interval [CI]: 1.07-1.20; P<.01). Using the SMRW model, the HR for death with CRT was 1.08 (95% CI: 1.02-1.15; P=.01). Conclusions: Although the addition of chemotherapy to radiation has proven efficacious in many randomized controlled trials, it may be less effective in an older patient population treated outside of a controlled trial setting.« less

  10. Multivariable control of the Space Shuttle Remote Manipulator System using linearization by state feedback

    NASA Technical Reports Server (NTRS)

    Gettman, Chang-Ching L.; Adams, Neil; Bedrossian, Nazareth; Valavani, Lena

    1993-01-01

    This paper demonstrates an approach to nonlinear control system design that uses linearization by state feedback to allow faster maneuvering of payloads by the Shuttle Remote Manipulator System (SRMS). A nonlinear feedback law is defined to cancel the nonlinear plant dynamics so that a linear controller can be designed for the SRMS. First a nonlinear design model was generated via SIMULINK. This design model included nonlinear arm dynamics derived from the Lagrangian approach, linearized servo model, and linearized gearbox model. The current SRMS position hold controller was implemented on this system. Next, a trajectory was defined using a rigid body kinematics SRMS tool, KRMS. The maneuver was simulated. Finally, higher bandwidth controllers were developed. Results of the new controllers were compared with the existing SRMS automatic control modes for the Space Station Freedom Mission Build 4 Payload extended on the SRMS.

  11. Microbial Performance of Food Safety Control and Assurance Activities in a Fresh Produce Processing Sector Measured Using a Microbial Assessment Scheme and Statistical Modeling.

    PubMed

    Njage, Patrick Murigu Kamau; Sawe, Chemutai Tonui; Onyango, Cecilia Moraa; Habib, I; Njagi, Edmund Njeru; Aerts, Marc; Molenberghs, Geert

    2017-01-01

    Current approaches such as inspections, audits, and end product testing cannot detect the distribution and dynamics of microbial contamination. Despite the implementation of current food safety management systems, foodborne outbreaks linked to fresh produce continue to be reported. A microbial assessment scheme and statistical modeling were used to systematically assess the microbial performance of core control and assurance activities in five Kenyan fresh produce processing and export companies. Generalized linear mixed models and correlated random-effects joint models for multivariate clustered data followed by empirical Bayes estimates enabled the analysis of the probability of contamination across critical sampling locations (CSLs) and factories as a random effect. Salmonella spp. and Listeria monocytogenes were not detected in the final products. However, none of the processors attained the maximum safety level for environmental samples. Escherichia coli was detected in five of the six CSLs, including the final product. Among the processing-environment samples, the hand or glove swabs of personnel revealed a higher level of predicted contamination with E. coli , and 80% of the factories were E. coli positive at this CSL. End products showed higher predicted probabilities of having the lowest level of food safety compared with raw materials. The final products were E. coli positive despite the raw materials being E. coli negative for 60% of the processors. There was a higher probability of contamination with coliforms in water at the inlet than in the final rinse water. Four (80%) of the five assessed processors had poor to unacceptable counts of Enterobacteriaceae on processing surfaces. Personnel-, equipment-, and product-related hygiene measures to improve the performance of preventive and intervention measures are recommended.

  12. Comprehensive analysis of Polygoni Multiflori Radix of different geographical origins using ultra-high-performance liquid chromatography fingerprints and multivariate chemometric methods.

    PubMed

    Sun, Li-Li; Wang, Meng; Zhang, Hui-Jie; Liu, Ya-Nan; Ren, Xiao-Liang; Deng, Yan-Ru; Qi, Ai-Di

    2018-01-01

    Polygoni Multiflori Radix (PMR) is increasingly being used not just as a traditional herbal medicine but also as a popular functional food. In this study, multivariate chemometric methods and mass spectrometry were combined to analyze the ultra-high-performance liquid chromatograph (UPLC) fingerprints of PMR from six different geographical origins. A chemometric strategy based on multivariate curve resolution-alternating least squares (MCR-ALS) and three classification methods is proposed to analyze the UPLC fingerprints obtained. Common chromatographic problems, including the background contribution, baseline contribution, and peak overlap, were handled by the established MCR-ALS model. A total of 22 components were resolved. Moreover, relative species concentrations were obtained from the MCR-ALS model, which was used for multivariate classification analysis. Principal component analysis (PCA) and Ward's method have been applied to classify 72 PMR samples from six different geographical regions. The PCA score plot showed that the PMR samples fell into four clusters, which related to the geographical location and climate of the source areas. The results were then corroborated by Ward's method. In addition, according to the variance-weighted distance between cluster centers obtained from Ward's method, five components were identified as the most significant variables (chemical markers) for cluster discrimination. A counter-propagation artificial neural network has been applied to confirm and predict the effects of chemical markers on different samples. Finally, the five chemical markers were identified by UPLC-quadrupole time-of-flight mass spectrometer. Components 3, 12, 16, 18, and 19 were identified as 2,3,5,4'-tetrahydroxy-stilbene-2-O-β-d-glucoside, emodin-8-O-β-d-glucopyranoside, emodin-8-O-(6'-O-acetyl)-β-d-glucopyranoside, emodin, and physcion, respectively. In conclusion, the proposed method can be applied for the comprehensive analysis of natural samples. Copyright © 2016. Published by Elsevier B.V.

  13. Groundwater potential mapping using C5.0, random forest, and multivariate adaptive regression spline models in GIS.

    PubMed

    Golkarian, Ali; Naghibi, Seyed Amir; Kalantar, Bahareh; Pradhan, Biswajeet

    2018-02-17

    Ever increasing demand for water resources for different purposes makes it essential to have better understanding and knowledge about water resources. As known, groundwater resources are one of the main water resources especially in countries with arid climatic condition. Thus, this study seeks to provide groundwater potential maps (GPMs) employing new algorithms. Accordingly, this study aims to validate the performance of C5.0, random forest (RF), and multivariate adaptive regression splines (MARS) algorithms for generating GPMs in the eastern part of Mashhad Plain, Iran. For this purpose, a dataset was produced consisting of spring locations as indicator and groundwater-conditioning factors (GCFs) as input. In this research, 13 GCFs were selected including altitude, slope aspect, slope angle, plan curvature, profile curvature, topographic wetness index (TWI), slope length, distance from rivers and faults, rivers and faults density, land use, and lithology. The mentioned dataset was divided into two classes of training and validation with 70 and 30% of the springs, respectively. Then, C5.0, RF, and MARS algorithms were employed using R statistical software, and the final values were transformed into GPMs. Finally, two evaluation criteria including Kappa and area under receiver operating characteristics curve (AUC-ROC) were calculated. According to the findings of this research, MARS had the best performance with AUC-ROC of 84.2%, followed by RF and C5.0 algorithms with AUC-ROC values of 79.7 and 77.3%, respectively. The results indicated that AUC-ROC values for the employed models are more than 70% which shows their acceptable performance. As a conclusion, the produced methodology could be used in other geographical areas. GPMs could be used by water resource managers and related organizations to accelerate and facilitate water resource exploitation.

  14. FAILURE OF RADIOACTIVE IODINE IN TREATMENT OF HYPERTHYROIDISM

    PubMed Central

    Schneider, David F.; Sonderman, Philip E.; Jones, Michaela F.; Ojomo, Kristin A.; Chen, Herbert; Jaume, Juan C.; Elson, Diane F.; Perlman, Scott B.; Sippel, Rebecca S.

    2015-01-01

    Introduction Persistent or recurrent hyperthyroidism after treatment with radioactive iodine (RAI) is common, and many patients require either additional doses or surgery before they are cured. The purpose of this study was to identify patterns and predictors of failure of RAI in patients with hyperthyroidism. Methods We conducted a retrospective review of patients treated with RAI from 2007–2010. Failure of RAI was defined as receipt of additional dose(s) and/or total thyroidectomy. Using a Cox proportional hazards model, we conducted univariate analysis to identify factors associated with failure of RAI. A final multivariate model was then constructed with significant (p < 0.05) variables from the univariate analysis. Results Of the 325 patients analyzed, 74 patients (22.8%) failed initial RAI treatment. 53 (71.6%) received additional RAI, 13 (17.6%) received additional RAI followed by surgery, and the remaining 8 (10.8%) were cured after thyroidectomy. The percentage of patients who failed decreased in a step-wise fashion as RAI dose increased. Similarly, the incidence of failure increased as the presenting T3 level increased. Sensitivity analysis revealed that RAI doses < 12.5 mCi were associated with failure while initial T3 and free T4 levels of at least 4.5 pg/mL and 2.3 ng/dL, respectively, were associated with failure. In the final multivariate analysis, higher T4 (HR 1.13, 95% CI 1.02–1.26, p=0.02) and methimazole treatment (HR 2.55, 95% CI 1.22–5.33, p=0.01) were associated with failure. Conclusions Laboratory values at presentation can predict which patients with hyperthyroidism are at risk for failing RAI treatment. Higher doses of RAI or surgical referral may prevent the need for repeat RAI in selected patients. PMID:25001092

  15. Failure of radioactive iodine in the treatment of hyperthyroidism.

    PubMed

    Schneider, David F; Sonderman, Philip E; Jones, Michaela F; Ojomo, Kristin A; Chen, Herbert; Jaume, Juan C; Elson, Diane F; Perlman, Scott B; Sippel, Rebecca S

    2014-12-01

    Persistent or recurrent hyperthyroidism after treatment with radioactive iodine (RAI) is common and many patients require either additional doses or surgery before they are cured. The purpose of this study was to identify patterns and predictors of failure of RAI in patients with hyperthyroidism. We conducted a retrospective review of patients treated with RAI from 2007 to 2010. Failure of RAI was defined as receipt of additional dose(s) and/or total thyroidectomy. Using a Cox proportional hazards model, we conducted univariate analysis to identify factors associated with failure of RAI. A final multivariate model was then constructed with significant (p < 0.05) variables from the univariate analysis. Of the 325 patients analyzed, 74 patients (22.8 %) failed initial RAI treatment, 53 (71.6 %) received additional RAI, 13 (17.6 %) received additional RAI followed by surgery, and the remaining 8 (10.8 %) were cured after thyroidectomy. The percentage of patients who failed decreased in a stepwise fashion as RAI dose increased. Similarly, the incidence of failure increased as the presenting T3 level increased. Sensitivity analysis revealed that RAI doses <12.5 mCi were associated with failure while initial T3 and free T4 levels of at least 4.5 pg/mL and 2.3 ng/dL, respectively, were associated with failure. In the final multivariate analysis, higher T4 (hazard ratio [HR] 1.13; 95 % confidence interval [CI] 1.02-1.26; p = 0.02) and methimazole treatment (HR 2.55; 95 % CI 1.22-5.33; p = 0.01) were associated with failure. Laboratory values at presentation can predict which patients with hyperthyroidism are at risk for failing RAI treatment. Higher doses of RAI or surgical referral may prevent the need for repeat RAI in selected patients.

  16. A Robust Bayesian Approach for Structural Equation Models with Missing Data

    ERIC Educational Resources Information Center

    Lee, Sik-Yum; Xia, Ye-Mao

    2008-01-01

    In this paper, normal/independent distributions, including but not limited to the multivariate t distribution, the multivariate contaminated distribution, and the multivariate slash distribution, are used to develop a robust Bayesian approach for analyzing structural equation models with complete or missing data. In the context of a nonlinear…

  17. Predicting the chance of live birth for women undergoing IVF: a novel pretreatment counselling tool.

    PubMed

    Dhillon, R K; McLernon, D J; Smith, P P; Fishel, S; Dowell, K; Deeks, J J; Bhattacharya, S; Coomarasamy, A

    2016-01-01

    Which pretreatment patient variables have an effect on live birth rates following assisted conception? The predictors in the final multivariate logistic regression model found to be significantly associated with reduced chances of IVF/ICSI success were increasing age (particularly above 36 years), tubal factor infertility, unexplained infertility and Asian or Black ethnicity. The two most widely recognized prediction models for live birth following IVF were developed on data from 1991 to 2007; pre-dating significant changes in clinical practice. These existing IVF outcome prediction models do not incorporate key pretreatment predictors, such as BMI, ethnicity and ovarian reserve, which are readily available now. In this cohort study a model to predict live birth was derived using data collected from 9915 women who underwent IVF/ICSI treatment at any CARE (Centres for Assisted Reproduction) clinic from 2008 to 2012. Model validation was performed on data collected from 2723 women who underwent treatment in 2013. The primary outcome for the model was live birth, which was defined as any birth event in which at least one baby was born alive and survived for more than 1 month. Data were collected from 12 fertility clinics within the CARE consortium in the UK. Multivariable logistic regression was used to develop the model. Discriminatory ability was assessed using the area under receiver operating characteristic (AUROC) curve, and calibration was assessed using calibration-in-the-large and the calibration slope test. The predictors in the final model were female age, BMI, ethnicity, antral follicle count (AFC), previous live birth, previous miscarriage, cause and duration of infertility. Upon assessing predictive ability, the AUROC curve for the final model and validation cohort was (0.62; 95% confidence interval (CI) 0.61-0.63) and (0.62; 95% CI 0.60-0.64) respectively. Calibration-in-the-large showed a systematic over-estimation of the predicted probability of live birth (Intercept (95% CI) = -0.168 (-0.252 to -0.084), P < 0.001). However, the calibration slope test was not significant (slope (95% CI) = 1.129 (0.893-1.365), P = 0.28). Due to the calibration-in-the-large test being significant we recalibrated the final model. The recalibrated model showed a much-improved calibration. Our model is unable to account for factors such as smoking and alcohol that can affect IVF/ICSI outcome and is somewhat restricted to representing the ethnic distribution and outcomes for the UK population only. We were unable to account for socioeconomic status and it may be that by having 75% of the population paying privately for their treatment, the results cannot be generalized to people of all socioeconomic backgrounds. In addition, patients and clinicians should understand this model is designed for use before treatment begins and does not include variables that become available (oocyte, embryo and endometrial) as treatment progresses. Finally, this model is also limited to use prior to first cycle only. To our knowledge, this is the first study to present a novel, up-to-date model encompassing three readily available prognostic factors; female BMI, ovarian reserve and ethnicity, which have not previously been used in prediction models for IVF outcome. Following geographical validation, the model can be used to build a user-friendly interface to aid decision-making for couples and their clinicians. Thereafter, a feasibility study of its implementation could focus on patient acceptability and quality of decision-making. None. © The Author 2015. Published by Oxford University Press on behalf of the European Society of Human Reproduction and Embryology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  18. A Multivariate Analysis of Personality, Values and Expectations as Correlates of Career Aspirations of Final Year Medical Students

    ERIC Educational Resources Information Center

    Rogers, Mary E.; Searle, Judy; Creed, Peter A.; Ng, Shu-Kay

    2010-01-01

    This study reports on the career intentions of 179 final year medical students who completed an online survey that included measures of personality, values, professional and lifestyle expectations, and well-being. Logistic regression analyses identified the determinants of preferred medical specialty, practice location and hours of work.…

  19. Brain MR image segmentation based on an improved active contour model

    PubMed Central

    Meng, Xiangrui; Gu, Wenya; Zhang, Jianwei

    2017-01-01

    It is often a difficult task to accurately segment brain magnetic resonance (MR) images with intensity in-homogeneity and noise. This paper introduces a novel level set method for simultaneous brain MR image segmentation and intensity inhomogeneity correction. To reduce the effect of noise, novel anisotropic spatial information, which can preserve more details of edges and corners, is proposed by incorporating the inner relationships among the neighbor pixels. Then the proposed energy function uses the multivariate Student's t-distribution to fit the distribution of the intensities of each tissue. Furthermore, the proposed model utilizes Hidden Markov random fields to model the spatial correlation between neigh-boring pixels/voxels. The means of the multivariate Student's t-distribution can be adaptively estimated by multiplying a bias field to reduce the effect of intensity inhomogeneity. In the end, we reconstructed the energy function to be convex and calculated it by using the Split Bregman method, which allows our framework for random initialization, thereby allowing fully automated applications. Our method can obtain the final result in less than 1 second for 2D image with size 256 × 256 and less than 300 seconds for 3D image with size 256 × 256 × 171. The proposed method was compared to other state-of-the-art segmentation methods using both synthetic and clinical brain MR images and increased the accuracies of the results more than 3%. PMID:28854235

  20. Risk of fall-related injury in people with lower limb amputations: A prospective cohort study.

    PubMed

    Wong, Christopher Kevin; Chihuri, Stanford T; Li, Guohua

    2016-01-01

    To assess fall-related injury risk and risk factors in people with lower limb amputation. Prospective longitudinal cohort with follow-up every 6 months for up to 41 months. Community-dwelling adults with lower limb amputations of any etiology and level recruited from support groups and prosthetic clinics. Demographic and clinical characteristics were obtained by self-reported questionnaire and telephone or in-person follow-up. Fall-related injury incidence requiring medical care per person-month and adjusted hazard ratio of fall-related injury were calculated using multivariable proportional hazards regression modeling. A total of 41 subjects, with 782 follow-up person-months in total, had 11 fall-related injury incidents (14.1/1,000 person-months). During follow-up, 56.1% of subjects reported falling and 26.8% reported fall-related injury. Multivariable proportional hazard modeling showed that women were nearly 6 times more likely as men to experience fall-related injury and people of non-white race were 13 times more likely than people of white race to experience fall-related injury. The final predictive model also included vascular amputation and age. Risk of fall-related injury requiring medical care in people with lower limb amputation appears to be higher than in older adult inpatients. Intervention programs to prevent fall-related injury in people with lower limb amputation should target women and racial minorities.

  1. Application of Multivariate Modeling for Radiation Injury Assessment: A Proof of Concept

    PubMed Central

    Bolduc, David L.; Villa, Vilmar; Sandgren, David J.; Ledney, G. David; Blakely, William F.; Bünger, Rolf

    2014-01-01

    Multivariate radiation injury estimation algorithms were formulated for estimating severe hematopoietic acute radiation syndrome (H-ARS) injury (i.e., response category three or RC3) in a rhesus monkey total-body irradiation (TBI) model. Classical CBC and serum chemistry blood parameters were examined prior to irradiation (d 0) and on d 7, 10, 14, 21, and 25 after irradiation involving 24 nonhuman primates (NHP) (Macaca mulatta) given 6.5-Gy 60Co Υ-rays (0.4 Gy min−1) TBI. A correlation matrix was formulated with the RC3 severity level designated as the “dependent variable” and independent variables down selected based on their radioresponsiveness and relatively low multicollinearity using stepwise-linear regression analyses. Final candidate independent variables included CBC counts (absolute number of neutrophils, lymphocytes, and platelets) in formulating the “CBC” RC3 estimation algorithm. Additionally, the formulation of a diagnostic CBC and serum chemistry “CBC-SCHEM” RC3 algorithm expanded upon the CBC algorithm model with the addition of hematocrit and the serum enzyme levels of aspartate aminotransferase, creatine kinase, and lactate dehydrogenase. Both algorithms estimated RC3 with over 90% predictive power. Only the CBC-SCHEM RC3 algorithm, however, met the critical three assumptions of linear least squares demonstrating slightly greater precision for radiation injury estimation, but with significantly decreased prediction error indicating increased statistical robustness. PMID:25165485

  2. Boosting Higgs pair production in the [Formula: see text] final state with multivariate techniques.

    PubMed

    Behr, J Katharina; Bortoletto, Daniela; Frost, James A; Hartland, Nathan P; Issever, Cigdem; Rojo, Juan

    2016-01-01

    The measurement of Higgs pair production will be a cornerstone of the LHC program in the coming years. Double Higgs production provides a crucial window upon the mechanism of electroweak symmetry breaking and has a unique sensitivity to the Higgs trilinear coupling. We study the feasibility of a measurement of Higgs pair production in the [Formula: see text] final state at the LHC. Our analysis is based on a combination of traditional cut-based methods with state-of-the-art multivariate techniques. We account for all relevant backgrounds, including the contributions from light and charm jet mis-identification, which are ultimately comparable in size to the irreducible 4 b QCD background. We demonstrate the robustness of our analysis strategy in a high pileup environment. For an integrated luminosity of [Formula: see text] ab[Formula: see text], a signal significance of [Formula: see text] is obtained, indicating that the [Formula: see text] final state alone could allow for the observation of double Higgs production at the High Luminosity LHC.

  3. A Comparison of Three Multivariate Models for Estimating Test Battery Reliability.

    ERIC Educational Resources Information Center

    Wood, Terry M.; Safrit, Margaret J.

    1987-01-01

    A comparison of three multivariate models (canonical reliability model, maximum generalizability model, canonical correlation model) for estimating test battery reliability indicated that the maximum generalizability model showed the least degree of bias, smallest errors in estimation, and the greatest relative efficiency across all experimental…

  4. Application of multivariate Gaussian detection theory to known non-Gaussian probability density functions

    NASA Astrophysics Data System (ADS)

    Schwartz, Craig R.; Thelen, Brian J.; Kenton, Arthur C.

    1995-06-01

    A statistical parametric multispectral sensor performance model was developed by ERIM to support mine field detection studies, multispectral sensor design/performance trade-off studies, and target detection algorithm development. The model assumes target detection algorithms and their performance models which are based on data assumed to obey multivariate Gaussian probability distribution functions (PDFs). The applicability of these algorithms and performance models can be generalized to data having non-Gaussian PDFs through the use of transforms which convert non-Gaussian data to Gaussian (or near-Gaussian) data. An example of one such transform is the Box-Cox power law transform. In practice, such a transform can be applied to non-Gaussian data prior to the introduction of a detection algorithm that is formally based on the assumption of multivariate Gaussian data. This paper presents an extension of these techniques to the case where the joint multivariate probability density function of the non-Gaussian input data is known, and where the joint estimate of the multivariate Gaussian statistics, under the Box-Cox transform, is desired. The jointly estimated multivariate Gaussian statistics can then be used to predict the performance of a target detection algorithm which has an associated Gaussian performance model.

  5. A new extranodal scoring system based on the prognostically relevant extranodal sites in diffuse large B-cell lymphoma, not otherwise specified treated with chemoimmunotherapy.

    PubMed

    Hwang, Hee Sang; Yoon, Dok Hyun; Suh, Cheolwon; Huh, Jooryung

    2016-08-01

    Extranodal involvement is a well-known prognostic factor in patients with diffuse large B-cell lymphomas (DLBCL). Nevertheless, the prognostic impact of the extranodal scoring system included in the conventional international prognostic index (IPI) has been questioned in an era where rituximab treatment has become widespread. We investigated the prognostic impacts of individual sites of extranodal involvement in 761 patients with DLBCL who received rituximab-based chemoimmunotherapy. Subsequently, we established a new extranodal scoring system based on extranodal sites, showing significant prognostic correlation, and compared this system with conventional scoring systems, such as the IPI and the National Comprehensive Cancer Network-IPI (NCCN-IPI). An internal validation procedure, using bootstrapped samples, was also performed for both univariate and multivariate models. Using multivariate analysis with a backward variable selection, we found nine extranodal sites (the liver, lung, spleen, central nervous system, bone marrow, kidney, skin, adrenal glands, and peritoneum) that remained significant for use in the final model. Our newly established extranodal scoring system, based on these sites, was better correlated with patient survival than standard scoring systems, such as the IPI and the NCCN-IPI. Internal validation by bootstrapping demonstrated an improvement in model performance of our modified extranodal scoring system. Our new extranodal scoring system, based on the prognostically relevant sites, may improve the performance of conventional prognostic models of DLBCL in the rituximab era and warrants further external validation using large study populations.

  6. Bayesian inference for multivariate meta-analysis Box-Cox transformation models for individual patient data with applications to evaluation of cholesterol lowering drugs

    PubMed Central

    Kim, Sungduk; Chen, Ming-Hui; Ibrahim, Joseph G.; Shah, Arvind K.; Lin, Jianxin

    2013-01-01

    In this paper, we propose a class of Box-Cox transformation regression models with multidimensional random effects for analyzing multivariate responses for individual patient data (IPD) in meta-analysis. Our modeling formulation uses a multivariate normal response meta-analysis model with multivariate random effects, in which each response is allowed to have its own Box-Cox transformation. Prior distributions are specified for the Box-Cox transformation parameters as well as the regression coefficients in this complex model, and the Deviance Information Criterion (DIC) is used to select the best transformation model. Since the model is quite complex, a novel Monte Carlo Markov chain (MCMC) sampling scheme is developed to sample from the joint posterior of the parameters. This model is motivated by a very rich dataset comprising 26 clinical trials involving cholesterol lowering drugs where the goal is to jointly model the three dimensional response consisting of Low Density Lipoprotein Cholesterol (LDL-C), High Density Lipoprotein Cholesterol (HDL-C), and Triglycerides (TG) (LDL-C, HDL-C, TG). Since the joint distribution of (LDL-C, HDL-C, TG) is not multivariate normal and in fact quite skewed, a Box-Cox transformation is needed to achieve normality. In the clinical literature, these three variables are usually analyzed univariately: however, a multivariate approach would be more appropriate since these variables are correlated with each other. A detailed analysis of these data is carried out using the proposed methodology. PMID:23580436

  7. Bayesian inference for multivariate meta-analysis Box-Cox transformation models for individual patient data with applications to evaluation of cholesterol-lowering drugs.

    PubMed

    Kim, Sungduk; Chen, Ming-Hui; Ibrahim, Joseph G; Shah, Arvind K; Lin, Jianxin

    2013-10-15

    In this paper, we propose a class of Box-Cox transformation regression models with multidimensional random effects for analyzing multivariate responses for individual patient data in meta-analysis. Our modeling formulation uses a multivariate normal response meta-analysis model with multivariate random effects, in which each response is allowed to have its own Box-Cox transformation. Prior distributions are specified for the Box-Cox transformation parameters as well as the regression coefficients in this complex model, and the deviance information criterion is used to select the best transformation model. Because the model is quite complex, we develop a novel Monte Carlo Markov chain sampling scheme to sample from the joint posterior of the parameters. This model is motivated by a very rich dataset comprising 26 clinical trials involving cholesterol-lowering drugs where the goal is to jointly model the three-dimensional response consisting of low density lipoprotein cholesterol (LDL-C), high density lipoprotein cholesterol (HDL-C), and triglycerides (TG) (LDL-C, HDL-C, TG). Because the joint distribution of (LDL-C, HDL-C, TG) is not multivariate normal and in fact quite skewed, a Box-Cox transformation is needed to achieve normality. In the clinical literature, these three variables are usually analyzed univariately; however, a multivariate approach would be more appropriate because these variables are correlated with each other. We carry out a detailed analysis of these data by using the proposed methodology. Copyright © 2013 John Wiley & Sons, Ltd.

  8. Correlational Analysis of neck/shoulder Pain and Low Back Pain with the Use of Digital Products, Physical Activity and Psychological Status among Adolescents in Shanghai

    PubMed Central

    Li, Jipeng; Li, Yangyang; Zhang, Yongxing; Zhao, Qinghua

    2013-01-01

    Purpose This study investigates the neck/shoulder pain (NSP) and low back pain (LBP) among current high school students in Shanghai and explores the relationship between these pains and their possible influences, including digital products, physical activity, and psychological status. Methods An anonymous self-assessment was administered to 3,600 students across 30 high schools in Shanghai. This questionnaire examined the prevalence of NSP and LBP and the level of physical activity as well as the use of mobile phones, personal computers (PC) and tablet computers (Tablet). The CES-D (Center for Epidemiological Studies Depression) scale was also included in the survey. The survey data were analyzed using the chi-square test, univariate logistic analyses and a multivariate logistic regression model. Results Three thousand sixteen valid questionnaires were received including 1,460 (48.41%) from male respondents and 1,556 (51.59%) from female respondents. The high school students in this study showed NSP and LBP rates of 40.8% and 33.1%, respectively, and the prevalence of both influenced by the student’s grade, use of digital products, and mental status; these factors affected the rates of NSP and LBP to varying degrees. The multivariate logistic regression analysis revealed that Gender, grade, soreness after exercise, PC using habits, tablet use, sitting time after school and academic stress entered the final model of NSP, while the final model of LBP consisted of gender, grade, soreness after exercise, PC using habits, mobile phone use, sitting time after school, academic stress and CES-D score. Conclusions High school students in Shanghai showed high prevalence of NSP and LBP that were closely related to multiple factors. Appropriate interventions should be implemented to reduce the occurrences of NSP and LBP. PMID:24147114

  9. Self-efficacy is independently associated with brain volume in older women.

    PubMed

    Davis, Jennifer C; Nagamatsu, Lindsay S; Hsu, Chun Liang; Beattie, B Lynn; Liu-Ambrose, Teresa

    2012-07-01

    ageing is highly associated with neurodegeneration and atrophy of the brain. Evidence suggests that personality variables are risk factors for reduced brain volume. We examine whether falls-related self-efficacy is independently associated with brain volume. a cross-sectional analysis of whether falls-related self-efficacy is independently associated with brain volumes (total, grey and white matter). Three multivariate regression models were constructed. Covariates included in the models were age, global cognition, systolic blood pressure, functional comorbidity index and current physical activity level. MRI scans were acquired from 79 community-dwelling senior women aged 65-75 years old. Falls-related self-efficacy was assessed by the activities-specific balance confidence (ABC) scale. after accounting for covariates, falls-related self-efficacy was independently associated with both total brain volume and total grey matter volume. The final model for total brain volume accounted for 17% of the variance, with the ABC score accounting for 8%. For total grey matter volume, the final model accounted for 24% of the variance, with the ABC score accounting for 10%. we provide novel evidence that falls-related self-efficacy, a modifiable risk factor for healthy ageing, is positively associated with total brain volume and total grey matter volume. ClinicalTrials.gov Identifier: NCT00426881.

  10. Towards a Multi-Variable Parametric Cost Model for Ground and Space Telescopes

    NASA Technical Reports Server (NTRS)

    Stahl, H. Philip; Henrichs, Todd

    2016-01-01

    Parametric cost models can be used by designers and project managers to perform relative cost comparisons between major architectural cost drivers and allow high-level design trades; enable cost-benefit analysis for technology development investment; and, provide a basis for estimating total project cost between related concepts. This paper hypothesizes a single model, based on published models and engineering intuition, for both ground and space telescopes: OTA Cost approximately (X) D(exp (1.75 +/- 0.05)) lambda(exp(-0.5 +/- 0.25) T(exp -0.25) e (exp (-0.04)Y). Specific findings include: space telescopes cost 50X to 100X more ground telescopes; diameter is the most important CER; cost is reduced by approximately 50% every 20 years (presumably because of technology advance and process improvements); and, for space telescopes, cost associated with wavelength performance is balanced by cost associated with operating temperature. Finally, duplication only reduces cost for the manufacture of identical systems (i.e. multiple aperture sparse arrays or interferometers). And, while duplication does reduce the cost of manufacturing the mirrors of segmented primary mirror, this cost savings does not appear to manifest itself in the final primary mirror assembly (presumably because the structure for a segmented mirror is more complicated than for a monolithic mirror).

  11. Self-efficacy is independently associated with brain volume in older women

    PubMed Central

    Davis, Jennifer C.; Nagamatsu, Lindsay S.; Hsu, Chun Liang; Beattie, B. Lynn; Liu-Ambrose, Teresa

    2015-01-01

    Background Aging is highly associated with neurodegeneration and atrophy of the brain. Evidence suggests that personality variables are risk factors for reduced brain volume. We examine whether falls-related self-efficacy is independently associated with brain volume. Method A cross-sectional analysis of whether falls-related self-efficacy is independently associated with brain volumes (total, grey, and white matter). Three multivariate regression models were constructed. Covariates included in the models were age, global cognition, systolic blood pressure, functional comorbidity index, and current physical activity level. MRI scans were acquired from 79 community-dwelling senior women aged 65 to 75 years old. Falls-related self-efficacy was assessed by the Activities Specific Balance Confidence (ABC) Scale. Results After accounting for covariates, falls-related self-efficacy was independently associated with both total brain volume and total grey matter volume. The final model for total brain volume accounted for 17% of the variance, with the ABC score accounting for 8%. For total grey matter volume, the final model accounted for 24% of the variance, with the ABC score accounting for 10%. Conclusion We provide novel evidence that falls-related self-efficacy, a modifiable risk factor for healthy aging, is positively associated with total brain volume and total grey matter volume. Trial Registration ClinicalTrials.gov Identifier: NCT00426881. PMID:22436405

  12. Multivariable parametric cost model for space and ground telescopes

    NASA Astrophysics Data System (ADS)

    Stahl, H. Philip; Henrichs, Todd

    2016-09-01

    Parametric cost models can be used by designers and project managers to perform relative cost comparisons between major architectural cost drivers and allow high-level design trades; enable cost-benefit analysis for technology development investment; and, provide a basis for estimating total project cost between related concepts. This paper hypothesizes a single model, based on published models and engineering intuition, for both ground and space telescopes: OTA Cost (X) D (1.75 +/- 0.05) λ (-0.5 +/- 0.25) T-0.25 e (-0.04) Y Specific findings include: space telescopes cost 50X to 100X more ground telescopes; diameter is the most important CER; cost is reduced by approximately 50% every 20 years (presumably because of technology advance and process improvements); and, for space telescopes, cost associated with wavelength performance is balanced by cost associated with operating temperature. Finally, duplication only reduces cost for the manufacture of identical systems (i.e. multiple aperture sparse arrays or interferometers). And, while duplication does reduce the cost of manufacturing the mirrors of segmented primary mirror, this cost savings does not appear to manifest itself in the final primary mirror assembly (presumably because the structure for a segmented mirror is more complicated than for a monolithic mirror).

  13. Epidemiological characteristics of reported sporadic and outbreak cases of E. coli O157 in people from Alberta, Canada (2000-2002): methodological challenges of comparing clustered to unclustered data.

    PubMed

    Pearl, D L; Louie, M; Chui, L; Doré, K; Grimsrud, K M; Martin, S W; Michel, P; Svenson, L W; McEwen, S A

    2008-04-01

    Using multivariable models, we compared whether there were significant differences between reported outbreak and sporadic cases in terms of their sex, age, and mode and site of disease transmission. We also determined the potential role of administrative, temporal, and spatial factors within these models. We compared a variety of approaches to account for clustering of cases in outbreaks including weighted logistic regression, random effects models, general estimating equations, robust variance estimates, and the random selection of one case from each outbreak. Age and mode of transmission were the only epidemiologically and statistically significant covariates in our final models using the above approaches. Weighing observations in a logistic regression model by the inverse of their outbreak size appeared to be a relatively robust and valid means for modelling these data. Some analytical techniques, designed to account for clustering, had difficulty converging or producing realistic measures of association.

  14. Semiparametric Thurstonian Models for Recurrent Choices: A Bayesian Analysis

    ERIC Educational Resources Information Center

    Ansari, Asim; Iyengar, Raghuram

    2006-01-01

    We develop semiparametric Bayesian Thurstonian models for analyzing repeated choice decisions involving multinomial, multivariate binary or multivariate ordinal data. Our modeling framework has multiple components that together yield considerable flexibility in modeling preference utilities, cross-sectional heterogeneity and parameter-driven…

  15. Associations with substance abuse treatment completion in drug court

    PubMed Central

    Brown, Randall T

    2009-01-01

    Subjects in the study included all participants (N = 573) in drug treatment court in a mid-sized U.S. city from 1996 through 2004. Administrative data from the drug court included measures of demographics and socioeconomics, substance use, and criminal justice history. Stepwise multivariate logistic regression yielded a final model of failure to complete drug treatment. Unemployment, lower educational attainment, and cocaine use disorders were associated with failure to complete treatment. The limitations of administrative data should be considered in the interpretation of results. Funding was provided by the National Institutes of Health, National Institute on Drug Abuse (1 K23 DA017283-01). PMID:20380560

  16. Self-perceived health status, gender, and work status.

    PubMed

    Pino-Domínguez, Lara; Navarro-Gil, Patricia; González-Vélez, Abel E; Prieto-Flores, Maria-Eugenia; Ayala, Alba; Rojo-Pérez, Fermina; Fernández-Mayoralas, Gloria; Martínez-Martín, Pablo; Forjaz, Maria João

    2016-01-01

    This study analyzes the relationship between gender and self-perceived health status in Spanish retirees and housewives from a sample of 1,106 community-dwelling older adults. A multivariate linear regression model was used in which self-perceived health status was measured by the EQ-5D visual analogue scale and gender according to work status (retired men and women and housewives). Retired males reported a significantly better health status than housewives. Self-perceived health status was closely associated with physical, mental, and functional health and leisure activities. Finally, being a woman with complete dedication to domestic work is associated with a worse state of self-perceived health.

  17. Error Covariance Penalized Regression: A novel multivariate model combining penalized regression with multivariate error structure.

    PubMed

    Allegrini, Franco; Braga, Jez W B; Moreira, Alessandro C O; Olivieri, Alejandro C

    2018-06-29

    A new multivariate regression model, named Error Covariance Penalized Regression (ECPR) is presented. Following a penalized regression strategy, the proposed model incorporates information about the measurement error structure of the system, using the error covariance matrix (ECM) as a penalization term. Results are reported from both simulations and experimental data based on replicate mid and near infrared (MIR and NIR) spectral measurements. The results for ECPR are better under non-iid conditions when compared with traditional first-order multivariate methods such as ridge regression (RR), principal component regression (PCR) and partial least-squares regression (PLS). Copyright © 2018 Elsevier B.V. All rights reserved.

  18. Effects of Covariance Heterogeneity on Three Procedures for Analyzing Multivariate Repeated Measures Designs.

    ERIC Educational Resources Information Center

    Vallejo, Guillermo; Fidalgo, Angel; Fernandez, Paula

    2001-01-01

    Estimated empirical Type I error rate and power rate for three procedures for analyzing multivariate repeated measures designs: (1) the doubly multivariate model; (2) the Welch-James multivariate solution (H. Keselman, M. Carriere, a nd L. Lix, 1993); and (3) the multivariate version of the modified Brown-Forsythe procedure (M. Brown and A.…

  19. Abandonment of antiretroviral therapy among HIV-positive patients attended at the reference center for HIV/AIDS in Vitória, Brazil.

    PubMed

    Zago, Adriana Marchon; Morelato, Paola; Endringer, Emmanuele de Angeli; Dan, Germano de Freitas; Ribeiro, Evanira Mendes; Miranda, Angelica Espinosa

    2012-01-01

    This study evaluates the risk factors for the abandonment of antiretroviral therapy (ART) among patients receiving care in an AIDS clinic in Vitória, Brazil. We conducted a case-control study of patients with AIDS attending a reference center for sexually transmitted disease (STD)/AIDS. A total of 62 patients, who abandoned therapy in 2008, and 188 HIV-infected patients answered an interview including demographic, social, and clinical characteristics. Risk factors associated with abandon in univariate analysis were entered into logistic regression models. A total of 250 patients were included in the study. Groups were similar regarding age, gender, and monthly income. In the final multivariate model, illicit drug use (adjusted odds ratio [AOR], 2.3; 95% confidence interval [CI], 1.03-5.07), previous abandon of medication (AOR 38.6; 95% CI 10.49-142.25), last CD4 count <200 cells/mm(3) (AOR 1.5; 95% CI 1.03-2.10), and viral load higher than 1000 copies/mL (AOR 2.0 (95% CI 1.34-3.09) were independent predictors of abandonment of ART. In addition to the clinical indicators, behavioral factors remained important throughout the multivariate analysis in our study.

  20. Characterizing Durations of Heroin Abstinence in the California Civil Addict Program: Results From a 33-Year Observational Cohort Study

    PubMed Central

    Nosyk, Bohdan; Anglin, M. Douglas; Brecht, Mary-Lynn; Lima, Viviane Dias; Hser, Yih-Ing

    2013-01-01

    In accordance with the chronic disease model of opioid dependence, cessation is often observed as a longitudinal process rather than a discrete endpoint. We aimed to characterize and identify predictors of periods of heroin abstinence in the natural history of recovery from opioid dependence. Data were collected on participants from California who were enrolled in the Civil Addict Program from 1962 onward by use of a natural history interview. Multivariate regression using proportional hazards frailty models was applied to identify independent predictors and correlates of repeated abstinence episode durations. Among 471 heroin-dependent males, 387 (82.2%) reported 932 abstinence episodes, 60.3% of which lasted at least 1 year. Multivariate analysis revealed several important findings. First, demographic factors such as age and ethnicity did not explain variation in durations of abstinence episodes. However, employment and lower drug use severity predicted longer episodes. Second, abstinence durations were longer following sustained treatment versus incarceration. Third, individuals with multiple abstinence episodes remained abstinent for longer durations in successive episodes. Finally, abstinence episodes initiated >10 and ≤20 years after first use lasted longer than others. Public policy facilitating engagement of opioid-dependent individuals in maintenance-oriented drug treatment and employment is recommended to achieve and sustain opioid abstinence. PMID:23445901

  1. Dynamics of the development of multiple follicles by early versus late hCG administration in ART program.

    PubMed

    Falagario, Maddalena; Trerotoli, Paolo; Chincoli, Annarosa; Cobuzzi, Isabella; Vacca, Margherita P; Falagario, Doriana; Nardelli, Claudia; Depalo, Raffaella

    2017-02-01

    To evaluate, in patients stimulated with recombinant FSH and GnRH antagonists, whether triggering the final maturation of oocytes affects IVF outcomes. Five hundred and six IVF procedures were divided into three groups according to the timing of hCG administration: when at least 2 follicles reached the diameter of 17 mm, at least 2 follicles reached 18 mm and at least 2 follicles reached 20 mm. The main outcome was the number of mature oocyte that was the dependent variable of a multivariate model whose independents were, age, AFC, hCG timing, E2 levels at hCG day, number of follicles in different categories of dimension. Secondary endpoints were to compare fertilization, implantation and pregnancy rates in a multilevel multivariate model whose covariates were age, BMI, AFC, embryo quality and cause of infertility. Timing did not result a statistically significant factor influencing the number of oocytes collected, which was influenced by age, AFC, number of follicles between 12.1 and 15.9 mm and E2 levels. Implantation rate and pregnancy rate appear to be affected only by embryo quality. The number of oocytes collected and the probability of pregnancy are not associated with the time of hCG administration.

  2. H. Pylori as a predictor of marginal ulceration: A nationwide analysis.

    PubMed

    Schulman, Allison R; Abougergi, Marwan S; Thompson, Christopher C

    2017-03-01

    Helicobacter pylori has been implicated as a risk factor for development of marginal ulceration following gastric bypass, although studies have been small and yielded conflicting results. This study sought to determine the relationship between H. pylori infection and development of marginal ulceration following bariatric surgery in a nationwide analysis. This was a retrospective cohort study using the 2012 Nationwide Inpatient Sample (NIS) database. Discharges with ICD-9-CM code indicating marginal ulceration and a secondary ICD-9-CM code for bariatric surgery were included. Primary outcome was incidence of marginal ulceration. A stepwise forward selection model was used to build the multivariate logistic regression model based on known risk factors. A P value of 0.05 was considered significant. There were 253,765 patients who met inclusion criteria. Prevalence of marginal ulceration was 3.90%. Of those patients found to have marginal ulceration, 31.20% of patients were H. pylori-positive. Final multivariate regression analysis revealed that H. pylori was the strongest independent predictor of marginal ulceration. H. pylori is an independent predictor of marginal ulceration using a large national database. Preoperative testing for and eradication of H. pylori prior to bariatric surgery may be an important preventive measure to reduce the incidence of ulcer development. © 2017 The Obesity Society.

  3. High prevalence of forced sex among non-brothel based, wine shop centered sex workers in Chennai, India.

    PubMed

    Go, Vivian F; Srikrishnan, Aylur K; Parker, Corette Breeden; Salter, Megan; Green, Annette M; Sivaram, Sudha; Johnson, Sethulakshmi C; Latkin, Carl; Davis, Wendy; Solomon, Suniti; Celentano, David D

    2011-01-01

    Sexual violence has been shown to increase women's risk of HIV infection. India is a country where the HIV epidemic is growing among women and intimate partner violence (IPV) is pervasive. This study examined prevalence of and factors associated with forced sex among female sex workers (FSWs) in Chennai, India. We conducted a probability survey among FSWs in 24 slum venues and identified predictive factors for recent forced sex using univariate and multivariable proportional odds models. Among 522 FSWs, 28% reported having forced sex with one partner and 35% with 2+ partners. In the final multivariable model, women who had a high number of partners who had a strong tendency to drink alcohol before sex were more likely to have experienced forced sex, and women who had both unprotected sex with a nonspousal partner and > 20 days of alcohol consumption in the last 30 days were more likely to have experienced forced sex. Discussion about family violence with larger social networks was independently associated with lower odds of forced sex among FSWs. HIV interventions for FSWs and their clients aimed at reducing alcohol consumption and encouraging condom use could be enhanced by violence prevention interventions to facilitate discourse about sexual violence.

  4. Linear multivariate evaluation models for spatial perception of soundscape.

    PubMed

    Deng, Zhiyong; Kang, Jian; Wang, Daiwei; Liu, Aili; Kang, Joe Zhengyu

    2015-11-01

    Soundscape is a sound environment that emphasizes the awareness of auditory perception and social or cultural understandings. The case of spatial perception is significant to soundscape. However, previous studies on the auditory spatial perception of the soundscape environment have been limited. Based on 21 native binaural-recorded soundscape samples and a set of auditory experiments for subjective spatial perception (SSP), a study of the analysis among semantic parameters, the inter-aural-cross-correlation coefficient (IACC), A-weighted-equal sound-pressure-level (L(eq)), dynamic (D), and SSP is introduced to verify the independent effect of each parameter and to re-determine some of their possible relationships. The results show that the more noisiness the audience perceived, the worse spatial awareness they received, while the closer and more directional the sound source image variations, dynamics, and numbers of sound sources in the soundscape are, the better the spatial awareness would be. Thus, the sensations of roughness, sound intensity, transient dynamic, and the values of Leq and IACC have a suitable range for better spatial perception. A better spatial awareness seems to promote the preference slightly for the audience. Finally, setting SSPs as functions of the semantic parameters and Leq-D-IACC, two linear multivariate evaluation models of subjective spatial perception are proposed.

  5. On the Numerical Formulation of Parametric Linear Fractional Transformation (LFT) Uncertainty Models for Multivariate Matrix Polynomial Problems

    NASA Technical Reports Server (NTRS)

    Belcastro, Christine M.

    1998-01-01

    Robust control system analysis and design is based on an uncertainty description, called a linear fractional transformation (LFT), which separates the uncertain (or varying) part of the system from the nominal system. These models are also useful in the design of gain-scheduled control systems based on Linear Parameter Varying (LPV) methods. Low-order LFT models are difficult to form for problems involving nonlinear parameter variations. This paper presents a numerical computational method for constructing and LFT model for a given LPV model. The method is developed for multivariate polynomial problems, and uses simple matrix computations to obtain an exact low-order LFT representation of the given LPV system without the use of model reduction. Although the method is developed for multivariate polynomial problems, multivariate rational problems can also be solved using this method by reformulating the rational problem into a polynomial form.

  6. Multivariate Methods for Meta-Analysis of Genetic Association Studies.

    PubMed

    Dimou, Niki L; Pantavou, Katerina G; Braliou, Georgia G; Bagos, Pantelis G

    2018-01-01

    Multivariate meta-analysis of genetic association studies and genome-wide association studies has received a remarkable attention as it improves the precision of the analysis. Here, we review, summarize and present in a unified framework methods for multivariate meta-analysis of genetic association studies and genome-wide association studies. Starting with the statistical methods used for robust analysis and genetic model selection, we present in brief univariate methods for meta-analysis and we then scrutinize multivariate methodologies. Multivariate models of meta-analysis for a single gene-disease association studies, including models for haplotype association studies, multiple linked polymorphisms and multiple outcomes are discussed. The popular Mendelian randomization approach and special cases of meta-analysis addressing issues such as the assumption of the mode of inheritance, deviation from Hardy-Weinberg Equilibrium and gene-environment interactions are also presented. All available methods are enriched with practical applications and methodologies that could be developed in the future are discussed. Links for all available software implementing multivariate meta-analysis methods are also provided.

  7. Quality-by-Design approach to monitor the operation of a batch bioreactor in an industrial avian vaccine manufacturing process.

    PubMed

    Largoni, Martina; Facco, Pierantonio; Bernini, Donatella; Bezzo, Fabrizio; Barolo, Massimiliano

    2015-10-10

    Monitoring batch bioreactors is a complex task, due to the fact that several sources of variability can affect a running batch and impact on the final product quality. Additionally, the product quality itself may not be measurable on line, but requires sampling and lab analysis taking several days to be completed. In this study we show that, by using appropriate process analytical technology tools, the operation of an industrial batch bioreactor used in avian vaccine manufacturing can be effectively monitored as the batch progresses. Multivariate statistical models are built from historical databases of batches already completed, and they are used to enable the real time identification of the variability sources, to reliably predict the final product quality, and to improve process understanding, paving the way to a reduction of final product rejections, as well as to a reduction of the product cycle time. It is also shown that the product quality "builds up" mainly during the first half of a batch, suggesting on the one side that reducing the variability during this period is crucial, and on the other side that the batch length can possibly be shortened. Overall, the study demonstrates that, by using a Quality-by-Design approach centered on the appropriate use of mathematical modeling, quality can indeed be built "by design" into the final product, whereas the role of end-point product testing can progressively reduce its importance in product manufacturing. Copyright © 2015 Elsevier B.V. All rights reserved.

  8. A system to build distributed multivariate models and manage disparate data sharing policies: implementation in the scalable national network for effectiveness research.

    PubMed

    Meeker, Daniella; Jiang, Xiaoqian; Matheny, Michael E; Farcas, Claudiu; D'Arcy, Michel; Pearlman, Laura; Nookala, Lavanya; Day, Michele E; Kim, Katherine K; Kim, Hyeoneui; Boxwala, Aziz; El-Kareh, Robert; Kuo, Grace M; Resnic, Frederic S; Kesselman, Carl; Ohno-Machado, Lucila

    2015-11-01

    Centralized and federated models for sharing data in research networks currently exist. To build multivariate data analysis for centralized networks, transfer of patient-level data to a central computation resource is necessary. The authors implemented distributed multivariate models for federated networks in which patient-level data is kept at each site and data exchange policies are managed in a study-centric manner. The objective was to implement infrastructure that supports the functionality of some existing research networks (e.g., cohort discovery, workflow management, and estimation of multivariate analytic models on centralized data) while adding additional important new features, such as algorithms for distributed iterative multivariate models, a graphical interface for multivariate model specification, synchronous and asynchronous response to network queries, investigator-initiated studies, and study-based control of staff, protocols, and data sharing policies. Based on the requirements gathered from statisticians, administrators, and investigators from multiple institutions, the authors developed infrastructure and tools to support multisite comparative effectiveness studies using web services for multivariate statistical estimation in the SCANNER federated network. The authors implemented massively parallel (map-reduce) computation methods and a new policy management system to enable each study initiated by network participants to define the ways in which data may be processed, managed, queried, and shared. The authors illustrated the use of these systems among institutions with highly different policies and operating under different state laws. Federated research networks need not limit distributed query functionality to count queries, cohort discovery, or independently estimated analytic models. Multivariate analyses can be efficiently and securely conducted without patient-level data transport, allowing institutions with strict local data storage requirements to participate in sophisticated analyses based on federated research networks. © The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association.

  9. MULTIVARIATE RECEPTOR MODELS AND MODEL UNCERTAINTY. (R825173)

    EPA Science Inventory

    Abstract

    Estimation of the number of major pollution sources, the source composition profiles, and the source contributions are the main interests in multivariate receptor modeling. Due to lack of identifiability of the receptor model, however, the estimation cannot be...

  10. Quantifying the impact of between-study heterogeneity in multivariate meta-analyses

    PubMed Central

    Jackson, Dan; White, Ian R; Riley, Richard D

    2012-01-01

    Measures that quantify the impact of heterogeneity in univariate meta-analysis, including the very popular I2 statistic, are now well established. Multivariate meta-analysis, where studies provide multiple outcomes that are pooled in a single analysis, is also becoming more commonly used. The question of how to quantify heterogeneity in the multivariate setting is therefore raised. It is the univariate R2 statistic, the ratio of the variance of the estimated treatment effect under the random and fixed effects models, that generalises most naturally, so this statistic provides our basis. This statistic is then used to derive a multivariate analogue of I2, which we call . We also provide a multivariate H2 statistic, the ratio of a generalisation of Cochran's heterogeneity statistic and its associated degrees of freedom, with an accompanying generalisation of the usual I2 statistic, . Our proposed heterogeneity statistics can be used alongside all the usual estimates and inferential procedures used in multivariate meta-analysis. We apply our methods to some real datasets and show how our statistics are equally appropriate in the context of multivariate meta-regression, where study level covariate effects are included in the model. Our heterogeneity statistics may be used when applying any procedure for fitting the multivariate random effects model. Copyright © 2012 John Wiley & Sons, Ltd. PMID:22763950

  11. Multivariate statistical analysis of low-voltage EDS spectrum images

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Anderson, I.M.

    1998-03-01

    Whereas energy-dispersive X-ray spectrometry (EDS) has been used for compositional analysis in the scanning electron microscope for 30 years, the benefits of using low operating voltages for such analyses have been explored only during the last few years. This paper couples low-voltage EDS with two other emerging areas of characterization: spectrum imaging and multivariate statistical analysis. The specimen analyzed for this study was a finished Intel Pentium processor, with the polyimide protective coating stripped off to expose the final active layers.

  12. Up-scaling of multi-variable flood loss models from objects to land use units at the meso-scale

    NASA Astrophysics Data System (ADS)

    Kreibich, Heidi; Schröter, Kai; Merz, Bruno

    2016-05-01

    Flood risk management increasingly relies on risk analyses, including loss modelling. Most of the flood loss models usually applied in standard practice have in common that complex damaging processes are described by simple approaches like stage-damage functions. Novel multi-variable models significantly improve loss estimation on the micro-scale and may also be advantageous for large-scale applications. However, more input parameters also reveal additional uncertainty, even more in upscaling procedures for meso-scale applications, where the parameters need to be estimated on a regional area-wide basis. To gain more knowledge about challenges associated with the up-scaling of multi-variable flood loss models the following approach is applied: Single- and multi-variable micro-scale flood loss models are up-scaled and applied on the meso-scale, namely on basis of ATKIS land-use units. Application and validation is undertaken in 19 municipalities, which were affected during the 2002 flood by the River Mulde in Saxony, Germany by comparison to official loss data provided by the Saxon Relief Bank (SAB).In the meso-scale case study based model validation, most multi-variable models show smaller errors than the uni-variable stage-damage functions. The results show the suitability of the up-scaling approach, and, in accordance with micro-scale validation studies, that multi-variable models are an improvement in flood loss modelling also on the meso-scale. However, uncertainties remain high, stressing the importance of uncertainty quantification. Thus, the development of probabilistic loss models, like BT-FLEMO used in this study, which inherently provide uncertainty information are the way forward.

  13. An error bound for a discrete reduced order model of a linear multivariable system

    NASA Technical Reports Server (NTRS)

    Al-Saggaf, Ubaid M.; Franklin, Gene F.

    1987-01-01

    The design of feasible controllers for high dimension multivariable systems can be greatly aided by a method of model reduction. In order for the design based on the order reduction to include a guarantee of stability, it is sufficient to have a bound on the model error. Previous work has provided such a bound for continuous-time systems for algorithms based on balancing. In this note an L-infinity bound is derived for model error for a method of order reduction of discrete linear multivariable systems based on balancing.

  14. Prediction of drug transport processes using simple parameters and PLS statistics. The use of ACD/logP and ACD/ChemSketch descriptors.

    PubMed

    Osterberg, T; Norinder, U

    2001-01-01

    A method of modelling and predicting biopharmaceutical properties using simple theoretically computed molecular descriptors and multivariate statistics has been investigated for several data sets related to solubility, IAM chromatography, permeability across Caco-2 cell monolayers, human intestinal perfusion, brain-blood partitioning, and P-glycoprotein ATPase activity. The molecular descriptors (e.g. molar refractivity, molar volume, index of refraction, surface tension and density) and logP were computed with ACD/ChemSketch and ACD/logP, respectively. Good statistical models were derived that permit simple computational prediction of biopharmaceutical properties. All final models derived had R(2) values ranging from 0.73 to 0.95 and Q(2) values ranging from 0.69 to 0.86. The RMSEP values for the external test sets ranged from 0.24 to 0.85 (log scale).

  15. Multivariate analysis of standoff laser-induced breakdown spectroscopy spectra for classification of explosive-containing residues

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    De Lucia, Frank C. Jr.; Gottfried, Jennifer L.; Munson, Chase A.

    2008-11-01

    A technique being evaluated for standoff explosives detection is laser-induced breakdown spectroscopy (LIBS). LIBS is a real-time sensor technology that uses components that can be configured into a ruggedized standoff instrument. The U.S. Army Research Laboratory has been coupling standoff LIBS spectra with chemometrics for several years now in order to discriminate between explosives and nonexplosives. We have investigated the use of partial least squares discriminant analysis (PLS-DA) for explosives detection. We have extended our study of PLS-DA to more complex sample types, including binary mixtures, different types of explosives, and samples not included in the model. We demonstrate themore » importance of building the PLS-DA model by iteratively testing it against sample test sets. Independent test sets are used to test the robustness of the final model.« less

  16. Automatic Cell Segmentation Using a Shape-Classification Model in Immunohistochemically Stained Cytological Images

    NASA Astrophysics Data System (ADS)

    Shah, Shishir

    This paper presents a segmentation method for detecting cells in immunohistochemically stained cytological images. A two-phase approach to segmentation is used where an unsupervised clustering approach coupled with cluster merging based on a fitness function is used as the first phase to obtain a first approximation of the cell locations. A joint segmentation-classification approach incorporating ellipse as a shape model is used as the second phase to detect the final cell contour. The segmentation model estimates a multivariate density function of low-level image features from training samples and uses it as a measure of how likely each image pixel is to be a cell. This estimate is constrained by the zero level set, which is obtained as a solution to an implicit representation of an ellipse. Results of segmentation are presented and compared to ground truth measurements.

  17. An Analysis of Polynomial Chaos Approximations for Modeling Single-Fluid-Phase Flow in Porous Medium Systems

    PubMed Central

    Rupert, C.P.; Miller, C.T.

    2008-01-01

    We examine a variety of polynomial-chaos-motivated approximations to a stochastic form of a steady state groundwater flow model. We consider approaches for truncating the infinite dimensional problem and producing decoupled systems. We discuss conditions under which such decoupling is possible and show that to generalize the known decoupling by numerical cubature, it would be necessary to find new multivariate cubature rules. Finally, we use the acceleration of Monte Carlo to compare the quality of polynomial models obtained for all approaches and find that in general the methods considered are more efficient than Monte Carlo for the relatively small domains considered in this work. A curse of dimensionality in the series expansion of the log-normal stochastic random field used to represent hydraulic conductivity provides a significant impediment to efficient approximations for large domains for all methods considered in this work, other than the Monte Carlo method. PMID:18836519

  18. Improved Geothermometry Through Multivariate Reaction-path Modeling and Evaluation of Geomicrobiological Influences on Geochemical Temperature Indicators: Final Report

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mattson, Earl; Smith, Robert; Fujita, Yoshiko

    2015-03-01

    The project was aimed at demonstrating that the geothermometric predictions can be improved through the application of multi-element reaction path modeling that accounts for lithologic and tectonic settings, while also accounting for biological influences on geochemical temperature indicators. The limited utilization of chemical signatures by individual traditional geothermometer in the development of reservoir temperature estimates may have been constraining their reliability for evaluation of potential geothermal resources. This project, however, was intended to build a geothermometry tool which can integrate multi-component reaction path modeling with process-optimization capability that can be applied to dilute, low-temperature water samples to consistently predict reservoirmore » temperature within ±30 °C. The project was also intended to evaluate the extent to which microbiological processes can modulate the geochemical signals in some thermal waters and influence the geothermometric predictions.« less

  19. Research and simulation of the decoupling transformation in AC motor vector control

    NASA Astrophysics Data System (ADS)

    He, Jiaojiao; Zhao, Zhongjie; Liu, Ken; Zhang, Yongping; Yao, Tuozhong

    2018-04-01

    Permanent magnet synchronous motor (PMSM) is a nonlinear, strong coupling, multivariable complex object, and transformation decoupling can solve the coupling problem of permanent magnet synchronous motor. This paper gives a permanent magnet synchronous motor (PMSM) mathematical model, introduces the permanent magnet synchronous motor vector control coordinate transformation in the process of modal matrix inductance matrix transform through the matrix related knowledge of different coordinates of diagonalization, which makes the coupling between the independent, realize the control of motor current and excitation the torque current coupling separation, and derived the coordinate transformation matrix, the thought to solve the coupling problem of AC motor. Finally, in the Matlab/Simulink environment, through the establishment and combination between the PMSM ontology, coordinate conversion module, built the simulation model of permanent magnet synchronous motor vector control, introduces the model of each part, and analyzed the simulation results.

  20. Search for a supersymmetric partner to the top quark using a multivariate analysis technique

    NASA Astrophysics Data System (ADS)

    Darmora, Smita

    Supersymmetry (SUSY) is an extension to the Standard Model (SM) which introduces supersymmetric partners of the known fermions and bosons. Top squark (stop) searches are a natural extension of inclusive SUSY searches at the LHC. If SUSY solves the naturalness problem, the stop should be light enough to cancel the top loop contribution to the Higgs mass parameter. The 3rd generation squarks may be the first SUSY particles to be discovered at the LHC. The stop can decay into a variety of final states, depending, amongst other factors, on the hierarchy of the mass eigenstates formed from the linear superposition of the SUSY partners of the Higgs boson and electroweak gauge bosons. In this study the relevant mass eigenstates are the lightest chargino (chi+/-1) and the neutralino (chi +/-0). A search is presented for a heavy SUSY top partner decaying to a lepton, neutrino and the lightest supersymmetric particle (chi+/-0), via a b-quark and a chargino (chi +/-1) in events with two leptons in the final state. The analysis targets searches for a SUSY top partner by means of Multivariate Analysis Technique, used to discriminate between the stop signal and the background with a learning algorithm based on Monte Carlo generated signal and background samples. The analysis uses data corresponding to 20.3 fb --1 of integrated luminosity at √s = 8 TeV, collected by the ATLAS experiment at the Large Hadron Collider in 2012.

  1. Multivariate Radiological-Based Models for the Prediction of Future Knee Pain: Data from the OAI

    PubMed Central

    Galván-Tejada, Jorge I.; Celaya-Padilla, José M.; Treviño, Victor; Tamez-Peña, José G.

    2015-01-01

    In this work, the potential of X-ray based multivariate prognostic models to predict the onset of chronic knee pain is presented. Using X-rays quantitative image assessments of joint-space-width (JSW) and paired semiquantitative central X-ray scores from the Osteoarthritis Initiative (OAI), a case-control study is presented. The pain assessments of the right knee at the baseline and the 60-month visits were used to screen for case/control subjects. Scores were analyzed at the time of pain incidence (T-0), the year prior incidence (T-1), and two years before pain incidence (T-2). Multivariate models were created by a cross validated elastic-net regularized generalized linear models feature selection tool. Univariate differences between cases and controls were reported by AUC, C-statistics, and ODDs ratios. Univariate analysis indicated that the medial osteophytes were significantly more prevalent in cases than controls: C-stat 0.62, 0.62, and 0.61, at T-0, T-1, and T-2, respectively. The multivariate JSW models significantly predicted pain: AUC = 0.695, 0.623, and 0.620, at T-0, T-1, and T-2, respectively. Semiquantitative multivariate models predicted paint with C-stat = 0.671, 0.648, and 0.645 at T-0, T-1, and T-2, respectively. Multivariate models derived from plain X-ray radiography assessments may be used to predict subjects that are at risk of developing knee pain. PMID:26504490

  2. Calibration transfer of a Raman spectroscopic quantification method for the assessment of liquid detergent compositions from at-line laboratory to in-line industrial scale.

    PubMed

    Brouckaert, D; Uyttersprot, J-S; Broeckx, W; De Beer, T

    2018-03-01

    Calibration transfer or standardisation aims at creating a uniform spectral response on different spectroscopic instruments or under varying conditions, without requiring a full recalibration for each situation. In the current study, this strategy is applied to construct at-line multivariate calibration models and consequently employ them in-line in a continuous industrial production line, using the same spectrometer. Firstly, quantitative multivariate models are constructed at-line at laboratory scale for predicting the concentration of two main ingredients in hard surface cleaners. By regressing the Raman spectra of a set of small-scale calibration samples against their reference concentration values, partial least squares (PLS) models are developed to quantify the surfactant levels in the liquid detergent compositions under investigation. After evaluating the models performance with a set of independent validation samples, a univariate slope/bias correction is applied in view of transporting these at-line calibration models to an in-line manufacturing set-up. This standardisation technique allows a fast and easy transfer of the PLS regression models, by simply correcting the model predictions on the in-line set-up, without adjusting anything to the original multivariate calibration models. An extensive statistical analysis is performed in order to assess the predictive quality of the transferred regression models. Before and after transfer, the R 2 and RMSEP of both models is compared for evaluating if their magnitude is similar. T-tests are then performed to investigate whether the slope and intercept of the transferred regression line are not statistically different from 1 and 0, respectively. Furthermore, it is inspected whether no significant bias can be noted. F-tests are executed as well, for assessing the linearity of the transfer regression line and for investigating the statistical coincidence of the transfer and validation regression line. Finally, a paired t-test is performed to compare the original at-line model to the slope/bias corrected in-line model, using interval hypotheses. It is shown that the calibration models of Surfactant 1 and Surfactant 2 yield satisfactory in-line predictions after slope/bias correction. While Surfactant 1 passes seven out of eight statistical tests, the recommended validation parameters are 100% successful for Surfactant 2. It is hence concluded that the proposed strategy for transferring at-line calibration models to an in-line industrial environment via a univariate slope/bias correction of the predicted values offers a successful standardisation approach. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. Sensor failure and multivariable control for airbreathing propulsion systems. Ph.D. Thesis - Dec. 1979 Final Report

    NASA Technical Reports Server (NTRS)

    Behbehani, K.

    1980-01-01

    A new sensor/actuator failure analysis technique for turbofan jet engines was developed. Three phases of failure analysis, namely detection, isolation, and accommodation are considered. Failure detection and isolation techniques are developed by utilizing the concept of Generalized Likelihood Ratio (GLR) tests. These techniques are applicable to both time varying and time invariant systems. Three GLR detectors are developed for: (1) hard-over sensor failure; (2) hard-over actuator failure; and (3) brief disturbances in the actuators. The probability distribution of the GLR detectors and the detectability of sensor/actuator failures are established. Failure type is determined by the maximum of the GLR detectors. Failure accommodation is accomplished by extending the Multivariable Nyquest Array (MNA) control design techniques to nonsquare system designs. The performance and effectiveness of the failure analysis technique are studied by applying the technique to a turbofan jet engine, namely the Quiet Clean Short Haul Experimental Engine (QCSEE). Single and multiple sensor/actuator failures in the QCSEE are simulated and analyzed and the effects of model degradation are studied.

  4. Robust Decentralized Nonlinear Control for a Twin Rotor MIMO System

    PubMed Central

    Belmonte, Lidia María; Morales, Rafael; Fernández-Caballero, Antonio; Somolinos, José Andrés

    2016-01-01

    This article presents the design of a novel decentralized nonlinear multivariate control scheme for an underactuated, nonlinear and multivariate laboratory helicopter denominated the twin rotor MIMO system (TRMS). The TRMS is characterized by a coupling effect between rotor dynamics and the body of the model, which is due to the action-reaction principle originated in the acceleration and deceleration of the motor-propeller groups. The proposed controller is composed of two nested loops that are utilized to achieve stabilization and precise trajectory tracking tasks for the controlled position of the generalized coordinates of the TRMS. The nonlinear internal loop is used to control the electrical dynamics of the platform, and the nonlinear external loop allows the platform to be perfectly stabilized and positioned in space. Finally, we illustrate the theoretical control developments with a set of experiments in order to verify the effectiveness of the proposed nonlinear decentralized feedback controller, in which a comparative study with other controllers is performed, illustrating the excellent performance of the proposed robust decentralized control scheme in both stabilization and trajectory tracking tasks. PMID:27472338

  5. A multiresolution approach for the convergence acceleration of multivariate curve resolution methods.

    PubMed

    Sawall, Mathias; Kubis, Christoph; Börner, Armin; Selent, Detlef; Neymeyr, Klaus

    2015-09-03

    Modern computerized spectroscopic instrumentation can result in high volumes of spectroscopic data. Such accurate measurements rise special computational challenges for multivariate curve resolution techniques since pure component factorizations are often solved via constrained minimization problems. The computational costs for these calculations rapidly grow with an increased time or frequency resolution of the spectral measurements. The key idea of this paper is to define for the given high-dimensional spectroscopic data a sequence of coarsened subproblems with reduced resolutions. The multiresolution algorithm first computes a pure component factorization for the coarsest problem with the lowest resolution. Then the factorization results are used as initial values for the next problem with a higher resolution. Good initial values result in a fast solution on the next refined level. This procedure is repeated and finally a factorization is determined for the highest level of resolution. The described multiresolution approach allows a considerable convergence acceleration. The computational procedure is analyzed and is tested for experimental spectroscopic data from the rhodium-catalyzed hydroformylation together with various soft and hard models. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Robust Decentralized Nonlinear Control for a Twin Rotor MIMO System.

    PubMed

    Belmonte, Lidia María; Morales, Rafael; Fernández-Caballero, Antonio; Somolinos, José Andrés

    2016-07-27

    This article presents the design of a novel decentralized nonlinear multivariate control scheme for an underactuated, nonlinear and multivariate laboratory helicopter denominated the twin rotor MIMO system (TRMS). The TRMS is characterized by a coupling effect between rotor dynamics and the body of the model, which is due to the action-reaction principle originated in the acceleration and deceleration of the motor-propeller groups. The proposed controller is composed of two nested loops that are utilized to achieve stabilization and precise trajectory tracking tasks for the controlled position of the generalized coordinates of the TRMS. The nonlinear internal loop is used to control the electrical dynamics of the platform, and the nonlinear external loop allows the platform to be perfectly stabilized and positioned in space. Finally, we illustrate the theoretical control developments with a set of experiments in order to verify the effectiveness of the proposed nonlinear decentralized feedback controller, in which a comparative study with other controllers is performed, illustrating the excellent performance of the proposed robust decentralized control scheme in both stabilization and trajectory tracking tasks.

  7. Self-consistent core-pedestal transport simulations with neural network accelerated models

    DOE PAGES

    Meneghini, Orso; Smith, Sterling P.; Snyder, Philip B.; ...

    2017-07-12

    Fusion whole device modeling simulations require comprehensive models that are simultaneously physically accurate, fast, robust, and predictive. In this paper we describe the development of two neural-network (NN) based models as a means to perform a snon-linear multivariate regression of theory-based models for the core turbulent transport fluxes, and the pedestal structure. Specifically, we find that a NN-based approach can be used to consistently reproduce the results of the TGLF and EPED1 theory-based models over a broad range of plasma regimes, and with a computational speedup of several orders of magnitudes. These models are then integrated into a predictive workflowmore » that allows prediction with self-consistent core-pedestal coupling of the kinetic profiles within the last closed flux surface of the plasma. Finally, the NN paradigm is capable of breaking the speed-accuracy trade-off that is expected of traditional numerical physics models, and can provide the missing link towards self-consistent coupled core-pedestal whole device modeling simulations that are physically accurate and yet take only seconds to run.« less

  8. Self-consistent core-pedestal transport simulations with neural network accelerated models

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Meneghini, Orso; Smith, Sterling P.; Snyder, Philip B.

    Fusion whole device modeling simulations require comprehensive models that are simultaneously physically accurate, fast, robust, and predictive. In this paper we describe the development of two neural-network (NN) based models as a means to perform a snon-linear multivariate regression of theory-based models for the core turbulent transport fluxes, and the pedestal structure. Specifically, we find that a NN-based approach can be used to consistently reproduce the results of the TGLF and EPED1 theory-based models over a broad range of plasma regimes, and with a computational speedup of several orders of magnitudes. These models are then integrated into a predictive workflowmore » that allows prediction with self-consistent core-pedestal coupling of the kinetic profiles within the last closed flux surface of the plasma. Finally, the NN paradigm is capable of breaking the speed-accuracy trade-off that is expected of traditional numerical physics models, and can provide the missing link towards self-consistent coupled core-pedestal whole device modeling simulations that are physically accurate and yet take only seconds to run.« less

  9. An Analytic Solution to the Computation of Power and Sample Size for Genetic Association Studies under a Pleiotropic Mode of Inheritance.

    PubMed

    Gordon, Derek; Londono, Douglas; Patel, Payal; Kim, Wonkuk; Finch, Stephen J; Heiman, Gary A

    2016-01-01

    Our motivation here is to calculate the power of 3 statistical tests used when there are genetic traits that operate under a pleiotropic mode of inheritance and when qualitative phenotypes are defined by use of thresholds for the multiple quantitative phenotypes. Specifically, we formulate a multivariate function that provides the probability that an individual has a vector of specific quantitative trait values conditional on having a risk locus genotype, and we apply thresholds to define qualitative phenotypes (affected, unaffected) and compute penetrances and conditional genotype frequencies based on the multivariate function. We extend the analytic power and minimum-sample-size-necessary (MSSN) formulas for 2 categorical data-based tests (genotype, linear trend test [LTT]) of genetic association to the pleiotropic model. We further compare the MSSN of the genotype test and the LTT with that of a multivariate ANOVA (Pillai). We approximate the MSSN for statistics by linear models using a factorial design and ANOVA. With ANOVA decomposition, we determine which factors most significantly change the power/MSSN for all statistics. Finally, we determine which test statistics have the smallest MSSN. In this work, MSSN calculations are for 2 traits (bivariate distributions) only (for illustrative purposes). We note that the calculations may be extended to address any number of traits. Our key findings are that the genotype test usually has lower MSSN requirements than the LTT. More inclusive thresholds (top/bottom 25% vs. top/bottom 10%) have higher sample size requirements. The Pillai test has a much larger MSSN than both the genotype test and the LTT, as a result of sample selection. With these formulas, researchers can specify how many subjects they must collect to localize genes for pleiotropic phenotypes. © 2017 S. Karger AG, Basel.

  10. Multivariate curve resolution based chromatographic peak alignment combined with parallel factor analysis to exploit second-order advantage in complex chromatographic measurements.

    PubMed

    Parastar, Hadi; Akvan, Nadia

    2014-03-13

    In the present contribution, a new combination of multivariate curve resolution-correlation optimized warping (MCR-COW) with trilinear parallel factor analysis (PARAFAC) is developed to exploit second-order advantage in complex chromatographic measurements. In MCR-COW, the complexity of the chromatographic data is reduced by arranging the data in a column-wise augmented matrix, analyzing using MCR bilinear model and aligning the resolved elution profiles using COW in a component-wise manner. The aligned chromatographic data is then decomposed using trilinear model of PARAFAC in order to exploit pure chromatographic and spectroscopic information. The performance of this strategy is evaluated using simulated and real high-performance liquid chromatography-diode array detection (HPLC-DAD) datasets. The obtained results showed that the MCR-COW can efficiently correct elution time shifts of target compounds that are completely overlapped by coeluted interferences in complex chromatographic data. In addition, the PARAFAC analysis of aligned chromatographic data has the advantage of unique decomposition of overlapped chromatographic peaks to identify and quantify the target compounds in the presence of interferences. Finally, to confirm the reliability of the proposed strategy, the performance of the MCR-COW-PARAFAC is compared with the frequently used methods of PARAFAC, COW-PARAFAC, multivariate curve resolution-alternating least squares (MCR-ALS), and MCR-COW-MCR. In general, in most of the cases the MCR-COW-PARAFAC showed an improvement in terms of lack of fit (LOF), relative error (RE) and spectral correlation coefficients in comparison to the PARAFAC, COW-PARAFAC, MCR-ALS and MCR-COW-MCR results. Copyright © 2014 Elsevier B.V. All rights reserved.

  11. A Spatially Constrained Multi-autoencoder Approach for Multivariate Geochemical Anomaly Recognition

    NASA Astrophysics Data System (ADS)

    Lirong, C.; Qingfeng, G.; Renguang, Z.; Yihui, X.

    2017-12-01

    Separating and recognizing geochemical anomalies from the geochemical background is one of the key tasks in geochemical exploration. Many methods have been developed, such as calculating the mean ±2 standard deviation, and fractal/multifractal models. In recent years, deep autoencoder, a deep learning approach, have been used for multivariate geochemical anomaly recognition. While being able to deal with the non-normal distributions of geochemical concentrations and the non-linear relationships among them, this self-supervised learning method does not take into account the spatial heterogeneity of geochemical background and the uncertainty induced by the randomly initialized weights of neurons, leading to ineffective recognition of weak anomalies. In this paper, we introduce a spatially constrained multi-autoencoder (SCMA) approach for multivariate geochemical anomaly recognition, which includes two steps: spatial partitioning and anomaly score computation. The first step divides the study area into multiple sub-regions to segregate the geochemical background, by grouping the geochemical samples through K-means clustering, spatial filtering, and spatial constraining rules. In the second step, for each sub-region, a group of autoencoder neural networks are constructed with an identical structure but different initial weights on neurons. Each autoencoder is trained using the geochemical samples within the corresponding sub-region to learn the sub-regional geochemical background. The best autoencoder of a group is chosen as the final model for the corresponding sub-region. The anomaly score at each location can then be calculated as the euclidean distance between the observed concentrations and reconstructed concentrations of geochemical elements.The experiments using the geochemical data and Fe deposits in the southwestern Fujian province of China showed that our SCMA approach greatly improved the recognition of weak anomalies, achieving the AUC of 0.89, compared with the AUC of 0.77 using a single deep autoencoder approach.

  12. Quality of life in Chronic Pancreatitis is determined by constant pain, disability/unemployment, current smoking and associated co-morbidities

    PubMed Central

    Machicado, Jorge D.; Amann, Stephen T; Anderson, Michelle A.; Abberbock, Judah; Sherman, Stuart; Conwell, Darwin; Cote, Gregory A.; Singh, Vikesh K.; Lewis, Michele; Alkaade, Samer; Sandhu, Bimaljit S.; Guda, Nalini M.; Muniraj, Thiruvengadam; Tang, Gong; Baillie, John; Brand, Randall; Gardner, Timothy B.; Gelrud, Andres; Forsmark, Christopher E.; Banks, Peter A.; Slivka, Adam; Wilcox, C. Mel; Whitcomb, David C.; Yadav, Dhiraj

    2018-01-01

    Background Chronic pancreatitis (CP) has a profound independent effect on quality of life (QOL). Our aim was to identify factors that impact the QOL in CP patients. Methods We used data on 1,024 CP patients enrolled in the three NAPS2 studies. Information on demographics, risk factors, co-morbidities, disease phenotype and treatments was obtained from responses to structured questionnaires. Physical (PCS) and mental (MCS) component summary scores generated using responses to the Short Form-12 (SF-12) survey were used to assess QOL at enrollment. Multivariable linear regression models determined independent predictors of QOL. Results Mean PCS and MCS scores were 36.7±11.7 and 42.4±12.2, respectively. Significant (p<0.05) negative impact on PCS scores in multivariable analyses was noted due to constant mild-moderate pain with episodes of severe pain or constant severe pain (10 points), constant mild-moderate pain (5.2), pain-related disability/unemployment (5.1), current smoking (2.9 points) and medical co-morbidities. Significant (p<0.05) negative impact on MCS scores was related to constant pain irrespective of severity (6.8-6.9 points), current smoking (3.9 points) and pain-related disability/unemployment (2.4 points). In women, disability/unemployment resulted in an additional reduction 3.7 point reduction in MCS score. Final multivariable models explained 27% and 18% of the variance in PCS and MCS scores, respectively. Etiology, disease duration, pancreatic morphology, diabetes, exocrine insufficiency and prior endotherapy/pancreatic surgery had no significant independent effect on QOL. Conclusion Constant pain, pain-related disability/unemployment, current smoking, and concurrent co-morbidities significantly affect the QOL in CP. Further research is needed to identify factors impacting QOL not explained by our analyses. PMID:28244497

  13. Comparison of the prognostic value of pretreatment measurements of systemic inflammatory response in patients undergoing curative resection of clear cell renal cell carcinoma.

    PubMed

    Lucca, Ilaria; de Martino, Michela; Hofbauer, Sebastian L; Zamani, Nura; Shariat, Shahrokh F; Klatte, Tobias

    2015-12-01

    Pretreatment measurements of systemic inflammatory response, including the Glasgow prognostic score (GPS), the neutrophil-to-lymphocyte ratio (NLR), the monocyte-to-lymphocyte ratio (MLR), the platelet-to-lymphocyte ratio (PLR) and the prognostic nutritional index (PNI) have been recognized as prognostic factors in clear cell renal cell carcinoma (CCRCC), but there is at present no study that compared these markers. We evaluated the pretreatment GPS, NLR, MLR, PLR and PNI in 430 patients, who underwent surgery for clinically localized CCRCC (pT1-3N0M0). Associations with disease-free survival were assessed with Cox models. Discrimination was measured with the C-index, and a decision curve analysis was used to evaluate the clinical net benefit. On multivariable analyses, all measures of systemic inflammatory response were significant prognostic factors. The increase in discrimination compared with the stage, size, grade and necrosis (SSIGN) score alone was 5.8 % for the GPS, 1.1-1.4 % for the NLR, 2.9-3.4 % for the MLR, 2.0-3.3 % for the PLR and 1.4-3.0 % for the PNI. On the simultaneous multivariable analysis of all candidate measures, the final multivariable model contained the SSIGN score (HR 1.40, P < 0.001), the GPS (HR 2.32, P < 0.001) and the MLR (HR 5.78, P = 0.003) as significant variables. Adding both the GPS and the MLR increased the discrimination of the SSIGN score by 6.2 % and improved the clinical net benefit. In patients with clinically localized CCRCC, the GPS and the MLR appear to be the most relevant prognostic measures of systemic inflammatory response. They may be used as an adjunct for patient counseling, tailoring management and clinical trial design.

  14. Quality of Life in Chronic Pancreatitis is Determined by Constant Pain, Disability/Unemployment, Current Smoking, and Associated Co-Morbidities.

    PubMed

    Machicado, Jorge D; Amann, Stephen T; Anderson, Michelle A; Abberbock, Judah; Sherman, Stuart; Conwell, Darwin L; Cote, Gregory A; Singh, Vikesh K; Lewis, Michele D; Alkaade, Samer; Sandhu, Bimaljit S; Guda, Nalini M; Muniraj, Thiruvengadam; Tang, Gong; Baillie, John; Brand, Randall E; Gardner, Timothy B; Gelrud, Andres; Forsmark, Christopher E; Banks, Peter A; Slivka, Adam; Wilcox, C Mel; Whitcomb, David C; Yadav, Dhiraj

    2017-04-01

    Chronic pancreatitis (CP) has a profound independent effect on quality of life (QOL). Our aim was to identify factors that impact the QOL in CP patients. We used data on 1,024 CP patients enrolled in the three NAPS2 studies. Information on demographics, risk factors, co-morbidities, disease phenotype, and treatments was obtained from responses to structured questionnaires. Physical and mental component summary (PCS and MCS, respectively) scores generated using responses to the Short Form-12 (SF-12) survey were used to assess QOL at enrollment. Multivariable linear regression models determined independent predictors of QOL. Mean PCS and MCS scores were 36.7±11.7 and 42.4±12.2, respectively. Significant (P<0.05) negative impact on PCS scores in multivariable analyses was noted owing to constant mild-moderate pain with episodes of severe pain or constant severe pain (10 points), constant mild-moderate pain (5.2), pain-related disability/unemployment (5.1), current smoking (2.9 points), and medical co-morbidities. Significant (P<0.05) negative impact on MCS scores was related to constant pain irrespective of severity (6.8-6.9 points), current smoking (3.9 points), and pain-related disability/unemployment (2.4 points). In women, disability/unemployment resulted in an additional 3.7 point reduction in MCS score. Final multivariable models explained 27% and 18% of the variance in PCS and MCS scores, respectively. Etiology, disease duration, pancreatic morphology, diabetes, exocrine insufficiency, and prior endotherapy/pancreatic surgery had no significant independent effect on QOL. Constant pain, pain-related disability/unemployment, current smoking, and concurrent co-morbidities significantly affect the QOL in CP. Further research is needed to identify factors impacting QOL not explained by our analyses.

  15. Preliminary Multi-Variable Parametric Cost Model for Space Telescopes

    NASA Technical Reports Server (NTRS)

    Stahl, H. Philip; Hendrichs, Todd

    2010-01-01

    This slide presentation reviews creating a preliminary multi-variable cost model for the contract costs of making a space telescope. There is discussion of the methodology for collecting the data, definition of the statistical analysis methodology, single variable model results, testing of historical models and an introduction of the multi variable models.

  16. Forecasting high-priority infectious disease surveillance regions: a socioeconomic model.

    PubMed

    Chan, Emily H; Scales, David A; Brewer, Timothy F; Madoff, Lawrence C; Pollack, Marjorie P; Hoen, Anne G; Choden, Tenzin; Brownstein, John S

    2013-02-01

    Few researchers have assessed the relationships between socioeconomic inequality and infectious disease outbreaks at the population level globally. We use a socioeconomic model to forecast national annual rates of infectious disease outbreaks. We constructed a multivariate mixed-effects Poisson model of the number of times a given country was the origin of an outbreak in a given year. The dataset included 389 outbreaks of international concern reported in the World Health Organization's Disease Outbreak News from 1996 to 2008. The initial full model included 9 socioeconomic variables related to education, poverty, population health, urbanization, health infrastructure, gender equality, communication, transportation, and democracy, and 1 composite index. Population, latitude, and elevation were included as potential confounders. The initial model was pared down to a final model by a backwards elimination procedure. The dependent and independent variables were lagged by 2 years to allow for forecasting future rates. Among the socioeconomic variables tested, the final model included child measles immunization rate and telephone line density. The Democratic Republic of Congo, China, and Brazil were predicted to be at the highest risk for outbreaks in 2010, and Colombia and Indonesia were predicted to have the highest percentage of increase in their risk compared to their average over 1996-2008. Understanding socioeconomic factors could help improve the understanding of outbreak risk. The inclusion of the measles immunization variable suggests that there is a fundamental basis in ensuring adequate public health capacity. Increased vigilance and expanding public health capacity should be prioritized in the projected high-risk regions.

  17. Multivariate Models for Normal and Binary Responses in Intervention Studies

    ERIC Educational Resources Information Center

    Pituch, Keenan A.; Whittaker, Tiffany A.; Chang, Wanchen

    2016-01-01

    Use of multivariate analysis (e.g., multivariate analysis of variance) is common when normally distributed outcomes are collected in intervention research. However, when mixed responses--a set of normal and binary outcomes--are collected, standard multivariate analyses are no longer suitable. While mixed responses are often obtained in…

  18. Prediction of MeV electron fluxes throughout the outer radiation belt using multivariate autoregressive models

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sakaguchi, Kaori; Nagatsuma, Tsutomu; Reeves, Geoffrey D.

    The Van Allen radiation belts surrounding the Earth are filled with MeV-energy electrons. This region poses ionizing radiation risks for spacecraft that operate within it, including those in geostationary orbit (GEO) and medium Earth orbit. In order to provide alerts of electron flux enhancements, 16 prediction models of the electron log-flux variation throughout the equatorial outer radiation belt as a function of the McIlwain L parameter were developed using the multivariate autoregressive model and Kalman filter. Measurements of omnidirectional 2.3 MeV electron flux from the Van Allen Probes mission as well as >2 MeV electrons from the GOES 15 spacecraftmore » were used as the predictors. Furthermore, we selected model explanatory parameters from solar wind parameters, the electron log-flux at GEO, and geomagnetic indices. For the innermost region of the outer radiation belt, the electron flux is best predicted by using the Dst index as the sole input parameter. For the central to outermost regions, at L≥4.8 and L ≥5.6, the electron flux is predicted most accurately by including also the solar wind velocity and then the dynamic pressure, respectively. The Dst index is the best overall single parameter for predicting at 3 ≤ L ≤ 6, while for the GEO flux prediction, the K P index is better than Dst. Finally, a test calculation demonstrates that the model successfully predicts the timing and location of the flux maximum as much as 2 days in advance and that the electron flux decreases faster with time at higher L values, both model features consistent with the actually observed behavior.« less

  19. Prediction of MeV electron fluxes throughout the outer radiation belt using multivariate autoregressive models

    DOE PAGES

    Sakaguchi, Kaori; Nagatsuma, Tsutomu; Reeves, Geoffrey D.; ...

    2015-12-22

    The Van Allen radiation belts surrounding the Earth are filled with MeV-energy electrons. This region poses ionizing radiation risks for spacecraft that operate within it, including those in geostationary orbit (GEO) and medium Earth orbit. In order to provide alerts of electron flux enhancements, 16 prediction models of the electron log-flux variation throughout the equatorial outer radiation belt as a function of the McIlwain L parameter were developed using the multivariate autoregressive model and Kalman filter. Measurements of omnidirectional 2.3 MeV electron flux from the Van Allen Probes mission as well as >2 MeV electrons from the GOES 15 spacecraftmore » were used as the predictors. Furthermore, we selected model explanatory parameters from solar wind parameters, the electron log-flux at GEO, and geomagnetic indices. For the innermost region of the outer radiation belt, the electron flux is best predicted by using the Dst index as the sole input parameter. For the central to outermost regions, at L≥4.8 and L ≥5.6, the electron flux is predicted most accurately by including also the solar wind velocity and then the dynamic pressure, respectively. The Dst index is the best overall single parameter for predicting at 3 ≤ L ≤ 6, while for the GEO flux prediction, the K P index is better than Dst. Finally, a test calculation demonstrates that the model successfully predicts the timing and location of the flux maximum as much as 2 days in advance and that the electron flux decreases faster with time at higher L values, both model features consistent with the actually observed behavior.« less

  20. Combining microwave resonance technology to multivariate data analysis as a novel PAT tool to improve process understanding in fluid bed granulation.

    PubMed

    Lourenço, Vera; Herdling, Thorsten; Reich, Gabriele; Menezes, José C; Lochmann, Dirk

    2011-08-01

    A set of 192 fluid bed granulation batches at industrial scale were in-line monitored using microwave resonance technology (MRT) to determine moisture, temperature and density of the granules. Multivariate data analysis techniques such as multiway partial least squares (PLS), multiway principal component analysis (PCA) and multivariate batch control charts were applied onto collected batch data sets. The combination of all these techniques, along with off-line particle size measurements, led to significantly increased process understanding. A seasonality effect could be put into evidence that impacted further processing through its influence on the final granule size. Moreover, it was demonstrated by means of a PLS that a relation between the particle size and the MRT measurements can be quantitatively defined, highlighting a potential ability of the MRT sensor to predict information about the final granule size. This study has contributed to improve a fluid bed granulation process, and the process knowledge obtained shows that the product quality can be built in process design, following Quality by Design (QbD) and Process Analytical Technology (PAT) principles. Copyright © 2011. Published by Elsevier B.V.

  1. Ethnic differences in self reported health in Malmö in southern Sweden

    PubMed Central

    Lindstrom, M; Sundquist, J; Ostergren, P

    2001-01-01

    STUDY OBJECTIVE—The aim of this study was to investigate ethnic differences in self reported health in the city of Malmö, Sweden, and whether these differences could be explained by psychosocial and economic conditions.
DESIGN/SETTING/PARTICIPANTS—The public health survey in Malmö 1994 was a cross sectional study. A total of 5600 people aged 20-80 years completed a postal questionnaire. The participation rate was 71%. The population was categorised according to country of origin: born in Sweden, other Western countries, Yugoslavia, Poland, Arabic speaking countries and all other countries. The multivariate analysis was performed using a logistic regression model in order to investigate the importance of possible confounders on the differences by country of origin in self reported health. Finally, variables measuring psychosocial and economic conditions were introduced into the model.
MAIN RESULTS—The odds ratios of having poor self reported health were significantly higher among men born in other Western countries, Yugoslavia, Arabic speaking countries and in the category all other countries, as well as among women born in Yugoslavia, Poland and all other countries, compared with men and women born in Sweden. The multivariate analysis including age and education did not change these results. A huge reduction of the odds ratios was observed for men and women born in Yugoslavia, Arabic speaking countries and all other countries, and for women born in Poland after the introduction of the social network, social support and economic factors into the multivariate model.
CONCLUSIONS—There were significant ethnic group differences in self reported health. These differences were greatly reduced by psychosocial and economic factors, which suggest that these factors may be important determinants of self rated health in certain minority groups.


Keywords: self reported health; social network; social support PMID:11154248

  2. PTEN Loss as Determined by Clinical-grade Immunohistochemistry Assay Is Associated with Worse Recurrence-free Survival in Prostate Cancer

    PubMed Central

    Lotan, Tamara L.; Wei, Wei; Morais, Carlos L.; Hawley, Sarah T.; Fazli, Ladan; Hurtado-Coll, Antonio; Troyer, Dean; McKenney, Jesse K.; Simko, Jeffrey; Carroll, Peter R.; Gleave, Martin; Lance, Raymond; Lin, Daniel W.; Nelson, Peter S.; Thompson, Ian M.; True, Lawrence D.; Feng, Ziding; Brooks, James D.

    2015-01-01

    Background PTEN is the most commonly deleted tumor suppressor gene in primary prostate cancer (PCa) and its loss is associated with poor clinical outcomes and ERG gene rearrangement. Objective We tested whether PTEN loss is associated with shorter recurrence-free survival (RFS) in surgically treated PCa patients with known ERG status. Design, setting, and participants A genetically validated, automated PTEN immunohistochemistry (IHC) protocol was used for 1275 primary prostate tumors from the Canary Foundation retrospective PCa tissue microarray cohort to assess homogeneous (in all tumor tissue sampled) or heterogeneous (in a subset of tumor tissue sampled) PTEN loss. ERG status as determined by a genetically validated IHC assay was available for a subset of 938 tumors. Outcome measurements and statistical analysis Associations between PTEN and ERG status were assessed using Fisher’s exact test. Kaplan-Meier and multivariate weighted Cox proportional models for RFS were constructed. Results and limitations When compared to intact PTEN, homogeneous (hazard ratio [HR] 1.66, p = 0.001) but not heterogeneous (HR 1.24, p = 0.14) PTEN loss was significantly associated with shorter RFS in multivariate models. Among ERG-positive tumors, homogeneous (HR 3.07, p < 0.0001) but not heterogeneous (HR 1.46, p = 0.10) PTEN loss was significantly associated with shorter RFS. Among ERG-negative tumors, PTEN did not reach significance for inclusion in the final multivariate models. The interaction term for PTEN and ERG status with respect to RFS did not reach statistical significance (p = 0.11) for the current sample size. Conclusions These data suggest that PTEN is a useful prognostic biomarker and that there is no statistically significant interaction between PTEN and ERG status for RFS. Patient summary We found that loss of the PTEN tumor suppressor gene in prostate tumors as assessed by tissue staining is correlated with shorter time to prostate cancer recurrence after radical prostatectomy. PMID:27617307

  3. Multivariate quantile mapping bias correction: an N-dimensional probability density function transform for climate model simulations of multiple variables

    NASA Astrophysics Data System (ADS)

    Cannon, Alex J.

    2018-01-01

    Most bias correction algorithms used in climatology, for example quantile mapping, are applied to univariate time series. They neglect the dependence between different variables. Those that are multivariate often correct only limited measures of joint dependence, such as Pearson or Spearman rank correlation. Here, an image processing technique designed to transfer colour information from one image to another—the N-dimensional probability density function transform—is adapted for use as a multivariate bias correction algorithm (MBCn) for climate model projections/predictions of multiple climate variables. MBCn is a multivariate generalization of quantile mapping that transfers all aspects of an observed continuous multivariate distribution to the corresponding multivariate distribution of variables from a climate model. When applied to climate model projections, changes in quantiles of each variable between the historical and projection period are also preserved. The MBCn algorithm is demonstrated on three case studies. First, the method is applied to an image processing example with characteristics that mimic a climate projection problem. Second, MBCn is used to correct a suite of 3-hourly surface meteorological variables from the Canadian Centre for Climate Modelling and Analysis Regional Climate Model (CanRCM4) across a North American domain. Components of the Canadian Forest Fire Weather Index (FWI) System, a complicated set of multivariate indices that characterizes the risk of wildfire, are then calculated and verified against observed values. Third, MBCn is used to correct biases in the spatial dependence structure of CanRCM4 precipitation fields. Results are compared against a univariate quantile mapping algorithm, which neglects the dependence between variables, and two multivariate bias correction algorithms, each of which corrects a different form of inter-variable correlation structure. MBCn outperforms these alternatives, often by a large margin, particularly for annual maxima of the FWI distribution and spatiotemporal autocorrelation of precipitation fields.

  4. Partial Least Squares Calibration Modeling Towards the Multivariate Limit of Detection for Enriched Isotopic Mixtures via Laser Ablation Molecular Isotopic Spectroscopy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Harris, Candace; Profeta, Luisa; Akpovo, Codjo

    The psuedo univariate limit of detection was calculated to compare to the multivariate interval. ompared with results from the psuedounivariate LOD, the multivariate LOD includes other factors (i.e. signal uncertainties) and the reveals the significance in creating models that not only use the analyte’s emission line but also its entire molecular spectra.

  5. Multiple imputation for handling missing outcome data when estimating the relative risk.

    PubMed

    Sullivan, Thomas R; Lee, Katherine J; Ryan, Philip; Salter, Amy B

    2017-09-06

    Multiple imputation is a popular approach to handling missing data in medical research, yet little is known about its applicability for estimating the relative risk. Standard methods for imputing incomplete binary outcomes involve logistic regression or an assumption of multivariate normality, whereas relative risks are typically estimated using log binomial models. It is unclear whether misspecification of the imputation model in this setting could lead to biased parameter estimates. Using simulated data, we evaluated the performance of multiple imputation for handling missing data prior to estimating adjusted relative risks from a correctly specified multivariable log binomial model. We considered an arbitrary pattern of missing data in both outcome and exposure variables, with missing data induced under missing at random mechanisms. Focusing on standard model-based methods of multiple imputation, missing data were imputed using multivariate normal imputation or fully conditional specification with a logistic imputation model for the outcome. Multivariate normal imputation performed poorly in the simulation study, consistently producing estimates of the relative risk that were biased towards the null. Despite outperforming multivariate normal imputation, fully conditional specification also produced somewhat biased estimates, with greater bias observed for higher outcome prevalences and larger relative risks. Deleting imputed outcomes from analysis datasets did not improve the performance of fully conditional specification. Both multivariate normal imputation and fully conditional specification produced biased estimates of the relative risk, presumably since both use a misspecified imputation model. Based on simulation results, we recommend researchers use fully conditional specification rather than multivariate normal imputation and retain imputed outcomes in the analysis when estimating relative risks. However fully conditional specification is not without its shortcomings, and so further research is needed to identify optimal approaches for relative risk estimation within the multiple imputation framework.

  6. A simplified parsimonious higher order multivariate Markov chain model

    NASA Astrophysics Data System (ADS)

    Wang, Chao; Yang, Chuan-sheng

    2017-09-01

    In this paper, a simplified parsimonious higher-order multivariate Markov chain model (SPHOMMCM) is presented. Moreover, parameter estimation method of TPHOMMCM is give. Numerical experiments shows the effectiveness of TPHOMMCM.

  7. Piecewise multivariate modelling of sequential metabolic profiling data.

    PubMed

    Rantalainen, Mattias; Cloarec, Olivier; Ebbels, Timothy M D; Lundstedt, Torbjörn; Nicholson, Jeremy K; Holmes, Elaine; Trygg, Johan

    2008-02-19

    Modelling the time-related behaviour of biological systems is essential for understanding their dynamic responses to perturbations. In metabolic profiling studies, the sampling rate and number of sampling points are often restricted due to experimental and biological constraints. A supervised multivariate modelling approach with the objective to model the time-related variation in the data for short and sparsely sampled time-series is described. A set of piecewise Orthogonal Projections to Latent Structures (OPLS) models are estimated, describing changes between successive time points. The individual OPLS models are linear, but the piecewise combination of several models accommodates modelling and prediction of changes which are non-linear with respect to the time course. We demonstrate the method on both simulated and metabolic profiling data, illustrating how time related changes are successfully modelled and predicted. The proposed method is effective for modelling and prediction of short and multivariate time series data. A key advantage of the method is model transparency, allowing easy interpretation of time-related variation in the data. The method provides a competitive complement to commonly applied multivariate methods such as OPLS and Principal Component Analysis (PCA) for modelling and analysis of short time-series data.

  8. A tridiagonal parsimonious higher order multivariate Markov chain model

    NASA Astrophysics Data System (ADS)

    Wang, Chao; Yang, Chuan-sheng

    2017-09-01

    In this paper, we present a tridiagonal parsimonious higher-order multivariate Markov chain model (TPHOMMCM). Moreover, estimation method of the parameters in TPHOMMCM is give. Numerical experiments illustrate the effectiveness of TPHOMMCM.

  9. MULTIVARIATE LINEAR MIXED MODELS FOR MULTIPLE OUTCOMES. (R824757)

    EPA Science Inventory

    We propose a multivariate linear mixed (MLMM) for the analysis of multiple outcomes, which generalizes the latent variable model of Sammel and Ryan. The proposed model assumes a flexible correlation structure among the multiple outcomes, and allows a global test of the impact of ...

  10. Electricity Consumption in the Industrial Sector of Jordan: Application of Multivariate Linear Regression and Adaptive Neuro-Fuzzy Techniques

    NASA Astrophysics Data System (ADS)

    Samhouri, M.; Al-Ghandoor, A.; Fouad, R. H.

    2009-08-01

    In this study two techniques, for modeling electricity consumption of the Jordanian industrial sector, are presented: (i) multivariate linear regression and (ii) neuro-fuzzy models. Electricity consumption is modeled as function of different variables such as number of establishments, number of employees, electricity tariff, prevailing fuel prices, production outputs, capacity utilizations, and structural effects. It was found that industrial production and capacity utilization are the most important variables that have significant effect on future electrical power demand. The results showed that both the multivariate linear regression and neuro-fuzzy models are generally comparable and can be used adequately to simulate industrial electricity consumption. However, comparison that is based on the square root average squared error of data suggests that the neuro-fuzzy model performs slightly better for future prediction of electricity consumption than the multivariate linear regression model. Such results are in full agreement with similar work, using different methods, for other countries.

  11. Prostate Health Index (PHI) Predicts High-stage Pathology in African American Men.

    PubMed

    Schwen, Zeyad R; Tosoian, Jeffrey J; Sokoll, Lori J; Mangold, Leslie; Humphreys, Elizabeth; Schaeffer, Edward M; Partin, Alan W; Ross, Ashley E

    2016-04-01

    To evaluate the association between the Prostate Health Index (PHI) and adverse pathology in a cohort of African American (AA) men undergoing radical prostatectomy. Eighty AA men with prostate-specific antigen (PSA) of 2-10 ng/mL underwent measurement of PSA, free PSA (fPSA), and p2PSA prior to radical prostatectomy. PHI was calculated as [(p2PSA/fPSA) × (PSA)(½)]. Biomarker association with pT3 disease was assessed using logistic regression, and covariates were added to a baseline multivariable model including digital rectal examination. Biomarker ability to predict pT3 disease was measured using the area under the receiver operator characteristic curve. Sixteen men (20%) demonstrated pT3 disease on final pathology. Mean age, PSA, and %fPSA were similar in men with and without pT3 disease (all P  >  .05), whereas PHI was significantly greater in men with pT3 disease (mean 57.2 vs 46.6, P  =  .04). Addition of PHI to the baseline multivariable model improved discriminative ability by 12.9% (P  =. .04) and yielded greater diagnostic accuracy than models, including other individual biomarkers. In AA men with PSA of 2-10 ng/mL, PHI was predictive of pT3 prostate cancer and may help to identify men at increased risk of adverse pathology. Additional studies are needed to substantiate these findings and identify appropriate thresholds for clinical use. Copyright © 2016 Elsevier Inc. All rights reserved.

  12. Normal tissue complication probability (NTCP) models for late rectal bleeding, stool frequency and fecal incontinence after radiotherapy in prostate cancer patients.

    PubMed

    Schaake, Wouter; van der Schaaf, Arjen; van Dijk, Lisanne V; Bongaerts, Alfons H H; van den Bergh, Alfons C M; Langendijk, Johannes A

    2016-06-01

    Curative radiotherapy for prostate cancer may lead to anorectal side effects, including rectal bleeding, fecal incontinence, increased stool frequency and rectal pain. The main objective of this study was to develop multivariable NTCP models for these side effects. The study sample was composed of 262 patients with localized or locally advanced prostate cancer (stage T1-3). Anorectal toxicity was prospectively assessed using a standardized follow-up program. Different anatomical subregions within and around the anorectum were delineated. A LASSO logistic regression analysis was used to analyze dose volume effects on toxicity. In the univariable analysis, rectal bleeding, increase in stool frequency and fecal incontinence were significantly associated with a large number of dosimetric parameters. The collinearity between these predictors was high (VIF>5). In the multivariable model, rectal bleeding was associated with the anorectum (V70) and anticoagulant use, fecal incontinence was associated with the external sphincter (V15) and the iliococcygeal muscle (V55). Finally, increase in stool frequency was associated with the iliococcygeal muscle (V45) and the levator ani (V40). No significant associations were found for rectal pain. Different anorectal side effects are associated with different anatomical substructures within and around the anorectum. The dosimetric variables associated with these side effects can be used to optimize radiotherapy treatment planning aiming at prevention of specific side effects and to estimate the benefit of new radiation technologies. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  13. Genetic polymorphisms and the risk of stroke after cardiac surgery.

    PubMed

    Grocott, Hilary P; White, William D; Morris, Richard W; Podgoreanu, Mihai V; Mathew, Joseph P; Nielsen, Dahlia M; Schwinn, Debra A; Newman, Mark F

    2005-09-01

    Stroke represents a significant cause of morbidity and mortality after cardiac surgery. Although the risk of stroke varies according to both patient and procedural factors, the impact of genetic variants on stroke risk is not well understood. Therefore, we tested the hypothesis that specific genetic polymorphisms are associated with an increased risk of stroke after cardiac surgery. Patients undergoing cardiac surgery utilizing cardiopulmonary bypass surgery were studied. DNA was isolated from preoperative blood and analyzed for 26 different single-nucleotide polymorphisms. Multivariable logistic regression modeling was used to determine the association of clinical and genetic characteristics with stroke. Permutation analysis was used to adjust for multiple comparisons inherent in genetic association studies. A total of 1635 patients experiencing 28 strokes (1.7%) were included in the final genetic model. The combination of the 2 minor alleles of C-reactive protein (CRP; 3'UTR 1846C/T) and interleukin-6 (IL-6; -174G/C) polymorphisms, occurring in 583 (35.7%) patients, was significantly associated with stroke (odds ratio, 3.3; 95% CI, 1.4 to 8.1; P=0.0023). In a multivariable logistic model adjusting for age, the CRP and IL-6 single-nucleotide polymorphism combination remained significantly associated with stroke (P=0.0020). We demonstrate that common genetic variants of CRP (3'UTR 1846C/T) and IL-6 (-174G/C) are significantly associated with the risk of stroke after cardiac surgery, suggesting a pivotal role of inflammation in post-cardiac surgery stroke.

  14. Diagnosing perforated appendicitis in pediatric patients: a new model.

    PubMed

    van den Bogaard, Veerle A B; Euser, Sjoerd M; van der Ploeg, Tjeerd; de Korte, Niels; Sanders, Dave G M; de Winter, Derek; Vergroesen, Diederik; van Groningen, Krijn; de Winter, Peter

    2016-03-01

    Studies have investigated sensitivity and specificity of symptoms and tests for diagnosing appendicitis in children. Less is known with regard to the predictive value of these symptoms and tests with respect to the severity of appendicitis. The aim of this study was to determine the predictive value of patient's characteristics and tests for discriminating between perforated and nonperforated appendicitis in children. Pediatric patients who underwent an appendectomy at Spaarne Hospital Hoofddorp, the Netherlands, between January 1, 2009 and December 31, 2013, were included. Baseline patient's characteristics, history, physical examination, laboratory data and results of ultrasounds were collected. Univariate and multivariate logistic regressions were used to determine predictors of perforation. In total, 375 patients were included in this study of which 97 children (25.9%) had significant signs of perforation. Univariate analysis showed that age, duration of complaints, temperature, vomiting, CRP, WBC, different findings on ultrasound and the diameter of the appendix were good predictors of a perforated appendicitis. The final multivariate prediction model included temperature, CRP, clearly visible appendix and free fluids on ultrasound and diameter of the appendix and resulted in an area under the curve (AUC) of 0.91 showing sensitivity and specificity of respectively 85.2% and 81.2%. This prediction model can be used for identification of 'high-risk' children for a perforated appendicitis and might be helpful to prevent complications and longer hospitalization by bringing these children to theater earlier. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. Biodemographic Modeling of the Links Between Fertility Motivation and Fertility Outcomes in the NLSY79

    PubMed Central

    MILLER, WARREN B.; BARD, DAVID E.; PASTA, DAVID J.; RODGERS, JOSEPH LEE

    2010-01-01

    In spite of long-held beliefs that traits related to reproductive success tend to become fixed by evolution with little or no genetic variation, there is now considerable evidence that the natural variation of fertility within populations is genetically influenced and that a portion of that influence is related to the motivational precursors to fertility. We conduct a two-stage analysis to examine these inferences in a time-ordered multivariate context. First, using data from the National Longitudinal Survey of Youth, 1979, and LISREL analysis, we develop a structural equation model in which five hypothesized motivational precursors to fertility, measured in 1979–1982, predict both a child-timing and a child-number outcome, measured in 2002. Second, having chosen two time-ordered sequences of six variables from the SEM to represent our phenotypic models, we use Mx to conduct both univariate and multivariate behavioral genetic analyses with the selected variables. Our results indicate that one or more genes acting within a gene network have additive effects that operate through child-number desires to affect both the timing of the next child born and the final number of children born, that one or more genes acting through a separate network may have additive effects operating through gender role attitudes to produce downstream effects on the two fertility outcomes, and that no genetic variance is associated with either child-timing intentions or educational intentions. PMID:20608103

  16. Models and analysis for multivariate failure time data

    NASA Astrophysics Data System (ADS)

    Shih, Joanna Huang

    The goal of this research is to develop and investigate models and analytic methods for multivariate failure time data. We compare models in terms of direct modeling of the margins, flexibility of dependency structure, local vs. global measures of association, and ease of implementation. In particular, we study copula models, and models produced by right neutral cumulative hazard functions and right neutral hazard functions. We examine the changes of association over time for families of bivariate distributions induced from these models by displaying their density contour plots, conditional density plots, correlation curves of Doksum et al, and local cross ratios of Oakes. We know that bivariate distributions with same margins might exhibit quite different dependency structures. In addition to modeling, we study estimation procedures. For copula models, we investigate three estimation procedures. the first procedure is full maximum likelihood. The second procedure is two-stage maximum likelihood. At stage 1, we estimate the parameters in the margins by maximizing the marginal likelihood. At stage 2, we estimate the dependency structure by fixing the margins at the estimated ones. The third procedure is two-stage partially parametric maximum likelihood. It is similar to the second procedure, but we estimate the margins by the Kaplan-Meier estimate. We derive asymptotic properties for these three estimation procedures and compare their efficiency by Monte-Carlo simulations and direct computations. For models produced by right neutral cumulative hazards and right neutral hazards, we derive the likelihood and investigate the properties of the maximum likelihood estimates. Finally, we develop goodness of fit tests for the dependency structure in the copula models. We derive a test statistic and its asymptotic properties based on the test of homogeneity of Zelterman and Chen (1988), and a graphical diagnostic procedure based on the empirical Bayes approach. We study the performance of these two methods using actual and computer generated data.

  17. Can patient comorbidities be included in clinical performance measures for radiation oncology?

    PubMed

    Owen, Jean B; Khalid, Najma; Ho, Alex; Kachnic, Lisa A; Komaki, Ritsuko; Tao, May Lin; Currey, Adam; Wilson, J Frank

    2014-05-01

    Patient comorbidities may affect the applicability of performance measures that are inherent in multidisciplinary cancer treatment guidelines. This article describes the distribution of common comorbid conditions by disease site and by patient and facility characteristics in patients who received radiation therapy as part of treatment for cancer of the breast, cervix, lung, prostate, and stomach, and investigates the association of comorbidities with treatment decisions. Stratified two-stage cluster sampling provided a random sample of radiation oncology facilities. Eligible patients were randomly sampled from each participating facility for each disease site, and data were abstracted from medical records. The Adult Comorbidity Evaluation Index (ACE-27) was used to measure comorbid conditions and their severity. National estimates were calculated using SUDAAN statistical software. Multivariable logistic regression models predicted the dependent variable "treatment changed or contraindicated due to comorbidities." The final model showed that ACE-27 was highly associated with change in treatment for patients with severe or moderate index values compared to those with none or mild (P < .001). Two other covariates, age and medical coverage, had no (age) or little (medical coverage) significant contribution to predicting treatment change in the multivariable model. Disease site was associated with treatment change after adjusting for other covariates in the model. ACE-27 is highly predictive of treatment modifications for patients treated for these cancers who receive radiation as part of their care. A standardized tool identifying patients who should be excluded from clinical performance measures allows more accurate use of these measures. Copyright © 2014 by American Society of Clinical Oncology.

  18. Can Patient Comorbidities Be Included in Clinical Performance Measures for Radiation Oncology?

    PubMed Central

    Owen, Jean B.; Khalid, Najma; Ho, Alex; Kachnic, Lisa A.; Komaki, Ritsuko; Tao, May Lin; Currey, Adam; Wilson, J. Frank

    2014-01-01

    Purpose: Patient comorbidities may affect the applicability of performance measures that are inherent in multidisciplinary cancer treatment guidelines. This article describes the distribution of common comorbid conditions by disease site and by patient and facility characteristics in patients who received radiation therapy as part of treatment for cancer of the breast, cervix, lung, prostate, and stomach, and investigates the association of comorbidities with treatment decisions. Materials and Methods: Stratified two-stage cluster sampling provided a random sample of radiation oncology facilities. Eligible patients were randomly sampled from each participating facility for each disease site, and data were abstracted from medical records. The Adult Comorbidity Evaluation Index (ACE-27) was used to measure comorbid conditions and their severity. National estimates were calculated using SUDAAN statistical software. Results: Multivariable logistic regression models predicted the dependent variable “treatment changed or contraindicated due to comorbidities.” The final model showed that ACE-27 was highly associated with change in treatment for patients with severe or moderate index values compared to those with none or mild (P < .001). Two other covariates, age and medical coverage, had no (age) or little (medical coverage) significant contribution to predicting treatment change in the multivariable model. Disease site was associated with treatment change after adjusting for other covariates in the model. Conclusions: ACE-27 is highly predictive of treatment modifications for patients treated for these cancers who receive radiation as part of their care. A standardized tool identifying patients who should be excluded from clinical performance measures allows more accurate use of these measures. PMID:24643573

  19. Comparing Within-Person Effects from Multivariate Longitudinal Models

    ERIC Educational Resources Information Center

    Bainter, Sierra A.; Howard, Andrea L.

    2016-01-01

    Several multivariate models are motivated to answer similar developmental questions regarding within-person (intraindividual) effects between 2 or more constructs over time, yet the within-person effects tested by each model are distinct. In this article, the authors clarify the types of within-person inferences that can be made from each model.…

  20. Applying the multivariate time-rescaling theorem to neural population models

    PubMed Central

    Gerhard, Felipe; Haslinger, Robert; Pipa, Gordon

    2011-01-01

    Statistical models of neural activity are integral to modern neuroscience. Recently, interest has grown in modeling the spiking activity of populations of simultaneously recorded neurons to study the effects of correlations and functional connectivity on neural information processing. However any statistical model must be validated by an appropriate goodness-of-fit test. Kolmogorov-Smirnov tests based upon the time-rescaling theorem have proven to be useful for evaluating point-process-based statistical models of single-neuron spike trains. Here we discuss the extension of the time-rescaling theorem to the multivariate (neural population) case. We show that even in the presence of strong correlations between spike trains, models which neglect couplings between neurons can be erroneously passed by the univariate time-rescaling test. We present the multivariate version of the time-rescaling theorem, and provide a practical step-by-step procedure for applying it towards testing the sufficiency of neural population models. Using several simple analytically tractable models and also more complex simulated and real data sets, we demonstrate that important features of the population activity can only be detected using the multivariate extension of the test. PMID:21395436

  1. Estimation of daily protein intake based on spot urine urea nitrogen concentration in chronic kidney disease patients.

    PubMed

    Kanno, Hiroko; Kanda, Eiichiro; Sato, Asako; Sakamoto, Kaori; Kanno, Yoshihiko

    2016-04-01

    Determination of daily protein intake in the management of chronic kidney disease (CKD) requires precision. Inaccuracies in recording dietary intake occur, and estimation from total urea excretion presents hurdles owing to the difficulty of collecting whole urine for 24 h. Spot urine has been used for measuring daily sodium intake and urinary protein excretion. In this cross-sectional study, we investigated whether urea nitrogen (UN) concentration in spot urine can be used to predict daily protein intake instead of the 24-h urine collection in 193 Japanese CKD patients (Stages G1-G5). After patient randomization into 2 datasets for the development and validation of models, bootstrapping was used to develop protein intake estimation models. The parameters for the candidate multivariate regression models were male gender, age, body mass index (BMI), diabetes mellitus, dyslipidemia, proteinuria, estimated glomerular filtration rate, serum albumin level, spot urinary UN and creatinine level, and spot urinary UN/creatinine levels. The final model contained BMI and spot urinary UN level. The final model was selected because of the higher correlation between the predicted and measured protein intakes r = 0.558 (95 % confidence interval 0.400, 0.683), and the smaller distribution of the difference between the measured and predicted protein intakes than those of the other models. The results suggest that UN concentration in spot urine may be used to estimate daily protein intake and that a prediction formula would be useful for nutritional control in CKD patients.

  2. Comprehensive drought characteristics analysis based on a nonlinear multivariate drought index

    NASA Astrophysics Data System (ADS)

    Yang, Jie; Chang, Jianxia; Wang, Yimin; Li, Yunyun; Hu, Hui; Chen, Yutong; Huang, Qiang; Yao, Jun

    2018-02-01

    It is vital to identify drought events and to evaluate multivariate drought characteristics based on a composite drought index for better drought risk assessment and sustainable development of water resources. However, most composite drought indices are constructed by the linear combination, principal component analysis and entropy weight method assuming a linear relationship among different drought indices. In this study, the multidimensional copulas function was applied to construct a nonlinear multivariate drought index (NMDI) to solve the complicated and nonlinear relationship due to its dependence structure and flexibility. The NMDI was constructed by combining meteorological, hydrological, and agricultural variables (precipitation, runoff, and soil moisture) to better reflect the multivariate variables simultaneously. Based on the constructed NMDI and runs theory, drought events for a particular area regarding three drought characteristics: duration, peak, and severity were identified. Finally, multivariate drought risk was analyzed as a tool for providing reliable support in drought decision-making. The results indicate that: (1) multidimensional copulas can effectively solve the complicated and nonlinear relationship among multivariate variables; (2) compared with single and other composite drought indices, the NMDI is slightly more sensitive in capturing recorded drought events; and (3) drought risk shows a spatial variation; out of the five partitions studied, the Jing River Basin as well as the upstream and midstream of the Wei River Basin are characterized by a higher multivariate drought risk. In general, multidimensional copulas provides a reliable way to solve the nonlinear relationship when constructing a comprehensive drought index and evaluating multivariate drought characteristics.

  3. Multivariate assessment of event-related potentials with the t-CWT method.

    PubMed

    Bostanov, Vladimir

    2015-11-05

    Event-related brain potentials (ERPs) are usually assessed with univariate statistical tests although they are essentially multivariate objects. Brain-computer interface applications are a notable exception to this practice, because they are based on multivariate classification of single-trial ERPs. Multivariate ERP assessment can be facilitated by feature extraction methods. One such method is t-CWT, a mathematical-statistical algorithm based on the continuous wavelet transform (CWT) and Student's t-test. This article begins with a geometric primer on some basic concepts of multivariate statistics as applied to ERP assessment in general and to the t-CWT method in particular. Further, it presents for the first time a detailed, step-by-step, formal mathematical description of the t-CWT algorithm. A new multivariate outlier rejection procedure based on principal component analysis in the frequency domain is presented as an important pre-processing step. The MATLAB and GNU Octave implementation of t-CWT is also made publicly available for the first time as free and open source code. The method is demonstrated on some example ERP data obtained in a passive oddball paradigm. Finally, some conceptually novel applications of the multivariate approach in general and of the t-CWT method in particular are suggested and discussed. Hopefully, the publication of both the t-CWT source code and its underlying mathematical algorithm along with a didactic geometric introduction to some basic concepts of multivariate statistics would make t-CWT more accessible to both users and developers in the field of neuroscience research.

  4. Improved parameter inference in catchment models: 1. Evaluating parameter uncertainty

    NASA Astrophysics Data System (ADS)

    Kuczera, George

    1983-10-01

    A Bayesian methodology is developed to evaluate parameter uncertainty in catchment models fitted to a hydrologic response such as runoff, the goal being to improve the chance of successful regionalization. The catchment model is posed as a nonlinear regression model with stochastic errors possibly being both autocorrelated and heteroscedastic. The end result of this methodology, which may use Box-Cox power transformations and ARMA error models, is the posterior distribution, which summarizes what is known about the catchment model parameters. This can be simplified to a multivariate normal provided a linearization in parameter space is acceptable; means of checking and improving this assumption are discussed. The posterior standard deviations give a direct measure of parameter uncertainty, and study of the posterior correlation matrix can indicate what kinds of data are required to improve the precision of poorly determined parameters. Finally, a case study involving a nine-parameter catchment model fitted to monthly runoff and soil moisture data is presented. It is shown that use of ordinary least squares when its underlying error assumptions are violated gives an erroneous description of parameter uncertainty.

  5. A Copula-Based Conditional Probabilistic Forecast Model for Wind Power Ramps

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hodge, Brian S; Krishnan, Venkat K; Zhang, Jie

    Efficient management of wind ramping characteristics can significantly reduce wind integration costs for balancing authorities. By considering the stochastic dependence of wind power ramp (WPR) features, this paper develops a conditional probabilistic wind power ramp forecast (cp-WPRF) model based on Copula theory. The WPRs dataset is constructed by extracting ramps from a large dataset of historical wind power. Each WPR feature (e.g., rate, magnitude, duration, and start-time) is separately forecasted by considering the coupling effects among different ramp features. To accurately model the marginal distributions with a copula, a Gaussian mixture model (GMM) is adopted to characterize the WPR uncertaintymore » and features. The Canonical Maximum Likelihood (CML) method is used to estimate parameters of the multivariable copula. The optimal copula model is chosen based on the Bayesian information criterion (BIC) from each copula family. Finally, the best conditions based cp-WPRF model is determined by predictive interval (PI) based evaluation metrics. Numerical simulations on publicly available wind power data show that the developed copula-based cp-WPRF model can predict WPRs with a high level of reliability and sharpness.« less

  6. The role of modern control theory in the design of controls for aircraft turbine engines

    NASA Technical Reports Server (NTRS)

    Zeller, J.; Lehtinen, B.; Merrill, W.

    1982-01-01

    The development, applications, and current research in modern control theory (MCT) are reviewed, noting the importance for fuel-efficient operation of turbines with variable inlet guide vanes, compressor stators, and exhaust nozzle area. The evolution of multivariable propulsion control design is examined, noting a basis in a matrix formulation of the differential equations defining the process, leading to state space formulations. Reports and papers which appeared from 1970-1982 which dealt with problems in MCT applications to turbine engine control design are outlined, including works on linear quadratic regulator methods, frequency domain methods, identification, estimation, and model reduction, detection, isolation, and accommodation, and state space control, adaptive control, and optimization approaches. Finally, NASA programs in frequency domain design, sensor failure detection, computer-aided control design, and plant modeling are explored

  7. Spatial estimation from remotely sensed data via empirical Bayes models

    NASA Technical Reports Server (NTRS)

    Hill, J. R.; Hinkley, D. V.; Kostal, H.; Morris, C. N.

    1984-01-01

    Multichannel satellite image data, available as LANDSAT imagery, are recorded as a multivariate time series (four channels, multiple passovers) in two spatial dimensions. The application of parametric empirical Bayes theory to classification of, and estimating the probability of, each crop type at each of a large number of pixels is considered. This theory involves both the probability distribution of imagery data, conditional on crop types, and the prior spatial distribution of crop types. For the latter Markov models indexed by estimable parameters are used. A broad outline of the general theory reveals several questions for further research. Some detailed results are given for the special case of two crop types when only a line transect is analyzed. Finally, the estimation of an underlying continuous process on the lattice is discussed which would be applicable to such quantities as crop yield.

  8. Remote-sensing data processing with the multivariate regression analysis method for iron mineral resource potential mapping: a case study in the Sarvian area, central Iran

    NASA Astrophysics Data System (ADS)

    Mansouri, Edris; Feizi, Faranak; Jafari Rad, Alireza; Arian, Mehran

    2018-03-01

    This paper uses multivariate regression to create a mathematical model for iron skarn exploration in the Sarvian area, central Iran, using multivariate regression for mineral prospectivity mapping (MPM). The main target of this paper is to apply multivariate regression analysis (as an MPM method) to map iron outcrops in the northeastern part of the study area in order to discover new iron deposits in other parts of the study area. Two types of multivariate regression models using two linear equations were employed to discover new mineral deposits. This method is one of the reliable methods for processing satellite images. ASTER satellite images (14 bands) were used as unique independent variables (UIVs), and iron outcrops were mapped as dependent variables for MPM. According to the results of the probability value (p value), coefficient of determination value (R2) and adjusted determination coefficient (Radj2), the second regression model (which consistent of multiple UIVs) fitted better than other models. The accuracy of the model was confirmed by iron outcrops map and geological observation. Based on field observation, iron mineralization occurs at the contact of limestone and intrusive rocks (skarn type).

  9. Developing a Hierarchical Model for the Spatial Analysis of PM10 Pollution Extremes in the Mexico City Metropolitan Area.

    PubMed

    Aguirre-Salado, Alejandro Ivan; Vaquera-Huerta, Humberto; Aguirre-Salado, Carlos Arturo; Reyes-Mora, Silvia; Olvera-Cervantes, Ana Delia; Lancho-Romero, Guillermo Arturo; Soubervielle-Montalvo, Carlos

    2017-07-06

    We implemented a spatial model for analysing PM 10 maxima across the Mexico City metropolitan area during the period 1995-2016. We assumed that these maxima follow a non-identical generalized extreme value (GEV) distribution and modeled the trend by introducing multivariate smoothing spline functions into the probability GEV distribution. A flexible, three-stage hierarchical Bayesian approach was developed to analyse the distribution of the PM 10 maxima in space and time. We evaluated the statistical model's performance by using a simulation study. The results showed strong evidence of a positive correlation between the PM 10 maxima and the longitude and latitude. The relationship between time and the PM 10 maxima was negative, indicating a decreasing trend over time. Finally, a high risk of PM 10 maxima presenting levels above 1000 μ g/m 3 (return period: 25 yr) was observed in the northwestern region of the study area.

  10. Meta-Analytic Structural Equation Modeling (MASEM): Comparison of the Multivariate Methods

    ERIC Educational Resources Information Center

    Zhang, Ying

    2011-01-01

    Meta-analytic Structural Equation Modeling (MASEM) has drawn interest from many researchers recently. In doing MASEM, researchers usually first synthesize correlation matrices across studies using meta-analysis techniques and then analyze the pooled correlation matrix using structural equation modeling techniques. Several multivariate methods of…

  11. MULTIVARIATE RECEPTOR MODELS-CURRENT PRACTICE AND FUTURE TRENDS. (R826238)

    EPA Science Inventory

    Multivariate receptor models have been applied to the analysis of air quality data for sometime. However, solving the general mixture problem is important in several other fields. This paper looks at the panoply of these models with a view of identifying common challenges and ...

  12. Comparison of connectivity analyses for resting state EEG data

    NASA Astrophysics Data System (ADS)

    Olejarczyk, Elzbieta; Marzetti, Laura; Pizzella, Vittorio; Zappasodi, Filippo

    2017-06-01

    Objective. In the present work, a nonlinear measure (transfer entropy, TE) was used in a multivariate approach for the analysis of effective connectivity in high density resting state EEG data in eyes open and eyes closed. Advantages of the multivariate approach in comparison to the bivariate one were tested. Moreover, the multivariate TE was compared to an effective linear measure, i.e. directed transfer function (DTF). Finally, the existence of a relationship between the information transfer and the level of brain synchronization as measured by phase synchronization value (PLV) was investigated. Approach. The comparison between the connectivity measures, i.e. bivariate versus multivariate TE, TE versus DTF, TE versus PLV, was performed by means of statistical analysis of indexes based on graph theory. Main results. The multivariate approach is less sensitive to false indirect connections with respect to the bivariate estimates. The multivariate TE differentiated better between eyes closed and eyes open conditions compared to DTF. Moreover, the multivariate TE evidenced non-linear phenomena in information transfer, which are not evidenced by the use of DTF. We also showed that the target of information flow, in particular the frontal region, is an area of greater brain synchronization. Significance. Comparison of different connectivity analysis methods pointed to the advantages of nonlinear methods, and indicated a relationship existing between the flow of information and the level of synchronization of the brain.

  13. Detecting event-related changes of multivariate phase coupling in dynamic brain networks.

    PubMed

    Canolty, Ryan T; Cadieu, Charles F; Koepsell, Kilian; Ganguly, Karunesh; Knight, Robert T; Carmena, Jose M

    2012-04-01

    Oscillatory phase coupling within large-scale brain networks is a topic of increasing interest within systems, cognitive, and theoretical neuroscience. Evidence shows that brain rhythms play a role in controlling neuronal excitability and response modulation (Haider B, McCormick D. Neuron 62: 171-189, 2009) and regulate the efficacy of communication between cortical regions (Fries P. Trends Cogn Sci 9: 474-480, 2005) and distinct spatiotemporal scales (Canolty RT, Knight RT. Trends Cogn Sci 14: 506-515, 2010). In this view, anatomically connected brain areas form the scaffolding upon which neuronal oscillations rapidly create and dissolve transient functional networks (Lakatos P, Karmos G, Mehta A, Ulbert I, Schroeder C. Science 320: 110-113, 2008). Importantly, testing these hypotheses requires methods designed to accurately reflect dynamic changes in multivariate phase coupling within brain networks. Unfortunately, phase coupling between neurophysiological signals is commonly investigated using suboptimal techniques. Here we describe how a recently developed probabilistic model, phase coupling estimation (PCE; Cadieu C, Koepsell K Neural Comput 44: 3107-3126, 2010), can be used to investigate changes in multivariate phase coupling, and we detail the advantages of this model over the commonly employed phase-locking value (PLV; Lachaux JP, Rodriguez E, Martinerie J, Varela F. Human Brain Map 8: 194-208, 1999). We show that the N-dimensional PCE is a natural generalization of the inherently bivariate PLV. Using simulations, we show that PCE accurately captures both direct and indirect (network mediated) coupling between network elements in situations where PLV produces erroneous results. We present empirical results on recordings from humans and nonhuman primates and show that the PCE-estimated coupling values are different from those using the bivariate PLV. Critically on these empirical recordings, PCE output tends to be sparser than the PLVs, indicating fewer significant interactions and perhaps a more parsimonious description of the data. Finally, the physical interpretation of PCE parameters is straightforward: the PCE parameters correspond to interaction terms in a network of coupled oscillators. Forward modeling of a network of coupled oscillators with parameters estimated by PCE generates synthetic data with statistical characteristics identical to empirical signals. Given these advantages over the PLV, PCE is a useful tool for investigating multivariate phase coupling in distributed brain networks.

  14. Regression Models For Multivariate Count Data

    PubMed Central

    Zhang, Yiwen; Zhou, Hua; Zhou, Jin; Sun, Wei

    2016-01-01

    Data with multivariate count responses frequently occur in modern applications. The commonly used multinomial-logit model is limiting due to its restrictive mean-variance structure. For instance, analyzing count data from the recent RNA-seq technology by the multinomial-logit model leads to serious errors in hypothesis testing. The ubiquity of over-dispersion and complicated correlation structures among multivariate counts calls for more flexible regression models. In this article, we study some generalized linear models that incorporate various correlation structures among the counts. Current literature lacks a treatment of these models, partly due to the fact that they do not belong to the natural exponential family. We study the estimation, testing, and variable selection for these models in a unifying framework. The regression models are compared on both synthetic and real RNA-seq data. PMID:28348500

  15. Regression Models For Multivariate Count Data.

    PubMed

    Zhang, Yiwen; Zhou, Hua; Zhou, Jin; Sun, Wei

    2017-01-01

    Data with multivariate count responses frequently occur in modern applications. The commonly used multinomial-logit model is limiting due to its restrictive mean-variance structure. For instance, analyzing count data from the recent RNA-seq technology by the multinomial-logit model leads to serious errors in hypothesis testing. The ubiquity of over-dispersion and complicated correlation structures among multivariate counts calls for more flexible regression models. In this article, we study some generalized linear models that incorporate various correlation structures among the counts. Current literature lacks a treatment of these models, partly due to the fact that they do not belong to the natural exponential family. We study the estimation, testing, and variable selection for these models in a unifying framework. The regression models are compared on both synthetic and real RNA-seq data.

  16. Pretreatment Nomogram to Predict the Risk of Acute Urinary Retention After I-125 Prostate Brachytherapy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Roeloffzen, Ellen M., E-mail: e.m.a.roeloffzen@umcutrecht.nl; Vulpen, Marco van; Battermann, Jan J.

    Purpose: Acute urinary retention (AUR) after iodine-125 (I-125) prostate brachytherapy negatively influences long-term quality of life and therefore should be prevented. We aimed to develop a nomogram to preoperatively predict the risk of AUR. Methods: Using the preoperative data of 714 consecutive patients who underwent I-125 prostate brachytherapy between 2005 and 2008 at our department, we modeled the probability of AUR. Multivariate logistic regression analysis was used to assess the predictive ability of a set of pretreatment predictors and the additional value of a new risk factor (the extent of prostate protrusion into the bladder). The performance of the finalmore » model was assessed with calibration and discrimination measures. Results: Of the 714 patients, 57 patients (8.0%) developed AUR after implantation. Multivariate analysis showed that the combination of prostate volume, IPSS score, neoadjuvant hormonal treatment and the extent of prostate protrusion contribute to the prediction of AUR. The discriminative value (receiver operator characteristic area, ROC) of the basic model (including prostate volume, International Prostate Symptom Score, and neoadjuvant hormonal treatment) to predict the development of AUR was 0.70. The addition of prostate protrusion significantly increased the discriminative power of the model (ROC 0.82). Calibration of this final model was good. The nomogram showed that among patients with a low sum score (<18 points), the risk of AUR was only 0%-5%. However, in patients with a high sum score (>35 points), the risk of AUR was more than 20%. Conclusion: This nomogram is a useful tool for physicians to predict the risk of AUR after I-125 prostate brachytherapy. The nomogram can aid in individualized treatment decision-making and patient counseling.« less

  17. Development and validation of a Partial Least Squares-Discriminant Analysis (PLS-DA) model based on the determination of ethyl glucuronide (EtG) and fatty acid ethyl esters (FAEEs) in hair for the diagnosis of chronic alcohol abuse.

    PubMed

    Alladio, E; Giacomelli, L; Biosa, G; Corcia, D Di; Gerace, E; Salomone, A; Vincenti, M

    2018-01-01

    The chronic intake of an excessive amount of alcohol is currently ascertained by determining the concentration of direct alcohol metabolites in the hair samples of the alleged abusers, including ethyl glucuronide (EtG) and, less frequently, fatty acid ethyl esters (FAEEs). Indirect blood biomarkers of alcohol abuse are still determined to support hair EtG results and diagnose a consequent liver impairment. In the present study, the supporting role of hair FAEEs is compared with indirect blood biomarkers with respect to the contexts in which hair EtG interpretation is uncertain. Receiver Operating Characteristics (ROC) curves and multivariate Principal Component Analysis (PCA) demonstrated much stronger correlation of EtG results with FAEEs than with any single indirect biomarker or their combinations. Partial Least Squares Discriminant Analysis (PLS-DA) models based on hair EtG and FAEEs were developed to maximize the biomarkers information content on a multivariate background. The final PLS-DA model yielded 100% correct classification on a training/evaluation dataset of 155 subjects, including both chronic alcohol abusers and social drinkers. Then, the PLS-DA model was validated on an external dataset of 81 individual providing optimal discrimination ability between chronic alcohol abusers and social drinkers, in terms of specificity and sensitivity. The PLS-DA scores obtained for each subject, with respect to the PLS-DA model threshold that separates the probabilistic distributions for the two classes, furnished a likelihood ratio value, which in turn conveys the strength of the experimental data support to the classification decision, within a Bayesian logic. Typical boundary real cases from daily work are discussed, too. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Estimation of soil cation exchange capacity using Genetic Expression Programming (GEP) and Multivariate Adaptive Regression Splines (MARS)

    NASA Astrophysics Data System (ADS)

    Emamgolizadeh, S.; Bateni, S. M.; Shahsavani, D.; Ashrafi, T.; Ghorbani, H.

    2015-10-01

    The soil cation exchange capacity (CEC) is one of the main soil chemical properties, which is required in various fields such as environmental and agricultural engineering as well as soil science. In situ measurement of CEC is time consuming and costly. Hence, numerous studies have used traditional regression-based techniques to estimate CEC from more easily measurable soil parameters (e.g., soil texture, organic matter (OM), and pH). However, these models may not be able to adequately capture the complex and highly nonlinear relationship between CEC and its influential soil variables. In this study, Genetic Expression Programming (GEP) and Multivariate Adaptive Regression Splines (MARS) were employed to estimate CEC from more readily measurable soil physical and chemical variables (e.g., OM, clay, and pH) by developing functional relations. The GEP- and MARS-based functional relations were tested at two field sites in Iran. Results showed that GEP and MARS can provide reliable estimates of CEC. Also, it was found that the MARS model (with root-mean-square-error (RMSE) of 0.318 Cmol+ kg-1 and correlation coefficient (R2) of 0.864) generated slightly better results than the GEP model (with RMSE of 0.270 Cmol+ kg-1 and R2 of 0.807). The performance of GEP and MARS models was compared with two existing approaches, namely artificial neural network (ANN) and multiple linear regression (MLR). The comparison indicated that MARS and GEP outperformed the MLP model, but they did not perform as good as ANN. Finally, a sensitivity analysis was conducted to determine the most and the least influential variables affecting CEC. It was found that OM and pH have the most and least significant effect on CEC, respectively.

  19. Arm structure in normal spiral galaxies, 1: Multivariate data for 492 galaxies

    NASA Technical Reports Server (NTRS)

    Magri, Christopher

    1994-01-01

    Multivariate data have been collected as part of an effort to develop a new classification system for spiral galaxies, one which is not necessarily based on subjective morphological properties. A sample of 492 moderately bright northern Sa and Sc spirals was chosen for future statistical analysis. New observations were made at 20 and 21 cm; the latter data are described in detail here. Infrared Astronomy Satellite (IRAS) fluxes were obtained from archival data. Finally, new estimates of arm pattern radomness and of local environmental harshness were compiled for most sample objects.

  20. Surgical Responses of Medial Rectus Muscle Recession in Thyroid Eye Disease-Related Esotropia

    PubMed Central

    Lyu, In Jeong; Lee, Ju-Yeun; Kong, Mingui; Park, Kyung-Ah; Oh, Sei Yeul

    2016-01-01

    We evaluate the surgical outcomes and surgical responses of medial rectus muscle (MR) recession patients with thyroid eye disease (TED)-related esotropia (ET). The surgical dose-response curves 1 week postoperatively and at the final visit were analyzed. Univariable and multivariable linear regression analyses were applied to investigate factors influencing surgical dose-response. A total of 43 patients with TED-related ET that underwent MR recession were included. The final success rate was 86.0% and the rate of undercorrection was 14.0%. The surgical dose-response curves of TED-related ET showed a gentle slope compared with those of standard surgical tables. In the univariable model, simultaneous vertical rectus muscle recession was the only significant factor influencing surgical dose-response of MR recession in TED-related ET (β = -0.397, P = 0.044). In a model adjusted for age, sex, type of surgery, and preoperative horizontal angle of deviation, simultaneous vertical rectus muscle recession showed marginal significance (β = -0.389, P = 0.064). The surgical dose-response curve of TED-related ET was unique. Simultaneous vertical rectus muscle recession was associated with increased surgical dose-response in TED-related ET. PMID:26796354

  1. A "Model" Multivariable Calculus Course.

    ERIC Educational Resources Information Center

    Beckmann, Charlene E.; Schlicker, Steven J.

    1999-01-01

    Describes a rich, investigative approach to multivariable calculus. Introduces a project in which students construct physical models of surfaces that represent real-life applications of their choice. The models, along with student-selected datasets, serve as vehicles to study most of the concepts of the course from both continuous and discrete…

  2. Bayesian Estimation of Multivariate Latent Regression Models: Gauss versus Laplace

    ERIC Educational Resources Information Center

    Culpepper, Steven Andrew; Park, Trevor

    2017-01-01

    A latent multivariate regression model is developed that employs a generalized asymmetric Laplace (GAL) prior distribution for regression coefficients. The model is designed for high-dimensional applications where an approximate sparsity condition is satisfied, such that many regression coefficients are near zero after accounting for all the model…

  3. A Sandwich-Type Standard Error Estimator of SEM Models with Multivariate Time Series

    ERIC Educational Resources Information Center

    Zhang, Guangjian; Chow, Sy-Miin; Ong, Anthony D.

    2011-01-01

    Structural equation models are increasingly used as a modeling tool for multivariate time series data in the social and behavioral sciences. Standard error estimators of SEM models, originally developed for independent data, require modifications to accommodate the fact that time series data are inherently dependent. In this article, we extend a…

  4. Multivariate Autoregressive Modeling and Granger Causality Analysis of Multiple Spike Trains

    PubMed Central

    Krumin, Michael; Shoham, Shy

    2010-01-01

    Recent years have seen the emergence of microelectrode arrays and optical methods allowing simultaneous recording of spiking activity from populations of neurons in various parts of the nervous system. The analysis of multiple neural spike train data could benefit significantly from existing methods for multivariate time-series analysis which have proven to be very powerful in the modeling and analysis of continuous neural signals like EEG signals. However, those methods have not generally been well adapted to point processes. Here, we use our recent results on correlation distortions in multivariate Linear-Nonlinear-Poisson spiking neuron models to derive generalized Yule-Walker-type equations for fitting ‘‘hidden” Multivariate Autoregressive models. We use this new framework to perform Granger causality analysis in order to extract the directed information flow pattern in networks of simulated spiking neurons. We discuss the relative merits and limitations of the new method. PMID:20454705

  5. A joint modeling and estimation method for multivariate longitudinal data with mixed types of responses to analyze physical activity data generated by accelerometers.

    PubMed

    Li, Haocheng; Zhang, Yukun; Carroll, Raymond J; Keadle, Sarah Kozey; Sampson, Joshua N; Matthews, Charles E

    2017-11-10

    A mixed effect model is proposed to jointly analyze multivariate longitudinal data with continuous, proportion, count, and binary responses. The association of the variables is modeled through the correlation of random effects. We use a quasi-likelihood type approximation for nonlinear variables and transform the proposed model into a multivariate linear mixed model framework for estimation and inference. Via an extension to the EM approach, an efficient algorithm is developed to fit the model. The method is applied to physical activity data, which uses a wearable accelerometer device to measure daily movement and energy expenditure information. Our approach is also evaluated by a simulation study. Copyright © 2017 John Wiley & Sons, Ltd.

  6. A multivariate model of plant species richness in forested systems: Old-growth montane forests with a long history of fire

    USGS Publications Warehouse

    Laughlin, D.C.; Grace, J.B.

    2006-01-01

    Recently, efforts to develop multivariate models of plant species richness have been extended to include systems where trees play important roles as overstory elements mediating the influences of environment and disturbance on understory richness. We used structural equation modeling to examine the relationship of understory vascular plant species richness to understory abundance, forest structure, topographic slope, and surface fire history in lower montane forests on the North Rim of Grand Canyon National Park, USA based on data from eighty-two 0.1 ha plots. The questions of primary interest in this analysis were: (1) to what degree are influences of trees on understory richness mediated by effects on understory abundance? (2) To what degree are influences of fire history on richness mediated by effects on trees and/or understory abundance? (3) Can the influences of fire history on this system be related simply to time-since-fire or are there unique influences associated with long-term fire frequency? The results we obtained are consistent with the following inferences. First, it appears that pine trees had a strong inhibitory effect on the abundance of understory plants, which in turn led to lower understory species richness. Second, richness declined over time since the last fire. This pattern appears to result from several processes, including (1) a post-fire stimulation of germination, (2) a decline in understory abundance, and (3) an increase over time in pine abundance (which indirectly leads to reduced richness). Finally, once time-since-fire was statistically controlled, it was seen that areas with higher fire frequency have lower richness than expected, which appears to result from negative effects on understory abundance, possibly by depletions of soil nutrients from repeated surface fire. Overall, it appears that at large temporal and spatial scales, surface fire plays an important and complex role in structuring understory plant communities in old-growth montane forests. These results show how multivariate models of herbaceous richness can be expanded to apply to forested systems. Copyright ?? Oikos 2006.

  7. Load compensation in a lean burn natural gas vehicle

    NASA Astrophysics Data System (ADS)

    Gangopadhyay, Anupam

    A new multivariable PI tuning technique is developed in this research that is primarily developed for regulation purposes. Design guidelines are developed based on closed-loop stability. The new multivariable design is applied in a natural gas vehicle to combine idle and A/F ratio control loops. This results in better recovery during low idle operation of a vehicle under external step torques. A powertrain model of a natural gas engine is developed and validated for steady-state and transient operation. The nonlinear model has three states: engine speed, intake manifold pressure and fuel fraction in the intake manifold. The model includes the effect of fuel partial pressure in the intake manifold filling and emptying dynamics. Due to the inclusion of fuel fraction as a state, fuel flow rate into the cylinders is also accurately modeled. A linear system identification is performed on the nonlinear model. The linear model structure is predicted analytically from the nonlinear model and the coefficients of the predicted transfer function are shown to be functions of key physical parameters in the plant. Simulations of linear system and model parameter identification is shown to converge to the predicted values of the model coefficients. The multivariable controller developed in this research could be designed in an algebraic fashion once the plant model is known. It is thus possible to implement the multivariable PI design in an adaptive fashion combining the controller with identified plant model on-line. This will result in a self-tuning regulator (STR) type controller where the underlying design criteria is the multivariable tuning technique designed in this research.

  8. GPU-powered Shotgun Stochastic Search for Dirichlet process mixtures of Gaussian Graphical Models

    PubMed Central

    Mukherjee, Chiranjit; Rodriguez, Abel

    2016-01-01

    Gaussian graphical models are popular for modeling high-dimensional multivariate data with sparse conditional dependencies. A mixture of Gaussian graphical models extends this model to the more realistic scenario where observations come from a heterogenous population composed of a small number of homogeneous sub-groups. In this paper we present a novel stochastic search algorithm for finding the posterior mode of high-dimensional Dirichlet process mixtures of decomposable Gaussian graphical models. Further, we investigate how to harness the massive thread-parallelization capabilities of graphical processing units to accelerate computation. The computational advantages of our algorithms are demonstrated with various simulated data examples in which we compare our stochastic search with a Markov chain Monte Carlo algorithm in moderate dimensional data examples. These experiments show that our stochastic search largely outperforms the Markov chain Monte Carlo algorithm in terms of computing-times and in terms of the quality of the posterior mode discovered. Finally, we analyze a gene expression dataset in which Markov chain Monte Carlo algorithms are too slow to be practically useful. PMID:28626348

  9. GPU-powered Shotgun Stochastic Search for Dirichlet process mixtures of Gaussian Graphical Models.

    PubMed

    Mukherjee, Chiranjit; Rodriguez, Abel

    2016-01-01

    Gaussian graphical models are popular for modeling high-dimensional multivariate data with sparse conditional dependencies. A mixture of Gaussian graphical models extends this model to the more realistic scenario where observations come from a heterogenous population composed of a small number of homogeneous sub-groups. In this paper we present a novel stochastic search algorithm for finding the posterior mode of high-dimensional Dirichlet process mixtures of decomposable Gaussian graphical models. Further, we investigate how to harness the massive thread-parallelization capabilities of graphical processing units to accelerate computation. The computational advantages of our algorithms are demonstrated with various simulated data examples in which we compare our stochastic search with a Markov chain Monte Carlo algorithm in moderate dimensional data examples. These experiments show that our stochastic search largely outperforms the Markov chain Monte Carlo algorithm in terms of computing-times and in terms of the quality of the posterior mode discovered. Finally, we analyze a gene expression dataset in which Markov chain Monte Carlo algorithms are too slow to be practically useful.

  10. Variable selection based on clustering analysis for improvement of polyphenols prediction in green tea using synchronous fluorescence spectra.

    PubMed

    Shan, Jiajia; Wang, Xue; Zhou, Hao; Han, Shuqing; Riza, Dimas Firmanda Al; Kondo, Naoshi

    2018-03-13

    Synchronous fluorescence spectra, combined with multivariate analysis were used to predict flavonoids content in green tea rapidly and nondestructively. This paper presented a new and efficient spectral intervals selection method called clustering based partial least square (CL-PLS), which selected informative wavelengths by combining clustering concept and partial least square (PLS) methods to improve models' performance by synchronous fluorescence spectra. The fluorescence spectra of tea samples were obtained and k-means and kohonen-self organizing map clustering algorithms were carried out to cluster full spectra into several clusters, and sub-PLS regression model was developed on each cluster. Finally, CL-PLS models consisting of gradually selected clusters were built. Correlation coefficient (R) was used to evaluate the effect on prediction performance of PLS models. In addition, variable influence on projection partial least square (VIP-PLS), selectivity ratio partial least square (SR-PLS), interval partial least square (iPLS) models and full spectra PLS model were investigated and the results were compared. The results showed that CL-PLS presented the best result for flavonoids prediction using synchronous fluorescence spectra.

  11. Multilevel covariance regression with correlated random effects in the mean and variance structure.

    PubMed

    Quintero, Adrian; Lesaffre, Emmanuel

    2017-09-01

    Multivariate regression methods generally assume a constant covariance matrix for the observations. In case a heteroscedastic model is needed, the parametric and nonparametric covariance regression approaches can be restrictive in the literature. We propose a multilevel regression model for the mean and covariance structure, including random intercepts in both components and allowing for correlation between them. The implied conditional covariance function can be different across clusters as a result of the random effect in the variance structure. In addition, allowing for correlation between the random intercepts in the mean and covariance makes the model convenient for skewedly distributed responses. Furthermore, it permits us to analyse directly the relation between the mean response level and the variability in each cluster. Parameter estimation is carried out via Gibbs sampling. We compare the performance of our model to other covariance modelling approaches in a simulation study. Finally, the proposed model is applied to the RN4CAST dataset to identify the variables that impact burnout of nurses in Belgium. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. Practical robustness measures in multivariable control system analysis. Ph.D. Thesis

    NASA Technical Reports Server (NTRS)

    Lehtomaki, N. A.

    1981-01-01

    The robustness of the stability of multivariable linear time invariant feedback control systems with respect to model uncertainty is considered using frequency domain criteria. Available robustness tests are unified under a common framework based on the nature and structure of model errors. These results are derived using a multivariable version of Nyquist's stability theorem in which the minimum singular value of the return difference transfer matrix is shown to be the multivariable generalization of the distance to the critical point on a single input, single output Nyquist diagram. Using the return difference transfer matrix, a very general robustness theorem is presented from which all of the robustness tests dealing with specific model errors may be derived. The robustness tests that explicitly utilized model error structure are able to guarantee feedback system stability in the face of model errors of larger magnitude than those robustness tests that do not. The robustness of linear quadratic Gaussian control systems are analyzed.

  13. A matrix-based method of moments for fitting the multivariate random effects model for meta-analysis and meta-regression

    PubMed Central

    Jackson, Dan; White, Ian R; Riley, Richard D

    2013-01-01

    Multivariate meta-analysis is becoming more commonly used. Methods for fitting the multivariate random effects model include maximum likelihood, restricted maximum likelihood, Bayesian estimation and multivariate generalisations of the standard univariate method of moments. Here, we provide a new multivariate method of moments for estimating the between-study covariance matrix with the properties that (1) it allows for either complete or incomplete outcomes and (2) it allows for covariates through meta-regression. Further, for complete data, it is invariant to linear transformations. Our method reduces to the usual univariate method of moments, proposed by DerSimonian and Laird, in a single dimension. We illustrate our method and compare it with some of the alternatives using a simulation study and a real example. PMID:23401213

  14. Application of multivariate statistical techniques in microbial ecology

    PubMed Central

    Paliy, O.; Shankar, V.

    2016-01-01

    Recent advances in high-throughput methods of molecular analyses have led to an explosion of studies generating large scale ecological datasets. Especially noticeable effect has been attained in the field of microbial ecology, where new experimental approaches provided in-depth assessments of the composition, functions, and dynamic changes of complex microbial communities. Because even a single high-throughput experiment produces large amounts of data, powerful statistical techniques of multivariate analysis are well suited to analyze and interpret these datasets. Many different multivariate techniques are available, and often it is not clear which method should be applied to a particular dataset. In this review we describe and compare the most widely used multivariate statistical techniques including exploratory, interpretive, and discriminatory procedures. We consider several important limitations and assumptions of these methods, and we present examples of how these approaches have been utilized in recent studies to provide insight into the ecology of the microbial world. Finally, we offer suggestions for the selection of appropriate methods based on the research question and dataset structure. PMID:26786791

  15. Early warning indicators for first-line virologic failure independent of adherence measures in a South African urban clinic.

    PubMed

    Marconi, Vincent C; Wu, Baohua; Hampton, Jane; Ordóñez, Claudia E; Johnson, Brent A; Singh, Dinesh; John, Sally; Gordon, Michelle; Hare, Anna; Murphy, Richard; Nachega, Jean; Kuritzkes, Daniel R; del Rio, Carlos; Sunpath, Henry

    2013-12-01

    We sought to develop individual-level Early Warning Indicators (EWI) of virologic failure (VF) for clinicians to use during routine care complementing WHO population-level EWI. A case-control study was conducted at a Durban clinic. Patients after ≥ 5 months of first-line antiretroviral therapy (ART) were defined as cases if they had VF [HIV-1 viral load (VL)>1000 copies/mL] and controls (2:1) if they had VL ≤ 1000 copies/mL. Pharmacy refills and pill counts were used as adherence measures. Participants responded to a questionnaire including validated psychosocial and symptom scales. Data were also collected from the medical record. Multivariable logistic regression models of VF included factors associated with VF (p<0.05) in univariable analyses. We enrolled 158 cases and 300 controls. In the final multivariable model, male gender, not having an active religious faith, practicing unsafe sex, having a family member with HIV, not being pleased with the clinic experience, symptoms of depression, fatigue, or rash, low CD4 counts, family recommending HIV care, and using a TV/radio as ART reminders (compared to mobile phones) were associated with VF independent of adherence measures. In this setting, we identified several key individual-level EWI associated with VF including novel psychosocial factors independent of adherence measures.

  16. Cerebellar contribution to motor and cognitive performance in multiple sclerosis: An MRI sub-regional volumetric analysis.

    PubMed

    D'Ambrosio, Alessandro; Pagani, Elisabetta; Riccitelli, Gianna C; Colombo, Bruno; Rodegher, Mariaemma; Falini, Andrea; Comi, Giancarlo; Filippi, Massimo; Rocca, Maria A

    2017-08-01

    To investigate the role of cerebellar sub-regions on motor and cognitive performance in multiple sclerosis (MS) patients. Whole and sub-regional cerebellar volumes, brain volumes, T2 hyperintense lesion volumes (LV), and motor performance scores were obtained from 95 relapse-onset MS patients and 32 healthy controls (HC). MS patients also underwent an evaluation of working memory and processing speed functions. Cerebellar anterior and posterior lobes were segmented using the Spatially Unbiased Infratentorial Toolbox (SUIT) from Statistical Parametric Mapping (SPM12). Multivariate linear regression models assessed the relationship between magnetic resonance imaging (MRI) measures and motor/cognitive scores. Compared to HC, only secondary progressive multiple sclerosis (SPMS) patients had lower cerebellar volumes (total and posterior cerebellum). In MS patients, lower anterior cerebellar volume and brain T2 LV predicted worse motor performance, whereas lower posterior cerebellar volume and brain T2 LV predicted poor cognitive performance. Global measures of brain volume and infratentorial T2 LV were not selected by the final multivariate models. Cerebellar volumetric abnormalities are likely to play an important contribution to explain motor and cognitive performance in MS patients. Consistently with functional mapping studies, cerebellar posterior-inferior volume accounted for variance in cognitive measures, whereas anterior cerebellar volume accounted for variance in motor performance, supporting the assessment of cerebellar damage at sub-regional level.

  17. Quality control for quantitative PCR based on amplification compatibility test.

    PubMed

    Tichopad, Ales; Bar, Tzachi; Pecen, Ladislav; Kitchen, Robert R; Kubista, Mikael; Pfaffl, Michael W

    2010-04-01

    Quantitative qPCR is a routinely used method for the accurate quantification of nucleic acids. Yet it may generate erroneous results if the amplification process is obscured by inhibition or generation of aberrant side-products such as primer dimers. Several methods have been established to control for pre-processing performance that rely on the introduction of a co-amplified reference sequence, however there is currently no method to allow for reliable control of the amplification process without directly modifying the sample mix. Herein we present a statistical approach based on multivariate analysis of the amplification response data generated in real-time. The amplification trajectory in its most resolved and dynamic phase is fitted with a suitable model. Two parameters of this model, related to amplification efficiency, are then used for calculation of the Z-score statistics. Each studied sample is compared to a predefined reference set of reactions, typically calibration reactions. A probabilistic decision for each individual Z-score is then used to identify the majority of inhibited reactions in our experiments. We compare this approach to univariate methods using only the sample specific amplification efficiency as reporter of the compatibility. We demonstrate improved identification performance using the multivariate approach compared to the univariate approach. Finally we stress that the performance of the amplification compatibility test as a quality control procedure depends on the quality of the reference set. Copyright 2010 Elsevier Inc. All rights reserved.

  18. Geochemical processes controlling water salinization in an irrigated basin in Spain: identification of natural and anthropogenic influence.

    PubMed

    Merchán, D; Auqué, L F; Acero, P; Gimeno, M J; Causapé, J

    2015-01-01

    Salinization of water bodies represents a significant risk in water systems. The salinization of waters in a small irrigated hydrological basin is studied herein through an integrated hydrogeochemical study including multivariate statistical analyses and geochemical modeling. The study zone has two well differentiated geologic materials: (i) Quaternary sediments of low salinity and high permeability and (ii) Tertiary sediments of high salinity and very low permeability. In this work, soil samples were collected and leaching experiments conducted on them in the laboratory. In addition, water samples were collected from precipitation, irrigation, groundwater, spring and surface waters. The waters show an increase in salinity from precipitation and irrigation water to ground- and, finally, surface water. The enrichment in salinity is related to the dissolution of soluble mineral present mainly in the Tertiary materials. Cation exchange, precipitation of calcite and, probably, incongruent dissolution of dolomite, have been inferred from the hydrochemical data set. Multivariate statistical analysis provided information about the structure of the data, differentiating the group of surface waters from the groundwaters and the salinization from the nitrate pollution processes. The available information was included in geochemical models in which hypothesis of consistency and thermodynamic feasibility were checked. The assessment of the collected information pointed to a natural control on salinization processes in the Lerma Basin with minimal influence of anthropogenic factors. Copyright © 2014 Elsevier B.V. All rights reserved.

  19. Describing the Elephant: Structure and Function in Multivariate Data.

    ERIC Educational Resources Information Center

    McDonald, Roderick P.

    1986-01-01

    There is a unity underlying the diversity of models for the analysis of multivariate data. Essentially, they constitute a family of models, most generally nonlinear, for structural/functional relations between variables drawn from a behavior domain. (Author)

  20. Fermentanomics: Relating quality attributes of a monoclonal antibody to cell culture process variables and raw materials using multivariate data analysis.

    PubMed

    Rathore, Anurag S; Kumar Singh, Sumit; Pathak, Mili; Read, Erik K; Brorson, Kurt A; Agarabi, Cyrus D; Khan, Mansoor

    2015-01-01

    Fermentanomics is an emerging field of research and involves understanding the underlying controlled process variables and their effect on process yield and product quality. Although major advancements have occurred in process analytics over the past two decades, accurate real-time measurement of significant quality attributes for a biotech product during production culture is still not feasible. Researchers have used an amalgam of process models and analytical measurements for monitoring and process control during production. This article focuses on using multivariate data analysis as a tool for monitoring the internal bioreactor dynamics, the metabolic state of the cell, and interactions among them during culture. Quality attributes of the monoclonal antibody product that were monitored include glycosylation profile of the final product along with process attributes, such as viable cell density and level of antibody expression. These were related to process variables, raw materials components of the chemically defined hybridoma media, concentration of metabolites formed during the course of the culture, aeration-related parameters, and supplemented raw materials such as glucose, methionine, threonine, tryptophan, and tyrosine. This article demonstrates the utility of multivariate data analysis for correlating the product quality attributes (especially glycosylation) to process variables and raw materials (especially amino acid supplements in cell culture media). The proposed approach can be applied for process optimization to increase product expression, improve consistency of product quality, and target the desired quality attribute profile. © 2015 American Institute of Chemical Engineers.

  1. Clinical risk assessment of patients with chronic kidney disease by using clinical data and multivariate models.

    PubMed

    Chen, Zewei; Zhang, Xin; Zhang, Zhuoyong

    2016-12-01

    Timely risk assessment of chronic kidney disease (CKD) and proper community-based CKD monitoring are important to prevent patients with potential risk from further kidney injuries. As many symptoms are associated with the progressive development of CKD, evaluating risk of CKD through a set of clinical data of symptoms coupled with multivariate models can be considered as an available method for prevention of CKD and would be useful for community-based CKD monitoring. Three common used multivariate models, i.e., K-nearest neighbor (KNN), support vector machine (SVM), and soft independent modeling of class analogy (SIMCA), were used to evaluate risk of 386 patients based on a series of clinical data taken from UCI machine learning repository. Different types of composite data, in which proportional disturbances were added to simulate measurement deviations caused by environment and instrument noises, were also utilized to evaluate the feasibility and robustness of these models in risk assessment of CKD. For the original data set, three mentioned multivariate models can differentiate patients with CKD and non-CKD with the overall accuracies over 93 %. KNN and SVM have better performances than SIMCA has in this study. For the composite data set, SVM model has the best ability to tolerate noise disturbance and thus are more robust than the other two models. Using clinical data set on symptoms coupled with multivariate models has been proved to be feasible approach for assessment of patient with potential CKD risk. SVM model can be used as useful and robust tool in this study.

  2. Cole-Cole, linear and multivariate modeling of capacitance data for on-line monitoring of biomass.

    PubMed

    Dabros, Michal; Dennewald, Danielle; Currie, David J; Lee, Mark H; Todd, Robert W; Marison, Ian W; von Stockar, Urs

    2009-02-01

    This work evaluates three techniques of calibrating capacitance (dielectric) spectrometers used for on-line monitoring of biomass: modeling of cell properties using the theoretical Cole-Cole equation, linear regression of dual-frequency capacitance measurements on biomass concentration, and multivariate (PLS) modeling of scanning dielectric spectra. The performance and robustness of each technique is assessed during a sequence of validation batches in two experimental settings of differing signal noise. In more noisy conditions, the Cole-Cole model had significantly higher biomass concentration prediction errors than the linear and multivariate models. The PLS model was the most robust in handling signal noise. In less noisy conditions, the three models performed similarly. Estimates of the mean cell size were done additionally using the Cole-Cole and PLS models, the latter technique giving more satisfactory results.

  3. Multivariate regression model for predicting lumber grade volumes of northern red oak sawlogs

    Treesearch

    Daniel A. Yaussy; Robert L. Brisbin

    1983-01-01

    A multivariate regression model was developed to predict green board-foot yields for the seven common factory lumber grades processed from northern red oak (Quercus rubra L.) factory grade logs. The model uses the standard log measurements of grade, scaling diameter, length, and percent defect. It was validated with an independent data set. The model...

  4. A Hierarchical Multivariate Bayesian Approach to Ensemble Model output Statistics in Atmospheric Prediction

    DTIC Science & Technology

    2017-09-01

    efficacy of statistical post-processing methods downstream of these dynamical model components with a hierarchical multivariate Bayesian approach to...Bayesian hierarchical modeling, Markov chain Monte Carlo methods , Metropolis algorithm, machine learning, atmospheric prediction 15. NUMBER OF PAGES...scale processes. However, this dissertation explores the efficacy of statistical post-processing methods downstream of these dynamical model components

  5. Predictive and mechanistic multivariate linear regression models for reaction development

    PubMed Central

    Santiago, Celine B.; Guo, Jing-Yao

    2018-01-01

    Multivariate Linear Regression (MLR) models utilizing computationally-derived and empirically-derived physical organic molecular descriptors are described in this review. Several reports demonstrating the effectiveness of this methodological approach towards reaction optimization and mechanistic interrogation are discussed. A detailed protocol to access quantitative and predictive MLR models is provided as a guide for model development and parameter analysis. PMID:29719711

  6. Linear regression analysis and its application to multivariate chromatographic calibration for the quantitative analysis of two-component mixtures.

    PubMed

    Dinç, Erdal; Ozdemir, Abdil

    2005-01-01

    Multivariate chromatographic calibration technique was developed for the quantitative analysis of binary mixtures enalapril maleate (EA) and hydrochlorothiazide (HCT) in tablets in the presence of losartan potassium (LST). The mathematical algorithm of multivariate chromatographic calibration technique is based on the use of the linear regression equations constructed using relationship between concentration and peak area at the five-wavelength set. The algorithm of this mathematical calibration model having a simple mathematical content was briefly described. This approach is a powerful mathematical tool for an optimum chromatographic multivariate calibration and elimination of fluctuations coming from instrumental and experimental conditions. This multivariate chromatographic calibration contains reduction of multivariate linear regression functions to univariate data set. The validation of model was carried out by analyzing various synthetic binary mixtures and using the standard addition technique. Developed calibration technique was applied to the analysis of the real pharmaceutical tablets containing EA and HCT. The obtained results were compared with those obtained by classical HPLC method. It was observed that the proposed multivariate chromatographic calibration gives better results than classical HPLC.

  7. Chemical Attribution of Fentanyl Using Multivariate Statistical Analysis of Orthogonal Mass Spectral Data

    DOE PAGES

    Mayer, Brian P.; DeHope, Alan J.; Mew, Daniel A.; ...

    2016-03-24

    Attribution of the origin of an illicit drug relies on identification of compounds indicative of its clandestine production and is a key component of many modern forensic investigations. Here, the results of these studies can yield detailed information on method of manufacture, starting material source, and final product, all critical forensic evidence. In the present work, chemical attribution signatures (CAS) associated with the synthesis of the analgesic fentanyl, N-(1-phenylethylpiperidin-4-yl)-N-phenylpropanamide, were investigated. Six synthesis methods, all previously published fentanyl synthetic routes or hybrid versions thereof, were studied in an effort to identify and classify route-specific signatures. A total of 160 distinctmore » compounds and inorganic species were identified using gas and liquid chromatographies combined with mass spectrometric methods (gas chromatography/mass spectrometry (GC/MS) and liquid chromatography–tandem mass spectrometry-time of-flight (LC–MS/MS-TOF)) in conjunction with inductively coupled plasma mass spectrometry (ICPMS). The complexity of the resultant data matrix urged the use of multivariate statistical analysis. Using partial least-squares-discriminant analysis (PLS-DA), 87 route-specific CAS were classified and a statistical model capable of predicting the method of fentanyl synthesis was validated and tested against CAS profiles from crude fentanyl products deposited and later extracted from two operationally relevant surfaces: stainless steel and vinyl tile. Finally, this work provides the most detailed fentanyl CAS investigation to date by using orthogonal mass spectral data to identify CAS of forensic significance for illicit drug detection, profiling, and attribution.« less

  8. Chemical Attribution of Fentanyl Using Multivariate Statistical Analysis of Orthogonal Mass Spectral Data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Mayer, Brian P.; DeHope, Alan J.; Mew, Daniel A.

    Attribution of the origin of an illicit drug relies on identification of compounds indicative of its clandestine production and is a key component of many modern forensic investigations. Here, the results of these studies can yield detailed information on method of manufacture, starting material source, and final product, all critical forensic evidence. In the present work, chemical attribution signatures (CAS) associated with the synthesis of the analgesic fentanyl, N-(1-phenylethylpiperidin-4-yl)-N-phenylpropanamide, were investigated. Six synthesis methods, all previously published fentanyl synthetic routes or hybrid versions thereof, were studied in an effort to identify and classify route-specific signatures. A total of 160 distinctmore » compounds and inorganic species were identified using gas and liquid chromatographies combined with mass spectrometric methods (gas chromatography/mass spectrometry (GC/MS) and liquid chromatography–tandem mass spectrometry-time of-flight (LC–MS/MS-TOF)) in conjunction with inductively coupled plasma mass spectrometry (ICPMS). The complexity of the resultant data matrix urged the use of multivariate statistical analysis. Using partial least-squares-discriminant analysis (PLS-DA), 87 route-specific CAS were classified and a statistical model capable of predicting the method of fentanyl synthesis was validated and tested against CAS profiles from crude fentanyl products deposited and later extracted from two operationally relevant surfaces: stainless steel and vinyl tile. Finally, this work provides the most detailed fentanyl CAS investigation to date by using orthogonal mass spectral data to identify CAS of forensic significance for illicit drug detection, profiling, and attribution.« less

  9. Power of Models in Longitudinal Study: Findings from a Full-Crossed Simulation Design

    ERIC Educational Resources Information Center

    Fang, Hua; Brooks, Gordon P.; Rizzo, Maria L.; Espy, Kimberly Andrews; Barcikowski, Robert S.

    2009-01-01

    Because the power properties of traditional repeated measures and hierarchical multivariate linear models have not been clearly determined in the balanced design for longitudinal studies in the literature, the authors present a power comparison study of traditional repeated measures and hierarchical multivariate linear models under 3…

  10. Species distribution modelling for plant communities: Stacked single species or multivariate modelling approaches?

    Treesearch

    Emilie B. Henderson; Janet L. Ohmann; Matthew J. Gregory; Heather M. Roberts; Harold S.J. Zald

    2014-01-01

    Landscape management and conservation planning require maps of vegetation composition and structure over large regions. Species distribution models (SDMs) are often used for individual species, but projects mapping multiple species are rarer. We compare maps of plant community composition assembled by stacking results from many SDMs with multivariate maps constructed...

  11. IRT-ZIP Modeling for Multivariate Zero-Inflated Count Data

    ERIC Educational Resources Information Center

    Wang, Lijuan

    2010-01-01

    This study introduces an item response theory-zero-inflated Poisson (IRT-ZIP) model to investigate psychometric properties of multiple items and predict individuals' latent trait scores for multivariate zero-inflated count data. In the model, two link functions are used to capture two processes of the zero-inflated count data. Item parameters are…

  12. Forecasting High-Priority Infectious Disease Surveillance Regions: A Socioeconomic Model

    PubMed Central

    Chan, Emily H.; Scales, David A.; Brewer, Timothy F.; Madoff, Lawrence C.; Pollack, Marjorie P.; Hoen, Anne G.; Choden, Tenzin; Brownstein, John S.

    2013-01-01

    Background. Few researchers have assessed the relationships between socioeconomic inequality and infectious disease outbreaks at the population level globally. We use a socioeconomic model to forecast national annual rates of infectious disease outbreaks. Methods. We constructed a multivariate mixed-effects Poisson model of the number of times a given country was the origin of an outbreak in a given year. The dataset included 389 outbreaks of international concern reported in the World Health Organization's Disease Outbreak News from 1996 to 2008. The initial full model included 9 socioeconomic variables related to education, poverty, population health, urbanization, health infrastructure, gender equality, communication, transportation, and democracy, and 1 composite index. Population, latitude, and elevation were included as potential confounders. The initial model was pared down to a final model by a backwards elimination procedure. The dependent and independent variables were lagged by 2 years to allow for forecasting future rates. Results. Among the socioeconomic variables tested, the final model included child measles immunization rate and telephone line density. The Democratic Republic of Congo, China, and Brazil were predicted to be at the highest risk for outbreaks in 2010, and Colombia and Indonesia were predicted to have the highest percentage of increase in their risk compared to their average over 1996–2008. Conclusions. Understanding socioeconomic factors could help improve the understanding of outbreak risk. The inclusion of the measles immunization variable suggests that there is a fundamental basis in ensuring adequate public health capacity. Increased vigilance and expanding public health capacity should be prioritized in the projected high-risk regions. PMID:23118271

  13. A flexible model for multivariate interval-censored survival times with complex correlation structure.

    PubMed

    Falcaro, Milena; Pickles, Andrew

    2007-02-10

    We focus on the analysis of multivariate survival times with highly structured interdependency and subject to interval censoring. Such data are common in developmental genetics and genetic epidemiology. We propose a flexible mixed probit model that deals naturally with complex but uninformative censoring. The recorded ages of onset are treated as possibly censored ordinal outcomes with the interval censoring mechanism seen as arising from a coarsened measurement of a continuous variable observed as falling between subject-specific thresholds. This bypasses the requirement for the failure times to be observed as falling into non-overlapping intervals. The assumption of a normal age-of-onset distribution of the standard probit model is relaxed by embedding within it a multivariate Box-Cox transformation whose parameters are jointly estimated with the other parameters of the model. Complex decompositions of the underlying multivariate normal covariance matrix of the transformed ages of onset become possible. The new methodology is here applied to a multivariate study of the ages of first use of tobacco and first consumption of alcohol without parental permission in twins. The proposed model allows estimation of the genetic and environmental effects that are shared by both of these risk behaviours as well as those that are specific. 2006 John Wiley & Sons, Ltd.

  14. Perceived justice and popular support for public health laws: a case study around comprehensive smoke-free legislation in Mexico City.

    PubMed

    Thrasher, James F; Besley, John C; González, Wendy

    2010-03-01

    The World Health Organization's Framework Convention on Tobacco Control promotes comprehensive smoke-free laws. The effective implementation of these laws requires citizen participation and support. Risk communication research suggests that citizens' perceptions of the fairness of smoke-free laws would help explain their support for the law. This study aimed to assess the factors that correlate with citizens' perceptions of the distributive, procedural and interpersonal justice of smoke-free laws, as well as how these perceptions are related to support for and intention to help enforce these laws. Study data came from a cross-sectional, population-based survey of 800 Mexico City inhabitants before a comprehensive smoke-free policy was implemented there in 2008. Structural equation modeling was used to estimate the bivariate and multivariate adjusted paths relating study variables. In the final multivariate model, the three justice concepts mediated the influence of smoking status, perceived dangers of secondhand smoke exposure, strength of home smoking ban, and perceived rights of smokers on the two distal constructs of support for smoke-free policy and intention to help enforce it. Statistically significant paths were estimated from distributive and procedural justice to support for the law and intention help enforce it. The path from interpersonal justice to support for the law was not significant, but the path to intention to help enforce the law was. Finally, the path from support for the law to the intention to enforce it was statistically significant. These results suggest that three distinct dimensions of perceived justice help explain citizen support for smoke-free policies. These dimensions of perceived justice may explain the conditions under which smoke-free policies are effectively implemented and could help shape the focus for communication strategies that aim to ensure effective implementation of this and other public health policies. 2009 Elsevier Ltd. All rights reserved.

  15. 64Cu-DOTATATE PET/MRI for Detection of Activated Macrophages in Carotid Atherosclerotic Plaques: Studies in Patients Undergoing Endarterectomy.

    PubMed

    Pedersen, Sune Folke; Sandholt, Benjamin Vikjær; Keller, Sune Høgild; Hansen, Adam Espe; Clemmensen, Andreas Ettrup; Sillesen, Henrik; Højgaard, Liselotte; Ripa, Rasmus Sejersten; Kjær, Andreas

    2015-07-01

    A feature of vulnerable atherosclerotic plaques of the carotid artery is high activity and abundance of lesion macrophages. There is consensus that this is of importance for plaque vulnerability, which may lead to clinical events, such as stroke and transient ischemic attack. We used positron emission tomography (PET) and the novel PET ligand [(64)Cu] [1,4,7,10-tetraazacyclododecane-N,N',N″,N‴-tetraacetic acid]-d-Phe1,Tyr3-octreotate ((64)Cu-DOTATATE) to specifically target macrophages via the somatostatin receptor subtype-2 in vivo. Ten patients underwent simultaneous PET/MRI to measure (64)Cu-DOTATATE uptake in carotid artery plaques before carotid endarterectomy. (64)Cu-DOTATATE uptake was significantly higher in symptomatic plaque versus the contralateral carotid artery (P<0.001). Subsequently, a total of 62 plaque segments were assessed for gene expression of selected markers of plaque vulnerability using real-time quantitative polymerase chain reaction. These results were compared with in vivo (64)Cu-DOTATATE uptake calculated as the mean standardized uptake value. Univariate analysis of real-time quantitative polymerase chain reaction and PET showed that cluster of differentiation 163 (CD163) and CD68 gene expression correlated significantly but weakly with mean standardized uptake value in scans performed 85 minutes post injection (P<0.001 and P=0.015, respectively). Subsequent multivariate analysis showed that CD163 correlated independently with (64)Cu-DOTATATE uptake (P=0.031) whereas CD68 did not contribute significantly to the final model. The novel PET tracer (64)Cu-DOTATATE accumulates in atherosclerotic plaques of the carotid artery. CD163 gene expression correlated independently with (64)Cu-DOTATATE uptake measured by real-time quantitative polymerase chain reaction in the final multivariate model, indicating that (64)Cu-DOTATATE PET is detecting alternatively activated macrophages. This association could potentially improve noninvasive identification and characterization of vulnerable plaques. © 2015 The Authors.

  16. Can multivariate models based on MOAKS predict OA knee pain? Data from the Osteoarthritis Initiative

    NASA Astrophysics Data System (ADS)

    Luna-Gómez, Carlos D.; Zanella-Calzada, Laura A.; Galván-Tejada, Jorge I.; Galván-Tejada, Carlos E.; Celaya-Padilla, José M.

    2017-03-01

    Osteoarthritis is the most common rheumatic disease in the world. Knee pain is the most disabling symptom in the disease, the prediction of pain is one of the targets in preventive medicine, this can be applied to new therapies or treatments. Using the magnetic resonance imaging and the grading scales, a multivariate model based on genetic algorithms is presented. Using a predictive model can be useful to associate minor structure changes in the joint with the future knee pain. Results suggest that multivariate models can be predictive with future knee chronic pain. All models; T0, T1 and T2, were statistically significant, all p values were < 0.05 and all AUC > 0.60.

  17. Drought forecasting in Luanhe River basin involving climatic indices

    NASA Astrophysics Data System (ADS)

    Ren, Weinan; Wang, Yixuan; Li, Jianzhu; Feng, Ping; Smith, Ronald J.

    2017-11-01

    Drought is regarded as one of the most severe natural disasters globally. This is especially the case in Tianjin City, Northern China, where drought can affect economic development and people's livelihoods. Drought forecasting, the basis of drought management, is an important mitigation strategy. In this paper, we evolve a probabilistic forecasting model, which forecasts transition probabilities from a current Standardized Precipitation Index (SPI) value to a future SPI class, based on conditional distribution of multivariate normal distribution to involve two large-scale climatic indices at the same time, and apply the forecasting model to 26 rain gauges in the Luanhe River basin in North China. The establishment of the model and the derivation of the SPI are based on the hypothesis of aggregated monthly precipitation that is normally distributed. Pearson correlation and Shapiro-Wilk normality tests are used to select appropriate SPI time scale and large-scale climatic indices. Findings indicated that longer-term aggregated monthly precipitation, in general, was more likely to be considered normally distributed and forecasting models should be applied to each gauge, respectively, rather than to the whole basin. Taking Liying Gauge as an example, we illustrate the impact of the SPI time scale and lead time on transition probabilities. Then, the controlled climatic indices of every gauge are selected by Pearson correlation test and the multivariate normality of SPI, corresponding climatic indices for current month and SPI 1, 2, and 3 months later are demonstrated using Shapiro-Wilk normality test. Subsequently, we illustrate the impact of large-scale oceanic-atmospheric circulation patterns on transition probabilities. Finally, we use a score method to evaluate and compare the performance of the three forecasting models and compare them with two traditional models which forecast transition probabilities from a current to a future SPI class. The results show that the three proposed models outperform the two traditional models and involving large-scale climatic indices can improve the forecasting accuracy.

  18. The Multivariate Largest Lyapunov Exponent as an Age-Related Metric of Quiet Standing Balance

    PubMed Central

    Liu, Kun; Wang, Hongrui; Xiao, Jinzhuang

    2015-01-01

    The largest Lyapunov exponent has been researched as a metric of the balance ability during human quiet standing. However, the sensitivity and accuracy of this measurement method are not good enough for clinical use. The present research proposes a metric of the human body's standing balance ability based on the multivariate largest Lyapunov exponent which can quantify the human standing balance. The dynamic multivariate time series of ankle, knee, and hip were measured by multiple electrical goniometers. Thirty-six normal people of different ages participated in the test. With acquired data, the multivariate largest Lyapunov exponent was calculated. Finally, the results of the proposed approach were analysed and compared with the traditional method, for which the largest Lyapunov exponent and power spectral density from the centre of pressure were also calculated. The following conclusions can be obtained. The multivariate largest Lyapunov exponent has a higher degree of differentiation in differentiating balance in eyes-closed conditions. The MLLE value reflects the overall coordination between multisegment movements. Individuals of different ages can be distinguished by their MLLE values. The standing stability of human is reduced with the increment of age. PMID:26064182

  19. Multivariate-$t$ nonlinear mixed models with application to censored multi-outcome AIDS studies.

    PubMed

    Lin, Tsung-I; Wang, Wan-Lun

    2017-10-01

    In multivariate longitudinal HIV/AIDS studies, multi-outcome repeated measures on each patient over time may contain outliers, and the viral loads are often subject to a upper or lower limit of detection depending on the quantification assays. In this article, we consider an extension of the multivariate nonlinear mixed-effects model by adopting a joint multivariate-$t$ distribution for random effects and within-subject errors and taking the censoring information of multiple responses into account. The proposed model is called the multivariate-$t$ nonlinear mixed-effects model with censored responses (MtNLMMC), allowing for analyzing multi-outcome longitudinal data exhibiting nonlinear growth patterns with censorship and fat-tailed behavior. Utilizing the Taylor-series linearization method, a pseudo-data version of expectation conditional maximization either (ECME) algorithm is developed for iteratively carrying out maximum likelihood estimation. We illustrate our techniques with two data examples from HIV/AIDS studies. Experimental results signify that the MtNLMMC performs favorably compared to its Gaussian analogue and some existing approaches. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  20. Multivariate analysis of longitudinal rates of change.

    PubMed

    Bryan, Matthew; Heagerty, Patrick J

    2016-12-10

    Longitudinal data allow direct comparison of the change in patient outcomes associated with treatment or exposure. Frequently, several longitudinal measures are collected that either reflect a common underlying health status, or characterize processes that are influenced in a similar way by covariates such as exposure or demographic characteristics. Statistical methods that can combine multivariate response variables into common measures of covariate effects have been proposed in the literature. Current methods for characterizing the relationship between covariates and the rate of change in multivariate outcomes are limited to select models. For example, 'accelerated time' methods have been developed which assume that covariates rescale time in longitudinal models for disease progression. In this manuscript, we detail an alternative multivariate model formulation that directly structures longitudinal rates of change and that permits a common covariate effect across multiple outcomes. We detail maximum likelihood estimation for a multivariate longitudinal mixed model. We show via asymptotic calculations the potential gain in power that may be achieved with a common analysis of multiple outcomes. We apply the proposed methods to the analysis of a trivariate outcome for infant growth and compare rates of change for HIV infected and uninfected infants. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  1. Voxelwise multivariate analysis of multimodality magnetic resonance imaging

    PubMed Central

    Naylor, Melissa G.; Cardenas, Valerie A.; Tosun, Duygu; Schuff, Norbert; Weiner, Michael; Schwartzman, Armin

    2015-01-01

    Most brain magnetic resonance imaging (MRI) studies concentrate on a single MRI contrast or modality, frequently structural MRI. By performing an integrated analysis of several modalities, such as structural, perfusion-weighted, and diffusion-weighted MRI, new insights may be attained to better understand the underlying processes of brain diseases. We compare two voxelwise approaches: (1) fitting multiple univariate models, one for each outcome and then adjusting for multiple comparisons among the outcomes and (2) fitting a multivariate model. In both cases, adjustment for multiple comparisons is performed over all voxels jointly to account for the search over the brain. The multivariate model is able to account for the multiple comparisons over outcomes without assuming independence because the covariance structure between modalities is estimated. Simulations show that the multivariate approach is more powerful when the outcomes are correlated and, even when the outcomes are independent, the multivariate approach is just as powerful or more powerful when at least two outcomes are dependent on predictors in the model. However, multiple univariate regressions with Bonferroni correction remains a desirable alternative in some circumstances. To illustrate the power of each approach, we analyze a case control study of Alzheimer's disease, in which data from three MRI modalities are available. PMID:23408378

  2. Preliminary Multivariable Cost Model for Space Telescopes

    NASA Technical Reports Server (NTRS)

    Stahl, H. Philip

    2010-01-01

    Parametric cost models are routinely used to plan missions, compare concepts and justify technology investments. Previously, the authors published two single variable cost models based on 19 flight missions. The current paper presents the development of a multi-variable space telescopes cost model. The validity of previously published models are tested. Cost estimating relationships which are and are not significant cost drivers are identified. And, interrelationships between variables are explored

  3. On the effect of model parameters on forecast objects

    NASA Astrophysics Data System (ADS)

    Marzban, Caren; Jones, Corinne; Li, Ning; Sandgathe, Scott

    2018-04-01

    Many physics-based numerical models produce a gridded, spatial field of forecasts, e.g., a temperature map. The field for some quantities generally consists of spatially coherent and disconnected objects. Such objects arise in many problems, including precipitation forecasts in atmospheric models, eddy currents in ocean models, and models of forest fires. Certain features of these objects (e.g., location, size, intensity, and shape) are generally of interest. Here, a methodology is developed for assessing the impact of model parameters on the features of forecast objects. The main ingredients of the methodology include the use of (1) Latin hypercube sampling for varying the values of the model parameters, (2) statistical clustering algorithms for identifying objects, (3) multivariate multiple regression for assessing the impact of multiple model parameters on the distribution (across the forecast domain) of object features, and (4) methods for reducing the number of hypothesis tests and controlling the resulting errors. The final output of the methodology is a series of box plots and confidence intervals that visually display the sensitivities. The methodology is demonstrated on precipitation forecasts from a mesoscale numerical weather prediction model.

  4. DUALITY IN MULTIVARIATE RECEPTOR MODEL. (R831078)

    EPA Science Inventory

    Multivariate receptor models are used for source apportionment of multiple observations of compositional data of air pollutants that obey mass conservation. Singular value decomposition of the data leads to two sets of eigenvectors. One set of eigenvectors spans a space in whi...

  5. Variable selection based on clustering analysis for improvement of polyphenols prediction in green tea using synchronous fluorescence spectra

    NASA Astrophysics Data System (ADS)

    Shan, Jiajia; Wang, Xue; Zhou, Hao; Han, Shuqing; Riza, Dimas Firmanda Al; Kondo, Naoshi

    2018-04-01

    Synchronous fluorescence spectra, combined with multivariate analysis were used to predict flavonoids content in green tea rapidly and nondestructively. This paper presented a new and efficient spectral intervals selection method called clustering based partial least square (CL-PLS), which selected informative wavelengths by combining clustering concept and partial least square (PLS) methods to improve models’ performance by synchronous fluorescence spectra. The fluorescence spectra of tea samples were obtained and k-means and kohonen-self organizing map clustering algorithms were carried out to cluster full spectra into several clusters, and sub-PLS regression model was developed on each cluster. Finally, CL-PLS models consisting of gradually selected clusters were built. Correlation coefficient (R) was used to evaluate the effect on prediction performance of PLS models. In addition, variable influence on projection partial least square (VIP-PLS), selectivity ratio partial least square (SR-PLS), interval partial least square (iPLS) models and full spectra PLS model were investigated and the results were compared. The results showed that CL-PLS presented the best result for flavonoids prediction using synchronous fluorescence spectra.

  6. Coordinated control system modelling of ultra-supercritical unit based on a new T-S fuzzy structure.

    PubMed

    Hou, Guolian; Du, Huan; Yang, Yu; Huang, Congzhi; Zhang, Jianhua

    2018-03-01

    The thermal power plant, especially the ultra-supercritical unit is featured with severe nonlinearity, strong multivariable coupling. In order to deal with these difficulties, it is of great importance to build an accurate and simple model of the coordinated control system (CCS) in the ultra-supercritical unit. In this paper, an improved T-S fuzzy model identification approach is proposed. First of all, the k-means++ algorithm is employed to identify the premise parameters so as to guarantee the number of fuzzy rules. Then, the local linearized models are determined by using the incremental historical data around the cluster centers, which are obtained via the stochastic gradient descent algorithm with momentum and variable learning rate. Finally, with the proposed method, the CCS model of a 1000 MW USC unit in Tai Zhou power plant is developed. The effectiveness of the proposed approach is validated by the given extensive simulation results, and it can be further employed to design the overall advanced controllers for the CCS in an USC unit. Copyright © 2018 ISA. Published by Elsevier Ltd. All rights reserved.

  7. Multivariate modelling of endophenotypes associated with the metabolic syndrome in Chinese twins.

    PubMed

    Pang, Z; Zhang, D; Li, S; Duan, H; Hjelmborg, J; Kruse, T A; Kyvik, K O; Christensen, K; Tan, Q

    2010-12-01

    The common genetic and environmental effects on endophenotypes related to the metabolic syndrome have been investigated using bivariate and multivariate twin models. This paper extends the pairwise analysis approach by introducing independent and common pathway models to Chinese twin data. The aim was to explore the common genetic architecture in the development of these phenotypes in the Chinese population. Three multivariate models including the full saturated Cholesky decomposition model, the common factor independent pathway model and the common factor common pathway model were fitted to 695 pairs of Chinese twins representing six phenotypes including BMI, total cholesterol, total triacylglycerol, fasting glucose, HDL and LDL. Performances of the nested models were compared with that of the full Cholesky model. Cross-phenotype correlation coefficients gave clear indication of common genetic or environmental backgrounds in the phenotypes. Decomposition of phenotypic correlation by the Cholesky model revealed that the observed phenotypic correlation among lipid phenotypes had genetic and unique environmental backgrounds. Both pathway models suggest a common genetic architecture for lipid phenotypes, which is distinct from that of the non-lipid phenotypes. The declining performance with model restriction indicates biological heterogeneity in development among some of these phenotypes. Our multivariate analyses revealed common genetic and environmental backgrounds for the studied lipid phenotypes in Chinese twins. Model performance showed that physiologically distinct endophenotypes may follow different genetic regulations.

  8. Methodological challenges to multivariate syndromic surveillance: a case study using Swiss animal health data.

    PubMed

    Vial, Flavie; Wei, Wei; Held, Leonhard

    2016-12-20

    In an era of ubiquitous electronic collection of animal health data, multivariate surveillance systems (which concurrently monitor several data streams) should have a greater probability of detecting disease events than univariate systems. However, despite their limitations, univariate aberration detection algorithms are used in most active syndromic surveillance (SyS) systems because of their ease of application and interpretation. On the other hand, a stochastic modelling-based approach to multivariate surveillance offers more flexibility, allowing for the retention of historical outbreaks, for overdispersion and for non-stationarity. While such methods are not new, they are yet to be applied to animal health surveillance data. We applied an example of such stochastic model, Held and colleagues' two-component model, to two multivariate animal health datasets from Switzerland. In our first application, multivariate time series of the number of laboratories test requests were derived from Swiss animal diagnostic laboratories. We compare the performance of the two-component model to parallel monitoring using an improved Farrington algorithm and found both methods yield a satisfactorily low false alarm rate. However, the calibration test of the two-component model on the one-step ahead predictions proved satisfactory, making such an approach suitable for outbreak prediction. In our second application, the two-component model was applied to the multivariate time series of the number of cattle abortions and the number of test requests for bovine viral diarrhea (a disease that often results in abortions). We found that there is a two days lagged effect from the number of abortions to the number of test requests. We further compared the joint modelling and univariate modelling of the number of laboratory test requests time series. The joint modelling approach showed evidence of superiority in terms of forecasting abilities. Stochastic modelling approaches offer the potential to address more realistic surveillance scenarios through, for example, the inclusion of times series specific parameters, or of covariates known to have an impact on syndrome counts. Nevertheless, many methodological challenges to multivariate surveillance of animal SyS data still remain. Deciding on the amount of corroboration among data streams that is required to escalate into an alert is not a trivial task given the sparse data on the events under consideration (e.g. disease outbreaks).

  9. Higher-order Multivariable Polynomial Regression to Estimate Human Affective States

    NASA Astrophysics Data System (ADS)

    Wei, Jie; Chen, Tong; Liu, Guangyuan; Yang, Jiemin

    2016-03-01

    From direct observations, facial, vocal, gestural, physiological, and central nervous signals, estimating human affective states through computational models such as multivariate linear-regression analysis, support vector regression, and artificial neural network, have been proposed in the past decade. In these models, linear models are generally lack of precision because of ignoring intrinsic nonlinearities of complex psychophysiological processes; and nonlinear models commonly adopt complicated algorithms. To improve accuracy and simplify model, we introduce a new computational modeling method named as higher-order multivariable polynomial regression to estimate human affective states. The study employs standardized pictures in the International Affective Picture System to induce thirty subjects’ affective states, and obtains pure affective patterns of skin conductance as input variables to the higher-order multivariable polynomial model for predicting affective valence and arousal. Experimental results show that our method is able to obtain efficient correlation coefficients of 0.98 and 0.96 for estimation of affective valence and arousal, respectively. Moreover, the method may provide certain indirect evidences that valence and arousal have their brain’s motivational circuit origins. Thus, the proposed method can serve as a novel one for efficiently estimating human affective states.

  10. Higher-order Multivariable Polynomial Regression to Estimate Human Affective States

    PubMed Central

    Wei, Jie; Chen, Tong; Liu, Guangyuan; Yang, Jiemin

    2016-01-01

    From direct observations, facial, vocal, gestural, physiological, and central nervous signals, estimating human affective states through computational models such as multivariate linear-regression analysis, support vector regression, and artificial neural network, have been proposed in the past decade. In these models, linear models are generally lack of precision because of ignoring intrinsic nonlinearities of complex psychophysiological processes; and nonlinear models commonly adopt complicated algorithms. To improve accuracy and simplify model, we introduce a new computational modeling method named as higher-order multivariable polynomial regression to estimate human affective states. The study employs standardized pictures in the International Affective Picture System to induce thirty subjects’ affective states, and obtains pure affective patterns of skin conductance as input variables to the higher-order multivariable polynomial model for predicting affective valence and arousal. Experimental results show that our method is able to obtain efficient correlation coefficients of 0.98 and 0.96 for estimation of affective valence and arousal, respectively. Moreover, the method may provide certain indirect evidences that valence and arousal have their brain’s motivational circuit origins. Thus, the proposed method can serve as a novel one for efficiently estimating human affective states. PMID:26996254

  11. Esophageal wall dose-surface maps do not improve the predictive performance of a multivariable NTCP model for acute esophageal toxicity in advanced stage NSCLC patients treated with intensity-modulated (chemo-)radiotherapy.

    PubMed

    Dankers, Frank; Wijsman, Robin; Troost, Esther G C; Monshouwer, René; Bussink, Johan; Hoffmann, Aswin L

    2017-05-07

    In our previous work, a multivariable normal-tissue complication probability (NTCP) model for acute esophageal toxicity (AET) Grade  ⩾2 after highly conformal (chemo-)radiotherapy for non-small cell lung cancer (NSCLC) was developed using multivariable logistic regression analysis incorporating clinical parameters and mean esophageal dose (MED). Since the esophagus is a tubular organ, spatial information of the esophageal wall dose distribution may be important in predicting AET. We investigated whether the incorporation of esophageal wall dose-surface data with spatial information improves the predictive power of our established NTCP model. For 149 NSCLC patients treated with highly conformal radiation therapy esophageal wall dose-surface histograms (DSHs) and polar dose-surface maps (DSMs) were generated. DSMs were used to generate new DSHs and dose-length-histograms that incorporate spatial information of the dose-surface distribution. From these histograms dose parameters were derived and univariate logistic regression analysis showed that they correlated significantly with AET. Following our previous work, new multivariable NTCP models were developed using the most significant dose histogram parameters based on univariate analysis (19 in total). However, the 19 new models incorporating esophageal wall dose-surface data with spatial information did not show improved predictive performance (area under the curve, AUC range 0.79-0.84) over the established multivariable NTCP model based on conventional dose-volume data (AUC  =  0.84). For prediction of AET, based on the proposed multivariable statistical approach, spatial information of the esophageal wall dose distribution is of no added value and it is sufficient to only consider MED as a predictive dosimetric parameter.

  12. Esophageal wall dose-surface maps do not improve the predictive performance of a multivariable NTCP model for acute esophageal toxicity in advanced stage NSCLC patients treated with intensity-modulated (chemo-)radiotherapy

    NASA Astrophysics Data System (ADS)

    Dankers, Frank; Wijsman, Robin; Troost, Esther G. C.; Monshouwer, René; Bussink, Johan; Hoffmann, Aswin L.

    2017-05-01

    In our previous work, a multivariable normal-tissue complication probability (NTCP) model for acute esophageal toxicity (AET) Grade  ⩾2 after highly conformal (chemo-)radiotherapy for non-small cell lung cancer (NSCLC) was developed using multivariable logistic regression analysis incorporating clinical parameters and mean esophageal dose (MED). Since the esophagus is a tubular organ, spatial information of the esophageal wall dose distribution may be important in predicting AET. We investigated whether the incorporation of esophageal wall dose-surface data with spatial information improves the predictive power of our established NTCP model. For 149 NSCLC patients treated with highly conformal radiation therapy esophageal wall dose-surface histograms (DSHs) and polar dose-surface maps (DSMs) were generated. DSMs were used to generate new DSHs and dose-length-histograms that incorporate spatial information of the dose-surface distribution. From these histograms dose parameters were derived and univariate logistic regression analysis showed that they correlated significantly with AET. Following our previous work, new multivariable NTCP models were developed using the most significant dose histogram parameters based on univariate analysis (19 in total). However, the 19 new models incorporating esophageal wall dose-surface data with spatial information did not show improved predictive performance (area under the curve, AUC range 0.79-0.84) over the established multivariable NTCP model based on conventional dose-volume data (AUC  =  0.84). For prediction of AET, based on the proposed multivariable statistical approach, spatial information of the esophageal wall dose distribution is of no added value and it is sufficient to only consider MED as a predictive dosimetric parameter.

  13. Non-medical use of prescription drugs in a national sample of college women

    PubMed Central

    McCauley, Jenna L.; Amstadter, Ananda B.; Macdonald, Alexandra; Danielson, Carla Kmett; Ruggiero, Kenneth J.; Resnick, Heidi S.; Kilpatrick, Dean G.

    2015-01-01

    Non-medical use of prescription drugs (NMUPD) is one of the fastest growing forms of illicit drug use, with research indicating that college students represent a particularly high risk population. The current study examined demographic characteristics, health/mental health, substance misuse, and rape experiences as potential risk correlates of NMUPD among a national sample of college women (N=2000). Interviews were conducted via telephone using Computer-Assisted Telephone Interviewing technology. NMUPD was assessed by asking if, participants had used a prescription drug non-medically in the past year. NMUPD was endorsed by 7.8% of the sample (n=155). Although incapacitated and drug–alcohol facilitated rape were associated with NMUPD in the initial model, the final multivariable model showed that only lifetime major depression and other forms of substance use/abuse were significantly uniquely associated with an increased likelihood of NMUPD. Implications for primary and secondary prevention and subsequent research are addressed. PMID:21356576

  14. Multivariate meta-analysis: potential and promise.

    PubMed

    Jackson, Dan; Riley, Richard; White, Ian R

    2011-09-10

    The multivariate random effects model is a generalization of the standard univariate model. Multivariate meta-analysis is becoming more commonly used and the techniques and related computer software, although continually under development, are now in place. In order to raise awareness of the multivariate methods, and discuss their advantages and disadvantages, we organized a one day 'Multivariate meta-analysis' event at the Royal Statistical Society. In addition to disseminating the most recent developments, we also received an abundance of comments, concerns, insights, critiques and encouragement. This article provides a balanced account of the day's discourse. By giving others the opportunity to respond to our assessment, we hope to ensure that the various view points and opinions are aired before multivariate meta-analysis simply becomes another widely used de facto method without any proper consideration of it by the medical statistics community. We describe the areas of application that multivariate meta-analysis has found, the methods available, the difficulties typically encountered and the arguments for and against the multivariate methods, using four representative but contrasting examples. We conclude that the multivariate methods can be useful, and in particular can provide estimates with better statistical properties, but also that these benefits come at the price of making more assumptions which do not result in better inference in every case. Although there is evidence that multivariate meta-analysis has considerable potential, it must be even more carefully applied than its univariate counterpart in practice. Copyright © 2011 John Wiley & Sons, Ltd.

  15. Multivariate meta-analysis: Potential and promise

    PubMed Central

    Jackson, Dan; Riley, Richard; White, Ian R

    2011-01-01

    The multivariate random effects model is a generalization of the standard univariate model. Multivariate meta-analysis is becoming more commonly used and the techniques and related computer software, although continually under development, are now in place. In order to raise awareness of the multivariate methods, and discuss their advantages and disadvantages, we organized a one day ‘Multivariate meta-analysis’ event at the Royal Statistical Society. In addition to disseminating the most recent developments, we also received an abundance of comments, concerns, insights, critiques and encouragement. This article provides a balanced account of the day's discourse. By giving others the opportunity to respond to our assessment, we hope to ensure that the various view points and opinions are aired before multivariate meta-analysis simply becomes another widely used de facto method without any proper consideration of it by the medical statistics community. We describe the areas of application that multivariate meta-analysis has found, the methods available, the difficulties typically encountered and the arguments for and against the multivariate methods, using four representative but contrasting examples. We conclude that the multivariate methods can be useful, and in particular can provide estimates with better statistical properties, but also that these benefits come at the price of making more assumptions which do not result in better inference in every case. Although there is evidence that multivariate meta-analysis has considerable potential, it must be even more carefully applied than its univariate counterpart in practice. Copyright © 2011 John Wiley & Sons, Ltd. PMID:21268052

  16. Stress and Personal Resource as Predictors of the Adjustment of Parents to Autistic Children: A Multivariate Model

    ERIC Educational Resources Information Center

    Siman-Tov, Ayelet; Kaniel, Shlomo

    2011-01-01

    The research validates a multivariate model that predicts parental adjustment to coping successfully with an autistic child. The model comprises four elements: parental stress, parental resources, parental adjustment and the child's autism symptoms. 176 parents of children aged between 6 to 16 diagnosed with PDD answered several questionnaires…

  17. Multivariate mixed linear model analysis of longitudinal data: an information-rich statistical technique for analyzing disease resistance data

    USDA-ARS?s Scientific Manuscript database

    The mixed linear model (MLM) is currently among the most advanced and flexible statistical modeling techniques and its use in tackling problems in plant pathology has begun surfacing in the literature. The longitudinal MLM is a multivariate extension that handles repeatedly measured data, such as r...

  18. Decomposing biodiversity data using the Latent Dirichlet Allocation model, a probabilistic multivariate statistical method

    Treesearch

    Denis Valle; Benjamin Baiser; Christopher W. Woodall; Robin Chazdon; Jerome Chave

    2014-01-01

    We propose a novel multivariate method to analyse biodiversity data based on the Latent Dirichlet Allocation (LDA) model. LDA, a probabilistic model, reduces assemblages to sets of distinct component communities. It produces easily interpretable results, can represent abrupt and gradual changes in composition, accommodates missing data and allows for coherent estimates...

  19. Theoretical constraints in the design of multivariable control systems

    NASA Technical Reports Server (NTRS)

    Rynaski, E. G.; Mook, D. J.

    1993-01-01

    The theoretical constraints inherent in the design of multivariable control systems were defined and investigated. These constraints are manifested by the system transmission zeros that limit or bound the areas in which closed loop poles and individual transfer function zeros may be placed. These constraints were investigated primarily in the context of system decoupling or non-interaction. It was proven that decoupling requires the placement of closed loop poles at the system transmission zeros. Therefore, the system transmission zeros must be minimum phase to guarantee a stable decoupled system. Once decoupling has been accomplished, the remaining part of the system exhibits transmission zeros at infinity, so nearly complete design freedom is possible in terms of placing both poles and zeros of individual closed loop transfer functions. A general, dynamic inversion model following system architecture was developed that encompasses both the implicit and explicit configuration. Robustness properties are developed along with other attributes of this type of system. Finally, a direct design is developed for the longitudinal-vertical degrees of freedom of aircraft motion to show how a direct lift flap can be used to improve the pitch-heave maneuvering coordination for enhanced flying qualities.

  20. Attitudes and exercise adherence: test of the Theories of Reasoned Action and Planned Behaviour.

    PubMed

    Smith, R A; Biddle, S J

    1999-04-01

    Three studies of exercise adherence and attitudes are reported that tested the Theory of Reasoned Action and the Theory of Planned Behaviour. In a prospective study of adherence to a private fitness club, structural equation modelling path analysis showed that attitudinal and social normative components of the Theory of Reasoned Action accounted for 13.1% of the variance in adherence 4 months later, although only social norm significantly predicted intention. In a second study, the Theory of Planned Behaviour was used to predict both physical activity and sedentary behaviour. Path analyses showed that attitude and perceived control, but not social norm, predicted total physical activity. Physical activity was predicted from intentions and control over sedentary behaviour. Finally, an intervention study with previously sedentary adults showed that intentions to be active measured at the start and end of a 10-week intervention were associated with the planned behaviour variables. A multivariate analysis of variance revealed no significant multivariate effects for time on the planned behaviour variables measured before and after intervention. Qualitative data provided evidence that participants had a positive experience on the intervention programme and supported the role of social normative factors in the adherence process.

  1. Longitudinal Consistency in Self-Reported Age of First Vaginal Intercourse Among Young Adults

    PubMed Central

    Goldberg, Shoshana K.; Haydon, Abigail A.; Herring, Amy H.; Halpern, Carolyn T.

    2014-01-01

    We examined consistency in self-reports of age at first vaginal sex among 9,399 male and female respondents who participated in Waves III and IV (separated by approximately 7 years) of the National Longitudinal Study of Adolescent Health (Add Health). Respondents were coded as consistent if they reported an age at first vaginal intercourse at Wave IV that was within 1 year of the age they reported at Wave III. Sociodemographic, behavioral, and cognitive predictors of consistency were examined using bivariate and multivariate logistic regression. Overall, 85.43% of respondents were able to provide consistent reports. Among both males and females, consistency was associated with age, years since first vaginal intercourse, race/ethnicity, and lifetime number of other-sex partners in final multivariate models. Respondents who were older and had more recently had their first sexual experience were more likely to be consistent. For females only, those who reported a history of non-parental, physically forced sex were less likely to be consistent. Most young adults consistently report age at first vaginal intercourse, supporting the credibility of retrospective self-reports about salient sexual events such as timing of first vaginal intercourse. PMID:23237101

  2. Multivariate Regression Analysis and Slaughter Livestock,

    DTIC Science & Technology

    AGRICULTURE, *ECONOMICS), (*MEAT, PRODUCTION), MULTIVARIATE ANALYSIS, REGRESSION ANALYSIS , ANIMALS, WEIGHT, COSTS, PREDICTIONS, STABILITY, MATHEMATICAL MODELS, STORAGE, BEEF, PORK, FOOD, STATISTICAL DATA, ACCURACY

  3. Virus replication as a phenotypic version of polynucleotide evolution.

    PubMed

    Antoneli, Fernando; Bosco, Francisco; Castro, Diogo; Janini, Luiz Mario

    2013-04-01

    In this paper, we revisit and adapt to viral evolution an approach based on the theory of branching process advanced by Demetrius et al. (Bull. Math. Biol. 46:239-262, 1985), in their study of polynucleotide evolution. By taking into account beneficial effects, we obtain a non-trivial multivariate generalization of their single-type branching process model. Perturbative techniques allows us to obtain analytical asymptotic expressions for the main global parameters of the model, which lead to the following rigorous results: (i) a new criterion for "no sure extinction", (ii) a generalization and proof, for this particular class of models, of the lethal mutagenesis criterion proposed by Bull et al. (J. Virol. 18:2930-2939, 2007), (iii) a new proposal for the notion of relaxation time with a quantitative prescription for its evaluation, (iv) the quantitative description of the evolution of the expected values in four distinct "stages": extinction threshold, lethal mutagenesis, stationary "equilibrium", and transient. Finally, based on these quantitative results, we are able to draw some qualitative conclusions.

  4. Search for the electroweak production of supersymmetric particles in $$\\sqrt{s}$$ = 8 TeV $pp$ collisions with the ATLAS detector

    DOE PAGES

    Aad, G.; Abbott, B.; Abdallah, J.; ...

    2016-03-04

    In this study, the ATLAS experiment has performed extensive searches for the electroweak production of charginos, neutralinos, and staus. This article summarizes and extends the search for electroweak supersymmetry with new analyses targeting scenarios not covered by previously published searches. New searches use vector-boson fusion production, initial-state radiation jets, and low-momentum lepton final states, as well as multivariate analysis techniques to improve the sensitivity to scenarios with small mass splittings and low-production cross sections. Results are based on 20 fb -1 of proton-proton collision data at √s = 8 TeV recorded with the ATLAS experiment at the Large Hadron Collider.more » No significant excess beyond Standard Model expectations is observed. The new and existing searches are combined and interpreted in terms of 95% confidence-level exclusion limits in simplified models, where a single production process and decay mode is assumed, as well as within phenomenological supersymmetric models.« less

  5. "Photographing money" task pricing

    NASA Astrophysics Data System (ADS)

    Jia, Zhongxiang

    2018-05-01

    "Photographing money" [1]is a self-service model under the mobile Internet. The task pricing is reasonable, related to the success of the commodity inspection. First of all, we analyzed the position of the mission and the membership, and introduced the factor of membership density, considering the influence of the number of members around the mission on the pricing. Multivariate regression of task location and membership density using MATLAB to establish the mathematical model of task pricing. At the same time, we can see from the life experience that membership reputation and the intensity of the task will also affect the pricing, and the data of the task success point is more reliable. Therefore, the successful point of the task is selected, and its reputation, task density, membership density and Multiple regression of task positions, according to which a nhew task pricing program. Finally, an objective evaluation is given of the advantages and disadvantages of the established model and solution method, and the improved method is pointed out.

  6. Developing a Hierarchical Model for the Spatial Analysis of PM10 Pollution Extremes in the Mexico City Metropolitan Area

    PubMed Central

    Aguirre-Salado, Alejandro Ivan; Vaquera-Huerta, Humberto; Aguirre-Salado, Carlos Arturo; Reyes-Mora, Silvia; Olvera-Cervantes, Ana Delia; Lancho-Romero, Guillermo Arturo; Soubervielle-Montalvo, Carlos

    2017-01-01

    We implemented a spatial model for analysing PM10 maxima across the Mexico City metropolitan area during the period 1995–2016. We assumed that these maxima follow a non-identical generalized extreme value (GEV) distribution and modeled the trend by introducing multivariate smoothing spline functions into the probability GEV distribution. A flexible, three-stage hierarchical Bayesian approach was developed to analyse the distribution of the PM10 maxima in space and time. We evaluated the statistical model’s performance by using a simulation study. The results showed strong evidence of a positive correlation between the PM10 maxima and the longitude and latitude. The relationship between time and the PM10 maxima was negative, indicating a decreasing trend over time. Finally, a high risk of PM10 maxima presenting levels above 1000 μg/m3 (return period: 25 yr) was observed in the northwestern region of the study area. PMID:28684720

  7. Characteristics and associations of pain intensity in patients referred to a specialist cancer pain clinic.

    PubMed

    Pina, Paulo; Sabri, Elham; Lawlor, Peter G

    2015-01-01

    Uncontrolled cancer pain (CP) may impair quality of life. Given the multidimensional nature of CP, its poor control is often attributed to poor assessment and classification. To determine the characteristics and associations of pain intensity in a specialist CP clinic. Consecutive patients referred to the CP clinic of the Portuguese Cancer Institute (Lisbon, Portugal) had standardized initial assessments and status documentation of the following: Brief Pain Inventory ratings for 'pain now' as the outcome variable; initial pain intensity (iPI) on a 0 to 10 scale; pain mechanism (using the Douleur Neuropathique 4 tool to assess neuropathic pain); episodic pain; Eastern Cooperative Oncology Group rating; oral morphine equivalent daily dose (MEDD); Hospital Anxiety Depression Scale and Emotional Thermometer scores; and cancer diagnosis, metastases, treatment and pain duration. Univariable analyses were conducted to test the association of independent variables with iPI. Variables with P<0.1 were entered into a multivariable regression model, using backward elimination and a cut-point of P=0.2 for final model selection. Of 371 participants, 285 (77%) had moderate (4 to 6) or severe (7 to 10) iPI. The initial median MEDD was relatively low (30 mg [range 20 mg to 60 mg]). In the multivariable model, higher income, Eastern Cooperative Oncology Group rating 3 to 4, cancer diagnosis (head and neck, genitourinary and gastrointestinal), adjuvant use and initial MEDD were associated with iPI (P<0.05). The model's R2 was 18.6, which explained only 19% of iPI variance. The diversity of factors associated with pain intensity and their limited explanation of its variance underscore the biopsychosocial complexity of CP. Adequacy of CP management warrants further exploration.

  8. Quality by design approach for viral clearance by protein a chromatography

    PubMed Central

    Zhang, Min; Miesegaes, George R; Lee, Michael; Coleman, Daniel; Yang, Bin; Trexler-Schmidt, Melody; Norling, Lenore; Lester, Philip; Brorson, Kurt A; Chen, Qi

    2014-01-01

    Protein A chromatography is widely used as a capture step in monoclonal antibody (mAb) purification processes. Antibodies and Fc fusion proteins can be efficiently purified from the majority of other complex components in harvested cell culture fluid (HCCF). Protein A chromatography is also capable of removing modest levels of viruses and is often validated for viral clearance. Historical data mining of Genentech and FDA/CDER databases systematically evaluated the removal of model viruses by Protein A chromatography. First, we found that for each model virus, removal by Protein A chromatography varies significantly across mAbs, while remains consistent within a specific mAb product, even across the acceptable ranges of the process parameters. In addition, our analysis revealed a correlation between retrovirus and parvovirus removal, with retrovirus data generally possessing a greater clearance factor. Finally, we describe a multivariate approach used to evaluate process parameter impacts on viral clearance, based on the levels of retrovirus-like particles (RVLP) present among process characterization study samples. It was shown that RVLP removal by Protein A is robust, that is, parameter effects were not observed across the ranges tested. Robustness of RVLP removal by Protein A also correlates with that for other model viruses such as X-MuLV, MMV, and SV40. The data supports that evaluating RVLP removal using process characterization study samples can establish multivariate acceptable ranges for virus removal by the protein A step for QbD. By measuring RVLP instead of a model retrovirus, it may alleviate some of the technical and economic challenges associated with performing large, design-of-experiment (DoE)—type virus spiking studies. This approach could also serve to provide useful insight when designing strategies to ensure viral safety in the manufacturing of a biopharmaceutical product. PMID:23860745

  9. DEPRESSION AND INCIDENT ALZHEIMER’S DISEASE: THE IMPACT OF DEPRESSION SEVERITY

    PubMed Central

    Gracia-García, Patricia; de-la-Cámara, Concepción; Santabárbara, Javier; Lopez-Anton, Raúl; Quintanilla, Miguel Angel; Ventura, Tirso; Marcos, Guillermo; Campayo, Antonio; Saz, Pedro; Lyketsos, Constantine; Lobo, Antonio

    2014-01-01

    Objective We test the hypothesis that clinically significant depression, severe depression in particular, increases the risk of Alzheimer’s Disease (AD). Design A longitudinal, three-wave epidemiological enquiry was implemented in a sample of individuals aged ≥55 years (n = 4,803) followed-up at 2.5 years and 4.5 years. Setting Population-based cohort drawn from the ZARADEMP Project, in Zaragoza, Spain. Participants Cognitively intact individuals at baseline (n = 3,864). Main outcome measures Depression was assessed by a standardized diagnostic interview (Geriatric Mental State, GMS-AGECAT). A panel of research psychiatrist diagnosed AD according to DSM-IV criteria. Fine and Gray multivariate regression model was used in the analysis, accounting for mortality. Results At baseline, clinically significant depression was diagnosed in 452 participants (11.7%). Among the depressed, 16.4% had severe depression. Seventy incident cases of AD were found at follow-up. Compared with non-depressed individuals, the incidence rate of AD was significantly higher in the depressed (incidence rate ratio, IRR = 1.91 (95%CI: 1.04–3.51) and particularly in the severely depressed (IRR = 3.59 (95%CI: 1.30–9.94). A consistent, significant association was observed between severe depression at baseline and incident AD in the multivariate model (hazard ratio, HR = 4.30 (95%CI: 1.39–13.33). Untreated depression was associated with incident AD in the unadjusted model, although in the final model this association was attenuated and non-significant. Conclusions Severe depression increases the risk of AD, even after controlling for the competing risk of death. This finding may stimulate studies about the effect of treating depression in relation to the risk of AD. PMID:23791538

  10. Univariate and multivariate spatial models of health facility utilisation for childhood fevers in an area on the coast of Kenya.

    PubMed

    Ouma, Paul O; Agutu, Nathan O; Snow, Robert W; Noor, Abdisalan M

    2017-09-18

    Precise quantification of health service utilisation is important for the estimation of disease burden and allocation of health resources. Current approaches to mapping health facility utilisation rely on spatial accessibility alone as the predictor. However, other spatially varying social, demographic and economic factors may affect the use of health services. The exclusion of these factors can lead to the inaccurate estimation of health facility utilisation. Here, we compare the accuracy of a univariate spatial model, developed only from estimated travel time, to a multivariate model that also includes relevant social, demographic and economic factors. A theoretical surface of travel time to the nearest public health facility was developed. These were assigned to each child reported to have had fever in the Kenya demographic and health survey of 2014 (KDHS 2014). The relationship of child treatment seeking for fever with travel time, household and individual factors from the KDHS2014 were determined using multilevel mixed modelling. Bayesian information criterion (BIC) and likelihood ratio test (LRT) tests were carried out to measure how selected factors improve parsimony and goodness of fit of the time model. Using the mixed model, a univariate spatial model of health facility utilisation was fitted using travel time as the predictor. The mixed model was also used to compute a multivariate spatial model of utilisation, using travel time and modelled surfaces of selected household and individual factors as predictors. The univariate and multivariate spatial models were then compared using the receiver operating area under the curve (AUC) and a percent correct prediction (PCP) test. The best fitting multivariate model had travel time, household wealth index and number of children in household as the predictors. These factors reduced BIC of the time model from 4008 to 2959, a change which was confirmed by the LRT test. Although there was a high correlation of the two modelled probability surfaces (Adj R 2  = 88%), the multivariate model had better AUC compared to the univariate model; 0.83 versus 0.73 and PCP 0.61 versus 0.45 values. Our study shows that a model that uses travel time, as well as household and individual-level socio-demographic factors, results in a more accurate estimation of use of health facilities for the treatment of childhood fever, compared to one that relies on only travel time.

  11. Adjustment of automatic control systems of production facilities at coal processing plants using multivariant physico- mathematical models

    NASA Astrophysics Data System (ADS)

    Evtushenko, V. F.; Myshlyaev, L. P.; Makarov, G. V.; Ivushkin, K. A.; Burkova, E. V.

    2016-10-01

    The structure of multi-variant physical and mathematical models of control system is offered as well as its application for adjustment of automatic control system (ACS) of production facilities on the example of coal processing plant.

  12. A simplified parsimonious higher order multivariate Markov chain model with new convergence condition

    NASA Astrophysics Data System (ADS)

    Wang, Chao; Yang, Chuan-sheng

    2017-09-01

    In this paper, we present a simplified parsimonious higher-order multivariate Markov chain model with new convergence condition. (TPHOMMCM-NCC). Moreover, estimation method of the parameters in TPHOMMCM-NCC is give. Numerical experiments illustrate the effectiveness of TPHOMMCM-NCC.

  13. Various forms of indexing HDMR for modelling multivariate classification problems

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Aksu, Çağrı; Tunga, M. Alper

    2014-12-10

    The Indexing HDMR method was recently developed for modelling multivariate interpolation problems. The method uses the Plain HDMR philosophy in partitioning the given multivariate data set into less variate data sets and then constructing an analytical structure through these partitioned data sets to represent the given multidimensional problem. Indexing HDMR makes HDMR be applicable to classification problems having real world data. Mostly, we do not know all possible class values in the domain of the given problem, that is, we have a non-orthogonal data structure. However, Plain HDMR needs an orthogonal data structure in the given problem to be modelled.more » In this sense, the main idea of this work is to offer various forms of Indexing HDMR to successfully model these real life classification problems. To test these different forms, several well-known multivariate classification problems given in UCI Machine Learning Repository were used and it was observed that the accuracy results lie between 80% and 95% which are very satisfactory.« less

  14. Multivariate random-parameters zero-inflated negative binomial regression model: an application to estimate crash frequencies at intersections.

    PubMed

    Dong, Chunjiao; Clarke, David B; Yan, Xuedong; Khattak, Asad; Huang, Baoshan

    2014-09-01

    Crash data are collected through police reports and integrated with road inventory data for further analysis. Integrated police reports and inventory data yield correlated multivariate data for roadway entities (e.g., segments or intersections). Analysis of such data reveals important relationships that can help focus on high-risk situations and coming up with safety countermeasures. To understand relationships between crash frequencies and associated variables, while taking full advantage of the available data, multivariate random-parameters models are appropriate since they can simultaneously consider the correlation among the specific crash types and account for unobserved heterogeneity. However, a key issue that arises with correlated multivariate data is the number of crash-free samples increases, as crash counts have many categories. In this paper, we describe a multivariate random-parameters zero-inflated negative binomial (MRZINB) regression model for jointly modeling crash counts. The full Bayesian method is employed to estimate the model parameters. Crash frequencies at urban signalized intersections in Tennessee are analyzed. The paper investigates the performance of MZINB and MRZINB regression models in establishing the relationship between crash frequencies, pavement conditions, traffic factors, and geometric design features of roadway intersections. Compared to the MZINB model, the MRZINB model identifies additional statistically significant factors and provides better goodness of fit in developing the relationships. The empirical results show that MRZINB model possesses most of the desirable statistical properties in terms of its ability to accommodate unobserved heterogeneity and excess zero counts in correlated data. Notably, in the random-parameters MZINB model, the estimated parameters vary significantly across intersections for different crash types. Copyright © 2014 Elsevier Ltd. All rights reserved.

  15. Insights on multivariate updates of physical and biogeochemical ocean variables using an Ensemble Kalman Filter and an idealized model of upwelling

    NASA Astrophysics Data System (ADS)

    Yu, Liuqian; Fennel, Katja; Bertino, Laurent; Gharamti, Mohamad El; Thompson, Keith R.

    2018-06-01

    Effective data assimilation methods for incorporating observations into marine biogeochemical models are required to improve hindcasts, nowcasts and forecasts of the ocean's biogeochemical state. Recent assimilation efforts have shown that updating model physics alone can degrade biogeochemical fields while only updating biogeochemical variables may not improve a model's predictive skill when the physical fields are inaccurate. Here we systematically investigate whether multivariate updates of physical and biogeochemical model states are superior to only updating either physical or biogeochemical variables. We conducted a series of twin experiments in an idealized ocean channel that experiences wind-driven upwelling. The forecast model was forced with biased wind stress and perturbed biogeochemical model parameters compared to the model run representing the "truth". Taking advantage of the multivariate nature of the deterministic Ensemble Kalman Filter (DEnKF), we assimilated different combinations of synthetic physical (sea surface height, sea surface temperature and temperature profiles) and biogeochemical (surface chlorophyll and nitrate profiles) observations. We show that when biogeochemical and physical properties are highly correlated (e.g., thermocline and nutricline), multivariate updates of both are essential for improving model skill and can be accomplished by assimilating either physical (e.g., temperature profiles) or biogeochemical (e.g., nutrient profiles) observations. In our idealized domain, the improvement is largely due to a better representation of nutrient upwelling, which results in a more accurate nutrient input into the euphotic zone. In contrast, assimilating surface chlorophyll improves the model state only slightly, because surface chlorophyll contains little information about the vertical density structure. We also show that a degradation of the correlation between observed subsurface temperature and nutrient fields, which has been an issue in several previous assimilation studies, can be reduced by multivariate updates of physical and biogeochemical fields.

  16. Extended behavioural modelling of FET and lattice-mismatched HEMT devices

    NASA Astrophysics Data System (ADS)

    Khawam, Yahya; Albasha, Lutfi

    2017-07-01

    This study presents an improved large signal model that can be used for high electron mobility transistors (HEMTs) and field effect transistors using measurement-based behavioural modelling techniques. The steps for accurate large and small signal modelling for transistor are also discussed. The proposed DC model is based on the Fager model since it compensates between the number of model's parameters and accuracy. The objective is to increase the accuracy of the drain-source current model with respect to any change in gate or drain voltages. Also, the objective is to extend the improved DC model to account for soft breakdown and kink effect found in some variants of HEMT devices. A hybrid Newton's-Genetic algorithm is used in order to determine the unknown parameters in the developed model. In addition to accurate modelling of a transistor's DC characteristics, the complete large signal model is modelled using multi-bias s-parameter measurements. The way that the complete model is performed is by using a hybrid multi-objective optimisation technique (Non-dominated Sorting Genetic Algorithm II) and local minimum search (multivariable Newton's method) for parasitic elements extraction. Finally, the results of DC modelling and multi-bias s-parameters modelling are presented, and three-device modelling recommendations are discussed.

  17. A Multivariate Model of Stakeholder Preference for Lethal Cat Management

    PubMed Central

    Wald, Dara M.; Jacobson, Susan K.

    2014-01-01

    Identifying stakeholder beliefs and attitudes is critical for resolving management conflicts. Debate over outdoor cat management is often described as a conflict between two groups, environmental advocates and animal welfare advocates, but little is known about the variables predicting differences among these critical stakeholder groups. We administered a mail survey to randomly selected stakeholders representing both of these groups (n = 1,596) in Florida, where contention over the management of outdoor cats has been widespread. We used a structural equation model to evaluate stakeholder intention to support non-lethal management. The cognitive hierarchy model predicted that values influenced beliefs, which predicted general and specific attitudes, which in turn, influenced behavioral intentions. We posited that specific attitudes would mediate the effect of general attitudes, beliefs, and values on management support. Model fit statistics suggested that the final model fit the data well (CFI = 0.94, RMSEA = 0.062). The final model explained 74% of the variance in management support, and positive attitudes toward lethal management (humaneness) had the largest direct effect on management support. Specific attitudes toward lethal management and general attitudes toward outdoor cats mediated the relationship between positive (p<0.05) and negative cat-related impact beliefs (p<0.05) and support for management. These results supported the specificity hypothesis and the use of the cognitive hierarchy to assess stakeholder intention to support non-lethal cat management. Our findings suggest that stakeholders can simultaneously perceive both positive and negative beliefs about outdoor cats, which influence attitudes toward and support for non-lethal management. PMID:24736744

  18. A multivariate model of stakeholder preference for lethal cat management.

    PubMed

    Wald, Dara M; Jacobson, Susan K

    2014-01-01

    Identifying stakeholder beliefs and attitudes is critical for resolving management conflicts. Debate over outdoor cat management is often described as a conflict between two groups, environmental advocates and animal welfare advocates, but little is known about the variables predicting differences among these critical stakeholder groups. We administered a mail survey to randomly selected stakeholders representing both of these groups (n=1,596) in Florida, where contention over the management of outdoor cats has been widespread. We used a structural equation model to evaluate stakeholder intention to support non-lethal management. The cognitive hierarchy model predicted that values influenced beliefs, which predicted general and specific attitudes, which in turn, influenced behavioral intentions. We posited that specific attitudes would mediate the effect of general attitudes, beliefs, and values on management support. Model fit statistics suggested that the final model fit the data well (CFI=0.94, RMSEA=0.062). The final model explained 74% of the variance in management support, and positive attitudes toward lethal management (humaneness) had the largest direct effect on management support. Specific attitudes toward lethal management and general attitudes toward outdoor cats mediated the relationship between positive (p<0.05) and negative cat-related impact beliefs (p<0.05) and support for management. These results supported the specificity hypothesis and the use of the cognitive hierarchy to assess stakeholder intention to support non-lethal cat management. Our findings suggest that stakeholders can simultaneously perceive both positive and negative beliefs about outdoor cats, which influence attitudes toward and support for non-lethal management.

  19. Multivariate analysis of gamma spectra to characterize used nuclear fuel

    DOE PAGES

    Coble, Jamie; Orton, Christopher; Schwantes, Jon

    2017-01-17

    The Multi-Isotope Process (MIP) Monitor provides an efficient means to monitor the process conditions in used nuclear fuel reprocessing facilities to support process verification and validation. The MIP Monitor applies multivariate analysis to gamma spectroscopy of key stages in the reprocessing stream in order to detect small changes in the gamma spectrum, which may indicate changes in process conditions. This research extends the MIP Monitor by characterizing a used fuel sample after initial dissolution according to the type of reactor of origin (pressurized or boiling water reactor; PWR and BWR, respectively), initial enrichment, burn up, and cooling time. Simulated gammamore » spectra were used in this paper to develop and test three fuel characterization algorithms. The classification and estimation models employed are based on the partial least squares regression (PLS) algorithm. A PLS discriminate analysis model was developed which perfectly classified reactor type for the three PWR and three BWR reactor designs studied. Locally weighted PLS models were fitted on-the-fly to estimate the remaining fuel characteristics. For the simulated gamma spectra considered, burn up was predicted with 0.1% root mean squared percent error (RMSPE) and both cooling time and initial enrichment with approximately 2% RMSPE. Finally, this approach to automated fuel characterization can be used to independently verify operator declarations of used fuel characteristics and to inform the MIP Monitor anomaly detection routines at later stages of the fuel reprocessing stream to improve sensitivity to changes in operational parameters that may indicate issues with operational control or malicious activities.« less

  20. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Piepel, Gregory F.; Pasquini, Benedetta; Cooley, Scott K.

    In recent years, multivariate optimization has played an increasing role in analytical method development. ICH guidelines recommend using statistical design of experiments to identify the design space, in which multivariate combinations of composition variables and process variables have been demonstrated to provide quality results. Considering a microemulsion electrokinetic chromatography method (MEEKC), the performance of the electrophoretic run depends on the proportions of mixture components (MCs) of the microemulsion and on the values of process variables (PVs). In the present work, for the first time in the literature, a mixture-process variable (MPV) approach was applied to optimize a MEEKC method formore » the analysis of coenzyme Q10 (Q10), ascorbic acid (AA), and folic acid (FA) contained in nutraceuticals. The MCs (buffer, surfactant-cosurfactant, oil) and the PVs (voltage, buffer concentration, buffer pH) were simultaneously changed according to a MPV experimental design. A 62-run MPV design was generated using the I-optimality criterion, assuming a 46-term MPV model allowing for special-cubic blending of the MCs, quadratic effects of the PVs, and some MC-PV interactions. The obtained data were used to develop MPV models that express the performance of an electrophoretic run (measured as peak efficiencies of Q10, AA, and FA) in terms of the MCs and PVs. Contour and perturbation plots were drawn for each of the responses. Finally, the MPV models and criteria for the peak efficiencies were used to develop the design space and an optimal subregion (i.e., the settings of the mixture MCs and PVs that satisfy the respective criteria), as well as a unique optimal combination of MCs and PVs.« less

  1. Multivariate analysis of gamma spectra to characterize used nuclear fuel

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Coble, Jamie; Orton, Christopher; Schwantes, Jon

    The Multi-Isotope Process (MIP) Monitor provides an efficient means to monitor the process conditions in used nuclear fuel reprocessing facilities to support process verification and validation. The MIP Monitor applies multivariate analysis to gamma spectroscopy of key stages in the reprocessing stream in order to detect small changes in the gamma spectrum, which may indicate changes in process conditions. This research extends the MIP Monitor by characterizing a used fuel sample after initial dissolution according to the type of reactor of origin (pressurized or boiling water reactor; PWR and BWR, respectively), initial enrichment, burn up, and cooling time. Simulated gammamore » spectra were used in this paper to develop and test three fuel characterization algorithms. The classification and estimation models employed are based on the partial least squares regression (PLS) algorithm. A PLS discriminate analysis model was developed which perfectly classified reactor type for the three PWR and three BWR reactor designs studied. Locally weighted PLS models were fitted on-the-fly to estimate the remaining fuel characteristics. For the simulated gamma spectra considered, burn up was predicted with 0.1% root mean squared percent error (RMSPE) and both cooling time and initial enrichment with approximately 2% RMSPE. Finally, this approach to automated fuel characterization can be used to independently verify operator declarations of used fuel characteristics and to inform the MIP Monitor anomaly detection routines at later stages of the fuel reprocessing stream to improve sensitivity to changes in operational parameters that may indicate issues with operational control or malicious activities.« less

  2. Monthly income, standard of living and erectile function in late life.

    PubMed

    Cheng, J Y W; Ng, E M L; Ko, J S N; Chen, R Y L

    2007-01-01

    This was a cross-sectional study that enrolled 160 men aged 50 and above who were sexually active (sexual intercourse in the preceding 6 months) from a large primary care treatment centre. The subjects of interest were elderly aged 65 and above, and men aged 50-65 were used for comparison. The overall response rate was 66.9%. The men who participated were generally more affluent. Standard of living was measured by the presence of maid and housing type. Erectile function (EF) score was significantly higher in those who hired maids (P=0.02). Housing type was not associated with erectile dysfunction (ED). In Model A (included both monthly income and education), significant non-parametric correlations were found between monthly income and EF, intercourse satisfaction (IS), orgasmic function (OF) and sexual desire (SD) domains. After statistical adjustments, only EF (P<0.01) and IS (P=0.04) remained positively associated with monthly income. OF was negatively associated with age (P<0.01) and diabetes (P=0.04), whereas SD was negatively associated with age (P<0.01) in the multivariate analysis. Overall satisfaction was not significantly associated with any factor. In Model B (excluded monthly income from analysis), education attainment was positively associated with OF (P=0.04), but was not significant after adjustment for multiple testing. In the final multivariate model, only monthly income (P<0.01) and age (P<0.01), but not education (P=0.47), remained significantly associated with EF. This study suggests the influence of social determinants on EF and that this influence can extend into late life.

  3. Predicting frequent emergency department visits among children with asthma using EHR data.

    PubMed

    Das, Lala T; Abramson, Erika L; Stone, Anne E; Kondrich, Janienne E; Kern, Lisa M; Grinspan, Zachary M

    2017-07-01

    For children with asthma, emergency department (ED) visits are common, expensive, and often avoidable. Though several factors are associated with ED use (demographics, comorbidities, insurance, medications), its predictability using electronic health record (EHR) data is understudied. We used a retrospective cohort study design and EHR data from one center to examine the relationship of patient factors in 1 year (2013) and the likelihood of frequent ED use (≥2 visits) in the following year (2014), using bivariate and multivariable statistics. We applied and compared several machine-learning algorithms to predict frequent ED use, then selected a model based on accuracy, parsimony, and interpretability. We identified 2691 children. In bivariate analyses, future frequent ED use was associated with demographics, co-morbidities, insurance status, medication history, and use of healthcare resources. Machine learning algorithms had very good AUC (area under the curve) values [0.66-0.87], though fair PPV (positive predictive value) [48-70%] and poor sensitivity [16-27%]. Our final multivariable logistic regression model contained two variables: insurance status and prior ED use. For publicly insured patients, the odds of frequent ED use were 3.1 [2.2-4.5] times that of privately insured patients. Publicly insured patients with 4+ ED visits and privately insured patients with 6+ ED visits in a year had ≥50% probability of frequent ED use the following year. The model had an AUC of 0.86, PPV of 56%, and sensitivity of 23%. Among children with asthma, prior frequent ED use and insurance status strongly predict future ED use. © 2017 Wiley Periodicals, Inc.

  4. Kinetic Risk Factors of Running-Related Injuries in Female Recreational Runners.

    PubMed

    Napier, Christopher; MacLean, Christopher L; Maurer, Jessica; Taunton, Jack E; Hunt, Michael A

    2018-05-30

    Our objective was to prospectively investigate the association of kinetic variables with running-related injury (RRI) risk. Seventy-four healthy female recreational runners ran on an instrumented treadmill while 3D kinetic and kinematic data were collected. Kinetic outcomes were vertical impact transient, average vertical loading rate, instantaneous vertical loading rate, active peak, vertical impulse, and peak braking force (PBF). Participants followed a 15-week half-marathon training program. Exposure time (hours of running) was calculated from start of program until onset of injury, loss to follow-up, or end of program. After converting kinetic variables from continuous to ordinal variables based on tertiles, Cox proportional hazard models with competing risks were fit for each variable independently, before analysis in a forward stepwise multivariable model. Sixty-five participants were included in the final analysis, with a 33.8% injury rate. PBF was the only kinetic variable that was a significant predictor of RRI. Runners in the highest tertile (PBF <-0.27 BW) were injured at 5.08 times the rate of those in the middle tertile and 7.98 times the rate of those in the lowest tertile. When analyzed in the multivariable model, no kinetic variables made a significant contribution to predicting injury beyond what had already been accounted for by PBF alone. Findings from this study suggest PBF is associated with a significantly higher injury hazard ratio in female recreational runners and should be considered as a target for gait retraining interventions. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  5. Multivariate generalized multifactor dimensionality reduction to detect gene-gene interactions

    PubMed Central

    2013-01-01

    Background Recently, one of the greatest challenges in genome-wide association studies is to detect gene-gene and/or gene-environment interactions for common complex human diseases. Ritchie et al. (2001) proposed multifactor dimensionality reduction (MDR) method for interaction analysis. MDR is a combinatorial approach to reduce multi-locus genotypes into high-risk and low-risk groups. Although MDR has been widely used for case-control studies with binary phenotypes, several extensions have been proposed. One of these methods, a generalized MDR (GMDR) proposed by Lou et al. (2007), allows adjusting for covariates and applying to both dichotomous and continuous phenotypes. GMDR uses the residual score of a generalized linear model of phenotypes to assign either high-risk or low-risk group, while MDR uses the ratio of cases to controls. Methods In this study, we propose multivariate GMDR, an extension of GMDR for multivariate phenotypes. Jointly analysing correlated multivariate phenotypes may have more power to detect susceptible genes and gene-gene interactions. We construct generalized estimating equations (GEE) with multivariate phenotypes to extend generalized linear models. Using the score vectors from GEE we discriminate high-risk from low-risk groups. We applied the multivariate GMDR method to the blood pressure data of the 7,546 subjects from the Korean Association Resource study: systolic blood pressure (SBP) and diastolic blood pressure (DBP). We compare the results of multivariate GMDR for SBP and DBP to the results from separate univariate GMDR for SBP and DBP, respectively. We also applied the multivariate GMDR method to the repeatedly measured hypertension status from 5,466 subjects and compared its result with those of univariate GMDR at each time point. Results Results from the univariate GMDR and multivariate GMDR in two-locus model with both blood pressures and hypertension phenotypes indicate best combinations of SNPs whose interaction has significant association with risk for high blood pressures or hypertension. Although the test balanced accuracy (BA) of multivariate analysis was not always greater than that of univariate analysis, the multivariate BAs were more stable with smaller standard deviations. Conclusions In this study, we have developed multivariate GMDR method using GEE approach. It is useful to use multivariate GMDR with correlated multiple phenotypes of interests. PMID:24565370

  6. Estimation and model selection of semiparametric multivariate survival functions under general censorship.

    PubMed

    Chen, Xiaohong; Fan, Yanqin; Pouzo, Demian; Ying, Zhiliang

    2010-07-01

    We study estimation and model selection of semiparametric models of multivariate survival functions for censored data, which are characterized by possibly misspecified parametric copulas and nonparametric marginal survivals. We obtain the consistency and root- n asymptotic normality of a two-step copula estimator to the pseudo-true copula parameter value according to KLIC, and provide a simple consistent estimator of its asymptotic variance, allowing for a first-step nonparametric estimation of the marginal survivals. We establish the asymptotic distribution of the penalized pseudo-likelihood ratio statistic for comparing multiple semiparametric multivariate survival functions subject to copula misspecification and general censorship. An empirical application is provided.

  7. Estimation and model selection of semiparametric multivariate survival functions under general censorship

    PubMed Central

    Chen, Xiaohong; Fan, Yanqin; Pouzo, Demian; Ying, Zhiliang

    2013-01-01

    We study estimation and model selection of semiparametric models of multivariate survival functions for censored data, which are characterized by possibly misspecified parametric copulas and nonparametric marginal survivals. We obtain the consistency and root-n asymptotic normality of a two-step copula estimator to the pseudo-true copula parameter value according to KLIC, and provide a simple consistent estimator of its asymptotic variance, allowing for a first-step nonparametric estimation of the marginal survivals. We establish the asymptotic distribution of the penalized pseudo-likelihood ratio statistic for comparing multiple semiparametric multivariate survival functions subject to copula misspecification and general censorship. An empirical application is provided. PMID:24790286

  8. Usual Dietary Intakes: SAS Macros for Fitting Multivariate Measurement Error Models & Estimating Multivariate Usual Intake Distributions

    Cancer.gov

    The following SAS macros can be used to create a multivariate usual intake distribution for multiple dietary components that are consumed nearly every day or episodically. A SAS macro for performing balanced repeated replication (BRR) variance estimation is also included.

  9. Comparative Robustness of Recent Methods for Analyzing Multivariate Repeated Measures Designs

    ERIC Educational Resources Information Center

    Seco, Guillermo Vallejo; Gras, Jaime Arnau; Garcia, Manuel Ato

    2007-01-01

    This study evaluated the robustness of two recent methods for analyzing multivariate repeated measures when the assumptions of covariance homogeneity and multivariate normality are violated. Specifically, the authors' work compares the performance of the modified Brown-Forsythe (MBF) procedure and the mixed-model procedure adjusted by the…

  10. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sirunyan, Albert M; et al.

    A measurement is presented of the associated production of a single top quark and a W boson in proton-proton collisions atmore » $$\\sqrt{s}=$$ 13 TeV by the CMS Collaboration at the CERN LHC. The data collected corresponds to an integrated luminosity of 35.9 fb$$^{-1}$$. The measurement is performed using events with one electron and one muon in the final state along with at least one jet originated from a bottom quark. A multivariate discriminant, exploiting the kinematic properties of the events, is used to separate the signal from the dominant $$\\mathrm{t\\overline{t}}$$ background. The measured cross section of 63.1 $$\\pm$$ 1.8 (stat) $$\\pm$$ 6.4 (syst) $$\\pm$$ 2.1 (lum) pb is in agreement with the standard model expectation.« less

  11. Variables related to tobacco use among young adults: are there differences between males and females?

    PubMed

    Pederson, Linda L; Koval, John J; Chan, Stella S H; Zhang, Xiaohe

    2007-02-01

    We sought to determine the association of four categories (chunks) of variables: (1) demographic characteristics, (2) family and friends smoking and other drug use, (3) psychosocial factors and attitude, and (4) lifestyle factors to current smoking as compared to never smoking among Canadian young adults. A cohort of 1270 young adults, followed for 10 years, completed a self-administered questionnaire. In multivariable analyses, the best final model for both genders did not include the psychosocial and attitudinal categories, but did contain variables in the demographic, family and friends, and lifestyle categories. Interventions for reducing smoking among young adults may be similar for males and females, a conclusion that differs from conclusions based on findings from younger age groups.

  12. Sequential deconvolution from wave-front sensing using bivariate simplex splines

    NASA Astrophysics Data System (ADS)

    Guo, Shiping; Zhang, Rongzhi; Li, Jisheng; Zou, Jianhua; Xu, Rong; Liu, Changhai

    2015-05-01

    Deconvolution from wave-front sensing (DWFS) is an imaging compensation technique for turbulence degraded images based on simultaneous recording of short exposure images and wave-front sensor data. This paper employs the multivariate splines method for the sequential DWFS: a bivariate simplex splines based average slopes measurement model is built firstly for Shack-Hartmann wave-front sensor; next, a well-conditioned least squares estimator for the spline coefficients is constructed using multiple Shack-Hartmann measurements; then, the distorted wave-front is uniquely determined by the estimated spline coefficients; the object image is finally obtained by non-blind deconvolution processing. Simulated experiments in different turbulence strength show that our method performs superior image restoration results and noise rejection capability especially when extracting the multidirectional phase derivatives.

  13. Exploring intentions to discriminate against patients living with HIV/AIDS among future healthcare providers in Malaysia

    PubMed Central

    Earnshaw, Valerie A.; Jin, Harry; Wickersham, Jeffrey; Kamarulzaman, Adeeba; John, Jacob; Altice, Frederick L.

    2015-01-01

    OBJECTIVES Stigma towards people living with HIV/AIDS (PLWHA) is strong in Malaysia. Although stigma has been understudied, it may be a barrier to treating the approximately 81 000 Malaysian PLWHA. The current study explores correlates of intentions to discriminate against PLWHA among medical and dental students, the future healthcare providers of Malaysia. METHODS An online, cross-sectional survey of 1296 medical and dental students was conducted in 2012 at seven Malaysian universities; 1165 (89.9%) completed the survey and were analysed. Sociodemographic characteristics, stigma-related constructs and intentions to discriminate against PLWHA were measured. Linear mixed models were conducted, controlling for clustering by university. RESULTS The final multivariate model demonstrated that students who intended to discriminate more against PLWHA were female, less advanced in their training, and studying dentistry. They further endorsed more negative attitudes towards PLWHA, internalised greater HIV-related shame, reported more HIV-related fear and disagreed more strongly that PLWHA deserve good care. The final model accounted for 38% of the variance in discrimination intent, with 10% accounted for by sociodemographic characteristics and 28% accounted for by stigma-related constructs. CONCLUSIONS It is critical to reduce stigma among medical and dental students to eliminate intentions to discriminate and achieve equitable care for Malaysian PLWHA. Stigma-reduction interventions should be multipronged, addressing attitudes, internalised shame, fear and perceptions of deservingness of care. PMID:24666546

  14. Exploring intentions to discriminate against patients living with HIV/AIDS among future healthcare providers in Malaysia.

    PubMed

    Earnshaw, Valerie A; Jin, Harry; Wickersham, Jeffrey; Kamarulzaman, Adeeba; John, Jacob; Altice, Frederick L

    2014-06-01

    Stigma towards people living with HIV/AIDS (PLWHA) is strong in Malaysia. Although stigma has been understudied, it may be a barrier to treating the approximately 81 000 Malaysian PLWHA. The current study explores correlates of intentions to discriminate against PLWHA among medical and dental students, the future healthcare providers of Malaysia. An online, cross-sectional survey of 1296 medical and dental students was conducted in 2012 at seven Malaysian universities; 1165 (89.9%) completed the survey and were analysed. Socio-demographic characteristics, stigma-related constructs and intentions to discriminate against PLWHA were measured. Linear mixed models were conducted, controlling for clustering by university. The final multivariate model demonstrated that students who intended to discriminate more against PLWHA were female, less advanced in their training, and studying dentistry. They further endorsed more negative attitudes towards PLWHA, internalised greater HIV-related shame, reported more HIV-related fear and disagreed more strongly that PLWHA deserve good care. The final model accounted for 38% of the variance in discrimination intent, with 10% accounted for by socio-demographic characteristics and 28% accounted for by stigma-related constructs. It is critical to reduce stigma among medical and dental students to eliminate intentions to discriminate and achieve equitable care for Malaysian PLWHA. Stigma-reduction interventions should be multipronged, addressing attitudes, internalised shame, fear and perceptions of deservingness of care. © 2014 John Wiley & Sons Ltd.

  15. Analysis/forecast experiments with a multivariate statistical analysis scheme using FGGE data

    NASA Technical Reports Server (NTRS)

    Baker, W. E.; Bloom, S. C.; Nestler, M. S.

    1985-01-01

    A three-dimensional, multivariate, statistical analysis method, optimal interpolation (OI) is described for modeling meteorological data from widely dispersed sites. The model was developed to analyze FGGE data at the NASA-Goddard Laboratory of Atmospherics. The model features a multivariate surface analysis over the oceans, including maintenance of the Ekman balance and a geographically dependent correlation function. Preliminary comparisons are made between the OI model and similar schemes employed at the European Center for Medium Range Weather Forecasts and the National Meteorological Center. The OI scheme is used to provide input to a GCM, and model error correlations are calculated for forecasts of 500 mb vertical water mixing ratios and the wind profiles. Comparisons are made between the predictions and measured data. The model is shown to be as accurate as a successive corrections model out to 4.5 days.

  16. Multivariate Bayesian modeling of known and unknown causes of events--an application to biosurveillance.

    PubMed

    Shen, Yanna; Cooper, Gregory F

    2012-09-01

    This paper investigates Bayesian modeling of known and unknown causes of events in the context of disease-outbreak detection. We introduce a multivariate Bayesian approach that models multiple evidential features of every person in the population. This approach models and detects (1) known diseases (e.g., influenza and anthrax) by using informative prior probabilities and (2) unknown diseases (e.g., a new, highly contagious respiratory virus that has never been seen before) by using relatively non-informative prior probabilities. We report the results of simulation experiments which support that this modeling method can improve the detection of new disease outbreaks in a population. A contribution of this paper is that it introduces a multivariate Bayesian approach for jointly modeling both known and unknown causes of events. Such modeling has general applicability in domains where the space of known causes is incomplete. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.

  17. FACTOR ANALYTIC MODELS OF CLUSTERED MULTIVARIATE DATA WITH INFORMATIVE CENSORING

    EPA Science Inventory

    This paper describes a general class of factor analytic models for the analysis of clustered multivariate data in the presence of informative missingness. We assume that there are distinct sets of cluster-level latent variables related to the primary outcomes and to the censorin...

  18. An Examination of the Domain of Multivariable Functions Using the Pirie-Kieren Model

    ERIC Educational Resources Information Center

    Sengul, Sare; Yildiz, Sevda Goktepe

    2016-01-01

    The aim of this study is to employ the Pirie-Kieren model so as to examine the understandings relating to the domain of multivariable functions held by primary school mathematics preservice teachers. The data obtained was categorized according to Pirie-Kieren model and demonstrated visually in tables and bar charts. The study group consisted of…

  19. Multivariate regression model for predicting yields of grade lumber from yellow birch sawlogs

    Treesearch

    Andrew F. Howard; Daniel A. Yaussy

    1986-01-01

    A multivariate regression model was developed to predict green board-foot yields for the common grades of factory lumber processed from yellow birch factory-grade logs. The model incorporates the standard log measurements of scaling diameter, length, proportion of scalable defects, and the assigned USDA Forest Service log grade. Differences in yields between band and...

  20. A Multivariate Model for the Meta-Analysis of Study Level Survival Data at Multiple Times

    ERIC Educational Resources Information Center

    Jackson, Dan; Rollins, Katie; Coughlin, Patrick

    2014-01-01

    Motivated by our meta-analytic dataset involving survival rates after treatment for critical leg ischemia, we develop and apply a new multivariate model for the meta-analysis of study level survival data at multiple times. Our data set involves 50 studies that provide mortality rates at up to seven time points, which we model simultaneously, and…

  1. A generalized K statistic for estimating phylogenetic signal from shape and other high-dimensional multivariate data.

    PubMed

    Adams, Dean C

    2014-09-01

    Phylogenetic signal is the tendency for closely related species to display similar trait values due to their common ancestry. Several methods have been developed for quantifying phylogenetic signal in univariate traits and for sets of traits treated simultaneously, and the statistical properties of these approaches have been extensively studied. However, methods for assessing phylogenetic signal in high-dimensional multivariate traits like shape are less well developed, and their statistical performance is not well characterized. In this article, I describe a generalization of the K statistic of Blomberg et al. that is useful for quantifying and evaluating phylogenetic signal in highly dimensional multivariate data. The method (K(mult)) is found from the equivalency between statistical methods based on covariance matrices and those based on distance matrices. Using computer simulations based on Brownian motion, I demonstrate that the expected value of K(mult) remains at 1.0 as trait variation among species is increased or decreased, and as the number of trait dimensions is increased. By contrast, estimates of phylogenetic signal found with a squared-change parsimony procedure for multivariate data change with increasing trait variation among species and with increasing numbers of trait dimensions, confounding biological interpretations. I also evaluate the statistical performance of hypothesis testing procedures based on K(mult) and find that the method displays appropriate Type I error and high statistical power for detecting phylogenetic signal in high-dimensional data. Statistical properties of K(mult) were consistent for simulations using bifurcating and random phylogenies, for simulations using different numbers of species, for simulations that varied the number of trait dimensions, and for different underlying models of trait covariance structure. Overall these findings demonstrate that K(mult) provides a useful means of evaluating phylogenetic signal in high-dimensional multivariate traits. Finally, I illustrate the utility of the new approach by evaluating the strength of phylogenetic signal for head shape in a lineage of Plethodon salamanders. © The Author(s) 2014. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  2. Analytical framework for reconstructing heterogeneous environmental variables from mammal community structure.

    PubMed

    Louys, Julien; Meloro, Carlo; Elton, Sarah; Ditchfield, Peter; Bishop, Laura C

    2015-01-01

    We test the performance of two models that use mammalian communities to reconstruct multivariate palaeoenvironments. While both models exploit the correlation between mammal communities (defined in terms of functional groups) and arboreal heterogeneity, the first uses a multiple multivariate regression of community structure and arboreal heterogeneity, while the second uses a linear regression of the principal components of each ecospace. The success of these methods means the palaeoenvironment of a particular locality can be reconstructed in terms of the proportions of heavy, moderate, light, and absent tree canopy cover. The linear regression is less biased, and more precisely and accurately reconstructs heavy tree canopy cover than the multiple multivariate model. However, the multiple multivariate model performs better than the linear regression for all other canopy cover categories. Both models consistently perform better than randomly generated reconstructions. We apply both models to the palaeocommunity of the Upper Laetolil Beds, Tanzania. Our reconstructions indicate that there was very little heavy tree cover at this site (likely less than 10%), with the palaeo-landscape instead comprising a mixture of light and absent tree cover. These reconstructions help resolve the previous conflicting palaeoecological reconstructions made for this site. Copyright © 2014 Elsevier Ltd. All rights reserved.

  3. The NLS-Based Nonlinear Grey Multivariate Model for Forecasting Pollutant Emissions in China.

    PubMed

    Pei, Ling-Ling; Li, Qin; Wang, Zheng-Xin

    2018-03-08

    The relationship between pollutant discharge and economic growth has been a major research focus in environmental economics. To accurately estimate the nonlinear change law of China's pollutant discharge with economic growth, this study establishes a transformed nonlinear grey multivariable (TNGM (1, N )) model based on the nonlinear least square (NLS) method. The Gauss-Seidel iterative algorithm was used to solve the parameters of the TNGM (1, N ) model based on the NLS basic principle. This algorithm improves the precision of the model by continuous iteration and constantly approximating the optimal regression coefficient of the nonlinear model. In our empirical analysis, the traditional grey multivariate model GM (1, N ) and the NLS-based TNGM (1, N ) models were respectively adopted to forecast and analyze the relationship among wastewater discharge per capita (WDPC), and per capita emissions of SO₂ and dust, alongside GDP per capita in China during the period 1996-2015. Results indicated that the NLS algorithm is able to effectively help the grey multivariable model identify the nonlinear relationship between pollutant discharge and economic growth. The results show that the NLS-based TNGM (1, N ) model presents greater precision when forecasting WDPC, SO₂ emissions and dust emissions per capita, compared to the traditional GM (1, N ) model; WDPC indicates a growing tendency aligned with the growth of GDP, while the per capita emissions of SO₂ and dust reduce accordingly.

  4. Development of the statistical ARIMA model: an application for predicting the upcoming of MJO index

    NASA Astrophysics Data System (ADS)

    Hermawan, Eddy; Nurani Ruchjana, Budi; Setiawan Abdullah, Atje; Gede Nyoman Mindra Jaya, I.; Berliana Sipayung, Sinta; Rustiana, Shailla

    2017-10-01

    This study is mainly concerned in development one of the most important equatorial atmospheric phenomena that we call as the Madden Julian Oscillation (MJO) which having strong impacts to the extreme rainfall anomalies over the Indonesian Maritime Continent (IMC). In this study, we focused to the big floods over Jakarta and surrounded area that suspecting caused by the impacts of MJO. We concentrated to develop the MJO index using the statistical model that we call as Box-Jenkis (ARIMA) ini 1996, 2002, and 2007, respectively. They are the RMM (Real Multivariate MJO) index as represented by RMM1 and RMM2, respectively. There are some steps to develop that model, starting from identification of data, estimated, determined model, before finally we applied that model for investigation some big floods that occurred at Jakarta in 1996, 2002, and 2007 respectively. We found the best of estimated model for the RMM1 and RMM2 prediction is ARIMA (2,1,2). Detailed steps how that model can be extracted and applying to predict the rainfall anomalies over Jakarta for 3 to 6 months later is discussed at this paper.

  5. Voxelwise multivariate analysis of multimodality magnetic resonance imaging.

    PubMed

    Naylor, Melissa G; Cardenas, Valerie A; Tosun, Duygu; Schuff, Norbert; Weiner, Michael; Schwartzman, Armin

    2014-03-01

    Most brain magnetic resonance imaging (MRI) studies concentrate on a single MRI contrast or modality, frequently structural MRI. By performing an integrated analysis of several modalities, such as structural, perfusion-weighted, and diffusion-weighted MRI, new insights may be attained to better understand the underlying processes of brain diseases. We compare two voxelwise approaches: (1) fitting multiple univariate models, one for each outcome and then adjusting for multiple comparisons among the outcomes and (2) fitting a multivariate model. In both cases, adjustment for multiple comparisons is performed over all voxels jointly to account for the search over the brain. The multivariate model is able to account for the multiple comparisons over outcomes without assuming independence because the covariance structure between modalities is estimated. Simulations show that the multivariate approach is more powerful when the outcomes are correlated and, even when the outcomes are independent, the multivariate approach is just as powerful or more powerful when at least two outcomes are dependent on predictors in the model. However, multiple univariate regressions with Bonferroni correction remain a desirable alternative in some circumstances. To illustrate the power of each approach, we analyze a case control study of Alzheimer's disease, in which data from three MRI modalities are available. Copyright © 2013 Wiley Periodicals, Inc.

  6. Multivariate Analysis of Longitudinal Rates of Change

    PubMed Central

    Bryan, Matthew; Heagerty, Patrick J.

    2016-01-01

    Longitudinal data allow direct comparison of the change in patient outcomes associated with treatment or exposure. Frequently, several longitudinal measures are collected that either reflect a common underlying health status, or characterize processes that are influenced in a similar way by covariates such as exposure or demographic characteristics. Statistical methods that can combine multivariate response variables into common measures of covariate effects have been proposed by Roy and Lin [1]; Proust-Lima, Letenneur and Jacqmin-Gadda [2]; and Gray and Brookmeyer [3] among others. Current methods for characterizing the relationship between covariates and the rate of change in multivariate outcomes are limited to select models. For example, Gray and Brookmeyer [3] introduce an “accelerated time” method which assumes that covariates rescale time in longitudinal models for disease progression. In this manuscript we detail an alternative multivariate model formulation that directly structures longitudinal rates of change, and that permits a common covariate effect across multiple outcomes. We detail maximum likelihood estimation for a multivariate longitudinal mixed model. We show via asymptotic calculations the potential gain in power that may be achieved with a common analysis of multiple outcomes. We apply the proposed methods to the analysis of a trivariate outcome for infant growth and compare rates of change for HIV infected and uninfected infants. PMID:27417129

  7. A Multivariate Descriptive Model of Motivation for Orthodontic Treatment.

    ERIC Educational Resources Information Center

    Hackett, Paul M. W.; And Others

    1993-01-01

    Motivation for receiving orthodontic treatment was studied among 109 young adults, and a multivariate model of the process is proposed. The combination of smallest scale analysis and Partial Order Scalogram Analysis by base Coordinates (POSAC) illustrates an interesting methodology for health treatment studies and explores motivation for dental…

  8. Mathematical Formulation of Multivariate Euclidean Models for Discrimination Methods.

    ERIC Educational Resources Information Center

    Mullen, Kenneth; Ennis, Daniel M.

    1987-01-01

    Multivariate models for the triangular and duo-trio methods are described, and theoretical methods are compared to a Monte Carlo simulation. Implications are discussed for a new theory of multidimensional scaling which challenges the traditional assumption that proximity measures and perceptual distances are monotonically related. (Author/GDC)

  9. A Multivariate Model of Parent-Adolescent Relationship Variables in Early Adolescence

    ERIC Educational Resources Information Center

    McKinney, Cliff; Renk, Kimberly

    2011-01-01

    Given the importance of predicting outcomes for early adolescents, this study examines a multivariate model of parent-adolescent relationship variables, including parenting, family environment, and conflict. Participants, who completed measures assessing these variables, included 710 culturally diverse 11-14-year-olds who were attending a middle…

  10. Application of multivariate statistical techniques in microbial ecology.

    PubMed

    Paliy, O; Shankar, V

    2016-03-01

    Recent advances in high-throughput methods of molecular analyses have led to an explosion of studies generating large-scale ecological data sets. In particular, noticeable effect has been attained in the field of microbial ecology, where new experimental approaches provided in-depth assessments of the composition, functions and dynamic changes of complex microbial communities. Because even a single high-throughput experiment produces large amount of data, powerful statistical techniques of multivariate analysis are well suited to analyse and interpret these data sets. Many different multivariate techniques are available, and often it is not clear which method should be applied to a particular data set. In this review, we describe and compare the most widely used multivariate statistical techniques including exploratory, interpretive and discriminatory procedures. We consider several important limitations and assumptions of these methods, and we present examples of how these approaches have been utilized in recent studies to provide insight into the ecology of the microbial world. Finally, we offer suggestions for the selection of appropriate methods based on the research question and data set structure. © 2016 John Wiley & Sons Ltd.

  11. Hierarchical Bayesian spatial models for predicting multiple forest variables using waveform LiDAR, hyperspectral imagery, and large inventory datasets

    USGS Publications Warehouse

    Finley, Andrew O.; Banerjee, Sudipto; Cook, Bruce D.; Bradford, John B.

    2013-01-01

    In this paper we detail a multivariate spatial regression model that couples LiDAR, hyperspectral and forest inventory data to predict forest outcome variables at a high spatial resolution. The proposed model is used to analyze forest inventory data collected on the US Forest Service Penobscot Experimental Forest (PEF), ME, USA. In addition to helping meet the regression model's assumptions, results from the PEF analysis suggest that the addition of multivariate spatial random effects improves model fit and predictive ability, compared with two commonly applied modeling approaches. This improvement results from explicitly modeling the covariation among forest outcome variables and spatial dependence among observations through the random effects. Direct application of such multivariate models to even moderately large datasets is often computationally infeasible because of cubic order matrix algorithms involved in estimation. We apply a spatial dimension reduction technique to help overcome this computational hurdle without sacrificing richness in modeling.

  12. Multivariate missing data in hydrology - Review and applications

    NASA Astrophysics Data System (ADS)

    Ben Aissia, Mohamed-Aymen; Chebana, Fateh; Ouarda, Taha B. M. J.

    2017-12-01

    Water resources planning and management require complete data sets of a number of hydrological variables, such as flood peaks and volumes. However, hydrologists are often faced with the problem of missing data (MD) in hydrological databases. Several methods are used to deal with the imputation of MD. During the last decade, multivariate approaches have gained popularity in the field of hydrology, especially in hydrological frequency analysis (HFA). However, treating the MD remains neglected in the multivariate HFA literature whereas the focus has been mainly on the modeling component. For a complete analysis and in order to optimize the use of data, MD should also be treated in the multivariate setting prior to modeling and inference. Imputation of MD in the multivariate hydrological framework can have direct implications on the quality of the estimation. Indeed, the dependence between the series represents important additional information that can be included in the imputation process. The objective of the present paper is to highlight the importance of treating MD in multivariate hydrological frequency analysis by reviewing and applying multivariate imputation methods and by comparing univariate and multivariate imputation methods. An application is carried out for multiple flood attributes on three sites in order to evaluate the performance of the different methods based on the leave-one-out procedure. The results indicate that, the performance of imputation methods can be improved by adopting the multivariate setting, compared to mean substitution and interpolation methods, especially when using the copula-based approach.

  13. A stochastic storm surge generator for the German North Sea and the multivariate statistical assessment of the simulation results

    NASA Astrophysics Data System (ADS)

    Wahl, Thomas; Jensen, Jürgen; Mudersbach, Christoph

    2010-05-01

    Storm surges along the German North Sea coastline led to major damages in the past and the risk of inundation is expected to increase in the course of an ongoing climate change. The knowledge of the characteristics of possible storm surges is essential for the performance of integrated risk analyses, e.g. based on the source-pathway-receptor concept. The latter includes the storm surge simulation/analyses (source), modelling of dike/dune breach scenarios (pathway) and the quantification of potential losses (receptor). In subproject 1b of the German joint research project XtremRisK (www.xtremrisk.de), a stochastic storm surge generator for the south-eastern North Sea area is developed. The input data for the multivariate model are high resolution sea level observations from tide gauges during extreme events. Based on 25 parameters (19 sea level parameters and 6 time parameters) observed storm surge hydrographs consisting of three tides are parameterised. Followed by the adaption of common parametric probability distributions and a large number of Monte-Carlo-Simulations, the final reconstruction leads to a set of 100.000 (default) synthetic storm surge events with a one-minute resolution. Such a data set can potentially serve as the basis for a large number of applications. For risk analyses, storm surges with peak water levels exceeding the design water levels are of special interest. The occurrence probabilities of the simulated extreme events are estimated based on multivariate statistics, considering the parameters "peak water level" and "fullness/intensity". In the past, most studies considered only the peak water levels during extreme events, which might not be the most important parameter in any cases. Here, a 2D-Archimedian copula model is used for the estimation of the joint probabilities of the selected parameters, accounting for the structures of dependence overlooking the margins. In coordination with subproject 1a, the results will be used as the input for the XtremRisK subprojects 2 to 4. The project is funded by the German Federal Ministry of Education and Research (BMBF) (Project No. 03 F 0483 B).

  14. Universal Pressure Ulcer Prevention Bundle With WOC Nurse Support.

    PubMed

    Anderson, Megan; Finch Guthrie, Patricia; Kraft, Wendy; Reicks, Patty; Skay, Carol; Beal, Alan L

    2015-01-01

    This study examined the effectiveness of a universal pressure ulcer prevention bundle (UPUPB) applied to intensive care unit (ICU) patients combined with proactive, semiweekly WOC nurse rounds. The UPUBP was compared to a standard guideline with referral-based WOC nurse involvement measuring adherence to 5 evidence-based prevention interventions and incidence of pressure ulcers. The study used a quasi-experimental, pre-, and postintervention design in which each phase included different subjects. Descriptive methods assisted in exploring the content of WOC nurse rounds. One hundred eighty-one pre- and 146 postintervention subjects who met inclusion criteria and were admitted to ICU for more than 24 hours participated in the study. The research setting was 3 ICUs located at North Memorial Medical Center in Minneapolis, Minnesota. Data collection included admission/discharge skin assessments, chart reviews for 5 evidence-based interventions and patient characteristics, and WOC nurse rounding logs. Study subjects with intact skin on admission identified with an initial skin assessment were enrolled in which prephase subjects received standard care and postphase subjects received the UPUPB. Skin assessments on ICU discharge and chart reviews throughout the stay determined the presence of unit-acquired pressure ulcers and skin care received. Analysis included description of WOC nurse rounds, t-tests for guideline adherence, and multivariate analysis for intervention effect on pressure ulcer incidence. Unit assignment, Braden Scale score, and ICU length of stay were covariates for a multivariate model based on bivariate logistic regression screening. The incidence of unit-acquired pressure ulcers decreased from 15.5% to 2.1%. WOC nurses logged 204 rounds over 6 months, focusing primarily on early detection of pressure sources. Data analysis revealed significantly increased adherence to heel elevation (t = -3.905, df = 325, P < .001) and repositioning (t = -2.441, df = 325, P < .015). Multivariate logistic regression modeling showed a significant reduction in unit-acquired pressure ulcers (P < .001). The intervention increased the Nagelkerke R-Square value by 0.099 (P < .001) more than 0.297 (P < .001) when including only covariates, for a final model value of 0.396 (P < .001). The UPUPB with WOC nurse rounds resulted in a statistically significant and clinically relevant reduction in the incidence of pressure ulcers.

  15. Socioeconomic Drought in a Changing Climate: Modeling and Management

    NASA Astrophysics Data System (ADS)

    AghaKouchak, Amir; Mehran, Ali; Mazdiyasni, Omid

    2016-04-01

    Drought is typically defined based on meteorological, hydrological and land surface conditions. However, in many parts of the world, anthropogenic changes and water management practices have significantly altered local water availability. Socioeconomic drought refers to conditions whereby the available water supply cannot satisfy the human and environmental water needs. Surface water reservoirs provide resilience against local climate variability (e.g., droughts), and play a major role in regional water management. This presentation focuses on a framework for describing socioeconomic drought based on both water supply and demand information. We present a multivariate approach as a measure of socioeconomic drought, termed Multivariate Standardized Reliability and Resilience Index (MSRRI; Mehran et al., 2015). This model links the information on inflow and surface reservoir storage to water demand. MSRRI integrates a "top-down" and a "bottom-up" approach for describing socioeconomic drought. The "top-down" component describes processes that cannot be simply controlled or altered by local decision-makers and managers (e.g., precipitation, climate variability, climate change), whereas the "bottom-up" component focuses on the local resilience, and societal capacity to respond to droughts. The two components (termed, Inflow-Demand Reliability (IDR) indicator and Water Storage Resilience (WSR) indicator) are integrated using a nonparametric multivariate approach. We use this framework to assess the socioeconomic drought during the Australian Millennium Drought (1998-2010) and the 2011-2014 California Droughts. MSRRI provides additional information on socioeconomic drought onset, development and termination based on local resilience and human demand that cannot be obtained from the commonly used drought indicators. We show that MSRRI can be used for water management scenario analysis (e.g., local water availability based on different human water demands scenarios). Finally, we provide examples of using the proposed modeling framework for analyzing water availability in a changing climate considering local conditions. Reference: Mehran A., Mazdiyasni O., AghaKouchak A., 2015, A Hybrid Framework for Assessing Socioeconomic Drought: Linking Climate Variability, Local Resilience, and Demand, Journal of Geophysical Research, 120 (15), 7520-7533, doi: 10.1002/2015JD023147

  16. A mixed-effects regression model for longitudinal multivariate ordinal data.

    PubMed

    Liu, Li C; Hedeker, Donald

    2006-03-01

    A mixed-effects item response theory model that allows for three-level multivariate ordinal outcomes and accommodates multiple random subject effects is proposed for analysis of multivariate ordinal outcomes in longitudinal studies. This model allows for the estimation of different item factor loadings (item discrimination parameters) for the multiple outcomes. The covariates in the model do not have to follow the proportional odds assumption and can be at any level. Assuming either a probit or logistic response function, maximum marginal likelihood estimation is proposed utilizing multidimensional Gauss-Hermite quadrature for integration of the random effects. An iterative Fisher scoring solution, which provides standard errors for all model parameters, is used. An analysis of a longitudinal substance use data set, where four items of substance use behavior (cigarette use, alcohol use, marijuana use, and getting drunk or high) are repeatedly measured over time, is used to illustrate application of the proposed model.

  17. Error propagation of partial least squares for parameters optimization in NIR modeling.

    PubMed

    Du, Chenzhao; Dai, Shengyun; Qiao, Yanjiang; Wu, Zhisheng

    2018-03-05

    A novel methodology is proposed to determine the error propagation of partial least-square (PLS) for parameters optimization in near-infrared (NIR) modeling. The parameters include spectral pretreatment, latent variables and variable selection. In this paper, an open source dataset (corn) and a complicated dataset (Gardenia) were used to establish PLS models under different modeling parameters. And error propagation of modeling parameters for water quantity in corn and geniposide quantity in Gardenia were presented by both type І and type II error. For example, when variable importance in the projection (VIP), interval partial least square (iPLS) and backward interval partial least square (BiPLS) variable selection algorithms were used for geniposide in Gardenia, compared with synergy interval partial least squares (SiPLS), the error weight varied from 5% to 65%, 55% and 15%. The results demonstrated how and what extent the different modeling parameters affect error propagation of PLS for parameters optimization in NIR modeling. The larger the error weight, the worse the model. Finally, our trials finished a powerful process in developing robust PLS models for corn and Gardenia under the optimal modeling parameters. Furthermore, it could provide a significant guidance for the selection of modeling parameters of other multivariate calibration models. Copyright © 2017. Published by Elsevier B.V.

  18. Error propagation of partial least squares for parameters optimization in NIR modeling

    NASA Astrophysics Data System (ADS)

    Du, Chenzhao; Dai, Shengyun; Qiao, Yanjiang; Wu, Zhisheng

    2018-03-01

    A novel methodology is proposed to determine the error propagation of partial least-square (PLS) for parameters optimization in near-infrared (NIR) modeling. The parameters include spectral pretreatment, latent variables and variable selection. In this paper, an open source dataset (corn) and a complicated dataset (Gardenia) were used to establish PLS models under different modeling parameters. And error propagation of modeling parameters for water quantity in corn and geniposide quantity in Gardenia were presented by both type І and type II error. For example, when variable importance in the projection (VIP), interval partial least square (iPLS) and backward interval partial least square (BiPLS) variable selection algorithms were used for geniposide in Gardenia, compared with synergy interval partial least squares (SiPLS), the error weight varied from 5% to 65%, 55% and 15%. The results demonstrated how and what extent the different modeling parameters affect error propagation of PLS for parameters optimization in NIR modeling. The larger the error weight, the worse the model. Finally, our trials finished a powerful process in developing robust PLS models for corn and Gardenia under the optimal modeling parameters. Furthermore, it could provide a significant guidance for the selection of modeling parameters of other multivariate calibration models.

  19. A multivariate spatial mixture model for areal data: examining regional differences in standardized test scores

    PubMed Central

    Neelon, Brian; Gelfand, Alan E.; Miranda, Marie Lynn

    2013-01-01

    Summary Researchers in the health and social sciences often wish to examine joint spatial patterns for two or more related outcomes. Examples include infant birth weight and gestational length, psychosocial and behavioral indices, and educational test scores from different cognitive domains. We propose a multivariate spatial mixture model for the joint analysis of continuous individual-level outcomes that are referenced to areal units. The responses are modeled as a finite mixture of multivariate normals, which accommodates a wide range of marginal response distributions and allows investigators to examine covariate effects within subpopulations of interest. The model has a hierarchical structure built at the individual level (i.e., individuals are nested within areal units), and thus incorporates both individual- and areal-level predictors as well as spatial random effects for each mixture component. Conditional autoregressive (CAR) priors on the random effects provide spatial smoothing and allow the shape of the multivariate distribution to vary flexibly across geographic regions. We adopt a Bayesian modeling approach and develop an efficient Markov chain Monte Carlo model fitting algorithm that relies primarily on closed-form full conditionals. We use the model to explore geographic patterns in end-of-grade math and reading test scores among school-age children in North Carolina. PMID:26401059

  20. Thermal regulation in multiple-source arc welding involving material transformations

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Doumanidis, C.C.

    1995-06-01

    This article addresses regulation of the thermal field generated during arc welding, as the cause of solidification, heat-affected zone and cooling rate related metallurgical transformations affecting the final microstructure and mechanical properties of various welded materials. This temperature field is described by a dynamic real-time process model, consisting of an analytical composite conduction expression for the solid region, and a lumped-state, double-stream circulation model in the weld pool, integrated with a Gaussian heat input and calibrated experimentally through butt joint GMAW tests on plain steel plates. This model serves as the basis of an in-process thermal control system employing feedbackmore » of part surface temperatures measured by infrared pyrometry; and real-time identification of the model parameters with a multivariable adaptive control strategy. Multiple heat inputs and continuous power distributions are implemented by a single time-multiplexed torch, scanning the weld surface to ensure independent, decoupled control of several thermal characteristics. Their regulation is experimentally obtained in longitudinal GTAW of stainless steel pipes, despite the presence of several geometrical, thermal and process condition disturbances of arc welding.« less

  1. Predicting Failure of Glyburide Therapy in Gestational Diabetes

    PubMed Central

    Harper, Lorie M.; Glover, Angelica V.; Biggio, Joseph R.; Tita, Alan

    2016-01-01

    Objective We sought to develop a prediction model to identify women with gestational diabetes (GDM) who require insulin to achieve glycemic control. Study Design Retrospective cohort of all singletons with GDM treated with glyburide 2007–2013. Glyburide failure was defined as reaching glyburide 20 mg/day and receiving insulin. Glyburide success was defined as any glyburide dose without insulin and >70% of visits with glycemic control. Multivariable logistic regression analysis was performed to create a prediction model. Results Of 360 women, 63 (17.5%) qualified as glyburide failure and 157 (43.6%) glyburide success. The final prediction model for glyburide failure included prior GDM, GDM diagnosis ≤26 weeks, 1-hour GCT ≥228 mg/dL, 3-hour GTT 1-hour value ≥221 mg/dL, ≥7 post-prandial blood sugars >120 mg/dL in the week glyburide started, and ≥1 blood sugar >200 mg/dL. The model accurately classified 81% of subjects. Conclusions Women with GDM who will require insulin can be identified at initiation of pharmacologic therapy. PMID:26796130

  2. Predicting failure of glyburide therapy in gestational diabetes.

    PubMed

    Harper, L M; Glover, A V; Biggio, J R; Tita, A

    2016-05-01

    We sought to develop a prediction model to identify women with gestational diabetes (GDM) who require insulin to achieve glycemic control. Retrospective cohort of all singletons with GDM treated with glyburide from 2007 to 2013. Glyburide failure was defined as reaching glyburide 20 mg day(-1) and receiving insulin. Glyburide success was defined as any glyburide dose without insulin and >70% of visits with glycemic control. Multivariable logistic regression analysis was performed to create a prediction model. Of the 360 women, 63 (17.5%) qualified as glyburide failure and 157 (43.6%) as glyburide success. The final prediction model for glyburide failure included prior GDM, GDM diagnosis ⩽26 weeks, 1-h glucose challenge test ⩾228 mg dl(-1), 3-h glucose tolerance test 1-h value ⩾221 mg dl(-1), ⩾7 postprandial blood sugars >120 mg dl(-1) in the week glyburide started and ⩾1 blood sugar >200 mg dl(-1). The model accurately classified 81% of subjects. Women with GDM who will require insulin can be identified at the initiation of pharmacological therapy.

  3. Geographical origin discrimination of lentils (Lens culinaris Medik.) using 1H NMR fingerprinting and multivariate statistical analyses.

    PubMed

    Longobardi, Francesco; Innamorato, Valentina; Di Gioia, Annalisa; Ventrella, Andrea; Lippolis, Vincenzo; Logrieco, Antonio F; Catucci, Lucia; Agostiano, Angela

    2017-12-15

    Lentil samples coming from two different countries, i.e. Italy and Canada, were analysed using untargeted 1 H NMR fingerprinting in combination with chemometrics in order to build models able to classify them according to their geographical origin. For such aim, Soft Independent Modelling of Class Analogy (SIMCA), k-Nearest Neighbor (k-NN), Principal Component Analysis followed by Linear Discriminant Analysis (PCA-LDA) and Partial Least Squares-Discriminant Analysis (PLS-DA) were applied to the NMR data and the results were compared. The best combination of average recognition (100%) and cross-validation prediction abilities (96.7%) was obtained for the PCA-LDA. All the statistical models were validated both by using a test set and by carrying out a Monte Carlo Cross Validation: the obtained performances were found to be satisfying for all the models, with prediction abilities higher than 95% demonstrating the suitability of the developed methods. Finally, the metabolites that mostly contributed to the lentil discrimination were indicated. Copyright © 2017 Elsevier Ltd. All rights reserved.

  4. Multivariate meta-analysis of individual participant data helped externally validate the performance and implementation of a prediction model.

    PubMed

    Snell, Kym I E; Hua, Harry; Debray, Thomas P A; Ensor, Joie; Look, Maxime P; Moons, Karel G M; Riley, Richard D

    2016-01-01

    Our aim was to improve meta-analysis methods for summarizing a prediction model's performance when individual participant data are available from multiple studies for external validation. We suggest multivariate meta-analysis for jointly synthesizing calibration and discrimination performance, while accounting for their correlation. The approach estimates a prediction model's average performance, the heterogeneity in performance across populations, and the probability of "good" performance in new populations. This allows different implementation strategies (e.g., recalibration) to be compared. Application is made to a diagnostic model for deep vein thrombosis (DVT) and a prognostic model for breast cancer mortality. In both examples, multivariate meta-analysis reveals that calibration performance is excellent on average but highly heterogeneous across populations unless the model's intercept (baseline hazard) is recalibrated. For the cancer model, the probability of "good" performance (defined by C statistic ≥0.7 and calibration slope between 0.9 and 1.1) in a new population was 0.67 with recalibration but 0.22 without recalibration. For the DVT model, even with recalibration, there was only a 0.03 probability of "good" performance. Multivariate meta-analysis can be used to externally validate a prediction model's calibration and discrimination performance across multiple populations and to evaluate different implementation strategies. Crown Copyright © 2016. Published by Elsevier Inc. All rights reserved.

  5. Prognostic nomogram and score to predict overall survival in locally advanced untreated pancreatic cancer (PROLAP)

    PubMed Central

    Vernerey, Dewi; Huguet, Florence; Vienot, Angélique; Goldstein, David; Paget-Bailly, Sophie; Van Laethem, Jean-Luc; Glimelius, Bengt; Artru, Pascal; Moore, Malcolm J; André, Thierry; Mineur, Laurent; Chibaudel, Benoist; Benetkiewicz, Magdalena; Louvet, Christophe; Hammel, Pascal; Bonnetain, Franck

    2016-01-01

    Background: The management of locally advanced pancreatic cancer (LAPC) patients remains controversial. Better discrimination for overall survival (OS) at diagnosis is needed. We address this issue by developing and validating a prognostic nomogram and a score for OS in LAPC (PROLAP). Methods: Analyses were derived from 442 LAPC patients enrolled in the LAP07 trial. The prognostic ability of 30 baseline parameters was evaluated using univariate and multivariate Cox regression analyses. Performance assessment and internal validation of the final model were done with Harrell's C-index, calibration plot and bootstrap sample procedures. On the basis of the final model, a prognostic nomogram and a score were developed, and externally validated in 106 consecutive LAPC patients treated in Besançon Hospital, France. Results: Age, pain, tumour size, albumin and CA 19-9 were independent prognostic factors for OS. The final model had good calibration, acceptable discrimination (C-index=0.60) and robust internal validity. The PROLAP score has the potential to delineate three different prognosis groups with median OS of 15.4, 11.7 and 8.5 months (log-rank P<0.0001). The score ability to discriminate OS was externally confirmed in 63 (59%) patients with complete clinical data derived from a data set of 106 consecutive LAPC patients; median OS of 18.3, 14.1 and 7.6 months for the three groups (log-rank P<0.0001). Conclusions: The PROLAP nomogram and score can accurately predict OS before initiation of induction chemotherapy in LAPC-untreated patients. They may help to optimise clinical trials design and might offer the opportunity to define risk-adapted strategies for LAPC management in the future. PMID:27404456

  6. Multivariate Formation Pressure Prediction with Seismic-derived Petrophysical Properties from Prestack AVO inversion and Poststack Seismic Motion Inversion

    NASA Astrophysics Data System (ADS)

    Yu, H.; Gu, H.

    2017-12-01

    A novel multivariate seismic formation pressure prediction methodology is presented, which incorporates high-resolution seismic velocity data from prestack AVO inversion, and petrophysical data (porosity and shale volume) derived from poststack seismic motion inversion. In contrast to traditional seismic formation prediction methods, the proposed methodology is based on a multivariate pressure prediction model and utilizes a trace-by-trace multivariate regression analysis on seismic-derived petrophysical properties to calibrate model parameters in order to make accurate predictions with higher resolution in both vertical and lateral directions. With prestack time migration velocity as initial velocity model, an AVO inversion was first applied to prestack dataset to obtain high-resolution seismic velocity with higher frequency that is to be used as the velocity input for seismic pressure prediction, and the density dataset to calculate accurate Overburden Pressure (OBP). Seismic Motion Inversion (SMI) is an inversion technique based on Markov Chain Monte Carlo simulation. Both structural variability and similarity of seismic waveform are used to incorporate well log data to characterize the variability of the property to be obtained. In this research, porosity and shale volume are first interpreted on well logs, and then combined with poststack seismic data using SMI to build porosity and shale volume datasets for seismic pressure prediction. A multivariate effective stress model is used to convert velocity, porosity and shale volume datasets to effective stress. After a thorough study of the regional stratigraphic and sedimentary characteristics, a regional normally compacted interval model is built, and then the coefficients in the multivariate prediction model are determined in a trace-by-trace multivariate regression analysis on the petrophysical data. The coefficients are used to convert velocity, porosity and shale volume datasets to effective stress and then to calculate formation pressure with OBP. Application of the proposed methodology to a research area in East China Sea has proved that the method can bridge the gap between seismic and well log pressure prediction and give predicted pressure values close to pressure meassurements from well testing.

  7. Dose-dependent effect of mammographic breast density on the risk of contralateral breast cancer.

    PubMed

    Chowdhury, Marzana; Euhus, David; O'Donnell, Maureen; Onega, Tracy; Choudhary, Pankaj K; Biswas, Swati

    2018-07-01

    Increased mammographic breast density is a significant risk factor for breast cancer. It is not clear if it is also a risk factor for the development of contralateral breast cancer. The data were obtained from Breast Cancer Surveillance Consortium and included women diagnosed with invasive breast cancer or ductal carcinoma in situ between ages 18 and 88 and years 1995 and 2009. Each case of contralateral breast cancer was matched with three controls based on year of first breast cancer diagnosis, race, and length of follow-up. A total of 847 cases and 2541 controls were included. The risk factors included in the study were mammographic breast density, age of first breast cancer diagnosis, family history of breast cancer, anti-estrogen treatment, hormone replacement therapy, menopausal status, and estrogen receptor status, all from the time of first breast cancer diagnosis. Both univariate analysis and multivariate conditional logistic regression analysis were performed. In the final multivariate model, breast density, family history of breast cancer, and anti-estrogen treatment remained significant with p values less than 0.01. Increasing breast density had a dose-dependent effect on the risk of contralateral breast cancer. Relative to 'almost entirely fat' category of breast density, the adjusted odds ratios (and p values) in the multivariate analysis for 'scattered density,' 'heterogeneously dense,' and 'extremely dense' categories were 1.65 (0.036), 2.10 (0.002), and 2.32 (0.001), respectively. Breast density is an independent and significant risk factor for development of contralateral breast cancer. This risk factor should contribute to clinical decision making.

  8. Time Series Model Identification by Estimating Information.

    DTIC Science & Technology

    1982-11-01

    principle, Applications of Statistics, P. R. Krishnaiah , ed., North-Holland: Amsterdam, 27-41. Anderson, T. W. (1971). The Statistical Analysis of Time Series...E. (1969). Multiple Time Series Modeling, Multivariate Analysis II, edited by P. Krishnaiah , Academic Press: New York, 389-409. Parzen, E. (1981...Newton, H. J. (1980). Multiple Time Series Modeling, II Multivariate Analysis - V, edited by P. Krishnaiah , North Holland: Amsterdam, 181-197. Shibata, R

  9. Determining the Relationship Between Moral Waivers and Marine Corps Unsuitability Attrition

    DTIC Science & Technology

    2008-03-01

    observed characteristics. However, econometric research indicates that the magnitude of interaction effects estimated via probit or logit models may...1997 to 2005. Multivariate probit models were used to analyze the effects of moral waivers on unsatisfactory service separations. 15. NUMBER OF...files from fiscal years 1997 to 2005. Multivariate probit models were used to analyze the effects of moral waivers on unsatisfactory service

  10. Measuring multiple spike train synchrony.

    PubMed

    Kreuz, Thomas; Chicharro, Daniel; Andrzejak, Ralph G; Haas, Julie S; Abarbanel, Henry D I

    2009-10-15

    Measures of multiple spike train synchrony are essential in order to study issues such as spike timing reliability, network synchronization, and neuronal coding. These measures can broadly be divided in multivariate measures and averages over bivariate measures. One of the most recent bivariate approaches, the ISI-distance, employs the ratio of instantaneous interspike intervals (ISIs). In this study we propose two extensions of the ISI-distance, the straightforward averaged bivariate ISI-distance and the multivariate ISI-diversity based on the coefficient of variation. Like the original measure these extensions combine many properties desirable in applications to real data. In particular, they are parameter-free, time scale independent, and easy to visualize in a time-resolved manner, as we illustrate with in vitro recordings from a cortical neuron. Using a simulated network of Hindemarsh-Rose neurons as a controlled configuration we compare the performance of our methods in distinguishing different levels of multi-neuron spike train synchrony to the performance of six other previously published measures. We show and explain why the averaged bivariate measures perform better than the multivariate ones and why the multivariate ISI-diversity is the best performer among the multivariate methods. Finally, in a comparison against standard methods that rely on moving window estimates, we use single-unit monkey data to demonstrate the advantages of the instantaneous nature of our methods.

  11. Mapping Informative Clusters in a Hierarchial Framework of fMRI Multivariate Analysis

    PubMed Central

    Xu, Rui; Zhen, Zonglei; Liu, Jia

    2010-01-01

    Pattern recognition methods have become increasingly popular in fMRI data analysis, which are powerful in discriminating between multi-voxel patterns of brain activities associated with different mental states. However, when they are used in functional brain mapping, the location of discriminative voxels varies significantly, raising difficulties in interpreting the locus of the effect. Here we proposed a hierarchical framework of multivariate approach that maps informative clusters rather than voxels to achieve reliable functional brain mapping without compromising the discriminative power. In particular, we first searched for local homogeneous clusters that consisted of voxels with similar response profiles. Then, a multi-voxel classifier was built for each cluster to extract discriminative information from the multi-voxel patterns. Finally, through multivariate ranking, outputs from the classifiers were served as a multi-cluster pattern to identify informative clusters by examining interactions among clusters. Results from both simulated and real fMRI data demonstrated that this hierarchical approach showed better performance in the robustness of functional brain mapping than traditional voxel-based multivariate methods. In addition, the mapped clusters were highly overlapped for two perceptually equivalent object categories, further confirming the validity of our approach. In short, the hierarchical framework of multivariate approach is suitable for both pattern classification and brain mapping in fMRI studies. PMID:21152081

  12. Presence of Ureaplasma diversum in the genital tracts of female dairy cattle in Mato Grosso State, Brazil.

    PubMed

    Azevedo, Jaqueline B; Silva, Gustavo S; Rocha, Priscylla S; Pitchenin, Letícia C; Dutra, Valéria; Nakazato, Luciano; de Oliveira, Anderson Castro Soares; Pescador, Caroline A

    2017-02-01

    Ureaplasma diversum infection in bovine females may result in various reproductive problems, including granular vulvovaginitis, abortion, weak calves, salpingitis, and spontaneous abortion. The presence of U. diversum in a dairy bovine population from midwestern Brazil has not been established. The aim of this study was to determine whether U. diversum was present in dairy cattle from midwestern Brazil using polymerase chain reaction (PCR). Vulvovaginal mucus was analyzed from 203 cows located in six municipalities in the north region of Mato Grosso State, Brazil. A total of 25% of dairy cows with vulvovaginitis were positive for U. diversum. The factors evaluated were included in a multivariable logistic regression model with the presence of at least one positive cow in the herd serving as the dependent variable. Three variables were significantly associated with a U. diversum-positive PCR and were included in the final multivariable model: number of parities, vulvar lesions, and reproductive problems. For each new parity, the chance of U. diversum infection decreased 0.03-fold, indicating that cows with the highest number of parities were more protected. The presence of vulvar lesions was increased 17.6-fold in females positive for U. diversum, suggesting that this bacterium could be related to the red granular lesions in the vulvar mucosa, whereas reproductive problems were increased 7.6-fold. However, further investigations should be conducted to ascertain the effects of U. diversum in association with other mycoplasma species in the herds studied.

  13. Spectral Mining for Discriminating Blood Origins in the Presence of Substrate Interference via Attenuated Total Reflection Fourier Transform Infrared Spectroscopy: Postmortem or Antemortem Blood?

    PubMed

    Takamura, Ayari; Watanabe, Ken; Akutsu, Tomoko; Ikegaya, Hiroshi; Ozawa, Takeaki

    2017-09-19

    Often in criminal investigations, discrimination of types of body fluid evidence is crucially important to ascertain how a crime was committed. Compared to current methods using biochemical techniques, vibrational spectroscopic approaches can provide versatile applicability to identify various body fluid types without sample invasion. However, their applicability is limited to pure body fluid samples because important signals from body fluids incorporated in a substrate are affected strongly by interference from substrate signals. Herein, we describe a novel approach to recover body fluid signals that are embedded in strong substrate interferences using attenuated total reflection Fourier transform infrared (ATR FT-IR) spectroscopy and an innovative multivariate spectral processing. This technique supported detection of covert features of body fluid signals, and then identified origins of body fluid stains on substrates. We discriminated between ATR FT-IR spectra of postmortem blood (PB) and those of antemortem blood (AB) by creating a multivariate statistics model. From ATR FT-IR spectra of PB and AB stains on interfering substrates (polyester, cotton, and denim), blood-originated signals were extracted by a weighted linear regression approach we developed originally using principal components of both blood and substrate spectra. The blood-originated signals were finally classified by the discriminant model, demonstrating high discriminant accuracy. The present method can identify body fluid evidence independently of the substrate type, which is expected to promote the application of vibrational spectroscopic techniques in forensic body fluid analysis.

  14. Estimation of typhoon rainfall in GaoPing River: A Multivariate Maximum Entropy Method

    NASA Astrophysics Data System (ADS)

    Pei-Jui, Wu; Hwa-Lung, Yu

    2016-04-01

    The heavy rainfall from typhoons is the main factor of the natural disaster in Taiwan, which causes the significant loss of human lives and properties. Statistically average 3.5 typhoons invade Taiwan every year, and the serious typhoon, Morakot in 2009, impacted Taiwan in recorded history. Because the duration, path and intensity of typhoon, also affect the temporal and spatial rainfall type in specific region , finding the characteristics of the typhoon rainfall type is advantageous when we try to estimate the quantity of rainfall. This study developed a rainfall prediction model and can be divided three parts. First, using the EEOF(extended empirical orthogonal function) to classify the typhoon events, and decompose the standard rainfall type of all stations of each typhoon event into the EOF and PC(principal component). So we can classify the typhoon events which vary similarly in temporally and spatially as the similar typhoon types. Next, according to the classification above, we construct the PDF(probability density function) in different space and time by means of using the multivariate maximum entropy from the first to forth moment statistically. Therefore, we can get the probability of each stations of each time. Final we use the BME(Bayesian Maximum Entropy method) to construct the typhoon rainfall prediction model , and to estimate the rainfall for the case of GaoPing river which located in south of Taiwan.This study could be useful for typhoon rainfall predictions in future and suitable to government for the typhoon disaster prevention .

  15. Forgotten marriages? Measuring the reliability of marriage histories

    PubMed Central

    Chae, Sophia

    2016-01-01

    BACKGROUND Marriage histories are a valuable data source for investigating nuptiality. While researchers typically acknowledge the problems associated with their use, it is unknown to what extent these problems occur and how marriage analyses are affected. OBJECTIVE This paper seeks to investigate the quality of marriage histories by measuring levels of misreporting, examining the characteristics associated with misreporting, and assessing whether misreporting biases marriage indicators. METHODS Using data from the Malawi Longitudinal Study of Families and Health (MLSFH), I compare marriage histories reported by the same respondents at two different points in time. I investigate whether respondents consistently report their spouses (by name), status of marriage, and dates of marriage. I use multivariate regression models to investigate the characteristics associated with misreporting. Finally, I examine whether misreporting marriages and marriage dates affects marriage indicators. RESULTS Results indicate that 28.3% of men and 17.9% of women omitted at least one marriage in one of the survey waves. Multivariate regression models show that misreporting is not random: marriage, individual, interviewer, and survey characteristics are associated with marriage omission and marriage date inconsistencies. Misreporting also affects marriage indicators. CONCLUSIONS This is the first study of its kind to examine the reliability of marriage histories collected in the context of Sub-Saharan Africa. Although marriage histories are frequently used to study marriage dynamics, until now no knowledge has existed on the degree of misreporting. Misreporting in marriage histories is shown to be non-negligent and could potentially affect analyses. PMID:27152090

  16. Measurement error and timing of predictor values for multivariable risk prediction models are poorly reported.

    PubMed

    Whittle, Rebecca; Peat, George; Belcher, John; Collins, Gary S; Riley, Richard D

    2018-05-18

    Measurement error in predictor variables may threaten the validity of clinical prediction models. We sought to evaluate the possible extent of the problem. A secondary objective was to examine whether predictors are measured at the intended moment of model use. A systematic search of Medline was used to identify a sample of articles reporting the development of a clinical prediction model published in 2015. After screening according to a predefined inclusion criteria, information on predictors, strategies to control for measurement error and intended moment of model use were extracted. Susceptibility to measurement error for each predictor was classified into low and high risk. Thirty-three studies were reviewed, including 151 different predictors in the final prediction models. Fifty-one (33.7%) predictors were categorised as high risk of error, however this was not accounted for in the model development. Only 8 (24.2%) studies explicitly stated the intended moment of model use and when the predictors were measured. Reporting of measurement error and intended moment of model use is poor in prediction model studies. There is a need to identify circumstances where ignoring measurement error in prediction models is consequential and whether accounting for the error will improve the predictions. Copyright © 2018. Published by Elsevier Inc.

  17. Validation of a Radiosensitivity Molecular Signature in Breast Cancer

    PubMed Central

    Eschrich, Steven A.; Fulp, William J.; Pawitan, Yudi; Foekens, John A.; Smid, Marcel; Martens, John W. M.; Echevarria, Michelle; Kamath, Vidya; Lee, Ji-Hyun; Harris, Eleanor E.; Bergh, Jonas; Torres-Roca, Javier F.

    2014-01-01

    Purpose Previously, we developed a radiosensitivity molecular signature (RSI) that was clinically-validated in three independent datasets (rectal, esophageal, head and neck) in 118 patients. Here, we test RSI in radiotherapy (RT) treated breast cancer patients. Experimental Design RSI was tested in two previously published breast cancer datasets. Patients were treated at the Karolinska University Hospital (n=159) and Erasmus Medical Center (n=344). RSI was applied as previously described. Results We tested RSI in RT-treated patients (Karolinska). Patients predicted to be radiosensitive (RS) had an improved 5 yr relapse-free survival when compared with radioresistant (RR) patients (95% vs. 75%, p=0.0212) but there was no difference between RS/RR patients treated without RT (71% vs. 77%, p=0.6744), consistent with RSI being RT-specific (interaction term RSIxRT, p=0.05). Similarly, in the Erasmus dataset RT-treated RS patients had an improved 5-year distant-metastasis-free survival over RR patients (77% vs. 64%, p=0.0409) but no difference was observed in patients treated without RT (RS vs. RR, 80% vs. 81%, p=0.9425). Multivariable analysis showed RSI is the strongest variable in RT-treated patients (Karolinska, HR=5.53, p=0.0987, Erasmus, HR=1.64, p=0.0758) and in backward selection (removal alpha of 0.10) RSI was the only variable remaining in the final model. Finally, RSI is an independent predictor of outcome in RT-treated ER+ patients (Erasmus, multivariable analysis, HR=2.64, p=0.0085). Conclusions RSI is validated in two independent breast cancer datasets totaling 503 patients. Including prior data, RSI is validated in five independent cohorts (621 patients) and represents, to our knowledge, the most extensively validated molecular signature in radiation oncology. PMID:22832933

  18. Are hospitalized Parkinson's disease patients more likely to carry a do-not-resuscitate order?

    PubMed

    Mahajan, Abhimanyu; Patel, Achint; Nadkarni, Girish; Sidiropoulos, Christos

    2017-03-01

    While DNR utilization is a complex subjective phenomenon, the effect of such a decision can collectively influence attitudes of care. The role of palliative care in advanced PD has been under appreciated. We reviewed the Healthcare Cost and Utilization Project's National Inpatient Sample (NIS) database from 2012 for all hospitalizations ⩾65years. We identified PD by using ICD-9-CM code 332.0 and DNR status with ICD code - V49.86 entered during the same admission as a secondary diagnosis. We estimated risk of mortality by the 3M™ All Patient Refined DRG (APR DRG) classification System and generated multivariate regression models to assess associations between DNR and PD after adjusting for confounders. Finally, we tested for interaction by risk of mortality. We analyzed 12,700,000 hospitalizations with age ⩾65years in 2012, of which 246625 (1.94%) pts had PD. Proportion of DNR utilization was higher among PD patients vs. those without, 20895 (8.47%) vs. 723090 (5.8%) (p<0.01). In multivariable regression analysis, PD patients were associated with higher odds of DNR utilization [Adjusted Odds ratio (aOR): 1.26, 95% CI: 1.21, 1.30, p<0.001]. Finally, the odds of DNR utilization increased significantly with APR-DRG stage [aOR: 1 vs. 1.61 (Stage 2) vs. 2.46 (Stage 3) vs. 3.61 (Stage 4); p<0.0001]. PD patients have higher odds of DNR utilization than the general population, which worsens with increasing objective risk of mortality. This is likely correlated with perception of end of life and importance of QOL with increasing severity of overall illness. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. The role of drinking water sources, consumption of vegetables and seafood in relation to blood arsenic concentrations of Jamaican children with and without Autism Spectrum Disorders

    PubMed Central

    Rahbar, Mohammad H.; Samms-Vaughan, Maureen; Ardjomand-Hessabi, Manouchehr; Loveland, Katherine A.; Dickerson, Aisha S.; Chen, Zhongxue; Bressler, Jan; Shakespeare-Pellington, Sydonnie; Grove, Megan L.; Bloom, Kari; Wirth, Julie; Pearson, Deborah A.; Boerwinkle, Eric

    2012-01-01

    Arsenic is a toxic metal with harmful effects on human health, particularly on cognitive function. Autism Spectrum Disorders (ASDs) are lifelong neurodevelopmental and behavioral disorders manifesting in infancy or early childhood. We used data from 130 children between 2-8 years (65 pairs of ASD cases with age- and sex-matched control), to compare the mean total blood arsenic concentrations in children with and without ASDs in Kingston, Jamaica. Based on univariable analysis, we observed a significant difference between ASD cases and controls (4.03μg/L for cases vs. 4.48μg/L for controls, P < 0.01). In the final multivariable General Linear Model (GLM), after controlling for car ownership, maternal age, parental education levels, source of drinking water, consumption of “yam, sweet potato, or dasheen”, “carrot or pumpkin”, “callaloo, broccoli, or pak choi”, cabbage, avocado, and the frequency of seafood consumption per week, we did not find a significant association between blood arsenic concentrations and ASD status (4.36μg/L for cases vs. 4.65μg/L for controls, P = 0.23). Likewise, in a separate final multivariable GLM, we found that source of drinking water, eating avocado, and eating “callaloo, broccoli, or pak choi” were significantly associated with higher blood arsenic concentrations (all three P < 0.05). Based on our findings, we recommend assessment of arsenic levels in water, fruits, and vegetables, as well as increased awareness among the Jamaican population regarding potential risks for various exposures to arsenic. PMID:22819887

  20. A General Multivariate Latent Growth Model with Applications to Student Achievement

    ERIC Educational Resources Information Center

    Bianconcini, Silvia; Cagnone, Silvia

    2012-01-01

    The evaluation of the formative process in the University system has been assuming an ever increasing importance in the European countries. Within this context, the analysis of student performance and capabilities plays a fundamental role. In this work, the authors propose a multivariate latent growth model for studying the performances of a…

  1. Bayesian Estimation of Random Coefficient Dynamic Factor Models

    ERIC Educational Resources Information Center

    Song, Hairong; Ferrer, Emilio

    2012-01-01

    Dynamic factor models (DFMs) have typically been applied to multivariate time series data collected from a single unit of study, such as a single individual or dyad. The goal of DFMs application is to capture dynamics of multivariate systems. When multiple units are available, however, DFMs are not suited to capture variations in dynamics across…

  2. Rotation in the Dynamic Factor Modeling of Multivariate Stationary Time Series.

    ERIC Educational Resources Information Center

    Molenaar, Peter C. M.; Nesselroade, John R.

    2001-01-01

    Proposes a special rotation procedure for the exploratory dynamic factor model for stationary multivariate time series. The rotation procedure applies separately to each univariate component series of a q-variate latent factor series and transforms such a component, initially represented as white noise, into a univariate moving-average.…

  3. Modeling Associations among Multivariate Longitudinal Categorical Variables in Survey Data: A Semiparametric Bayesian Approach

    ERIC Educational Resources Information Center

    Tchumtchoua, Sylvie; Dey, Dipak K.

    2012-01-01

    This paper proposes a semiparametric Bayesian framework for the analysis of associations among multivariate longitudinal categorical variables in high-dimensional data settings. This type of data is frequent, especially in the social and behavioral sciences. A semiparametric hierarchical factor analysis model is developed in which the…

  4. Parametric Cost Models for Space Telescopes

    NASA Technical Reports Server (NTRS)

    Stahl, H. Philip

    2010-01-01

    A study is in-process to develop a multivariable parametric cost model for space telescopes. Cost and engineering parametric data has been collected on 30 different space telescopes. Statistical correlations have been developed between 19 variables of 59 variables sampled. Single Variable and Multi-Variable Cost Estimating Relationships have been developed. Results are being published.

  5. A Dynamic Intrusion Detection System Based on Multivariate Hotelling's T2 Statistics Approach for Network Environments

    PubMed Central

    Avalappampatty Sivasamy, Aneetha; Sundan, Bose

    2015-01-01

    The ever expanding communication requirements in today's world demand extensive and efficient network systems with equally efficient and reliable security features integrated for safe, confident, and secured communication and data transfer. Providing effective security protocols for any network environment, therefore, assumes paramount importance. Attempts are made continuously for designing more efficient and dynamic network intrusion detection models. In this work, an approach based on Hotelling's T2 method, a multivariate statistical analysis technique, has been employed for intrusion detection, especially in network environments. Components such as preprocessing, multivariate statistical analysis, and attack detection have been incorporated in developing the multivariate Hotelling's T2 statistical model and necessary profiles have been generated based on the T-square distance metrics. With a threshold range obtained using the central limit theorem, observed traffic profiles have been classified either as normal or attack types. Performance of the model, as evaluated through validation and testing using KDD Cup'99 dataset, has shown very high detection rates for all classes with low false alarm rates. Accuracy of the model presented in this work, in comparison with the existing models, has been found to be much better. PMID:26357668

  6. A Dynamic Intrusion Detection System Based on Multivariate Hotelling's T2 Statistics Approach for Network Environments.

    PubMed

    Sivasamy, Aneetha Avalappampatty; Sundan, Bose

    2015-01-01

    The ever expanding communication requirements in today's world demand extensive and efficient network systems with equally efficient and reliable security features integrated for safe, confident, and secured communication and data transfer. Providing effective security protocols for any network environment, therefore, assumes paramount importance. Attempts are made continuously for designing more efficient and dynamic network intrusion detection models. In this work, an approach based on Hotelling's T(2) method, a multivariate statistical analysis technique, has been employed for intrusion detection, especially in network environments. Components such as preprocessing, multivariate statistical analysis, and attack detection have been incorporated in developing the multivariate Hotelling's T(2) statistical model and necessary profiles have been generated based on the T-square distance metrics. With a threshold range obtained using the central limit theorem, observed traffic profiles have been classified either as normal or attack types. Performance of the model, as evaluated through validation and testing using KDD Cup'99 dataset, has shown very high detection rates for all classes with low false alarm rates. Accuracy of the model presented in this work, in comparison with the existing models, has been found to be much better.

  7. The choice of prior distribution for a covariance matrix in multivariate meta-analysis: a simulation study.

    PubMed

    Hurtado Rúa, Sandra M; Mazumdar, Madhu; Strawderman, Robert L

    2015-12-30

    Bayesian meta-analysis is an increasingly important component of clinical research, with multivariate meta-analysis a promising tool for studies with multiple endpoints. Model assumptions, including the choice of priors, are crucial aspects of multivariate Bayesian meta-analysis (MBMA) models. In a given model, two different prior distributions can lead to different inferences about a particular parameter. A simulation study was performed in which the impact of families of prior distributions for the covariance matrix of a multivariate normal random effects MBMA model was analyzed. Inferences about effect sizes were not particularly sensitive to prior choice, but the related covariance estimates were. A few families of prior distributions with small relative biases, tight mean squared errors, and close to nominal coverage for the effect size estimates were identified. Our results demonstrate the need for sensitivity analysis and suggest some guidelines for choosing prior distributions in this class of problems. The MBMA models proposed here are illustrated in a small meta-analysis example from the periodontal field and a medium meta-analysis from the study of stroke. Copyright © 2015 John Wiley & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.

  8. Studying Resist Stochastics with the Multivariate Poisson Propagation Model

    DOE PAGES

    Naulleau, Patrick; Anderson, Christopher; Chao, Weilun; ...

    2014-01-01

    Progress in the ultimate performance of extreme ultraviolet resist has arguably decelerated in recent years suggesting an approach to stochastic limits both in photon counts and material parameters. Here we report on the performance of a variety of leading extreme ultraviolet resist both with and without chemical amplification. The measured performance is compared to stochastic modeling results using the Multivariate Poisson Propagation Model. The results show that the best materials are indeed nearing modeled performance limits.

  9. Multivariable Parametric Cost Model for Ground Optical Telescope Assembly

    NASA Technical Reports Server (NTRS)

    Stahl, H. Philip; Rowell, Ginger Holmes; Reese, Gayle; Byberg, Alicia

    2005-01-01

    A parametric cost model for ground-based telescopes is developed using multivariable statistical analysis of both engineering and performance parameters. While diameter continues to be the dominant cost driver, diffraction-limited wavelength is found to be a secondary driver. Other parameters such as radius of curvature are examined. The model includes an explicit factor for primary mirror segmentation and/or duplication (i.e., multi-telescope phased-array systems). Additionally, single variable models Based on aperture diameter are derived.

  10. Order-restricted inference for multivariate longitudinal data with applications to the natural history of hearing loss.

    PubMed

    Rosen, Sophia; Davidov, Ori

    2012-07-20

    Multivariate outcomes are often measured longitudinally. For example, in hearing loss studies, hearing thresholds for each subject are measured repeatedly over time at several frequencies. Thus, each patient is associated with a multivariate longitudinal outcome. The multivariate mixed-effects model is a useful tool for the analysis of such data. There are situations in which the parameters of the model are subject to some restrictions or constraints. For example, it is known that hearing thresholds, at every frequency, increase with age. Moreover, this age-related threshold elevation is monotone in frequency, that is, the higher the frequency, the higher, on average, is the rate of threshold elevation. This means that there is a natural ordering among the different frequencies in the rate of hearing loss. In practice, this amounts to imposing a set of constraints on the different frequencies' regression coefficients modeling the mean effect of time and age at entry to the study on hearing thresholds. The aforementioned constraints should be accounted for in the analysis. The result is a multivariate longitudinal model with restricted parameters. We propose estimation and testing procedures for such models. We show that ignoring the constraints may lead to misleading inferences regarding the direction and the magnitude of various effects. Moreover, simulations show that incorporating the constraints substantially improves the mean squared error of the estimates and the power of the tests. We used this methodology to analyze a real hearing loss study. Copyright © 2012 John Wiley & Sons, Ltd.

  11. The NLS-Based Nonlinear Grey Multivariate Model for Forecasting Pollutant Emissions in China

    PubMed Central

    Pei, Ling-Ling; Li, Qin

    2018-01-01

    The relationship between pollutant discharge and economic growth has been a major research focus in environmental economics. To accurately estimate the nonlinear change law of China’s pollutant discharge with economic growth, this study establishes a transformed nonlinear grey multivariable (TNGM (1, N)) model based on the nonlinear least square (NLS) method. The Gauss–Seidel iterative algorithm was used to solve the parameters of the TNGM (1, N) model based on the NLS basic principle. This algorithm improves the precision of the model by continuous iteration and constantly approximating the optimal regression coefficient of the nonlinear model. In our empirical analysis, the traditional grey multivariate model GM (1, N) and the NLS-based TNGM (1, N) models were respectively adopted to forecast and analyze the relationship among wastewater discharge per capita (WDPC), and per capita emissions of SO2 and dust, alongside GDP per capita in China during the period 1996–2015. Results indicated that the NLS algorithm is able to effectively help the grey multivariable model identify the nonlinear relationship between pollutant discharge and economic growth. The results show that the NLS-based TNGM (1, N) model presents greater precision when forecasting WDPC, SO2 emissions and dust emissions per capita, compared to the traditional GM (1, N) model; WDPC indicates a growing tendency aligned with the growth of GDP, while the per capita emissions of SO2 and dust reduce accordingly. PMID:29517985

  12. Population level determinants of acute mountain sickness among young men: a retrospective study.

    PubMed

    Li, Xiaoxiao; Tao, Fasheng; Pei, Tao; You, Haiyan; Liu, Yan; Gao, Yuqi

    2011-09-28

    Many visitors, including military troops, who enter highland regions from low altitude areas may suffer from acute mountain sickness (AMS), which negatively impacts workable man-hours and increases healthcare costs. The aim of this study was to evaluate the population level risk factors and build a multivariate model, which might be applicable to reduce the effects of AMS on Chinese young men traveling to this region. Chinese highland military medical records were used to obtain data of young men (n = 3727) who entered the Tibet plateau between the years of 2006-2009. The relationship between AMS and travel profile, demographic characteristics, and health behaviors were evaluated by logistic regression. Univariate logistic models estimated the crude odds ratio. The variables that showed significance in the univariate model were included in a multivariate model to derive adjusted odds ratios and build the final model. Data corresponding to odd and even years (2 subsets) were analyzed separately and used in a simple cross-validation. Univariate analysis indicated that travel profile, prophylactic use, ethnicity, and province of birth were all associated with AMS in both subsets. In multivariate analysis, young men who traveled from lower altitude (600-800 m vs. 1300-1500 m, adjusted odds ratio (AOR) = 1.32-1.44) to higher altitudes (4100-4300 m vs. 2900-3100 m, AOR = 3.94-4.12; 3600-3700 m vs. 2900-3100 m, AOR = 2.71-2.74) by air or rapid land transport for emergency mission deployment (emergency land deployment vs. normal land deployment, AOR = 2.08-2.11; normal air deployment vs. normal land deployment, AOR = 2.00-2.20; emergency air deployment vs. normal land deployment, AOR = 2.40-3.34) during the cold season (cold vs. warm, AOR = 1.25-1.28) are at great risk for developing AMS. Non-Tibetan male soldiers (Tibetan vs. Han, AOR = 0.03-0.08), born and raised in lower provinces (eastern vs. northwestern, AOR = 1.32-1.39), and deployed without prophylaxis (prophylactic drug vs. none, AOR = 0.75-0.76), also represented a population at significantly increased risk for AMS. The predicted model was built; the area under receiver operating characteristic curve was 0.703. Before a group of young men first enter a high altitude area, it is important that a health service plan should be made referring to the group's travel profile and with respect to young men's ethnicity and province of birth. Low-cost Chinese traditional prophylactic drugs might have some effect on decreasing the risk of AMS, although this needs further verification.

  13. Autochthonous lactic acid bacteria with probiotic aptitudes as starter cultures for fish-based products.

    PubMed

    Speranza, Barbara; Racioppo, Angela; Beneduce, Luciano; Bevilacqua, Antonio; Sinigaglia, Milena; Corbo, Maria Rosaria

    2017-08-01

    This study focused on the selection of lactic starters with probiotic properties for the production of fermented fish-products by the use of a multivariate approach (Cluster Analysis and Principal Component Analysis). Seventy-five isolates were recovered from fish intestinal microbiota and characterized by evaluating phenotypical, technological and probiotic traits; the most promising isolates were molecularly identified and then used into fish fermented sausage production. Namely, data from technological characterization were modelled through Growth Index and used as input to run a preliminary selection. Thus, 15 promising strains were selected and subjected to probiotic characterization; considering the results from probiotic tests, 3 promising strains were finally chosen (11, 68 and 69), identified as members of the genus Lactobacillus and used for the validation at laboratory level through the assessment of their performances for the production of fermented fish sausages. The results were promising as the use of the selected strains reduced the fermentation time (2 days) ensuring a good microbiological quality of the final product. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. A Review of PAT Strategies in Secondary Solid Oral Dosage Manufacturing of Small Molecules.

    PubMed

    Laske, Stephan; Paudel, Amrit; Scheibelhofer, Otto

    2017-03-01

    Pharmaceutical solid oral dosage product manufacturing is a well-established, yet revolutionizing area. To this end, process analytical technology (PAT) involves interdisciplinary and multivariate (chemical, physical, microbiological, and mathematical) methods for material (e.g., materials, intermediates, products) and process (e.g., temperature, pressure, throughput, etc.) analysis. This supports rational process modeling and enhanced control strategies for improved product quality and process efficiency. Therefore, it is often difficult to orient and find the relevant, integrated aspects of the current state-of-the-art. Especially, the link between fundamental research, in terms of sensor and control system development, to the application both in laboratory and manufacturing scale, is difficult to comprehend. This review compiles a nonexhaustive overview on current approaches from the recognized academia and industrial practices of PAT, including screening, selection, and final implementations in solid oral dosage manufacturing, through a wide diversity of use cases. Finally, the authors attempt to extract a common consensus toward developing PAT application guidance for different unit operations of drug product manufacturing. Copyright © 2017 American Pharmacists Association®. Published by Elsevier Inc. All rights reserved.

  15. Sequence-independent construction of ordered combinatorial libraries with predefined crossover points.

    PubMed

    Jézéquel, Laetitia; Loeper, Jacqueline; Pompon, Denis

    2008-11-01

    Combinatorial libraries coding for mosaic enzymes with predefined crossover points constitute useful tools to address and model structure-function relationships and for functional optimization of enzymes based on multivariate statistics. The presented method, called sequence-independent generation of a chimera-ordered library (SIGNAL), allows easy shuffling of any predefined amino acid segment between two or more proteins. This method is particularly well adapted to the exchange of protein structural modules. The procedure could also be well suited to generate ordered combinatorial libraries independent of sequence similarities in a robotized manner. Sequence segments to be recombined are first extracted by PCR from a single-stranded template coding for an enzyme of interest using a biotin-avidin-based method. This technique allows the reduction of parental template contamination in the final library. Specific PCR primers allow amplification of two complementary mosaic DNA fragments, overlapping in the region to be exchanged. Fragments are finally reassembled using a fusion PCR. The process is illustrated via the construction of a set of mosaic CYP2B enzymes using this highly modular approach.

  16. Space-time variation of respiratory cancers in South Carolina: a flexible multivariate mixture modeling approach to risk estimation.

    PubMed

    Carroll, Rachel; Lawson, Andrew B; Kirby, Russell S; Faes, Christel; Aregay, Mehreteab; Watjou, Kevin

    2017-01-01

    Many types of cancer have an underlying spatiotemporal distribution. Spatiotemporal mixture modeling can offer a flexible approach to risk estimation via the inclusion of latent variables. In this article, we examine the application and benefits of using four different spatiotemporal mixture modeling methods in the modeling of cancer of the lung and bronchus as well as "other" respiratory cancer incidences in the state of South Carolina. Of the methods tested, no single method outperforms the other methods; which method is best depends on the cancer under consideration. The lung and bronchus cancer incidence outcome is best described by the univariate modeling formulation, whereas the "other" respiratory cancer incidence outcome is best described by the multivariate modeling formulation. Spatiotemporal multivariate mixture methods can aid in the modeling of cancers with small and sparse incidences when including information from a related, more common type of cancer. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. Multivariate Time Series Decomposition into Oscillation Components.

    PubMed

    Matsuda, Takeru; Komaki, Fumiyasu

    2017-08-01

    Many time series are considered to be a superposition of several oscillation components. We have proposed a method for decomposing univariate time series into oscillation components and estimating their phases (Matsuda & Komaki, 2017 ). In this study, we extend that method to multivariate time series. We assume that several oscillators underlie the given multivariate time series and that each variable corresponds to a superposition of the projections of the oscillators. Thus, the oscillators superpose on each variable with amplitude and phase modulation. Based on this idea, we develop gaussian linear state-space models and use them to decompose the given multivariate time series. The model parameters are estimated from data using the empirical Bayes method, and the number of oscillators is determined using the Akaike information criterion. Therefore, the proposed method extracts underlying oscillators in a data-driven manner and enables investigation of phase dynamics in a given multivariate time series. Numerical results show the effectiveness of the proposed method. From monthly mean north-south sunspot number data, the proposed method reveals an interesting phase relationship.

  18. Multivariate meta-analysis using individual participant data

    PubMed Central

    Riley, R. D.; Price, M. J.; Jackson, D.; Wardle, M.; Gueyffier, F.; Wang, J.; Staessen, J. A.; White, I. R.

    2016-01-01

    When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is that within-study correlations needed to fit the multivariate model are unknown from published reports. However, provision of individual participant data (IPD) allows them to be calculated directly. Here, we illustrate how to use IPD to estimate within-study correlations, using a joint linear regression for multiple continuous outcomes and bootstrapping methods for binary, survival and mixed outcomes. In a meta-analysis of 10 hypertension trials, we then show how these methods enable multivariate meta-analysis to address novel clinical questions about continuous, survival and binary outcomes; treatment–covariate interactions; adjusted risk/prognostic factor effects; longitudinal data; prognostic and multiparameter models; and multiple treatment comparisons. Both frequentist and Bayesian approaches are applied, with example software code provided to derive within-study correlations and to fit the models. PMID:26099484

  19. CONSTRUCTING KNOWLEDGE FROM MULTIVARIATE SPATIOTEMPORAL DATA: INTEGRATING GEOGRAPHIC VISUALIZATION WITH KNOWLEDGE DISCOVERY IN DATABASE METHODS. (R825195)

    EPA Science Inventory

    The perspectives, information and conclusions conveyed in research project abstracts, progress reports, final reports, journal abstracts and journal publications convey the viewpoints of the principal investigator and may not represent the views and policies of ORD and EPA. Concl...

  20. DEVELOPMENT OF HOURLY PROBABILISTIC UTILITY NOX EMISSION INVENTORIES USING TIME SERIES TECHNIQUES: PART 2-MULTIVARIATE APPROACH. (R826766)

    EPA Science Inventory

    The perspectives, information and conclusions conveyed in research project abstracts, progress reports, final reports, journal abstracts and journal publications convey the viewpoints of the principal investigator and may not represent the views and policies of ORD and EPA. Concl...

  1. Airborne gamma-ray spectrometer and magnetometer survey, Durango B, Colorado. Final report Volume II C. Detail area

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Not Available

    1983-01-01

    This volume contains eight appendices: flight line maps, geology maps, explanation of geologic legend, flight line/geology maps, radiometric contour maps, magnetic contour maps, multi-variant analysis maps, and geochemical factor analysis maps. These appendices pertain to the Durango B detail area.

  2. Regression Model Optimization for the Analysis of Experimental Data

    NASA Technical Reports Server (NTRS)

    Ulbrich, N.

    2009-01-01

    A candidate math model search algorithm was developed at Ames Research Center that determines a recommended math model for the multivariate regression analysis of experimental data. The search algorithm is applicable to classical regression analysis problems as well as wind tunnel strain gage balance calibration analysis applications. The algorithm compares the predictive capability of different regression models using the standard deviation of the PRESS residuals of the responses as a search metric. This search metric is minimized during the search. Singular value decomposition is used during the search to reject math models that lead to a singular solution of the regression analysis problem. Two threshold dependent constraints are also applied. The first constraint rejects math models with insignificant terms. The second constraint rejects math models with near-linear dependencies between terms. The math term hierarchy rule may also be applied as an optional constraint during or after the candidate math model search. The final term selection of the recommended math model depends on the regressor and response values of the data set, the user s function class combination choice, the user s constraint selections, and the result of the search metric minimization. A frequently used regression analysis example from the literature is used to illustrate the application of the search algorithm to experimental data.

  3. Separation of cognitive impairments in attention-deficit/hyperactivity disorder into 2 familial factors.

    PubMed

    Kuntsi, Jonna; Wood, Alexis C; Rijsdijk, Frühling; Johnson, Katherine A; Andreou, Penelope; Albrecht, Björn; Arias-Vasquez, Alejandro; Buitelaar, Jan K; McLoughlin, Gráinne; Rommelse, Nanda N J; Sergeant, Joseph A; Sonuga-Barke, Edmund J; Uebel, Henrik; van der Meere, Jaap J; Banaschewski, Tobias; Gill, Michael; Manor, Iris; Miranda, Ana; Mulas, Fernando; Oades, Robert D; Roeyers, Herbert; Rothenberger, Aribert; Steinhausen, Hans-Christoph; Faraone, Stephen V; Asherson, Philip

    2010-11-01

    Attention-deficit/hyperactivity disorder (ADHD) is associated with widespread cognitive impairments, but it is not known whether the apparent multiple impairments share etiological roots or separate etiological pathways exist. A better understanding of the etiological pathways is important for the development of targeted interventions and for identification of suitable intermediate phenotypes for molecular genetic investigations. To determine, by using a multivariate familial factor analysis approach, whether 1 or more familial factors underlie the slow and variable reaction times, impaired response inhibition, and choice impulsivity associated with ADHD. An ADHD and control sibling-pair design. Belgium, Germany, Ireland, Israel, Spain, Switzerland, and the United Kingdom. A total of 1265 participants, aged 6 to 18 years: 464 probands with ADHD and 456 of their siblings (524 with combined-subtype ADHD), and 345 control participants. Performance on a 4-choice reaction time task, a go/no-go inhibition task, and a choice-delay task. The final model consisted of 2 familial factors. The larger factor, reflecting 85% of the familial variance of ADHD, captured 98% to 100% of the familial influences on mean reaction time and reaction time variability. The second, smaller factor, reflecting 13% of the familial variance of ADHD, captured 62% to 82% of the familial influences on commission and omission errors on the go/no-go task. Choice impulsivity was excluded in the final model because of poor fit. The findings suggest the existence of 2 familial pathways to cognitive impairments in ADHD and indicate promising cognitive targets for future molecular genetic investigations. The familial distinction between the 2 cognitive impairments is consistent with recent theoretical models--a developmental model and an arousal-attention model--of 2 separable underlying processes in ADHD. Future research that tests the familial model within a developmental framework may inform developmentally sensitive interventions.

  4. Multivariable Parametric Cost Model for Ground Optical: Telescope Assembly

    NASA Technical Reports Server (NTRS)

    Stahl, H. Philip; Rowell, Ginger Holmes; Reese, Gayle; Byberg, Alicia

    2004-01-01

    A parametric cost model for ground-based telescopes is developed using multi-variable statistical analysis of both engineering and performance parameters. While diameter continues to be the dominant cost driver, diffraction limited wavelength is found to be a secondary driver. Other parameters such as radius of curvature were examined. The model includes an explicit factor for primary mirror segmentation and/or duplication (i.e. multi-telescope phased-array systems). Additionally, single variable models based on aperture diameter were derived.

  5. A Multivariate Multilevel Approach to the Modeling of Accuracy and Speed of Test Takers

    ERIC Educational Resources Information Center

    Klein Entink, R. H.; Fox, J. P.; van der Linden, W. J.

    2009-01-01

    Response times on test items are easily collected in modern computerized testing. When collecting both (binary) responses and (continuous) response times on test items, it is possible to measure the accuracy and speed of test takers. To study the relationships between these two constructs, the model is extended with a multivariate multilevel…

  6. Multivariate regression model for partitioning tree volume of white oak into round-product classes

    Treesearch

    Daniel A. Yaussy; David L. Sonderman

    1984-01-01

    Describes the development of multivariate equations that predict the expected cubic volume of four round-product classes from independent variables composed of individual tree-quality characteristics. Although the model has limited application at this time, it does demonstrate the feasibility of partitioning total tree cubic volume into round-product classes based on...

  7. The Dirichlet-Multinomial Model for Multivariate Randomized Response Data and Small Samples

    ERIC Educational Resources Information Center

    Avetisyan, Marianna; Fox, Jean-Paul

    2012-01-01

    In survey sampling the randomized response (RR) technique can be used to obtain truthful answers to sensitive questions. Although the individual answers are masked due to the RR technique, individual (sensitive) response rates can be estimated when observing multivariate response data. The beta-binomial model for binary RR data will be generalized…

  8. Tracking Problem Solving by Multivariate Pattern Analysis and Hidden Markov Model Algorithms

    ERIC Educational Resources Information Center

    Anderson, John R.

    2012-01-01

    Multivariate pattern analysis can be combined with Hidden Markov Model algorithms to track the second-by-second thinking as people solve complex problems. Two applications of this methodology are illustrated with a data set taken from children as they interacted with an intelligent tutoring system for algebra. The first "mind reading" application…

  9. Four Families of Multi-Variant Issues in Graduate-Level Asynchronous Online Courses

    ERIC Educational Resources Information Center

    Gisburne, Jaclyn M.; Fairchild, Patricia J.

    2004-01-01

    This is the first of several papers developed from a faculty and student perspective describing a new distance learning (DL) model. Integral to the model are four interrelated families of multi-variant issues, referred to here as (a) the academic divide, (b) student misalignment, (c) administrative influences, and (d) the use of student…

  10. Assessing Reliability of Student Ratings of Advisor: A Comparison of Univariate and Multivariate Generalizability Approaches.

    ERIC Educational Resources Information Center

    Sun, Anji; Valiga, Michael J.

    In this study, the reliability of the American College Testing (ACT) Program's "Survey of Academic Advising" (SAA) was examined using both univariate and multivariate generalizability theory approaches. The primary purpose of the study was to compare the results of three generalizability theory models (a random univariate model, a mixed…

  11. Web-Based Tools for Modelling and Analysis of Multivariate Data: California Ozone Pollution Activity

    ERIC Educational Resources Information Center

    Dinov, Ivo D.; Christou, Nicolas

    2011-01-01

    This article presents a hands-on web-based activity motivated by the relation between human health and ozone pollution in California. This case study is based on multivariate data collected monthly at 20 locations in California between 1980 and 2006. Several strategies and tools for data interrogation and exploratory data analysis, model fitting…

  12. Multivariate Generalizations of Student's t-Distribution. ONR Technical Report. [Biometric Lab Report No. 90-3.

    ERIC Educational Resources Information Center

    Gibbons, Robert D.; And Others

    In the process of developing a conditionally-dependent item response theory (IRT) model, the problem arose of modeling an underlying multivariate normal (MVN) response process with general correlation among the items. Without the assumption of conditional independence, for which the underlying MVN cdf takes on comparatively simple forms and can be…

  13. Bias and Precision of Measures of Association for a Fixed-Effect Multivariate Analysis of Variance Model

    ERIC Educational Resources Information Center

    Kim, Soyoung; Olejnik, Stephen

    2005-01-01

    The sampling distributions of five popular measures of association with and without two bias adjusting methods were examined for the single factor fixed-effects multivariate analysis of variance model. The number of groups, sample sizes, number of outcomes, and the strength of association were manipulated. The results indicate that all five…

  14. Market interdependence among commodity prices based on information transmission on the Internet

    NASA Astrophysics Data System (ADS)

    Ji, Qiang; Guo, Jian-Feng

    2015-05-01

    Human behaviour on the Internet has become a synchro-projection of real society. In this paper, we introduce the public concern derived from query volumes on the Web to empirically analyse the influence of information on commodity markets (e.g., crude oil, heating oil, corn and gold) using multivariate GARCH models based on dynamic conditional correlations. The analysis found that the changes of public concern on the Internet can well depict the changes of market prices, as the former has significant Granger causality effects on market prices. The findings indicate that the information of external shocks to commodity markets could be transmitted quickly, and commodity markets easily absorb the public concern of the information-sensitive traders. Finally, the conditional correlation among commodity prices varies dramatically over time.

  15. Reporting quality of multivariable logistic regression in selected Indian medical journals.

    PubMed

    Kumar, R; Indrayan, A; Chhabra, P

    2012-01-01

    Use of multivariable logistic regression (MLR) modeling has steeply increased in the medical literature over the past few years. Testing of model assumptions and adequate reporting of MLR allow the reader to interpret results more accurately. To review the fulfillment of assumptions and reporting quality of MLR in selected Indian medical journals using established criteria. Analysis of published literature. Medknow.com publishes 68 Indian medical journals with open access. Eight of these journals had at least five articles using MLR between the years 1994 to 2008. Articles from each of these journals were evaluated according to the previously established 10-point quality criteria for reporting and to test the MLR model assumptions. SPSS 17 software and non-parametric test (Kruskal-Wallis H, Mann Whitney U, Spearman Correlation). One hundred and nine articles were finally found using MLR for analyzing the data in the selected eight journals. The number of such articles gradually increased after year 2003, but quality score remained almost similar over time. P value, odds ratio, and 95% confidence interval for coefficients in MLR was reported in 75.2% and sufficient cases (>10) per covariate of limiting sample size were reported in the 58.7% of the articles. No article reported the test for conformity of linear gradient for continuous covariates. Total score was not significantly different across the journals. However, involvement of statistician or epidemiologist as a co-author improved the average quality score significantly (P=0.014). Reporting of MLR in many Indian journals is incomplete. Only one article managed to score 8 out of 10 among 109 articles under review. All others scored less. Appropriate guidelines in instructions to authors, and pre-publication review of articles using MLR by a qualified statistician may improve quality of reporting.

  16. Intermittent and daily smokers: two different socioeconomic patterns, and diverging influence of social participation

    PubMed Central

    Lindstrom, M.; Ostergren, P.

    2001-01-01

    OBJECTIVE—To investigate socioeconomic differences in intermittent and daily smoking, and to assess the association between social participation and these two smoking behaviours.
DESIGN/SETTING/PARTICIPANTS/MEASUREMENTS—A population of 11 837 individuals interviewed in 1992-94, aged 45-64 years, was investigated in this cross sectional study. A multivariate logistic regression model was used to assess socioeconomic differences in daily and intermittent smoking, adjusting for age, country of origin, previous/current diseases, and marital status. Finally, social participation as a measure of social capital was introduced in the multivariate model.
RESULTS—When unskilled manual workers were compared to high level non-manual employees, odds ratios of 2.3 (95% confidence interval (CI) 1.7 to 3.0) for men and 1.9 (95% CI 1.4 to 2.5) for women were found in regard to daily smoking, but odd ratios of only 0.7 (95% CI 0.4 to 1.2) for men and 1.3 (95% CI 0.7 to 2.4) for women were found in regard to intermittent smoking. A decrease in the daily smoking odds ratios was found when social participation was introduced in the model, while the odds ratios regarding intermittent smoking were unaffected.
CONCLUSIONS—There were no socioeconomic differences in intermittent smoking and no association with social participation, a result that contrasts sharply with the patterns of daily smoking. These findings have important implications for the discussion concerning social capital and preventive measures.


Keywords: intermittent smoking; daily smoking; socioeconomic status; social participation; social capital PMID:11544391

  17. The role of footwear in the prevention of foot lesions in patients with NIDDM. Conventional wisdom or evidence-based practice?

    PubMed

    Litzelman, D K; Marriott, D J; Vinicor, F

    1997-02-01

    To conduct a prospective evaluation of footwear characteristics as predictors of diabetic foot wounds. A total of 352 patients with NIDDM enrolled in a randomized controlled trial aimed at preventing diabetic foot lesions in an academic general medicine practice were studied. Foot wounds (n = 63) were modeled univariately and multivariably using generalized estimating equations. The dependent variable was a wound classified as a 1.2 or greater according to the Seattle Wound Classification System, indicating at least a superficial or healing minor lesion with no functional interruption of the protective cutaneous barrier. Independent variables included detailed measures of style and material of patients' indoor and outdoor shoes, appropriate length and width, sock fibers, whether the patient had bought new shoes in the past 6 months, and if the patient had been recommended for special shoes. Modeling controlled for intervention status and physiological measures (baseline wound, monofilament abnormalities, and serum HDL level). Initial screening (P < 0.20) suggested that a recommendation for special shoes, shoe length, and shoe width were indicative of wounds at follow-up (odds ratios [ORs] 2.19, 1.84, 1.86, respectively), while having bought shoes in the past 6 months was associated with no wound at follow-up (OR 0.60). The final multivariable model included only the recommendation for special shoes (OR 2.19; 95% CI 1.07-4.49). Many variables commonly cited as protective measures in footwear for diabetic patients were not prospectively predictive when controlling for physiological risk factors. Rigorous analyses are needed to examine the many assumptions regarding footwear recommendations for diabetic patients.

  18. Clinical predictors and outcome implications of early readmission in lung transplant recipients.

    PubMed

    Osho, Asishana A; Castleberry, Anthony W; Yerokun, Babatunde A; Mulvihill, Michael S; Rucker, Justin; Snyder, Laurie D; Davis, Robert D; Hartwig, Matthew G

    2017-05-01

    The purpose of this study was to identify risk factors and outcome implications for 30-day hospital readmission in lung transplant recipients. We conducted a retrospective cohort study of lung transplant cases from a single, high-volume lung transplant program between January 2000 and March 2012. Demographic and health data were reviewed for all patients. Risk factors for 30-day readmission (defined as readmission within 30 days of discharge from index lung transplant hospitalization) were modeled using logistic regression, with selection of parameters by backward elimination. The sample comprised 795 patients after excluding scheduled readmissions and in-hospital deaths. Overall 30-day readmission rate was 45.4% (n = 361). Readmission rates were similar across different diagnosis categories and procedure types. By univariate analysis, post-operative complications that predisposed to 30-day readmission included pneumonia, any infection, and atrial fibrillation (all p < 0.05). In the final multivariate model, occurrence of any post-transplant complication was the most significant risk factor for 30-day readmission (odds ratio = 1.764; 95% confidence interval, 1.259-2.470). Even for patients with no documented perioperative complication, readmission rates were still >35%. Kaplan-Meier analysis and multi-variate regression modeling to assess readmission as a predictor of long-term outcomes showed that 30-day readmission was not a significant predictor of worse survival in lung recipients. Occurrence of at least 1 post-transplant complication increases risk for 30-day readmission in lung transplant recipients. In this patient population, 30-day readmission does not predispose to adverse long-term survival. Quality indicators other than 30-day readmission may be needed to assess hospitals that perform lung transplantation. Copyright © 2017 International Society for Heart and Lung Transplantation. Published by Elsevier Inc. All rights reserved.

  19. Investigation of the Sensitivity of Transmission Raman Spectroscopy for Polymorph Detection in Pharmaceutical Tablets.

    PubMed

    Feng, Hanzhou; Bondi, Robert W; Anderson, Carl A; Drennen, James K; Igne, Benoît

    2017-08-01

    Polymorph detection is critical for ensuring pharmaceutical product quality in drug substances exhibiting polymorphism. Conventional analytical techniques such as X-ray powder diffraction and solid-state nuclear magnetic resonance are utilized primarily for characterizing the presence and identity of specific polymorphs in a sample. These techniques have encountered challenges in analyzing the constitution of polymorphs in the presence of other components commonly found in pharmaceutical dosage forms. Laborious sample preparation procedures are usually required to achieve satisfactory data interpretability. There is a need for alternative techniques capable of probing pharmaceutical dosage forms rapidly and nondestructively, which is dictated by the practical requirements of applications such as quality monitoring on production lines or when quantifying product shelf lifetime. The sensitivity of transmission Raman spectroscopy for detecting polymorphs in final tablet cores was investigated in this work. Carbamazepine was chosen as a model drug, polymorph form III is the commercial form, whereas form I is an undesired polymorph that requires effective detection. The concentration of form I in a direct compression tablet formulation containing 20% w/w of carbamazepine, 74.00% w/w of fillers (mannitol and microcrystalline cellulose), and 6% w/w of croscarmellose sodium, silicon dioxide, and magnesium stearate was estimated using transmission Raman spectroscopy. Quantitative models were generated and optimized using multivariate regression and data preprocessing. Prediction uncertainty was estimated for each validation sample by accounting for all the main variables contributing to the prediction. Multivariate detection limits were calculated based on statistical hypothesis testing. The transmission Raman spectroscopic model had an absolute prediction error of 0.241% w/w for the independent validation set. The method detection limit was estimated at 1.31% w/w. The results demonstrated that transmission Raman spectroscopy is a sensitive tool for polymorphs detection in pharmaceutical tablets.

  20. Comparison of perioperative outcomes between open and robotic radical cystectomy: a population based analysis.

    PubMed

    Nazzani, Sebastiano; Mazzone, Elio; Preisser, Felix; Bandini, Marco; Tian, Zhe; Marchioni, Michele; Ratti, Dario; Motta, Gloria; Zorn, Kevin Christopher; Briganti, Alberto; Shariat, Shahrokh F; Montanari, Emanuele; Carmignani, Luca; Karakiewicz, Pierre I

    2018-05-30

    Radical cystectomy represents the standard of care for muscle invasive bladder cancer (MIBC). Due to its novelty the use of robotic radical cystectomy (RARC) is still under debate. We examined intraoperative and postoperative morbidity and mortality as well as impact on length of stay (LOS) and total hospital charges (THCGs) of RARC compared to open radical cystectomy (ORC). Within National Inpatient Sample (NIS) (2008-2013), we identified patients with non-metastatic bladder cancer treated with either ORC or RARC. We relied on inverse probability of treatment weighting (IPTW) to reduce the effect of inherent differences between ORC vs. RARC. Multivariable logistic regression (MLR) and multivariable Poisson regression models (MPR) were used. Of all 10 027 patients, 12.6% underwent RARC. Between 2008 and 2013, RARC rates increased from 0.8 to 20.4% [Estimated annual percentage change (EAPC): +26.5%, CI: +11.1 to +48.3; p=0.035] and RARC THCGs decreased from 45 981 to 31 749 United States Dollars (EAPC: -6.8%, CI: -9.6 to -3.9; p=0.01). In MLR models RARC resulted in lower rates of overall complications (OR: 0.6; p <0.001) and transfusions (OR: 0.44; p <0.001). In MPR models, RARC was associated with shorter LOS [relative risk(RR)0.91 ; p <0.001]. Finally, higher THCGs (OR: 1.09; p <0.001) were recorded for RARC. Data are retrospective and no tumor characteristics were available. RARC is related to lower rates of overall complications and transfusions rates. In consequence, RARC is a safe and feasible technique in select muscle invasive bladder cancer patients. Moreover, RARC is associated with shorter LOS albeit higher THCGs.  .

  1. Refining the Association of Fever with Functional Outcome in Aneurysmal Subarachnoid Hemorrhage.

    PubMed

    Kramer, Christopher L; Pegoli, Marianna; Mandrekar, Jay; Lanzino, Giuseppe; Rabinstein, Alejandro A

    2017-02-01

    We analyzed the impact of cause, severity, and duration of fever on functional outcome in patients with aneurysmal subarachnoid hemorrhage (aSAH). Fever characteristics were analyzed in 584 consecutive patients with aSAH. Fever was defined as core body temperature ≥38.3 °C on ≥2 consecutive days. Subfebrile measurements were those between 37 and 38.2 °C. Febrile and subfebrile loads were the number of hours with fever or subfebrile measurements, respectively. Univariate and multivariate logistic regression models were developed to define predictors of outcome using various categorizations of fever cause, severity, and duration. Febrile measurements were observed in 281/584 (48.1 %) patients, recurring over a mean duration of 2.1 ± 3.0 days. Early fever within 24 and 72 h was encountered in 69 (11.9 %) and 110 (18.9 %) of patients, respectively. An infectious source was discovered in 126 (44.8 %) febrile patients. On univariate analysis, days of fever, febrile load, and fever onset within 24 and 72 h were associated with poor outcome (all p < 0.001); but subfebrile load was not (p = 0.56). On multivariate model constructed with all variables associated with outcome on univariate analyses, days of fever remained independently associated with poor outcome (OR 1.14 of poor outcome per day of fever, 95 % CI 1.06-1.22; p = 0.0006) displacing all other fever measures from the final model. Early onset of fever, number of hours with fever, and especially days of fever are associated with poor functional outcome. Conversely, subfebrile load does not influence clinical outcome. These data suggest prolonged fever should be avoided, but subfebrile temperatures may not justify intervention.

  2. Lifetime risks for aneurysmal subarachnoid haemorrhage: multivariable risk stratification.

    PubMed

    Vlak, Monique H M; Rinkel, Gabriel J E; Greebe, Paut; Greving, Jacoba P; Algra, Ale

    2013-06-01

    The overall incidence of aneurysmal subarachnoid haemorrhage (aSAH) in western populations is around 9 per 100 000 person-years, which confers to a lifetime risk of around half per cent. Risk factors for aSAH are usually expressed as relative risks and suggest that absolute risks vary considerably according to risk factor profiles, but such estimates are lacking. We aimed to estimate incidence and lifetime risks of aSAH according to risk factor profiles. We used data from 250 patients admitted with aSAH and 574 sex-matched and age-matched controls, who were randomly retrieved from general practitioners files. We determined independent prognostic factors with multivariable logistic regression analyses and assessed discriminatory performance using the area under the receiver operating characteristic curve. Based on the prognostic model we predicted incidences and lifetime risks of aSAH for different risk factor profiles. The four strongest independent predictors for aSAH, namely current smoking (OR 6.0; 95% CI 4.1 to 8.6), a positive family history for aSAH (4.0; 95% CI 2.3 to 7.0), hypertension (2.4; 95% CI 1.5 to 3.8) and hypercholesterolaemia (0.2; 95% CI 0.1 to 0.4), were used in the final prediction model. This model had an area under the receiver operating characteristic curve of 0.73 (95% CI 0.69 to 0.76). Depending on sex, age and the four predictors, the incidence of aSAH ranged from 0.4/100 000 to 298/100 000 person-years and lifetime risk between 0.02% and 7.2%. The incidence and lifetime risk of aSAH in the general population varies widely according to risk factor profiles. Whether persons with high risks benefit from screening should be assessed in cost-effectiveness studies.

  3. Robustness analysis of an air heating plant and control law by using polynomial chaos

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Colón, Diego; Ferreira, Murillo A. S.; Bueno, Átila M.

    2014-12-10

    This paper presents a robustness analysis of an air heating plant with a multivariable closed-loop control law by using the polynomial chaos methodology (MPC). The plant consists of a PVC tube with a fan in the air input (that forces the air through the tube) and a mass flux sensor in the output. A heating resistance warms the air as it flows inside the tube, and a thermo-couple sensor measures the air temperature. The plant has thus two inputs (the fan's rotation intensity and heat generated by the resistance, both measured in percent of the maximum value) and two outputsmore » (air temperature and air mass flux, also in percent of the maximal value). The mathematical model is obtained by System Identification techniques. The mass flux sensor, which is nonlinear, is linearized and the delays in the transfer functions are properly approximated by non-minimum phase transfer functions. The resulting model is transformed to a state-space model, which is used for control design purposes. The multivariable robust control design techniques used is the LQG/LTR, and the controllers are validated in simulation software and in the real plant. Finally, the MPC is applied by considering some of the system's parameters as random variables (one at a time, and the system's stochastic differential equations are solved by expanding the solution (a stochastic process) in an orthogonal basis of polynomial functions of the basic random variables. This method transforms the stochastic equations in a set of deterministic differential equations, which can be solved by traditional numerical methods (That is the MPC). Statistical data for the system (like expected values and variances) are then calculated. The effects of randomness in the parameters are evaluated in the open-loop and closed-loop pole's positions.« less

  4. Clostridium difficile Associated Risk of Death Score (CARDS): A novel severity score to predict mortality among hospitalized patients with Clostridium difficile infection

    PubMed Central

    Kassam, Zain; Fabersunne, Camila Cribb; Smith, Mark B.; Alm, Eric J.; Kaplan, Gilaad G.; Nguyen, Geoffrey C.; Ananthakrishnan, Ashwin N.

    2016-01-01

    Background Clostridium difficile infection (CDI) is public health threat and associated with significant mortality. However, there is a paucity of objectively derived CDI severity scoring systems to predict mortality. Aims To develop a novel CDI risk score to predict mortality entitled: Clostridium difficile Associated Risk of Death Score (CARDS). Methods We obtained data from the United States 2011 Nationwide Inpatient Sample (NIS) database. All CDI-associated hospitalizations were identified using discharge codes (ICD-9-CM, 008.45). Multivariate logistic regression was utilized to identify independent predictors of mortality. CARDS was calculated by assigning a numeric weight to each parameter based on their odds ratio in the final logistic model. Predictive properties of model discrimination were assessed using the c-statistic and validated in an independent sample using the 2010 NIS database. Results We identified 77,776 hospitalizations, yielding an estimate of 374,747 cases with an associated diagnosis of CDI in the United States, 8% of whom died in the hospital. The 8 severity score predictors were identified on multivariate analysis: age, cardiopulmonary disease, malignancy, diabetes, inflammatory bowel disease, acute renal failure, liver disease and ICU admission, with weights ranging from −1 (for diabetes) to 5 (for ICU admission). The overall risk score in the cohort ranged from 0 to 18. Mortality increased significantly as CARDS increased. CDI-associated mortality was 1.2% with a CARDS of 0 compared to 100% with CARDS of 18. The model performed equally well in our validation cohort. Conclusion CARDS is a promising simple severity score to predict mortality among those hospitalized with CDI. PMID:26849527

  5. Quality of life after lacunar stroke: the Secondary Prevention of Small Subcortical Strokes study.

    PubMed

    Dhamoon, Mandip S; McClure, Leslie A; White, Carole L; Lau, Helena; Benavente, Oscar; Elkind, Mitchell S V

    2014-01-01

    We sought to describe the course and predictors of quality of life (QOL) after lacunar stroke. We hypothesized that there is a decline in QOL after recovery from lacunar stroke. The Secondary Prevention of Small Subcortical Strokes is a clinical trial in lacunar stroke patients with annual assessments of QOL with the stroke-specific QOL score. The overall score was used and analyzed as a continuous variable (range 0-5). We fit linear mixed models to assess the trend in QOL over time, assuming linearity of time, and adjusted for demographics, medical risk factors, cognitive factors, and functional status in univariable and multivariable models. Among 2870 participants, mean age was 63.4 years (SD 10.7), 63% were men, 51% White, 32% Hispanic, 36% had college education, 36% had diabetes, 89% had hypertension, and 10% had prior stroke. Mean poststroke Barthel Index (BI) score was 95.4 (assessed on average 6 months after stroke). In the final multivariable model, there was an average increase in QOL of .6% per year, and factors associated with decline in QOL over time included age (-.0003 per year, P < .0001), any college education (-.0013 per year, .01), prior stroke (-.004 per year, P < .0001), and BI (-.0002 per year, P < .0001). In this clinical trial of lacunar stroke patients, there was a slight annual increase in QOL overall, and age, level of education, and prior stroke were associated with changes in QOL over time. Multiple strokes may cause decline in QOL over time in the absence of recurrent events. Copyright © 2014 National Stroke Association. Published by Elsevier Inc. All rights reserved.

  6. MULTIVARIATE ANALYSES (CONONICAL CORRELATION AND PARTIAL LEAST SQUARE, PLS) TO MODEL AND ASSESS THE ASSOCIATION OF LANDSCAPE METRICS TO SURFACE WATER CHEMICAL AND BIOLOGICAL PROPERTIES USING SAVANNAH RIVER BASIN DATA.

    EPA Science Inventory

    Many multivariate methods are used in describing and predicting relation; each has its unique usage of categorical and non-categorical data. In multivariate analysis of variance (MANOVA), many response variables (y's) are related to many independent variables that are categorical...

  7. Modeling and simulation of maintenance treatment in first-line non-small cell lung cancer with external validation.

    PubMed

    Han, Kelong; Claret, Laurent; Sandler, Alan; Das, Asha; Jin, Jin; Bruno, Rene

    2016-07-13

    Maintenance treatment (MTx) in responders following first-line treatment has been investigated and practiced for many cancers. Modeling and simulation may support interpretation of interim data and development decisions. We aimed to develop a modeling framework to simulate overall survival (OS) for MTx in NSCLC using tumor growth inhibition (TGI) data. TGI metrics were estimated using longitudinal tumor size data from two Phase III first-line NSCLC studies evaluating bevacizumab and erlotinib as MTx in 1632 patients. Baseline prognostic factors and TGI metric estimates were assessed in multivariate parametric models to predict OS. The OS model was externally validated by simulating a third independent NSCLC study (n = 253) based on interim TGI data (up to progression-free survival database lock). The third study evaluated pemetrexed + bevacizumab vs. bevacizumab alone as MTx. Time-to-tumor-growth (TTG) was the best TGI metric to predict OS. TTG, baseline tumor size, ECOG score, Asian ethnicity, age, and gender were significant covariates in the final OS model. The OS model was qualified by simulating OS distributions and hazard ratios (HR) in the two studies used for model-building. Simulations of the third independent study based on interim TGI data showed that pemetrexed + bevacizumab MTx was unlikely to significantly prolong OS vs. bevacizumab alone given the current sample size (predicted HR: 0.81; 95 % prediction interval: 0.59-1.09). Predicted median OS was 17.3 months and 14.7 months in both arms, respectively. These simulations are consistent with the results of the final OS analysis published 2 years later (observed HR: 0.87; 95 % confidence interval: 0.63-1.21). Final observed median OS was 17.1 months and 13.2 months in both arms, respectively, consistent with our predictions. A robust TGI-OS model was developed for MTx in NSCLC. TTG captures treatment effect. The model successfully predicted the OS outcomes of an independent study based on interim TGI data and thus may facilitate trial simulation and interpretation of interim data. The model was built based on erlotinib data and externally validated using pemetrexed data, suggesting that TGI-OS models may be treatment-independent. The results supported the use of longitudinal tumor size and TTG as endpoints in early clinical oncology studies.

  8. An area-level model of vehicle-pedestrian injury collisions with implications for land use and transportation planning.

    PubMed

    Wier, Megan; Weintraub, June; Humphreys, Elizabeth H; Seto, Edmund; Bhatia, Rajiv

    2009-01-01

    There is growing awareness among urban planning, public health, and transportation professionals that design decisions and investments that promote walking can be beneficial for human and ecological health. Planners need practical tools to consider the impact of development on pedestrian safety, a key requirement for the promotion of walking. Simple bivariate models have been used to predict changes in vehicle-pedestrian injury collisions based on changes in traffic volume. We describe the development of a multivariate, area-level regression model of vehicle-pedestrian injury collisions based on environmental and population data in 176 San Francisco, California census tracts. Predictor variables examined included street, land use, and population characteristics, including commute behaviors. The final model explained approximately 72% of the systematic variation in census-tract vehicle-pedestrian injury collisions and included measures of traffic volume, arterial streets without transit, land area, proportion of land area zoned for neighborhood commercial and residential-neighborhood commercial uses, employee and resident populations, proportion of people living in poverty and proportion aged 65 and older. We have begun to apply this model to predict area-level change in vehicle-pedestrian injury collisions associated with land use development and transportation planning decisions.

  9. Multivariate analysis of the impacts of the turbine fuel JP-4 in a microcosm toxicity test with implications for the evaluation of ecosystem dynamics and risk assessment.

    PubMed

    Landis, W G; Matthews, R A; Markiewicz, A J; Matthews, G B

    1993-12-01

    Turbine fuels are often the only aviation fuel available in most of the world. Turbine fuels consist of numerous constituents with varying water solubilities, volatilities and toxicities. This study investigates the toxicity of the water soluble fraction (WSF) of JP-4 using the Standard Aquatic Microcosm (SAM). Multivariate analysis of the complex data, including the relatively new method of nonmetric clustering, was used and compared to more traditional analyses. Particular emphasis is placed on ecosystem dynamics in multivariate space.The WSF is prepared by vigorously mixing the fuel and the SAM microcosm media in a separatory funnel. The water phase, which contains the water-soluble fraction of JP-4 is then collected. The SAM experiment was conducted using concentrations of 0.0, 1.5 and 15% WSF. The WSF is added on day 7 of the experiments by removing 450 ml from each microcosm including the controls, then adding the appropriate amount of toxicant solution and finally bringing the final volume to 3 L with microcosm media. Analysis of the WSF was performed by purge and trap gas chromatography. The organic constituents of the WSF were not recoverable from the water column within several days of the addition of the toxicant. However, the impact of the WSF on the microcosm was apparent. In the highest initial concentration treatment group an algal bloom ensued, generated by the apparent toxicity of the WSF of JP-4 to the daphnids. As the daphnid populations recovered the algal populations decreased to control values. Multivariate methods clearly demonstrated this initial impact along with an additional oscillation seperating the four treatment groups in the latter segment of the experiment. Apparent recovery may be an artifact of the projections used to describe the multivariate data. The variables that were most important in distinguishing the four groups shifted during the course of the 63 day experiment. Even this simple microcosm exhibited a variety of dynamics, with implications for biomonitoring schemes and ecological risk assessments.

  10. Toward the Multivariate Modeling of Achievement, Aptitude, and Personality.

    ERIC Educational Resources Information Center

    Foshay, Wellesley R.; Misanchuk, Earl R.

    1981-01-01

    A multivariate investigation of the dynamics of cumulative achievement studied the influence of course grades, personality traits, environmental variables, and previous performance. The latter was the best single predictor of performance. (CJ)

  11. Prevalence and determinants of cardiovascular disease risk factors among the residents of urban community housing projects in Malaysia.

    PubMed

    Amiri, Mohammadreza; Majid, Hazreen Abdul; Hairi, FarizahMohd; Thangiah, Nithiah; Bulgiba, Awang; Su, Tin Tin

    2014-01-01

    The objectives are to assess the prevalence and determinants of cardiovascular disease (CVD) risk factors among the residents of Community Housing Projects in metropolitan Kuala Lumpur, Malaysia. By using simple random sampling, we selected and surveyed 833 households which comprised of 3,722 individuals. Out of the 2,360 adults, 50.5% participated in blood sampling and anthropometric measurement sessions. Uni and bivariate data analysis and multivariate binary logistic regression were applied to identify demographic and socioeconomic determinants of the existence of having at least one CVD risk factor. As a Result, while obesity (54.8%), hypercholesterolemia (51.5%), and hypertension (39.3%) were the most common CVD risk factors among the low-income respondents, smoking (16.3%), diabetes mellitus (7.8%) and alcohol consumption (1.4%) were the least prevalent. Finally, the results from the multivariate binary logistic model illustrated that compared to the Malays, the Indians were 41% less likely to have at least one of the CVD risk factors (OR = 0.59; 95% CI: 0.37 - 0.93). In Conclusion, the low-income individuals were at higher risk of developing CVDs. Prospective policies addressing preventive actions and increased awareness focusing on low-income communities are highly recommended and to consider age, gender, ethnic backgrounds, and occupation classes.

  12. Understanding the relationship between access to care and facility-based delivery through analysis of the 2008 Ghana Demographic Health Survey.

    PubMed

    Moyer, Cheryl A; McLaren, Zoë M; Adanu, Richard M; Lantz, Paula M

    2013-09-01

    To determine the types of access to care most strongly associated with facility-based delivery among women in Ghana. Data relating to the "5 As of Access" framework were extracted from the 2008 Ghana Demographic Health Survey and analyzed using multivariate logistic regression. In all, 55.5% of a weighted sample of 1102 women delivered in a healthcare facility, whereas 45.5% delivered at home. Affordability was the strongest access factor associated with delivery location, with health insurance coverage tripling the odds of facility delivery. Availability, accessibility (except urban residence), acceptability, and social access variables were not significant factors in the final models. Social access variables, including needing permission to seek healthcare and not being involved in decisions regarding healthcare, were associated with a reduced likelihood of facility-based delivery when examined individually. Multivariate analysis suggested that these variables reflected maternal literacy, health insurance coverage, and household wealth, all of which attenuated the effects of social access. Affordability was an important determinant of facility delivery in Ghana-even among women with health insurance-but social access variables had a mediating role. Copyright © 2013 International Federation of Gynecology and Obstetrics. Published by Elsevier Ireland Ltd. All rights reserved.

  13. A case-control study of pathogen and lifestyle risk factors for diarrhoea in dogs.

    PubMed

    Stavisky, Jenny; Radford, Alan David; Gaskell, Rosalind; Dawson, Susan; German, Alex; Parsons, Bryony; Clegg, Simon; Newman, Jenny; Pinchbeck, Gina

    2011-05-01

    Diarrhoea is a common and multi-factorial condition in dogs, the aetiology of which is often incompletely understood. A case-control study was carried out to compare the carriage of some common canine enteric pathogens (enteric coronavirus, parvovirus, distemper, endoparasites, Campylobacter and Salmonella spp.), as well as lifestyle factors such as vaccination history, diet and contact with other species, in dogs presenting at first opinion veterinary practices with and without diarrhoea. Multivariable conditional logistic regression showed that dogs in the study which scavenged or had had a recent change of diet (OR 3.5, p=0.002), had recently stayed in kennels (OR 9.5, p=0.01), or were fed a home-cooked diet (OR 4, p=0.002) were at a significantly greater risk of diarrhoea, whilst being female (OR 0.4, p=0.01), currently up to date with routine vaccinations (OR 0.4, p=0.05) and having contact with horse faeces (OR 0.4, p=0.06) were associated with a reduced risk. None of the pathogens tested for was a significant factor in the final multivariable model suggesting that in this predominantly vaccinated population, diarrhoea may be more associated with lifestyle risk factors than specific pathogens. Copyright © 2011 Elsevier B.V. All rights reserved.

  14. Uterine fibroids at routine second-trimester ultrasound survey and risk of sonographic short cervix.

    PubMed

    Blitz, Matthew J; Rochelson, Burton; Augustine, Stephanie; Greenberg, Meir; Sison, Cristina P; Vohra, Nidhi

    2016-11-01

    To determine whether women with sonographically identified uterine fibroids are at higher risk for a short cervix. This retrospective cohort study evaluated all women with singleton gestations who had a routine second-trimester ultrasound at 17-23 weeks gestational age from 2010 to 2013. When fibroids were noted, their presence, number, location and size were recorded. Exclusion criteria included a history of cervical conization or loop electrosurgical excision procedure (LEEP), uterine anomalies, maternal age greater than 40 years, and a previously placed cerclage. The primary variable of interest was short cervix (<25 mm). Secondary variables of interest included gestational age at delivery, mode of delivery, indication for cesarean, malpresentation, birth weight, and Apgar scores. A multivariable logistic regression analysis was performed. Fibroids were identified in 522/10 314 patients (5.1%). In the final multivariable logistic regression model, short cervix was increased in women with fibroids (OR 2.29, 95% CI: 1.40, 3.74). The number of fibroids did not affect the frequency of short cervix. Fibroids were significantly associated with preterm delivery (<37 weeks), primary cesarean, breech presentation, lower birth weight infants, and lower Apgar scores. Women with uterine fibroids may be at higher risk for a short cervix. Fibroids are also associated with several adverse obstetric and neonatal outcomes.

  15. Targeted metabolomic profiling in rat tissues reveals sex differences.

    PubMed

    Ruoppolo, Margherita; Caterino, Marianna; Albano, Lucia; Pecce, Rita; Di Girolamo, Maria Grazia; Crisci, Daniela; Costanzo, Michele; Milella, Luigi; Franconi, Flavia; Campesi, Ilaria

    2018-03-16

    Sex differences affect several diseases and are organ-and parameter-specific. In humans and animals, sex differences also influence the metabolism and homeostasis of amino acids and fatty acids, which are linked to the onset of diseases. Thus, the use of targeted metabolite profiles in tissues represents a powerful approach to examine the intermediary metabolism and evidence for any sex differences. To clarify the sex-specific activities of liver, heart and kidney tissues, we used targeted metabolomics, linear discriminant analysis (LDA), principal component analysis (PCA), cluster analysis and linear correlation models to evaluate sex and organ-specific differences in amino acids, free carnitine and acylcarnitine levels in male and female Sprague-Dawley rats. Several intra-sex differences affect tissues, indicating that metabolite profiles in rat hearts, livers and kidneys are organ-dependent. Amino acids and carnitine levels in rat hearts, livers and kidneys are affected by sex: male and female hearts show the greatest sexual dimorphism, both qualitatively and quantitatively. Finally, multivariate analysis confirmed the influence of sex on the metabolomics profiling. Our data demonstrate that the metabolomics approach together with a multivariate approach can capture the dynamics of physiological and pathological states, which are essential for explaining the basis of the sex differences observed in physiological and pathological conditions.

  16. Hepatotoxicity during Treatment for Tuberculosis in People Living with HIV/AIDS.

    PubMed

    Araújo-Mariz, Carolline; Lopes, Edmundo Pessoa; Acioli-Santos, Bartolomeu; Maruza, Magda; Montarroyos, Ulisses Ramos; Ximenes, Ricardo Arraes de Alencar; Lacerda, Heloísa Ramos; Miranda-Filho, Demócrito de Barros; Albuquerque, Maria de Fátima P Militão de

    2016-01-01

    Hepatotoxicity is frequently reported as an adverse reaction during the treatment of tuberculosis. The aim of this study was to determine the incidence of hepatotoxicity and to identify predictive factors for developing hepatotoxicity after people living with HIV/AIDS (PLWHA) start treatment for tuberculosis. This was a prospective cohort study with PLWHA who were monitored during the first 60 days of tuberculosis treatment in Pernambuco, Brazil. Hepatotoxicity was considered increased levels of aminotransferase, namely those that rose to three times higher than the level before initiating tuberculosis treatment, these levels being associated with symptoms of hepatitis. We conducted a multivariate logistic regression analysis and the magnitude of the associations was expressed by the odds ratio with a confidence interval of 95%. Hepatotoxicity was observed in 53 (30.6%) of the 173 patients who started tuberculosis treatment. The final multivariate logistic regression model demonstrated that the use of fluconazole, malnutrition and the subject being classified as a phenotypically slow acetylator increased the risk of hepatotoxicity significantly. The incidence of hepatotoxicity during treatment for tuberculosis in PLWHA was high. Those classified as phenotypically slow acetylators and as malnourished should be targeted for specific care to reduce the risk of hepatotoxicity during treatment for tuberculosis. The use of fluconazole should be avoided during tuberculosis treatment in PLWHA.

  17. Comparing Medical and Recreational Cannabis Users on Socio-Demographic, Substance and Medication Use, and Health and Disability Characteristics.

    PubMed

    Goulet-Stock, Sybil; Rueda, Sergio; Vafaei, Afshin; Ialomiteanu, Anca; Manthey, Jakob; Rehm, Jürgen; Fischer, Benedikt

    2017-01-01

    While recreational cannabis use is common, medical cannabis programs have proliferated across North America, including a federal program in Canada. Few comparisons of medical and recreational cannabis users (RCUs) exist; this study compared these groups on key characteristics. Data came from a community-recruited sample of formally approved medical cannabis users (MCUs; n = 53), and a sub-sample of recreational cannabis users (RCUs; n = 169) from a representative adult survey in Ontario (Canada). Samples were telephone-surveyed on identical measures, including select socio-demographic, substance and medication use, and health and disability measures. Based on initial bivariate comparisons, multivariate logistical regression with a progressive adjustment approach was performed to assess independent predictors of group status. In bivariate analyses, older age, lower household income, lower alcohol use, higher cocaine, prescription opioid, depression and anxiety medication use, and lower health and disability status were significantly associated with medical cannabis use. In the multivariate analysis, final model, household income, alcohol use, and disability levels were associated with medical cannabis use. Conclusions/Scientific Significance: Compared to RCUs, medical users appear to be mainly characterized by factors negatively influencing their overall health status. Future studies should investigate the actual impact and net benefits of medical cannabis use on these health problems. © 2017 S. Karger AG, Basel.

  18. Predicting stress urinary incontinence during pregnancy: combination of pelvic floor ultrasound parameters and clinical factors.

    PubMed

    Chen, Ling; Luo, Dan; Yu, Xiajuan; Jin, Mei; Cai, Wenzhi

    2018-05-12

    The aim of this study was to develop and validate a predictive tool that combining pelvic floor ultrasound parameters and clinical factors for stress urinary incontinence during pregnancy. A total of 535 women in first or second trimester were included for an interview and transperineal ultrasound assessment from two hospitals. Imaging data sets were analyzed offline to assess for bladder neck vertical position, urethra angles (α, β, and γ angles), hiatal area and bladder neck funneling. All significant continuous variables at univariable analysis were analyzed by receiver-operating characteristics. Three multivariable logistic models were built on clinical factor, and combined with ultrasound parameters. The final predictive model with best performance and fewest variables was selected to establish a nomogram. Internal and external validation of the nomogram were performed by both discrimination represented by C-index and calibration measured by Hosmer-Lemeshow test. A decision curve analysis was conducted to determine the clinical utility of the nomogram. After excluding 14 women with invalid data, 521 women were analyzed. β angle, γ angle and hiatal area had limited predictive value for stress urinary incontinence during pregnancy, with area under curves of 0.558-0.648. The final predictive model included body mass index gain since pregnancy, constipation, previous delivery mode, β angle at rest, and bladder neck funneling. The nomogram based on the final model showed good discrimination with a C-index of 0.789 and satisfactory calibration (P=0.828), both of which were supported by external validation. Decision curve analysis showed that the nomogram was clinical useful. The nomogram incorporating both the pelvic floor ultrasound parameters and clinical factors has been validated to show good discrimination and calibration, and could be an important tool for stress urinary incontinence risk prediction at an early stage of pregnancy. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  19. Prognostic factors in Acanthamoeba keratitis.

    PubMed

    Kaiserman, Igor; Bahar, Irit; McAllum, Penny; Srinivasan, Sathish; Elbaz, Uri; Slomovic, Allan R; Rootman, David S

    2012-06-01

    To assess the prognostic factors influencing visual prognosis and length of treatment after acanthamoeba keratitis (AK). Forty-two AK eyes of 41 patients treated between 1999 and 2006 were included. A diagnosis of AK was made on the basis of culture results with a corresponding clinical presentation. We calculated the prognostic effect of the various factors on final visual acuity and the length of treatment. Multivariate regression analysis was used to adjust for the simultaneous effects of the various prognostic factors. Mean follow-up was 19.7 ± 21.0 months. Sixty-four percent of cases had > 1 identified risk factor for AK, the most common risk factor being contact lens wear (92.9% of eyes). At presentation, median best spectacle corrected visual acuity (BCVA) was 20/200 (20/30 to Hand Motion [HM]) that improved after treatment to 20/50 (20/20 to Counting Fingers [CF]). Infection acquired by swimming or related to contact lenses had significantly better final BCVA (p = 0.03 and p = 0.007, respectively). Neuritis and pseudodendrites were also associated with better final BCVA (p = 0.04 and p = 0.05, respectively). Having had an epithelial defect on presentation and having been treated with topical steroid were associated with worse final best spectacle corrected visual acuity (BSCVA) (p = 0.0006 and p = 0.04). Multivariate regression analysis found a good initial visual acuity (p = 0.002), infections related to swimming (p = 0.01), the absence of an epithelial defect (p = 0.03), having been treated with chlorhexidine (p = 0.05), and not having receive steroids (p = 0.003) to significantly forecast a good final BCVA. We identified several prognostic factors that can help clinicians evaluate the expected visual damage of the AK infection and thus tailor treatment accordingly. Copyright © 2012 Canadian Ophthalmological Society. All rights reserved.

  20. Bayesian transformation cure frailty models with multivariate failure time data.

    PubMed

    Yin, Guosheng

    2008-12-10

    We propose a class of transformation cure frailty models to accommodate a survival fraction in multivariate failure time data. Established through a general power transformation, this family of cure frailty models includes the proportional hazards and the proportional odds modeling structures as two special cases. Within the Bayesian paradigm, we obtain the joint posterior distribution and the corresponding full conditional distributions of the model parameters for the implementation of Gibbs sampling. Model selection is based on the conditional predictive ordinate statistic and deviance information criterion. As an illustration, we apply the proposed method to a real data set from dentistry.

  1. Socioeconomic and Demographic Disparities in Knowledge of Reproductive Healthcare among Female University Students in Bangladesh

    PubMed Central

    Islam Mondal, Md. Nazrul; Nasir Ullah, Md. Monzur Morshad; Khan, Md. Nuruzzaman; Islam, Mohammad Zamirul; Islam, Md. Nurul; Moni, Sabiha Yasmin; Hoque, Md. Nazrul; Rahman, Md. Mashiur

    2015-01-01

    Background: Reproductive health (RH) is a critical component of women’s health and overall well-being around the world, especially in developing countries. We examine the factors that determine knowledge of RH care among female university students in Bangladesh. Methods: Data on 300 female students were collected from Rajshahi University, Bangladesh through a structured questionnaire using purposive sampling technique. The data were used for univariate analysis, to carry out the description of the variables; bivariate analysis was used to examine the associations between the variables; and finally, multivariate analysis (binary logistic regression model) was used to examine and fit the model and interpret the parameter estimates, especially in terms of odds ratios. Results: The results revealed that more than one-third (34.3%) respondents do not have sufficient knowledge of RH care. The χ2-test identified the significant (p < 0.05) associations between respondents’ knowledge of RH care with respondents’ age, education, family type, watching television; and knowledge about pregnancy, family planning, and contraceptive use. Finally, the binary logistic regression model identified respondents’ age, education, family type; and knowledge about family planning, and contraceptive use as the significant (p < 0.05) predictors of RH care. Conclusions and Global Health Implications: Knowledge of RH care among female university students was found unsatisfactory. Government and concerned organizations should promote and strengthen various health education programs to focus on RH care especially for the female university students in Bangladesh. PMID:27622005

  2. Multivariate exploration of non-intrusive load monitoring via spatiotemporal pattern network

    DOE PAGES

    Liu, Chao; Akintayo, Adedotun; Jiang, Zhanhong; ...

    2017-12-18

    Non-intrusive load monitoring (NILM) of electrical demand for the purpose of identifying load components has thus far mostly been studied using univariate data, e.g., using only whole building electricity consumption time series to identify a certain type of end-use such as lighting load. However, using additional variables in the form of multivariate time series data may provide more information in terms of extracting distinguishable features in the context of energy disaggregation. In this work, a novel probabilistic graphical modeling approach, namely the spatiotemporal pattern network (STPN) is proposed for energy disaggregation using multivariate time-series data. The STPN framework is shownmore » to be capable of handling diverse types of multivariate time-series to improve the energy disaggregation performance. The technique outperforms the state of the art factorial hidden Markov models (FHMM) and combinatorial optimization (CO) techniques in multiple real-life test cases. Furthermore, based on two homes' aggregate electric consumption data, a similarity metric is defined for the energy disaggregation of one home using a trained model based on the other home (i.e., out-of-sample case). The proposed similarity metric allows us to enhance scalability via learning supervised models for a few homes and deploying such models to many other similar but unmodeled homes with significantly high disaggregation accuracy.« less

  3. Multivariate exploration of non-intrusive load monitoring via spatiotemporal pattern network

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Liu, Chao; Akintayo, Adedotun; Jiang, Zhanhong

    Non-intrusive load monitoring (NILM) of electrical demand for the purpose of identifying load components has thus far mostly been studied using univariate data, e.g., using only whole building electricity consumption time series to identify a certain type of end-use such as lighting load. However, using additional variables in the form of multivariate time series data may provide more information in terms of extracting distinguishable features in the context of energy disaggregation. In this work, a novel probabilistic graphical modeling approach, namely the spatiotemporal pattern network (STPN) is proposed for energy disaggregation using multivariate time-series data. The STPN framework is shownmore » to be capable of handling diverse types of multivariate time-series to improve the energy disaggregation performance. The technique outperforms the state of the art factorial hidden Markov models (FHMM) and combinatorial optimization (CO) techniques in multiple real-life test cases. Furthermore, based on two homes' aggregate electric consumption data, a similarity metric is defined for the energy disaggregation of one home using a trained model based on the other home (i.e., out-of-sample case). The proposed similarity metric allows us to enhance scalability via learning supervised models for a few homes and deploying such models to many other similar but unmodeled homes with significantly high disaggregation accuracy.« less

  4. Multivariate dynamical modelling of structural change during development.

    PubMed

    Ziegler, Gabriel; Ridgway, Gerard R; Blakemore, Sarah-Jayne; Ashburner, John; Penny, Will

    2017-02-15

    Here we introduce a multivariate framework for characterising longitudinal changes in structural MRI using dynamical systems. The general approach enables modelling changes of states in multiple imaging biomarkers typically observed during brain development, plasticity, ageing and degeneration, e.g. regional gray matter volume of multiple regions of interest (ROIs). Structural brain states follow intrinsic dynamics according to a linear system with additional inputs accounting for potential driving forces of brain development. In particular, the inputs to the system are specified to account for known or latent developmental growth/decline factors, e.g. due to effects of growth hormones, puberty, or sudden behavioural changes etc. Because effects of developmental factors might be region-specific, the sensitivity of each ROI to contributions of each factor is explicitly modelled. In addition to the external effects of developmental factors on regional change, the framework enables modelling and inference about directed (potentially reciprocal) interactions between brain regions, due to competition for space, or structural connectivity, and suchlike. This approach accounts for repeated measures in typical MRI studies of development and aging. Model inversion and posterior distributions are obtained using earlier established variational methods enabling Bayesian evidence-based comparisons between various models of structural change. Using this approach we demonstrate dynamic cortical changes during brain maturation between 6 and 22 years of age using a large openly available longitudinal paediatric dataset with 637 scans from 289 individuals. In particular, we model volumetric changes in 26 bilateral ROIs, which cover large portions of cortical and subcortical gray matter. We account for (1) puberty-related effects on gray matter regions; (2) effects of an early transient growth process with additional time-lag parameter; (3) sexual dimorphism by modelling parameter differences between boys and girls. There is evidence that the regional pattern of sensitivity to dynamic hidden growth factors in late childhood is similar across genders and shows a consistent anterior-posterior gradient with strongest impact to prefrontal cortex (PFC) brain changes. Finally, we demonstrate the potential of the framework to explore the coupling of structural changes across a priori defined subnetworks using an example of previously established resting state functional connectivity. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  5. A sparse reconstruction method for the estimation of multi-resolution emission fields via atmospheric inversion

    DOE PAGES

    Ray, J.; Lee, J.; Yadav, V.; ...

    2015-04-29

    Atmospheric inversions are frequently used to estimate fluxes of atmospheric greenhouse gases (e.g., biospheric CO 2 flux fields) at Earth's surface. These inversions typically assume that flux departures from a prior model are spatially smoothly varying, which are then modeled using a multi-variate Gaussian. When the field being estimated is spatially rough, multi-variate Gaussian models are difficult to construct and a wavelet-based field model may be more suitable. Unfortunately, such models are very high dimensional and are most conveniently used when the estimation method can simultaneously perform data-driven model simplification (removal of model parameters that cannot be reliably estimated) andmore » fitting. Such sparse reconstruction methods are typically not used in atmospheric inversions. In this work, we devise a sparse reconstruction method, and illustrate it in an idealized atmospheric inversion problem for the estimation of fossil fuel CO 2 (ffCO 2) emissions in the lower 48 states of the USA. Our new method is based on stagewise orthogonal matching pursuit (StOMP), a method used to reconstruct compressively sensed images. Our adaptations bestow three properties to the sparse reconstruction procedure which are useful in atmospheric inversions. We have modified StOMP to incorporate prior information on the emission field being estimated and to enforce non-negativity on the estimated field. Finally, though based on wavelets, our method allows for the estimation of fields in non-rectangular geometries, e.g., emission fields inside geographical and political boundaries. Our idealized inversions use a recently developed multi-resolution (i.e., wavelet-based) random field model developed for ffCO 2 emissions and synthetic observations of ffCO 2 concentrations from a limited set of measurement sites. We find that our method for limiting the estimated field within an irregularly shaped region is about a factor of 10 faster than conventional approaches. It also reduces the overall computational cost by a factor of 2. Further, the sparse reconstruction scheme imposes non-negativity without introducing strong nonlinearities, such as those introduced by employing log-transformed fields, and thus reaps the benefits of simplicity and computational speed that are characteristic of linear inverse problems.« less

  6. A sparse reconstruction method for the estimation of multi-resolution emission fields via atmospheric inversion

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ray, J.; Lee, J.; Yadav, V.

    Atmospheric inversions are frequently used to estimate fluxes of atmospheric greenhouse gases (e.g., biospheric CO 2 flux fields) at Earth's surface. These inversions typically assume that flux departures from a prior model are spatially smoothly varying, which are then modeled using a multi-variate Gaussian. When the field being estimated is spatially rough, multi-variate Gaussian models are difficult to construct and a wavelet-based field model may be more suitable. Unfortunately, such models are very high dimensional and are most conveniently used when the estimation method can simultaneously perform data-driven model simplification (removal of model parameters that cannot be reliably estimated) andmore » fitting. Such sparse reconstruction methods are typically not used in atmospheric inversions. In this work, we devise a sparse reconstruction method, and illustrate it in an idealized atmospheric inversion problem for the estimation of fossil fuel CO 2 (ffCO 2) emissions in the lower 48 states of the USA. Our new method is based on stagewise orthogonal matching pursuit (StOMP), a method used to reconstruct compressively sensed images. Our adaptations bestow three properties to the sparse reconstruction procedure which are useful in atmospheric inversions. We have modified StOMP to incorporate prior information on the emission field being estimated and to enforce non-negativity on the estimated field. Finally, though based on wavelets, our method allows for the estimation of fields in non-rectangular geometries, e.g., emission fields inside geographical and political boundaries. Our idealized inversions use a recently developed multi-resolution (i.e., wavelet-based) random field model developed for ffCO 2 emissions and synthetic observations of ffCO 2 concentrations from a limited set of measurement sites. We find that our method for limiting the estimated field within an irregularly shaped region is about a factor of 10 faster than conventional approaches. It also reduces the overall computational cost by a factor of 2. Further, the sparse reconstruction scheme imposes non-negativity without introducing strong nonlinearities, such as those introduced by employing log-transformed fields, and thus reaps the benefits of simplicity and computational speed that are characteristic of linear inverse problems.« less

  7. SU-F-R-51: Radiomics in CT Perfusion Maps of Head and Neck Cancer

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Nesteruk, M; Riesterer, O; Veit-Haibach, P

    2016-06-15

    Purpose: The aim of this study was to test the predictive value of radiomics features of CT perfusion (CTP) for tumor control, based on a preselection of radiomics features in a robustness study. Methods: 11 patients with head and neck cancer (HNC) and 11 patients with lung cancer were included in the robustness study to preselect stable radiomics parameters. Data from 36 HNC patients treated with definitive radiochemotherapy (median follow-up 30 months) was used to build a predictive model based on these parameters. All patients underwent pre-treatment CTP. 315 texture parameters were computed for three perfusion maps: blood volume, bloodmore » flow and mean transit time. The variability of texture parameters was tested with respect to non-standardizable perfusion computation factors (noise level and artery contouring) using intraclass correlation coefficients (ICC). The parameter with the highest ICC in the correlated group of parameters (inter-parameter Spearman correlations) was tested for its predictive value. The final model to predict tumor control was built using multivariate Cox regression analysis with backward selection of the variables. For comparison, a predictive model based on tumor volume was created. Results: Ten parameters were found to be stable in both HNC and lung cancer regarding potentially non-standardizable factors after the correction for inter-parameter correlations. In the multivariate backward selection of the variables, blood flow entropy showed a highly significant impact on tumor control (p=0.03) with concordance index (CI) of 0.76. Blood flow entropy was significantly lower in the patient group with controlled tumors at 18 months (p<0.1). The new model showed a higher concordance index compared to the tumor volume model (CI=0.68). Conclusion: The preselection of variables in the robustness study allowed building a predictive radiomics-based model of tumor control in HNC despite a small patient cohort. This model was found to be superior to the volume-based model. The project was supported by the KFSP Tumor Oxygenation of the University of Zurich, by a grant of the Center for Clinical Research, University and University Hospital Zurich and by a research grant from Merck (Schweiz) AG.« less

  8. A High-Dimensional, Multivariate Copula Approach to Modeling Multivariate Agricultural Price Relationships and Tail Dependencies

    Treesearch

    Xuan Chi; Barry Goodwin

    2012-01-01

    Spatial and temporal relationships among agricultural prices have been an important topic of applied research for many years. Such research is used to investigate the performance of markets and to examine linkages up and down the marketing chain. This research has empirically evaluated price linkages by using correlation and regression models and, later, linear and...

  9. Validation of cross-sectional time series and multivariate adaptive regression splines models for the prediction of energy expenditure in children and adolescents using doubly labeled water

    USDA-ARS?s Scientific Manuscript database

    Accurate, nonintrusive, and inexpensive techniques are needed to measure energy expenditure (EE) in free-living populations. Our primary aim in this study was to validate cross-sectional time series (CSTS) and multivariate adaptive regression splines (MARS) models based on observable participant cha...

  10. Identifying pleiotropic genes in genome-wide association studies from related subjects using the linear mixed model and Fisher combination function.

    PubMed

    Yang, James J; Williams, L Keoki; Buu, Anne

    2017-08-24

    A multivariate genome-wide association test is proposed for analyzing data on multivariate quantitative phenotypes collected from related subjects. The proposed method is a two-step approach. The first step models the association between the genotype and marginal phenotype using a linear mixed model. The second step uses the correlation between residuals of the linear mixed model to estimate the null distribution of the Fisher combination test statistic. The simulation results show that the proposed method controls the type I error rate and is more powerful than the marginal tests across different population structures (admixed or non-admixed) and relatedness (related or independent). The statistical analysis on the database of the Study of Addiction: Genetics and Environment (SAGE) demonstrates that applying the multivariate association test may facilitate identification of the pleiotropic genes contributing to the risk for alcohol dependence commonly expressed by four correlated phenotypes. This study proposes a multivariate method for identifying pleiotropic genes while adjusting for cryptic relatedness and population structure between subjects. The two-step approach is not only powerful but also computationally efficient even when the number of subjects and the number of phenotypes are both very large.

  11. Copula-based prediction of economic movements

    NASA Astrophysics Data System (ADS)

    García, J. E.; González-López, V. A.; Hirsh, I. D.

    2016-06-01

    In this paper we model the discretized returns of two paired time series BM&FBOVESPA Dividend Index and BM&FBOVESPA Public Utilities Index using multivariate Markov models. The discretization corresponds to three categories, high losses, high profits and the complementary periods of the series. In technical terms, the maximal memory that can be considered for a Markov model, can be derived from the size of the alphabet and dataset. The number of parameters needed to specify a discrete multivariate Markov chain grows exponentially with the order and dimension of the chain. In this case the size of the database is not large enough for a consistent estimation of the model. We apply a strategy to estimate a multivariate process with an order greater than the order achieved using standard procedures. The new strategy consist on obtaining a partition of the state space which is constructed from a combination, of the partitions corresponding to the two marginal processes and the partition corresponding to the multivariate Markov chain. In order to estimate the transition probabilities, all the partitions are linked using a copula. In our application this strategy provides a significant improvement in the movement predictions.

  12. Cross-country transferability of multi-variable damage models

    NASA Astrophysics Data System (ADS)

    Wagenaar, Dennis; Lüdtke, Stefan; Kreibich, Heidi; Bouwer, Laurens

    2017-04-01

    Flood damage assessment is often done with simple damage curves based only on flood water depth. Additionally, damage models are often transferred in space and time, e.g. from region to region or from one flood event to another. Validation has shown that depth-damage curve estimates are associated with high uncertainties, particularly when applied in regions outside the area where the data for curve development was collected. Recently, progress has been made with multi-variable damage models created with data-mining techniques, i.e. Bayesian Networks and random forest. However, it is still unknown to what extent and under which conditions model transfers are possible and reliable. Model validations in different countries will provide valuable insights into the transferability of multi-variable damage models. In this study we compare multi-variable models developed on basis of flood damage datasets from Germany as well as from The Netherlands. Data from several German floods was collected using computer aided telephone interviews. Data from the 1993 Meuse flood in the Netherlands is available, based on compensations paid by the government. The Bayesian network and random forest based models are applied and validated in both countries on basis of the individual datasets. A major challenge was the harmonization of the variables between both datasets due to factors like differences in variable definitions, and regional and temporal differences in flood hazard and exposure characteristics. Results of model validations and comparisons in both countries are discussed, particularly in respect to encountered challenges and possible solutions for an improvement of model transferability.

  13. Multivariate Prediction Equations for HbA1c Lowering, Weight Change, and Hypoglycemic Events Associated with Insulin Rescue Medication in Type 2 Diabetes Mellitus: Informing Economic Modeling.

    PubMed

    Willis, Michael; Asseburg, Christian; Nilsson, Andreas; Johnsson, Kristina; Kartman, Bernt

    2017-03-01

    Type 2 diabetes mellitus (T2DM) is chronic and progressive and the cost-effectiveness of new treatment interventions must be established over long time horizons. Given the limited durability of drugs, assumptions regarding downstream rescue medication can drive results. Especially for insulin, for which treatment effects and adverse events are known to depend on patient characteristics, this can be problematic for health economic evaluation involving modeling. To estimate parsimonious multivariate equations of treatment effects and hypoglycemic event risks for use in parameterizing insulin rescue therapy in model-based cost-effectiveness analysis. Clinical evidence for insulin use in T2DM was identified in PubMed and from published reviews and meta-analyses. Study and patient characteristics and treatment effects and adverse event rates were extracted and the data used to estimate parsimonious treatment effect and hypoglycemic event risk equations using multivariate regression analysis. Data from 91 studies featuring 171 usable study arms were identified, mostly for premix and basal insulin types. Multivariate prediction equations for glycated hemoglobin A 1c lowering and weight change were estimated separately for insulin-naive and insulin-experienced patients. Goodness of fit (R 2 ) for both outcomes were generally good, ranging from 0.44 to 0.84. Multivariate prediction equations for symptomatic, nocturnal, and severe hypoglycemic events were also estimated, though considerable heterogeneity in definitions limits their usefulness. Parsimonious and robust multivariate prediction equations were estimated for glycated hemoglobin A 1c and weight change, separately for insulin-naive and insulin-experienced patients. Using these in economic simulation modeling in T2DM can improve realism and flexibility in modeling insulin rescue medication. Copyright © 2017 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.

  14. Prediction versus aetiology: common pitfalls and how to avoid them.

    PubMed

    van Diepen, Merel; Ramspek, Chava L; Jager, Kitty J; Zoccali, Carmine; Dekker, Friedo W

    2017-04-01

    Prediction research is a distinct field of epidemiologic research, which should be clearly separated from aetiological research. Both prediction and aetiology make use of multivariable modelling, but the underlying research aim and interpretation of results are very different. Aetiology aims at uncovering the causal effect of a specific risk factor on an outcome, adjusting for confounding factors that are selected based on pre-existing knowledge of causal relations. In contrast, prediction aims at accurately predicting the risk of an outcome using multiple predictors collectively, where the final prediction model is usually based on statistically significant, but not necessarily causal, associations in the data at hand.In both scientific and clinical practice, however, the two are often confused, resulting in poor-quality publications with limited interpretability and applicability. A major problem is the frequently encountered aetiological interpretation of prediction results, where individual variables in a prediction model are attributed causal meaning. This article stresses the differences in use and interpretation of aetiological and prediction studies, and gives examples of common pitfalls. © The Author 2017. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.

  15. Combination of multivariate curve resolution and multivariate classification techniques for comprehensive high-performance liquid chromatography-diode array absorbance detection fingerprints analysis of Salvia reuterana extracts.

    PubMed

    Hakimzadeh, Neda; Parastar, Hadi; Fattahi, Mohammad

    2014-01-24

    In this study, multivariate curve resolution (MCR) and multivariate classification methods are proposed to develop a new chemometric strategy for comprehensive analysis of high-performance liquid chromatography-diode array absorbance detection (HPLC-DAD) fingerprints of sixty Salvia reuterana samples from five different geographical regions. Different chromatographic problems occurred during HPLC-DAD analysis of S. reuterana samples, such as baseline/background contribution and noise, low signal-to-noise ratio (S/N), asymmetric peaks, elution time shifts, and peak overlap are handled using the proposed strategy. In this way, chromatographic fingerprints of sixty samples are properly segmented to ten common chromatographic regions using local rank analysis and then, the corresponding segments are column-wise augmented for subsequent MCR analysis. Extended multivariate curve resolution-alternating least squares (MCR-ALS) is used to obtain pure component profiles in each segment. In general, thirty-one chemical components were resolved using MCR-ALS in sixty S. reuterana samples and the lack of fit (LOF) values of MCR-ALS models were below 10.0% in all cases. Pure spectral profiles are considered for identification of chemical components by comparing their resolved spectra with the standard ones and twenty-four components out of thirty-one components were identified. Additionally, pure elution profiles are used to obtain relative concentrations of chemical components in different samples for multivariate classification analysis by principal component analysis (PCA) and k-nearest neighbors (kNN). Inspection of the PCA score plot (explaining 76.1% of variance accounted for three PCs) showed that S. reuterana samples belong to four clusters. The degree of class separation (DCS) which quantifies the distance separating clusters in relation to the scatter within each cluster is calculated for four clusters and it was in the range of 1.6-5.8. These results are then confirmed by kNN. In addition, according to the PCA loading plot and kNN dendrogram of thirty-one variables, five chemical constituents of luteolin-7-o-glucoside, salvianolic acid D, rosmarinic acid, lithospermic acid and trijuganone A are identified as the most important variables (i.e., chemical markers) for clusters discrimination. Finally, the effect of different chemical markers on samples differentiation is investigated using counter-propagation artificial neural network (CP-ANN) method. It is concluded that the proposed strategy can be successfully applied for comprehensive analysis of chromatographic fingerprints of complex natural samples. Copyright © 2013 Elsevier B.V. All rights reserved.

  16. A multivariate mixed model system for wood specific gravity and moisture content of planted loblolly pine stands in the southern United States

    Treesearch

    Finto Antony; Laurence R. Schimleck; Alex Clark; Richard F. Daniels

    2012-01-01

    Specific gravity (SG) and moisture content (MC) both have a strong influence on the quantity and quality of wood fiber. We proposed a multivariate mixed model system to model the two properties simultaneously. Disk SG and MC at different height levels were measured from 3 trees in 135 stands across the natural range of loblolly pine and the stand level values were used...

  17. Multivariate curve resolution-alternating least squares and kinetic modeling applied to near-infrared data from curing reactions of epoxy resins: mechanistic approach and estimation of kinetic rate constants.

    PubMed

    Garrido, M; Larrechi, M S; Rius, F X

    2006-02-01

    This study describes the combination of multivariate curve resolution-alternating least squares with a kinetic modeling strategy for obtaining the kinetic rate constants of a curing reaction of epoxy resins. The reaction between phenyl glycidyl ether and aniline is monitored by near-infrared spectroscopy under isothermal conditions for several initial molar ratios of the reagents. The data for all experiments, arranged in a column-wise augmented data matrix, are analyzed using multivariate curve resolution-alternating least squares. The concentration profiles recovered are fitted to a chemical model proposed for the reaction. The selection of the kinetic model is assisted by the information contained in the recovered concentration profiles. The nonlinear fitting provides the kinetic rate constants. The optimized rate constants are in agreement with values reported in the literature.

  18. Tropical Pacific moisture variability: Its detection, synoptic structure and consequences in the general circulation

    NASA Technical Reports Server (NTRS)

    Mcguirk, James P.

    1990-01-01

    Satellite data analysis tools are developed and implemented for the diagnosis of atmospheric circulation systems over the tropical Pacific Ocean. The tools include statistical multi-variate procedures, a multi-spectral radiative transfer model, and the global spectral forecast model at NMC. Data include in-situ observations; satellite observations from VAS (moisture, infrared and visible) NOAA polar orbiters (including Tiros Operational Satellite System (TOVS) multi-channel sounding data and OLR grids) and scanning multichannel microwave radiometer (SMMR); and European Centre for Medium Weather Forecasts (ECHMWF) analyses. A primary goal is a better understanding of the relation between synoptic structures of the area, particularly tropical plumes, and the general circulation, especially the Hadley circulation. A second goal is the definition of the quantitative structure and behavior of all Pacific tropical synoptic systems. Finally, strategies are examined for extracting new and additional information from existing satellite observations. Although moisture structure is emphasized, thermal patterns are also analyzed. Both horizontal and vertical structures are studied and objective quantitative results are emphasized.

  19. Reliability Stress-Strength Models for Dependent Observations with Applications in Clinical Trials

    NASA Technical Reports Server (NTRS)

    Kushary, Debashis; Kulkarni, Pandurang M.

    1995-01-01

    We consider the applications of stress-strength models in studies involving clinical trials. When studying the effects and side effects of certain procedures (treatments), it is often the case that observations are correlated due to subject effect, repeated measurements and observing many characteristics simultaneously. We develop maximum likelihood estimator (MLE) and uniform minimum variance unbiased estimator (UMVUE) of the reliability which in clinical trial studies could be considered as the chances of increased side effects due to a particular procedure compared to another. The results developed apply to both univariate and multivariate situations. Also, for the univariate situations we develop simple to use lower confidence bounds for the reliability. Further, we consider the cases when both stress and strength constitute time dependent processes. We define the future reliability and obtain methods of constructing lower confidence bounds for this reliability. Finally, we conduct simulation studies to evaluate all the procedures developed and also to compare the MLE and the UMVUE.

  20. Modification of Gaussian mixture models for data classification in high energy physics

    NASA Astrophysics Data System (ADS)

    Štěpánek, Michal; Franc, Jiří; Kůs, Václav

    2015-01-01

    In high energy physics, we deal with demanding task of signal separation from background. The Model Based Clustering method involves the estimation of distribution mixture parameters via the Expectation-Maximization algorithm in the training phase and application of Bayes' rule in the testing phase. Modifications of the algorithm such as weighting, missing data processing, and overtraining avoidance will be discussed. Due to the strong dependence of the algorithm on initialization, genetic optimization techniques such as mutation, elitism, parasitism, and the rank selection of individuals will be mentioned. Data pre-processing plays a significant role for the subsequent combination of final discriminants in order to improve signal separation efficiency. Moreover, the results of the top quark separation from the Tevatron collider will be compared with those of standard multivariate techniques in high energy physics. Results from this study has been used in the measurement of the inclusive top pair production cross section employing DØ Tevatron full Runll data (9.7 fb-1).

Top