Sample records for multivariate state estimation

  1. Higher-order Multivariable Polynomial Regression to Estimate Human Affective States

    NASA Astrophysics Data System (ADS)

    Wei, Jie; Chen, Tong; Liu, Guangyuan; Yang, Jiemin

    2016-03-01

    From direct observations, facial, vocal, gestural, physiological, and central nervous signals, estimating human affective states through computational models such as multivariate linear-regression analysis, support vector regression, and artificial neural network, have been proposed in the past decade. In these models, linear models are generally lack of precision because of ignoring intrinsic nonlinearities of complex psychophysiological processes; and nonlinear models commonly adopt complicated algorithms. To improve accuracy and simplify model, we introduce a new computational modeling method named as higher-order multivariable polynomial regression to estimate human affective states. The study employs standardized pictures in the International Affective Picture System to induce thirty subjects’ affective states, and obtains pure affective patterns of skin conductance as input variables to the higher-order multivariable polynomial model for predicting affective valence and arousal. Experimental results show that our method is able to obtain efficient correlation coefficients of 0.98 and 0.96 for estimation of affective valence and arousal, respectively. Moreover, the method may provide certain indirect evidences that valence and arousal have their brain’s motivational circuit origins. Thus, the proposed method can serve as a novel one for efficiently estimating human affective states.

  2. Higher-order Multivariable Polynomial Regression to Estimate Human Affective States

    PubMed Central

    Wei, Jie; Chen, Tong; Liu, Guangyuan; Yang, Jiemin

    2016-01-01

    From direct observations, facial, vocal, gestural, physiological, and central nervous signals, estimating human affective states through computational models such as multivariate linear-regression analysis, support vector regression, and artificial neural network, have been proposed in the past decade. In these models, linear models are generally lack of precision because of ignoring intrinsic nonlinearities of complex psychophysiological processes; and nonlinear models commonly adopt complicated algorithms. To improve accuracy and simplify model, we introduce a new computational modeling method named as higher-order multivariable polynomial regression to estimate human affective states. The study employs standardized pictures in the International Affective Picture System to induce thirty subjects’ affective states, and obtains pure affective patterns of skin conductance as input variables to the higher-order multivariable polynomial model for predicting affective valence and arousal. Experimental results show that our method is able to obtain efficient correlation coefficients of 0.98 and 0.96 for estimation of affective valence and arousal, respectively. Moreover, the method may provide certain indirect evidences that valence and arousal have their brain’s motivational circuit origins. Thus, the proposed method can serve as a novel one for efficiently estimating human affective states. PMID:26996254

  3. Model transformations for state-space self-tuning control of multivariable stochastic systems

    NASA Technical Reports Server (NTRS)

    Shieh, Leang S.; Bao, Yuan L.; Coleman, Norman P.

    1988-01-01

    The design of self-tuning controllers for multivariable stochastic systems is considered analytically. A long-division technique for finding the similarity transformation matrix and transforming the estimated left MFD to the right MFD is developed; the derivation is given in detail, and the procedures involved are briefly characterized.

  4. Multivariate Error Covariance Estimates by Monte-Carlo Simulation for Assimilation Studies in the Pacific Ocean

    NASA Technical Reports Server (NTRS)

    Borovikov, Anna; Rienecker, Michele M.; Keppenne, Christian; Johnson, Gregory C.

    2004-01-01

    One of the most difficult aspects of ocean state estimation is the prescription of the model forecast error covariances. The paucity of ocean observations limits our ability to estimate the covariance structures from model-observation differences. In most practical applications, simple covariances are usually prescribed. Rarely are cross-covariances between different model variables used. Here a comparison is made between a univariate Optimal Interpolation (UOI) scheme and a multivariate OI algorithm (MvOI) in the assimilation of ocean temperature. In the UOI case only temperature is updated using a Gaussian covariance function and in the MvOI salinity, zonal and meridional velocities as well as temperature, are updated using an empirically estimated multivariate covariance matrix. Earlier studies have shown that a univariate OI has a detrimental effect on the salinity and velocity fields of the model. Apparently, in a sequential framework it is important to analyze temperature and salinity together. For the MvOI an estimation of the model error statistics is made by Monte-Carlo techniques from an ensemble of model integrations. An important advantage of using an ensemble of ocean states is that it provides a natural way to estimate cross-covariances between the fields of different physical variables constituting the model state vector, at the same time incorporating the model's dynamical and thermodynamical constraints as well as the effects of physical boundaries. Only temperature observations from the Tropical Atmosphere-Ocean array have been assimilated in this study. In order to investigate the efficacy of the multivariate scheme two data assimilation experiments are validated with a large independent set of recently published subsurface observations of salinity, zonal velocity and temperature. For reference, a third control run with no data assimilation is used to check how the data assimilation affects systematic model errors. While the performance of the UOI and MvOI is similar with respect to the temperature field, the salinity and velocity fields are greatly improved when multivariate correction is used, as evident from the analyses of the rms differences of these fields and independent observations. The MvOI assimilation is found to improve upon the control run in generating the water masses with properties close to the observed, while the UOI failed to maintain the temperature and salinity structure.

  5. Multivariate Time Series Decomposition into Oscillation Components.

    PubMed

    Matsuda, Takeru; Komaki, Fumiyasu

    2017-08-01

    Many time series are considered to be a superposition of several oscillation components. We have proposed a method for decomposing univariate time series into oscillation components and estimating their phases (Matsuda & Komaki, 2017 ). In this study, we extend that method to multivariate time series. We assume that several oscillators underlie the given multivariate time series and that each variable corresponds to a superposition of the projections of the oscillators. Thus, the oscillators superpose on each variable with amplitude and phase modulation. Based on this idea, we develop gaussian linear state-space models and use them to decompose the given multivariate time series. The model parameters are estimated from data using the empirical Bayes method, and the number of oscillators is determined using the Akaike information criterion. Therefore, the proposed method extracts underlying oscillators in a data-driven manner and enables investigation of phase dynamics in a given multivariate time series. Numerical results show the effectiveness of the proposed method. From monthly mean north-south sunspot number data, the proposed method reveals an interesting phase relationship.

  6. Copula-based analysis of rhythm

    NASA Astrophysics Data System (ADS)

    García, J. E.; González-López, V. A.; Viola, M. L. Lanfredi

    2016-06-01

    In this paper we establish stochastic profiles of the rhythm for three languages: English, Japanese and Spanish. We model the increase or decrease of the acoustical energy, collected into three bands coming from the acoustic signal. The number of parameters needed to specify a discrete multivariate Markov chain grows exponentially with the order and dimension of the chain. In this case the size of the database is not large enough for a consistent estimation of the model. We apply a strategy to estimate a multivariate process with an order greater than the order achieved using standard procedures. The new strategy consist on obtaining a partition of the state space which is constructed from a combination of the partitions corresponding to the three marginal processes, one for each band of energy, and the partition coming from to the multivariate Markov chain. Then, all the partitions are linked using a copula, in order to estimate the transition probabilities.

  7. Novel health monitoring method using an RGB camera.

    PubMed

    Hassan, M A; Malik, A S; Fofi, D; Saad, N; Meriaudeau, F

    2017-11-01

    In this paper we present a novel health monitoring method by estimating the heart rate and respiratory rate using an RGB camera. The heart rate and the respiratory rate are estimated from the photoplethysmography (PPG) and the respiratory motion. The method mainly operates by using the green spectrum of the RGB camera to generate a multivariate PPG signal to perform multivariate de-noising on the video signal to extract the resultant PPG signal. A periodicity based voting scheme (PVS) was used to measure the heart rate and respiratory rate from the estimated PPG signal. We evaluated our proposed method with a state of the art heart rate measuring method for two scenarios using the MAHNOB-HCI database and a self collected naturalistic environment database. The methods were furthermore evaluated for various scenarios at naturalistic environments such as a motion variance session and a skin tone variance session. Our proposed method operated robustly during the experiments and outperformed the state of the art heart rate measuring methods by compensating the effects of the naturalistic environment.

  8. Robustness enhancement of neurocontroller and state estimator

    NASA Technical Reports Server (NTRS)

    Troudet, Terry

    1993-01-01

    The feasibility of enhancing neurocontrol robustness, through training of the neurocontroller and state estimator in the presence of system uncertainties, is investigated on the example of a multivariable aircraft control problem. The performance and robustness of the newly trained neurocontroller are compared to those for an existing neurocontrol design scheme. The newly designed dynamic neurocontroller exhibits a better trade-off between phase and gain stability margins, and it is significantly more robust to degradations of the plant dynamics.

  9. Copula-based prediction of economic movements

    NASA Astrophysics Data System (ADS)

    García, J. E.; González-López, V. A.; Hirsh, I. D.

    2016-06-01

    In this paper we model the discretized returns of two paired time series BM&FBOVESPA Dividend Index and BM&FBOVESPA Public Utilities Index using multivariate Markov models. The discretization corresponds to three categories, high losses, high profits and the complementary periods of the series. In technical terms, the maximal memory that can be considered for a Markov model, can be derived from the size of the alphabet and dataset. The number of parameters needed to specify a discrete multivariate Markov chain grows exponentially with the order and dimension of the chain. In this case the size of the database is not large enough for a consistent estimation of the model. We apply a strategy to estimate a multivariate process with an order greater than the order achieved using standard procedures. The new strategy consist on obtaining a partition of the state space which is constructed from a combination, of the partitions corresponding to the two marginal processes and the partition corresponding to the multivariate Markov chain. In order to estimate the transition probabilities, all the partitions are linked using a copula. In our application this strategy provides a significant improvement in the movement predictions.

  10. State-space self-tuner for on-line adaptive control

    NASA Technical Reports Server (NTRS)

    Shieh, L. S.

    1994-01-01

    Dynamic systems, such as flight vehicles, satellites and space stations, operating in real environments, constantly face parameter and/or structural variations owing to nonlinear behavior of actuators, failure of sensors, changes in operating conditions, disturbances acting on the system, etc. In the past three decades, adaptive control has been shown to be effective in dealing with dynamic systems in the presence of parameter uncertainties, structural perturbations, random disturbances and environmental variations. Among the existing adaptive control methodologies, the state-space self-tuning control methods, initially proposed by us, are shown to be effective in designing advanced adaptive controllers for multivariable systems. In our approaches, we have embedded the standard Kalman state-estimation algorithm into an online parameter estimation algorithm. Thus, the advanced state-feedback controllers can be easily established for digital adaptive control of continuous-time stochastic multivariable systems. A state-space self-tuner for a general multivariable stochastic system has been developed and successfully applied to the space station for on-line adaptive control. Also, a technique for multistage design of an optimal momentum management controller for the space station has been developed and reported in. Moreover, we have successfully developed various digital redesign techniques which can convert a continuous-time controller to an equivalent digital controller. As a result, the expensive and unreliable continuous-time controller can be implemented using low-cost and high performance microprocessors. Recently, we have developed a new hybrid state-space self tuner using a new dual-rate sampling scheme for on-line adaptive control of continuous-time uncertain systems.

  11. Data driven discrete-time parsimonious identification of a nonlinear state-space model for a weakly nonlinear system with short data record

    NASA Astrophysics Data System (ADS)

    Relan, Rishi; Tiels, Koen; Marconato, Anna; Dreesen, Philippe; Schoukens, Johan

    2018-05-01

    Many real world systems exhibit a quasi linear or weakly nonlinear behavior during normal operation, and a hard saturation effect for high peaks of the input signal. In this paper, a methodology to identify a parsimonious discrete-time nonlinear state space model (NLSS) for the nonlinear dynamical system with relatively short data record is proposed. The capability of the NLSS model structure is demonstrated by introducing two different initialisation schemes, one of them using multivariate polynomials. In addition, a method using first-order information of the multivariate polynomials and tensor decomposition is employed to obtain the parsimonious decoupled representation of the set of multivariate real polynomials estimated during the identification of NLSS model. Finally, the experimental verification of the model structure is done on the cascaded water-benchmark identification problem.

  12. Alternative method to validate the seasonal land cover regions of the conterminous United States

    Treesearch

    Zhiliang Zhu; Donald O. Ohlen; Raymond L. Czaplewski; Robert E. Burgan

    1996-01-01

    An accuracy assessment method involving double sampling and the multivariate composite estimator has been used to validate the prototype seasonal land cover characteristics database of the conterminous United States. The database consists of 159 land cover classes, classified using time series of 1990 1-km satellite data and augmented with ancillary data including...

  13. SPReM: Sparse Projection Regression Model For High-dimensional Linear Regression *

    PubMed Central

    Sun, Qiang; Zhu, Hongtu; Liu, Yufeng; Ibrahim, Joseph G.

    2014-01-01

    The aim of this paper is to develop a sparse projection regression modeling (SPReM) framework to perform multivariate regression modeling with a large number of responses and a multivariate covariate of interest. We propose two novel heritability ratios to simultaneously perform dimension reduction, response selection, estimation, and testing, while explicitly accounting for correlations among multivariate responses. Our SPReM is devised to specifically address the low statistical power issue of many standard statistical approaches, such as the Hotelling’s T2 test statistic or a mass univariate analysis, for high-dimensional data. We formulate the estimation problem of SPREM as a novel sparse unit rank projection (SURP) problem and propose a fast optimization algorithm for SURP. Furthermore, we extend SURP to the sparse multi-rank projection (SMURP) by adopting a sequential SURP approximation. Theoretically, we have systematically investigated the convergence properties of SURP and the convergence rate of SURP estimates. Our simulation results and real data analysis have shown that SPReM out-performs other state-of-the-art methods. PMID:26527844

  14. Exploring connectivity with large-scale Granger causality on resting-state functional MRI.

    PubMed

    DSouza, Adora M; Abidin, Anas Z; Leistritz, Lutz; Wismüller, Axel

    2017-08-01

    Large-scale Granger causality (lsGC) is a recently developed, resting-state functional MRI (fMRI) connectivity analysis approach that estimates multivariate voxel-resolution connectivity. Unlike most commonly used multivariate approaches, which establish coarse-resolution connectivity by aggregating voxel time-series avoiding an underdetermined problem, lsGC estimates voxel-resolution, fine-grained connectivity by incorporating an embedded dimension reduction. We investigate application of lsGC on realistic fMRI simulations, modeling smoothing of neuronal activity by the hemodynamic response function and repetition time (TR), and empirical resting-state fMRI data. Subsequently, functional subnetworks are extracted from lsGC connectivity measures for both datasets and validated quantitatively. We also provide guidelines to select lsGC free parameters. Results indicate that lsGC reliably recovers underlying network structure with area under receiver operator characteristic curve (AUC) of 0.93 at TR=1.5s for a 10-min session of fMRI simulations. Furthermore, subnetworks of closely interacting modules are recovered from the aforementioned lsGC networks. Results on empirical resting-state fMRI data demonstrate recovery of visual and motor cortex in close agreement with spatial maps obtained from (i) visuo-motor fMRI stimulation task-sequence (Accuracy=0.76) and (ii) independent component analysis (ICA) of resting-state fMRI (Accuracy=0.86). Compared with conventional Granger causality approach (AUC=0.75), lsGC produces better network recovery on fMRI simulations. Furthermore, it cannot recover functional subnetworks from empirical fMRI data, since quantifying voxel-resolution connectivity is not possible as consequence of encountering an underdetermined problem. Functional network recovery from fMRI data suggests that lsGC gives useful insight into connectivity patterns from resting-state fMRI at a multivariate voxel-resolution. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. Comparison of connectivity analyses for resting state EEG data

    NASA Astrophysics Data System (ADS)

    Olejarczyk, Elzbieta; Marzetti, Laura; Pizzella, Vittorio; Zappasodi, Filippo

    2017-06-01

    Objective. In the present work, a nonlinear measure (transfer entropy, TE) was used in a multivariate approach for the analysis of effective connectivity in high density resting state EEG data in eyes open and eyes closed. Advantages of the multivariate approach in comparison to the bivariate one were tested. Moreover, the multivariate TE was compared to an effective linear measure, i.e. directed transfer function (DTF). Finally, the existence of a relationship between the information transfer and the level of brain synchronization as measured by phase synchronization value (PLV) was investigated. Approach. The comparison between the connectivity measures, i.e. bivariate versus multivariate TE, TE versus DTF, TE versus PLV, was performed by means of statistical analysis of indexes based on graph theory. Main results. The multivariate approach is less sensitive to false indirect connections with respect to the bivariate estimates. The multivariate TE differentiated better between eyes closed and eyes open conditions compared to DTF. Moreover, the multivariate TE evidenced non-linear phenomena in information transfer, which are not evidenced by the use of DTF. We also showed that the target of information flow, in particular the frontal region, is an area of greater brain synchronization. Significance. Comparison of different connectivity analysis methods pointed to the advantages of nonlinear methods, and indicated a relationship existing between the flow of information and the level of synchronization of the brain.

  16. A system to build distributed multivariate models and manage disparate data sharing policies: implementation in the scalable national network for effectiveness research.

    PubMed

    Meeker, Daniella; Jiang, Xiaoqian; Matheny, Michael E; Farcas, Claudiu; D'Arcy, Michel; Pearlman, Laura; Nookala, Lavanya; Day, Michele E; Kim, Katherine K; Kim, Hyeoneui; Boxwala, Aziz; El-Kareh, Robert; Kuo, Grace M; Resnic, Frederic S; Kesselman, Carl; Ohno-Machado, Lucila

    2015-11-01

    Centralized and federated models for sharing data in research networks currently exist. To build multivariate data analysis for centralized networks, transfer of patient-level data to a central computation resource is necessary. The authors implemented distributed multivariate models for federated networks in which patient-level data is kept at each site and data exchange policies are managed in a study-centric manner. The objective was to implement infrastructure that supports the functionality of some existing research networks (e.g., cohort discovery, workflow management, and estimation of multivariate analytic models on centralized data) while adding additional important new features, such as algorithms for distributed iterative multivariate models, a graphical interface for multivariate model specification, synchronous and asynchronous response to network queries, investigator-initiated studies, and study-based control of staff, protocols, and data sharing policies. Based on the requirements gathered from statisticians, administrators, and investigators from multiple institutions, the authors developed infrastructure and tools to support multisite comparative effectiveness studies using web services for multivariate statistical estimation in the SCANNER federated network. The authors implemented massively parallel (map-reduce) computation methods and a new policy management system to enable each study initiated by network participants to define the ways in which data may be processed, managed, queried, and shared. The authors illustrated the use of these systems among institutions with highly different policies and operating under different state laws. Federated research networks need not limit distributed query functionality to count queries, cohort discovery, or independently estimated analytic models. Multivariate analyses can be efficiently and securely conducted without patient-level data transport, allowing institutions with strict local data storage requirements to participate in sophisticated analyses based on federated research networks. © The Author 2015. Published by Oxford University Press on behalf of the American Medical Informatics Association.

  17. A methodology for designing robust multivariable nonlinear control systems. Ph.D. Thesis

    NASA Technical Reports Server (NTRS)

    Grunberg, D. B.

    1986-01-01

    A new methodology is described for the design of nonlinear dynamic controllers for nonlinear multivariable systems providing guarantees of closed-loop stability, performance, and robustness. The methodology is an extension of the Linear-Quadratic-Gaussian with Loop-Transfer-Recovery (LQG/LTR) methodology for linear systems, thus hinging upon the idea of constructing an approximate inverse operator for the plant. A major feature of the methodology is a unification of both the state-space and input-output formulations. In addition, new results on stability theory, nonlinear state estimation, and optimal nonlinear regulator theory are presented, including the guaranteed global properties of the extended Kalman filter and optimal nonlinear regulators.

  18. Space-time variation of respiratory cancers in South Carolina: a flexible multivariate mixture modeling approach to risk estimation.

    PubMed

    Carroll, Rachel; Lawson, Andrew B; Kirby, Russell S; Faes, Christel; Aregay, Mehreteab; Watjou, Kevin

    2017-01-01

    Many types of cancer have an underlying spatiotemporal distribution. Spatiotemporal mixture modeling can offer a flexible approach to risk estimation via the inclusion of latent variables. In this article, we examine the application and benefits of using four different spatiotemporal mixture modeling methods in the modeling of cancer of the lung and bronchus as well as "other" respiratory cancer incidences in the state of South Carolina. Of the methods tested, no single method outperforms the other methods; which method is best depends on the cancer under consideration. The lung and bronchus cancer incidence outcome is best described by the univariate modeling formulation, whereas the "other" respiratory cancer incidence outcome is best described by the multivariate modeling formulation. Spatiotemporal multivariate mixture methods can aid in the modeling of cancers with small and sparse incidences when including information from a related, more common type of cancer. Copyright © 2016 Elsevier Inc. All rights reserved.

  19. Use of uninformative priors to initialize state estimation for dynamical systems

    NASA Astrophysics Data System (ADS)

    Worthy, Johnny L.; Holzinger, Marcus J.

    2017-10-01

    The admissible region must be expressed probabilistically in order to be used in Bayesian estimation schemes. When treated as a probability density function (PDF), a uniform admissible region can be shown to have non-uniform probability density after a transformation. An alternative approach can be used to express the admissible region probabilistically according to the Principle of Transformation Groups. This paper uses a fundamental multivariate probability transformation theorem to show that regardless of which state space an admissible region is expressed in, the probability density must remain the same under the Principle of Transformation Groups. The admissible region can be shown to be analogous to an uninformative prior with a probability density that remains constant under reparameterization. This paper introduces requirements on how these uninformative priors may be transformed and used for state estimation and the difference in results when initializing an estimation scheme via a traditional transformation versus the alternative approach.

  20. Stochastic spectral projection of electrochemical thermal model for lithium-ion cell state estimation

    NASA Astrophysics Data System (ADS)

    Tagade, Piyush; Hariharan, Krishnan S.; Kolake, Subramanya Mayya; Song, Taewon; Oh, Dukjin

    2017-03-01

    A novel approach for integrating a pseudo-two dimensional electrochemical thermal (P2D-ECT) model and data assimilation algorithm is presented for lithium-ion cell state estimation. This approach refrains from making any simplifications in the P2D-ECT model while making it amenable for online state estimation. Though deterministic, uncertainty in the initial states induces stochasticity in the P2D-ECT model. This stochasticity is resolved by spectrally projecting the stochastic P2D-ECT model on a set of orthogonal multivariate Hermite polynomials. Volume averaging in the stochastic dimensions is proposed for efficient numerical solution of the resultant model. A state estimation framework is developed using a transformation of the orthogonal basis to assimilate the measurables with this system of equations. Effectiveness of the proposed method is first demonstrated by assimilating the cell voltage and temperature data generated using a synthetic test bed. This validated method is used with the experimentally observed cell voltage and temperature data for state estimation at different operating conditions and drive cycle protocols. The results show increased prediction accuracy when the data is assimilated every 30s. High accuracy of the estimated states is exploited to infer temperature dependent behavior of the lithium-ion cell.

  1. Numerically stable algorithm for combining census and sample estimates with the multivariate composite estimator

    Treesearch

    R. L. Czaplewski

    2009-01-01

    The minimum variance multivariate composite estimator is a relatively simple sequential estimator for complex sampling designs (Czaplewski 2009). Such designs combine a probability sample of expensive field data with multiple censuses and/or samples of relatively inexpensive multi-sensor, multi-resolution remotely sensed data. Unfortunately, the multivariate composite...

  2. The Impact of State Laws Limiting Malpractice Damage Awards on Health Care Expenditures

    PubMed Central

    Hellinger, Fred J.; Encinosa, William E.

    2006-01-01

    Twenty-eight states have laws that limit payments in malpractice cases, and several studies indicate that these laws reduce the frequency and severity of malpractice claims and lower premiums. Moreover, proponents believe that such laws reduce health care expenditures by reducing the practice of defensive medicine. However, there is a dearth of empirical evidence about the impact of these laws on the cost of health care. We used multivariate models and relatively recent data to estimate the impact of state tort reform laws that directly limit malpractice damage payments on health care expenditures. Estimates from these models suggest that laws limiting malpractice payments lower state health care expenditures by between 3% and 4%. PMID:16809580

  3. State-granted practice authority: do nurse practitioners vote with their feet?

    PubMed

    Perry, John J

    2012-01-01

    Nurse practitioners have become an increasingly important part of the US medical workforce as they have gained greater practice authority through state-level regulatory changes. This study investigates one labor market impact of this large change in nurse practitioner regulation. Using data from the National Sample Survey of Registered Nurses and a dataset of state-level nurse practitioner prescribing authority, a multivariate estimation is performed analysing the impact of greater practice authority on the probability of a nurse practitioner moving from a state. The empirical results indicate that nurse practitioners in states that grant expanded practice are less likely to move from the state than nurse practitioners in states that have not granted expanded practice authority. The estimated effect is robust and is statistically and economically meaningful. This finding is in concert with and strengthens the wider literature which finds states that grant expanded practice authority to nurse practitioners tend to have larger nurse practitioner populations.

  4. Anomaly Monitoring Method for Key Components of Satellite

    PubMed Central

    Fan, Linjun; Xiao, Weidong; Tang, Jun

    2014-01-01

    This paper presented a fault diagnosis method for key components of satellite, called Anomaly Monitoring Method (AMM), which is made up of state estimation based on Multivariate State Estimation Techniques (MSET) and anomaly detection based on Sequential Probability Ratio Test (SPRT). On the basis of analysis failure of lithium-ion batteries (LIBs), we divided the failure of LIBs into internal failure, external failure, and thermal runaway and selected electrolyte resistance (R e) and the charge transfer resistance (R ct) as the key parameters of state estimation. Then, through the actual in-orbit telemetry data of the key parameters of LIBs, we obtained the actual residual value (R X) and healthy residual value (R L) of LIBs based on the state estimation of MSET, and then, through the residual values (R X and R L) of LIBs, we detected the anomaly states based on the anomaly detection of SPRT. Lastly, we conducted an example of AMM for LIBs, and, according to the results of AMM, we validated the feasibility and effectiveness of AMM by comparing it with the results of threshold detective method (TDM). PMID:24587703

  5. Multivariate survivorship analysis using two cross-sectional samples.

    PubMed

    Hill, M E

    1999-11-01

    As an alternative to survival analysis with longitudinal data, I introduce a method that can be applied when one observes the same cohort in two cross-sectional samples collected at different points in time. The method allows for the estimation of log-probability survivorship models that estimate the influence of multiple time-invariant factors on survival over a time interval separating two samples. This approach can be used whenever the survival process can be adequately conceptualized as an irreversible single-decrement process (e.g., mortality, the transition to first marriage among a cohort of never-married individuals). Using data from the Integrated Public Use Microdata Series (Ruggles and Sobek 1997), I illustrate the multivariate method through an investigation of the effects of race, parity, and educational attainment on the survival of older women in the United States.

  6. A New Multivariate Approach for Prognostics Based on Extreme Learning Machine and Fuzzy Clustering.

    PubMed

    Javed, Kamran; Gouriveau, Rafael; Zerhouni, Noureddine

    2015-12-01

    Prognostics is a core process of prognostics and health management (PHM) discipline, that estimates the remaining useful life (RUL) of a degrading machinery to optimize its service delivery potential. However, machinery operates in a dynamic environment and the acquired condition monitoring data are usually noisy and subject to a high level of uncertainty/unpredictability, which complicates prognostics. The complexity further increases, when there is absence of prior knowledge about ground truth (or failure definition). For such issues, data-driven prognostics can be a valuable solution without deep understanding of system physics. This paper contributes a new data-driven prognostics approach namely, an "enhanced multivariate degradation modeling," which enables modeling degrading states of machinery without assuming a homogeneous pattern. In brief, a predictability scheme is introduced to reduce the dimensionality of the data. Following that, the proposed prognostics model is achieved by integrating two new algorithms namely, the summation wavelet-extreme learning machine and subtractive-maximum entropy fuzzy clustering to show evolution of machine degradation by simultaneous predictions and discrete state estimation. The prognostics model is equipped with a dynamic failure threshold assignment procedure to estimate RUL in a realistic manner. To validate the proposition, a case study is performed on turbofan engines data from PHM challenge 2008 (NASA), and results are compared with recent publications.

  7. Multiple imputation for handling missing outcome data when estimating the relative risk.

    PubMed

    Sullivan, Thomas R; Lee, Katherine J; Ryan, Philip; Salter, Amy B

    2017-09-06

    Multiple imputation is a popular approach to handling missing data in medical research, yet little is known about its applicability for estimating the relative risk. Standard methods for imputing incomplete binary outcomes involve logistic regression or an assumption of multivariate normality, whereas relative risks are typically estimated using log binomial models. It is unclear whether misspecification of the imputation model in this setting could lead to biased parameter estimates. Using simulated data, we evaluated the performance of multiple imputation for handling missing data prior to estimating adjusted relative risks from a correctly specified multivariable log binomial model. We considered an arbitrary pattern of missing data in both outcome and exposure variables, with missing data induced under missing at random mechanisms. Focusing on standard model-based methods of multiple imputation, missing data were imputed using multivariate normal imputation or fully conditional specification with a logistic imputation model for the outcome. Multivariate normal imputation performed poorly in the simulation study, consistently producing estimates of the relative risk that were biased towards the null. Despite outperforming multivariate normal imputation, fully conditional specification also produced somewhat biased estimates, with greater bias observed for higher outcome prevalences and larger relative risks. Deleting imputed outcomes from analysis datasets did not improve the performance of fully conditional specification. Both multivariate normal imputation and fully conditional specification produced biased estimates of the relative risk, presumably since both use a misspecified imputation model. Based on simulation results, we recommend researchers use fully conditional specification rather than multivariate normal imputation and retain imputed outcomes in the analysis when estimating relative risks. However fully conditional specification is not without its shortcomings, and so further research is needed to identify optimal approaches for relative risk estimation within the multiple imputation framework.

  8. DasPy – Open Source Multivariate Land Data Assimilation Framework with High Performance Computing

    NASA Astrophysics Data System (ADS)

    Han, Xujun; Li, Xin; Montzka, Carsten; Kollet, Stefan; Vereecken, Harry; Hendricks Franssen, Harrie-Jan

    2015-04-01

    Data assimilation has become a popular method to integrate observations from multiple sources with land surface models to improve predictions of the water and energy cycles of the soil-vegetation-atmosphere continuum. In recent years, several land data assimilation systems have been developed in different research agencies. Because of the software availability or adaptability, these systems are not easy to apply for the purpose of multivariate land data assimilation research. Multivariate data assimilation refers to the simultaneous assimilation of observation data for multiple model state variables into a simulation model. Our main motivation was to develop an open source multivariate land data assimilation framework (DasPy) which is implemented using the Python script language mixed with C++ and Fortran language. This system has been evaluated in several soil moisture, L-band brightness temperature and land surface temperature assimilation studies. The implementation allows also parameter estimation (soil properties and/or leaf area index) on the basis of the joint state and parameter estimation approach. LETKF (Local Ensemble Transform Kalman Filter) is implemented as the main data assimilation algorithm, and uncertainties in the data assimilation can be represented by perturbed atmospheric forcings, perturbed soil and vegetation properties and model initial conditions. The CLM4.5 (Community Land Model) was integrated as the model operator. The CMEM (Community Microwave Emission Modelling Platform), COSMIC (COsmic-ray Soil Moisture Interaction Code) and the two source formulation were integrated as observation operators for assimilation of L-band passive microwave, cosmic-ray soil moisture probe and land surface temperature measurements, respectively. DasPy is parallelized using the hybrid MPI (Message Passing Interface) and OpenMP (Open Multi-Processing) techniques. All the input and output data flow is organized efficiently using the commonly used NetCDF file format. Online 1D and 2D visualization of data assimilation results is also implemented to facilitate the post simulation analysis. In summary, DasPy is a ready to use open source parallel multivariate land data assimilation framework.

  9. Optimized tuner selection for engine performance estimation

    NASA Technical Reports Server (NTRS)

    Simon, Donald L. (Inventor); Garg, Sanjay (Inventor)

    2013-01-01

    A methodology for minimizing the error in on-line Kalman filter-based aircraft engine performance estimation applications is presented. This technique specifically addresses the underdetermined estimation problem, where there are more unknown parameters than available sensor measurements. A systematic approach is applied to produce a model tuning parameter vector of appropriate dimension to enable estimation by a Kalman filter, while minimizing the estimation error in the parameters of interest. Tuning parameter selection is performed using a multi-variable iterative search routine which seeks to minimize the theoretical mean-squared estimation error. Theoretical Kalman filter estimation error bias and variance values are derived at steady-state operating conditions, and the tuner selection routine is applied to minimize these values. The new methodology yields an improvement in on-line engine performance estimation accuracy.

  10. Nonparametric estimation of the multivariate survivor function: the multivariate Kaplan-Meier estimator.

    PubMed

    Prentice, Ross L; Zhao, Shanshan

    2018-01-01

    The Dabrowska (Ann Stat 16:1475-1489, 1988) product integral representation of the multivariate survivor function is extended, leading to a nonparametric survivor function estimator for an arbitrary number of failure time variates that has a simple recursive formula for its calculation. Empirical process methods are used to sketch proofs for this estimator's strong consistency and weak convergence properties. Summary measures of pairwise and higher-order dependencies are also defined and nonparametrically estimated. Simulation evaluation is given for the special case of three failure time variates.

  11. Scalable Joint Models for Reliable Uncertainty-Aware Event Prediction.

    PubMed

    Soleimani, Hossein; Hensman, James; Saria, Suchi

    2017-08-21

    Missing data and noisy observations pose significant challenges for reliably predicting events from irregularly sampled multivariate time series (longitudinal) data. Imputation methods, which are typically used for completing the data prior to event prediction, lack a principled mechanism to account for the uncertainty due to missingness. Alternatively, state-of-the-art joint modeling techniques can be used for jointly modeling the longitudinal and event data and compute event probabilities conditioned on the longitudinal observations. These approaches, however, make strong parametric assumptions and do not easily scale to multivariate signals with many observations. Our proposed approach consists of several key innovations. First, we develop a flexible and scalable joint model based upon sparse multiple-output Gaussian processes. Unlike state-of-the-art joint models, the proposed model can explain highly challenging structure including non-Gaussian noise while scaling to large data. Second, we derive an optimal policy for predicting events using the distribution of the event occurrence estimated by the joint model. The derived policy trades-off the cost of a delayed detection versus incorrect assessments and abstains from making decisions when the estimated event probability does not satisfy the derived confidence criteria. Experiments on a large dataset show that the proposed framework significantly outperforms state-of-the-art techniques in event prediction.

  12. A symmetric multivariate leakage correction for MEG connectomes

    PubMed Central

    Colclough, G.L.; Brookes, M.J.; Smith, S.M.; Woolrich, M.W.

    2015-01-01

    Ambiguities in the source reconstruction of magnetoencephalographic (MEG) measurements can cause spurious correlations between estimated source time-courses. In this paper, we propose a symmetric orthogonalisation method to correct for these artificial correlations between a set of multiple regions of interest (ROIs). This process enables the straightforward application of network modelling methods, including partial correlation or multivariate autoregressive modelling, to infer connectomes, or functional networks, from the corrected ROIs. Here, we apply the correction to simulated MEG recordings of simple networks and to a resting-state dataset collected from eight subjects, before computing the partial correlations between power envelopes of the corrected ROItime-courses. We show accurate reconstruction of our simulated networks, and in the analysis of real MEGresting-state connectivity, we find dense bilateral connections within the motor and visual networks, together with longer-range direct fronto-parietal connections. PMID:25862259

  13. Modeling a multivariable reactor and on-line model predictive control.

    PubMed

    Yu, D W; Yu, D L

    2005-10-01

    A nonlinear first principle model is developed for a laboratory-scaled multivariable chemical reactor rig in this paper and the on-line model predictive control (MPC) is implemented to the rig. The reactor has three variables-temperature, pH, and dissolved oxygen with nonlinear dynamics-and is therefore used as a pilot system for the biochemical industry. A nonlinear discrete-time model is derived for each of the three output variables and their model parameters are estimated from the real data using an adaptive optimization method. The developed model is used in a nonlinear MPC scheme. An accurate multistep-ahead prediction is obtained for MPC, where the extended Kalman filter is used to estimate system unknown states. The on-line control is implemented and a satisfactory tracking performance is achieved. The MPC is compared with three decentralized PID controllers and the advantage of the nonlinear MPC over the PID is clearly shown.

  14. Usual Dietary Intakes: SAS Macros for Fitting Multivariate Measurement Error Models & Estimating Multivariate Usual Intake Distributions

    Cancer.gov

    The following SAS macros can be used to create a multivariate usual intake distribution for multiple dietary components that are consumed nearly every day or episodically. A SAS macro for performing balanced repeated replication (BRR) variance estimation is also included.

  15. A Bayesian Multivariate Receptor Model for Estimating Source Contributions to Particulate Matter Pollution using National Databases.

    PubMed

    Hackstadt, Amber J; Peng, Roger D

    2014-11-01

    Time series studies have suggested that air pollution can negatively impact health. These studies have typically focused on the total mass of fine particulate matter air pollution or the individual chemical constituents that contribute to it, and not source-specific contributions to air pollution. Source-specific contribution estimates are useful from a regulatory standpoint by allowing regulators to focus limited resources on reducing emissions from sources that are major contributors to air pollution and are also desired when estimating source-specific health effects. However, researchers often lack direct observations of the emissions at the source level. We propose a Bayesian multivariate receptor model to infer information about source contributions from ambient air pollution measurements. The proposed model incorporates information from national databases containing data on both the composition of source emissions and the amount of emissions from known sources of air pollution. The proposed model is used to perform source apportionment analyses for two distinct locations in the United States (Boston, Massachusetts and Phoenix, Arizona). Our results mirror previous source apportionment analyses that did not utilize the information from national databases and provide additional information about uncertainty that is relevant to the estimation of health effects.

  16. Predicting Potential Changes in Suitable Habitat and Distribution by 2100 for Tree Species of the Eastern United States

    Treesearch

    Louis R Iverson; Anantha M. Prasad; Mark W. Schwartz; Mark W. Schwartz

    2005-01-01

    We predict current distribution and abundance for tree species present in eastern North America, and subsequently estimate potential suitable habitat for those species under a changed climate with 2 x CO2. We used a series of statistical models (i.e., Regression Tree Analysis (RTA), Multivariate Adaptive Regression Splines (MARS), Bagging Trees (...

  17. Inter-hospital transfer is associated with increased mortality and costs in severe sepsis and septic shock: An instrumental variables approach.

    PubMed

    Mohr, Nicholas M; Harland, Karisa K; Shane, Dan M; Ahmed, Azeemuddin; Fuller, Brian M; Torner, James C

    2016-12-01

    The objective of this study was to evaluate the impact of regionalization on sepsis survival, to describe the role of inter-hospital transfer in rural sepsis care, and to measure the cost of inter-hospital transfer in a predominantly rural state. Observational case-control study using statewide administrative claims data from 2005 to 2014 in a predominantly rural Midwestern state. Mortality and marginal costs were estimated with multivariable generalized estimating equations models and with instrumental variables models. A total of 18 246 patients were included, of which 59% were transferred between hospitals. Transferred patients had higher mortality and longer hospital length-of-stay than non-transferred patients. Using a multivariable generalized estimating equations (GEE) model to adjust for potentially confounding factors, inter-hospital transfer was associated with increased mortality (aOR 1.7, 95% CI 1.5-1.9). Using an instrumental variables model, transfer was associated with a 9.2% increased risk of death. Transfer was associated with additional costs of $6897 (95% CI $5769-8024). Even when limiting to only those patients who received care in the largest hospitals, transfer was still associated with $5167 (95% CI $3696-6638) in additional cost. The majority of rural sepsis patients are transferred, and these transferred patients have higher mortality and significantly increased cost of care. Copyright © 2016 Elsevier Inc. All rights reserved.

  18. Multivariate Longitudinal Analysis with Bivariate Correlation Test

    PubMed Central

    Adjakossa, Eric Houngla; Sadissou, Ibrahim; Hounkonnou, Mahouton Norbert; Nuel, Gregory

    2016-01-01

    In the context of multivariate multilevel data analysis, this paper focuses on the multivariate linear mixed-effects model, including all the correlations between the random effects when the dimensional residual terms are assumed uncorrelated. Using the EM algorithm, we suggest more general expressions of the model’s parameters estimators. These estimators can be used in the framework of the multivariate longitudinal data analysis as well as in the more general context of the analysis of multivariate multilevel data. By using a likelihood ratio test, we test the significance of the correlations between the random effects of two dependent variables of the model, in order to investigate whether or not it is useful to model these dependent variables jointly. Simulation studies are done to assess both the parameter recovery performance of the EM estimators and the power of the test. Using two empirical data sets which are of longitudinal multivariate type and multivariate multilevel type, respectively, the usefulness of the test is illustrated. PMID:27537692

  19. Multivariate Longitudinal Analysis with Bivariate Correlation Test.

    PubMed

    Adjakossa, Eric Houngla; Sadissou, Ibrahim; Hounkonnou, Mahouton Norbert; Nuel, Gregory

    2016-01-01

    In the context of multivariate multilevel data analysis, this paper focuses on the multivariate linear mixed-effects model, including all the correlations between the random effects when the dimensional residual terms are assumed uncorrelated. Using the EM algorithm, we suggest more general expressions of the model's parameters estimators. These estimators can be used in the framework of the multivariate longitudinal data analysis as well as in the more general context of the analysis of multivariate multilevel data. By using a likelihood ratio test, we test the significance of the correlations between the random effects of two dependent variables of the model, in order to investigate whether or not it is useful to model these dependent variables jointly. Simulation studies are done to assess both the parameter recovery performance of the EM estimators and the power of the test. Using two empirical data sets which are of longitudinal multivariate type and multivariate multilevel type, respectively, the usefulness of the test is illustrated.

  20. Exploring image data assimilation in the prospect of high-resolution satellite oceanic observations

    NASA Astrophysics Data System (ADS)

    Durán Moro, Marina; Brankart, Jean-Michel; Brasseur, Pierre; Verron, Jacques

    2017-07-01

    Satellite sensors increasingly provide high-resolution (HR) observations of the ocean. They supply observations of sea surface height (SSH) and of tracers of the dynamics such as sea surface salinity (SSS) and sea surface temperature (SST). In particular, the Surface Water Ocean Topography (SWOT) mission will provide measurements of the surface ocean topography at very high-resolution (HR) delivering unprecedented information on the meso-scale and submeso-scale dynamics. This study investigates the feasibility to use these measurements to reconstruct meso-scale features simulated by numerical models, in particular on the vertical dimension. A methodology to reconstruct three-dimensional (3D) multivariate meso-scale scenes is developed by using a HR numerical model of the Solomon Sea region. An inverse problem is defined in the framework of a twin experiment where synthetic observations are used. A true state is chosen among the 3D multivariate states which is considered as a reference state. In order to correct a first guess of this true state, a two-step analysis is carried out. A probability distribution of the first guess is defined and updated at each step of the analysis: (i) the first step applies the analysis scheme of a reduced-order Kalman filter to update the first guess probability distribution using SSH observation; (ii) the second step minimizes a cost function using observations of HR image structure and a new probability distribution is estimated. The analysis is extended to the vertical dimension using 3D multivariate empirical orthogonal functions (EOFs) and the probabilistic approach allows the update of the probability distribution through the two-step analysis. Experiments show that the proposed technique succeeds in correcting a multivariate state using meso-scale and submeso-scale information contained in HR SSH and image structure observations. It also demonstrates how the surface information can be used to reconstruct the ocean state below the surface.

  1. Estimation and model selection of semiparametric multivariate survival functions under general censorship.

    PubMed

    Chen, Xiaohong; Fan, Yanqin; Pouzo, Demian; Ying, Zhiliang

    2010-07-01

    We study estimation and model selection of semiparametric models of multivariate survival functions for censored data, which are characterized by possibly misspecified parametric copulas and nonparametric marginal survivals. We obtain the consistency and root- n asymptotic normality of a two-step copula estimator to the pseudo-true copula parameter value according to KLIC, and provide a simple consistent estimator of its asymptotic variance, allowing for a first-step nonparametric estimation of the marginal survivals. We establish the asymptotic distribution of the penalized pseudo-likelihood ratio statistic for comparing multiple semiparametric multivariate survival functions subject to copula misspecification and general censorship. An empirical application is provided.

  2. Estimation and model selection of semiparametric multivariate survival functions under general censorship

    PubMed Central

    Chen, Xiaohong; Fan, Yanqin; Pouzo, Demian; Ying, Zhiliang

    2013-01-01

    We study estimation and model selection of semiparametric models of multivariate survival functions for censored data, which are characterized by possibly misspecified parametric copulas and nonparametric marginal survivals. We obtain the consistency and root-n asymptotic normality of a two-step copula estimator to the pseudo-true copula parameter value according to KLIC, and provide a simple consistent estimator of its asymptotic variance, allowing for a first-step nonparametric estimation of the marginal survivals. We establish the asymptotic distribution of the penalized pseudo-likelihood ratio statistic for comparing multiple semiparametric multivariate survival functions subject to copula misspecification and general censorship. An empirical application is provided. PMID:24790286

  3. Work and retirement among a cohort of older men in the United States, 1966-1983.

    PubMed

    Hayward, M D; Grady, W R

    1990-08-01

    Multivariate increment-decrement working life tables are estimated for a cohort of older men in the United States for the period 1966-1983. The approach taken allows multiple processes to be simultaneously incorporated into a single model, resulting in a more realistic portrayal of a cohort's late-life labor force behavior. In addition, because the life table model is developed from multivariate hazard equations, we identify the effects of sociodemographic characteristics on the potentially complex process by which the labor force career is ended. In contrast to the assumed homogeneity of previous working life table analyses, the present study shows marked differences in labor force mobility and working and nonworking life expectancy according to occupation, class of worker, education, race, and marital status. We briefly discuss the implications of these findings for inequities of access to retirement, private and public pension consumption, and future changes in the retirement process.

  4. Promoting accountability: hospital charity care in California, Washington state, and Texas.

    PubMed

    Sutton, Janet P; Stensland, Jeffrey

    2004-05-01

    Debate as to whether private hospitals meet their charitable obligations is heated. This study examines how alternative state approaches for ensuring hospital accountability to the community affects charitable expenditures and potentially affects access to care for the uninsured. Descriptive and multivariate analyses were used to compare private California hospitals' charity care expenditures with those of hospitals in Texas and Washington state. The key finding from this study is that net of hospital characteristics, market characteristics and community need, Texas hospitals were estimated to provide over 3 times more charity care and Washington hospitals were estimated to provide 66% more charity care than California hospitals. This finding suggests that more prescriptive community benefit or charity care requirements may be necessary to ensure that private hospitals assume a larger role in the care of the uninsured.

  5. Statistical inferences for data from studies conducted with an aggregated multivariate outcome-dependent sample design

    PubMed Central

    Lu, Tsui-Shan; Longnecker, Matthew P.; Zhou, Haibo

    2016-01-01

    Outcome-dependent sampling (ODS) scheme is a cost-effective sampling scheme where one observes the exposure with a probability that depends on the outcome. The well-known such design is the case-control design for binary response, the case-cohort design for the failure time data and the general ODS design for a continuous response. While substantial work has been done for the univariate response case, statistical inference and design for the ODS with multivariate cases remain under-developed. Motivated by the need in biological studies for taking the advantage of the available responses for subjects in a cluster, we propose a multivariate outcome dependent sampling (Multivariate-ODS) design that is based on a general selection of the continuous responses within a cluster. The proposed inference procedure for the Multivariate-ODS design is semiparametric where all the underlying distributions of covariates are modeled nonparametrically using the empirical likelihood methods. We show that the proposed estimator is consistent and developed the asymptotically normality properties. Simulation studies show that the proposed estimator is more efficient than the estimator obtained using only the simple-random-sample portion of the Multivariate-ODS or the estimator from a simple random sample with the same sample size. The Multivariate-ODS design together with the proposed estimator provides an approach to further improve study efficiency for a given fixed study budget. We illustrate the proposed design and estimator with an analysis of association of PCB exposure to hearing loss in children born to the Collaborative Perinatal Study. PMID:27966260

  6. The public costs of births resulting from unintended pregnancies: national and state-level estimates.

    PubMed

    Sonfield, Adam; Kost, Kathryn; Gold, Rachel Benson; Finer, Lawrence B

    2011-06-01

    Births resulting from unintended pregnancies are associated with substantial maternity and infant care costs to the federal and state governments; these costs have never been estimated at the national and state levels. The proportions of births paid for by public insurance programs in 2006 were estimated, by pregnancy intention status, using data from the Pregnancy Risk Assessment Monitoring System and similar state surveys, or were predicted by multivariate linear regression. Public costs were calculated using state-level estimates of the number of births, by intention status, and of the cost of a publicly funded birth. In 2006, 64% of births resulting from unintended pregnancies were publicly funded, compared with 48% of all births and 35% of births resulting from intended pregnancies. The proportion of births resulting from unintended pregnancies that were publicly funded varied by state, from 42% to 81%. Of the 2.0 million publicly funded births, 51% resulted from unintended pregnancies, accounting for $11.1 billion in costs-half of the total public expenditures on births. In seven states, the costs for births from unintended pregnancies exceeded a half billion dollars. Public insurance programs are central in assisting American families in affording pregnancy and childbirth; however, they pay for a disproportionately high number of births resulting from unintended pregnancy. The resulting budgetary impact warrants increased public efforts to reduce unintended pregnancy. Copyright © 2011 by the Guttmacher Institute.

  7. Joint coverage probability in a simulation study on Continuous-Time Markov Chain parameter estimation.

    PubMed

    Benoit, Julia S; Chan, Wenyaw; Doody, Rachelle S

    2015-01-01

    Parameter dependency within data sets in simulation studies is common, especially in models such as Continuous-Time Markov Chains (CTMC). Additionally, the literature lacks a comprehensive examination of estimation performance for the likelihood-based general multi-state CTMC. Among studies attempting to assess the estimation, none have accounted for dependency among parameter estimates. The purpose of this research is twofold: 1) to develop a multivariate approach for assessing accuracy and precision for simulation studies 2) to add to the literature a comprehensive examination of the estimation of a general 3-state CTMC model. Simulation studies are conducted to analyze longitudinal data with a trinomial outcome using a CTMC with and without covariates. Measures of performance including bias, component-wise coverage probabilities, and joint coverage probabilities are calculated. An application is presented using Alzheimer's disease caregiver stress levels. Comparisons of joint and component-wise parameter estimates yield conflicting inferential results in simulations from models with and without covariates. In conclusion, caution should be taken when conducting simulation studies aiming to assess performance and choice of inference should properly reflect the purpose of the simulation.

  8. Design and evaluation of a robust dynamic neurocontroller for a multivariable aircraft control problem

    NASA Technical Reports Server (NTRS)

    Troudet, T.; Garg, S.; Merrill, W.

    1992-01-01

    The design of a dynamic neurocontroller with good robustness properties is presented for a multivariable aircraft control problem. The internal dynamics of the neurocontroller are synthesized by a state estimator feedback loop. The neurocontrol is generated by a multilayer feedforward neural network which is trained through backpropagation to minimize an objective function that is a weighted sum of tracking errors, and control input commands and rates. The neurocontroller exhibits good robustness through stability margins in phase and vehicle output gains. By maintaining performance and stability in the presence of sensor failures in the error loops, the structure of the neurocontroller is also consistent with the classical approach of flight control design.

  9. A Comparison of Three Multivariate Models for Estimating Test Battery Reliability.

    ERIC Educational Resources Information Center

    Wood, Terry M.; Safrit, Margaret J.

    1987-01-01

    A comparison of three multivariate models (canonical reliability model, maximum generalizability model, canonical correlation model) for estimating test battery reliability indicated that the maximum generalizability model showed the least degree of bias, smallest errors in estimation, and the greatest relative efficiency across all experimental…

  10. Evolution of the Marine Officer Fitness Report: A Multivariate Analysis

    DTIC Science & Technology

    This thesis explores the evaluation behavior of United States Marine Corps (USMC) Reporting Seniors (RSs) from 2010 to 2017. Using fitness report...RSs evaluate the performance of subordinate active component unrestricted officer MROs over time. I estimate logistic regression models of the...lowest. However, these correlations indicating the effects of race matching on FITREP evaluations narrow in significance when performance-based factors

  11. Photos for estimating fuel loadings before and after prescribed burning in the upper coastal plain of the southeast

    Treesearch

    Eric R. Scholl; Thomas A. Waldrop

    1999-01-01

    Although prescribed burning is common in the Southeastern United States, most fuel models apply to only western forests. This paper documents a fuel classification system that was developed for plantations of loblolly and longleaf pines for the Upper Coastal Plain region. Multivariate analysis of variance and discriminant function analysis were used to confirm eight...

  12. Alpha-canonical form representation of the open loop dynamics of the Space Shuttle main engine

    NASA Technical Reports Server (NTRS)

    Duyar, Almet; Eldem, Vasfi; Merrill, Walter C.; Guo, Ten-Huei

    1991-01-01

    A parameter and structure estimation technique for multivariable systems is used to obtain a state space representation of open loop dynamics of the space shuttle main engine in alpha-canonical form. The parameterization being used is both minimal and unique. The simplified linear model may be used for fault detection studies and control system design and development.

  13. DasPy 1.0 - the Open Source Multivariate Land Data Assimilation Framework in combination with the Community Land Model 4.5

    NASA Astrophysics Data System (ADS)

    Han, X.; Li, X.; He, G.; Kumbhar, P.; Montzka, C.; Kollet, S.; Miyoshi, T.; Rosolem, R.; Zhang, Y.; Vereecken, H.; Franssen, H.-J. H.

    2015-08-01

    Data assimilation has become a popular method to integrate observations from multiple sources with land surface models to improve predictions of the water and energy cycles of the soil-vegetation-atmosphere continuum. Multivariate data assimilation refers to the simultaneous assimilation of observation data from multiple model state variables into a simulation model. In recent years, several land data assimilation systems have been developed in different research agencies. Because of the software availability or adaptability, these systems are not easy to apply for the purpose of multivariate land data assimilation research. We developed an open source multivariate land data assimilation framework (DasPy) which is implemented using the Python script language mixed with the C++ and Fortran programming languages. LETKF (Local Ensemble Transform Kalman Filter) is implemented as the main data assimilation algorithm, and uncertainties in the data assimilation can be introduced by perturbed atmospheric forcing data, and represented by perturbed soil and vegetation parameters and model initial conditions. The Community Land Model (CLM) was integrated as the model operator. The implementation allows also parameter estimation (soil properties and/or leaf area index) on the basis of the joint state and parameter estimation approach. The Community Microwave Emission Modelling platform (CMEM), COsmic-ray Soil Moisture Interaction Code (COSMIC) and the Two-Source Formulation (TSF) were integrated as observation operators for the assimilation of L-band passive microwave, cosmic-ray soil moisture probe and land surface temperature measurements, respectively. DasPy has been evaluated in several assimilation studies of neutron count intensity (soil moisture), L-band brightness temperature and land surface temperature. DasPy is parallelized using the hybrid Message Passing Interface and Open Multi-Processing techniques. All the input and output data flows are organized efficiently using the commonly used NetCDF file format. Online 1-D and 2-D visualization of data assimilation results is also implemented to facilitate the post simulation analysis. In summary, DasPy is a ready to use open source parallel multivariate land data assimilation framework.

  14. Understanding characteristics in multivariate traffic flow time series from complex network structure

    NASA Astrophysics Data System (ADS)

    Yan, Ying; Zhang, Shen; Tang, Jinjun; Wang, Xiaofei

    2017-07-01

    Discovering dynamic characteristics in traffic flow is the significant step to design effective traffic managing and controlling strategy for relieving traffic congestion in urban cities. A new method based on complex network theory is proposed to study multivariate traffic flow time series. The data were collected from loop detectors on freeway during a year. In order to construct complex network from original traffic flow, a weighted Froenius norm is adopt to estimate similarity between multivariate time series, and Principal Component Analysis is implemented to determine the weights. We discuss how to select optimal critical threshold for networks at different hour in term of cumulative probability distribution of degree. Furthermore, two statistical properties of networks: normalized network structure entropy and cumulative probability of degree, are utilized to explore hourly variation in traffic flow. The results demonstrate these two statistical quantities express similar pattern to traffic flow parameters with morning and evening peak hours. Accordingly, we detect three traffic states: trough, peak and transitional hours, according to the correlation between two aforementioned properties. The classifying results of states can actually represent hourly fluctuation in traffic flow by analyzing annual average hourly values of traffic volume, occupancy and speed in corresponding hours.

  15. Multivariate Granger causality: an estimation framework based on factorization of the spectral density matrix

    PubMed Central

    Wen, Xiaotong; Rangarajan, Govindan; Ding, Mingzhou

    2013-01-01

    Granger causality is increasingly being applied to multi-electrode neurophysiological and functional imaging data to characterize directional interactions between neurons and brain regions. For a multivariate dataset, one might be interested in different subsets of the recorded neurons or brain regions. According to the current estimation framework, for each subset, one conducts a separate autoregressive model fitting process, introducing the potential for unwanted variability and uncertainty. In this paper, we propose a multivariate framework for estimating Granger causality. It is based on spectral density matrix factorization and offers the advantage that the estimation of such a matrix needs to be done only once for the entire multivariate dataset. For any subset of recorded data, Granger causality can be calculated through factorizing the appropriate submatrix of the overall spectral density matrix. PMID:23858479

  16. Multivariate Density Estimation and Remote Sensing

    NASA Technical Reports Server (NTRS)

    Scott, D. W.

    1983-01-01

    Current efforts to develop methods and computer algorithms to effectively represent multivariate data commonly encountered in remote sensing applications are described. While this may involve scatter diagrams, multivariate representations of nonparametric probability density estimates are emphasized. The density function provides a useful graphical tool for looking at data and a useful theoretical tool for classification. This approach is called a thunderstorm data analysis.

  17. Statistical inferences for data from studies conducted with an aggregated multivariate outcome-dependent sample design.

    PubMed

    Lu, Tsui-Shan; Longnecker, Matthew P; Zhou, Haibo

    2017-03-15

    Outcome-dependent sampling (ODS) scheme is a cost-effective sampling scheme where one observes the exposure with a probability that depends on the outcome. The well-known such design is the case-control design for binary response, the case-cohort design for the failure time data, and the general ODS design for a continuous response. While substantial work has been carried out for the univariate response case, statistical inference and design for the ODS with multivariate cases remain under-developed. Motivated by the need in biological studies for taking the advantage of the available responses for subjects in a cluster, we propose a multivariate outcome-dependent sampling (multivariate-ODS) design that is based on a general selection of the continuous responses within a cluster. The proposed inference procedure for the multivariate-ODS design is semiparametric where all the underlying distributions of covariates are modeled nonparametrically using the empirical likelihood methods. We show that the proposed estimator is consistent and developed the asymptotically normality properties. Simulation studies show that the proposed estimator is more efficient than the estimator obtained using only the simple-random-sample portion of the multivariate-ODS or the estimator from a simple random sample with the same sample size. The multivariate-ODS design together with the proposed estimator provides an approach to further improve study efficiency for a given fixed study budget. We illustrate the proposed design and estimator with an analysis of association of polychlorinated biphenyl exposure to hearing loss in children born to the Collaborative Perinatal Study. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  18. Noise source and reactor stability estimation in a boiling water reactor using a multivariate autoregressive model

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kanemoto, S.; Andoh, Y.; Sandoz, S.A.

    1984-10-01

    A method for evaluating reactor stability in boiling water reactors has been developed. The method is based on multivariate autoregressive (M-AR) modeling of steady-state neutron and process noise signals. In this method, two kinds of power spectral densities (PSDs) for the measured neutron signal and the corresponding noise source signal are separately identified by the M-AR modeling. The closed- and open-loop stability parameters are evaluated from these PSDs. The method is applied to actual plant noise data that were measured together with artificial perturbation test data. Stability parameters identified from noise data are compared to those from perturbation test data,more » and it is shown that both results are in good agreement. In addition to these stability estimations, driving noise sources for the neutron signal are evaluated by the M-AR modeling. Contributions from void, core flow, and pressure noise sources are quantitatively evaluated, and the void noise source is shown to be the most dominant.« less

  19. Meal Detection in Patients With Type 1 Diabetes: A New Module for the Multivariable Adaptive Artificial Pancreas Control System.

    PubMed

    Turksoy, Kamuran; Samadi, Sediqeh; Feng, Jianyuan; Littlejohn, Elizabeth; Quinn, Laurie; Cinar, Ali

    2016-01-01

    A novel meal-detection algorithm is developed based on continuous glucose measurements. Bergman's minimal model is modified and used in an unscented Kalman filter for state estimations. The estimated rate of appearance of glucose is used for meal detection. Data from nine subjects are used to assess the performance of the algorithm. The results indicate that the proposed algorithm works successfully with high accuracy. The average change in glucose levels between the meals and the detection points is 16(±9.42) [mg/dl] for 61 successfully detected meals and snacks. The algorithm is developed as a new module of an integrated multivariable adaptive artificial pancreas control system. Meal detection with the proposed method is used to administer insulin boluses and prevent most of postprandial hyperglycemia without any manual meal announcements. A novel meal bolus calculation method is proposed and tested with the UVA/Padova simulator. The results indicate significant reduction in hyperglycemia.

  20. Multivariate Meta-Analysis of Genetic Association Studies: A Simulation Study

    PubMed Central

    Neupane, Binod; Beyene, Joseph

    2015-01-01

    In a meta-analysis with multiple end points of interests that are correlated between or within studies, multivariate approach to meta-analysis has a potential to produce more precise estimates of effects by exploiting the correlation structure between end points. However, under random-effects assumption the multivariate estimation is more complex (as it involves estimation of more parameters simultaneously) than univariate estimation, and sometimes can produce unrealistic parameter estimates. Usefulness of multivariate approach to meta-analysis of the effects of a genetic variant on two or more correlated traits is not well understood in the area of genetic association studies. In such studies, genetic variants are expected to roughly maintain Hardy-Weinberg equilibrium within studies, and also their effects on complex traits are generally very small to modest and could be heterogeneous across studies for genuine reasons. We carried out extensive simulation to explore the comparative performance of multivariate approach with most commonly used univariate inverse-variance weighted approach under random-effects assumption in various realistic meta-analytic scenarios of genetic association studies of correlated end points. We evaluated the performance with respect to relative mean bias percentage, and root mean square error (RMSE) of the estimate and coverage probability of corresponding 95% confidence interval of the effect for each end point. Our simulation results suggest that multivariate approach performs similarly or better than univariate method when correlations between end points within or between studies are at least moderate and between-study variation is similar or larger than average within-study variation for meta-analyses of 10 or more genetic studies. Multivariate approach produces estimates with smaller bias and RMSE especially for the end point that has randomly or informatively missing summary data in some individual studies, when the missing data in the endpoint are imputed with null effects and quite large variance. PMID:26196398

  1. Multivariate Meta-Analysis of Genetic Association Studies: A Simulation Study.

    PubMed

    Neupane, Binod; Beyene, Joseph

    2015-01-01

    In a meta-analysis with multiple end points of interests that are correlated between or within studies, multivariate approach to meta-analysis has a potential to produce more precise estimates of effects by exploiting the correlation structure between end points. However, under random-effects assumption the multivariate estimation is more complex (as it involves estimation of more parameters simultaneously) than univariate estimation, and sometimes can produce unrealistic parameter estimates. Usefulness of multivariate approach to meta-analysis of the effects of a genetic variant on two or more correlated traits is not well understood in the area of genetic association studies. In such studies, genetic variants are expected to roughly maintain Hardy-Weinberg equilibrium within studies, and also their effects on complex traits are generally very small to modest and could be heterogeneous across studies for genuine reasons. We carried out extensive simulation to explore the comparative performance of multivariate approach with most commonly used univariate inverse-variance weighted approach under random-effects assumption in various realistic meta-analytic scenarios of genetic association studies of correlated end points. We evaluated the performance with respect to relative mean bias percentage, and root mean square error (RMSE) of the estimate and coverage probability of corresponding 95% confidence interval of the effect for each end point. Our simulation results suggest that multivariate approach performs similarly or better than univariate method when correlations between end points within or between studies are at least moderate and between-study variation is similar or larger than average within-study variation for meta-analyses of 10 or more genetic studies. Multivariate approach produces estimates with smaller bias and RMSE especially for the end point that has randomly or informatively missing summary data in some individual studies, when the missing data in the endpoint are imputed with null effects and quite large variance.

  2. Multivariate generalized hidden Markov regression models with random covariates: Physical exercise in an elderly population.

    PubMed

    Punzo, Antonio; Ingrassia, Salvatore; Maruotti, Antonello

    2018-04-22

    A time-varying latent variable model is proposed to jointly analyze multivariate mixed-support longitudinal data. The proposal can be viewed as an extension of hidden Markov regression models with fixed covariates (HMRMFCs), which is the state of the art for modelling longitudinal data, with a special focus on the underlying clustering structure. HMRMFCs are inadequate for applications in which a clustering structure can be identified in the distribution of the covariates, as the clustering is independent from the covariates distribution. Here, hidden Markov regression models with random covariates are introduced by explicitly specifying state-specific distributions for the covariates, with the aim of improving the recovering of the clusters in the data with respect to a fixed covariates paradigm. The hidden Markov regression models with random covariates class is defined focusing on the exponential family, in a generalized linear model framework. Model identifiability conditions are sketched, an expectation-maximization algorithm is outlined for parameter estimation, and various implementation and operational issues are discussed. Properties of the estimators of the regression coefficients, as well as of the hidden path parameters, are evaluated through simulation experiments and compared with those of HMRMFCs. The method is applied to physical activity data. Copyright © 2018 John Wiley & Sons, Ltd.

  3. A Penalized Likelihood Framework For High-Dimensional Phylogenetic Comparative Methods And An Application To New-World Monkeys Brain Evolution.

    PubMed

    Julien, Clavel; Leandro, Aristide; Hélène, Morlon

    2018-06-19

    Working with high-dimensional phylogenetic comparative datasets is challenging because likelihood-based multivariate methods suffer from low statistical performances as the number of traits p approaches the number of species n and because some computational complications occur when p exceeds n. Alternative phylogenetic comparative methods have recently been proposed to deal with the large p small n scenario but their use and performances are limited. Here we develop a penalized likelihood framework to deal with high-dimensional comparative datasets. We propose various penalizations and methods for selecting the intensity of the penalties. We apply this general framework to the estimation of parameters (the evolutionary trait covariance matrix and parameters of the evolutionary model) and model comparison for the high-dimensional multivariate Brownian (BM), Early-burst (EB), Ornstein-Uhlenbeck (OU) and Pagel's lambda models. We show using simulations that our penalized likelihood approach dramatically improves the estimation of evolutionary trait covariance matrices and model parameters when p approaches n, and allows for their accurate estimation when p equals or exceeds n. In addition, we show that penalized likelihood models can be efficiently compared using Generalized Information Criterion (GIC). We implement these methods, as well as the related estimation of ancestral states and the computation of phylogenetic PCA in the R package RPANDA and mvMORPH. Finally, we illustrate the utility of the new proposed framework by evaluating evolutionary models fit, analyzing integration patterns, and reconstructing evolutionary trajectories for a high-dimensional 3-D dataset of brain shape in the New World monkeys. We find a clear support for an Early-burst model suggesting an early diversification of brain morphology during the ecological radiation of the clade. Penalized likelihood offers an efficient way to deal with high-dimensional multivariate comparative data.

  4. Testing for Human Immunodeficiency Virus Among Cancer Survivors Under Age 65 in the United States

    PubMed Central

    Thompson, Trevor D.; Tai, Eric; Zhao, Guixiang; Oster, Alexandra M.

    2014-01-01

    Introduction Knowing the human immunodeficiency virus (HIV) serostatus of patients at the time of cancer diagnosis or cancer recurrence is prerequisite to coordinating HIV and cancer treatments and improving treatment outcomes. However, there are no published data about HIV testing among cancer survivors in the United States. We sought to provide estimates of the proportion of cancer survivors tested for HIV and to characterize factors associated with having had HIV testing. Methods We used data from the 2009 Behavioral Risk Factor Surveillance System to calculate the proportion of cancer survivors under age 65 who had undergone HIV testing, by demographic and health-related factors and by state. Adjusted proportion estimates were calculated by multivariable logistic regression. Results Only 41% of cancer survivors in the United States under the age of 65 reported ever having had an HIV test. The highest proportion of survivors tested was among patients aged 25 to 34 years (72.2%), non-Hispanic blacks (59.5%), and cervical cancer survivors (51.2%). The proportion tested was highest in the District of Columbia (68.3%) and lowest in Nebraska (24.1%). Multivariable analysis showed that factors associated with HIV testing included being non-Hispanic black or Hispanic, being younger, having higher education, not being married or living with a partner, not being disabled, and having medical cost concerns. Having an AIDS-related cancer was associated with HIV testing only among females. Conclusion The proportions of HIV testing varied substantially by demographic and health-related factors and by state. Our study points to the need for public health interventions to promote HIV testing among cancer survivors. PMID:25393748

  5. Differentially Private Synthesization of Multi-Dimensional Data using Copula Functions

    PubMed Central

    Li, Haoran; Xiong, Li; Jiang, Xiaoqian

    2014-01-01

    Differential privacy has recently emerged in private statistical data release as one of the strongest privacy guarantees. Most of the existing techniques that generate differentially private histograms or synthetic data only work well for single dimensional or low-dimensional histograms. They become problematic for high dimensional and large domain data due to increased perturbation error and computation complexity. In this paper, we propose DPCopula, a differentially private data synthesization technique using Copula functions for multi-dimensional data. The core of our method is to compute a differentially private copula function from which we can sample synthetic data. Copula functions are used to describe the dependence between multivariate random vectors and allow us to build the multivariate joint distribution using one-dimensional marginal distributions. We present two methods for estimating the parameters of the copula functions with differential privacy: maximum likelihood estimation and Kendall’s τ estimation. We present formal proofs for the privacy guarantee as well as the convergence property of our methods. Extensive experiments using both real datasets and synthetic datasets demonstrate that DPCopula generates highly accurate synthetic multi-dimensional data with significantly better utility than state-of-the-art techniques. PMID:25405241

  6. Modeling stochastic frontier based on vine copulas

    NASA Astrophysics Data System (ADS)

    Constantino, Michel; Candido, Osvaldo; Tabak, Benjamin M.; da Costa, Reginaldo Brito

    2017-11-01

    This article models a production function and analyzes the technical efficiency of listed companies in the United States, Germany and England between 2005 and 2012 based on the vine copula approach. Traditional estimates of the stochastic frontier assume that data is multivariate normally distributed and there is no source of asymmetry. The proposed method based on vine copulas allow us to explore different types of asymmetry and multivariate distribution. Using data on product, capital and labor, we measure the relative efficiency of the vine production function and estimate the coefficient used in the stochastic frontier literature for comparison purposes. This production vine copula predicts the value added by firms with given capital and labor in a probabilistic way. It thereby stands in sharp contrast to the production function, where the output of firms is completely deterministic. The results show that, on average, S&P500 companies are more efficient than companies listed in England and Germany, which presented similar average efficiency coefficients. For comparative purposes, the traditional stochastic frontier was estimated and the results showed discrepancies between the coefficients obtained by the application of the two methods, traditional and frontier-vine, opening new paths of non-linear research.

  7. Fresh Biomass Estimation in Heterogeneous Grassland Using Hyperspectral Measurements and Multivariate Statistical Analysis

    NASA Astrophysics Data System (ADS)

    Darvishzadeh, R.; Skidmore, A. K.; Mirzaie, M.; Atzberger, C.; Schlerf, M.

    2014-12-01

    Accurate estimation of grassland biomass at their peak productivity can provide crucial information regarding the functioning and productivity of the rangelands. Hyperspectral remote sensing has proved to be valuable for estimation of vegetation biophysical parameters such as biomass using different statistical techniques. However, in statistical analysis of hyperspectral data, multicollinearity is a common problem due to large amount of correlated hyper-spectral reflectance measurements. The aim of this study was to examine the prospect of above ground biomass estimation in a heterogeneous Mediterranean rangeland employing multivariate calibration methods. Canopy spectral measurements were made in the field using a GER 3700 spectroradiometer, along with concomitant in situ measurements of above ground biomass for 170 sample plots. Multivariate calibrations including partial least squares regression (PLSR), principal component regression (PCR), and Least-Squared Support Vector Machine (LS-SVM) were used to estimate the above ground biomass. The prediction accuracy of the multivariate calibration methods were assessed using cross validated R2 and RMSE. The best model performance was obtained using LS_SVM and then PLSR both calibrated with first derivative reflectance dataset with R2cv = 0.88 & 0.86 and RMSEcv= 1.15 & 1.07 respectively. The weakest prediction accuracy was appeared when PCR were used (R2cv = 0.31 and RMSEcv= 2.48). The obtained results highlight the importance of multivariate calibration methods for biomass estimation when hyperspectral data are used.

  8. A novel multivariate STeady-state index during general ANesthesia (STAN).

    PubMed

    Castro, Ana; de Almeida, Fernando Gomes; Amorim, Pedro; Nunes, Catarina S

    2017-08-01

    The assessment of the adequacy of general anesthesia for surgery, namely the nociception/anti-nociception balance, has received wide attention from the scientific community. Monitoring systems based on the frontal EEG/EMG, or autonomic state reactions (e.g. heart rate and blood pressure) have been developed aiming to objectively assess this balance. In this study a new multivariate indicator of patients' steady-state during anesthesia (STAN) is proposed, based on wavelet analysis of signals linked to noxious activation. A clinical protocol was designed to analyze precise noxious stimuli (laryngoscopy/intubation, tetanic, and incision), under three different analgesic doses; patients were randomized to receive either remifentanil 2.0, 3.0 or 4.0 ng/ml. ECG, PPG, BP, BIS, EMG and [Formula: see text] were continuously recorded. ECG, PPG and BP were processed to extract beat-to-beat information, and [Formula: see text] curve used to estimate the respiration rate. A combined steady-state index based on wavelet analysis of these variables, was applied and compared between the three study groups and stimuli (Wilcoxon signed ranks, Kruskal-Wallis and Mann-Whitney tests). Following institutional approval and signing the informed consent thirty four patients were enrolled in this study (3 excluded due to signal loss during data collection). The BIS index of the EEG, frontal EMG, heart rate, BP, and PPG wave amplitude changed in response to different noxious stimuli. Laryngoscopy/intubation was the stimulus with the more pronounced response [Formula: see text]. These variables were used in the construction of the combined index STAN; STAN responded adequately to noxious stimuli, with a more pronounced response to laryngoscopy/intubation (18.5-43.1 %, [Formula: see text]), and the attenuation provided by the analgesic, detecting steady-state periods in the different physiological signals analyzed (approximately 50 % of the total study time). A new multivariate approach for the assessment of the patient steady-state during general anesthesia was developed. The proposed wavelet based multivariate index responds adequately to different noxious stimuli, and attenuation provided by the analgesic in a dose-dependent manner for each stimulus analyzed in this study.

  9. Developpement d'un estimateur d'etat energetique d'un chauffe-eau pour un controle par champ moyen

    NASA Astrophysics Data System (ADS)

    Solis, Jerome

    Domestic electric water heaters (EWH) provide an interesting energy storage potential that can be exploited in the context of smart grids. For example, a variation of 1 °C of the water temperature of an EWH of 270 liters corresponds to 0.313 kWh of energy. For utilities like Hydro-Quebec, which has about 2 million EWH connected to its grid, this represents a strong storage potential. This storage source already present on the network could allow, using an adequate control strategy, to smooth the electricity consumption curve by moving some of the load occurring during peak periods to trough periods. This thesis position itself in the context of developing a simulation platform to implement a mean field control strategy of water heaters on the grid. This innovative approach enables control of all water heaters to meet a global objective while respecting local constraints of each. To be functional, the strategy requires several components to be integrated into the simulation platform. Specifically, this thesis focuses on the development of a numerical model and the estimation of the state in temperature and energy of an EWH. The proposed model for predicting the behavior of a EWH is a multi-nodal model. This approach allows discretizing the tank of an EWH into smaller volumes. The energy conservation equation is solved for each of these sub-volumes and it is possible to predict changes in the output mean temperature with an average error of 2.4 °C. The results from the multi-nodal model compare favorably with numerical reference models, found in the TRNSYS thermal simulation software, as well as experimental measurements of temperature of an electric water heater. In this model, the temperature is assumed to be uniform for each sub-volume and the standby losses are considered constants. Although the model is linear, a correction routine allows redistributing adequately heat within the tank while respecting the thermal dynamics of the heater, including the presence of temperature stratification at the heating elements, hence breaking the linearity. The multi-nodal model is used primarily to simulate the load of a large water heater population on the power grid. It is also useful to the scheduler, which generates the desired temperature for the whole population, as well as by mean field controller, which generates the optimal control for this same group. The estimation process takes place in two distinct steps: estimating the sequence of hot water draw (discrete state of the system) and the reconstruction of the temperature inside the tank (continuous system state). To perform the estimation of the drawing sequence, a maximum likelihood test is used with a set of measurements of the average temperature of the system. Under the assumption that the statistics of random events, modeled by a two state Markov chain (active and inactive) are known a priori, a set of possible drawing scenarios is generated. By comparing the prediction for each scenario with the actions of the system, the one minimizing the error between the predicted and measured temperature is retained. This method predicts the discrete state of the system correctly 93.7 % of the time. Then, based on the multi-nodal model, a routine for estimating the continuous state of an EWH is developed. This model should be simplified by removing the temperature correction routine in order to keep it linear. This is required in order to build more easily a state observer, a tool that allows the full reconstruction of the temperature state from measurements. This simplification results in limiting the use of the observer to periods of drawing, because without drawing, there is no heat exchange in the system, resulting in aberrations in the simplified model. Those aberrations are corrected under the assumption that the power of the heating elements is distributed in all layers of the system in proportion to their respective volume. Each node is hence heated by an intermediate element. To realize the reconstruction of the continuous state of temperature, two different approaches are considered: the multivariate observer and the Kalman filter. With at most 3 temperature measurements, either in the highest layer of the tank, near the top heating element and the latter at the bottom of the tank of the water heater, a global average error of -440 Wh on all estimated components of the water heater state is obtained by using the multivariate observer, which translates into a global mean percentage error (MPE) of -2.9 % on the energy stocking capacity. Taking advantage of an independent use of the discrete state of the system, the Kalman filter was able to reconstruct the continuous state with a global MPE of -2.0 %, or -270 Wh. For these reasons, the Kalman filter is preferred to the multivariate observer to complete the process of estimating the complete state of the water heater. Moreover, the estimation made by the Kalman filter or the multivariate observer may deviate around +/-10-15 % compared to the total energy stored in the water heater. This thesis presents innovative tools allowing the use of a mean field control strategy in a context of smart grids. This brand new approach presents many challenges and significant improvements can be envisioned, especially for estimating the state of the water heater. Mainly, a more accurate description of the process of extracting hot water, integrating several types of different drawing intensity, allow a more accurate and robust estimation of the hot water drawing sequence.

  10. Effects of Covariance Heterogeneity on Three Procedures for Analyzing Multivariate Repeated Measures Designs.

    ERIC Educational Resources Information Center

    Vallejo, Guillermo; Fidalgo, Angel; Fernandez, Paula

    2001-01-01

    Estimated empirical Type I error rate and power rate for three procedures for analyzing multivariate repeated measures designs: (1) the doubly multivariate model; (2) the Welch-James multivariate solution (H. Keselman, M. Carriere, a nd L. Lix, 1993); and (3) the multivariate version of the modified Brown-Forsythe procedure (M. Brown and A.…

  11. SMURC: High-Dimension Small-Sample Multivariate Regression With Covariance Estimation.

    PubMed

    Bayar, Belhassen; Bouaynaya, Nidhal; Shterenberg, Roman

    2017-03-01

    We consider a high-dimension low sample-size multivariate regression problem that accounts for correlation of the response variables. The system is underdetermined as there are more parameters than samples. We show that the maximum likelihood approach with covariance estimation is senseless because the likelihood diverges. We subsequently propose a normalization of the likelihood function that guarantees convergence. We call this method small-sample multivariate regression with covariance (SMURC) estimation. We derive an optimization problem and its convex approximation to compute SMURC. Simulation results show that the proposed algorithm outperforms the regularized likelihood estimator with known covariance matrix and the sparse conditional Gaussian graphical model. We also apply SMURC to the inference of the wing-muscle gene network of the Drosophila melanogaster (fruit fly).

  12. Enhancing e-waste estimates: Improving data quality by multivariate Input–Output Analysis

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Wang, Feng, E-mail: fwang@unu.edu; Design for Sustainability Lab, Faculty of Industrial Design Engineering, Delft University of Technology, Landbergstraat 15, 2628CE Delft; Huisman, Jaco

    2013-11-15

    Highlights: • A multivariate Input–Output Analysis method for e-waste estimates is proposed. • Applying multivariate analysis to consolidate data can enhance e-waste estimates. • We examine the influence of model selection and data quality on e-waste estimates. • Datasets of all e-waste related variables in a Dutch case study have been provided. • Accurate modeling of time-variant lifespan distributions is critical for estimate. - Abstract: Waste electrical and electronic equipment (or e-waste) is one of the fastest growing waste streams, which encompasses a wide and increasing spectrum of products. Accurate estimation of e-waste generation is difficult, mainly due to lackmore » of high quality data referred to market and socio-economic dynamics. This paper addresses how to enhance e-waste estimates by providing techniques to increase data quality. An advanced, flexible and multivariate Input–Output Analysis (IOA) method is proposed. It links all three pillars in IOA (product sales, stock and lifespan profiles) to construct mathematical relationships between various data points. By applying this method, the data consolidation steps can generate more accurate time-series datasets from available data pool. This can consequently increase the reliability of e-waste estimates compared to the approach without data processing. A case study in the Netherlands is used to apply the advanced IOA model. As a result, for the first time ever, complete datasets of all three variables for estimating all types of e-waste have been obtained. The result of this study also demonstrates significant disparity between various estimation models, arising from the use of data under different conditions. It shows the importance of applying multivariate approach and multiple sources to improve data quality for modelling, specifically using appropriate time-varying lifespan parameters. Following the case study, a roadmap with a procedural guideline is provided to enhance e-waste estimation studies.« less

  13. Local polynomial estimation of heteroscedasticity in a multivariate linear regression model and its applications in economics.

    PubMed

    Su, Liyun; Zhao, Yanyong; Yan, Tianshun; Li, Fenglan

    2012-01-01

    Multivariate local polynomial fitting is applied to the multivariate linear heteroscedastic regression model. Firstly, the local polynomial fitting is applied to estimate heteroscedastic function, then the coefficients of regression model are obtained by using generalized least squares method. One noteworthy feature of our approach is that we avoid the testing for heteroscedasticity by improving the traditional two-stage method. Due to non-parametric technique of local polynomial estimation, it is unnecessary to know the form of heteroscedastic function. Therefore, we can improve the estimation precision, when the heteroscedastic function is unknown. Furthermore, we verify that the regression coefficients is asymptotic normal based on numerical simulations and normal Q-Q plots of residuals. Finally, the simulation results and the local polynomial estimation of real data indicate that our approach is surely effective in finite-sample situations.

  14. MULTIVARIATE RECEPTOR MODELS AND MODEL UNCERTAINTY. (R825173)

    EPA Science Inventory

    Abstract

    Estimation of the number of major pollution sources, the source composition profiles, and the source contributions are the main interests in multivariate receptor modeling. Due to lack of identifiability of the receptor model, however, the estimation cannot be...

  15. Using Copula Distributions to Support More Accurate Imaging-Based Diagnostic Classifiers for Neuropsychiatric Disorders

    PubMed Central

    Bansal, Ravi; Hao, Xuejun; Liu, Jun; Peterson, Bradley S.

    2014-01-01

    Many investigators have tried to apply machine learning techniques to magnetic resonance images (MRIs) of the brain in order to diagnose neuropsychiatric disorders. Usually the number of brain imaging measures (such as measures of cortical thickness and measures of local surface morphology) derived from the MRIs (i.e., their dimensionality) has been large (e.g. >10) relative to the number of participants who provide the MRI data (<100). Sparse data in a high dimensional space increases the variability of the classification rules that machine learning algorithms generate, thereby limiting the validity, reproducibility, and generalizability of those classifiers. The accuracy and stability of the classifiers can improve significantly if the multivariate distributions of the imaging measures can be estimated accurately. To accurately estimate the multivariate distributions using sparse data, we propose to estimate first the univariate distributions of imaging data and then combine them using a Copula to generate more accurate estimates of their multivariate distributions. We then sample the estimated Copula distributions to generate dense sets of imaging measures and use those measures to train classifiers. We hypothesize that the dense sets of brain imaging measures will generate classifiers that are stable to variations in brain imaging measures, thereby improving the reproducibility, validity, and generalizability of diagnostic classification algorithms in imaging datasets from clinical populations. In our experiments, we used both computer-generated and real-world brain imaging datasets to assess the accuracy of multivariate Copula distributions in estimating the corresponding multivariate distributions of real-world imaging data. Our experiments showed that diagnostic classifiers generated using imaging measures sampled from the Copula were significantly more accurate and more reproducible than were the classifiers generated using either the real-world imaging measures or their multivariate Gaussian distributions. Thus, our findings demonstrate that estimated multivariate Copula distributions can generate dense sets of brain imaging measures that can in turn be used to train classifiers, and those classifiers are significantly more accurate and more reproducible than are those generated using real-world imaging measures alone. PMID:25093634

  16. Probabilistic, meso-scale flood loss modelling

    NASA Astrophysics Data System (ADS)

    Kreibich, Heidi; Botto, Anna; Schröter, Kai; Merz, Bruno

    2016-04-01

    Flood risk analyses are an important basis for decisions on flood risk management and adaptation. However, such analyses are associated with significant uncertainty, even more if changes in risk due to global change are expected. Although uncertainty analysis and probabilistic approaches have received increased attention during the last years, they are still not standard practice for flood risk assessments and even more for flood loss modelling. State of the art in flood loss modelling is still the use of simple, deterministic approaches like stage-damage functions. Novel probabilistic, multi-variate flood loss models have been developed and validated on the micro-scale using a data-mining approach, namely bagging decision trees (Merz et al. 2013). In this presentation we demonstrate and evaluate the upscaling of the approach to the meso-scale, namely on the basis of land-use units. The model is applied in 19 municipalities which were affected during the 2002 flood by the River Mulde in Saxony, Germany (Botto et al. submitted). The application of bagging decision tree based loss models provide a probability distribution of estimated loss per municipality. Validation is undertaken on the one hand via a comparison with eight deterministic loss models including stage-damage functions as well as multi-variate models. On the other hand the results are compared with official loss data provided by the Saxon Relief Bank (SAB). The results show, that uncertainties of loss estimation remain high. Thus, the significant advantage of this probabilistic flood loss estimation approach is that it inherently provides quantitative information about the uncertainty of the prediction. References: Merz, B.; Kreibich, H.; Lall, U. (2013): Multi-variate flood damage assessment: a tree-based data-mining approach. NHESS, 13(1), 53-64. Botto A, Kreibich H, Merz B, Schröter K (submitted) Probabilistic, multi-variable flood loss modelling on the meso-scale with BT-FLEMO. Risk Analysis.

  17. Importance of Preserving Cross-correlation in developing Statistically Downscaled Climate Forcings and in estimating Land-surface Fluxes and States

    NASA Astrophysics Data System (ADS)

    Das Bhowmik, R.; Arumugam, S.

    2015-12-01

    Multivariate downscaling techniques exhibited superiority over univariate regression schemes in terms of preserving cross-correlations between multiple variables- precipitation and temperature - from GCMs. This study focuses on two aspects: (a) develop an analytical solutions on estimating biases in cross-correlations from univariate downscaling approaches and (b) quantify the uncertainty in land-surface states and fluxes due to biases in cross-correlations in downscaled climate forcings. Both these aspects are evaluated using climate forcings available from both historical climate simulations and CMIP5 hindcasts over the entire US. The analytical solution basically relates the univariate regression parameters, co-efficient of determination of regression and the co-variance ratio between GCM and downscaled values. The analytical solutions are compared with the downscaled univariate forcings by choosing the desired p-value (Type-1 error) in preserving the observed cross-correlation. . For quantifying the impacts of biases on cross-correlation on estimating streamflow and groundwater, we corrupt the downscaled climate forcings with different cross-correlation structure.

  18. A framework for multivariate data-based at-site flood frequency analysis: Essentiality of the conjugal application of parametric and nonparametric approaches

    NASA Astrophysics Data System (ADS)

    Vittal, H.; Singh, Jitendra; Kumar, Pankaj; Karmakar, Subhankar

    2015-06-01

    In watershed management, flood frequency analysis (FFA) is performed to quantify the risk of flooding at different spatial locations and also to provide guidelines for determining the design periods of flood control structures. The traditional FFA was extensively performed by considering univariate scenario for both at-site and regional estimation of return periods. However, due to inherent mutual dependence of the flood variables or characteristics [i.e., peak flow (P), flood volume (V) and flood duration (D), which are random in nature], analysis has been further extended to multivariate scenario, with some restrictive assumptions. To overcome the assumption of same family of marginal density function for all flood variables, the concept of copula has been introduced. Although, the advancement from univariate to multivariate analyses drew formidable attention to the FFA research community, the basic limitation was that the analyses were performed with the implementation of only parametric family of distributions. The aim of the current study is to emphasize the importance of nonparametric approaches in the field of multivariate FFA; however, the nonparametric distribution may not always be a good-fit and capable of replacing well-implemented multivariate parametric and multivariate copula-based applications. Nevertheless, the potential of obtaining best-fit using nonparametric distributions might be improved because such distributions reproduce the sample's characteristics, resulting in more accurate estimations of the multivariate return period. Hence, the current study shows the importance of conjugating multivariate nonparametric approach with multivariate parametric and copula-based approaches, thereby results in a comprehensive framework for complete at-site FFA. Although the proposed framework is designed for at-site FFA, this approach can also be applied to regional FFA because regional estimations ideally include at-site estimations. The framework is based on the following steps: (i) comprehensive trend analysis to assess nonstationarity in the observed data; (ii) selection of the best-fit univariate marginal distribution with a comprehensive set of parametric and nonparametric distributions for the flood variables; (iii) multivariate frequency analyses with parametric, copula-based and nonparametric approaches; and (iv) estimation of joint and various conditional return periods. The proposed framework for frequency analysis is demonstrated using 110 years of observed data from Allegheny River at Salamanca, New York, USA. The results show that for both univariate and multivariate cases, the nonparametric Gaussian kernel provides the best estimate. Further, we perform FFA for twenty major rivers over continental USA, which shows for seven rivers, all the flood variables followed nonparametric Gaussian kernel; whereas for other rivers, parametric distributions provide the best-fit either for one or two flood variables. Thus the summary of results shows that the nonparametric method cannot substitute the parametric and copula-based approaches, but should be considered during any at-site FFA to provide the broadest choices for best estimation of the flood return periods.

  19. Hospitalisation charges for fibromyalgia in the United States, 1999-2007.

    PubMed

    Haviland, Mark G; Banta, Jim E; Przekop, Peter

    2012-01-01

    To estimate fibromyalgia (FM) hospitalisation costs (i.e. charges) for patients in the United States from 1999 to 2007; to determine factors associated with variation in costs of FM and non-FM hospitalisations; and to investigate hospital procedures associated with FM hospitalisations. Data were from the Nationwide Inpatient Sample, a large database of hospitalisations in the U.S. Over the study period, an estimated 63,772 patients - two-thirds women, one-third men - had been hospitalised for FM (FM criterion was the International Classification of Diseases, 9th Revision, Clinical Modification diagnosis code 729.1, Myositis and Myalgia, unspecified). Demographics and hospital characteristics were described with frequencies and mean inflation-adjusted charges. Two multivariable linear regressions (one for FM and a second for non-FM patients), with Consumer Price Index (CPI)-adjusted charges (hospital and related services category) in thousands of dollars as the dependent variable, were performed, excluding cases with masked or missing data. Procedures were categorised with a standard classification scheme. Survey-adjusted total CPI-adjusted charges over the study period were estimated to be approximately $1.0 billion. Hospital procedures and Charlson-Deyo Index (co-morbidity severity) scores were the strongest predictors of charges in bivariate and multivariate analyses (for both FM and non-FM patients). The majority of procedures for FM patients were related to musculoskeletal, gastrointestinal, or cardiovascular systems. Most FM patients, however, did not have any procedure or a life-threatening co-morbid illness. Over the nine-year period, hospital charges for FM were substantial. Studies of how to reduce or avoid these costs in the treatment of FM need to be undertaken.

  20. An efficient parallel sampling technique for Multivariate Poisson-Lognormal model: Analysis with two crash count datasets

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhan, Xianyuan; Aziz, H. M. Abdul; Ukkusuri, Satish V.

    Our study investigates the Multivariate Poisson-lognormal (MVPLN) model that jointly models crash frequency and severity accounting for correlations. The ordinary univariate count models analyze crashes of different severity level separately ignoring the correlations among severity levels. The MVPLN model is capable to incorporate the general correlation structure and takes account of the over dispersion in the data that leads to a superior data fitting. But, the traditional estimation approach for MVPLN model is computationally expensive, which often limits the use of MVPLN model in practice. In this work, a parallel sampling scheme is introduced to improve the original Markov Chainmore » Monte Carlo (MCMC) estimation approach of the MVPLN model, which significantly reduces the model estimation time. Two MVPLN models are developed using the pedestrian vehicle crash data collected in New York City from 2002 to 2006, and the highway-injury data from Washington State (5-year data from 1990 to 1994) The Deviance Information Criteria (DIC) is used to evaluate the model fitting. The estimation results show that the MVPLN models provide a superior fit over univariate Poisson-lognormal (PLN), univariate Poisson, and Negative Binomial models. Moreover, the correlations among the latent effects of different severity levels are found significant in both datasets that justifies the importance of jointly modeling crash frequency and severity accounting for correlations.« less

  1. An efficient parallel sampling technique for Multivariate Poisson-Lognormal model: Analysis with two crash count datasets

    DOE PAGES

    Zhan, Xianyuan; Aziz, H. M. Abdul; Ukkusuri, Satish V.

    2015-11-19

    Our study investigates the Multivariate Poisson-lognormal (MVPLN) model that jointly models crash frequency and severity accounting for correlations. The ordinary univariate count models analyze crashes of different severity level separately ignoring the correlations among severity levels. The MVPLN model is capable to incorporate the general correlation structure and takes account of the over dispersion in the data that leads to a superior data fitting. But, the traditional estimation approach for MVPLN model is computationally expensive, which often limits the use of MVPLN model in practice. In this work, a parallel sampling scheme is introduced to improve the original Markov Chainmore » Monte Carlo (MCMC) estimation approach of the MVPLN model, which significantly reduces the model estimation time. Two MVPLN models are developed using the pedestrian vehicle crash data collected in New York City from 2002 to 2006, and the highway-injury data from Washington State (5-year data from 1990 to 1994) The Deviance Information Criteria (DIC) is used to evaluate the model fitting. The estimation results show that the MVPLN models provide a superior fit over univariate Poisson-lognormal (PLN), univariate Poisson, and Negative Binomial models. Moreover, the correlations among the latent effects of different severity levels are found significant in both datasets that justifies the importance of jointly modeling crash frequency and severity accounting for correlations.« less

  2. Improved estimation of PM2.5 using Lagrangian satellite-measured aerosol optical depth

    NASA Astrophysics Data System (ADS)

    Olivas Saunders, Rolando

    Suspended particulate matter (aerosols) with aerodynamic diameters less than 2.5 mum (PM2.5) has negative effects on human health, plays an important role in climate change and also causes the corrosion of structures by acid deposition. Accurate estimates of PM2.5 concentrations are thus relevant in air quality, epidemiology, cloud microphysics and climate forcing studies. Aerosol optical depth (AOD) retrieved by the Moderate Resolution Imaging Spectroradiometer (MODIS) satellite instrument has been used as an empirical predictor to estimate ground-level concentrations of PM2.5 . These estimates usually have large uncertainties and errors. The main objective of this work is to assess the value of using upwind (Lagrangian) MODIS-AOD as predictors in empirical models of PM2.5. The upwind locations of the Lagrangian AOD were estimated using modeled backward air trajectories. Since the specification of an arrival elevation is somewhat arbitrary, trajectories were calculated to arrive at four different elevations at ten measurement sites within the continental United States. A systematic examination revealed trajectory model calculations to be sensitive to starting elevation. With a 500 m difference in starting elevation, the 48-hr mean horizontal separation of trajectory endpoints was 326 km. When the difference in starting elevation was doubled and tripled to 1000 m and 1500m, the mean horizontal separation of trajectory endpoints approximately doubled and tripled to 627 km and 886 km, respectively. A seasonal dependence of this sensitivity was also found: the smallest mean horizontal separation of trajectory endpoints was exhibited during the summer and the largest separations during the winter. A daily average AOD product was generated and coupled to the trajectory model in order to determine AOD values upwind of the measurement sites during the period 2003-2007. Empirical models that included in situ AOD and upwind AOD as predictors of PM2.5 were generated by multivariate linear regressions using the least squares method. The multivariate models showed improved performance over the single variable regression (PM2.5 and in situ AOD) models. The statistical significance of the improvement of the multivariate models over the single variable regression models was tested using the extra sum of squares principle. In many cases, even when the R-squared was high for the multivariate models, the improvement over the single models was not statistically significant. The R-squared of these multivariate models varied with respect to seasons, with the best performance occurring during the summer months. A set of seasonal categorical variables was included in the regressions to exploit this variability. The multivariate regression models that included these categorical seasonal variables performed better than the models that didn't account for seasonal variability. Furthermore, 71% of these regressions exhibited improvement over the single variable models that was statistically significant at a 95% confidence level.

  3. A matrix-based method of moments for fitting the multivariate random effects model for meta-analysis and meta-regression

    PubMed Central

    Jackson, Dan; White, Ian R; Riley, Richard D

    2013-01-01

    Multivariate meta-analysis is becoming more commonly used. Methods for fitting the multivariate random effects model include maximum likelihood, restricted maximum likelihood, Bayesian estimation and multivariate generalisations of the standard univariate method of moments. Here, we provide a new multivariate method of moments for estimating the between-study covariance matrix with the properties that (1) it allows for either complete or incomplete outcomes and (2) it allows for covariates through meta-regression. Further, for complete data, it is invariant to linear transformations. Our method reduces to the usual univariate method of moments, proposed by DerSimonian and Laird, in a single dimension. We illustrate our method and compare it with some of the alternatives using a simulation study and a real example. PMID:23401213

  4. Inter-Hospital Transfer is Associated with Increased Mortality and Costs in Severe Sepsis and Septic Shock: An Instrumental Variables Approach

    PubMed Central

    Mohr, Nicholas M.; Harland, Karisa K.; Shane, Dan M.; Ahmed, Azeemuddin; Fuller, Brian M.; Torner, James C.

    2016-01-01

    Purpose The objective of this study was to evaluate the impact of regionalization on sepsis survival, to describe the role of inter-hospital transfer in rural sepsis care, and to measure the cost of inter-hospital transfer in a predominantly rural state. Materials and Methods Observational case-control study using statewide administrative claims data from 2005-2014 in a predominantly rural Midwestern state. Mortality and marginal costs were estimated with multivariable generalized estimating equations (GEE) models and with instrumental variables models. Results A total of 18,246 patients were included, of which 59% were transferred between hospitals. Transferred patients had higher mortality and longer hospital length-of-stay than non-transferred patients. Using a multivariable GEE model to adjust for potentially confounding factors, inter-hospital transfer was associated with increased mortality (aOR 1.7, 95%CI 1.5 – 1.9). Using an instrumental variables model, transfer was associated with a 9.2% increased risk of death. Transfer was associated with additional costs of $6,897 (95%CI $5,769-8,024). Even when limiting to only those patients who received care in the largest hospitals, transfer was still associated with $5,167 (95%CI $3,696-6,638) in additional cost. Conclusions The majority of rural sepsis patients are transferred, and these transferred patients have higher mortality and significantly increased cost of care. PMID:27546770

  5. Multivariate localization methods for ensemble Kalman filtering

    NASA Astrophysics Data System (ADS)

    Roh, S.; Jun, M.; Szunyogh, I.; Genton, M. G.

    2015-05-01

    In ensemble Kalman filtering (EnKF), the small number of ensemble members that is feasible to use in a practical data assimilation application leads to sampling variability of the estimates of the background error covariances. The standard approach to reducing the effects of this sampling variability, which has also been found to be highly efficient in improving the performance of EnKF, is the localization of the estimates of the covariances. One family of localization techniques is based on taking the Schur (entry-wise) product of the ensemble-based sample covariance matrix and a correlation matrix whose entries are obtained by the discretization of a distance-dependent correlation function. While the proper definition of the localization function for a single state variable has been extensively investigated, a rigorous definition of the localization function for multiple state variables has been seldom considered. This paper introduces two strategies for the construction of localization functions for multiple state variables. The proposed localization functions are tested by assimilating simulated observations experiments into the bivariate Lorenz 95 model with their help.

  6. Trends in Fatalities From Distracted Driving in the United States, 1999 to 2008

    PubMed Central

    Stimpson, Jim P.

    2010-01-01

    Objectives. We examined trends in distracted driving fatalities and their relation to cell phone use and texting volume. Methods. The Fatality Analysis Reporting System (FARS) records data on all road fatalities that occurred on public roads in the United States from 1999 to 2008. We studied trends in distracted driving fatalities, driver and crash characteristics, and trends in cell phone use and texting volume. We used multivariate regression analysis to estimate the relation between state-level distracted driving fatalities and texting volumes. Results. After declining from 1999 to 2005, fatalities from distracted driving increased 28% after 2005, rising from 4572 fatalities to 5870 in 2008. Crashes increasingly involved male drivers driving alone in collisions with roadside obstructions in urban areas. By use of multivariate analyses, we predicted that increasing texting volumes resulted in more than 16 000 additional road fatalities from 2001 to 2007. Conclusions. Distracted driving is a growing public safety hazard. Specifically, the dramatic rise in texting volume since 2005 appeared to be contributing to an alarming rise in distracted driving fatalities. Legislation enacting texting bans should be paired with effective enforcement to deter drivers from using cell phones while driving. PMID:20864709

  7. Trends in fatalities from distracted driving in the United States, 1999 to 2008.

    PubMed

    Wilson, Fernando A; Stimpson, Jim P

    2010-11-01

    We examined trends in distracted driving fatalities and their relation to cell phone use and texting volume. The Fatality Analysis Reporting System (FARS) records data on all road fatalities that occurred on public roads in the United States from 1999 to 2008. We studied trends in distracted driving fatalities, driver and crash characteristics, and trends in cell phone use and texting volume. We used multivariate regression analysis to estimate the relation between state-level distracted driving fatalities and texting volumes. After declining from 1999 to 2005, fatalities from distracted driving increased 28% after 2005, rising from 4572 fatalities to 5870 in 2008. Crashes increasingly involved male drivers driving alone in collisions with roadside obstructions in urban areas. By use of multivariate analyses, we predicted that increasing texting volumes resulted in more than 16,000 additional road fatalities from 2001 to 2007. Distracted driving is a growing public safety hazard. Specifically, the dramatic rise in texting volume since 2005 appeared to be contributing to an alarming rise in distracted driving fatalities. Legislation enacting texting bans should be paired with effective enforcement to deter drivers from using cell phones while driving.

  8. Parameter estimation of multivariate multiple regression model using bayesian with non-informative Jeffreys’ prior distribution

    NASA Astrophysics Data System (ADS)

    Saputro, D. R. S.; Amalia, F.; Widyaningsih, P.; Affan, R. C.

    2018-05-01

    Bayesian method is a method that can be used to estimate the parameters of multivariate multiple regression model. Bayesian method has two distributions, there are prior and posterior distributions. Posterior distribution is influenced by the selection of prior distribution. Jeffreys’ prior distribution is a kind of Non-informative prior distribution. This prior is used when the information about parameter not available. Non-informative Jeffreys’ prior distribution is combined with the sample information resulting the posterior distribution. Posterior distribution is used to estimate the parameter. The purposes of this research is to estimate the parameters of multivariate regression model using Bayesian method with Non-informative Jeffreys’ prior distribution. Based on the results and discussion, parameter estimation of β and Σ which were obtained from expected value of random variable of marginal posterior distribution function. The marginal posterior distributions for β and Σ are multivariate normal and inverse Wishart. However, in calculation of the expected value involving integral of a function which difficult to determine the value. Therefore, approach is needed by generating of random samples according to the posterior distribution characteristics of each parameter using Markov chain Monte Carlo (MCMC) Gibbs sampling algorithm.

  9. Support vector machines for nuclear reactor state estimation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zavaljevski, N.; Gross, K. C.

    2000-02-14

    Validation of nuclear power reactor signals is often performed by comparing signal prototypes with the actual reactor signals. The signal prototypes are often computed based on empirical data. The implementation of an estimation algorithm which can make predictions on limited data is an important issue. A new machine learning algorithm called support vector machines (SVMS) recently developed by Vladimir Vapnik and his coworkers enables a high level of generalization with finite high-dimensional data. The improved generalization in comparison with standard methods like neural networks is due mainly to the following characteristics of the method. The input data space is transformedmore » into a high-dimensional feature space using a kernel function, and the learning problem is formulated as a convex quadratic programming problem with a unique solution. In this paper the authors have applied the SVM method for data-based state estimation in nuclear power reactors. In particular, they implemented and tested kernels developed at Argonne National Laboratory for the Multivariate State Estimation Technique (MSET), a nonlinear, nonparametric estimation technique with a wide range of applications in nuclear reactors. The methodology has been applied to three data sets from experimental and commercial nuclear power reactor applications. The results are promising. The combination of MSET kernels with the SVM method has better noise reduction and generalization properties than the standard MSET algorithm.« less

  10. A modified NARMAX model-based self-tuner with fault tolerance for unknown nonlinear stochastic hybrid systems with an input-output direct feed-through term.

    PubMed

    Tsai, Jason S-H; Hsu, Wen-Teng; Lin, Long-Guei; Guo, Shu-Mei; Tann, Joseph W

    2014-01-01

    A modified nonlinear autoregressive moving average with exogenous inputs (NARMAX) model-based state-space self-tuner with fault tolerance is proposed in this paper for the unknown nonlinear stochastic hybrid system with a direct transmission matrix from input to output. Through the off-line observer/Kalman filter identification method, one has a good initial guess of modified NARMAX model to reduce the on-line system identification process time. Then, based on the modified NARMAX-based system identification, a corresponding adaptive digital control scheme is presented for the unknown continuous-time nonlinear system, with an input-output direct transmission term, which also has measurement and system noises and inaccessible system states. Besides, an effective state space self-turner with fault tolerance scheme is presented for the unknown multivariable stochastic system. A quantitative criterion is suggested by comparing the innovation process error estimated by the Kalman filter estimation algorithm, so that a weighting matrix resetting technique by adjusting and resetting the covariance matrices of parameter estimate obtained by the Kalman filter estimation algorithm is utilized to achieve the parameter estimation for faulty system recovery. Consequently, the proposed method can effectively cope with partially abrupt and/or gradual system faults and input failures by the fault detection. Copyright © 2013 ISA. Published by Elsevier Ltd. All rights reserved.

  11. H1N1pdm in the Americas

    PubMed Central

    Lessler, Justin; Santos, Thais dos; Aguilera, Ximena; Brookmeyer, Ron; Cummings, Derek AT

    2010-01-01

    In late April 2009 the emergence of 2009 pandemic influenza A (H1N1pdm) virus was detected in humans. From its detection through July 18th, 2009, confirmed cases of H1N1pdm in the Americas were periodically reported to the Pan-American Health Organization (PAHO) by member states. Because the Americas span much of the world’s latitudes, this data provides an excellent opportunity to examine variation in H1N1pdm transmission by season. Using reports from PAHO member states from April 26th, 2009 through July 18th, 2009, we characterize the early spread of the H1N1 pandemic in the Americas. For a geographically representative sample of member states we estimate the reproductive number (R) of H1N1pdm over the reporting period. The association between these estimates and latitude, temperature, humidity and population age structure was estimated. Estimates of the peak reproductive number of H1N1pdm ranged from 1.3 (for Panama, Colombia) to 2.1 (for Chile). We found that reproductive number estimates were most associated with latitude in both univariate and multivariate analyses. To the extent that latitude is a proxy for seasonal changes in climate and behavior, this association suggests a strong seasonal component to H1N1pdm transmission. However, the reasons for this seasonality remain unclear. PMID:20847900

  12. Estimating the decomposition of predictive information in multivariate systems

    NASA Astrophysics Data System (ADS)

    Faes, Luca; Kugiumtzis, Dimitris; Nollo, Giandomenico; Jurysta, Fabrice; Marinazzo, Daniele

    2015-03-01

    In the study of complex systems from observed multivariate time series, insight into the evolution of one system may be under investigation, which can be explained by the information storage of the system and the information transfer from other interacting systems. We present a framework for the model-free estimation of information storage and information transfer computed as the terms composing the predictive information about the target of a multivariate dynamical process. The approach tackles the curse of dimensionality employing a nonuniform embedding scheme that selects progressively, among the past components of the multivariate process, only those that contribute most, in terms of conditional mutual information, to the present target process. Moreover, it computes all information-theoretic quantities using a nearest-neighbor technique designed to compensate the bias due to the different dimensionality of individual entropy terms. The resulting estimators of prediction entropy, storage entropy, transfer entropy, and partial transfer entropy are tested on simulations of coupled linear stochastic and nonlinear deterministic dynamic processes, demonstrating the superiority of the proposed approach over the traditional estimators based on uniform embedding. The framework is then applied to multivariate physiologic time series, resulting in physiologically well-interpretable information decompositions of cardiovascular and cardiorespiratory interactions during head-up tilt and of joint brain-heart dynamics during sleep.

  13. In Quest of the Alanine R3 Radical: Multivariate EPR Spectral Analyses of X-Irradiated Alanine in the Solid State.

    PubMed

    Jåstad, Eirik O; Torheim, Turid; Villeneuve, Kathleen M; Kvaal, Knut; Hole, Eli O; Sagstuen, Einar; Malinen, Eirik; Futsaether, Cecilia M

    2017-09-28

    The amino acid l-α-alanine is the most commonly used material for solid-state electron paramagnetic resonance (EPR) dosimetry, due to the formation of highly stable radicals upon irradiation, with yields proportional to the radiation dose. Two major alanine radical components designated R1 and R2 have previously been uniquely characterized from EPR and electron-nuclear double resonance (ENDOR) studies as well as from quantum chemical calculations. There is also convincing experimental evidence of a third minor radical component R3, and a tentative radical structure has been suggested, even though no well-defined spectral signature has been observed experimentally. In the present study, temperature dependent EPR spectra of X-ray irradiated polycrystalline alanine were analyzed using five multivariate methods in further attempts to understand the composite nature of the alanine dosimeter EPR spectrum. Principal component analysis (PCA), maximum likelihood common factor analysis (MLCFA), independent component analysis (ICA), self-modeling mixture analysis (SMA), and multivariate curve resolution (MCR) were used to extract pure radical spectra and their fractional contributions from the experimental EPR spectra. All methods yielded spectral estimates resembling the established R1 spectrum. Furthermore, SMA and MCR consistently predicted both the established R2 spectrum and the shape of the R3 spectrum. The predicted shape of the R3 spectrum corresponded well with the proposed tentative spectrum derived from spectrum simulations. Thus, results from two independent multivariate data analysis techniques strongly support the previous evidence that three radicals are indeed present in irradiated alanine samples.

  14. A silver lining to higher prices at the pump? Gasoline prices and teen driving behaviors.

    PubMed

    Sen, Bisakha; Patidar, Nitish; Thomas, Sheikilya

    2014-01-01

    Existing literature shows negative relationships between gasoline price and motor vehicle crashes, particularly among teens. This paper extends that literature by evaluating the relationship between gasoline price and self-reported risky driving among teens. Observational study using multivariate empirical analysis, using pooled data from the Youth Risk Behavior Survey, waves 2003-2009. Secondary data from survey administered in private and public high schools across the United States. Students in grades 9 through 12, surveyed biennially from 2003 to 2009 (n = 58,749). Outcomes are (self-reported) driving without seatbelts, driving after consuming alcohol, and moderate physical activity (like walking or bicycling). State-level retail gasoline prices constitute the main predictor variable. Multivariate logistic models are estimated for the full sample, as well as by gender, race/ethnicity, and age. Individual characteristics, state unemployment, and state driving policies are controlled for. Standard errors are clustered at the state level. Results are reported in form of risk differences. Higher gasoline prices are negatively and significantly associated with driving without seatbelts. Associations are particularly strong for males and minorities. There are fewer statistical associations between gasoline prices and driving after drinking. Higher gasoline prices are positively associated with more moderate physical activity. Higher gasoline prices are associated with less risky driving behaviors among teens, and they may be associated with more active forms of transportation, like walking and bicycling. The study limitations are discussed.

  15. The impact of covariance misspecification in multivariate Gaussian mixtures on estimation and inference: an application to longitudinal modeling.

    PubMed

    Heggeseth, Brianna C; Jewell, Nicholas P

    2013-07-20

    Multivariate Gaussian mixtures are a class of models that provide a flexible parametric approach for the representation of heterogeneous multivariate outcomes. When the outcome is a vector of repeated measurements taken on the same subject, there is often inherent dependence between observations. However, a common covariance assumption is conditional independence-that is, given the mixture component label, the outcomes for subjects are independent. In this paper, we study, through asymptotic bias calculations and simulation, the impact of covariance misspecification in multivariate Gaussian mixtures. Although maximum likelihood estimators of regression and mixing probability parameters are not consistent under misspecification, they have little asymptotic bias when mixture components are well separated or if the assumed correlation is close to the truth even when the covariance is misspecified. We also present a robust standard error estimator and show that it outperforms conventional estimators in simulations and can indicate that the model is misspecified. Body mass index data from a national longitudinal study are used to demonstrate the effects of misspecification on potential inferences made in practice. Copyright © 2013 John Wiley & Sons, Ltd.

  16. Stress state estimation in multilayer support of vertical shafts, considering off-design cross-sectional deformation

    NASA Astrophysics Data System (ADS)

    Antsiferov, SV; Sammal, AS; Deev, PV

    2018-03-01

    To determine the stress-strain state of multilayer support of vertical shafts, including cross-sectional deformation of the tubing rings as against the design, the authors propose an analytical method based on the provision of the mechanics of underground structures and surrounding rock mass as the elements of an integrated deformable system. The method involves a rigorous solution of the corresponding problem of elasticity, obtained using the mathematical apparatus of the theory of analytic functions of a complex variable. The design method is implemented as a software program allowing multivariate applied computation. Examples of the calculation are given.

  17. Predicting trauma patient mortality: ICD [or ICD-10-AM] versus AIS based approaches.

    PubMed

    Willis, Cameron D; Gabbe, Belinda J; Jolley, Damien; Harrison, James E; Cameron, Peter A

    2010-11-01

    The International Classification of Diseases Injury Severity Score (ICISS) has been proposed as an International Classification of Diseases (ICD)-10-based alternative to mortality prediction tools that use Abbreviated Injury Scale (AIS) data, including the Trauma and Injury Severity Score (TRISS). To date, studies have not examined the performance of ICISS using Australian trauma registry data. This study aimed to compare the performance of ICISS with other mortality prediction tools in an Australian trauma registry. This was a retrospective review of prospectively collected data from the Victorian State Trauma Registry. A training dataset was created for model development and a validation dataset for evaluation. The multiplicative ICISS model was compared with a worst injury ICISS approach, Victorian TRISS (V-TRISS, using local coefficients), maximum AIS severity and a multivariable model including ICD-10-AM codes as predictors. Models were investigated for discrimination (C-statistic) and calibration (Hosmer-Lemeshow statistic). The multivariable approach had the highest level of discrimination (C-statistic 0.90) and calibration (H-L 7.65, P= 0.468). Worst injury ICISS, V-TRISS and maximum AIS had similar performance. The multiplicative ICISS produced the lowest level of discrimination (C-statistic 0.80) and poorest calibration (H-L 50.23, P < 0.001). The performance of ICISS may be affected by the data used to develop estimates, the ICD version employed, the methods for deriving estimates and the inclusion of covariates. In this analysis, a multivariable approach using ICD-10-AM codes was the best-performing method. A multivariable ICISS approach may therefore be a useful alternative to AIS-based methods and may have comparable predictive performance to locally derived TRISS models. © 2010 The Authors. ANZ Journal of Surgery © 2010 Royal Australasian College of Surgeons.

  18. Measuring agreement of multivariate discrete survival times using a modified weighted kappa coefficient.

    PubMed

    Guo, Ying; Manatunga, Amita K

    2009-03-01

    Assessing agreement is often of interest in clinical studies to evaluate the similarity of measurements produced by different raters or methods on the same subjects. We present a modified weighted kappa coefficient to measure agreement between bivariate discrete survival times. The proposed kappa coefficient accommodates censoring by redistributing the mass of censored observations within the grid where the unobserved events may potentially happen. A generalized modified weighted kappa is proposed for multivariate discrete survival times. We estimate the modified kappa coefficients nonparametrically through a multivariate survival function estimator. The asymptotic properties of the kappa estimators are established and the performance of the estimators are examined through simulation studies of bivariate and trivariate survival times. We illustrate the application of the modified kappa coefficient in the presence of censored observations with data from a prostate cancer study.

  19. Robust multivariate nonparametric tests for detection of two-sample location shift in clinical trials

    PubMed Central

    Jiang, Xuejun; Guo, Xu; Zhang, Ning; Wang, Bo

    2018-01-01

    This article presents and investigates performance of a series of robust multivariate nonparametric tests for detection of location shift between two multivariate samples in randomized controlled trials. The tests are built upon robust estimators of distribution locations (medians, Hodges-Lehmann estimators, and an extended U statistic) with both unscaled and scaled versions. The nonparametric tests are robust to outliers and do not assume that the two samples are drawn from multivariate normal distributions. Bootstrap and permutation approaches are introduced for determining the p-values of the proposed test statistics. Simulation studies are conducted and numerical results are reported to examine performance of the proposed statistical tests. The numerical results demonstrate that the robust multivariate nonparametric tests constructed from the Hodges-Lehmann estimators are more efficient than those based on medians and the extended U statistic. The permutation approach can provide a more stringent control of Type I error and is generally more powerful than the bootstrap procedure. The proposed robust nonparametric tests are applied to detect multivariate distributional difference between the intervention and control groups in the Thai Healthy Choices study and examine the intervention effect of a four-session motivational interviewing-based intervention developed in the study to reduce risk behaviors among youth living with HIV. PMID:29672555

  20. Development of a Pattern Recognition Methodology for Determining Operationally Optimal Heat Balance Instrumentation Calibration Schedules

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kurt Beran; John Christenson; Dragos Nica

    2002-12-15

    The goal of the project is to enable plant operators to detect with high sensitivity and reliability the onset of decalibration drifts in all of the instrumentation used as input to the reactor heat balance calculations. To achieve this objective, the collaborators developed and implemented at DBNPS an extension of the Multivariate State Estimation Technique (MSET) pattern recognition methodology pioneered by ANAL. The extension was implemented during the second phase of the project and fully achieved the project goal.

  1. Alterations of functional connectivities from early to middle adulthood: Clues from multivariate pattern analysis of resting-state fMRI data.

    PubMed

    Tian, Lixia; Ma, Lin; Wang, Linlin

    2016-04-01

    In contrast to extended research interests in the maturation and aging of human brain, alterations of brain structure and function from early to middle adulthood have been much less studied. The aim of the present study was to investigate the extent and pattern of the alterations of functional interactions between brain regions from early to middle adulthood. We carried out the study by multivariate pattern analysis of resting-state fMRI (RS-fMRI) data of 63 adults aged 18 to 45 years. Specifically, using elastic net, we performed brain age estimation and age-group classification (young adults aged 18-28 years vs. middle-aged adults aged 35-45 years) based on the resting-state functional connectivities (RSFCs) between 160 regions of interest (ROIs) evaluated on the RS-fMRI data of each subject. The results indicate that the estimated brain ages were significantly correlated with the chronological age (R=0.78, MAE=4.81), and a classification rate of 94.44% and area under the receiver operating characteristic curve (AUC) of 0.99 were obtained when classifying the young and middle-aged adults. These results provide strong evidence that functional interactions between brain regions undergo notable alterations from early to middle adulthood. By analyzing the RSFCs that contribute to brain age estimation/age-group classification, we found that a majority of the RSFCs were inter-network, and we speculate that inter-network RSFCs might mature late but age early as compared to intra-network ones. In addition, the strengthening/weakening of the RSFCs associated with the left/right hemispheric ROIs, the weakening of cortico-cerebellar RSFCs and the strengthening of the RSFCs between the default mode network and other networks contributed much to both brain age estimation and age-group classification. All these alterations might reflect that aging of brain function is already in progress in middle adulthood. Overall, the present study indicated that the RSFCs undergo notable alterations from early to middle adulthood and highlighted the necessity of careful considerations of possible influences of these alterations in related studies. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. A case study of real-time monitoring of solid-state phase transformations in acoustically levitated particles using near infrared and Raman spectroscopy.

    PubMed

    Rehder, Sönke; Wu, Jian X; Laackmann, Julian; Moritz, Hans-Ulrich; Rantanen, Jukka; Rades, Thomas; Leopold, Claudia S

    2013-01-23

    The objective of this study was to monitor the amorphous-to-crystalline solid-state phase transformation kinetics of the model drug ibuprofen with spectroscopic methods during acoustic levitation. Chemical and physical information was obtained by real-time near infrared (NIRS) and Raman spectroscopy measurements. The recrystallisation kinetic parameters (overall recrystallisation rate constant β and the time needed to reach 50% of the equilibrated level t(50)), were determined using a multivariate curve resolution approach. The acoustic levitation device coupled with non-invasive spectroscopy enabled monitoring of the recrystallisation process of the difficult-to-handle (adhesive) amorphous sample. The application of multivariate curve resolution enabled isolation of the underlying pure spectra, which corresponded well with the reference spectra of amorphous and crystalline ibuprofen. The recrystallisation kinetic parameters were estimated from the recrystallisation profiles. While the empirical recrystallisation rate constant determined by NIR and Raman spectroscopy were comparable, the lag time for recrystallisation was significantly lower with Raman spectroscopy as compared to NIRS. This observation was explained by the high energy density of the Raman laser beam, which might have led to local heating effects of the sample and thus reduced the recrystallisation onset time. It was concluded that acoustic levitation with NIR and Raman spectroscopy combined with multivariate curve resolution allowed direct determination of the recrystallisation kinetics of amorphous drugs and thus is a promising technique for monitoring solid-state phase transformations of adhesive small-sized samples during the early phase of drug development. Copyright © 2012 Elsevier B.V. All rights reserved.

  3. A Sandwich-Type Standard Error Estimator of SEM Models with Multivariate Time Series

    ERIC Educational Resources Information Center

    Zhang, Guangjian; Chow, Sy-Miin; Ong, Anthony D.

    2011-01-01

    Structural equation models are increasingly used as a modeling tool for multivariate time series data in the social and behavioral sciences. Standard error estimators of SEM models, originally developed for independent data, require modifications to accommodate the fact that time series data are inherently dependent. In this article, we extend a…

  4. From point process observations to collective neural dynamics: Nonlinear Hawkes process GLMs, low-dimensional dynamics and coarse graining

    PubMed Central

    Truccolo, Wilson

    2017-01-01

    This review presents a perspective on capturing collective dynamics in recorded neuronal ensembles based on multivariate point process models, inference of low-dimensional dynamics and coarse graining of spatiotemporal measurements. A general probabilistic framework for continuous time point processes reviewed, with an emphasis on multivariate nonlinear Hawkes processes with exogenous inputs. A point process generalized linear model (PP-GLM) framework for the estimation of discrete time multivariate nonlinear Hawkes processes is described. The approach is illustrated with the modeling of collective dynamics in neocortical neuronal ensembles recorded in human and non-human primates, and prediction of single-neuron spiking. A complementary approach to capture collective dynamics based on low-dimensional dynamics (“order parameters”) inferred via latent state-space models with point process observations is presented. The approach is illustrated by inferring and decoding low-dimensional dynamics in primate motor cortex during naturalistic reach and grasp movements. Finally, we briefly review hypothesis tests based on conditional inference and spatiotemporal coarse graining for assessing collective dynamics in recorded neuronal ensembles. PMID:28336305

  5. From point process observations to collective neural dynamics: Nonlinear Hawkes process GLMs, low-dimensional dynamics and coarse graining.

    PubMed

    Truccolo, Wilson

    2016-11-01

    This review presents a perspective on capturing collective dynamics in recorded neuronal ensembles based on multivariate point process models, inference of low-dimensional dynamics and coarse graining of spatiotemporal measurements. A general probabilistic framework for continuous time point processes reviewed, with an emphasis on multivariate nonlinear Hawkes processes with exogenous inputs. A point process generalized linear model (PP-GLM) framework for the estimation of discrete time multivariate nonlinear Hawkes processes is described. The approach is illustrated with the modeling of collective dynamics in neocortical neuronal ensembles recorded in human and non-human primates, and prediction of single-neuron spiking. A complementary approach to capture collective dynamics based on low-dimensional dynamics ("order parameters") inferred via latent state-space models with point process observations is presented. The approach is illustrated by inferring and decoding low-dimensional dynamics in primate motor cortex during naturalistic reach and grasp movements. Finally, we briefly review hypothesis tests based on conditional inference and spatiotemporal coarse graining for assessing collective dynamics in recorded neuronal ensembles. Published by Elsevier Ltd.

  6. Optimal Tuner Selection for Kalman Filter-Based Aircraft Engine Performance Estimation

    NASA Technical Reports Server (NTRS)

    Simon, Donald L.; Garg, Sanjay

    2010-01-01

    A linear point design methodology for minimizing the error in on-line Kalman filter-based aircraft engine performance estimation applications is presented. This technique specifically addresses the underdetermined estimation problem, where there are more unknown parameters than available sensor measurements. A systematic approach is applied to produce a model tuning parameter vector of appropriate dimension to enable estimation by a Kalman filter, while minimizing the estimation error in the parameters of interest. Tuning parameter selection is performed using a multi-variable iterative search routine which seeks to minimize the theoretical mean-squared estimation error. This paper derives theoretical Kalman filter estimation error bias and variance values at steady-state operating conditions, and presents the tuner selection routine applied to minimize these values. Results from the application of the technique to an aircraft engine simulation are presented and compared to the conventional approach of tuner selection. Experimental simulation results are found to be in agreement with theoretical predictions. The new methodology is shown to yield a significant improvement in on-line engine performance estimation accuracy

  7. Renewable energy rebound effect?: Estimating the impact of state renewable energy financial incentives on residential electricity consumption

    NASA Astrophysics Data System (ADS)

    Stephenson, Beth A.

    Climate change is a well-documented phenomenon. If left unchecked greenhouse gas emissions will continue global surface warming, likely leading to severe and irreversible impacts. Generating renewable energy has become an increasingly salient topic in energy policy as it may mitigate the impact of climate change. State renewable energy financial incentives have been in place since the mid-1970s in some states and over 40 states have adopted one or more incentives at some point since then. Using multivariate linear and fixed effects regression for the years 2002 through 2012, I estimate the relationship between state renewable energy financial incentives and residential electricity consumption, along with the associated policy implications. My hypothesis is that a renewable energy rebound effect is present; therefore, states with renewable energy financial incentives have a higher rate of residential electricity consumption. I find a renewable energy rebound effect is present in varying degrees for each model, but the results do not definitively indicate how particular incentives influence consumer behavior. States should use caution when adopting and keeping renewable energy financial incentives as this may increase consumption in the short-term. The long-term impact is unclear, making it worthwhile for policymakers to continue studying the potential for renewable energy financial incentives to alter consumer behavior.

  8. Multivariate and Multiscale Data Assimilation in Terrestrial Systems: A Review

    PubMed Central

    Montzka, Carsten; Pauwels, Valentijn R. N.; Franssen, Harrie-Jan Hendricks; Han, Xujun; Vereecken, Harry

    2012-01-01

    More and more terrestrial observational networks are being established to monitor climatic, hydrological and land-use changes in different regions of the World. In these networks, time series of states and fluxes are recorded in an automated manner, often with a high temporal resolution. These data are important for the understanding of water, energy, and/or matter fluxes, as well as their biological and physical drivers and interactions with and within the terrestrial system. Similarly, the number and accuracy of variables, which can be observed by spaceborne sensors, are increasing. Data assimilation (DA) methods utilize these observations in terrestrial models in order to increase process knowledge as well as to improve forecasts for the system being studied. The widely implemented automation in observing environmental states and fluxes makes an operational computation more and more feasible, and it opens the perspective of short-time forecasts of the state of terrestrial systems. In this paper, we review the state of the art with respect to DA focusing on the joint assimilation of observational data precedents from different spatial scales and different data types. An introduction is given to different DA methods, such as the Ensemble Kalman Filter (EnKF), Particle Filter (PF) and variational methods (3/4D-VAR). In this review, we distinguish between four major DA approaches: (1) univariate single-scale DA (UVSS), which is the approach used in the majority of published DA applications, (2) univariate multiscale DA (UVMS) referring to a methodology which acknowledges that at least some of the assimilated data are measured at a different scale than the computational grid scale, (3) multivariate single-scale DA (MVSS) dealing with the assimilation of at least two different data types, and (4) combined multivariate multiscale DA (MVMS). Finally, we conclude with a discussion on the advantages and disadvantages of the assimilation of multiple data types in a simulation model. Existing approaches can be used to simultaneously update several model states and model parameters if applicable. In other words, the basic principles for multivariate data assimilation are already available. We argue that a better understanding of the measurement errors for different observation types, improved estimates of observation bias and improved multiscale assimilation methods for data which scale nonlinearly is important to properly weight them in multiscale multivariate data assimilation. In this context, improved cross-validation of different data types, and increased ground truth verification of remote sensing products are required. PMID:23443380

  9. Multivariate and multiscale data assimilation in terrestrial systems: a review.

    PubMed

    Montzka, Carsten; Pauwels, Valentijn R N; Franssen, Harrie-Jan Hendricks; Han, Xujun; Vereecken, Harry

    2012-11-26

    More and more terrestrial observational networks are being established to monitor climatic, hydrological and land-use changes in different regions of the World. In these networks, time series of states and fluxes are recorded in an automated manner, often with a high temporal resolution. These data are important for the understanding of water, energy, and/or matter fluxes, as well as their biological and physical drivers and interactions with and within the terrestrial system. Similarly, the number and accuracy of variables, which can be observed by spaceborne sensors, are increasing. Data assimilation (DA) methods utilize these observations in terrestrial models in order to increase process knowledge as well as to improve forecasts for the system being studied. The widely implemented automation in observing environmental states and fluxes makes an operational computation more and more feasible, and it opens the perspective of short-time forecasts of the state of terrestrial systems. In this paper, we review the state of the art with respect to DA focusing on the joint assimilation of observational data precedents from different spatial scales and different data types. An introduction is given to different DA methods, such as the Ensemble Kalman Filter (EnKF), Particle Filter (PF) and variational methods (3/4D-VAR). In this review, we distinguish between four major DA approaches: (1) univariate single-scale DA (UVSS), which is the approach used in the majority of published DA applications, (2) univariate multiscale DA (UVMS) referring to a methodology which acknowledges that at least some of the assimilated data are measured at a different scale than the computational grid scale, (3) multivariate single-scale DA (MVSS) dealing with the assimilation of at least two different data types, and (4) combined multivariate multiscale DA (MVMS). Finally, we conclude with a discussion on the advantages and disadvantages of the assimilation of multiple data types in a simulation model. Existing approaches can be used to simultaneously update several model states and model parameters if applicable. In other words, the basic principles for multivariate data assimilation are already available. We argue that a better understanding of the measurement errors for different observation types, improved estimates of observation bias and improved multiscale assimilation methods for data which scale nonlinearly is important to properly weight them in multiscale multivariate data assimilation. In this context, improved cross-validation of different data types, and increased ground truth verification of remote sensing products are required.

  10. Critical elements on fitting the Bayesian multivariate Poisson Lognormal model

    NASA Astrophysics Data System (ADS)

    Zamzuri, Zamira Hasanah binti

    2015-10-01

    Motivated by a problem on fitting multivariate models to traffic accident data, a detailed discussion of the Multivariate Poisson Lognormal (MPL) model is presented. This paper reveals three critical elements on fitting the MPL model: the setting of initial estimates, hyperparameters and tuning parameters. These issues have not been highlighted in the literature. Based on simulation studies conducted, we have shown that to use the Univariate Poisson Model (UPM) estimates as starting values, at least 20,000 iterations are needed to obtain reliable final estimates. We also illustrated the sensitivity of the specific hyperparameter, which if it is not given extra attention, may affect the final estimates. The last issue is regarding the tuning parameters where they depend on the acceptance rate. Finally, a heuristic algorithm to fit the MPL model is presented. This acts as a guide to ensure that the model works satisfactorily given any data set.

  11. Estimation and Psychometric Analysis of Component Profile Scores via Multivariate Generalizability Theory

    ERIC Educational Resources Information Center

    Grochowalski, Joseph H.

    2015-01-01

    Component Universe Score Profile analysis (CUSP) is introduced in this paper as a psychometric alternative to multivariate profile analysis. The theoretical foundations of CUSP analysis are reviewed, which include multivariate generalizability theory and constrained principal components analysis. Because CUSP is a combination of generalizability…

  12. Kernel canonical-correlation Granger causality for multiple time series

    NASA Astrophysics Data System (ADS)

    Wu, Guorong; Duan, Xujun; Liao, Wei; Gao, Qing; Chen, Huafu

    2011-04-01

    Canonical-correlation analysis as a multivariate statistical technique has been applied to multivariate Granger causality analysis to infer information flow in complex systems. It shows unique appeal and great superiority over the traditional vector autoregressive method, due to the simplified procedure that detects causal interaction between multiple time series, and the avoidance of potential model estimation problems. However, it is limited to the linear case. Here, we extend the framework of canonical correlation to include the estimation of multivariate nonlinear Granger causality for drawing inference about directed interaction. Its feasibility and effectiveness are verified on simulated data.

  13. Resilience and tipping points of an exploited fish population over six decades.

    PubMed

    Vasilakopoulos, Paraskevas; Marshall, C Tara

    2015-05-01

    Complex natural systems with eroded resilience, such as populations, ecosystems and socio-ecological systems, respond to small perturbations with abrupt, discontinuous state shifts, or critical transitions. Theory of critical transitions suggests that such systems exhibit fold bifurcations featuring folded response curves, tipping points and alternate attractors. However, there is little empirical evidence of fold bifurcations occurring in actual complex natural systems impacted by multiple stressors. Moreover, resilience of complex systems to change currently lacks clear operational measures with generic application. Here, we provide empirical evidence for the occurrence of a fold bifurcation in an exploited fish population and introduce a generic measure of ecological resilience based on the observed fold bifurcation attributes. We analyse the multivariate development of Barents Sea cod (Gadus morhua), which is currently the world's largest cod stock, over six decades (1949-2009), and identify a population state shift in 1981. By plotting a multivariate population index against a multivariate stressor index, the shift mechanism was revealed suggesting that the observed population shift was a nonlinear response to the combined effects of overfishing and climate change. Annual resilience values were estimated based on the position of each year in relation to the fitted attractors and assumed tipping points of the fold bifurcation. By interpolating the annual resilience values, a folded stability landscape was fit, which was shaped as predicted by theory. The resilience assessment suggested that the population may be close to another tipping point. This study illustrates how a multivariate analysis, supported by theory of critical transitions and accompanied by a quantitative resilience assessment, can clarify shift mechanisms in data-rich complex natural systems. © 2014 John Wiley & Sons Ltd.

  14. Multivariate spatial models of excess crash frequency at area level: case of Costa Rica.

    PubMed

    Aguero-Valverde, Jonathan

    2013-10-01

    Recently, areal models of crash frequency have being used in the analysis of various area-wide factors affecting road crashes. On the other hand, disease mapping methods are commonly used in epidemiology to assess the relative risk of the population at different spatial units. A natural next step is to combine these two approaches to estimate the excess crash frequency at area level as a measure of absolute crash risk. Furthermore, multivariate spatial models of crash severity are explored in order to account for both frequency and severity of crashes and control for the spatial correlation frequently found in crash data. This paper aims to extent the concept of safety performance functions to be used in areal models of crash frequency. A multivariate spatial model is used for that purpose and compared to its univariate counterpart. Full Bayes hierarchical approach is used to estimate the models of crash frequency at canton level for Costa Rica. An intrinsic multivariate conditional autoregressive model is used for modeling spatial random effects. The results show that the multivariate spatial model performs better than its univariate counterpart in terms of the penalized goodness-of-fit measure Deviance Information Criteria. Additionally, the effects of the spatial smoothing due to the multivariate spatial random effects are evident in the estimation of excess equivalent property damage only crashes. Copyright © 2013 Elsevier Ltd. All rights reserved.

  15. Empirical Bayes approach to the estimation of "unsafety": the multivariate regression method.

    PubMed

    Hauer, E

    1992-10-01

    There are two kinds of clues to the unsafety of an entity: its traits (such as traffic, geometry, age, or gender) and its historical accident record. The Empirical Bayes approach to unsafety estimation makes use of both kinds of clues. It requires information about the mean and the variance of the unsafety in a "reference population" of similar entities. The method now in use for this purpose suffers from several shortcomings. First, a very large reference population is required. Second, the choice of reference population is to some extent arbitrary. Third, entities in the reference population usually cannot match the traits of the entity the unsafety of which is estimated. To alleviate these shortcomings the multivariate regression method for estimating the mean and variance of unsafety in reference populations is offered. Its logical foundations are described and its soundness is demonstrated. The use of the multivariate method makes the Empirical Bayes approach to unsafety estimation applicable to a wider range of circumstances and yields better estimates of unsafety. The application of the method to the tasks of identifying deviant entities and of estimating the effect of interventions on unsafety are discussed and illustrated by numerical examples.

  16. Toward estimating the impact of changes in immigrants' insurance eligibility on hospital expenditures for uncompensated care

    PubMed Central

    Castel, Liana D; Timbie, Justin W; Sendersky, Veronica; Curtis, Lesley H; Feather, Keith A; Schulman, Kevin A

    2003-01-01

    Background The Personal Responsibility and Work Opportunity Reconciliation Act (PRWORA) of 1996 gave states the option to withdraw Medicaid coverage of nonemergency care from most legal immigrants. Our goal was to assess the effect of PRWORA on hospital uncompensated care in the United States. Methods We collected the following state-level data for the period from 1994 through 1999: foreign-born, noncitizen population and health uninsurance rates (US Census Current Population Survey); percentage of teaching hospitals (American Hospital Association Annual Survey of Hospitals); and each state's decision whether to implement the PRWORA Medicaid bar for legal permanent residents or to continue offering nonemergency Medicaid coverage using state-only funds (Urban Institute). We modeled uncompensated care expenditures by state (also from the Annual Survey of Hospitals) in both univariate and multivariable regression analyses. Results When measured at the state level, there was no significant relationship between uncompensated care expenditures and states' percentage of noncitizen immigrants. Uninsurance rates were the only significant factor in predicting uncompensated hospital care expenditures by state. Conclusions Reducing the number of uninsured patients would most surely reduce hospital expenditures for uncompensated care. However, data limitations hampered our efforts to obtain a monetary estimate of hospitals' financial losses due specifically to the immigrant eligibility changes in PRWORA. Quantifying the impact of these provisions on hospitals will require better data sources. PMID:12523939

  17. Network meta-analysis of multiple outcome measures accounting for borrowing of information across outcomes.

    PubMed

    Achana, Felix A; Cooper, Nicola J; Bujkiewicz, Sylwia; Hubbard, Stephanie J; Kendrick, Denise; Jones, David R; Sutton, Alex J

    2014-07-21

    Network meta-analysis (NMA) enables simultaneous comparison of multiple treatments while preserving randomisation. When summarising evidence to inform an economic evaluation, it is important that the analysis accurately reflects the dependency structure within the data, as correlations between outcomes may have implication for estimating the net benefit associated with treatment. A multivariate NMA offers a framework for evaluating multiple treatments across multiple outcome measures while accounting for the correlation structure between outcomes. The standard NMA model is extended to multiple outcome settings in two stages. In the first stage, information is borrowed across outcomes as well across studies through modelling the within-study and between-study correlation structure. In the second stage, we make use of the additional assumption that intervention effects are exchangeable between outcomes to predict effect estimates for all outcomes, including effect estimates on outcomes where evidence is either sparse or the treatment had not been considered by any one of the studies included in the analysis. We apply the methods to binary outcome data from a systematic review evaluating the effectiveness of nine home safety interventions on uptake of three poisoning prevention practices (safe storage of medicines, safe storage of other household products, and possession of poison centre control telephone number) in households with children. Analyses are conducted in WinBUGS using Markov Chain Monte Carlo (MCMC) simulations. Univariate and the first stage multivariate models produced broadly similar point estimates of intervention effects but the uncertainty around the multivariate estimates varied depending on the prior distribution specified for the between-study covariance structure. The second stage multivariate analyses produced more precise effect estimates while enabling intervention effects to be predicted for all outcomes, including intervention effects on outcomes not directly considered by the studies included in the analysis. Accounting for the dependency between outcomes in a multivariate meta-analysis may or may not improve the precision of effect estimates from a network meta-analysis compared to analysing each outcome separately.

  18. Decoding of visual activity patterns from fMRI responses using multivariate pattern analyses and convolutional neural network.

    PubMed

    Zafar, Raheel; Kamel, Nidal; Naufal, Mohamad; Malik, Aamir Saeed; Dass, Sarat C; Ahmad, Rana Fayyaz; Abdullah, Jafri M; Reza, Faruque

    2017-01-01

    Decoding of human brain activity has always been a primary goal in neuroscience especially with functional magnetic resonance imaging (fMRI) data. In recent years, Convolutional neural network (CNN) has become a popular method for the extraction of features due to its higher accuracy, however it needs a lot of computation and training data. In this study, an algorithm is developed using Multivariate pattern analysis (MVPA) and modified CNN to decode the behavior of brain for different images with limited data set. Selection of significant features is an important part of fMRI data analysis, since it reduces the computational burden and improves the prediction performance; significant features are selected using t-test. MVPA uses machine learning algorithms to classify different brain states and helps in prediction during the task. General linear model (GLM) is used to find the unknown parameters of every individual voxel and the classification is done using multi-class support vector machine (SVM). MVPA-CNN based proposed algorithm is compared with region of interest (ROI) based method and MVPA based estimated values. The proposed method showed better overall accuracy (68.6%) compared to ROI (61.88%) and estimation values (64.17%).

  19. Quantifying the impact of between-study heterogeneity in multivariate meta-analyses

    PubMed Central

    Jackson, Dan; White, Ian R; Riley, Richard D

    2012-01-01

    Measures that quantify the impact of heterogeneity in univariate meta-analysis, including the very popular I2 statistic, are now well established. Multivariate meta-analysis, where studies provide multiple outcomes that are pooled in a single analysis, is also becoming more commonly used. The question of how to quantify heterogeneity in the multivariate setting is therefore raised. It is the univariate R2 statistic, the ratio of the variance of the estimated treatment effect under the random and fixed effects models, that generalises most naturally, so this statistic provides our basis. This statistic is then used to derive a multivariate analogue of I2, which we call . We also provide a multivariate H2 statistic, the ratio of a generalisation of Cochran's heterogeneity statistic and its associated degrees of freedom, with an accompanying generalisation of the usual I2 statistic, . Our proposed heterogeneity statistics can be used alongside all the usual estimates and inferential procedures used in multivariate meta-analysis. We apply our methods to some real datasets and show how our statistics are equally appropriate in the context of multivariate meta-regression, where study level covariate effects are included in the model. Our heterogeneity statistics may be used when applying any procedure for fitting the multivariate random effects model. Copyright © 2012 John Wiley & Sons, Ltd. PMID:22763950

  20. Differential flatness properties and multivariable adaptive control of ovarian system dynamics

    NASA Astrophysics Data System (ADS)

    Rigatos, Gerasimos

    2016-12-01

    The ovarian system exhibits nonlinear dynamics which is modeled by a set of coupled nonlinear differential equations. The paper proposes adaptive fuzzy control based on differential flatness theory for the complex dynamics of the ovarian system. It is proven that the dynamic model of the ovarian system, having as state variables the LH and the FSH hormones and their derivatives, is a differentially flat one. This means that all its state variables and its control inputs can be described as differential functions of the flat output. By exploiting differential flatness properties the system's dynamic model is written in the multivariable linear canonical (Brunovsky) form, for which the design of a state feedback controller becomes possible. After this transformation, the new control inputs of the system contain unknown nonlinear parts, which are identified with the use of neurofuzzy approximators. The learning procedure for these estimators is determined by the requirement the first derivative of the closed-loop's Lyapunov function to be a negative one. Moreover, Lyapunov stability analysis shows that H-infinity tracking performance is succeeded for the feedback control loop and this assures improved robustness to the aforementioned model uncertainty as well as to external perturbations. The efficiency of the proposed adaptive fuzzy control scheme is confirmed through simulation experiments.

  1. Multivariate Prediction Equations for HbA1c Lowering, Weight Change, and Hypoglycemic Events Associated with Insulin Rescue Medication in Type 2 Diabetes Mellitus: Informing Economic Modeling.

    PubMed

    Willis, Michael; Asseburg, Christian; Nilsson, Andreas; Johnsson, Kristina; Kartman, Bernt

    2017-03-01

    Type 2 diabetes mellitus (T2DM) is chronic and progressive and the cost-effectiveness of new treatment interventions must be established over long time horizons. Given the limited durability of drugs, assumptions regarding downstream rescue medication can drive results. Especially for insulin, for which treatment effects and adverse events are known to depend on patient characteristics, this can be problematic for health economic evaluation involving modeling. To estimate parsimonious multivariate equations of treatment effects and hypoglycemic event risks for use in parameterizing insulin rescue therapy in model-based cost-effectiveness analysis. Clinical evidence for insulin use in T2DM was identified in PubMed and from published reviews and meta-analyses. Study and patient characteristics and treatment effects and adverse event rates were extracted and the data used to estimate parsimonious treatment effect and hypoglycemic event risk equations using multivariate regression analysis. Data from 91 studies featuring 171 usable study arms were identified, mostly for premix and basal insulin types. Multivariate prediction equations for glycated hemoglobin A 1c lowering and weight change were estimated separately for insulin-naive and insulin-experienced patients. Goodness of fit (R 2 ) for both outcomes were generally good, ranging from 0.44 to 0.84. Multivariate prediction equations for symptomatic, nocturnal, and severe hypoglycemic events were also estimated, though considerable heterogeneity in definitions limits their usefulness. Parsimonious and robust multivariate prediction equations were estimated for glycated hemoglobin A 1c and weight change, separately for insulin-naive and insulin-experienced patients. Using these in economic simulation modeling in T2DM can improve realism and flexibility in modeling insulin rescue medication. Copyright © 2017 International Society for Pharmacoeconomics and Outcomes Research (ISPOR). Published by Elsevier Inc. All rights reserved.

  2. Selection Indices and Multivariate Analysis Show Similar Results in the Evaluation of Growth and Carcass Traits in Beef Cattle

    PubMed Central

    Brito Lopes, Fernando; da Silva, Marcelo Corrêa; Magnabosco, Cláudio Ulhôa; Goncalves Narciso, Marcelo; Sainz, Roberto Daniel

    2016-01-01

    This research evaluated a multivariate approach as an alternative tool for the purpose of selection regarding expected progeny differences (EPDs). Data were fitted using a multi-trait model and consisted of growth traits (birth weight and weights at 120, 210, 365 and 450 days of age) and carcass traits (longissimus muscle area (LMA), back-fat thickness (BF), and rump fat thickness (RF)), registered over 21 years in extensive breeding systems of Polled Nellore cattle in Brazil. Multivariate analyses were performed using standardized (zero mean and unit variance) EPDs. The k mean method revealed that the best fit of data occurred using three clusters (k = 3) (P < 0.001). Estimates of genetic correlation among growth and carcass traits and the estimates of heritability were moderate to high, suggesting that a correlated response approach is suitable for practical decision making. Estimates of correlation between selection indices and the multivariate index (LD1) were moderate to high, ranging from 0.48 to 0.97. This reveals that both types of indices give similar results and that the multivariate approach is reliable for the purpose of selection. The alternative tool seems very handy when economic weights are not available or in cases where more rapid identification of the best animals is desired. Interestingly, multivariate analysis allowed forecasting information based on the relationships among breeding values (EPDs). Also, it enabled fine discrimination, rapid data summarization after genetic evaluation, and permitted accounting for maternal ability and the genetic direct potential of the animals. In addition, we recommend the use of longissimus muscle area and subcutaneous fat thickness as selection criteria, to allow estimation of breeding values before the first mating season in order to accelerate the response to individual selection. PMID:26789008

  3. Selection Indices and Multivariate Analysis Show Similar Results in the Evaluation of Growth and Carcass Traits in Beef Cattle.

    PubMed

    Brito Lopes, Fernando; da Silva, Marcelo Corrêa; Magnabosco, Cláudio Ulhôa; Goncalves Narciso, Marcelo; Sainz, Roberto Daniel

    2016-01-01

    This research evaluated a multivariate approach as an alternative tool for the purpose of selection regarding expected progeny differences (EPDs). Data were fitted using a multi-trait model and consisted of growth traits (birth weight and weights at 120, 210, 365 and 450 days of age) and carcass traits (longissimus muscle area (LMA), back-fat thickness (BF), and rump fat thickness (RF)), registered over 21 years in extensive breeding systems of Polled Nellore cattle in Brazil. Multivariate analyses were performed using standardized (zero mean and unit variance) EPDs. The k mean method revealed that the best fit of data occurred using three clusters (k = 3) (P < 0.001). Estimates of genetic correlation among growth and carcass traits and the estimates of heritability were moderate to high, suggesting that a correlated response approach is suitable for practical decision making. Estimates of correlation between selection indices and the multivariate index (LD1) were moderate to high, ranging from 0.48 to 0.97. This reveals that both types of indices give similar results and that the multivariate approach is reliable for the purpose of selection. The alternative tool seems very handy when economic weights are not available or in cases where more rapid identification of the best animals is desired. Interestingly, multivariate analysis allowed forecasting information based on the relationships among breeding values (EPDs). Also, it enabled fine discrimination, rapid data summarization after genetic evaluation, and permitted accounting for maternal ability and the genetic direct potential of the animals. In addition, we recommend the use of longissimus muscle area and subcutaneous fat thickness as selection criteria, to allow estimation of breeding values before the first mating season in order to accelerate the response to individual selection.

  4. Estimating an Effect Size in One-Way Multivariate Analysis of Variance (MANOVA)

    ERIC Educational Resources Information Center

    Steyn, H. S., Jr.; Ellis, S. M.

    2009-01-01

    When two or more univariate population means are compared, the proportion of variation in the dependent variable accounted for by population group membership is eta-squared. This effect size can be generalized by using multivariate measures of association, based on the multivariate analysis of variance (MANOVA) statistics, to establish whether…

  5. Small Sample Properties of Bayesian Multivariate Autoregressive Time Series Models

    ERIC Educational Resources Information Center

    Price, Larry R.

    2012-01-01

    The aim of this study was to compare the small sample (N = 1, 3, 5, 10, 15) performance of a Bayesian multivariate vector autoregressive (BVAR-SEM) time series model relative to frequentist power and parameter estimation bias. A multivariate autoregressive model was developed based on correlated autoregressive time series vectors of varying…

  6. Multivariate localization methods for ensemble Kalman filtering

    NASA Astrophysics Data System (ADS)

    Roh, S.; Jun, M.; Szunyogh, I.; Genton, M. G.

    2015-12-01

    In ensemble Kalman filtering (EnKF), the small number of ensemble members that is feasible to use in a practical data assimilation application leads to sampling variability of the estimates of the background error covariances. The standard approach to reducing the effects of this sampling variability, which has also been found to be highly efficient in improving the performance of EnKF, is the localization of the estimates of the covariances. One family of localization techniques is based on taking the Schur (element-wise) product of the ensemble-based sample covariance matrix and a correlation matrix whose entries are obtained by the discretization of a distance-dependent correlation function. While the proper definition of the localization function for a single state variable has been extensively investigated, a rigorous definition of the localization function for multiple state variables that exist at the same locations has been seldom considered. This paper introduces two strategies for the construction of localization functions for multiple state variables. The proposed localization functions are tested by assimilating simulated observations experiments into the bivariate Lorenz 95 model with their help.

  7. Estimation of failure criteria in multivariate sensory shelf life testing using survival analysis.

    PubMed

    Giménez, Ana; Gagliardi, Andrés; Ares, Gastón

    2017-09-01

    For most food products, shelf life is determined by changes in their sensory characteristics. A predetermined increase or decrease in the intensity of a sensory characteristic has frequently been used to signal that a product has reached the end of its shelf life. Considering all attributes change simultaneously, the concept of multivariate shelf life allows a single measurement of deterioration that takes into account all these sensory changes at a certain storage time. The aim of the present work was to apply survival analysis to estimate failure criteria in multivariate sensory shelf life testing using two case studies, hamburger buns and orange juice, by modelling the relationship between consumers' rejection of the product and the deterioration index estimated using PCA. In both studies, a panel of 13 trained assessors evaluated the samples using descriptive analysis whereas a panel of 100 consumers answered a "yes" or "no" question regarding intention to buy or consume the product. PC1 explained the great majority of the variance, indicating all sensory characteristics evolved similarly with storage time. Thus, PC1 could be regarded as index of sensory deterioration and a single failure criterion could be estimated through survival analysis for 25 and 50% consumers' rejection. The proposed approach based on multivariate shelf life testing may increase the accuracy of shelf life estimations. Copyright © 2017 Elsevier Ltd. All rights reserved.

  8. [Factors associated with work-related accidents in the informal commercial sector].

    PubMed

    Rios, Marcela Andrade; Nery, Adriana Alves; Rios, Polianna Alves Andrade; Casotti, Cezar Augusto; Cardoso, Jefferson Paixão

    2015-06-01

    This study aimed to estimate the incidence of non-fatal work-related accidents in the informal commercial sector and analyze associated socio-demographic, occupational, workplace, and health factors, in a cross-sectional survey of 434 workers in the business district of Jequié, Bahia State, Brazil, in 2013. Logistic regression was used to assess factors associated with accidents. Incidence of accidents in the previous 12 months was estimated at 32.3%, and multivariate analysis showed higher odds of accidents in male sex workers (OR = 1.61), young individuals (OR = 4.62), meat or poultry workers (OR = 9.55), and workers performing heavy physical work (OR = 1.71). The results show the need for public policies to prevent accidents in the informal commercial sector.

  9. Estimation of regions of attraction and ultimate boundedness for multiloop LQ regulators. [Linear Quadratic

    NASA Technical Reports Server (NTRS)

    Joshi, S. M.

    1984-01-01

    Closed-loop stability is investigated for multivariable linear time-invariant systems controlled by optimal full state feedback linear quadratic (LQ) regulators, with nonlinear gains present in the feedback channels. Estimates are obtained for the region of attraction when the nonlinearities escape the (0.5, infinity) sector in regions away from the origin and for the region of ultimate boundedness when the nonlinearities escape the sector near the origin. The expressions for these regions also provide methods for selecting the performance function parameters in order to obtain LQ designs with better tolerance for nonlinearities. The analytical results are illustrated by applying them to the problem of controlling the rigid-body pitch angle and elastic motion of a large, flexible space antenna.

  10. Declining estimates of infertility in the United States: 1982-2002.

    PubMed

    Stephen, Elizabeth Hervey; Chandra, Anjani

    2006-09-01

    To determine if the decline in infertility has been uniform across subgroups. Periodic data from the National Fertility Survey and the National Survey of Family Growth were used to determine which factors contributed to the decline in 12-month infertility in the United States. National Survey of Family Growth, a periodic US nationally representative study. A nationally representative sample of married women aged 15-44 years, N = 15,303 for pooled data across 4 survey years. None. Estimates of infertility prevalence among married women aged 15-44 years. The decline in 12-month infertility in the United States from 8.5% in 1982 and 7.4% in 2002 was significant. This decline was evident in nearly all subgroups of married women. In the multivariate analysis, 12-month infertility was more likely among women who were older and nulliparous, were non-Hispanic black or Hispanic, and did not have a college degree. The decline in 12-month infertility was observed even after controlling for the compositional differences of the population over time. Among married women in the United States, there has been a significant decline in 12-month infertility, which cannot be explained by changes in the composition of the population from 1982-2002.

  11. A simplified parsimonious higher order multivariate Markov chain model

    NASA Astrophysics Data System (ADS)

    Wang, Chao; Yang, Chuan-sheng

    2017-09-01

    In this paper, a simplified parsimonious higher-order multivariate Markov chain model (SPHOMMCM) is presented. Moreover, parameter estimation method of TPHOMMCM is give. Numerical experiments shows the effectiveness of TPHOMMCM.

  12. Estimating multivariate similarity between neuroimaging datasets with sparse canonical correlation analysis: an application to perfusion imaging.

    PubMed

    Rosa, Maria J; Mehta, Mitul A; Pich, Emilio M; Risterucci, Celine; Zelaya, Fernando; Reinders, Antje A T S; Williams, Steve C R; Dazzan, Paola; Doyle, Orla M; Marquand, Andre F

    2015-01-01

    An increasing number of neuroimaging studies are based on either combining more than one data modality (inter-modal) or combining more than one measurement from the same modality (intra-modal). To date, most intra-modal studies using multivariate statistics have focused on differences between datasets, for instance relying on classifiers to differentiate between effects in the data. However, to fully characterize these effects, multivariate methods able to measure similarities between datasets are needed. One classical technique for estimating the relationship between two datasets is canonical correlation analysis (CCA). However, in the context of high-dimensional data the application of CCA is extremely challenging. A recent extension of CCA, sparse CCA (SCCA), overcomes this limitation, by regularizing the model parameters while yielding a sparse solution. In this work, we modify SCCA with the aim of facilitating its application to high-dimensional neuroimaging data and finding meaningful multivariate image-to-image correspondences in intra-modal studies. In particular, we show how the optimal subset of variables can be estimated independently and we look at the information encoded in more than one set of SCCA transformations. We illustrate our framework using Arterial Spin Labeling data to investigate multivariate similarities between the effects of two antipsychotic drugs on cerebral blood flow.

  13. Implementation of the Iterative Proportion Fitting Algorithm for Geostatistical Facies Modeling

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li Yupeng, E-mail: yupeng@ualberta.ca; Deutsch, Clayton V.

    2012-06-15

    In geostatistics, most stochastic algorithm for simulation of categorical variables such as facies or rock types require a conditional probability distribution. The multivariate probability distribution of all the grouped locations including the unsampled location permits calculation of the conditional probability directly based on its definition. In this article, the iterative proportion fitting (IPF) algorithm is implemented to infer this multivariate probability. Using the IPF algorithm, the multivariate probability is obtained by iterative modification to an initial estimated multivariate probability using lower order bivariate probabilities as constraints. The imposed bivariate marginal probabilities are inferred from profiles along drill holes or wells.more » In the IPF process, a sparse matrix is used to calculate the marginal probabilities from the multivariate probability, which makes the iterative fitting more tractable and practical. This algorithm can be extended to higher order marginal probability constraints as used in multiple point statistics. The theoretical framework is developed and illustrated with estimation and simulation example.« less

  14. Medicaid enrollment after liver transplantation: Effects of medicaid expansion.

    PubMed

    Tumin, Dmitry; Hayes, Don; Washburn, W Kenneth; Tobias, Joseph D; Black, Sylvester M

    2016-08-01

    Liver transplantation (LT) recipients in the United States have low rates of paid employment, making some eligible for Medicaid public health insurance after transplant. We test whether recent expansions of Medicaid eligibility increased Medicaid enrollment and insurance coverage in this population. Patients of ages 18-59 years receiving first-time LTs in 2009-2013 were identified in the United Network for Organ Sharing registry and stratified according to insurance at transplantation (private versus Medicaid/Medicare). Posttransplant insurance status was assessed through June 2015. Difference-in-difference multivariate competing-risks models stratified on state of residence estimated effects of Medicaid expansion on Medicaid enrollment or use of uninsured care after LT. Of 12,837 patients meeting inclusion criteria, 6554 (51%) lived in a state that expanded Medicaid eligibility. Medicaid participation after LT was more common in Medicaid-expansion states (25%) compared to nonexpansion states (19%; P < 0.001). Multivariate analysis of 7279 patients with private insurance at transplantation demonstrated that after the effective date of Medicaid expansion (January 1, 2014), the hazard of posttransplant Medicaid enrollment increased in states participating in Medicaid expansion (hazard ratio [HR] = 1.5; 95% confidence interval [CI] = 1.1-2.0; P = 0.01), but not in states opting out of Medicaid expansion (HR = 0.8; 95% CI = 0.5-1.3; P = 0.37), controlling for individual characteristics and time-invariant state-level factors. No effects of Medicaid expansion on the use of posttransplant uninsured care were found, regardless of private or government insurance status at transplantation. Medicaid expansion increased posttransplant Medicaid enrollment among patients who had private insurance at transplantation, but it did not improve overall access to health insurance among LT recipients. Liver Transplantation 22 1075-1084 2016 AASLD. © 2016 American Association for the Study of Liver Diseases.

  15. Delirium superimposed on dementia: defining disease states and course from longitudinal measurements of a multivariate index using latent class analysis and hidden Markov chains.

    PubMed

    Ciampi, Antonio; Dyachenko, Alina; Cole, Martin; McCusker, Jane

    2011-12-01

    The study of mental disorders in the elderly presents substantial challenges due to population heterogeneity, coexistence of different mental disorders, and diagnostic uncertainty. While reliable tools have been developed to collect relevant data, new approaches to study design and analysis are needed. We focus on a new analytic approach. Our framework is based on latent class analysis and hidden Markov chains. From repeated measurements of a multivariate disease index, we extract the notion of underlying state of a patient at a time point. The course of the disorder is then a sequence of transitions among states. States and transitions are not observable; however, the probability of being in a state at a time point, and the transition probabilities from one state to another over time can be estimated. Data from 444 patients with and without diagnosis of delirium and dementia were available from a previous study. The Delirium Index was measured at diagnosis, and at 2 and 6 months from diagnosis. Four latent classes were identified: fairly healthy, moderately ill, clearly sick, and very sick. Dementia and delirium could not be separated on the basis of these data alone. Indeed, as the probability of delirium increased, so did the probability of decline of mental functions. Eight most probable courses were identified, including good and poor stable courses, and courses exhibiting various patterns of improvement. Latent class analysis and hidden Markov chains offer a promising tool for studying mental disorders in the elderly. Its use may show its full potential as new data become available.

  16. A multivariate cure model for left-censored and right-censored data with application to colorectal cancer screening patterns.

    PubMed

    Hagar, Yolanda C; Harvey, Danielle J; Beckett, Laurel A

    2016-08-30

    We develop a multivariate cure survival model to estimate lifetime patterns of colorectal cancer screening. Screening data cover long periods of time, with sparse observations for each person. Some events may occur before the study begins or after the study ends, so the data are both left-censored and right-censored, and some individuals are never screened (the 'cured' population). We propose a multivariate parametric cure model that can be used with left-censored and right-censored data. Our model allows for the estimation of the time to screening as well as the average number of times individuals will be screened. We calculate likelihood functions based on the observations for each subject using a distribution that accounts for within-subject correlation and estimate parameters using Markov chain Monte Carlo methods. We apply our methods to the estimation of lifetime colorectal cancer screening behavior in the SEER-Medicare data set. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  17. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Thompson, Stephen R., E-mail: stephen.thompson@sesiahs.health.nsw.gov.au; Department of Radiation Oncology, Prince of Wales Hospital, Sydney; University of New South Wales, Sydney

    Purpose: We aimed to estimate the optimal proportion of all gynecological cancers that should be treated with brachytherapy (BT)-the optimal brachytherapy utilization rate (BTU)-to compare this with actual gynecological BTU and to assess the effects of nonmedical factors on access to BT. Methods and Materials: The previously constructed inter/multinational guideline-based peer-reviewed models of optimal BTU for cancers of the uterine cervix, uterine corpus, and vagina were combined to estimate optimal BTU for all gynecological cancers. The robustness of the model was tested by univariate and multivariate sensitivity analyses. The resulting model was applied to New South Wales (NSW), the Unitedmore » States, and Western Europe. Actual BTU was determined for NSW by a retrospective patterns-of-care study of BT; for Western Europe from published reports; and for the United States from Surveillance, Epidemiology, and End Results data. Differences between optimal and actual BTU were assessed. The effect of nonmedical factors on access to BT in NSW were analyzed. Results: Gynecological BTU was as follows: NSW 28% optimal (95% confidence interval [CI] 26%-33%) compared with 14% actual; United States 30% optimal (95% CI 26%-34%) and 10% actual; and Western Europe 27% optimal (95% CI 25%-32%) and 16% actual. On multivariate analysis, NSW patients were more likely to undergo gynecological BT if residing in Area Health Service equipped with BT (odds ratio 1.76, P=.008) and if residing in socioeconomically disadvantaged postcodes (odds ratio 1.12, P=.05), but remoteness of residence was not significant. Conclusions: Gynecological BT is underutilized in NSW, Western Europe, and the United States given evidence-based guidelines. Access to BT equipment in NSW was significantly associated with higher utilization rates. Causes of underutilization elsewhere were undetermined. Our model of optimal BTU can be used as a quality assurance tool, providing an evidence-based benchmark against which actual patterns of practice can be measured. It can also be used to assist in determining the adequacy of BT resource allocation.« less

  18. A tridiagonal parsimonious higher order multivariate Markov chain model

    NASA Astrophysics Data System (ADS)

    Wang, Chao; Yang, Chuan-sheng

    2017-09-01

    In this paper, we present a tridiagonal parsimonious higher-order multivariate Markov chain model (TPHOMMCM). Moreover, estimation method of the parameters in TPHOMMCM is give. Numerical experiments illustrate the effectiveness of TPHOMMCM.

  19. Identifying Pleiotropic Genes in Genome-Wide Association Studies for Multivariate Phenotypes with Mixed Measurement Scales

    PubMed Central

    Williams, L. Keoki; Buu, Anne

    2017-01-01

    We propose a multivariate genome-wide association test for mixed continuous, binary, and ordinal phenotypes. A latent response model is used to estimate the correlation between phenotypes with different measurement scales so that the empirical distribution of the Fisher’s combination statistic under the null hypothesis is estimated efficiently. The simulation study shows that our proposed correlation estimation methods have high levels of accuracy. More importantly, our approach conservatively estimates the variance of the test statistic so that the type I error rate is controlled. The simulation also shows that the proposed test maintains the power at the level very close to that of the ideal analysis based on known latent phenotypes while controlling the type I error. In contrast, conventional approaches–dichotomizing all observed phenotypes or treating them as continuous variables–could either reduce the power or employ a linear regression model unfit for the data. Furthermore, the statistical analysis on the database of the Study of Addiction: Genetics and Environment (SAGE) demonstrates that conducting a multivariate test on multiple phenotypes can increase the power of identifying markers that may not be, otherwise, chosen using marginal tests. The proposed method also offers a new approach to analyzing the Fagerström Test for Nicotine Dependence as multivariate phenotypes in genome-wide association studies. PMID:28081206

  20. Baseline estimation in flame's spectra by using neural networks and robust statistics

    NASA Astrophysics Data System (ADS)

    Garces, Hugo; Arias, Luis; Rojas, Alejandro

    2014-09-01

    This work presents a baseline estimation method in flame spectra based on artificial intelligence structure as a neural network, combining robust statistics with multivariate analysis to automatically discriminate measured wavelengths belonging to continuous feature for model adaptation, surpassing restriction of measuring target baseline for training. The main contributions of this paper are: to analyze a flame spectra database computing Jolliffe statistics from Principal Components Analysis detecting wavelengths not correlated with most of the measured data corresponding to baseline; to systematically determine the optimal number of neurons in hidden layers based on Akaike's Final Prediction Error; to estimate baseline in full wavelength range sampling measured spectra; and to train an artificial intelligence structure as a Neural Network which allows to generalize the relation between measured and baseline spectra. The main application of our research is to compute total radiation with baseline information, allowing to diagnose combustion process state for optimization in early stages.

  1. An approach to online network monitoring using clustered patterns

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kim, Jinoh; Sim, Alex; Suh, Sang C.

    Network traffic monitoring is a core element in network operations and management for various purposes such as anomaly detection, change detection, and fault/failure detection. In this study, we introduce a new approach to online monitoring using a pattern-based representation of the network traffic. Unlike the past online techniques limited to a single variable to summarize (e.g., sketch), the focus of this study is on capturing the network state from the multivariate attributes under consideration. To this end, we employ clustering with its benefit of the aggregation of multidimensional variables. The clustered result represents the state of the network with regardmore » to the monitored variables, which can also be compared with the previously observed patterns visually and quantitatively. Finally, we demonstrate the proposed method with two popular use cases, one for estimating state changes and the other for identifying anomalous states, to confirm its feasibility.« less

  2. An approach to online network monitoring using clustered patterns

    DOE PAGES

    Kim, Jinoh; Sim, Alex; Suh, Sang C.; ...

    2017-03-13

    Network traffic monitoring is a core element in network operations and management for various purposes such as anomaly detection, change detection, and fault/failure detection. In this study, we introduce a new approach to online monitoring using a pattern-based representation of the network traffic. Unlike the past online techniques limited to a single variable to summarize (e.g., sketch), the focus of this study is on capturing the network state from the multivariate attributes under consideration. To this end, we employ clustering with its benefit of the aggregation of multidimensional variables. The clustered result represents the state of the network with regardmore » to the monitored variables, which can also be compared with the previously observed patterns visually and quantitatively. Finally, we demonstrate the proposed method with two popular use cases, one for estimating state changes and the other for identifying anomalous states, to confirm its feasibility.« less

  3. The impact of family planning clinic programs on adolescent pregnancy.

    PubMed

    Forrest, J D; Hermalin, A I; Henshaw, S K

    1981-01-01

    During the 1970s, there was a decline in adolescent childbearing in the United States and, among teenagers who were sexually active, there was a decline in pregnancy rates as well. To what extent was increased enrollment by teenagers in federally funded family planning clinics responsible for these declines? Areal multivariate analysis reveals that adolescent birthrates were reduced between 1970 and 1975 as the result of enrollment by teenagers in family planning clinics, independent of the effects of other factors also affecting fertility, such as poverty status, education and urbanization. Using a model which controls for differences in adolescent sexual activity in different areas in 1970 and 1975, the analysis found that for every 10 teenage patients enrolled in family planning clinics in 1975, about one birth was averted in 1976. Other multivariate models, which did not control for differences in sexual activity, showed changes in the same direction, though of smaller dimension. Since the family planning program averts not only births but also pregnancies that result in abortions and miscarriages, an estimate was made of the total number of pregnancies averted by the program. Based on the proportion of unintended pregnancies among adolescents that resulted in live births in 1976 (36 percent), it was estimated that for every 10 teen patients enrolled in 1975, almost three pregnancies were averted in the following year. Over the 1970s, an estimated 2.6 million unintended adolescent pregnancies were averted by the program--944,000 births, 1,376,000 abortions and 326,000 miscarriages. In 1979 alone, an estimated 417,000 unintended pregnancies were prevented by the program.

  4. Random forests on Hadoop for genome-wide association studies of multivariate neuroimaging phenotypes

    PubMed Central

    2013-01-01

    Motivation Multivariate quantitative traits arise naturally in recent neuroimaging genetics studies, in which both structural and functional variability of the human brain is measured non-invasively through techniques such as magnetic resonance imaging (MRI). There is growing interest in detecting genetic variants associated with such multivariate traits, especially in genome-wide studies. Random forests (RFs) classifiers, which are ensembles of decision trees, are amongst the best performing machine learning algorithms and have been successfully employed for the prioritisation of genetic variants in case-control studies. RFs can also be applied to produce gene rankings in association studies with multivariate quantitative traits, and to estimate genetic similarities measures that are predictive of the trait. However, in studies involving hundreds of thousands of SNPs and high-dimensional traits, a very large ensemble of trees must be inferred from the data in order to obtain reliable rankings, which makes the application of these algorithms computationally prohibitive. Results We have developed a parallel version of the RF algorithm for regression and genetic similarity learning tasks in large-scale population genetic association studies involving multivariate traits, called PaRFR (Parallel Random Forest Regression). Our implementation takes advantage of the MapReduce programming model and is deployed on Hadoop, an open-source software framework that supports data-intensive distributed applications. Notable speed-ups are obtained by introducing a distance-based criterion for node splitting in the tree estimation process. PaRFR has been applied to a genome-wide association study on Alzheimer's disease (AD) in which the quantitative trait consists of a high-dimensional neuroimaging phenotype describing longitudinal changes in the human brain structure. PaRFR provides a ranking of SNPs associated to this trait, and produces pair-wise measures of genetic proximity that can be directly compared to pair-wise measures of phenotypic proximity. Several known AD-related variants have been identified, including APOE4 and TOMM40. We also present experimental evidence supporting the hypothesis of a linear relationship between the number of top-ranked mutated states, or frequent mutation patterns, and an indicator of disease severity. Availability The Java codes are freely available at http://www2.imperial.ac.uk/~gmontana. PMID:24564704

  5. Random forests on Hadoop for genome-wide association studies of multivariate neuroimaging phenotypes.

    PubMed

    Wang, Yue; Goh, Wilson; Wong, Limsoon; Montana, Giovanni

    2013-01-01

    Multivariate quantitative traits arise naturally in recent neuroimaging genetics studies, in which both structural and functional variability of the human brain is measured non-invasively through techniques such as magnetic resonance imaging (MRI). There is growing interest in detecting genetic variants associated with such multivariate traits, especially in genome-wide studies. Random forests (RFs) classifiers, which are ensembles of decision trees, are amongst the best performing machine learning algorithms and have been successfully employed for the prioritisation of genetic variants in case-control studies. RFs can also be applied to produce gene rankings in association studies with multivariate quantitative traits, and to estimate genetic similarities measures that are predictive of the trait. However, in studies involving hundreds of thousands of SNPs and high-dimensional traits, a very large ensemble of trees must be inferred from the data in order to obtain reliable rankings, which makes the application of these algorithms computationally prohibitive. We have developed a parallel version of the RF algorithm for regression and genetic similarity learning tasks in large-scale population genetic association studies involving multivariate traits, called PaRFR (Parallel Random Forest Regression). Our implementation takes advantage of the MapReduce programming model and is deployed on Hadoop, an open-source software framework that supports data-intensive distributed applications. Notable speed-ups are obtained by introducing a distance-based criterion for node splitting in the tree estimation process. PaRFR has been applied to a genome-wide association study on Alzheimer's disease (AD) in which the quantitative trait consists of a high-dimensional neuroimaging phenotype describing longitudinal changes in the human brain structure. PaRFR provides a ranking of SNPs associated to this trait, and produces pair-wise measures of genetic proximity that can be directly compared to pair-wise measures of phenotypic proximity. Several known AD-related variants have been identified, including APOE4 and TOMM40. We also present experimental evidence supporting the hypothesis of a linear relationship between the number of top-ranked mutated states, or frequent mutation patterns, and an indicator of disease severity. The Java codes are freely available at http://www2.imperial.ac.uk/~gmontana.

  6. Comparison between the univariate and multivariate analysis on the partial characterization of the endoglucanase produced in the solid state fermentation by Aspergillus oryzae ATCC 10124.

    PubMed

    de Brito, Aila Riany; Santos Reis, Nadabe Dos; Silva, Tatielle Pereira; Ferreira Bonomo, Renata Cristina; Trovatti Uetanabaro, Ana Paula; de Assis, Sandra Aparecida; da Silva, Erik Galvão Paranhos; Aguiar-Oliveira, Elizama; Oliveira, Julieta Rangel; Franco, Marcelo

    2017-11-26

    Endoglucanase production by Aspergillus oryzae ATCC 10124 cultivated in rice husks or peanut shells was optimized by experimental design as a function of humidity, time, and temperature. The optimum temperature for the endoglucanase activity was estimated by a univariate analysis (one factor at the time) as 50°C (rice husks) and 60°C (peanut shells), however, by a multivariate analysis (synergism of factors), it was determined a different temperature (56°C) for endoglucanase from peanut shells. For the optimum pH, values determined by univariate and multivariate analysis were 5 and 5.2 (rice husk) and 5 and 7.6 (peanut shells). In addition, the best half-lives were observed at 50°C as 22.8 hr (rice husks) and 7.3 hr (peanut shells), also, 80% of residual activities was obtained between 30 and 50°C for both substrates, and the pH stability was improved at 5-7 (rice hulls) and 6-9 (peanut shells). Both endoglucanases obtained presented different characteristics as a result of the versatility of fungi in different substrates.

  7. An alternative derivation of the stationary distribution of the multivariate neutral Wright-Fisher model for low mutation rates with a view to mutation rate estimation from site frequency data.

    PubMed

    Schrempf, Dominik; Hobolth, Asger

    2017-04-01

    Recently, Burden and Tang (2016) provided an analytical expression for the stationary distribution of the multivariate neutral Wright-Fisher model with low mutation rates. In this paper we present a simple, alternative derivation that illustrates the approximation. Our proof is based on the discrete multivariate boundary mutation model which has three key ingredients. First, the decoupled Moran model is used to describe genetic drift. Second, low mutation rates are assumed by limiting mutations to monomorphic states. Third, the mutation rate matrix is separated into a time-reversible part and a flux part, as suggested by Burden and Tang (2016). An application of our result to data from several great apes reveals that the assumption of stationarity may be inadequate or that other evolutionary forces like selection or biased gene conversion are acting. Furthermore we find that the model with a reversible mutation rate matrix provides a reasonably good fit to the data compared to the one with a non-reversible mutation rate matrix. Copyright © 2016 The Author(s). Published by Elsevier Inc. All rights reserved.

  8. Social Context and Dental Pain in Adults of Colombian Ethnic Minority Groups: A Multilevel Cross-Sectional Study.

    PubMed

    Ardila, Carlos M; Agudelo-Suárez, Andrés A

    2016-01-01

    To estimate the effect of social context on dental pain in adults of Colombian ethnic minority groups (CEGs). Information from 34,843 participants was used. A multilevel model was constructed that had ethnic groups (ie, CEGs and non-CEGs) at level 1 and Colombian states at level 2. Contextual variables included gross domestic product (GDP), Human Development Index (HDI), and Unmet Basic Needs Index (UBNI). Dental pain was observed in 12.3% of 6,440 CEGs. In an unadjusted logistic regression model, dental pain was associated with being a CEG (odds ratio [95% confidence interval], 1.34 [1.22-1.46]; P = .0001). This association remained significant after adjusting for possible confounding variables. An unconditional multilevel analysis showed that the variance in dental pain was statistically significant at the ethnic group level (β = 0.047 ± 0.015; P = .0009) and at the state level (β = 0.038 ± 0.019; P = .02) and that the variation between ethnic groups was higher than the variation between states (55% vs 45%, respectively). In a multivariate model, the variance in dental pain was also statistically significant at the ethnic group level (β = 0.029 ± 0.012; P = .007) and the state level (β = 0.042 ± .019; P = .01), but the variation between states was higher (40% vs 60%). The results of multilevel multivariate analyses showed that dental pain was associated with increasing age (β = 0.009 ± 0.001; P = .0001), lower education level (β = 0.302 ± 0.103; P = .0001), female sex (β = 0.031 ± 0.069; P = .003), GDP (β = 5.136 ± 2.009; P = .002) and HDI (β = 6.862 ± 5.550; P = .004); however, UBNI was not associated with dental pain. The variance in dental pain was higher between states than between ethnic groups in the multivariate multilevel model. Dental pain in CEGs was associated with contextual and individual factors. Considering contextual factors, GDP and HDI may play a major role in dental pain prevalence.

  9. On the degrees of freedom of reduced-rank estimators in multivariate regression

    PubMed Central

    Mukherjee, A.; Chen, K.; Wang, N.; Zhu, J.

    2015-01-01

    Summary We study the effective degrees of freedom of a general class of reduced-rank estimators for multivariate regression in the framework of Stein's unbiased risk estimation. A finite-sample exact unbiased estimator is derived that admits a closed-form expression in terms of the thresholded singular values of the least-squares solution and hence is readily computable. The results continue to hold in the high-dimensional setting where both the predictor and the response dimensions may be larger than the sample size. The derived analytical form facilitates the investigation of theoretical properties and provides new insights into the empirical behaviour of the degrees of freedom. In particular, we examine the differences and connections between the proposed estimator and a commonly-used naive estimator. The use of the proposed estimator leads to efficient and accurate prediction risk estimation and model selection, as demonstrated by simulation studies and a data example. PMID:26702155

  10. Direct observation of magnetic domains by Kerr microscopy in a Ni-Mn-Ga magnetic shape-memory alloy

    NASA Astrophysics Data System (ADS)

    Perevertov, O.; Heczko, O.; Schäfer, R.

    2017-04-01

    The magnetic domains in a magnetic shape-memory Ni-Mn-Ga alloy were observed by magneto-optical Kerr microscopy using monochromatic blue LED light. The domains were observed for both single- and multivariant ferroelastic states of modulated martensite. The multivariant state with very fine twins was spontaneously formed after transformation from high-temperature austenite. For both cases, bar domains separated by 180∘ domain walls were found and their dynamics was studied. A quasidomain model was applied to explain the domains in the multivariant state.

  11. Deterministic annealing for density estimation by multivariate normal mixtures

    NASA Astrophysics Data System (ADS)

    Kloppenburg, Martin; Tavan, Paul

    1997-03-01

    An approach to maximum-likelihood density estimation by mixtures of multivariate normal distributions for large high-dimensional data sets is presented. Conventionally that problem is tackled by notoriously unstable expectation-maximization (EM) algorithms. We remove these instabilities by the introduction of soft constraints, enabling deterministic annealing. Our developments are motivated by the proof that algorithmically stable fuzzy clustering methods that are derived from statistical physics analogs are special cases of EM procedures.

  12. Spatio-temporal interpolation of precipitation during monsoon periods in Pakistan

    NASA Astrophysics Data System (ADS)

    Hussain, Ijaz; Spöck, Gunter; Pilz, Jürgen; Yu, Hwa-Lung

    2010-08-01

    Spatio-temporal estimation of precipitation over a region is essential to the modeling of hydrologic processes for water resources management. The changes of magnitude and space-time heterogeneity of rainfall observations make space-time estimation of precipitation a challenging task. In this paper we propose a Box-Cox transformed hierarchical Bayesian multivariate spatio-temporal interpolation method for the skewed response variable. The proposed method is applied to estimate space-time monthly precipitation in the monsoon periods during 1974-2000, and 27-year monthly average precipitation data are obtained from 51 stations in Pakistan. The results of transformed hierarchical Bayesian multivariate spatio-temporal interpolation are compared to those of non-transformed hierarchical Bayesian interpolation by using cross-validation. The software developed by [11] is used for Bayesian non-stationary multivariate space-time interpolation. It is observed that the transformed hierarchical Bayesian method provides more accuracy than the non-transformed hierarchical Bayesian method.

  13. Application of multivariate Gaussian detection theory to known non-Gaussian probability density functions

    NASA Astrophysics Data System (ADS)

    Schwartz, Craig R.; Thelen, Brian J.; Kenton, Arthur C.

    1995-06-01

    A statistical parametric multispectral sensor performance model was developed by ERIM to support mine field detection studies, multispectral sensor design/performance trade-off studies, and target detection algorithm development. The model assumes target detection algorithms and their performance models which are based on data assumed to obey multivariate Gaussian probability distribution functions (PDFs). The applicability of these algorithms and performance models can be generalized to data having non-Gaussian PDFs through the use of transforms which convert non-Gaussian data to Gaussian (or near-Gaussian) data. An example of one such transform is the Box-Cox power law transform. In practice, such a transform can be applied to non-Gaussian data prior to the introduction of a detection algorithm that is formally based on the assumption of multivariate Gaussian data. This paper presents an extension of these techniques to the case where the joint multivariate probability density function of the non-Gaussian input data is known, and where the joint estimate of the multivariate Gaussian statistics, under the Box-Cox transform, is desired. The jointly estimated multivariate Gaussian statistics can then be used to predict the performance of a target detection algorithm which has an associated Gaussian performance model.

  14. A sparse reconstruction method for the estimation of multi-resolution emission fields via atmospheric inversion

    DOE PAGES

    Ray, J.; Lee, J.; Yadav, V.; ...

    2015-04-29

    Atmospheric inversions are frequently used to estimate fluxes of atmospheric greenhouse gases (e.g., biospheric CO 2 flux fields) at Earth's surface. These inversions typically assume that flux departures from a prior model are spatially smoothly varying, which are then modeled using a multi-variate Gaussian. When the field being estimated is spatially rough, multi-variate Gaussian models are difficult to construct and a wavelet-based field model may be more suitable. Unfortunately, such models are very high dimensional and are most conveniently used when the estimation method can simultaneously perform data-driven model simplification (removal of model parameters that cannot be reliably estimated) andmore » fitting. Such sparse reconstruction methods are typically not used in atmospheric inversions. In this work, we devise a sparse reconstruction method, and illustrate it in an idealized atmospheric inversion problem for the estimation of fossil fuel CO 2 (ffCO 2) emissions in the lower 48 states of the USA. Our new method is based on stagewise orthogonal matching pursuit (StOMP), a method used to reconstruct compressively sensed images. Our adaptations bestow three properties to the sparse reconstruction procedure which are useful in atmospheric inversions. We have modified StOMP to incorporate prior information on the emission field being estimated and to enforce non-negativity on the estimated field. Finally, though based on wavelets, our method allows for the estimation of fields in non-rectangular geometries, e.g., emission fields inside geographical and political boundaries. Our idealized inversions use a recently developed multi-resolution (i.e., wavelet-based) random field model developed for ffCO 2 emissions and synthetic observations of ffCO 2 concentrations from a limited set of measurement sites. We find that our method for limiting the estimated field within an irregularly shaped region is about a factor of 10 faster than conventional approaches. It also reduces the overall computational cost by a factor of 2. Further, the sparse reconstruction scheme imposes non-negativity without introducing strong nonlinearities, such as those introduced by employing log-transformed fields, and thus reaps the benefits of simplicity and computational speed that are characteristic of linear inverse problems.« less

  15. A sparse reconstruction method for the estimation of multi-resolution emission fields via atmospheric inversion

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ray, J.; Lee, J.; Yadav, V.

    Atmospheric inversions are frequently used to estimate fluxes of atmospheric greenhouse gases (e.g., biospheric CO 2 flux fields) at Earth's surface. These inversions typically assume that flux departures from a prior model are spatially smoothly varying, which are then modeled using a multi-variate Gaussian. When the field being estimated is spatially rough, multi-variate Gaussian models are difficult to construct and a wavelet-based field model may be more suitable. Unfortunately, such models are very high dimensional and are most conveniently used when the estimation method can simultaneously perform data-driven model simplification (removal of model parameters that cannot be reliably estimated) andmore » fitting. Such sparse reconstruction methods are typically not used in atmospheric inversions. In this work, we devise a sparse reconstruction method, and illustrate it in an idealized atmospheric inversion problem for the estimation of fossil fuel CO 2 (ffCO 2) emissions in the lower 48 states of the USA. Our new method is based on stagewise orthogonal matching pursuit (StOMP), a method used to reconstruct compressively sensed images. Our adaptations bestow three properties to the sparse reconstruction procedure which are useful in atmospheric inversions. We have modified StOMP to incorporate prior information on the emission field being estimated and to enforce non-negativity on the estimated field. Finally, though based on wavelets, our method allows for the estimation of fields in non-rectangular geometries, e.g., emission fields inside geographical and political boundaries. Our idealized inversions use a recently developed multi-resolution (i.e., wavelet-based) random field model developed for ffCO 2 emissions and synthetic observations of ffCO 2 concentrations from a limited set of measurement sites. We find that our method for limiting the estimated field within an irregularly shaped region is about a factor of 10 faster than conventional approaches. It also reduces the overall computational cost by a factor of 2. Further, the sparse reconstruction scheme imposes non-negativity without introducing strong nonlinearities, such as those introduced by employing log-transformed fields, and thus reaps the benefits of simplicity and computational speed that are characteristic of linear inverse problems.« less

  16. Network meta-analysis of multiple outcome measures accounting for borrowing of information across outcomes

    PubMed Central

    2014-01-01

    Background Network meta-analysis (NMA) enables simultaneous comparison of multiple treatments while preserving randomisation. When summarising evidence to inform an economic evaluation, it is important that the analysis accurately reflects the dependency structure within the data, as correlations between outcomes may have implication for estimating the net benefit associated with treatment. A multivariate NMA offers a framework for evaluating multiple treatments across multiple outcome measures while accounting for the correlation structure between outcomes. Methods The standard NMA model is extended to multiple outcome settings in two stages. In the first stage, information is borrowed across outcomes as well across studies through modelling the within-study and between-study correlation structure. In the second stage, we make use of the additional assumption that intervention effects are exchangeable between outcomes to predict effect estimates for all outcomes, including effect estimates on outcomes where evidence is either sparse or the treatment had not been considered by any one of the studies included in the analysis. We apply the methods to binary outcome data from a systematic review evaluating the effectiveness of nine home safety interventions on uptake of three poisoning prevention practices (safe storage of medicines, safe storage of other household products, and possession of poison centre control telephone number) in households with children. Analyses are conducted in WinBUGS using Markov Chain Monte Carlo (MCMC) simulations. Results Univariate and the first stage multivariate models produced broadly similar point estimates of intervention effects but the uncertainty around the multivariate estimates varied depending on the prior distribution specified for the between-study covariance structure. The second stage multivariate analyses produced more precise effect estimates while enabling intervention effects to be predicted for all outcomes, including intervention effects on outcomes not directly considered by the studies included in the analysis. Conclusions Accounting for the dependency between outcomes in a multivariate meta-analysis may or may not improve the precision of effect estimates from a network meta-analysis compared to analysing each outcome separately. PMID:25047164

  17. The effects of driving age, driver education, and curfew laws on traffic fatalities of 15-17 year olds.

    PubMed

    Levy, D T

    1988-12-01

    This study examines the effect of state driving age, learning permit, driver's education, and curfew laws on 15-17-year-old driver fatality rates. A multivariate regression model is estimated for 47 states and nine years. The minimum legal driving age and curfew laws are found to be important determinants of fatalities. Driver's education and learning permits have smaller effects. The relationship between rates of licensure and driving age, education, and curfew laws is also examined. In each case, a more restrictive policy is found to reduce licensure of 15-17 year olds. The results suggest that the imposition of curfew laws and higher minimum driving ages are particularly effective traffic safety policies.

  18. A refined method for multivariate meta-analysis and meta-regression.

    PubMed

    Jackson, Daniel; Riley, Richard D

    2014-02-20

    Making inferences about the average treatment effect using the random effects model for meta-analysis is problematic in the common situation where there is a small number of studies. This is because estimates of the between-study variance are not precise enough to accurately apply the conventional methods for testing and deriving a confidence interval for the average effect. We have found that a refined method for univariate meta-analysis, which applies a scaling factor to the estimated effects' standard error, provides more accurate inference. We explain how to extend this method to the multivariate scenario and show that our proposal for refined multivariate meta-analysis and meta-regression can provide more accurate inferences than the more conventional approach. We explain how our proposed approach can be implemented using standard output from multivariate meta-analysis software packages and apply our methodology to two real examples. Copyright © 2013 John Wiley & Sons, Ltd.

  19. Bayesian state space models for dynamic genetic network construction across multiple tissues.

    PubMed

    Liang, Yulan; Kelemen, Arpad

    2016-08-01

    Construction of gene-gene interaction networks and potential pathways is a challenging and important problem in genomic research for complex diseases while estimating the dynamic changes of the temporal correlations and non-stationarity are the keys in this process. In this paper, we develop dynamic state space models with hierarchical Bayesian settings to tackle this challenge for inferring the dynamic profiles and genetic networks associated with disease treatments. We treat both the stochastic transition matrix and the observation matrix time-variant and include temporal correlation structures in the covariance matrix estimations in the multivariate Bayesian state space models. The unevenly spaced short time courses with unseen time points are treated as hidden state variables. Hierarchical Bayesian approaches with various prior and hyper-prior models with Monte Carlo Markov Chain and Gibbs sampling algorithms are used to estimate the model parameters and the hidden state variables. We apply the proposed Hierarchical Bayesian state space models to multiple tissues (liver, skeletal muscle, and kidney) Affymetrix time course data sets following corticosteroid (CS) drug administration. Both simulation and real data analysis results show that the genomic changes over time and gene-gene interaction in response to CS treatment can be well captured by the proposed models. The proposed dynamic Hierarchical Bayesian state space modeling approaches could be expanded and applied to other large scale genomic data, such as next generation sequence (NGS) combined with real time and time varying electronic health record (EHR) for more comprehensive and robust systematic and network based analysis in order to transform big biomedical data into predictions and diagnostics for precision medicine and personalized healthcare with better decision making and patient outcomes.

  20. Job insecurity and risk of diabetes: a meta-analysis of individual participant data.

    PubMed

    Ferrie, Jane E; Virtanen, Marianna; Jokela, Markus; Madsen, Ida E H; Heikkilä, Katriina; Alfredsson, Lars; Batty, G David; Bjorner, Jakob B; Borritz, Marianne; Burr, Hermann; Dragano, Nico; Elovainio, Marko; Fransson, Eleonor I; Knutsson, Anders; Koskenvuo, Markku; Koskinen, Aki; Kouvonen, Anne; Kumari, Meena; Nielsen, Martin L; Nordin, Maria; Oksanen, Tuula; Pahkin, Krista; Pejtersen, Jan H; Pentti, Jaana; Salo, Paula; Shipley, Martin J; Suominen, Sakari B; Tabák, Adam; Theorell, Töres; Väänänen, Ari; Vahtera, Jussi; Westerholm, Peter J M; Westerlund, Hugo; Rugulies, Reiner; Nyberg, Solja T; Kivimäki, Mika

    2016-12-06

    Job insecurity has been associated with certain health outcomes. We examined the role of job insecurity as a risk factor for incident diabetes. We used individual participant data from 8 cohort studies identified in 2 open-access data archives and 11 cohort studies participating in the Individual-Participant-Data Meta-analysis in Working Populations Consortium. We calculated study-specific estimates of the association between job insecurity reported at baseline and incident diabetes over the follow-up period. We pooled the estimates in a meta-analysis to produce a summary risk estimate. The 19 studies involved 140 825 participants from Australia, Europe and the United States, with a mean follow-up of 9.4 years and 3954 incident cases of diabetes. In the preliminary analysis adjusted for age and sex, high job insecurity was associated with an increased risk of incident diabetes compared with low job insecurity (adjusted odds ratio [OR] 1.19, 95% confidence interval [CI] 1.09-1.30). In the multivariable-adjusted analysis restricted to 15 studies with baseline data for all covariates (age, sex, socioeconomic status, obesity, physical activity, alcohol and smoking), the association was slightly attenuated (adjusted OR 1.12, 95% CI 1.01-1.24). Heterogeneity between the studies was low to moderate (age- and sex-adjusted model: I 2 = 24%, p = 0.2; multivariable-adjusted model: I 2 = 27%, p = 0.2). In the multivariable-adjusted analysis restricted to high-quality studies, in which the diabetes diagnosis was ascertained from electronic medical records or clinical examination, the association was similar to that in the main analysis (adjusted OR 1.19, 95% CI 1.04-1.35). Our findings suggest that self-reported job insecurity is associated with a modest increased risk of incident diabetes. Health care personnel should be aware of this association among workers reporting job insecurity. © 2016 Canadian Medical Association or its licensors.

  1. Pyrogenic carbon distribution in mineral topsoils of the northeastern United States

    USGS Publications Warehouse

    Jauss, Verena; Sullivan, Patrick J.; Sanderman, Jonathan; Smith, David; Lehmann, Johannes

    2017-01-01

    Due to its slow turnover rates in soil, pyrogenic carbon (PyC) is considered an important C pool and relevant to climate change processes. Therefore, the amounts of soil PyC were compared to environmental covariates over an area of 327,757 km2 in the northeastern United States in order to understand the controls on PyC distribution over large areas. Topsoil (defined as the soil A horizon, after removal of any organic horizons) samples were collected at 165 field sites in a generalised random tessellation stratified design that corresponded to approximately 1 site per 1600 km2 and PyC was estimated from diffuse reflectance mid-infrared spectroscopy measurements using a partial least-squares regression analysis in conjunction with a large database of PyC measurements based on a solid-state 13C nuclear magnetic resonance spectroscopy technique. Three spatial models were applied to the data in order to relate critical environmental covariates to the changes in spatial density of PyC over the landscape. Regional mean density estimates of PyC were 11.0 g kg− 1 (0.84 Gg km− 2) for Ordinary Kriging, 25.8 g kg− 1(12.2 Gg km− 2) for Multivariate Linear Regression, and 26.1 g kg− 1 (12.4 Gg km− 2) for Bayesian Regression Kriging. Akaike Information Criterion (AIC) indicated that the Multivariate Linear Regression model performed best (AIC = 842.6; n = 165) compared to Ordinary Kriging (AIC = 982.4) and Bayesian Regression Kriging (AIC = 979.2). Soil PyC concentrations correlated well with total soil sulphur (P < 0.001; n = 165), plant tissue lignin (P = 0.003), and drainage class (P = 0.008). This suggests the opportunity of including related environmental parameters in the spatial assessment of PyC in soils. Better estimates of the contribution of PyC to the global carbon cycle will thus also require more accurate assessments of these covariates.

  2. Robust Modal Filtering and Control of the X-56A Model with Simulated Fiber Optic Sensor Failures

    NASA Technical Reports Server (NTRS)

    Suh, Peter M.; Chin, Alexander W.; Marvis, Dimitri N.

    2014-01-01

    The X-56A aircraft is a remotely-piloted aircraft with flutter modes intentionally designed into the flight envelope. The X-56A program must demonstrate flight control while suppressing all unstable modes. A previous X-56A model study demonstrated a distributed-sensing-based active shape and active flutter suppression controller. The controller relies on an estimator which is sensitive to bias. This estimator is improved herein, and a real-time robust estimator is derived and demonstrated on 1530 fiber optic sensors. It is shown in simulation that the estimator can simultaneously reject 230 worst-case fiber optic sensor failures automatically. These sensor failures include locations with high leverage (or importance). To reduce the impact of leverage outliers, concentration based on a Mahalanobis trim criterion is introduced. A redescending M-estimator with Tukey bisquare weights is used to improve location and dispersion estimates within each concentration step in the presence of asymmetry (or leverage). A dynamic simulation is used to compare the concentrated robust estimator to a state-of-the-art real-time robust multivariate estimator. The estimators support a previously-derived mu-optimal shape controller. It is found that during the failure scenario, the concentrated modal estimator keeps the system stable.

  3. Robust Modal Filtering and Control of the X-56A Model with Simulated Fiber Optic Sensor Failures

    NASA Technical Reports Server (NTRS)

    Suh, Peter M.; Chin, Alexander W.; Mavris, Dimitri N.

    2016-01-01

    The X-56A aircraft is a remotely-piloted aircraft with flutter modes intentionally designed into the flight envelope. The X-56A program must demonstrate flight control while suppressing all unstable modes. A previous X-56A model study demonstrated a distributed-sensing-based active shape and active flutter suppression controller. The controller relies on an estimator which is sensitive to bias. This estimator is improved herein, and a real-time robust estimator is derived and demonstrated on 1530 fiber optic sensors. It is shown in simulation that the estimator can simultaneously reject 230 worst-case fiber optic sensor failures automatically. These sensor failures include locations with high leverage (or importance). To reduce the impact of leverage outliers, concentration based on a Mahalanobis trim criterion is introduced. A redescending M-estimator with Tukey bisquare weights is used to improve location and dispersion estimates within each concentration step in the presence of asymmetry (or leverage). A dynamic simulation is used to compare the concentrated robust estimator to a state-of-the-art real-time robust multivariate estimator. The estimators support a previously-derived mu-optimal shape controller. It is found that during the failure scenario, the concentrated modal estimator keeps the system stable.

  4. Resemblance profiles as clustering decision criteria: Estimating statistical power, error, and correspondence for a hypothesis test for multivariate structure.

    PubMed

    Kilborn, Joshua P; Jones, David L; Peebles, Ernst B; Naar, David F

    2017-04-01

    Clustering data continues to be a highly active area of data analysis, and resemblance profiles are being incorporated into ecological methodologies as a hypothesis testing-based approach to clustering multivariate data. However, these new clustering techniques have not been rigorously tested to determine the performance variability based on the algorithm's assumptions or any underlying data structures. Here, we use simulation studies to estimate the statistical error rates for the hypothesis test for multivariate structure based on dissimilarity profiles (DISPROF). We concurrently tested a widely used algorithm that employs the unweighted pair group method with arithmetic mean (UPGMA) to estimate the proficiency of clustering with DISPROF as a decision criterion. We simulated unstructured multivariate data from different probability distributions with increasing numbers of objects and descriptors, and grouped data with increasing overlap, overdispersion for ecological data, and correlation among descriptors within groups. Using simulated data, we measured the resolution and correspondence of clustering solutions achieved by DISPROF with UPGMA against the reference grouping partitions used to simulate the structured test datasets. Our results highlight the dynamic interactions between dataset dimensionality, group overlap, and the properties of the descriptors within a group (i.e., overdispersion or correlation structure) that are relevant to resemblance profiles as a clustering criterion for multivariate data. These methods are particularly useful for multivariate ecological datasets that benefit from distance-based statistical analyses. We propose guidelines for using DISPROF as a clustering decision tool that will help future users avoid potential pitfalls during the application of methods and the interpretation of results.

  5. Multivariate dynamic Tobit models with lagged observed dependent variables: An effectiveness analysis of highway safety laws.

    PubMed

    Dong, Chunjiao; Xie, Kun; Zeng, Jin; Li, Xia

    2018-04-01

    Highway safety laws aim to influence driver behaviors so as to reduce the frequency and severity of crashes, and their outcomes. For one specific highway safety law, it would have different effects on the crashes across severities. Understanding such effects can help policy makers upgrade current laws and hence improve traffic safety. To investigate the effects of highway safety laws on crashes across severities, multivariate models are needed to account for the interdependency issues in crash counts across severities. Based on the characteristics of the dependent variables, multivariate dynamic Tobit (MVDT) models are proposed to analyze crash counts that are aggregated at the state level. Lagged observed dependent variables are incorporated into the MVDT models to account for potential temporal correlation issues in crash data. The state highway safety law related factors are used as the explanatory variables and socio-demographic and traffic factors are used as the control variables. Three models, a MVDT model with lagged observed dependent variables, a MVDT model with unobserved random variables, and a multivariate static Tobit (MVST) model are developed and compared. The results show that among the investigated models, the MVDT models with lagged observed dependent variables have the best goodness-of-fit. The findings indicate that, compared to the MVST, the MVDT models have better explanatory power and prediction accuracy. The MVDT model with lagged observed variables can better handle the stochasticity and dependency in the temporal evolution of the crash counts and the estimated values from the model are closer to the observed values. The results show that more lives could be saved if law enforcement agencies can make a sustained effort to educate the public about the importance of motorcyclists wearing helmets. Motor vehicle crash-related deaths, injuries, and property damages could be reduced if states enact laws for stricter text messaging rules, higher speeding fines, older licensing age, and stronger graduated licensing provisions. Injury and PDO crashes would be significantly reduced with stricter laws prohibiting the use of hand-held communication devices and higher fines for drunk driving. Copyright © 2018 Elsevier Ltd. All rights reserved.

  6. Multivariate meta-analysis: a robust approach based on the theory of U-statistic.

    PubMed

    Ma, Yan; Mazumdar, Madhu

    2011-10-30

    Meta-analysis is the methodology for combining findings from similar research studies asking the same question. When the question of interest involves multiple outcomes, multivariate meta-analysis is used to synthesize the outcomes simultaneously taking into account the correlation between the outcomes. Likelihood-based approaches, in particular restricted maximum likelihood (REML) method, are commonly utilized in this context. REML assumes a multivariate normal distribution for the random-effects model. This assumption is difficult to verify, especially for meta-analysis with small number of component studies. The use of REML also requires iterative estimation between parameters, needing moderately high computation time, especially when the dimension of outcomes is large. A multivariate method of moments (MMM) is available and is shown to perform equally well to REML. However, there is a lack of information on the performance of these two methods when the true data distribution is far from normality. In this paper, we propose a new nonparametric and non-iterative method for multivariate meta-analysis on the basis of the theory of U-statistic and compare the properties of these three procedures under both normal and skewed data through simulation studies. It is shown that the effect on estimates from REML because of non-normal data distribution is marginal and that the estimates from MMM and U-statistic-based approaches are very similar. Therefore, we conclude that for performing multivariate meta-analysis, the U-statistic estimation procedure is a viable alternative to REML and MMM. Easy implementation of all three methods are illustrated by their application to data from two published meta-analysis from the fields of hip fracture and periodontal disease. We discuss ideas for future research based on U-statistic for testing significance of between-study heterogeneity and for extending the work to meta-regression setting. Copyright © 2011 John Wiley & Sons, Ltd.

  7. Gendered Social Learning, Nonfamily Institutions, and Attitudes About Recourse After Partner Violence.

    PubMed

    Yount, Kathryn M; Krause, Kathleen H

    2017-01-01

    To provide the first study in Vietnam of how gendered social learning about violence and exposure to non-family institutions influence women's attitudes about a wife's recourse after physical IPV. A probability sample of 532 married women, ages 18-50 years, was surveyed in July-August, 2012 in Mỹ Hào district. We fit a multivariate linear regression model to estimate correlates of favoring recourse in six situations using a validated attitudinal scale. We split attitudes towards recourse into three subscales (disfavor silence, favor informal recourse, favor formal recourse) and fit one multivariate ordinal logistic regression model for each behavior to estimate correlates of favoring recourse. On average, women favored recourse in 2.8 situations. Women who were older and had witnessed physical IPV in childhood had less favorable attitudes about recourse. Women who were hit as children, had completed more schooling, worked outside agriculture, and had sought recourse after IPV had more favorable attitudes about recourse. Normative change among women may require efforts to curb family violence, counsel those exposed to violence in childhood, and enhance women's opportunities for higher schooling and non-agricultural wage work. The state and organizations working on IPV might overcome pockets of unfavorable public opinion by enforcing accountability for IPV rather than seeking to alter ideas about recourse among women.

  8. Association of cutaneous melanoma incidence with area-based socioeconomic indicators-United States, 2004-2006.

    PubMed

    Singh, Simple D; Ajani, Umed A; Johnson, Christopher J; Roland, Katherine B; Eide, Melody; Jemal, Ahmedin; Negoita, Serban; Bayakly, Rana A; Ekwueme, Donatus U

    2011-11-01

    Socioeconomic status (SES) has been associated with melanoma incidence and outcomes. Examination of the relationship between melanoma and SES at the national level in the United States is limited. Expanding knowledge of this association is needed to improve early detection and eliminate disparities. We sought to provide a detailed description of cutaneous melanoma incidence and stage of disease in relationship to area-based socioeconomic measures including poverty level, education, income, and unemployment in the United States. Invasive cutaneous melanoma data reported by 44 population-based central cancer registries for 2004 to 2006 were merged with county-level SES estimates from the US Census Bureau. Age-adjusted incidence rates were calculated by gender, race/ethnicity, poverty, education, income, unemployment, and metro/urban/rural status using software. Poisson multilevel mixed models were fitted, and incidence density ratios were calculated by stage for area-based SES measures, controlling for age, gender, and state random effects. Counties with lower poverty, higher education, higher income, and lower unemployment had higher age-adjusted melanoma incidence rates for both early and late stage. In multivariate models, SES effects persisted for early-stage but not late-stage melanoma incidence. Individual-level measures of SES were unavailable, and estimates were based on county-level SES measures. Our findings show that melanoma incidence in the United States is associated with aggregate county-level measures of high SES. Analyses using finer-level SES measures, such as individual or census tract level, are needed to provide more precise estimates of these associations. Copyright © 2011 American Academy of Dermatology, Inc. Published by Mosby, Inc. All rights reserved.

  9. Divergences and estimating tight bounds on Bayes error with applications to multivariate Gaussian copula and latent Gaussian copula

    NASA Astrophysics Data System (ADS)

    Thelen, Brian J.; Xique, Ismael J.; Burns, Joseph W.; Goley, G. Steven; Nolan, Adam R.; Benson, Jonathan W.

    2017-04-01

    In Bayesian decision theory, there has been a great amount of research into theoretical frameworks and information- theoretic quantities that can be used to provide lower and upper bounds for the Bayes error. These include well-known bounds such as Chernoff, Battacharrya, and J-divergence. Part of the challenge of utilizing these various metrics in practice is (i) whether they are "loose" or "tight" bounds, (ii) how they might be estimated via either parametric or non-parametric methods, and (iii) how accurate the estimates are for limited amounts of data. In general what is desired is a methodology for generating relatively tight lower and upper bounds, and then an approach to estimate these bounds efficiently from data. In this paper, we explore the so-called triangle divergence which has been around for a while, but was recently made more prominent in some recent research on non-parametric estimation of information metrics. Part of this work is motivated by applications for quantifying fundamental information content in SAR/LIDAR data, and to help in this, we have developed a flexible multivariate modeling framework based on multivariate Gaussian copula models which can be combined with the triangle divergence framework to quantify this information, and provide approximate bounds on Bayes error. In this paper we present an overview of the bounds, including those based on triangle divergence and verify that under a number of multivariate models, the upper and lower bounds derived from triangle divergence are significantly tighter than the other common bounds, and often times, dramatically so. We also propose some simple but effective means for computing the triangle divergence using Monte Carlo methods, and then discuss estimation of the triangle divergence from empirical data based on Gaussian Copula models.

  10. Sampling effort affects multivariate comparisons of stream assemblages

    USGS Publications Warehouse

    Cao, Y.; Larsen, D.P.; Hughes, R.M.; Angermeier, P.L.; Patton, T.M.

    2002-01-01

    Multivariate analyses are used widely for determining patterns of assemblage structure, inferring species-environment relationships and assessing human impacts on ecosystems. The estimation of ecological patterns often depends on sampling effort, so the degree to which sampling effort affects the outcome of multivariate analyses is a concern. We examined the effect of sampling effort on site and group separation, which was measured using a mean similarity method. Two similarity measures, the Jaccard Coefficient and Bray-Curtis Index were investigated with 1 benthic macroinvertebrate and 2 fish data sets. Site separation was significantly improved with increased sampling effort because the similarity between replicate samples of a site increased more rapidly than between sites. Similarly, the faster increase in similarity between sites of the same group than between sites of different groups caused clearer separation between groups. The strength of site and group separation completely stabilized only when the mean similarity between replicates reached 1. These results are applicable to commonly used multivariate techniques such as cluster analysis and ordination because these multivariate techniques start with a similarity matrix. Completely stable outcomes of multivariate analyses are not feasible. Instead, we suggest 2 criteria for estimating the stability of multivariate analyses of assemblage data: 1) mean within-site similarity across all sites compared, indicating sample representativeness, and 2) the SD of within-site similarity across sites, measuring sample comparability.

  11. Geographic variation and risk of skin cancer in US women. Differences between melanoma, squamous cell carcinoma, and basal cell carcinoma.

    PubMed

    Qureshi, Abrar A; Laden, Francine; Colditz, Graham A; Hunter, David J

    2008-03-10

    Occurrences of melanoma, squamous cell carcinoma (SCC), and basal cell carcinoma (BCC) have been associated with varying geography. Our goal was to evaluate differences in risk of these skin cancers according to residence at varying UV indices at 3 time points. Prospective 1984-2002 study of 84 836 female nurses who lived in different UV index regions of the United States at birth and at 15 or 30 years of age. The outcome measure was diagnosis of melanoma, SCC, or BCC. During the 18-year study, 420 cases of melanoma, 863 cases of SCC, and 8215 cases of BCC occurred. At 30 years of age, age-adjusted risks for SCC were 1.47 (95% confidence interval [CI], 1.22-1.76) and 1.90 (95% CI, 1.51-2.36) for women residing in states with a UV index of 6 (medium) and 7 or more (high), respectively. Although elevated, the age-adjusted risk of BCC at 30 years of age associated with residence in these states was substantially less. Although the risk of melanoma was not elevated for women living in these states at 30 years of age, it was significantly elevated among women living in states with UV indices of 6 at birth and at 15 years of age. There was no material change in risk estimates with multivariate adjustment. For women who reported living in states with UV indices of 7 or more at all 3 time points, the multivariate risk of SCC was highest. The risk of SCC is independently affected by residence in locations with medium and high UV indices; the gradient of risk is weaker for BCC; and the risk of melanoma does not change significantly across this gradient.

  12. Utah Cancer Survivors: A Comprehensive Comparison of Health-Related Outcomes Between Survivors and Individuals Without a History of Cancer.

    PubMed

    Fowler, Brynn; Ding, Qian; Pappas, Lisa; Wu, Yelena P; Linder, Lauri; Yancey, Jeff; Wright, Jennifer; Clayton, Margaret; Kepka, Deanna; Kirchhoff, Anne C

    2018-02-01

    Assessments of cancer survivors' health-related needs are often limited to national estimates. State-specific information is vital to inform state comprehensive cancer control efforts developed to support patients and providers. We investigated demographics, health status/quality of life, health behaviors, and health care characteristics of long-term Utah cancer survivors compared to Utahans without a history of cancer. Utah Behavioral Risk Factor Surveillance System (BRFSS) 2009 and 2010 data were used. Individuals diagnosed with cancer within the past 5 years were excluded. Multivariable survey weighted logistic regressions and computed predictive marginals were used to estimate age-adjusted percentages and 95 % confidence intervals (CI). A total of 11,320 eligible individuals (727 cancer survivors, 10,593 controls) were included. Respondents were primarily non-Hispanic White (95.3 % of survivors, 84.1 % of controls). Survivors were older (85 % of survivors ≥40 years of age vs. 47 % of controls). Survivors reported the majority of their cancer survivorship care was managed by primary care physicians or non-cancer specialists (93.5 %, 95 % CI = 87.9-99.1). Furthermore, 71.1 % (95 % CI = 59.2-82.9) of survivors reported that they did not receive a cancer treatment summary. In multivariable estimates, fair/poor general health was more common among survivors compared to controls (17.8 %, 95 % CI = 12.5-23.1 vs. 14.2 %, 95 % CI = 12.4-16.0). Few survivors in Utah receive follow-up care from a cancer specialist. Provider educational efforts are needed to promote knowledge of cancer survivor issues. Efforts should be made to improve continuity in follow-up care that addresses the known issues of long-term survivors that preclude optimal quality of life, resulting in a patient-centered approach to survivorship.

  13. Dynamic connectivity regression: Determining state-related changes in brain connectivity

    PubMed Central

    Cribben, Ivor; Haraldsdottir, Ragnheidur; Atlas, Lauren Y.; Wager, Tor D.; Lindquist, Martin A.

    2014-01-01

    Most statistical analyses of fMRI data assume that the nature, timing and duration of the psychological processes being studied are known. However, often it is hard to specify this information a priori. In this work we introduce a data-driven technique for partitioning the experimental time course into distinct temporal intervals with different multivariate functional connectivity patterns between a set of regions of interest (ROIs). The technique, called Dynamic Connectivity Regression (DCR), detects temporal change points in functional connectivity and estimates a graph, or set of relationships between ROIs, for data in the temporal partition that falls between pairs of change points. Hence, DCR allows for estimation of both the time of change in connectivity and the connectivity graph for each partition, without requiring prior knowledge of the nature of the experimental design. Permutation and bootstrapping methods are used to perform inference on the change points. The method is applied to various simulated data sets as well as to an fMRI data set from a study (N=26) of a state anxiety induction using a socially evaluative threat challenge. The results illustrate the method’s ability to observe how the networks between different brain regions changed with subjects’ emotional state. PMID:22484408

  14. An improved method for bivariate meta-analysis when within-study correlations are unknown.

    PubMed

    Hong, Chuan; D Riley, Richard; Chen, Yong

    2018-03-01

    Multivariate meta-analysis, which jointly analyzes multiple and possibly correlated outcomes in a single analysis, is becoming increasingly popular in recent years. An attractive feature of the multivariate meta-analysis is its ability to account for the dependence between multiple estimates from the same study. However, standard inference procedures for multivariate meta-analysis require the knowledge of within-study correlations, which are usually unavailable. This limits standard inference approaches in practice. Riley et al proposed a working model and an overall synthesis correlation parameter to account for the marginal correlation between outcomes, where the only data needed are those required for a separate univariate random-effects meta-analysis. As within-study correlations are not required, the Riley method is applicable to a wide variety of evidence synthesis situations. However, the standard variance estimator of the Riley method is not entirely correct under many important settings. As a consequence, the coverage of a function of pooled estimates may not reach the nominal level even when the number of studies in the multivariate meta-analysis is large. In this paper, we improve the Riley method by proposing a robust variance estimator, which is asymptotically correct even when the model is misspecified (ie, when the likelihood function is incorrect). Simulation studies of a bivariate meta-analysis, in a variety of settings, show a function of pooled estimates has improved performance when using the proposed robust variance estimator. In terms of individual pooled estimates themselves, the standard variance estimator and robust variance estimator give similar results to the original method, with appropriate coverage. The proposed robust variance estimator performs well when the number of studies is relatively large. Therefore, we recommend the use of the robust method for meta-analyses with a relatively large number of studies (eg, m≥50). When the sample size is relatively small, we recommend the use of the robust method under the working independence assumption. We illustrate the proposed method through 2 meta-analyses. Copyright © 2017 John Wiley & Sons, Ltd.

  15. A multivariate extension of mutual information for growing neural networks.

    PubMed

    Ball, Kenneth R; Grant, Christopher; Mundy, William R; Shafer, Timothy J

    2017-11-01

    Recordings of neural network activity in vitro are increasingly being used to assess the development of neural network activity and the effects of drugs, chemicals and disease states on neural network function. The high-content nature of the data derived from such recordings can be used to infer effects of compounds or disease states on a variety of important neural functions, including network synchrony. Historically, synchrony of networks in vitro has been assessed either by determination of correlation coefficients (e.g. Pearson's correlation), by statistics estimated from cross-correlation histograms between pairs of active electrodes, and/or by pairwise mutual information and related measures. The present study examines the application of Normalized Multiinformation (NMI) as a scalar measure of shared information content in a multivariate network that is robust with respect to changes in network size. Theoretical simulations are designed to investigate NMI as a measure of complexity and synchrony in a developing network relative to several alternative approaches. The NMI approach is applied to these simulations and also to data collected during exposure of in vitro neural networks to neuroactive compounds during the first 12 days in vitro, and compared to other common measures, including correlation coefficients and mean firing rates of neurons. NMI is shown to be more sensitive to developmental effects than first order synchronous and nonsynchronous measures of network complexity. Finally, NMI is a scalar measure of global (rather than pairwise) mutual information in a multivariate network, and hence relies on less assumptions for cross-network comparisons than historical approaches. Copyright © 2017 Elsevier Ltd. All rights reserved.

  16. A joint modeling and estimation method for multivariate longitudinal data with mixed types of responses to analyze physical activity data generated by accelerometers.

    PubMed

    Li, Haocheng; Zhang, Yukun; Carroll, Raymond J; Keadle, Sarah Kozey; Sampson, Joshua N; Matthews, Charles E

    2017-11-10

    A mixed effect model is proposed to jointly analyze multivariate longitudinal data with continuous, proportion, count, and binary responses. The association of the variables is modeled through the correlation of random effects. We use a quasi-likelihood type approximation for nonlinear variables and transform the proposed model into a multivariate linear mixed model framework for estimation and inference. Via an extension to the EM approach, an efficient algorithm is developed to fit the model. The method is applied to physical activity data, which uses a wearable accelerometer device to measure daily movement and energy expenditure information. Our approach is also evaluated by a simulation study. Copyright © 2017 John Wiley & Sons, Ltd.

  17. A multivariate fall risk assessment model for VHA nursing homes using the minimum data set.

    PubMed

    French, Dustin D; Werner, Dennis C; Campbell, Robert R; Powell-Cope, Gail M; Nelson, Audrey L; Rubenstein, Laurence Z; Bulat, Tatjana; Spehar, Andrea M

    2007-02-01

    The purpose of this study was to develop a multivariate fall risk assessment model beyond the current fall Resident Assessment Protocol (RAP) triggers for nursing home residents using the Minimum Data Set (MDS). Retrospective, clustered secondary data analysis. National Veterans Health Administration (VHA) long-term care nursing homes (N = 136). The study population consisted of 6577 national VHA nursing home residents who had an annual assessment during FY 2005, identified from the MDS, as well as an earlier annual or admission assessment within a 1-year look-back period. A dichotomous multivariate model of nursing home residents coded with a fall on selected fall risk characteristics from the MDS, estimated with general estimation equations (GEE). There were 17 170 assessments corresponding to 6577 long-term care nursing home residents. The increased odds ratio (OR) of being classified as a faller relative to the omitted "dependent" category of activities of daily living (ADL) ranged from OR = 1.35 for "limited" ADL category up to OR = 1.57 for "extensive-2" ADL (P < .0001). Unsteady gait more than doubles the odds of being a faller (OR = 2.63, P < .0001). The use of assistive devices such as canes, walkers, or crutches, or the use of wheelchairs increases the odds of being a faller (OR = 1.17, P < .0005) or (OR = 1.19, P < .0002), respectively. Foot problems may also increase the odds of being a faller (OR = 1.26, P < .0016). Alzheimer's or other dementias also increase the odds of being classified as a faller (OR = 1.18, P < .0219) or (OR=1.22, P < .0001), respectively. In addition, anger (OR = 1.19, P < .0065); wandering (OR = 1.53, P < .0001); or use of antipsychotic medications (OR = 1.15, P < .0039), antianxiety medications (OR = 1.13, P < .0323), or antidepressant medications (OR = 1.39, P < .0001) was also associated with the odds of being a faller. This national study in one of the largest managed healthcare systems in the United States has empirically confirmed the relative importance of certain risk factors for falls in long-term care settings. The model incorporated an ADL index and adjusted for case mix by including only long-term care nursing home residents. The study offers clinicians practical estimates by combining multiple univariate MDS elements in an empirically based, multivariate fall risk assessment model.

  18. Advancing the detection of steady-state visual evoked potentials in brain-computer interfaces.

    PubMed

    Abu-Alqumsan, Mohammad; Peer, Angelika

    2016-06-01

    Spatial filtering has proved to be a powerful pre-processing step in detection of steady-state visual evoked potentials and boosted typical detection rates both in offline analysis and online SSVEP-based brain-computer interface applications. State-of-the-art detection methods and the spatial filters used thereby share many common foundations as they all build upon the second order statistics of the acquired Electroencephalographic (EEG) data, that is, its spatial autocovariance and cross-covariance with what is assumed to be a pure SSVEP response. The present study aims at highlighting the similarities and differences between these methods. We consider the canonical correlation analysis (CCA) method as a basis for the theoretical and empirical (with real EEG data) analysis of the state-of-the-art detection methods and the spatial filters used thereby. We build upon the findings of this analysis and prior research and propose a new detection method (CVARS) that combines the power of the canonical variates and that of the autoregressive spectral analysis in estimating the signal and noise power levels. We found that the multivariate synchronization index method and the maximum contrast combination method are variations of the CCA method. All three methods were found to provide relatively unreliable detections in low signal-to-noise ratio (SNR) regimes. CVARS and the minimum energy combination methods were found to provide better estimates for different SNR levels. Our theoretical and empirical results demonstrate that the proposed CVARS method outperforms other state-of-the-art detection methods when used in an unsupervised fashion. Furthermore, when used in a supervised fashion, a linear classifier learned from a short training session is able to estimate the hidden user intention, including the idle state (when the user is not attending to any stimulus), rapidly, accurately and reliably.

  19. Estimated incidence of erythema migrans in five regions of France and ecological correlations with environmental characteristics.

    PubMed

    Mariet, Anne-Sophie; Retel, Olivier; Avocat, Hélène; Serre, Anne; Schapman, Lucie; Schmitt, Marielle; Charron, Martine; Monnet, Elisabeth

    2013-09-01

    While several studies conducted on Lyme borreliosis (LB) risk in the United States showed an association with environmental characteristics, most of European studies considered solely the effect of climate characteristics. The aims of this study were to estimate incidence of erythema migrans (EM) in five regions of France and to analyze associations with several environmental characteristics of the place of residence. LB surveillance networks of general practitioners (GPs) were set up for a period of 2 years in five regions of France. Participating GPs reported all patients with EM during the study period. Data were pooled according to a standardized EM case definition. For each area with a participating GP, age-standardized incidence rates and ratios were estimated. Associations with altitude, indicators of landscape composition, and indicators of landscape configuration were tested with multivariate Poisson regression. Standardized estimated incidence rates of EM per 10(5) person-years were 8.8 [95% confidence interval (CI)=7.9-9.7] in Aquitaine, 40.0 (95% CI 36.4-43.6) in Limousin, 76.0 (95% CI 72.9-79.1) in the three participating départements of Rhône-Alpes, 46.1 (95% CI 43.0-49.2) in Franche-Comté, and 87.7 (95% CI 84.6-90.8) in Alsace. In multivariate analysis, age-adjusted incidence rates increased with the altitude (p<0.0001) and decreased with forest patch density (p<0.0001). The marked variations in EM risk among the five regions were partly related to differences in landscape and environmental characteristics. The latter may point out potential risk areas and provide information for targeting preventive actions.

  20. HIV-1 disease progression during highly active antiretroviral therapy: an application using population-level data in British Columbia: 1996-2011.

    PubMed

    Nosyk, Bohdan; Min, Jeong; Lima, Viviane D; Yip, Benita; Hogg, Robert S; Montaner, Julio S G

    2013-08-15

    Accurately estimating rates of disease progression is of central importance in developing mathematical models used to project outcomes and guide resource allocation decisions. Our objective was to specify a multivariate regression model to estimate changes in disease progression among individuals on highly active antiretroviral treatment in British Columbia, Canada, 1996-2011. We used population-level data on disease progression and antiretroviral treatment utilization from the BC HIV Drug Treatment Program. Disease progression was captured using longitudinal CD4 and plasma viral load testing data, linked with data on antiretroviral treatment. The study outcome was categorized into (CD4 count ≥ 500, 500-350, 350-200, <200 cells/mm, and mortality). A 5-state continuous-time Markov model was used to estimate covariate-specific probabilities of CD4 progression, focusing on temporal changes during the study period. A total of 210,083 CD4 measurements among 7421 individuals with HIV/AIDS were included in the study. Results of the multivariate model suggested that current highly active antiretroviral treatment at baseline, lower baseline CD4 (<200 cells/mm), and extended durations of elevated plasma viral load were each associated with accelerated progression. Immunological improvement was accelerated significantly from 2004 onward, with 23% and 46% increases in the probability of CD4 improvement from the fourth CD4 stratum (CD4 < 200) in 2004-2008 and 2008-2011, respectively. Our results demonstrate the impact of innovations in antiretroviral treatment and treatment delivery at the population level. These results can be used to estimate a transition probability matrix flexible to changes in the observed mix of clients in different clinical stages and treatment regimens over time.

  1. Control design for robust stability in linear regulators: Application to aerospace flight control

    NASA Technical Reports Server (NTRS)

    Yedavalli, R. K.

    1986-01-01

    Time domain stability robustness analysis and design for linear multivariable uncertain systems with bounded uncertainties is the central theme of the research. After reviewing the recently developed upper bounds on the linear elemental (structured), time varying perturbation of an asymptotically stable linear time invariant regulator, it is shown that it is possible to further improve these bounds by employing state transformations. Then introducing a quantitative measure called the stability robustness index, a state feedback conrol design algorithm is presented for a general linear regulator problem and then specialized to the case of modal systems as well as matched systems. The extension of the algorithm to stochastic systems with Kalman filter as the state estimator is presented. Finally an algorithm for robust dynamic compensator design is presented using Parameter Optimization (PO) procedure. Applications in a aircraft control and flexible structure control are presented along with a comparison with other existing methods.

  2. Steady State Fluorescence Spectroscopy for Medical Diagnosis

    NASA Astrophysics Data System (ADS)

    Mahadevan-Jansen, Anita; Gebhart, Steven C.

    Light can react with tissue in different ways and provide information for identifying the physiological state of tissue or detecting the presence of disease. The light used to probe tissue does so in a non-intrusive manner and typically uses very low levels of light far below the requirements for therapeutic applications. The use of fiber optics simplifies the delivery and collection of this light in a minimally invasive manner. Since tissue response is virtually instantaneous, the results are obtained in real-time and the use of data processing techniques and multi-variate statistical analysis allows for automated detection and therefore provides an objective estimation of the tissue state. These then form the fundamental basis for the application of optical techniques for the detection of tissue physiology as well as pathology. These distinct advantages have encouraged many researchers to pursue the development of the different optical interactions for biological and medical detection.

  3. Modelling daily dissolved oxygen concentration using least square support vector machine, multivariate adaptive regression splines and M5 model tree

    NASA Astrophysics Data System (ADS)

    Heddam, Salim; Kisi, Ozgur

    2018-04-01

    In the present study, three types of artificial intelligence techniques, least square support vector machine (LSSVM), multivariate adaptive regression splines (MARS) and M5 model tree (M5T) are applied for modeling daily dissolved oxygen (DO) concentration using several water quality variables as inputs. The DO concentration and water quality variables data from three stations operated by the United States Geological Survey (USGS) were used for developing the three models. The water quality data selected consisted of daily measured of water temperature (TE, °C), pH (std. unit), specific conductance (SC, μS/cm) and discharge (DI cfs), are used as inputs to the LSSVM, MARS and M5T models. The three models were applied for each station separately and compared to each other. According to the results obtained, it was found that: (i) the DO concentration could be successfully estimated using the three models and (ii) the best model among all others differs from one station to another.

  4. Accuracy enhancement of a multivariate calibration for lead determination in soils by laser induced breakdown spectroscopy

    NASA Astrophysics Data System (ADS)

    Zaytsev, Sergey M.; Krylov, Ivan N.; Popov, Andrey M.; Zorov, Nikita B.; Labutin, Timur A.

    2018-02-01

    We have investigated matrix effects and spectral interferences on example of lead determination in different types of soils by laser induced breakdown spectroscopy (LIBS). Comparison between analytical performances of univariate and multivariate calibrations with the use of different laser wavelength for ablation (532, 355 and 266 nm) have been reported. A set of 17 soil samples (Ca-rich, Fe-rich, lean soils etc., 8.5-280 ppm of Pb) was involved into construction of the calibration models. Spectral interferences from main components (Ca, Fe, Ti, Mg) and trace components (Mn, Nb, Zr) were estimated by spectra modeling, and they were a reason for significant differences between the univariate calibration models obtained for a three different soil types (black, red, gray) separately. Implementation of 3rd harmonic of Nd:YAG laser in combination with multivariate calibration model based on PCR with 3 principal components provided the best analytical results: the RMSEC has been lowered down to 8 ppm. The sufficient improvement of the relative uncertainty (up to 5-10%) in comparison with univariate calibration was observed at the Pb concentration level > 50 ppm, while the problem of accuracy still remains for some samples with Pb concentration at the 20 ppm level. We have also discussed a few possible ways to estimate LOD without a blank sample. The most rigorous criterion has resulted in LOD of Pb in soils being 13 ppm. Finally, a good agreement between the values of lead content predicted by LIBS (46 ± 5 ppm) and XRF (42.1 ± 3.3 ppm) in the unknown soil sample from Lomonosov Moscow State University area was demonstrated.

  5. Hot spots of multivariate extreme anomalies in Earth observations

    NASA Astrophysics Data System (ADS)

    Flach, M.; Sippel, S.; Bodesheim, P.; Brenning, A.; Denzler, J.; Gans, F.; Guanche, Y.; Reichstein, M.; Rodner, E.; Mahecha, M. D.

    2016-12-01

    Anomalies in Earth observations might indicate data quality issues, extremes or the change of underlying processes within a highly multivariate system. Thus, considering the multivariate constellation of variables for extreme detection yields crucial additional information over conventional univariate approaches. We highlight areas in which multivariate extreme anomalies are more likely to occur, i.e. hot spots of extremes in global atmospheric Earth observations that impact the Biosphere. In addition, we present the year of the most unusual multivariate extreme between 2001 and 2013 and show that these coincide with well known high impact extremes. Technically speaking, we account for multivariate extremes by using three sophisticated algorithms adapted from computer science applications. Namely an ensemble of the k-nearest neighbours mean distance, a kernel density estimation and an approach based on recurrences is used. However, the impact of atmosphere extremes on the Biosphere might largely depend on what is considered to be normal, i.e. the shape of the mean seasonal cycle and its inter-annual variability. We identify regions with similar mean seasonality by means of dimensionality reduction in order to estimate in each region both the `normal' variance and robust thresholds for detecting the extremes. In addition, we account for challenges like heteroscedasticity in Northern latitudes. Apart from hot spot areas, those anomalies in the atmosphere time series are of particular interest, which can only be detected by a multivariate approach but not by a simple univariate approach. Such an anomalous constellation of atmosphere variables is of interest if it impacts the Biosphere. The multivariate constellation of such an anomalous part of a time series is shown in one case study indicating that multivariate anomaly detection can provide novel insights into Earth observations.

  6. Multivariate Copula Analysis Toolbox (MvCAT): Describing dependence and underlying uncertainty using a Bayesian framework

    NASA Astrophysics Data System (ADS)

    Sadegh, Mojtaba; Ragno, Elisa; AghaKouchak, Amir

    2017-06-01

    We present a newly developed Multivariate Copula Analysis Toolbox (MvCAT) which includes a wide range of copula families with different levels of complexity. MvCAT employs a Bayesian framework with a residual-based Gaussian likelihood function for inferring copula parameters and estimating the underlying uncertainties. The contribution of this paper is threefold: (a) providing a Bayesian framework to approximate the predictive uncertainties of fitted copulas, (b) introducing a hybrid-evolution Markov Chain Monte Carlo (MCMC) approach designed for numerical estimation of the posterior distribution of copula parameters, and (c) enabling the community to explore a wide range of copulas and evaluate them relative to the fitting uncertainties. We show that the commonly used local optimization methods for copula parameter estimation often get trapped in local minima. The proposed method, however, addresses this limitation and improves describing the dependence structure. MvCAT also enables evaluation of uncertainties relative to the length of record, which is fundamental to a wide range of applications such as multivariate frequency analysis.

  7. Estimating multivariate response surface model with data outliers, case study in enhancing surface layer properties of an aircraft aluminium alloy

    NASA Astrophysics Data System (ADS)

    Widodo, Edy; Kariyam

    2017-03-01

    To determine the input variable settings that create the optimal compromise in response variable used Response Surface Methodology (RSM). There are three primary steps in the RSM problem, namely data collection, modelling, and optimization. In this study focused on the establishment of response surface models, using the assumption that the data produced is correct. Usually the response surface model parameters are estimated by OLS. However, this method is highly sensitive to outliers. Outliers can generate substantial residual and often affect the estimator models. Estimator models produced can be biased and could lead to errors in the determination of the optimal point of fact, that the main purpose of RSM is not reached. Meanwhile, in real life, the collected data often contain some response variable and a set of independent variables. Treat each response separately and apply a single response procedures can result in the wrong interpretation. So we need a development model for the multi-response case. Therefore, it takes a multivariate model of the response surface that is resistant to outliers. As an alternative, in this study discussed on M-estimation as a parameter estimator in multivariate response surface models containing outliers. As an illustration presented a case study on the experimental results to the enhancement of the surface layer of aluminium alloy air by shot peening.

  8. A simplified parsimonious higher order multivariate Markov chain model with new convergence condition

    NASA Astrophysics Data System (ADS)

    Wang, Chao; Yang, Chuan-sheng

    2017-09-01

    In this paper, we present a simplified parsimonious higher-order multivariate Markov chain model with new convergence condition. (TPHOMMCM-NCC). Moreover, estimation method of the parameters in TPHOMMCM-NCC is give. Numerical experiments illustrate the effectiveness of TPHOMMCM-NCC.

  9. Borrowing of strength and study weights in multivariate and network meta-analysis.

    PubMed

    Jackson, Dan; White, Ian R; Price, Malcolm; Copas, John; Riley, Richard D

    2017-12-01

    Multivariate and network meta-analysis have the potential for the estimated mean of one effect to borrow strength from the data on other effects of interest. The extent of this borrowing of strength is usually assessed informally. We present new mathematical definitions of 'borrowing of strength'. Our main proposal is based on a decomposition of the score statistic, which we show can be interpreted as comparing the precision of estimates from the multivariate and univariate models. Our definition of borrowing of strength therefore emulates the usual informal assessment. We also derive a method for calculating study weights, which we embed into the same framework as our borrowing of strength statistics, so that percentage study weights can accompany the results from multivariate and network meta-analyses as they do in conventional univariate meta-analyses. Our proposals are illustrated using three meta-analyses involving correlated effects for multiple outcomes, multiple risk factor associations and multiple treatments (network meta-analysis).

  10. Borrowing of strength and study weights in multivariate and network meta-analysis

    PubMed Central

    Jackson, Dan; White, Ian R; Price, Malcolm; Copas, John; Riley, Richard D

    2016-01-01

    Multivariate and network meta-analysis have the potential for the estimated mean of one effect to borrow strength from the data on other effects of interest. The extent of this borrowing of strength is usually assessed informally. We present new mathematical definitions of ‘borrowing of strength’. Our main proposal is based on a decomposition of the score statistic, which we show can be interpreted as comparing the precision of estimates from the multivariate and univariate models. Our definition of borrowing of strength therefore emulates the usual informal assessment. We also derive a method for calculating study weights, which we embed into the same framework as our borrowing of strength statistics, so that percentage study weights can accompany the results from multivariate and network meta-analyses as they do in conventional univariate meta-analyses. Our proposals are illustrated using three meta-analyses involving correlated effects for multiple outcomes, multiple risk factor associations and multiple treatments (network meta-analysis). PMID:26546254

  11. A refined method for multivariate meta-analysis and meta-regression

    PubMed Central

    Jackson, Daniel; Riley, Richard D

    2014-01-01

    Making inferences about the average treatment effect using the random effects model for meta-analysis is problematic in the common situation where there is a small number of studies. This is because estimates of the between-study variance are not precise enough to accurately apply the conventional methods for testing and deriving a confidence interval for the average effect. We have found that a refined method for univariate meta-analysis, which applies a scaling factor to the estimated effects’ standard error, provides more accurate inference. We explain how to extend this method to the multivariate scenario and show that our proposal for refined multivariate meta-analysis and meta-regression can provide more accurate inferences than the more conventional approach. We explain how our proposed approach can be implemented using standard output from multivariate meta-analysis software packages and apply our methodology to two real examples. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. PMID:23996351

  12. Multivariate meta-analysis using individual participant data

    PubMed Central

    Riley, R. D.; Price, M. J.; Jackson, D.; Wardle, M.; Gueyffier, F.; Wang, J.; Staessen, J. A.; White, I. R.

    2016-01-01

    When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is that within-study correlations needed to fit the multivariate model are unknown from published reports. However, provision of individual participant data (IPD) allows them to be calculated directly. Here, we illustrate how to use IPD to estimate within-study correlations, using a joint linear regression for multiple continuous outcomes and bootstrapping methods for binary, survival and mixed outcomes. In a meta-analysis of 10 hypertension trials, we then show how these methods enable multivariate meta-analysis to address novel clinical questions about continuous, survival and binary outcomes; treatment–covariate interactions; adjusted risk/prognostic factor effects; longitudinal data; prognostic and multiparameter models; and multiple treatment comparisons. Both frequentist and Bayesian approaches are applied, with example software code provided to derive within-study correlations and to fit the models. PMID:26099484

  13. Inference of reactive transport model parameters using a Bayesian multivariate approach

    NASA Astrophysics Data System (ADS)

    Carniato, Luca; Schoups, Gerrit; van de Giesen, Nick

    2014-08-01

    Parameter estimation of subsurface transport models from multispecies data requires the definition of an objective function that includes different types of measurements. Common approaches are weighted least squares (WLS), where weights are specified a priori for each measurement, and weighted least squares with weight estimation (WLS(we)) where weights are estimated from the data together with the parameters. In this study, we formulate the parameter estimation task as a multivariate Bayesian inference problem. The WLS and WLS(we) methods are special cases in this framework, corresponding to specific prior assumptions about the residual covariance matrix. The Bayesian perspective allows for generalizations to cases where residual correlation is important and for efficient inference by analytically integrating out the variances (weights) and selected covariances from the joint posterior. Specifically, the WLS and WLS(we) methods are compared to a multivariate (MV) approach that accounts for specific residual correlations without the need for explicit estimation of the error parameters. When applied to inference of reactive transport model parameters from column-scale data on dissolved species concentrations, the following results were obtained: (1) accounting for residual correlation between species provides more accurate parameter estimation for high residual correlation levels whereas its influence for predictive uncertainty is negligible, (2) integrating out the (co)variances leads to an efficient estimation of the full joint posterior with a reduced computational effort compared to the WLS(we) method, and (3) in the presence of model structural errors, none of the methods is able to identify the correct parameter values.

  14. Enhancing e-waste estimates: improving data quality by multivariate Input-Output Analysis.

    PubMed

    Wang, Feng; Huisman, Jaco; Stevels, Ab; Baldé, Cornelis Peter

    2013-11-01

    Waste electrical and electronic equipment (or e-waste) is one of the fastest growing waste streams, which encompasses a wide and increasing spectrum of products. Accurate estimation of e-waste generation is difficult, mainly due to lack of high quality data referred to market and socio-economic dynamics. This paper addresses how to enhance e-waste estimates by providing techniques to increase data quality. An advanced, flexible and multivariate Input-Output Analysis (IOA) method is proposed. It links all three pillars in IOA (product sales, stock and lifespan profiles) to construct mathematical relationships between various data points. By applying this method, the data consolidation steps can generate more accurate time-series datasets from available data pool. This can consequently increase the reliability of e-waste estimates compared to the approach without data processing. A case study in the Netherlands is used to apply the advanced IOA model. As a result, for the first time ever, complete datasets of all three variables for estimating all types of e-waste have been obtained. The result of this study also demonstrates significant disparity between various estimation models, arising from the use of data under different conditions. It shows the importance of applying multivariate approach and multiple sources to improve data quality for modelling, specifically using appropriate time-varying lifespan parameters. Following the case study, a roadmap with a procedural guideline is provided to enhance e-waste estimation studies. Copyright © 2013 Elsevier Ltd. All rights reserved.

  15. Information spreading by a combination of MEG source estimation and multivariate pattern classification.

    PubMed

    Sato, Masashi; Yamashita, Okito; Sato, Masa-Aki; Miyawaki, Yoichi

    2018-01-01

    To understand information representation in human brain activity, it is important to investigate its fine spatial patterns at high temporal resolution. One possible approach is to use source estimation of magnetoencephalography (MEG) signals. Previous studies have mainly quantified accuracy of this technique according to positional deviations and dispersion of estimated sources, but it remains unclear how accurately MEG source estimation restores information content represented by spatial patterns of brain activity. In this study, using simulated MEG signals representing artificial experimental conditions, we performed MEG source estimation and multivariate pattern analysis to examine whether MEG source estimation can restore information content represented by patterns of cortical current in source brain areas. Classification analysis revealed that the corresponding artificial experimental conditions were predicted accurately from patterns of cortical current estimated in the source brain areas. However, accurate predictions were also possible from brain areas whose original sources were not defined. Searchlight decoding further revealed that this unexpected prediction was possible across wide brain areas beyond the original source locations, indicating that information contained in the original sources can spread through MEG source estimation. This phenomenon of "information spreading" may easily lead to false-positive interpretations when MEG source estimation and classification analysis are combined to identify brain areas that represent target information. Real MEG data analyses also showed that presented stimuli were able to be predicted in the higher visual cortex at the same latency as in the primary visual cortex, also suggesting that information spreading took place. These results indicate that careful inspection is necessary to avoid false-positive interpretations when MEG source estimation and multivariate pattern analysis are combined.

  16. Information spreading by a combination of MEG source estimation and multivariate pattern classification

    PubMed Central

    Sato, Masashi; Yamashita, Okito; Sato, Masa-aki

    2018-01-01

    To understand information representation in human brain activity, it is important to investigate its fine spatial patterns at high temporal resolution. One possible approach is to use source estimation of magnetoencephalography (MEG) signals. Previous studies have mainly quantified accuracy of this technique according to positional deviations and dispersion of estimated sources, but it remains unclear how accurately MEG source estimation restores information content represented by spatial patterns of brain activity. In this study, using simulated MEG signals representing artificial experimental conditions, we performed MEG source estimation and multivariate pattern analysis to examine whether MEG source estimation can restore information content represented by patterns of cortical current in source brain areas. Classification analysis revealed that the corresponding artificial experimental conditions were predicted accurately from patterns of cortical current estimated in the source brain areas. However, accurate predictions were also possible from brain areas whose original sources were not defined. Searchlight decoding further revealed that this unexpected prediction was possible across wide brain areas beyond the original source locations, indicating that information contained in the original sources can spread through MEG source estimation. This phenomenon of “information spreading” may easily lead to false-positive interpretations when MEG source estimation and classification analysis are combined to identify brain areas that represent target information. Real MEG data analyses also showed that presented stimuli were able to be predicted in the higher visual cortex at the same latency as in the primary visual cortex, also suggesting that information spreading took place. These results indicate that careful inspection is necessary to avoid false-positive interpretations when MEG source estimation and multivariate pattern analysis are combined. PMID:29912968

  17. The Covariance Adjustment Approaches for Combining Incomparable Cox Regressions Caused by Unbalanced Covariates Adjustment: A Multivariate Meta-Analysis Study.

    PubMed

    Dehesh, Tania; Zare, Najaf; Ayatollahi, Seyyed Mohammad Taghi

    2015-01-01

    Univariate meta-analysis (UM) procedure, as a technique that provides a single overall result, has become increasingly popular. Neglecting the existence of other concomitant covariates in the models leads to loss of treatment efficiency. Our aim was proposing four new approximation approaches for the covariance matrix of the coefficients, which is not readily available for the multivariate generalized least square (MGLS) method as a multivariate meta-analysis approach. We evaluated the efficiency of four new approaches including zero correlation (ZC), common correlation (CC), estimated correlation (EC), and multivariate multilevel correlation (MMC) on the estimation bias, mean square error (MSE), and 95% probability coverage of the confidence interval (CI) in the synthesis of Cox proportional hazard models coefficients in a simulation study. Comparing the results of the simulation study on the MSE, bias, and CI of the estimated coefficients indicated that MMC approach was the most accurate procedure compared to EC, CC, and ZC procedures. The precision ranking of the four approaches according to all above settings was MMC ≥ EC ≥ CC ≥ ZC. This study highlights advantages of MGLS meta-analysis on UM approach. The results suggested the use of MMC procedure to overcome the lack of information for having a complete covariance matrix of the coefficients.

  18. U.S. Citizen Children of Undocumented Parents: The Link Between State Immigration Policy and the Health of Latino Children.

    PubMed

    Vargas, Edward D; Ybarra, Vickie D

    2017-08-01

    We examine Latino citizen children in mixed-status families and how their physical health status compares to their U.S. citizen, co-ethnic counterparts. We also examine Latino parents' perceptions of state immigration policy and its implications for child health status. Using the 2015 Latino National Health and Immigration Survey (n = 1493), we estimate a series of multivariate ordered logistic regression models with mixed-status family and perceptions of state immigration policy as primary predictors. We find that mixed-status families report worse physical health for their children as compared to their U.S. citizen co-ethnics. We also find that parental perceptions of their states' immigration status further exacerbate health disparities between families. These findings have implications for scholars and policy makers interested in immigrant health, family wellbeing, and health disparities in complex family structures. They contribute to the scholarship on Latino child health and on the erosion of the Latino immigrant health advantage.

  19. The impact of pediatric obesity on hospitalized children with lower respiratory tract infections in the United States.

    PubMed

    Okubo, Yusuke; Nochioka, Kotaro; Testa, Marcia A

    2018-04-01

    Obesity is the most common public health problem and is a clinically complicating risk factor among hospitalized children. The impact of pediatric obesity on the severity and morbidity of lower respiratory tract infections remains unclear. We conducted a retrospective cohort study of bronchitis and pneumonia among children aged 2-20 years using hospital discharge records. The data were obtained from the Kid's Inpatient Database in 2003, 2006, 2009, and 2012, and were weighted to estimate the number of hospitalizations in the United States. We used the International Classification of Diseases, Ninth Revision, Clinical Modification code (278.0×) to classify whether the patient was obese or not. We investigated the associations between pediatric obesity and use of mechanical ventilation using multivariable logistic regression model. In addition, we ascertained the relationships between pediatric obesity, comorbid blood stream infections, mean healthcare cost, and length of hospital stay. We estimated a total of 133 602 hospitalizations with pneumonia and bronchitis among children aged between 2 and 20 years. Obesity was significantly associated with use of mechanical ventilation (adjusted OR 2.90, 95% CI 2.15-3.90), comorbid bacteremia or septicemia (adjusted OR 1.58, 95% CI 1.03-2.44), elevated healthcare costs (adjusted difference $383, 95%CI $276-$476), and prolonged length of hospital stay (difference 0.32 days, 95%CI 0.23-0.40 days), after adjusting for patient and hospital characteristics using multivariable logistic regression models. Pediatric obesity is an independent risk factor for severity and morbidity among pediatric patients with lower respiratory tract infections. These findings suggest the importance of obesity prevention for pediatric populations. © 2017 John Wiley & Sons Ltd.

  20. Hybrid least squares multivariate spectral analysis methods

    DOEpatents

    Haaland, David M.

    2002-01-01

    A set of hybrid least squares multivariate spectral analysis methods in which spectral shapes of components or effects not present in the original calibration step are added in a following estimation or calibration step to improve the accuracy of the estimation of the amount of the original components in the sampled mixture. The "hybrid" method herein means a combination of an initial classical least squares analysis calibration step with subsequent analysis by an inverse multivariate analysis method. A "spectral shape" herein means normally the spectral shape of a non-calibrated chemical component in the sample mixture but can also mean the spectral shapes of other sources of spectral variation, including temperature drift, shifts between spectrometers, spectrometer drift, etc. The "shape" can be continuous, discontinuous, or even discrete points illustrative of the particular effect.

  1. Improved Accuracy of Automated Estimation of Cardiac Output Using Circulation Time in Patients with Heart Failure.

    PubMed

    Dajani, Hilmi R; Hosokawa, Kazuya; Ando, Shin-Ichi

    2016-11-01

    Lung-to-finger circulation time of oxygenated blood during nocturnal periodic breathing in heart failure patients measured using polysomnography correlates negatively with cardiac function but possesses limited accuracy for cardiac output (CO) estimation. CO was recalculated from lung-to-finger circulation time using a multivariable linear model with information on age and average overnight heart rate in 25 patients who underwent evaluation of heart failure. The multivariable model decreased the percentage error to 22.3% relative to invasive CO measured during cardiac catheterization. This improved automated noninvasive CO estimation using multiple variables meets a recently proposed performance criterion for clinical acceptability of noninvasive CO estimation, and compares very favorably with other available methods. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. A Longitudinal Study on Human Outdoor Decomposition in Central Texas.

    PubMed

    Suckling, Joanna K; Spradley, M Katherine; Godde, Kanya

    2016-01-01

    The development of a methodology that estimates the postmortem interval (PMI) from stages of decomposition is a goal for which forensic practitioners strive. A proposed equation (Megyesi et al. 2005) that utilizes total body score (TBS) and accumulated degree days (ADD) was tested using longitudinal data collected from human remains donated to the Forensic Anthropology Research Facility (FARF) at Texas State University-San Marcos. Exact binomial tests examined the rate of the equation to successfully predict ADD. Statistically significant differences were found between ADD estimated by the equation and the observed value for decomposition stage. Differences remained significant after carnivore scavenged donations were removed from analysis. Low success rates for the equation to predict ADD from TBS and the wide standard errors demonstrate the need to re-evaluate the use of this equation and methodology for PMI estimation in different environments; rather, multivariate methods and equations should be derived that are environmentally specific. © 2015 American Academy of Forensic Sciences.

  3. Unreported workers' compensation claims to the BLS Survey of Occupational Injuries and Illnesses: Establishment factors.

    PubMed

    Wuellner, Sara E; Adams, Darrin A; Bonauto, David K

    2016-04-01

    Studies suggest employers underreport injuries to the Bureau of Labor Statistics Survey of Occupational Injuries and Illnesses (SOII); less is known about reporting differences by establishment characteristics. We linked SOII data to Washington State workers' compensation claims data, using unemployment insurance data to improve linking accuracy. We used multivariable regression models to estimate incidence ratios (IR) of unreported workers' compensation claims for establishment characteristics. An estimated 70% of workers' compensation claims were reported in SOII. Claims among state and local government establishments were most likely to be reported. Compared to large manufacturing establishments, unreported claims were most common among small educational services establishments (IR = 2.47, 95%CI: 1.52-4.01) and large construction establishments (IR = 2.05, 95%CI: 1.77-2.37). Underreporting of workers' compensation claims to SOII varies by establishment characteristics, obscuring true differences in work injury incidence. Findings may differ from previous research due to differences in study methods. © 2016 The Authors. American Journal of Industrial Medicine Published by Wiley Periodicals, Inc.

  4. [Determinants of dental services utilization by adults: a population-based study in Florianópolis, Santa Catarina State, Brazil].

    PubMed

    Miranda, Camila Dal-Bó Coradini; Peres, Marco Aurélio

    2013-11-01

    This study aimed to estimate the prevalence of dental services utilization by adults and to identify associated socioeconomic, demographic, behavioral, and self-awareness factors. A cross-sectional population-based study was conducted with adults living in the urban area of Florianópolis, Santa Catarina State, Brazil, in 2009. Associations were tested between use of dental services and predisposing, enabling, and needs-based variables. Multivariate analysis was conducted using Poisson regression with estimates of prevalence ratios and was stratified by place of last dental appointment. Prevalence of dental services utilization was 66% (95%CI: 62.9-70.7). Dental visits were 20% more frequent among women and 72% more frequent among individuals with more schooling (the latter in both public and private dental services). Individuals with private dental plans used dental services 13% more than those without. Schooling was the most important variable in predicting utilization. The study's results show the importance of monitoring associated factors in order to promote more equitable use of dental services.

  5. Doula care, birth outcomes, and costs among Medicaid beneficiaries.

    PubMed

    Kozhimannil, Katy Backes; Hardeman, Rachel R; Attanasio, Laura B; Blauer-Peterson, Cori; O'Brien, Michelle

    2013-04-01

    We compared childbirth-related outcomes for Medicaid recipients who received prenatal education and childbirth support from trained doulas with outcomes from a national sample of similar women and estimated potential cost savings. We calculated descriptive statistics for Medicaid-funded births nationally (from the 2009 Nationwide Inpatient Sample; n = 279,008) and births supported by doula care (n = 1079) in Minneapolis, Minnesota, in 2010 to 2012; used multivariate regression to estimate impacts of doula care; and modeled potential cost savings associated with reductions in cesarean delivery for doula-supported births. The cesarean rate was 22.3% among doula-supported births and 31.5% among Medicaid beneficiaries nationally. The corresponding preterm birth rates were 6.1% and 7.3%, respectively. After control for clinical and sociodemographic factors, odds of cesarean delivery were 40.9% lower for doula-supported births (adjusted odds ratio = 0.59; P < .001). Potential cost savings to Medicaid programs associated with such cesarean rate reductions are substantial but depend on states' reimbursement rates, birth volume, and current cesarean rates. State Medicaid programs should consider offering coverage for birth doulas to realize potential cost savings associated with reduced cesarean rates.

  6. MULTIVARIATE ANALYSIS ON LEVELS OF SELECTED METALS, PARTICULATE MATTER, VOC, AND HOUSEHOLD CHARACTERISTICS AND ACTIVITIES FROM THE MIDWESTERN STATES NHEXAS

    EPA Science Inventory

    Microenvironmental and biological/personal monitoring information were collected during the National Human Exposure Assessment Survey (NHEXAS), conducted in the six states comprising U.S. EPA Region Five. They have been analyzed by multivariate analysis techniques with general ...

  7. Geographic variation in characteristics of postpartum women using female sterilization

    PubMed Central

    White, Kari; Potter, Joseph E.; Zite, Nikki

    2015-01-01

    Background Southern states have higher rates of female sterilization compared to other areas of the United States, and the reasons for this are not well understood. We examined whether low-income and racial/ethnic minority women, who were previous targets of coercive practices, disproportionately report using sterilization in the South. Methods We used data from 12 states participating in the Pregnancy Risk Assessment Monitoring System that collected information on women’s contraceptive method use between 2006–2009. We categorized states according to geographic region: South, Midwest/West and Northeast. Within each region, we computed the percentage of women using sterilization according to their demographic and obstetric characteristics and estimated multivariable-adjusted prevalence ratios to evaluate whether the same characteristics were associated with sterilization use. Findings The percentage of postpartum women using sterilization ranged from 5.0–9.9% in the Northeast, 8.9–10.6% in the Midwest/West and 11.6–22.4% in the South. Women in nearly all sub-groups in Southern states were more likely to use sterilization than women in the Northeast. After multivariable adjustment, there were no significant differences in the prevalence of sterilization for Blacks compared to Whites in the Northeast (0.76 [95% CI: 0.55–1.06]), Midwest/West (0.91 [95%CI: 0.80–1.04]) and South (0.96 [95%CI: 0.85–1.07]). Women with Medicaid-paid deliveries (versus private insurance) had a higher prevalence of sterilization in all regions (p<0.05). Conclusions These findings do not indicate that low-income and racial/ethnic minority women in the South use sterilization at disproportionately higher rates compared to other regions, and suggest that other differences, such as social norms and family planning policies, may contribute to this geographic variation. PMID:26232310

  8. Dietary diversity decreases the risk of cognitive decline among Japanese older adults.

    PubMed

    Otsuka, Rei; Nishita, Yukiko; Tange, Chikako; Tomida, Makiko; Kato, Yuki; Nakamoto, Mariko; Imai, Tomoko; Ando, Fujiko; Shimokata, Hiroshi

    2017-06-01

    To clarify the effectiveness of dietary diversity, calculated by dietary records, on cognitive decline. Data were derived from the National Institute for Longevity Sciences - Longitudinal Study of Aging. Participants comprised 298 men and 272 women aged 60-81 years at baseline (second wave) who participated in the follow-up study (third to seventh wave) at least once. Cognitive function was assessed with the Mini-Mental State Examination in all study waves. Dietary diversity was determined using the Quantitative Index for Dietary Diversity based on a 3-day dietary record in the second wave. Cumulative data among participants with a Mini-Mental State Examination score >27 in the second wave were analyzed using a generalized estimating equation. Multivariate adjusted odds ratios and 95% confidence intervals for Mini-Mental State Examination scores ≤27 in each study wave according to a 1 standard deviation (increase), or quartiles of the Quantitative Index for Dietary Diversity at baseline, were adjusted for sex, age, follow-up time, baseline Mini-Mental State Examination score, education, body mass index, annual household income, current smoking status, energy intake and disease history. Multivariate adjusted odds ratio for a decline in Mini-Mental State Examination score was 0.79 (95% CI 0.70-0.89; P < 0.001) with a 1 SD increase in dietary diversity score, or 1.00 (reference), 0.99 (95% CI 0.70-1.43), 0.68 (95% CI 0.46-0.99) and 0.56 (95% CI 0.38-0.83) according to the lowest through highest quartiles of dietary diversity score, respectively (trend P = 0.001). Daily intake of various kinds of food might be a protective factor against cognitive decline in community-dwelling Japanese older adults. Geriatr Gerontol Int 2017; 17: 937-944. © 2016 Japan Geriatrics Society.

  9. Multivariate Meta-Analysis Using Individual Participant Data

    ERIC Educational Resources Information Center

    Riley, R. D.; Price, M. J.; Jackson, D.; Wardle, M.; Gueyffier, F.; Wang, J.; Staessen, J. A.; White, I. R.

    2015-01-01

    When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is…

  10. A Probabilistic Approach for Real-Time Volcano Surveillance

    NASA Astrophysics Data System (ADS)

    Cannavo, F.; Cannata, A.; Cassisi, C.; Di Grazia, G.; Maronno, P.; Montalto, P.; Prestifilippo, M.; Privitera, E.; Gambino, S.; Coltelli, M.

    2016-12-01

    Continuous evaluation of the state of potentially dangerous volcanos plays a key role for civil protection purposes. Presently, real-time surveillance of most volcanoes worldwide is essentially delegated to one or more human experts in volcanology, who interpret data coming from different kind of monitoring networks. Unfavorably, the coupling of highly non-linear and complex volcanic dynamic processes leads to measurable effects that can show a large variety of different behaviors. Moreover, due to intrinsic uncertainties and possible failures in some recorded data, the volcano state needs to be expressed in probabilistic terms, thus making the fast volcano state assessment sometimes impracticable for the personnel on duty at the control rooms. With the aim of aiding the personnel on duty in volcano surveillance, we present a probabilistic graphical model to estimate automatically the ongoing volcano state from all the available different kind of measurements. The model consists of a Bayesian network able to represent a set of variables and their conditional dependencies via a directed acyclic graph. The model variables are both the measurements and the possible states of the volcano through the time. The model output is an estimation of the probability distribution of the feasible volcano states. We tested the model on the Mt. Etna (Italy) case study by considering a long record of multivariate data from 2011 to 2015 and cross-validated it. Results indicate that the proposed model is effective and of great power for decision making purposes.

  11. A model-based approach to wildland fire reconstruction using sediment charcoal records

    USGS Publications Warehouse

    Itter, Malcolm S.; Finley, Andrew O.; Hooten, Mevin B.; Higuera, Philip E.; Marlon, Jennifer R.; Kelly, Ryan; McLachlan, Jason S.

    2017-01-01

    Lake sediment charcoal records are used in paleoecological analyses to reconstruct fire history, including the identification of past wildland fires. One challenge of applying sediment charcoal records to infer fire history is the separation of charcoal associated with local fire occurrence and charcoal originating from regional fire activity. Despite a variety of methods to identify local fires from sediment charcoal records, an integrated statistical framework for fire reconstruction is lacking. We develop a Bayesian point process model to estimate the probability of fire associated with charcoal counts from individual-lake sediments and estimate mean fire return intervals. A multivariate extension of the model combines records from multiple lakes to reduce uncertainty in local fire identification and estimate a regional mean fire return interval. The univariate and multivariate models are applied to 13 lakes in the Yukon Flats region of Alaska. Both models resulted in similar mean fire return intervals (100–350 years) with reduced uncertainty under the multivariate model due to improved estimation of regional charcoal deposition. The point process model offers an integrated statistical framework for paleofire reconstruction and extends existing methods to infer regional fire history from multiple lake records with uncertainty following directly from posterior distributions.

  12. Multivariate Non-Symmetric Stochastic Models for Spatial Dependence Models

    NASA Astrophysics Data System (ADS)

    Haslauer, C. P.; Bárdossy, A.

    2017-12-01

    A copula based multivariate framework allows more flexibility to describe different kind of dependences than what is possible using models relying on the confining assumption of symmetric Gaussian models: different quantiles can be modelled with a different degree of dependence; it will be demonstrated how this can be expected given process understanding. maximum likelihood based multivariate quantitative parameter estimation yields stable and reliable results; not only improved results in cross-validation based measures of uncertainty are obtained but also a more realistic spatial structure of uncertainty compared to second order models of dependence; as much information as is available is included in the parameter estimation: incorporation of censored measurements (e.g., below detection limit, or ones that are above the sensitive range of the measurement device) yield to more realistic spatial models; the proportion of true zeros can be jointly estimated with and distinguished from censored measurements which allow estimates about the age of a contaminant in the system; secondary information (categorical and on the rational scale) has been used to improve the estimation of the primary variable; These copula based multivariate statistical techniques are demonstrated based on hydraulic conductivity observations at the Borden (Canada) site, the MADE site (USA), and a large regional groundwater quality data-set in south-west Germany. Fields of spatially distributed K were simulated with identical marginal simulation, identical second order spatial moments, yet substantially differing solute transport characteristics when numerical tracer tests were performed. A statistical methodology is shown that allows the delineation of a boundary layer separating homogenous parts of a spatial data-set. The effects of this boundary layer (macro structure) and the spatial dependence of K (micro structure) on solute transport behaviour is shown.

  13. Assessing agreement among alternative climate change projections to inform conservation recommendations in the contiguous United States.

    PubMed

    Belote, R Travis; Carroll, Carlos; Martinuzzi, Sebastián; Michalak, Julia; Williams, John W; Williamson, Matthew A; Aplet, Gregory H

    2018-06-21

    Addressing uncertainties in climate vulnerability remains a challenge for conservation planning. We evaluate how confidence in conservation recommendations may change with agreement among alternative climate projections and metrics of climate exposure. We assessed agreement among three multivariate estimates of climate exposure (forward velocity, backward velocity, and climate dissimilarity) using 18 alternative climate projections for the contiguous United States. For each metric, we classified maps into quartiles for each alternative climate projections, and calculated the frequency of quartiles assigned for each gridded location (high quartile frequency = more agreement among climate projections). We evaluated recommendations using a recent climate adaptation heuristic framework that recommends emphasizing various conservation strategies to land based on current conservation value and expected climate exposure. We found that areas where conservation strategies would be confidently assigned based on high agreement among climate projections varied substantially across regions. In general, there was more agreement in forward and backward velocity estimates among alternative projections than agreement in estimates of local dissimilarity. Consensus of climate predictions resulted in the same conservation recommendation assignments in a few areas, but patterns varied by climate exposure metric. This work demonstrates an approach for explicitly evaluating alternative predictions in geographic patterns of climate change.

  14. A Bayesian approach to multivariate measurement system assessment

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hamada, Michael Scott

    This article considers system assessment for multivariate measurements and presents a Bayesian approach to analyzing gauge R&R study data. The evaluation of variances for univariate measurement becomes the evaluation of covariance matrices for multivariate measurements. The Bayesian approach ensures positive definite estimates of the covariance matrices and easily provides their uncertainty. Furthermore, various measurement system assessment criteria are easily evaluated. The approach is illustrated with data from a real gauge R&R study as well as simulated data.

  15. A Bayesian approach to multivariate measurement system assessment

    DOE PAGES

    Hamada, Michael Scott

    2016-07-01

    This article considers system assessment for multivariate measurements and presents a Bayesian approach to analyzing gauge R&R study data. The evaluation of variances for univariate measurement becomes the evaluation of covariance matrices for multivariate measurements. The Bayesian approach ensures positive definite estimates of the covariance matrices and easily provides their uncertainty. Furthermore, various measurement system assessment criteria are easily evaluated. The approach is illustrated with data from a real gauge R&R study as well as simulated data.

  16. Univariate Analysis of Multivariate Outcomes in Educational Psychology.

    ERIC Educational Resources Information Center

    Hubble, L. M.

    1984-01-01

    The author examined the prevalence of multiple operational definitions of outcome constructs and an estimate of the incidence of Type I error rates when univariate procedures were applied to multiple variables in educational psychology. Multiple operational definitions of constructs were advocated and wider use of multivariate analysis was…

  17. Bayesian Estimation of Multivariate Latent Regression Models: Gauss versus Laplace

    ERIC Educational Resources Information Center

    Culpepper, Steven Andrew; Park, Trevor

    2017-01-01

    A latent multivariate regression model is developed that employs a generalized asymmetric Laplace (GAL) prior distribution for regression coefficients. The model is designed for high-dimensional applications where an approximate sparsity condition is satisfied, such that many regression coefficients are near zero after accounting for all the model…

  18. Gendered Social Learning, Nonfamily Institutions, and Attitudes About Recourse After Partner Violence

    PubMed Central

    Krause, Kathleen H.

    2015-01-01

    Objective To provide the first study in Vietnam of how gendered social learning about violence and exposure to non-family institutions influence women’s attitudes about a wife’s recourse after physical IPV. Method A probability sample of 532 married women, ages 18–50 years, was surveyed in July–August, 2012 in Mỹ Hào district. We fit a multivariate linear regression model to estimate correlates of favoring recourse in six situations using a validated attitudinal scale. We split attitudes towards recourse into three subscales (disfavor silence, favor informal recourse, favor formal recourse) and fit one multivariate ordinal logistic regression model for each behavior to estimate correlates of favoring recourse. Results On average, women favored recourse in 2.8 situations. Women who were older and had witnessed physical IPV in childhood had less favorable attitudes about recourse. Women who were hit as children, had completed more schooling, worked outside agriculture, and had sought recourse after IPV had more favorable attitudes about recourse. Conclusions Normative change among women may require efforts to curb family violence, counsel those exposed to violence in childhood, and enhance women’s opportunities for higher schooling and non-agricultural wage work. The state and organizations working on IPV might overcome pockets of unfavorable public opinion by enforcing accountability for IPV rather than seeking to alter ideas about recourse among women. PMID:28392967

  19. Part 2. Development of Enhanced Statistical Methods for Assessing Health Effects Associated with an Unknown Number of Major Sources of Multiple Air Pollutants.

    PubMed

    Park, Eun Sug; Symanski, Elaine; Han, Daikwon; Spiegelman, Clifford

    2015-06-01

    A major difficulty with assessing source-specific health effects is that source-specific exposures cannot be measured directly; rather, they need to be estimated by a source-apportionment method such as multivariate receptor modeling. The uncertainty in source apportionment (uncertainty in source-specific exposure estimates and model uncertainty due to the unknown number of sources and identifiability conditions) has been largely ignored in previous studies. Also, spatial dependence of multipollutant data collected from multiple monitoring sites has not yet been incorporated into multivariate receptor modeling. The objectives of this project are (1) to develop a multipollutant approach that incorporates both sources of uncertainty in source-apportionment into the assessment of source-specific health effects and (2) to develop enhanced multivariate receptor models that can account for spatial correlations in the multipollutant data collected from multiple sites. We employed a Bayesian hierarchical modeling framework consisting of multivariate receptor models, health-effects models, and a hierarchical model on latent source contributions. For the health model, we focused on the time-series design in this project. Each combination of number of sources and identifiability conditions (additional constraints on model parameters) defines a different model. We built a set of plausible models with extensive exploratory data analyses and with information from previous studies, and then computed posterior model probability to estimate model uncertainty. Parameter estimation and model uncertainty estimation were implemented simultaneously by Markov chain Monte Carlo (MCMC*) methods. We validated the methods using simulated data. We illustrated the methods using PM2.5 (particulate matter ≤ 2.5 μm in aerodynamic diameter) speciation data and mortality data from Phoenix, Arizona, and Houston, Texas. The Phoenix data included counts of cardiovascular deaths and daily PM2.5 speciation data from 1995-1997. The Houston data included respiratory mortality data and 24-hour PM2.5 speciation data sampled every six days from a region near the Houston Ship Channel in years 2002-2005. We also developed a Bayesian spatial multivariate receptor modeling approach that, while simultaneously dealing with the unknown number of sources and identifiability conditions, incorporated spatial correlations in the multipollutant data collected from multiple sites into the estimation of source profiles and contributions based on the discrete process convolution model for multivariate spatial processes. This new modeling approach was applied to 24-hour ambient air concentrations of 17 volatile organic compounds (VOCs) measured at nine monitoring sites in Harris County, Texas, during years 2000 to 2005. Simulation results indicated that our methods were accurate in identifying the true model and estimated parameters were close to the true values. The results from our methods agreed in general with previous studies on the source apportionment of the Phoenix data in terms of estimated source profiles and contributions. However, we had a greater number of statistically insignificant findings, which was likely a natural consequence of incorporating uncertainty in the estimated source contributions into the health-effects parameter estimation. For the Houston data, a model with five sources (that seemed to be Sulfate-Rich Secondary Aerosol, Motor Vehicles, Industrial Combustion, Soil/Crustal Matter, and Sea Salt) showed the highest posterior model probability among the candidate models considered when fitted simultaneously to the PM2.5 and mortality data. There was a statistically significant positive association between respiratory mortality and same-day PM2.5 concentrations attributed to one of the sources (probably industrial combustion). The Bayesian spatial multivariate receptor modeling approach applied to the VOC data led to a highest posterior model probability for a model with five sources (that seemed to be refinery, petrochemical production, gasoline evaporation, natural gas, and vehicular exhaust) among several candidate models, with the number of sources varying between three and seven and with different identifiability conditions. Our multipollutant approach assessing source-specific health effects is more advantageous than a single-pollutant approach in that it can estimate total health effects from multiple pollutants and can also identify emission sources that are responsible for adverse health effects. Our Bayesian approach can incorporate not only uncertainty in the estimated source contributions, but also model uncertainty that has not been addressed in previous studies on assessing source-specific health effects. The new Bayesian spatial multivariate receptor modeling approach enables predictions of source contributions at unmonitored sites, minimizing exposure misclassification and providing improved exposure estimates along with their uncertainty estimates, as well as accounting for uncertainty in the number of sources and identifiability conditions.

  20. PM10 modeling in the Oviedo urban area (Northern Spain) by using multivariate adaptive regression splines

    NASA Astrophysics Data System (ADS)

    Nieto, Paulino José García; Antón, Juan Carlos Álvarez; Vilán, José Antonio Vilán; García-Gonzalo, Esperanza

    2014-10-01

    The aim of this research work is to build a regression model of the particulate matter up to 10 micrometers in size (PM10) by using the multivariate adaptive regression splines (MARS) technique in the Oviedo urban area (Northern Spain) at local scale. This research work explores the use of a nonparametric regression algorithm known as multivariate adaptive regression splines (MARS) which has the ability to approximate the relationship between the inputs and outputs, and express the relationship mathematically. In this sense, hazardous air pollutants or toxic air contaminants refer to any substance that may cause or contribute to an increase in mortality or serious illness, or that may pose a present or potential hazard to human health. To accomplish the objective of this study, the experimental dataset of nitrogen oxides (NOx), carbon monoxide (CO), sulfur dioxide (SO2), ozone (O3) and dust (PM10) were collected over 3 years (2006-2008) and they are used to create a highly nonlinear model of the PM10 in the Oviedo urban nucleus (Northern Spain) based on the MARS technique. One main objective of this model is to obtain a preliminary estimate of the dependence between PM10 pollutant in the Oviedo urban area at local scale. A second aim is to determine the factors with the greatest bearing on air quality with a view to proposing health and lifestyle improvements. The United States National Ambient Air Quality Standards (NAAQS) establishes the limit values of the main pollutants in the atmosphere in order to ensure the health of healthy people. Firstly, this MARS regression model captures the main perception of statistical learning theory in order to obtain a good prediction of the dependence among the main pollutants in the Oviedo urban area. Secondly, the main advantages of MARS are its capacity to produce simple, easy-to-interpret models, its ability to estimate the contributions of the input variables, and its computational efficiency. Finally, on the basis of these numerical calculations, using the multivariate adaptive regression splines (MARS) technique, conclusions of this research work are exposed.

  1. Reparametrization-based estimation of genetic parameters in multi-trait animal model using Integrated Nested Laplace Approximation.

    PubMed

    Mathew, Boby; Holand, Anna Marie; Koistinen, Petri; Léon, Jens; Sillanpää, Mikko J

    2016-02-01

    A novel reparametrization-based INLA approach as a fast alternative to MCMC for the Bayesian estimation of genetic parameters in multivariate animal model is presented. Multi-trait genetic parameter estimation is a relevant topic in animal and plant breeding programs because multi-trait analysis can take into account the genetic correlation between different traits and that significantly improves the accuracy of the genetic parameter estimates. Generally, multi-trait analysis is computationally demanding and requires initial estimates of genetic and residual correlations among the traits, while those are difficult to obtain. In this study, we illustrate how to reparametrize covariance matrices of a multivariate animal model/animal models using modified Cholesky decompositions. This reparametrization-based approach is used in the Integrated Nested Laplace Approximation (INLA) methodology to estimate genetic parameters of multivariate animal model. Immediate benefits are: (1) to avoid difficulties of finding good starting values for analysis which can be a problem, for example in Restricted Maximum Likelihood (REML); (2) Bayesian estimation of (co)variance components using INLA is faster to execute than using Markov Chain Monte Carlo (MCMC) especially when realized relationship matrices are dense. The slight drawback is that priors for covariance matrices are assigned for elements of the Cholesky factor but not directly to the covariance matrix elements as in MCMC. Additionally, we illustrate the concordance of the INLA results with the traditional methods like MCMC and REML approaches. We also present results obtained from simulated data sets with replicates and field data in rice.

  2. The role of modern control theory in the design of controls for aircraft turbine engines

    NASA Technical Reports Server (NTRS)

    Zeller, J.; Lehtinen, B.; Merrill, W.

    1982-01-01

    The development, applications, and current research in modern control theory (MCT) are reviewed, noting the importance for fuel-efficient operation of turbines with variable inlet guide vanes, compressor stators, and exhaust nozzle area. The evolution of multivariable propulsion control design is examined, noting a basis in a matrix formulation of the differential equations defining the process, leading to state space formulations. Reports and papers which appeared from 1970-1982 which dealt with problems in MCT applications to turbine engine control design are outlined, including works on linear quadratic regulator methods, frequency domain methods, identification, estimation, and model reduction, detection, isolation, and accommodation, and state space control, adaptive control, and optimization approaches. Finally, NASA programs in frequency domain design, sensor failure detection, computer-aided control design, and plant modeling are explored

  3. Estimating and Testing the Sources of Evoked Potentials in the Brain.

    ERIC Educational Resources Information Center

    Huizenga, Hilde M.; Molenaar, Peter C. M.

    1994-01-01

    The source of an event-related brain potential (ERP) is estimated from multivariate measures of ERP on the head under several mathematical and physical constraints on the parameters of the source model. Statistical aspects of estimation are discussed, and new tests are proposed. (SLD)

  4. Early impact of the Affordable Care Act on health insurance coverage of young adults.

    PubMed

    Cantor, Joel C; Monheit, Alan C; DeLia, Derek; Lloyd, Kristen

    2012-10-01

    To evaluate one of the first implemented provisions of the Patient Protection and Affordable Care Act (ACA), which permits young adults up to age 26 to enroll as dependents on a parent's private health plan. Nearly one-in-three young adults lacked coverage before the ACA. STUDY DESIGN, METHODS, AND DATA: Data from the Current Population Survey 2005-2011 are used to estimate linear probability models within a difference-in-differences framework to estimate how the ACA affected coverage of eligible young adults compared to slightly older adults. Multivariate models control for individual characteristics, economic trends, and prior state-dependent coverage laws. This ACA provision led to a rapid and substantial increase in the share of young adults with dependent coverage and a reduction in their uninsured rate in the early months of implementation. Models accounting for prior state dependent expansions suggest greater policy impact in 2010 among young adults who were also eligible under a state law. ACA-dependent coverage expansion represents a rare public policy success in the effort to cover the uninsured. Still, this policy may have later unintended consequences for premiums for alternative forms of coverage and employer-offered rates for young adult workers. © Health Research and Educational Trust.

  5. Tort law and medical malpractice insurance premiums.

    PubMed

    Kilgore, Meredith L; Morrisey, Michael A; Nelson, Leonard J

    2006-01-01

    This paper estimated the effects of tort law and insurer investment returns on physician malpractice insurance premiums. Data were collected on tort law from 1991 through 2004, and multivariate regression models, including fixed effects for state and year, were used to estimate the effect of changes in tort law on medical malpractice premiums. The premium consequences of national policy changes were simulated. The analysis found that the introduction of a new damage cap lowered malpractice premiums for internal medicine, general surgery, and obstetrics/gynecology by 17.3%, 20.7%, and 25.5%, respectively. Lowering damage caps by dollar 100,000 reduced premiums by 4%. Statutes of repose also resulted in lower premiums. No other tort law changes had the effect of lowering premiums. Simulation results indicate that a national cap of dollar 250,000 on awards for noneconomic damages in all states would imply premium savings of dollar 16.9 billion. Extending a dollar 250,000 cap to all states that do not currently have them would save dollar 1.4 billion annually, or about 8% of the total. A negative effect on malpractice premiums was found for the Dow Jones industrial average, but not for bond prices; effects of the Nasdaq index were not significant for internal medicine, but were marginally significant for surgery and obstetrics premiums.

  6. Early Impact of the Affordable Care Act on Health Insurance Coverage of Young Adults

    PubMed Central

    Cantor, Joel C; Monheit, Alan C; DeLia, Derek; Lloyd, Kristen

    2012-01-01

    Research Objective To evaluate one of the first implemented provisions of the Patient Protection and Affordable Care Act (ACA), which permits young adults up to age 26 to enroll as dependents on a parent's private health plan. Nearly one-in-three young adults lacked coverage before the ACA. Study Design, Methods, and Data Data from the Current Population Survey 2005–2011 are used to estimate linear probability models within a difference-in-differences framework to estimate how the ACA affected coverage of eligible young adults compared to slightly older adults. Multivariate models control for individual characteristics, economic trends, and prior state-dependent coverage laws. Principal Findings This ACA provision led to a rapid and substantial increase in the share of young adults with dependent coverage and a reduction in their uninsured rate in the early months of implementation. Models accounting for prior state dependent expansions suggest greater policy impact in 2010 among young adults who were also eligible under a state law. Conclusions and Implications ACA-dependent coverage expansion represents a rare public policy success in the effort to cover the uninsured. Still, this policy may have later unintended consequences for premiums for alternative forms of coverage and employer-offered rates for young adult workers. PMID:22924684

  7. Boosted Multivariate Trees for Longitudinal Data

    PubMed Central

    Pande, Amol; Li, Liang; Rajeswaran, Jeevanantham; Ehrlinger, John; Kogalur, Udaya B.; Blackstone, Eugene H.; Ishwaran, Hemant

    2017-01-01

    Machine learning methods provide a powerful approach for analyzing longitudinal data in which repeated measurements are observed for a subject over time. We boost multivariate trees to fit a novel flexible semi-nonparametric marginal model for longitudinal data. In this model, features are assumed to be nonparametric, while feature-time interactions are modeled semi-nonparametrically utilizing P-splines with estimated smoothing parameter. In order to avoid overfitting, we describe a relatively simple in sample cross-validation method which can be used to estimate the optimal boosting iteration and which has the surprising added benefit of stabilizing certain parameter estimates. Our new multivariate tree boosting method is shown to be highly flexible, robust to covariance misspecification and unbalanced designs, and resistant to overfitting in high dimensions. Feature selection can be used to identify important features and feature-time interactions. An application to longitudinal data of forced 1-second lung expiratory volume (FEV1) for lung transplant patients identifies an important feature-time interaction and illustrates the ease with which our method can find complex relationships in longitudinal data. PMID:29249866

  8. Improving the realism of hydrologic model through multivariate parameter estimation

    NASA Astrophysics Data System (ADS)

    Rakovec, Oldrich; Kumar, Rohini; Attinger, Sabine; Samaniego, Luis

    2017-04-01

    Increased availability and quality of near real-time observations should improve understanding of predictive skills of hydrological models. Recent studies have shown the limited capability of river discharge data alone to adequately constrain different components of distributed model parameterizations. In this study, the GRACE satellite-based total water storage (TWS) anomaly is used to complement the discharge data with an aim to improve the fidelity of mesoscale hydrologic model (mHM) through multivariate parameter estimation. The study is conducted in 83 European basins covering a wide range of hydro-climatic regimes. The model parameterization complemented with the TWS anomalies leads to statistically significant improvements in (1) discharge simulations during low-flow period, and (2) evapotranspiration estimates which are evaluated against independent (FLUXNET) data. Overall, there is no significant deterioration in model performance for the discharge simulations when complemented by information from the TWS anomalies. However, considerable changes in the partitioning of precipitation into runoff components are noticed by in-/exclusion of TWS during the parameter estimation. A cross-validation test carried out to assess the transferability and robustness of the calibrated parameters to other locations further confirms the benefit of complementary TWS data. In particular, the evapotranspiration estimates show more robust performance when TWS data are incorporated during the parameter estimation, in comparison with the benchmark model constrained against discharge only. This study highlights the value for incorporating multiple data sources during parameter estimation to improve the overall realism of hydrologic model and its applications over large domains. Rakovec, O., Kumar, R., Attinger, S. and Samaniego, L. (2016): Improving the realism of hydrologic model functioning through multivariate parameter estimation. Water Resour. Res., 52, http://dx.doi.org/10.1002/2016WR019430

  9. Disparities in eye care utilization among the United States adults with visual impairment: findings from the behavioral risk factor surveillance system 2006-2009.

    PubMed

    Chou, Chiu-Fang; Barker, Lawrence E; Crews, John E; Primo, Susan A; Zhang, Xinzhi; Elliott, Amanda F; McKeever Bullard, Kai; Geiss, Linda S; Saaddine, Jinan B

    2012-12-01

    To estimate the prevalence of annual eye care among visually impaired United States residents aged 40 years or older, by state, race/ethnicity, education, and annual income. Cross-sectional study. In analyses of 2006-2009 Behavioral Risk Factor Surveillance System data from 21 states, we used multivariate regression to estimate the state-level prevalence of yearly eye doctor visit in the study population by race/ethnicity (non-Hispanic white, non-Hispanic black, Hispanic, and other), annual income (≥$35,000 and <$35,000), and education (< high school, high school, and > high school). The age-adjusted state-level prevalence of yearly eye doctor visits ranged from 48% (Missouri) to 69% (Maryland). In Alabama, Colorado, Indiana, Iowa, New Mexico, and North Carolina, the prevalence was significantly higher among respondents with more than a high school education than among those with a high school education or less (P < .05). The prevalence was positively associated with annual income levels in Alabama, Georgia, New Mexico, New York, Texas, and West Virginia and negatively associated with annual income levels in Massachusetts. After controlling for age, sex, race/ethnicity, education, and income, we also found significant disparities in the prevalence of yearly eye doctor visits among states. Among visually impaired US residents aged 40 or older, the prevalence of yearly eye examinations varied significantly by race/ethnicity, income, and education, both overall and within states. Continued and possibly enhanced collection of eye care utilization data, such as we analyzed here, may help states address disparities in vision health and identify population groups most in need of intervention programs. Copyright © 2012 Elsevier Inc. All rights reserved.

  10. Bayesian Estimation of Random Coefficient Dynamic Factor Models

    ERIC Educational Resources Information Center

    Song, Hairong; Ferrer, Emilio

    2012-01-01

    Dynamic factor models (DFMs) have typically been applied to multivariate time series data collected from a single unit of study, such as a single individual or dyad. The goal of DFMs application is to capture dynamics of multivariate systems. When multiple units are available, however, DFMs are not suited to capture variations in dynamics across…

  11. Parametric Cost Models for Space Telescopes

    NASA Technical Reports Server (NTRS)

    Stahl, H. Philip

    2010-01-01

    A study is in-process to develop a multivariable parametric cost model for space telescopes. Cost and engineering parametric data has been collected on 30 different space telescopes. Statistical correlations have been developed between 19 variables of 59 variables sampled. Single Variable and Multi-Variable Cost Estimating Relationships have been developed. Results are being published.

  12. A Simpli ed, General Approach to Simulating from Multivariate Copula Functions

    Treesearch

    Barry Goodwin

    2012-01-01

    Copulas have become an important analytic tool for characterizing multivariate distributions and dependence. One is often interested in simulating data from copula estimates. The process can be analytically and computationally complex and usually involves steps that are unique to a given parametric copula. We describe an alternative approach that uses \\probability{...

  13. Analyzing Multivariate Repeated Measures Designs: A Comparison of Two Approximate Degrees of Freedom Procedures

    ERIC Educational Resources Information Center

    Lix, Lisa M.; Algina, James; Keselman, H. J.

    2003-01-01

    The approximate degrees of freedom Welch-James (WJ) and Brown-Forsythe (BF) procedures for testing within-subjects effects in multivariate groups by trials repeated measures designs were investigated under departures from covariance homogeneity and normality. Empirical Type I error and power rates were obtained for least-squares estimators and…

  14. Controlled Multivariate Evaluation of Open Education: Application of a Critical Model.

    ERIC Educational Resources Information Center

    Sewell, Alan F.; And Others

    This paper continues previous reports of a controlled multivariate evaluation of a junior high school open-education program. A new method of estimating program objectives and implementation is presented, together with the nature and degree of obtained student outcomes. Open-program students were found to approve more highly of their learning…

  15. Multivariate meta-analysis using individual participant data.

    PubMed

    Riley, R D; Price, M J; Jackson, D; Wardle, M; Gueyffier, F; Wang, J; Staessen, J A; White, I R

    2015-06-01

    When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is that within-study correlations needed to fit the multivariate model are unknown from published reports. However, provision of individual participant data (IPD) allows them to be calculated directly. Here, we illustrate how to use IPD to estimate within-study correlations, using a joint linear regression for multiple continuous outcomes and bootstrapping methods for binary, survival and mixed outcomes. In a meta-analysis of 10 hypertension trials, we then show how these methods enable multivariate meta-analysis to address novel clinical questions about continuous, survival and binary outcomes; treatment-covariate interactions; adjusted risk/prognostic factor effects; longitudinal data; prognostic and multiparameter models; and multiple treatment comparisons. Both frequentist and Bayesian approaches are applied, with example software code provided to derive within-study correlations and to fit the models. © 2014 The Authors. Research Synthesis Methods published by John Wiley & Sons, Ltd.

  16. A flexible model for multivariate interval-censored survival times with complex correlation structure.

    PubMed

    Falcaro, Milena; Pickles, Andrew

    2007-02-10

    We focus on the analysis of multivariate survival times with highly structured interdependency and subject to interval censoring. Such data are common in developmental genetics and genetic epidemiology. We propose a flexible mixed probit model that deals naturally with complex but uninformative censoring. The recorded ages of onset are treated as possibly censored ordinal outcomes with the interval censoring mechanism seen as arising from a coarsened measurement of a continuous variable observed as falling between subject-specific thresholds. This bypasses the requirement for the failure times to be observed as falling into non-overlapping intervals. The assumption of a normal age-of-onset distribution of the standard probit model is relaxed by embedding within it a multivariate Box-Cox transformation whose parameters are jointly estimated with the other parameters of the model. Complex decompositions of the underlying multivariate normal covariance matrix of the transformed ages of onset become possible. The new methodology is here applied to a multivariate study of the ages of first use of tobacco and first consumption of alcohol without parental permission in twins. The proposed model allows estimation of the genetic and environmental effects that are shared by both of these risk behaviours as well as those that are specific. 2006 John Wiley & Sons, Ltd.

  17. Standard Error of Linear Observed-Score Equating for the NEAT Design with Nonnormally Distributed Data

    ERIC Educational Resources Information Center

    Zu, Jiyun; Yuan, Ke-Hai

    2012-01-01

    In the nonequivalent groups with anchor test (NEAT) design, the standard error of linear observed-score equating is commonly estimated by an estimator derived assuming multivariate normality. However, real data are seldom normally distributed, causing this normal estimator to be inconsistent. A general estimator, which does not rely on the…

  18. State policy influence on the early diffusion of buprenorphine in community treatment programs.

    PubMed

    Ducharme, Lori J; Abraham, Amanda J

    2008-06-20

    Buprenorphine was approved for use in the treatment of opioid dependence in 2002, but its diffusion into everyday clinical practice in community-based treatment programs has been slow. This study examines the net impact of efforts by state agencies, including provision of Medicaid coverage, on program-level adoption of buprenorphine as of 2006. Interviews were conducted with key informants in 49 of the 50 state agencies with oversight responsibility for addiction treatment services. Information from these interviews was integrated with organizational data from the 2006 National Survey of Substance Abuse Treatment Services. A multivariate logistic regression model was estimated to identify the effects of state efforts to promote the use of this medication, net of a host of organizational characteristics. The availability of Medicaid coverage for buprenorphine was a significant predictor of its adoption by treatment organizations. Inclusion of buprenorphine on state Medicaid formularies appears to be a key element in ensuring that patients have access to this state-of-the-art treatment option. Other potential barriers to the diffusion of buprenorphine require identification, and the value of additional state-level policies to promote its use should be evaluated.

  19. Genetic analysis of Holstein cattle populations in Brazil and the United States.

    PubMed

    Costa, C N; Blake, R W; Pollak, E J; Oltenacu, P A; Quaas, R L; Searle, S R

    2000-12-01

    Genetic relationships between Brazilian and US Holstein cattle populations were studied using first-lactation records of 305-d mature equivalent (ME) yields of milk and fat of daughters of 705 sires in Brazil and 701 sires in the United States, 358 of which had progeny in both countries. Components of(co)variance and genetic parameters were estimated from all data and from within herd-year standard deviation for milk (HYSD) data files using bivariate and multivariate sire models and DFREML procedures distinguishing the two countries. Sire (residual) variances from all data for milk yield were 51 to 59% (58 to 101%) as large in Brazil as those obtained from half-sisters in the average US herd. Corresponding proportions of the US variance in fat yield that were found in Brazil were 30 to 41% for the sire component of variance and 48 to 80% for the residual. Heritabilities for milk and fat yields from multivariate analysis of all the data were 0.25 and 0.22 in Brazil, and 0.34 and 0.35 in the United States. Genetic correlations between milk and fat were 0.79 in Brazil and 0.62 in the United States. Genetic correlations between countries were 0.85 for milk, 0.88 for fat, 0.55 for milk in Brazil and fat in the US, and 0.67 for fat in Brazil and milk in the United States. Correlated responses in Brazil from sire selection based on the US information increased with average HYSD in Brazil. Largest daughter yield response was predicted from information from half-sisters in low HYSD US herds (0.75 kg/kg for milk; 0.63 kg/kg for fat), which was 14% to 17% greater than estimates from all US herds because the scaling effects were less severe from heterogeneous variances. Unequal daughter response from unequal genetic (co)variances under restrictive Brazilian conditions is evidence for the interaction of genotype and environment. The smaller and variable yield expectations of daughters of US sires in Brazilian environments suggest the need for specific genetic improvement strategies in Brazilian Holstein herds. A US data file restricting daughter information to low HYSD US environments would be a wise choice for across-country evaluation. Procedures to incorporate such foreign evaluations should be explored to improve the accuracy of genetic evaluations for the Brazilian Holstein population.

  20. The Dose–Response Relationship Between Tobacco Education Advertising and Calls to Quitlines in the United States, March–June, 2012

    PubMed Central

    Alexander, Robert L.; Shafer, Paul; Mann, Nathan; Malarcher, Ann; Zhang, Lei

    2015-01-01

    Introduction We estimated changes in call volume in the United States in response to increases in advertising doses of the Tips From Former Smokers (Tips) campaign, the first federal national tobacco education campaign, which aired for 12 weeks from March 19 to June 10, 2012. We also measured the effectiveness of ad taglines that promoted calls directly with a quitline number (1-800-QUIT-NOW) and indirectly with a cessation help website (Smokefree.gov). Methods Multivariate regressions estimated the weekly number of calls to 1–800-QUIT-NOW by area code as a function of weekly market-level gross rating points (GRPs) from CDC’s Tips campaign in 2012. The number of quitline calls attributable solely to Tips was predicted. Results For quitline-tagged ads, an additional 100 television GRPs per week was associated with an increase of 89 calls per week in a typical area code in the United States (P < .001). The same unit increase in advertising GRPs for ads tagged with Smokefree.gov was associated with an increase of 29 calls per week in any given area code (P < .001). We estimated that the Tips campaign was responsible for more than 170,000 additional calls to 1–800-QUIT-NOW during the campaign and that it would have generated approximately 140,000 additional calls if all ads were tagged with 1–800-QUIT-NOW. Conclusion For campaign planners, these results make it possible to estimate 1) the likely impact of tobacco prevention media buys and 2) the additional quitline capacity needed at the national level should future campaigns of similar scale use 1–800-QUIT-NOW taglines exclusively. PMID:26542143

  1. The Dose-Response Relationship Between Tobacco Education Advertising and Calls to Quitlines in the United States, March-June, 2012.

    PubMed

    Davis, Kevin C; Alexander, Robert L; Shafer, Paul; Mann, Nathan; Malarcher, Ann; Zhang, Lei

    2015-11-05

    We estimated changes in call volume in the United States in response to increases in advertising doses of the Tips From Former Smokers (Tips) campaign, the first federal national tobacco education campaign, which aired for 12 weeks from March 19 to June 10, 2012. We also measured the effectiveness of ad taglines that promoted calls directly with a quitline number (1-800-QUIT-NOW) and indirectly with a cessation help website (Smokefree.gov). Multivariate regressions estimated the weekly number of calls to 1-800-QUIT-NOW by area code as a function of weekly market-level gross rating points (GRPs) from CDC's Tips campaign in 2012. The number of quitline calls attributable solely to Tips was predicted. For quitline-tagged ads, an additional 100 television GRPs per week was associated with an increase of 89 calls per week in a typical area code in the United States (P < .001). The same unit increase in advertising GRPs for ads tagged with Smokefree.gov was associated with an increase of 29 calls per week in any given area code (P < .001). We estimated that the Tips campaign was responsible for more than 170,000 additional calls to 1-800-QUIT-NOW during the campaign and that it would have generated approximately 140,000 additional calls if all ads were tagged with 1-800-QUIT-NOW. For campaign planners, these results make it possible to estimate 1) the likely impact of tobacco prevention media buys and 2) the additional quitline capacity needed at the national level should future campaigns of similar scale use 1-800-QUIT-NOW taglines exclusively.

  2. Geostatistics and petroleum geology

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hohn, M.E.

    1988-01-01

    This book examines purpose and use of geostatistics in exploration and development of oil and gas with an emphasis on appropriate and pertinent case studies. It present an overview of geostatistics. Topics covered include: The semivariogram; Linear estimation; Multivariate geostatistics; Nonlinear estimation; From indicator variables to nonparametric estimation; and More detail, less certainty; conditional simulation.

  3. Estimating a graphical intra-class correlation coefficient (GICC) using multivariate probit-linear mixed models.

    PubMed

    Yue, Chen; Chen, Shaojie; Sair, Haris I; Airan, Raag; Caffo, Brian S

    2015-09-01

    Data reproducibility is a critical issue in all scientific experiments. In this manuscript, the problem of quantifying the reproducibility of graphical measurements is considered. The image intra-class correlation coefficient (I2C2) is generalized and the graphical intra-class correlation coefficient (GICC) is proposed for such purpose. The concept for GICC is based on multivariate probit-linear mixed effect models. A Markov Chain Monte Carlo EM (mcm-cEM) algorithm is used for estimating the GICC. Simulation results with varied settings are demonstrated and our method is applied to the KIRBY21 test-retest dataset.

  4. Quantifying the individual-level association between income and mortality risk in the United States using the National Longitudinal Mortality Study.

    PubMed

    Brodish, Paul Henry; Hakes, Jahn K

    2016-12-01

    Policy makers would benefit from being able to estimate the likely impact of potential interventions to reverse the effects of rapidly rising income inequality on mortality rates. Using multiple cohorts of the National Longitudinal Mortality Study (NLMS), we estimate the absolute income effect on premature mortality in the United States. A multivariate Poisson regression using the natural logarithm of equivilized household income establishes the magnitude of the absolute income effect on mortality. We calculate mortality rates for each income decile of the study sample and mortality rate ratios relative to the decile containing mean income. We then apply the estimated income effect to two kinds of hypothetical interventions that would redistribute income. The first lifts everyone with an equivalized household income at or below the U.S. poverty line (in 2000$) out of poverty, to the income category just above the poverty line. The second shifts each family's equivalized income by, in turn, 10%, 20%, 30%, or 40% toward the mean household income, equivalent to reducing the Gini coefficient by the same percentage in each scenario. We also assess mortality disparities of the hypothetical interventions using ratios of mortality rates of the ninth and second income deciles, and test sensitivity to the assumption of causality of income on mortality by halving the mortality effect per unit of equivalized household income. The estimated absolute income effect would produce a three to four percent reduction in mortality for a 10% reduction in the Gini coefficient. Larger mortality reductions result from larger reductions in the Gini, but with diminishing returns. Inequalities in estimated mortality rates are reduced by a larger percentage than overall estimated mortality rates under the same hypothetical redistributions. Copyright © 2016 Elsevier Ltd. All rights reserved.

  5. What factors influence smoking prevalence and smoke free policy enactment across the European Union Member States.

    PubMed

    Bogdanovica, Ilze; McNeill, Ann; Murray, Rachael; Britton, John

    2011-01-01

    Smoking prevention should be a primary public health priority for all governments, and effective preventive policies have been identified for decades. The heterogeneity of smoking prevalence between European Union (EU) Member States therefore reflects, at least in part, a failure by governments to prioritise public health over tobacco industry or possibly other financial interests, and hence potentially government corruption. The aims of this study were to test the hypothesis that smoking prevalence is higher in countries with high levels of public sector corruption, and explore the ecological association between smoking prevalence and a range of other national characteristics in current EU Member States. Ecological data from 27 EU Member States were used to estimate univariate and multivariate correlations between smoking prevalence and the Transparency International Corruption Perceptions Index, and a range of other national characteristics including economic development, social inclusion, quality of life and importance of religion. We also explored the association between the Corruption Perceptions Index and measures of the extent to which smoke-free policies have been enacted and are enforced. In univariate analysis, smoking prevalence was significantly higher in countries with higher scores for corruption, material deprivation, and gender inequality; and lower in countries with higher per capita Gross Domestic Product, social spending, life satisfaction and human development scores. In multivariate analysis, only the corruption perception index was independently related to smoking prevalence. Exposure to tobacco smoke in the workplace was also correlated with corruption, independently from smoking prevalence, but not with the measures of national smoke-free policy implementation. Corruption appears to be an important risk factor for failure of national tobacco control activity in EU countries, and the extent to which key tobacco control policies have been implemented. Further research is needed to assess the causal relationships involved.

  6. Estimation of railroad capacity using parametric methods.

    DOT National Transportation Integrated Search

    2013-12-01

    This paper reviews different methodologies used for railroad capacity estimation and presents a user-friendly method to measure capacity. The objective of this paper is to use multivariate regression analysis to develop a continuous relation of the d...

  7. A mixed-effects regression model for longitudinal multivariate ordinal data.

    PubMed

    Liu, Li C; Hedeker, Donald

    2006-03-01

    A mixed-effects item response theory model that allows for three-level multivariate ordinal outcomes and accommodates multiple random subject effects is proposed for analysis of multivariate ordinal outcomes in longitudinal studies. This model allows for the estimation of different item factor loadings (item discrimination parameters) for the multiple outcomes. The covariates in the model do not have to follow the proportional odds assumption and can be at any level. Assuming either a probit or logistic response function, maximum marginal likelihood estimation is proposed utilizing multidimensional Gauss-Hermite quadrature for integration of the random effects. An iterative Fisher scoring solution, which provides standard errors for all model parameters, is used. An analysis of a longitudinal substance use data set, where four items of substance use behavior (cigarette use, alcohol use, marijuana use, and getting drunk or high) are repeatedly measured over time, is used to illustrate application of the proposed model.

  8. Multivariate random-parameters zero-inflated negative binomial regression model: an application to estimate crash frequencies at intersections.

    PubMed

    Dong, Chunjiao; Clarke, David B; Yan, Xuedong; Khattak, Asad; Huang, Baoshan

    2014-09-01

    Crash data are collected through police reports and integrated with road inventory data for further analysis. Integrated police reports and inventory data yield correlated multivariate data for roadway entities (e.g., segments or intersections). Analysis of such data reveals important relationships that can help focus on high-risk situations and coming up with safety countermeasures. To understand relationships between crash frequencies and associated variables, while taking full advantage of the available data, multivariate random-parameters models are appropriate since they can simultaneously consider the correlation among the specific crash types and account for unobserved heterogeneity. However, a key issue that arises with correlated multivariate data is the number of crash-free samples increases, as crash counts have many categories. In this paper, we describe a multivariate random-parameters zero-inflated negative binomial (MRZINB) regression model for jointly modeling crash counts. The full Bayesian method is employed to estimate the model parameters. Crash frequencies at urban signalized intersections in Tennessee are analyzed. The paper investigates the performance of MZINB and MRZINB regression models in establishing the relationship between crash frequencies, pavement conditions, traffic factors, and geometric design features of roadway intersections. Compared to the MZINB model, the MRZINB model identifies additional statistically significant factors and provides better goodness of fit in developing the relationships. The empirical results show that MRZINB model possesses most of the desirable statistical properties in terms of its ability to accommodate unobserved heterogeneity and excess zero counts in correlated data. Notably, in the random-parameters MZINB model, the estimated parameters vary significantly across intersections for different crash types. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. Is Exposure to Income Inequality a Public Health Concern? Lagged Effects of Income Inequality on Individual and Population Health

    PubMed Central

    Mellor, Jennifer M; Milyo, Jeffrey

    2003-01-01

    Objective To examine the health consequences of exposure to income inequality. Data Sources Secondary analysis employing data from several publicly available sources. Measures of individual health status and other individual characteristics are obtained from the March Current Population Survey (CPS). State-level income inequality is measured by the Gini coefficient based on family income, as reported by the U.S. Census Bureau and Al-Samarrie and Miller (1967). State-level mortality rates are from the Vital Statistics of the United States; other state-level characteristics are from U.S. census data as reported in the Statistical Abstract of the United States. Study Design We examine the effects of state-level income inequality lagged from 5 to 29 years on individual health by estimating probit models of poor/fair health status for samples of adults aged 25–74 in the 1995 through 1999 March CPS. We control for several individual characteristics, including educational attainment and household income, as well as regional fixed effects. We use multivariate regression to estimate the effects of income inequality lagged 10 and 20 years on state-level mortality rates for 1990, 1980, 1970, and 1960. Principal Findings Lagged income inequality is not significantly associated with individual health status after controlling for regional fixed effects. Lagged income inequality is not associated with all cause mortality, but associated with reduced mortality from cardiovascular disease and malignant neoplasms, after controlling for state fixed-effects. Conclusions In contrast to previous studies that fail to control for regional variations in health outcomes, we find little support for the contention that exposure to income inequality is detrimental to either individual or population health. PMID:12650385

  10. The effect of physician and health plan market concentration on prices in commercial health insurance markets.

    PubMed

    Schneider, John E; Li, Pengxiang; Klepser, Donald G; Peterson, N Andrew; Brown, Timothy T; Scheffler, Richard M

    2008-03-01

    The objective of this paper is to describe the market structure of health plans (HPs) and physician organizations (POs) in California, a state with high levels of managed care penetration and selective contracting. First we calculate Herfindahl-Hirschman (HHI) concentration indices for HPs and POs in 42 California counties. We then estimate a multivariable regression model to examine the relationship between concentration measures and the prices paid by HPs to POs. Price data is from Medstat MarketScan databases. The findings show that any California counties exhibit what the Department of Justice would consider high HHI concentration measures, in excess of 1,800. More than three quarters of California counties exhibit HP concentration indices over 1,800, and 83% of counties have PO concentration levels in excess of 1,800. Half of the study counties exhibited PO concentration levels in excess of 3,600, compared to only 24% for plans. Multivariate price models suggest that PO concentration is associated with higher physician prices (p < or = 0.05), whereas HP concentration does not appear to be significantly associated with higher outpatient commercial payer prices.

  11. Development of a robust framework for controlling high performance turbofan engines

    NASA Astrophysics Data System (ADS)

    Miklosovic, Robert

    This research involves the development of a robust framework for controlling complex and uncertain multivariable systems. Where mathematical modeling is often tedious or inaccurate, the new method uses an extended state observer (ESO) to estimate and cancel dynamic information in real time and dynamically decouple the system. As a result, controller design and tuning become transparent as the number of required model parameters is reduced. Much research has been devoted towards the application of modern multivariable control techniques on aircraft engines. However, few, if any, have been implemented on an operational aircraft, partially due to the difficulty in tuning the controller for satisfactory performance. The new technique is applied to a modern two-spool, high-pressure ratio, low-bypass turbofan with mixed-flow afterburning. A realistic Modular Aero-Propulsion System Simulation (MAPSS) package, developed by NASA, is used to demonstrate the new design process and compare its performance with that of a supplied nominal controller. This approach is expected to reduce gain scheduling over the full operating envelope of the engine and allow a controller to be tuned for engine-to-engine variations.

  12. Estimation of parameters in Shot-Noise-Driven Doubly Stochastic Poisson processes using the EM algorithm--modeling of pre- and postsynaptic spike trains.

    PubMed

    Mino, H

    2007-01-01

    To estimate the parameters, the impulse response (IR) functions of some linear time-invariant systems generating intensity processes, in Shot-Noise-Driven Doubly Stochastic Poisson Process (SND-DSPP) in which multivariate presynaptic spike trains and postsynaptic spike trains can be assumed to be modeled by the SND-DSPPs. An explicit formula for estimating the IR functions from observations of multivariate input processes of the linear systems and the corresponding counting process (output process) is derived utilizing the expectation maximization (EM) algorithm. The validity of the estimation formula was verified through Monte Carlo simulations in which two presynaptic spike trains and one postsynaptic spike train were assumed to be observable. The IR functions estimated on the basis of the proposed identification method were close to the true IR functions. The proposed method will play an important role in identifying the input-output relationship of pre- and postsynaptic neural spike trains in practical situations.

  13. Real-Time Parameter Estimation Method Applied to a MIMO Process and its Comparison with an Offline Identification Method

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kaplanoglu, Erkan; Safak, Koray K.; Varol, H. Selcuk

    2009-01-12

    An experiment based method is proposed for parameter estimation of a class of linear multivariable systems. The method was applied to a pressure-level control process. Experimental time domain input/output data was utilized in a gray-box modeling approach. Prior knowledge of the form of the system transfer function matrix elements is assumed to be known. Continuous-time system transfer function matrix parameters were estimated in real-time by the least-squares method. Simulation results of experimentally determined system transfer function matrix compare very well with the experimental results. For comparison and as an alternative to the proposed real-time estimation method, we also implemented anmore » offline identification method using artificial neural networks and obtained fairly good results. The proposed methods can be implemented conveniently on a desktop PC equipped with a data acquisition board for parameter estimation of moderately complex linear multivariable systems.« less

  14. U.S. Citizen Children of Undocumented Parents: The Link Between State Immigration Policy and the Health of Latino Children

    PubMed Central

    Vargas, Edward D.; Ybarra, Vickie D.

    2016-01-01

    Background We examine Latino citizen children in mixed-status families and how their physical health status compares to their U.S. citizen, co-ethnic counterparts. We also examine Latino parents’ perceptions of state immigration policy and its implications for child health status. Methods Using the 2015 Latino National Health and Immigration Survey (n=1493), we estimate a series of multivariate ordered logistic regression models with mixed-status family and perceptions of state immigration policy as primary predictors. Results We find that mixed-status families report worse physical health for their children as compared to their U.S. citizen co-ethnics. We also find that parental perceptions of their states’ immigration status further exacerbate health disparities between families. Discussion These findings have implications for scholars and policy makers interested in immigrant health, family wellbeing, and health disparities in complex family structures. They contribute to the scholarship on Latino child health and on the erosion of the Latino immigrant health advantage. PMID:27435476

  15. The Dirichlet-Multinomial Model for Multivariate Randomized Response Data and Small Samples

    ERIC Educational Resources Information Center

    Avetisyan, Marianna; Fox, Jean-Paul

    2012-01-01

    In survey sampling the randomized response (RR) technique can be used to obtain truthful answers to sensitive questions. Although the individual answers are masked due to the RR technique, individual (sensitive) response rates can be estimated when observing multivariate response data. The beta-binomial model for binary RR data will be generalized…

  16. Decomposing biodiversity data using the Latent Dirichlet Allocation model, a probabilistic multivariate statistical method

    Treesearch

    Denis Valle; Benjamin Baiser; Christopher W. Woodall; Robin Chazdon; Jerome Chave

    2014-01-01

    We propose a novel multivariate method to analyse biodiversity data based on the Latent Dirichlet Allocation (LDA) model. LDA, a probabilistic model, reduces assemblages to sets of distinct component communities. It produces easily interpretable results, can represent abrupt and gradual changes in composition, accommodates missing data and allows for coherent estimates...

  17. Outlier Detection in Hyperspectral Imagery Using Closest Distance to Center with Ellipsoidal Multivariate Trimming

    DTIC Science & Technology

    2011-01-01

    where r << P. The use of PCA for finding outliers in multivariate data is surveyed by Gnanadesikan and Kettenring16 and Rao.17 As alluded to earlier...1984. 16. Gnanadesikan R and Kettenring JR. Robust estimates, residu­ als, and outlier detection with multiresponse data. Biometrics 1972; 28: 81–124

  18. Kalman filter for statistical monitoring of forest cover across sub-continental regions

    Treesearch

    Raymond L. Czaplewski

    1991-01-01

    The Kalman filter is a multivariate generalization of the composite estimator which recursively combines a current direct estimate with a past estimate that is updated for expected change over time with a prediction model. The Kalman filter can estimate proportions of different cover types for sub-continental regions each year. A random sample of high-resolution...

  19. Observational needs for estimating Alaskan soil carbon stocks under current and future climate

    DOE PAGES

    Vitharana, U. W. A.; Mishra, U.; Jastrow, J. D.; ...

    2017-01-24

    Representing land surface spatial heterogeneity when designing observation networks is a critical scientific challenge. Here we present a geospatial approach that utilizes the multivariate spatial heterogeneity of soil-forming factors—namely, climate, topography, land cover types, and surficial geology—to identify observation sites to improve soil organic carbon (SOC) stock estimates across the State of Alaska, USA. Standard deviations in existing SOC samples indicated that 657, 870, and 906 randomly distributed pedons would be required to quantify the average SOC stocks for 0–1 m, 0–2 m, and whole-profile depths, respectively, at a confidence interval of 5 kg C m -2. Using the spatialmore » correlation range of existing SOC samples, we identified that 309, 446, and 484 new observation sites are needed to estimate current SOC stocks to 1 m, 2 m, and whole-profile depths, respectively. We also investigated whether the identified sites might change under future climate by using eight decadal (2020–2099) projections of precipitation, temperature, and length of growing season for three representative concentration pathway (RCP 4.5, 6.0, and 8.5) scenarios of the Intergovernmental Panel on Climate Change. These analyses determined that 12 to 41 additional sites (906 + 12 to 41; depending upon the emission scenarios) would be needed to capture the impact of future climate on Alaskan whole-profile SOC stocks by 2100. The identified observation sites represent spatially distributed locations across Alaska that captures the multivariate heterogeneity of soil-forming factors under current and future climatic conditions. This information is needed for designing monitoring networks and benchmarking of Earth system model results.« less

  20. Investigation of the Sensitivity of Transmission Raman Spectroscopy for Polymorph Detection in Pharmaceutical Tablets.

    PubMed

    Feng, Hanzhou; Bondi, Robert W; Anderson, Carl A; Drennen, James K; Igne, Benoît

    2017-08-01

    Polymorph detection is critical for ensuring pharmaceutical product quality in drug substances exhibiting polymorphism. Conventional analytical techniques such as X-ray powder diffraction and solid-state nuclear magnetic resonance are utilized primarily for characterizing the presence and identity of specific polymorphs in a sample. These techniques have encountered challenges in analyzing the constitution of polymorphs in the presence of other components commonly found in pharmaceutical dosage forms. Laborious sample preparation procedures are usually required to achieve satisfactory data interpretability. There is a need for alternative techniques capable of probing pharmaceutical dosage forms rapidly and nondestructively, which is dictated by the practical requirements of applications such as quality monitoring on production lines or when quantifying product shelf lifetime. The sensitivity of transmission Raman spectroscopy for detecting polymorphs in final tablet cores was investigated in this work. Carbamazepine was chosen as a model drug, polymorph form III is the commercial form, whereas form I is an undesired polymorph that requires effective detection. The concentration of form I in a direct compression tablet formulation containing 20% w/w of carbamazepine, 74.00% w/w of fillers (mannitol and microcrystalline cellulose), and 6% w/w of croscarmellose sodium, silicon dioxide, and magnesium stearate was estimated using transmission Raman spectroscopy. Quantitative models were generated and optimized using multivariate regression and data preprocessing. Prediction uncertainty was estimated for each validation sample by accounting for all the main variables contributing to the prediction. Multivariate detection limits were calculated based on statistical hypothesis testing. The transmission Raman spectroscopic model had an absolute prediction error of 0.241% w/w for the independent validation set. The method detection limit was estimated at 1.31% w/w. The results demonstrated that transmission Raman spectroscopy is a sensitive tool for polymorphs detection in pharmaceutical tablets.

  1. Observational needs for estimating Alaskan soil carbon stocks under current and future climate

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Vitharana, U. W. A.; Mishra, U.; Jastrow, J. D.

    Representing land surface spatial heterogeneity when designing observation networks is a critical scientific challenge. Here we present a geospatial approach that utilizes the multivariate spatial heterogeneity of soil-forming factors—namely, climate, topography, land cover types, and surficial geology—to identify observation sites to improve soil organic carbon (SOC) stock estimates across the State of Alaska, USA. Standard deviations in existing SOC samples indicated that 657, 870, and 906 randomly distributed pedons would be required to quantify the average SOC stocks for 0–1 m, 0–2 m, and whole-profile depths, respectively, at a confidence interval of 5 kg C m -2. Using the spatialmore » correlation range of existing SOC samples, we identified that 309, 446, and 484 new observation sites are needed to estimate current SOC stocks to 1 m, 2 m, and whole-profile depths, respectively. We also investigated whether the identified sites might change under future climate by using eight decadal (2020–2099) projections of precipitation, temperature, and length of growing season for three representative concentration pathway (RCP 4.5, 6.0, and 8.5) scenarios of the Intergovernmental Panel on Climate Change. These analyses determined that 12 to 41 additional sites (906 + 12 to 41; depending upon the emission scenarios) would be needed to capture the impact of future climate on Alaskan whole-profile SOC stocks by 2100. The identified observation sites represent spatially distributed locations across Alaska that captures the multivariate heterogeneity of soil-forming factors under current and future climatic conditions. This information is needed for designing monitoring networks and benchmarking of Earth system model results.« less

  2. Perceived Need Versus Current Spending: Gaps in Providing Foundational Public Health Services in Communities.

    PubMed

    Bekemeier, Betty; Marlowe, Justin; Squires, Linda Sharee; Tebaldi, Jennifer; Park, Seungeun

    Our objective was to estimate the gap between the costs for local health jurisdictions (LHJs) to provide foundational public health services (FPHS) and actual spending on FPHS and to examine factors associated with that gap. We employed resource-based cost estimation methods for this observational study and conducted multivariate analyses with measures derived from secondary administrative data. We used primary data collected from LHJ leaders that depicted 2014 spending and perceived need. We also included secondary administrative data depicting annual 2000-2013 expenditures organized into categories containing key elements of FPHS areas. We included primary data from a representative sample of 10 LHJs in Washington State and secondary data for all 35 LHJs in Washington. Participants were public health practice leaders from each sample LHJ. Our main outcome of interest was the gap identified between current spending and the perceived spending needed to provide FPHS in a jurisdiction. Actual FPHS spending was approximately 65% of spending needed to provide overall FPHS for our sample LHJs, but the size of the gap varied substantially by program. Some gaps also varied widely by LHJ, with spending gaps widest among rural and high poverty communities. Percent poverty and the metropolitan nature of a jurisdiction were factors significantly related to FPHS spending in our multivariate analyses. Actual spending lags far behind local officials' estimates of spending needed to provide FPHS and is likely influenced by local conditions. Major apparent gaps between spending and need, particularly in areas such as costly Business Competencies, underscore the need for cross-cutting capabilities to support public health system responsiveness and for attention to be paid to local conditions.

  3. Statistical analysis of latent generalized correlation matrix estimation in transelliptical distribution.

    PubMed

    Han, Fang; Liu, Han

    2017-02-01

    Correlation matrix plays a key role in many multivariate methods (e.g., graphical model estimation and factor analysis). The current state-of-the-art in estimating large correlation matrices focuses on the use of Pearson's sample correlation matrix. Although Pearson's sample correlation matrix enjoys various good properties under Gaussian models, its not an effective estimator when facing heavy-tail distributions with possible outliers. As a robust alternative, Han and Liu (2013b) advocated the use of a transformed version of the Kendall's tau sample correlation matrix in estimating high dimensional latent generalized correlation matrix under the transelliptical distribution family (or elliptical copula). The transelliptical family assumes that after unspecified marginal monotone transformations, the data follow an elliptical distribution. In this paper, we study the theoretical properties of the Kendall's tau sample correlation matrix and its transformed version proposed in Han and Liu (2013b) for estimating the population Kendall's tau correlation matrix and the latent Pearson's correlation matrix under both spectral and restricted spectral norms. With regard to the spectral norm, we highlight the role of "effective rank" in quantifying the rate of convergence. With regard to the restricted spectral norm, we for the first time present a "sign subgaussian condition" which is sufficient to guarantee that the rank-based correlation matrix estimator attains the optimal rate of convergence. In both cases, we do not need any moment condition.

  4. Application of two tests of multivariate discordancy to fisheries data sets

    USGS Publications Warehouse

    Stapanian, M.A.; Kocovsky, P.M.; Garner, F.C.

    2008-01-01

    The generalized (Mahalanobis) distance and multivariate kurtosis are two powerful tests of multivariate discordancies (outliers). Unlike the generalized distance test, the multivariate kurtosis test has not been applied as a test of discordancy to fisheries data heretofore. We applied both tests, along with published algorithms for identifying suspected causal variable(s) of discordant observations, to two fisheries data sets from Lake Erie: total length, mass, and age from 1,234 burbot, Lota lota; and 22 combinations of unique subsets of 10 morphometrics taken from 119 yellow perch, Perca flavescens. For the burbot data set, the generalized distance test identified six discordant observations and the multivariate kurtosis test identified 24 discordant observations. In contrast with the multivariate tests, the univariate generalized distance test identified no discordancies when applied separately to each variable. Removing discordancies had a substantial effect on length-versus-mass regression equations. For 500-mm burbot, the percent difference in estimated mass after removing discordancies in our study was greater than the percent difference in masses estimated for burbot of the same length in lakes that differed substantially in productivity. The number of discordant yellow perch detected ranged from 0 to 2 with the multivariate generalized distance test and from 6 to 11 with the multivariate kurtosis test. With the kurtosis test, 108 yellow perch (90.7%) were identified as discordant in zero to two combinations, and five (4.2%) were identified as discordant in either all or 21 of the 22 combinations. The relationship among the variables included in each combination determined which variables were identified as causal. The generalized distance test identified between zero and six discordancies when applied separately to each variable. Removing the discordancies found in at least one-half of the combinations (k=5) had a marked effect on a principal components analysis. In particular, the percent of the total variation explained by second and third principal components, which explain shape, increased by 52 and 44% respectively when the discordancies were removed. Multivariate applications of the tests have numerous ecological advantages over univariate applications, including improved management of fish stocks and interpretation of multivariate morphometric data. ?? 2007 Springer Science+Business Media B.V.

  5. Tracking the time-varying cortical connectivity patterns by adaptive multivariate estimators.

    PubMed

    Astolfi, L; Cincotti, F; Mattia, D; De Vico Fallani, F; Tocci, A; Colosimo, A; Salinari, S; Marciani, M G; Hesse, W; Witte, H; Ursino, M; Zavaglia, M; Babiloni, F

    2008-03-01

    The directed transfer function (DTF) and the partial directed coherence (PDC) are frequency-domain estimators that are able to describe interactions between cortical areas in terms of the concept of Granger causality. However, the classical estimation of these methods is based on the multivariate autoregressive modelling (MVAR) of time series, which requires the stationarity of the signals. In this way, transient pathways of information transfer remains hidden. The objective of this study is to test a time-varying multivariate method for the estimation of rapidly changing connectivity relationships between cortical areas of the human brain, based on DTF/PDC and on the use of adaptive MVAR modelling (AMVAR) and to apply it to a set of real high resolution EEG data. This approach will allow the observation of rapidly changing influences between the cortical areas during the execution of a task. The simulation results indicated that time-varying DTF and PDC are able to estimate correctly the imposed connectivity patterns under reasonable operative conditions of signal-to-noise ratio (SNR) ad number of trials. An SNR of five and a number of trials of at least 20 provide a good accuracy in the estimation. After testing the method by the simulation study, we provide an application to the cortical estimations obtained from high resolution EEG data recorded from a group of healthy subject during a combined foot-lips movement and present the time-varying connectivity patterns resulting from the application of both DTF and PDC. Two different cortical networks were detected with the proposed methods, one constant across the task and the other evolving during the preparation of the joint movement.

  6. Advancing the detection of steady-state visual evoked potentials in brain-computer interfaces

    NASA Astrophysics Data System (ADS)

    Abu-Alqumsan, Mohammad; Peer, Angelika

    2016-06-01

    Objective. Spatial filtering has proved to be a powerful pre-processing step in detection of steady-state visual evoked potentials and boosted typical detection rates both in offline analysis and online SSVEP-based brain-computer interface applications. State-of-the-art detection methods and the spatial filters used thereby share many common foundations as they all build upon the second order statistics of the acquired Electroencephalographic (EEG) data, that is, its spatial autocovariance and cross-covariance with what is assumed to be a pure SSVEP response. The present study aims at highlighting the similarities and differences between these methods. Approach. We consider the canonical correlation analysis (CCA) method as a basis for the theoretical and empirical (with real EEG data) analysis of the state-of-the-art detection methods and the spatial filters used thereby. We build upon the findings of this analysis and prior research and propose a new detection method (CVARS) that combines the power of the canonical variates and that of the autoregressive spectral analysis in estimating the signal and noise power levels. Main results. We found that the multivariate synchronization index method and the maximum contrast combination method are variations of the CCA method. All three methods were found to provide relatively unreliable detections in low signal-to-noise ratio (SNR) regimes. CVARS and the minimum energy combination methods were found to provide better estimates for different SNR levels. Significance. Our theoretical and empirical results demonstrate that the proposed CVARS method outperforms other state-of-the-art detection methods when used in an unsupervised fashion. Furthermore, when used in a supervised fashion, a linear classifier learned from a short training session is able to estimate the hidden user intention, including the idle state (when the user is not attending to any stimulus), rapidly, accurately and reliably.

  7. Comparative forensic soil analysis of New Jersey state parks using a combination of simple techniques with multivariate statistics.

    PubMed

    Bonetti, Jennifer; Quarino, Lawrence

    2014-05-01

    This study has shown that the combination of simple techniques with the use of multivariate statistics offers the potential for the comparative analysis of soil samples. Five samples were obtained from each of twelve state parks across New Jersey in both the summer and fall seasons. Each sample was examined using particle-size distribution, pH analysis in both water and 1 M CaCl2 , and a loss on ignition technique. Data from each of the techniques were combined, and principal component analysis (PCA) and canonical discriminant analysis (CDA) were used for multivariate data transformation. Samples from different locations could be visually differentiated from one another using these multivariate plots. Hold-one-out cross-validation analysis showed error rates as low as 3.33%. Ten blind study samples were analyzed resulting in no misclassifications using Mahalanobis distance calculations and visual examinations of multivariate plots. Seasonal variation was minimal between corresponding samples, suggesting potential success in forensic applications. © 2014 American Academy of Forensic Sciences.

  8. Variation In Health Outcomes: The Role Of Spending On Social Services, Public Health, And Health Care, 2000-09.

    PubMed

    Bradley, Elizabeth H; Canavan, Maureen; Rogan, Erika; Talbert-Slagle, Kristina; Ndumele, Chima; Taylor, Lauren; Curry, Leslie A

    2016-05-01

    Although spending rates on health care and social services vary substantially across the states, little is known about the possible association between variation in state-level health outcomes and the allocation of state spending between health care and social services. To estimate that association, we used state-level repeated measures multivariable modeling for the period 2000-09, with region and time fixed effects adjusted for total spending and state demographic and economic characteristics and with one- and two-year lags. We found that states with a higher ratio of social to health spending (calculated as the sum of social service spending and public health spending divided by the sum of Medicare spending and Medicaid spending) had significantly better subsequent health outcomes for the following seven measures: adult obesity; asthma; mentally unhealthy days; days with activity limitations; and mortality rates for lung cancer, acute myocardial infarction, and type 2 diabetes. Our study suggests that broadening the debate beyond what should be spent on health care to include what should be invested in health-not only in health care but also in social services and public health-is warranted. Project HOPE—The People-to-People Health Foundation, Inc.

  9. Robustness of reduced-order multivariable state-space self-tuning controller

    NASA Technical Reports Server (NTRS)

    Yuan, Zhuzhi; Chen, Zengqiang

    1994-01-01

    In this paper, we present a quantitative analysis of the robustness of a reduced-order pole-assignment state-space self-tuning controller for a multivariable adaptive control system whose order of the real process is higher than that of the model used in the controller design. The result of stability analysis shows that, under a specific bounded modelling error, the adaptively controlled closed-loop real system via the reduced-order state-space self-tuner is BIBO stable in the presence of unmodelled dynamics.

  10. Statistical analysis of multivariate atmospheric variables. [cloud cover

    NASA Technical Reports Server (NTRS)

    Tubbs, J. D.

    1979-01-01

    Topics covered include: (1) estimation in discrete multivariate distributions; (2) a procedure to predict cloud cover frequencies in the bivariate case; (3) a program to compute conditional bivariate normal parameters; (4) the transformation of nonnormal multivariate to near-normal; (5) test of fit for the extreme value distribution based upon the generalized minimum chi-square; (6) test of fit for continuous distributions based upon the generalized minimum chi-square; (7) effect of correlated observations on confidence sets based upon chi-square statistics; and (8) generation of random variates from specified distributions.

  11. Interference correction by extracting the information of interference dominant regions: Application to near-infrared spectra

    NASA Astrophysics Data System (ADS)

    Bi, Yiming; Tang, Liang; Shan, Peng; Xie, Qiong; Hu, Yong; Peng, Silong; Tan, Jie; Li, Changwen

    2014-08-01

    Interference such as baseline drift and light scattering can degrade the model predictability in multivariate analysis of near-infrared (NIR) spectra. Usually interference can be represented by an additive and a multiplicative factor. In order to eliminate these interferences, correction parameters are needed to be estimated from spectra. However, the spectra are often mixed of physical light scattering effects and chemical light absorbance effects, making it difficult for parameter estimation. Herein, a novel algorithm was proposed to find a spectral region automatically that the interesting chemical absorbance and noise are low, that is, finding an interference dominant region (IDR). Based on the definition of IDR, a two-step method was proposed to find the optimal IDR and the corresponding correction parameters estimated from IDR. Finally, the correction was performed to the full spectral range using previously obtained parameters for the calibration set and test set, respectively. The method can be applied to multi target systems with one IDR suitable for all targeted analytes. Tested on two benchmark data sets of near-infrared spectra, the performance of the proposed method provided considerable improvement compared with full spectral estimation methods and comparable with other state-of-art methods.

  12. Univariate and multivariate spatial models of health facility utilisation for childhood fevers in an area on the coast of Kenya.

    PubMed

    Ouma, Paul O; Agutu, Nathan O; Snow, Robert W; Noor, Abdisalan M

    2017-09-18

    Precise quantification of health service utilisation is important for the estimation of disease burden and allocation of health resources. Current approaches to mapping health facility utilisation rely on spatial accessibility alone as the predictor. However, other spatially varying social, demographic and economic factors may affect the use of health services. The exclusion of these factors can lead to the inaccurate estimation of health facility utilisation. Here, we compare the accuracy of a univariate spatial model, developed only from estimated travel time, to a multivariate model that also includes relevant social, demographic and economic factors. A theoretical surface of travel time to the nearest public health facility was developed. These were assigned to each child reported to have had fever in the Kenya demographic and health survey of 2014 (KDHS 2014). The relationship of child treatment seeking for fever with travel time, household and individual factors from the KDHS2014 were determined using multilevel mixed modelling. Bayesian information criterion (BIC) and likelihood ratio test (LRT) tests were carried out to measure how selected factors improve parsimony and goodness of fit of the time model. Using the mixed model, a univariate spatial model of health facility utilisation was fitted using travel time as the predictor. The mixed model was also used to compute a multivariate spatial model of utilisation, using travel time and modelled surfaces of selected household and individual factors as predictors. The univariate and multivariate spatial models were then compared using the receiver operating area under the curve (AUC) and a percent correct prediction (PCP) test. The best fitting multivariate model had travel time, household wealth index and number of children in household as the predictors. These factors reduced BIC of the time model from 4008 to 2959, a change which was confirmed by the LRT test. Although there was a high correlation of the two modelled probability surfaces (Adj R 2  = 88%), the multivariate model had better AUC compared to the univariate model; 0.83 versus 0.73 and PCP 0.61 versus 0.45 values. Our study shows that a model that uses travel time, as well as household and individual-level socio-demographic factors, results in a more accurate estimation of use of health facilities for the treatment of childhood fever, compared to one that relies on only travel time.

  13. A General Approach for Estimating Scale Score Reliability for Panel Survey Data

    ERIC Educational Resources Information Center

    Biemer, Paul P.; Christ, Sharon L.; Wiesen, Christopher A.

    2009-01-01

    Scale score measures are ubiquitous in the psychological literature and can be used as both dependent and independent variables in data analysis. Poor reliability of scale score measures leads to inflated standard errors and/or biased estimates, particularly in multivariate analysis. Reliability estimation is usually an integral step to assess…

  14. Methods for estimating confidence intervals in interrupted time series analyses of health interventions.

    PubMed

    Zhang, Fang; Wagner, Anita K; Soumerai, Stephen B; Ross-Degnan, Dennis

    2009-02-01

    Interrupted time series (ITS) is a strong quasi-experimental research design, which is increasingly applied to estimate the effects of health services and policy interventions. We describe and illustrate two methods for estimating confidence intervals (CIs) around absolute and relative changes in outcomes calculated from segmented regression parameter estimates. We used multivariate delta and bootstrapping methods (BMs) to construct CIs around relative changes in level and trend, and around absolute changes in outcome based on segmented linear regression analyses of time series data corrected for autocorrelated errors. Using previously published time series data, we estimated CIs around the effect of prescription alerts for interacting medications with warfarin on the rate of prescriptions per 10,000 warfarin users per month. Both the multivariate delta method (MDM) and the BM produced similar results. BM is preferred for calculating CIs of relative changes in outcomes of time series studies, because it does not require large sample sizes when parameter estimates are obtained correctly from the model. Caution is needed when sample size is small.

  15. Computation of nonlinear least squares estimator and maximum likelihood using principles in matrix calculus

    NASA Astrophysics Data System (ADS)

    Mahaboob, B.; Venkateswarlu, B.; Sankar, J. Ravi; Balasiddamuni, P.

    2017-11-01

    This paper uses matrix calculus techniques to obtain Nonlinear Least Squares Estimator (NLSE), Maximum Likelihood Estimator (MLE) and Linear Pseudo model for nonlinear regression model. David Pollard and Peter Radchenko [1] explained analytic techniques to compute the NLSE. However the present research paper introduces an innovative method to compute the NLSE using principles in multivariate calculus. This study is concerned with very new optimization techniques used to compute MLE and NLSE. Anh [2] derived NLSE and MLE of a heteroscedatistic regression model. Lemcoff [3] discussed a procedure to get linear pseudo model for nonlinear regression model. In this research article a new technique is developed to get the linear pseudo model for nonlinear regression model using multivariate calculus. The linear pseudo model of Edmond Malinvaud [4] has been explained in a very different way in this paper. David Pollard et.al used empirical process techniques to study the asymptotic of the LSE (Least-squares estimation) for the fitting of nonlinear regression function in 2006. In Jae Myung [13] provided a go conceptual for Maximum likelihood estimation in his work “Tutorial on maximum likelihood estimation

  16. Non-parametric directionality analysis - Extension for removal of a single common predictor and application to time series.

    PubMed

    Halliday, David M; Senik, Mohd Harizal; Stevenson, Carl W; Mason, Rob

    2016-08-01

    The ability to infer network structure from multivariate neuronal signals is central to computational neuroscience. Directed network analyses typically use parametric approaches based on auto-regressive (AR) models, where networks are constructed from estimates of AR model parameters. However, the validity of using low order AR models for neurophysiological signals has been questioned. A recent article introduced a non-parametric approach to estimate directionality in bivariate data, non-parametric approaches are free from concerns over model validity. We extend the non-parametric framework to include measures of directed conditional independence, using scalar measures that decompose the overall partial correlation coefficient summatively by direction, and a set of functions that decompose the partial coherence summatively by direction. A time domain partial correlation function allows both time and frequency views of the data to be constructed. The conditional independence estimates are conditioned on a single predictor. The framework is applied to simulated cortical neuron networks and mixtures of Gaussian time series data with known interactions. It is applied to experimental data consisting of local field potential recordings from bilateral hippocampus in anaesthetised rats. The framework offers a non-parametric approach to estimation of directed interactions in multivariate neuronal recordings, and increased flexibility in dealing with both spike train and time series data. The framework offers a novel alternative non-parametric approach to estimate directed interactions in multivariate neuronal recordings, and is applicable to spike train and time series data. Copyright © 2016 Elsevier B.V. All rights reserved.

  17. Nonprescription Antimicrobial Use in a Primary Care Population in the United States

    PubMed Central

    Zoorob, Roger; Nash, Susan; Trautner, Barbara W.

    2016-01-01

    Community antimicrobial resistance rates are high in communities with frequent use of nonprescription antibiotics. Studies addressing nonprescription antibiotic use in the United States have been restricted to Latin American immigrants. We estimated the prevalence of nonprescription antibiotic use in the previous 12 months as well as intended use (intention to use antibiotics without a prescription) and storage of antibiotics and examined patient characteristics associated with nonprescription use in a random sample of adults. We selected private and public primary care clinics that serve ethnically and socioeconomically diverse patients. Within the clinics, we used race/ethnicity-stratified systematic random sampling to choose a random sample of primary care patients. We used a self-administered standardized questionnaire on antibiotic use. Multivariate regression analysis was used to identify independent predictors of nonprescription use. The response rate was 94%. Of 400 respondents, 20 (5%) reported nonprescription use of systemic antibiotics in the last 12 months, 102 (25.4%) reported intended use, and 57 (14.2%) stored antibiotics at home. These rates were similar across race/ethnicity groups. Sources of antibiotics used without prescriptions or stored for future use were stores or pharmacies in the United States, “leftover” antibiotics from previous prescriptions, antibiotics obtained abroad, or antibiotics obtained from a relative or friend. Respiratory symptoms were common reasons for the use of nonprescription antibiotics. In multivariate analyses, public clinic patients, those with less education, and younger patients were more likely to endorse intended use. The problem of nonprescription use is not confined to Latino communities. Community antimicrobial stewardship must include a focus on nonprescription antibiotics. PMID:27401572

  18. Modeling Bivariate Longitudinal Hormone Profiles by Hierarchical State Space Models

    PubMed Central

    Liu, Ziyue; Cappola, Anne R.; Crofford, Leslie J.; Guo, Wensheng

    2013-01-01

    The hypothalamic-pituitary-adrenal (HPA) axis is crucial in coping with stress and maintaining homeostasis. Hormones produced by the HPA axis exhibit both complex univariate longitudinal profiles and complex relationships among different hormones. Consequently, modeling these multivariate longitudinal hormone profiles is a challenging task. In this paper, we propose a bivariate hierarchical state space model, in which each hormone profile is modeled by a hierarchical state space model, with both population-average and subject-specific components. The bivariate model is constructed by concatenating the univariate models based on the hypothesized relationship. Because of the flexible framework of state space form, the resultant models not only can handle complex individual profiles, but also can incorporate complex relationships between two hormones, including both concurrent and feedback relationship. Estimation and inference are based on marginal likelihood and posterior means and variances. Computationally efficient Kalman filtering and smoothing algorithms are used for implementation. Application of the proposed method to a study of chronic fatigue syndrome and fibromyalgia reveals that the relationships between adrenocorticotropic hormone and cortisol in the patient group are weaker than in healthy controls. PMID:24729646

  19. Modeling Bivariate Longitudinal Hormone Profiles by Hierarchical State Space Models.

    PubMed

    Liu, Ziyue; Cappola, Anne R; Crofford, Leslie J; Guo, Wensheng

    2014-01-01

    The hypothalamic-pituitary-adrenal (HPA) axis is crucial in coping with stress and maintaining homeostasis. Hormones produced by the HPA axis exhibit both complex univariate longitudinal profiles and complex relationships among different hormones. Consequently, modeling these multivariate longitudinal hormone profiles is a challenging task. In this paper, we propose a bivariate hierarchical state space model, in which each hormone profile is modeled by a hierarchical state space model, with both population-average and subject-specific components. The bivariate model is constructed by concatenating the univariate models based on the hypothesized relationship. Because of the flexible framework of state space form, the resultant models not only can handle complex individual profiles, but also can incorporate complex relationships between two hormones, including both concurrent and feedback relationship. Estimation and inference are based on marginal likelihood and posterior means and variances. Computationally efficient Kalman filtering and smoothing algorithms are used for implementation. Application of the proposed method to a study of chronic fatigue syndrome and fibromyalgia reveals that the relationships between adrenocorticotropic hormone and cortisol in the patient group are weaker than in healthy controls.

  20. Unreported workers’ compensation claims to the BLS Survey of Occupational Injuries and Illnesses: Establishment factors

    PubMed Central

    Adams, Darrin A.; Bonauto, David K.

    2016-01-01

    Background Studies suggest employers underreport injuries to the Bureau of Labor Statistics Survey of Occupational Injuries and Illnesses (SOII); less is known about reporting differences by establishment characteristics. Methods We linked SOII data to Washington State workers’ compensation claims data, using unemployment insurance data to improve linking accuracy. We used multivariable regression models to estimate incidence ratios (IR) of unreported workers’ compensation claims for establishment characteristics. Results An estimated 70% of workers’ compensation claims were reported in SOII. Claims among state and local government establishments were most likely to be reported. Compared to large manufacturing establishments, unreported claims were most common among small educational services establishments (IR = 2.47, 95%CI: 1.52–4.01) and large construction establishments (IR = 2.05, 95%CI: 1.77–2.37). Conclusions Underreporting of workers’ compensation claims to SOII varies by establishment characteristics, obscuring true differences in work injury incidence. Findings may differ from previous research due to differences in study methods. Am. J. Ind. Med. 59:274–289, 2016. © 2016 The Authors. American Journal of Industrial Medicine Published by Wiley Periodicals, Inc. PMID:26792563

  1. Precipitation estimation in mountainous terrain using multivariate geostatistics. Part II: isohyetal maps

    USGS Publications Warehouse

    Hevesi, Joseph A.; Flint, Alan L.; Istok, Jonathan D.

    1992-01-01

    Values of average annual precipitation (AAP) may be important for hydrologic characterization of a potential high-level nuclear-waste repository site at Yucca Mountain, Nevada. Reliable measurements of AAP are sparse in the vicinity of Yucca Mountain, and estimates of AAP were needed for an isohyetal mapping over a 2600-square-mile watershed containing Yucca Mountain. Estimates were obtained with a multivariate geostatistical model developed using AAP and elevation data from a network of 42 precipitation stations in southern Nevada and southeastern California. An additional 1531 elevations were obtained to improve estimation accuracy. Isohyets representing estimates obtained using univariate geostatistics (kriging) defined a smooth and continuous surface. Isohyets representing estimates obtained using multivariate geostatistics (cokriging) defined an irregular surface that more accurately represented expected local orographic influences on AAP. Cokriging results included a maximum estimate within the study area of 335 mm at an elevation of 7400 ft, an average estimate of 157 mm for the study area, and an average estimate of 172 mm at eight locations in the vicinity of the potential repository site. Kriging estimates tended to be lower in comparison because the increased AAP expected for remote mountainous topography was not adequately represented by the available sample. Regression results between cokriging estimates and elevation were similar to regression results between measured AAP and elevation. The position of the cokriging 250-mm isohyet relative to the boundaries of pinyon pine and juniper woodlands provided indirect evidence of improved estimation accuracy because the cokriging result agreed well with investigations by others concerning the relationship between elevation, vegetation, and climate in the Great Basin. Calculated estimation variances were also mapped and compared to evaluate improvements in estimation accuracy. Cokriging estimation variances were reduced by an average of 54% relative to kriging variances within the study area. Cokriging reduced estimation variances at the potential repository site by 55% relative to kriging. The usefulness of an existing network of stations for measuring AAP within the study area was evaluated using cokriging variances, and twenty additional stations were located for the purpose of improving the accuracy of future isohyetal mappings. Using the expanded network of stations, the maximum cokriging estimation variance within the study area was reduced by 78% relative to the existing network, and the average estimation variance was reduced by 52%.

  2. Old-Age Disability and Wealth among Return Mexican Migrants from the United States

    PubMed Central

    Wong, Rebeca; Gonzalez-Gonzalez, Cesar

    2012-01-01

    Objective To examine the old-age consequences of international migration with a focus on disability and wealth from the perspective of the origin country. Methods Analysis sample includes persons aged 60+ from the Mexican Health and Aging Study, a national survey of older-adults in Mexico in 2001. Univariate methods are used to present a comparative profile of return migrants. Multivariate models are estimated for physical disability and wealth. Results Gender differences are profound. Return migrant women are more likely to be disabled while men are wealthier than comparable older adults in Mexico. Discussion Compared to current older adults, younger cohorts of Mexico-U.S. migrants increasingly include women, and more migrants seem likely to remain in the United States rather than return, thus more research will be needed on the old-age conditions of migrants in both countries. PMID:20876848

  3. Estimating correlation between multivariate longitudinal data in the presence of heterogeneity.

    PubMed

    Gao, Feng; Philip Miller, J; Xiong, Chengjie; Luo, Jingqin; Beiser, Julia A; Chen, Ling; Gordon, Mae O

    2017-08-17

    Estimating correlation coefficients among outcomes is one of the most important analytical tasks in epidemiological and clinical research. Availability of multivariate longitudinal data presents a unique opportunity to assess joint evolution of outcomes over time. Bivariate linear mixed model (BLMM) provides a versatile tool with regard to assessing correlation. However, BLMMs often assume that all individuals are drawn from a single homogenous population where the individual trajectories are distributed smoothly around population average. Using longitudinal mean deviation (MD) and visual acuity (VA) from the Ocular Hypertension Treatment Study (OHTS), we demonstrated strategies to better understand the correlation between multivariate longitudinal data in the presence of potential heterogeneity. Conditional correlation (i.e., marginal correlation given random effects) was calculated to describe how the association between longitudinal outcomes evolved over time within specific subpopulation. The impact of heterogeneity on correlation was also assessed by simulated data. There was a significant positive correlation in both random intercepts (ρ = 0.278, 95% CI: 0.121-0.420) and random slopes (ρ = 0.579, 95% CI: 0.349-0.810) between longitudinal MD and VA, and the strength of correlation constantly increased over time. However, conditional correlation and simulation studies revealed that the correlation was induced primarily by participants with rapid deteriorating MD who only accounted for a small fraction of total samples. Conditional correlation given random effects provides a robust estimate to describe the correlation between multivariate longitudinal data in the presence of unobserved heterogeneity (NCT00000125).

  4. Evaluating the effectiveness of state specific lead-based paint hazard risk reduction laws in preventing recurring incidences of lead poisoning in children.

    PubMed

    Kennedy, Chinaro; Lordo, Robert; Sucosky, Marissa Scalia; Boehm, Rona; Brown, Mary Jean

    2016-01-01

    Despite significant progress made in recent decades in preventing childhood lead poisoning in the United States through the control or elimination of lead sources in the environment, it continues to be an issue in many communities, primarily in low-income communities with a large percentage of deteriorating housing built before the elimination of lead in residential paint. The purpose of this study is to determine whether state laws aimed at preventing childhood lead poisoning are also effective in preventing recurring lead poisoning among children previously poisoned. An evaluation was conducted to determine whether laws in two representative states, Massachusetts and Ohio, have been effective in preventing recurrent lead poisoning among children less than 72 months of age previously poisoned, compared to a representative state (Mississippi) which at the time of the study had yet to develop legislation to prevent childhood lead poisoning. Compared to no legislation, unadjusted estimates showed children less than 72 months old, living in Massachusetts, previously identified as being lead poisoned, were 73% less likely to develop recurrent lead poisoning. However, this statistically significant association did not remain after controlling for other confounding variables. We did not find such a significant association when analyzing data from Ohio. While findings from unadjusted estimates indicated that state lead laws such as those in Massachusetts may be effective at preventing recurrent lead poisoning among young children, small numbers may have attenuated the power to obtain statistical significance during multivariate analysis. Our findings did not provide evidence that state lead laws, such as those in Ohio, were effective in preventing recurrent lead poisoning among young children. Further studies may be needed to confirm these findings. Published by Elsevier GmbH.

  5. Evaluating the effectiveness of state specific lead-based paint hazard risk reduction laws in preventing recurring incidences of lead poisoning in children☆

    PubMed Central

    Kennedy, Chinaro; Lordo, Robert; Sucosky, Marissa Scalia; Boehm, Rona; Brown, Mary Jean

    2016-01-01

    Background Despite significant progress made in recent decades in preventing childhood lead poisoning in the United States through the control or elimination of lead sources in the environment, it continues to be an issue in many communities, primarily in low-income communities with a large percentage of deteriorating housing built before the elimination of lead in residential paint. The purpose of this study is to determine whether state laws aimed at preventing childhood lead poisoning are also effective in preventing recurring lead poisoning among children previously poisoned. Methods An evaluation was conducted to determine whether laws in two representative states, Massachusetts and Ohio, have been effective in preventing recurrent lead poisoning among children less than 72 months of age previously poisoned, compared to a representative state (Mississippi) which at the time of the study had yet to develop legislation to prevent childhood lead poisoning. Results Compared to no legislation, unadjusted estimates showed children less than 72 months old, living in Massachusetts, previously identified as being lead poisoned, were 73% less likely to develop recurrent lead poisoning. However, this statistically significant association did not remain after controlling for other confounding variables. We did not find such a significant association when analyzing data from Ohio. Conclusions While findings from unadjusted estimates indicated that state lead laws such as those in Massachusetts may be effective at preventing recurrent lead poisoning among young children, small numbers may have attenuated the power to obtain statistical significance during multivariate analysis. Our findings did not provide evidence that state lead laws, such as those in Ohio, were effective in preventing recurrent lead poisoning among young children. Further studies may be needed to confirm these findings. PMID:26472219

  6. A Hybrid Index for Characterizing Drought Based on a Nonparametric Kernel Estimator

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Huang, Shengzhi; Huang, Qiang; Leng, Guoyong

    This study develops a nonparametric multivariate drought index, namely, the Nonparametric Multivariate Standardized Drought Index (NMSDI), by considering the variations of both precipitation and streamflow. Building upon previous efforts in constructing Nonparametric Multivariate Drought Index, we use the nonparametric kernel estimator to derive the joint distribution of precipitation and streamflow, thus providing additional insights in drought index development. The proposed NMSDI are applied in the Wei River Basin (WRB), based on which the drought evolution characteristics are investigated. Results indicate: (1) generally, NMSDI captures the drought onset similar to Standardized Precipitation Index (SPI) and drought termination and persistence similar tomore » Standardized Streamflow Index (SSFI). The drought events identified by NMSDI match well with historical drought records in the WRB. The performances are also consistent with that by an existing Multivariate Standardized Drought Index (MSDI) at various timescales, confirming the validity of the newly constructed NMSDI in drought detections (2) An increasing risk of drought has been detected for the past decades, and will be persistent to a certain extent in future in most areas of the WRB; (3) the identified change points of annual NMSDI are mainly concentrated in the early 1970s and middle 1990s, coincident with extensive water use and soil reservation practices. This study highlights the nonparametric multivariable drought index, which can be used for drought detections and predictions efficiently and comprehensively.« less

  7. Receipt of HIV/STD prevention counseling by HIV-infected adults receiving medical care in the United States.

    PubMed

    Mizuno, Yuko; Zhu, Julia; Crepaz, Nicole; Beer, Linda; Purcell, David W; Johnson, Christopher H; Valverde, Eduardo E; Skarbinski, Jacek

    2014-01-28

    Guidelines recommend risk-reduction counseling by HIV providers to all HIV-infected persons. Among HIV-infected adults receiving medical care in the United States, we estimated prevalence of exposure to three types of HIV/sexually transmitted disease (STD) risk-reduction interventions and described the characteristics of persons who received these interventions. Data were from the Medical Monitoring Project (MMP), a supplemental HIV surveillance system designed to produce nationally representative estimates of behavioral and clinical characteristics of HIV-infected adults receiving medical care in the United States. Descriptive analyses were conducted to estimate the exposure to each type of HIV/STD risk-reduction intervention. Bivariate and multivariable analyses were conducted to assess associations between the selected correlates with each exposure variable. About 44% of participants reported a one-on-one conversation with a healthcare provider about HIV/STD prevention, 30% with a prevention program worker, 16% reported participation in a small group risk-reduction intervention, and 52% reported receiving at least one of the three interventions in the past 12 months. Minority race/ethnicity, low income, and risky sexual behavior consistently predicted greater intervention exposure. However, 39% of persons who reported risky sex did not receive any HIV/STD risk-reduction interventions. HIV-infected persons in care with fewer resources or those who engaged in risk behaviors were more likely to receive HIV/STD risk-reduction interventions. However, less than half of HIV-infected persons in care received HIV/STD prevention counseling from their provider, an intervention that has been shown to be effective and is supported by guidelines.

  8. The use of copulas to practical estimation of multivariate stochastic differential equation mixed effects models

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Rupšys, P.

    A system of stochastic differential equations (SDE) with mixed-effects parameters and multivariate normal copula density function were used to develop tree height model for Scots pine trees in Lithuania. A two-step maximum likelihood parameter estimation method is used and computational guidelines are given. After fitting the conditional probability density functions to outside bark diameter at breast height, and total tree height, a bivariate normal copula distribution model was constructed. Predictions from the mixed-effects parameters SDE tree height model calculated during this research were compared to the regression tree height equations. The results are implemented in the symbolic computational language MAPLE.

  9. Stability and Performance Robustness Assessment of Multivariable Control Systems

    DTIC Science & Technology

    1993-04-01

    00- STABILITY AND PERFORMANCE ROBUSTNESS ASSESSMENT OF MULTIVARIABLE CONTROL SYSTEMS Asok Ray , Jenny I. Shen, and Chen-Kuo Weng Mechanical...Office of Naval Research Assessment of Multivariable Control Systems Grant No. N00014-90-J- 1513 6. AUTHOR(S) (Extension) Professor Asok Ray , Dr...20 The Pennsylvania State University University Park, PA 16802 (20 for Professor Asok Ray ) Naval Postgraduate School

  10. A note on a simplified and general approach to simulating from multivariate copula functions

    Treesearch

    Barry K. Goodwin

    2013-01-01

    Copulas have become an important analytic tool for characterizing multivariate distributions and dependence. One is often interested in simulating data from copula estimates. The process can be analytically and computationally complex and usually involves steps that are unique to a given parametric copula. We describe an alternative approach that uses ‘Probability-...

  11. Multivariate time series analysis of neuroscience data: some challenges and opportunities.

    PubMed

    Pourahmadi, Mohsen; Noorbaloochi, Siamak

    2016-04-01

    Neuroimaging data may be viewed as high-dimensional multivariate time series, and analyzed using techniques from regression analysis, time series analysis and spatiotemporal analysis. We discuss issues related to data quality, model specification, estimation, interpretation, dimensionality and causality. Some recent research areas addressing aspects of some recurring challenges are introduced. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. Paternal smoking and spontaneous abortion: a population-based retrospective cohort study among non-smoking women aged 20-49 years in rural China.

    PubMed

    Wang, Long; Yang, Ying; Liu, Fangchao; Yang, Aimin; Xu, Qin; Wang, Qiaomei; Shen, Haiping; Zhang, Yiping; Yan, Donghai; Peng, Zuoqi; He, Yuan; Wang, Yuanyuan; Xu, Jihong; Zhao, Jun; Zhang, Hongguang; Zhang, Ya; Dai, Qiaoyun; Ma, Xu

    2018-06-11

    To comprehensively evaluate the association of paternal smoking and spontaneous abortion. We conducted a population-based retrospective cohort study among 5 770 691 non-smoking rural Chinese women, along with their husbands, participating in the National Free Pre-Pregnancy Checkups Project, regarding outcome events that occurred in 2010-2016. The main outcome was spontaneous abortion (SA). Multivariable logistic regression was used to estimate OR and 95% CI, and restricted cubic spline was used to estimate the non-linear relationship. The multivariable-adjusted OR of exposure to paternal smoking for SA was 1.17 (95% CI 1.16 to 1.19), compared with women without exposure to paternal smoking; and corresponding OR of exposure to preconception paternal smoking for SA was 1.11 (95% CI 1.08 to 1.14), compared with women without exposure to preconception paternal smoking. The ORs of preconception paternal smoking also increased with increases in paternal smoking (p nonlinear <0.05, almost linearly shaped) and preconception paternal smoking (p nonlinear >0.05). In addition, periconception paternal smoking cessation was associated with an 18% (15%-22%) lower risk of SA. Paternal smoking was associated with SA. The importance of tobacco control, specifically pertaining to paternal smoking, should be emphasised during preconception and pregnancy counselling. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  13. The choice of prior distribution for a covariance matrix in multivariate meta-analysis: a simulation study.

    PubMed

    Hurtado Rúa, Sandra M; Mazumdar, Madhu; Strawderman, Robert L

    2015-12-30

    Bayesian meta-analysis is an increasingly important component of clinical research, with multivariate meta-analysis a promising tool for studies with multiple endpoints. Model assumptions, including the choice of priors, are crucial aspects of multivariate Bayesian meta-analysis (MBMA) models. In a given model, two different prior distributions can lead to different inferences about a particular parameter. A simulation study was performed in which the impact of families of prior distributions for the covariance matrix of a multivariate normal random effects MBMA model was analyzed. Inferences about effect sizes were not particularly sensitive to prior choice, but the related covariance estimates were. A few families of prior distributions with small relative biases, tight mean squared errors, and close to nominal coverage for the effect size estimates were identified. Our results demonstrate the need for sensitivity analysis and suggest some guidelines for choosing prior distributions in this class of problems. The MBMA models proposed here are illustrated in a small meta-analysis example from the periodontal field and a medium meta-analysis from the study of stroke. Copyright © 2015 John Wiley & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.

  14. Dynamics of aquatic ecosystems and models under toxicant stress: State space analysis, covariance structure, and ecological risk

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Johnson, A.R.; Bartell, S.M.

    1988-06-01

    The state of an ecosystem at any time t may be characterized by a multidimensional state vector x(t). Changes in state are represented by the trajectory traced out by x(t) over time. The effects of toxicant stress are summarized by the displacement of a perturbed state vector, x/sub p/(t), relative to an appropriate control, x/sub c/(t). Within a multivariate statistical framework, the response of an ecosystem to perturbation is conveniently quantified by the distance separating x/sub p/(t) from x/sub c/(t) as measured by a Mahalanobis metric. Use of the Mahalanobis metric requires that the covariance matrix associated with the controlmore » state vector be estimated. State space displacement analysis was applied to data on the response of aquatic microcosms and outdoor ponds to alkylphenols. Dose-response relationships were derived using calculated state space separations as integrated measures of the ecological effects of toxicant exposure. Inspection of the data also revealed that the covariance structure varied both with time and with toxicant exposure, suggesting that analysis of such changes might be a useful tool for probing control mechanisms underlying ecosystem dynamics. 90 refs., 53 figs., 9 tabs.« less

  15. Improving the accuracy of hyaluronic acid molecular weight estimation by conventional size exclusion chromatography.

    PubMed

    Shanmuga Doss, Sreeja; Bhatt, Nirav Pravinbhai; Jayaraman, Guhan

    2017-08-15

    There is an unreasonably high variation in the literature reports on molecular weight of hyaluronic acid (HA) estimated using conventional size exclusion chromatography (SEC). This variation is most likely due to errors in estimation. Working with commercially available HA molecular weight standards, this work examines the extent of error in molecular weight estimation due to two factors: use of non-HA based calibration and concentration of sample injected into the SEC column. We develop a multivariate regression correlation to correct for concentration effect. Our analysis showed that, SEC calibration based on non-HA standards like polyethylene oxide and pullulan led to approximately 2 and 10 times overestimation, respectively, when compared to HA-based calibration. Further, we found that injected sample concentration has an effect on molecular weight estimation. Even at 1g/l injected sample concentration, HA molecular weight standards of 0.7 and 1.64MDa showed appreciable underestimation of 11-24%. The multivariate correlation developed was found to reduce error in estimations at 1g/l to <4%. The correlation was also successfully applied to accurately estimate the molecular weight of HA produced by a recombinant Lactococcus lactis fermentation. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. Estimating Contraceptive Prevalence Using Logistics Data for Short-Acting Methods: Analysis Across 30 Countries.

    PubMed

    Cunningham, Marc; Bock, Ariella; Brown, Niquelle; Sacher, Suzy; Hatch, Benjamin; Inglis, Andrew; Aronovich, Dana

    2015-09-01

    Contraceptive prevalence rate (CPR) is a vital indicator used by country governments, international donors, and other stakeholders for measuring progress in family planning programs against country targets and global initiatives as well as for estimating health outcomes. Because of the need for more frequent CPR estimates than population-based surveys currently provide, alternative approaches for estimating CPRs are being explored, including using contraceptive logistics data. Using data from the Demographic and Health Surveys (DHS) in 30 countries, population data from the United States Census Bureau International Database, and logistics data from the Procurement Planning and Monitoring Report (PPMR) and the Pipeline Monitoring and Procurement Planning System (PipeLine), we developed and evaluated 3 models to generate country-level, public-sector contraceptive prevalence estimates for injectable contraceptives, oral contraceptives, and male condoms. Models included: direct estimation through existing couple-years of protection (CYP) conversion factors, bivariate linear regression, and multivariate linear regression. Model evaluation consisted of comparing the referent DHS prevalence rates for each short-acting method with the model-generated prevalence rate using multiple metrics, including mean absolute error and proportion of countries where the modeled prevalence rate for each method was within 1, 2, or 5 percentage points of the DHS referent value. For the methods studied, family planning use estimates from public-sector logistics data were correlated with those from the DHS, validating the quality and accuracy of current public-sector logistics data. Logistics data for oral and injectable contraceptives were significantly associated (P<.05) with the referent DHS values for both bivariate and multivariate models. For condoms, however, that association was only significant for the bivariate model. With the exception of the CYP-based model for condoms, models were able to estimate public-sector prevalence rates for each short-acting method to within 2 percentage points in at least 85% of countries. Public-sector contraceptive logistics data are strongly correlated with public-sector prevalence rates for short-acting methods, demonstrating the quality of current logistics data and their ability to provide relatively accurate prevalence estimates. The models provide a starting point for generating interim estimates of contraceptive use when timely survey data are unavailable. All models except the condoms CYP model performed well; the regression models were most accurate but the CYP model offers the simplest calculation method. Future work extending the research to other modern methods, relating subnational logistics data with prevalence rates, and tracking that relationship over time is needed. © Cunningham et al.

  17. Estimating Contraceptive Prevalence Using Logistics Data for Short-Acting Methods: Analysis Across 30 Countries

    PubMed Central

    Cunningham, Marc; Brown, Niquelle; Sacher, Suzy; Hatch, Benjamin; Inglis, Andrew; Aronovich, Dana

    2015-01-01

    Background: Contraceptive prevalence rate (CPR) is a vital indicator used by country governments, international donors, and other stakeholders for measuring progress in family planning programs against country targets and global initiatives as well as for estimating health outcomes. Because of the need for more frequent CPR estimates than population-based surveys currently provide, alternative approaches for estimating CPRs are being explored, including using contraceptive logistics data. Methods: Using data from the Demographic and Health Surveys (DHS) in 30 countries, population data from the United States Census Bureau International Database, and logistics data from the Procurement Planning and Monitoring Report (PPMR) and the Pipeline Monitoring and Procurement Planning System (PipeLine), we developed and evaluated 3 models to generate country-level, public-sector contraceptive prevalence estimates for injectable contraceptives, oral contraceptives, and male condoms. Models included: direct estimation through existing couple-years of protection (CYP) conversion factors, bivariate linear regression, and multivariate linear regression. Model evaluation consisted of comparing the referent DHS prevalence rates for each short-acting method with the model-generated prevalence rate using multiple metrics, including mean absolute error and proportion of countries where the modeled prevalence rate for each method was within 1, 2, or 5 percentage points of the DHS referent value. Results: For the methods studied, family planning use estimates from public-sector logistics data were correlated with those from the DHS, validating the quality and accuracy of current public-sector logistics data. Logistics data for oral and injectable contraceptives were significantly associated (P<.05) with the referent DHS values for both bivariate and multivariate models. For condoms, however, that association was only significant for the bivariate model. With the exception of the CYP-based model for condoms, models were able to estimate public-sector prevalence rates for each short-acting method to within 2 percentage points in at least 85% of countries. Conclusions: Public-sector contraceptive logistics data are strongly correlated with public-sector prevalence rates for short-acting methods, demonstrating the quality of current logistics data and their ability to provide relatively accurate prevalence estimates. The models provide a starting point for generating interim estimates of contraceptive use when timely survey data are unavailable. All models except the condoms CYP model performed well; the regression models were most accurate but the CYP model offers the simplest calculation method. Future work extending the research to other modern methods, relating subnational logistics data with prevalence rates, and tracking that relationship over time is needed. PMID:26374805

  18. Work-related injuries among commercial janitors in Washington State, comparisons by gender.

    PubMed

    Smith, Caroline K; Anderson, Naomi J

    2017-09-01

    We analyzed workers' compensation (WC) data to identify characteristics related to workers' compensation claim outcomes among janitorial service workers in Washington State. We analyzed WC data from the Washington State Department of Labor & Industries (L&I) State Fund (SF) from January 1, 2003 through December 31, 2013, for janitorial service workers employed in the National Occupational Research Agenda (NORA) Services Sector. We constructed multivariable models to identify factors associated with higher medical costs and increased time lost from work. There were 2,390 janitorial service compensable claims available for analysis. There were significant differences in injury type and other factors by gender, age, and language preference. Linguistic minority status was associated with longer time loss and higher median medical costs. Women were estimated to account for 35% of janitorial service workers but made up 55% of the compensable claims in this study. Janitorial service workers comprise a large vulnerable occupational group in the U.S. workforce. Identifying differences by injury type and potential inequitable outcomes by gender and language is important to ensuring equal treatment in the workers' compensation process. There were significant differences in injury and individual characteristics between men and women in this study. Women had twice the estimated rate of injury to men, and were more likely to require Spanish language materials. Improving communication for training and knowledge about the workers' compensation system appear to be high priorities in this population of injured janitorial service workers. Copyright © 2017 Elsevier Ltd and National Safety Council. All rights reserved.

  19. Vehicle Sprung Mass Estimation for Rough Terrain

    DTIC Science & Technology

    2011-03-01

    distributions are greater than zero. The multivariate polynomials are functions of the Legendre polynomials (Poularikas (1999...developed methods based on polynomial chaos theory and on the maximum likelihood approach to estimate the most likely value of the vehicle sprung...mass. The polynomial chaos estimator is compared to benchmark algorithms including recursive least squares, recursive total least squares, extended

  20. Multivariate estimation of the limit of detection by orthogonal partial least squares in temperature-modulated MOX sensors.

    PubMed

    Burgués, Javier; Marco, Santiago

    2018-08-17

    Metal oxide semiconductor (MOX) sensors are usually temperature-modulated and calibrated with multivariate models such as partial least squares (PLS) to increase the inherent low selectivity of this technology. The multivariate sensor response patterns exhibit heteroscedastic and correlated noise, which suggests that maximum likelihood methods should outperform PLS. One contribution of this paper is the comparison between PLS and maximum likelihood principal components regression (MLPCR) in MOX sensors. PLS is often criticized by the lack of interpretability when the model complexity increases beyond the chemical rank of the problem. This happens in MOX sensors due to cross-sensitivities to interferences, such as temperature or humidity and non-linearity. Additionally, the estimation of fundamental figures of merit, such as the limit of detection (LOD), is still not standardized in multivariate models. Orthogonalization methods, such as orthogonal projection to latent structures (O-PLS), have been successfully applied in other fields to reduce the complexity of PLS models. In this work, we propose a LOD estimation method based on applying the well-accepted univariate LOD formulas to the scores of the first component of an orthogonal PLS model. The resulting LOD is compared to the multivariate LOD range derived from error-propagation. The methodology is applied to data extracted from temperature-modulated MOX sensors (FIS SB-500-12 and Figaro TGS 3870-A04), aiming at the detection of low concentrations of carbon monoxide in the presence of uncontrolled humidity (chemical noise). We found that PLS models were simpler and more accurate than MLPCR models. Average LOD values of 0.79 ppm (FIS) and 1.06 ppm (Figaro) were found using the approach described in this paper. These values were contained within the LOD ranges obtained with the error-propagation approach. The mean LOD increased to 1.13 ppm (FIS) and 1.59 ppm (Figaro) when considering validation samples collected two weeks after calibration, which represents a 43% and 46% degradation, respectively. The orthogonal score-plot was a very convenient tool to visualize MOX sensor data and to validate the LOD estimates. Copyright © 2018 Elsevier B.V. All rights reserved.

  1. Regional regression models of watershed suspended-sediment discharge for the eastern United States

    NASA Astrophysics Data System (ADS)

    Roman, David C.; Vogel, Richard M.; Schwarz, Gregory E.

    2012-11-01

    SummaryEstimates of mean annual watershed sediment discharge, derived from long-term measurements of suspended-sediment concentration and streamflow, often are not available at locations of interest. The goal of this study was to develop multivariate regression models to enable prediction of mean annual suspended-sediment discharge from available basin characteristics useful for most ungaged river locations in the eastern United States. The models are based on long-term mean sediment discharge estimates and explanatory variables obtained from a combined dataset of 1201 US Geological Survey (USGS) stations derived from a SPAtially Referenced Regression on Watershed attributes (SPARROW) study and the Geospatial Attributes of Gages for Evaluating Streamflow (GAGES) database. The resulting regional regression models summarized for major US water resources regions 1-8, exhibited prediction R2 values ranging from 76.9% to 92.7% and corresponding average model prediction errors ranging from 56.5% to 124.3%. Results from cross-validation experiments suggest that a majority of the models will perform similarly to calibration runs. The 36-parameter regional regression models also outperformed a 16-parameter national SPARROW model of suspended-sediment discharge and indicate that mean annual sediment loads in the eastern United States generally correlates with a combination of basin area, land use patterns, seasonal precipitation, soil composition, hydrologic modification, and to a lesser extent, topography.

  2. Regional regression models of watershed suspended-sediment discharge for the eastern United States

    USGS Publications Warehouse

    Roman, David C.; Vogel, Richard M.; Schwarz, Gregory E.

    2012-01-01

    Estimates of mean annual watershed sediment discharge, derived from long-term measurements of suspended-sediment concentration and streamflow, often are not available at locations of interest. The goal of this study was to develop multivariate regression models to enable prediction of mean annual suspended-sediment discharge from available basin characteristics useful for most ungaged river locations in the eastern United States. The models are based on long-term mean sediment discharge estimates and explanatory variables obtained from a combined dataset of 1201 US Geological Survey (USGS) stations derived from a SPAtially Referenced Regression on Watershed attributes (SPARROW) study and the Geospatial Attributes of Gages for Evaluating Streamflow (GAGES) database. The resulting regional regression models summarized for major US water resources regions 1–8, exhibited prediction R2 values ranging from 76.9% to 92.7% and corresponding average model prediction errors ranging from 56.5% to 124.3%. Results from cross-validation experiments suggest that a majority of the models will perform similarly to calibration runs. The 36-parameter regional regression models also outperformed a 16-parameter national SPARROW model of suspended-sediment discharge and indicate that mean annual sediment loads in the eastern United States generally correlates with a combination of basin area, land use patterns, seasonal precipitation, soil composition, hydrologic modification, and to a lesser extent, topography.

  3. The Association between State Policy Environments and Self-Rated Health Disparities for Sexual Minorities in the United States.

    PubMed

    Gonzales, Gilbert; Ehrenfeld, Jesse M

    2018-06-01

    A large body of research has documented disparities in health and access to care for lesbian, gay, and bisexual (LGB) people in the United States. Less research has examined how the level of legal protection afforded to LGB people (the state policy environment) affects health disparities for sexual minorities. This study used data on 14,687 sexual minority adults and 490,071 heterosexual adults from the 2014⁻2016 Behavioral Risk Factor Surveillance System to document differences in health. Unadjusted state-specific prevalence estimates and multivariable logistic regression models were used to compare poor/fair self-rated health by gender, sexual minority status, and state policy environments (comprehensive versus limited protections for LGB people). We found disparities in self-rated health between sexual minority adults and heterosexual adults in most states. On average, sexual minority men in states with limited protections and sexual minority women in states with either comprehensive or limited protections were more likely to report poor/fair self-rated health compared to their heterosexual counterparts. This study adds new findings on the association between state policy environments and self-rated health for sexual minorities and suggests differences in this relationship by gender. The associations and impacts of state-specific policies affecting LGB populations may vary by gender, as well as other intersectional identities.

  4. Retinal vessel diameter and estimated cerebrospinal fluid pressure in arterial hypertension: the Beijing Eye Study.

    PubMed

    Jonas, Jost B; Wang, Ningli; Wang, Shuang; Wang, Ya Xing; You, Qi Sheng; Yang, Diya; Wei, Wen Bin; Xu, Liang

    2014-09-01

    Hypertensive retinal microvascular abnormalities include an increased retinal vein-to-artery diameter ratio. Because central retinal vein pressure depends on cerebrospinal fluid pressure (CSFP), we examined whether the retinal vein-to-artery diameter ratio and other retinal hypertensive signs are associated with CSFP. Participants of the population-based Beijing Eye Study (n = 1,574 subjects) underwent measurement of the temporal inferior and superior retinal artery and vein diameter. CSFP was calculated as 0.44 × body mass index (kg/m(2)) + 0.16 × diastolic blood pressure (mm Hg) - 0.18 × age (years) - 1.91. Larger retinal vein diameters and higher vein-to-artery diameter ratios were significantly associated with higher estimated CSFP (P = 0.001) in multivariable analysis. In contrast, temporal inferior retinal arterial diameter was marginally associated (P = 0.03) with estimated CSFP, and temporal superior artery diameter was not significantly associated (P = 0.10) with estimated CSFP; other microvascular abnormalities, such as arteriovenous crossing signs, were also not significantly associated with estimated CSFP. In a reverse manner, higher estimated CSFP as a dependent variable in the multivariable analysis was associated with wider retinal veins and higher vein-to-artery diameter ratio. In the same model, estimated CSFP was not significantly correlated with retinal artery diameters or other retinal microvascular abnormalities. Correspondingly, arterial hypertension was associated with retinal microvascular abnormalities such as arteriovenous crossing signs (P = 0.003), thinner temporal retinal arteries (P < 0.001), higher CSFP (P < 0.001), and wider retinal veins (P = 0.001) or, as a corollary, with a higher vein-to-artery diameter ratio in multivariable analysis. Wider retinal vein diameters are associated with higher estimated CSFP and vice versa. In arterial hypertension, an increased retinal vein-to-artery diameter ratio depends on elevated CSFP, which is correlated with blood pressure. © American Journal of Hypertension, Ltd 2014. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  5. Geographic Variation in Characteristics of Postpartum Women Using Female Sterilization.

    PubMed

    White, Kari; Potter, Joseph E; Zite, Nikki

    2015-01-01

    Southern states have higher rates of female sterilization compared with other areas of the United States, and the reasons for this are not well understood. We examined whether low-income and racial/ethnic minority women, who were previous targets of coercive practices, disproportionately report using sterilization in the South. We used data from 12 states participating in the Pregnancy Risk Assessment Monitoring System that collected information on women's contraceptive method use between 2006 and 2009. We categorized states according to geographic region: South, Midwest/West, and Northeast. Within each region, we computed the percentage of women using sterilization according to their demographic and obstetric characteristics and estimated multivariable-adjusted prevalence ratios to evaluate whether the same characteristics were associated with sterilization use. The percentage of postpartum women using sterilization ranged from 5.0% to 9.9% in the Northeast, 8.9% to 10.6% in the Midwest/West, and 11.6% to 22.4% in the South. Women in nearly all subgroups in Southern states were more likely to use sterilization than women in the Northeast. After multivariable adjustment, there were no differences in the prevalence of sterilization for Blacks compared with Whites in the Northeast (0.76; 95% CI, 0.55-1.06), Midwest/West (0.91; 95% CI, 0.80-1.04), and South (0.96; 95% CI, 0.85-1.07). Women with Medicaid-paid deliveries (vs. private insurance) had a higher prevalence of sterilization in all regions (p < .05). These findings do not indicate that low-income and racial/ethnic minority women in the South use sterilization at disproportionately higher rates compared with other regions, and suggest that other differences, such as social norms and family planning policies, may contribute to this geographic variation. Copyright © 2015 Jacobs Institute of Women's Health. Published by Elsevier Inc. All rights reserved.

  6. Clinical impact of altered T-cell homeostasis in treated HIV patients enrolled in a large observational cohort.

    PubMed

    Ndumbi, Patricia; Gillis, Jennifer; Raboud, Janet M; Cooper, Curtis; Hogg, Robert S; Montaner, Julio S G; Burchell, Ann N; Loutfy, Mona R; Machouf, Nima; Klein, Marina B; Tsoukas, Chris M

    2013-11-28

    We investigated the probability of transitioning in or out of the CD3⁺ T-cell homeostatic range during antiretroviral therapy, and we assessed the clinical impact of lost T-cell homeostasis (TCH) on AIDS-defining illnesses (ADIs) or death. Within the Canadian Observational Cohort (CANOC), we studied 4463 antiretroviral therapy (ART)-naive HIV-positive patients initiating combination ART (cART) between 2000 and 2010. CD3⁺ trajectories were estimated using a four state Markov model. CD3⁺ T-cel percentage states were classified as follows: very low (<50%), low (50-64%), normal (65-85%), and high (>85%). Covariates associated with transitioning between states were examined. The association between CD3⁺ T-cell percentage states and time to ADI/death from cART initiation was determined using Cox proportional hazards models. A total of 4463 patients were followed for a median of 3 years. Two thousand, five hundred and eight (56%) patients never transitioned from their baseline CD3⁺ T-cell percentage state; 85% of these had normal TCH. In multivariable analysis, individuals with time-updated low CD4⁺ cell count, time-updated detectable viral load, older age, and hepatitis C virus (HCV) coinfection were less likely to maintain TCH. In the multivariable proportional hazards model, both very low and high CD3⁺ T-cell percentages were associated with increased risk of ADI/death [adjusted hazard ratio=1.91 (95% confidence interval, CI: 1.27-2.89) and hazard ratio=1.49 (95% CI: 1.13-1.96), respectively]. Patients with very low or high CD3⁺ T-cell percentages are at risk for ADIs/death. To our knowledge, this is the first study linking altered TCH and morbidity/mortality in cART-treated HIV-positive patients.

  7. Accounting for multiple climate components when estimating climate change exposure and velocity

    USGS Publications Warehouse

    Nadeau, Christopher P.; Fuller, Angela K.

    2015-01-01

    The effect of anthropogenic climate change on organisms will likely be related to climate change exposure and velocity at local and regional scales. However, common methods to estimate climate change exposure and velocity ignore important components of climate that are known to affect the ecology and evolution of organisms.We develop a novel index of climate change (climate overlap) that simultaneously estimates changes in the means, variation and correlation between multiple weather variables. Specifically, we estimate the overlap between multivariate normal probability distributions representing historical and current or projected future climates. We provide methods for estimating the statistical significance of climate overlap values and methods to estimate velocity using climate overlap.We show that climates have changed significantly across 80% of the continental United States in the last 32 years and that much of this change is due to changes in the variation and correlation between weather variables (two statistics that are rarely incorporated into climate change studies). We also show that projected future temperatures are predicted to be locally novel (<1·5% overlap) across most of the global land surface and that exposure is likely to be highest in areas with low historical climate variation. Last, we show that accounting for changes in the variation and correlation between multiple weather variables can dramatically affect velocity estimates; mean velocity estimates in the continental United States were between 3·1 and 19·0 km yr−1when estimated using climate overlap compared to 1·4 km yr−1 when estimated using traditional methods.Our results suggest that accounting for changes in the means, variation and correlation between multiple weather variables can dramatically affect estimates of climate change exposure and velocity. These climate components are known to affect the ecology and evolution of organisms, but are ignored by most measures of climate change. We conclude with a set of future directions and recommend future work to determine which measures of climate change exposure and velocity are most related to biological responses to climate change.

  8. On Restructurable Control System Theory

    NASA Technical Reports Server (NTRS)

    Athans, M.

    1983-01-01

    The state of stochastic system and control theory as it impacts restructurable control issues is addressed. The multivariable characteristics of the control problem are addressed. The failure detection/identification problem is discussed as a multi-hypothesis testing problem. Control strategy reconfiguration, static multivariable controls, static failure hypothesis testing, dynamic multivariable controls, fault-tolerant control theory, dynamic hypothesis testing, generalized likelihood ratio (GLR) methods, and adaptive control are discussed.

  9. A Comparison of Pseudo-Maximum Likelihood and Asymptotically Distribution-Free Dynamic Factor Analysis Parameter Estimation in Fitting Covariance Structure Models to Block-Toeplitz Matrices Representing Single-Subject Multivariate Time-Series.

    ERIC Educational Resources Information Center

    Molenaar, Peter C. M.; Nesselroade, John R.

    1998-01-01

    Pseudo-Maximum Likelihood (p-ML) and Asymptotically Distribution Free (ADF) estimation methods for estimating dynamic factor model parameters within a covariance structure framework were compared through a Monte Carlo simulation. Both methods appear to give consistent model parameter estimates, but only ADF gives standard errors and chi-square…

  10. Lessons Learned from Assimilating Altimeter Data into a Coupled General Circulation Model with the GMAO Augmented Ensemble Kalman Filter

    NASA Technical Reports Server (NTRS)

    Keppenne, Christian; Vernieres, Guillaume; Rienecker, Michele; Jacob, Jossy; Kovach, Robin

    2011-01-01

    Satellite altimetry measurements have provided global, evenly distributed observations of the ocean surface since 1993. However, the difficulties introduced by the presence of model biases and the requirement that data assimilation systems extrapolate the sea surface height (SSH) information to the subsurface in order to estimate the temperature, salinity and currents make it difficult to optimally exploit these measurements. This talk investigates the potential of the altimetry data assimilation once the biases are accounted for with an ad hoc bias estimation scheme. Either steady-state or state-dependent multivariate background-error covariances from an ensemble of model integrations are used to address the problem of extrapolating the information to the sub-surface. The GMAO ocean data assimilation system applied to an ensemble of coupled model instances using the GEOS-5 AGCM coupled to MOM4 is used in the investigation. To model the background error covariances, the system relies on a hybrid ensemble approach in which a small number of dynamically evolved model trajectories is augmented on the one hand with past instances of the state vector along each trajectory and, on the other, with a steady state ensemble of error estimates from a time series of short-term model forecasts. A state-dependent adaptive error-covariance localization and inflation algorithm controls how the SSH information is extrapolated to the sub-surface. A two-step predictor corrector approach is used to assimilate future information. Independent (not-assimilated) temperature and salinity observations from Argo floats are used to validate the assimilation. A two-step projection method in which the system first calculates a SSH increment and then projects this increment vertically onto the temperature, salt and current fields is found to be most effective in reconstructing the sub-surface information. The performance of the system in reconstructing the sub-surface fields is particularly impressive for temperature, but not as satisfactory for salt.

  11. Bayesian switching factor analysis for estimating time-varying functional connectivity in fMRI.

    PubMed

    Taghia, Jalil; Ryali, Srikanth; Chen, Tianwen; Supekar, Kaustubh; Cai, Weidong; Menon, Vinod

    2017-07-15

    There is growing interest in understanding the dynamical properties of functional interactions between distributed brain regions. However, robust estimation of temporal dynamics from functional magnetic resonance imaging (fMRI) data remains challenging due to limitations in extant multivariate methods for modeling time-varying functional interactions between multiple brain areas. Here, we develop a Bayesian generative model for fMRI time-series within the framework of hidden Markov models (HMMs). The model is a dynamic variant of the static factor analysis model (Ghahramani and Beal, 2000). We refer to this model as Bayesian switching factor analysis (BSFA) as it integrates factor analysis into a generative HMM in a unified Bayesian framework. In BSFA, brain dynamic functional networks are represented by latent states which are learnt from the data. Crucially, BSFA is a generative model which estimates the temporal evolution of brain states and transition probabilities between states as a function of time. An attractive feature of BSFA is the automatic determination of the number of latent states via Bayesian model selection arising from penalization of excessively complex models. Key features of BSFA are validated using extensive simulations on carefully designed synthetic data. We further validate BSFA using fingerprint analysis of multisession resting-state fMRI data from the Human Connectome Project (HCP). Our results show that modeling temporal dependencies in the generative model of BSFA results in improved fingerprinting of individual participants. Finally, we apply BSFA to elucidate the dynamic functional organization of the salience, central-executive, and default mode networks-three core neurocognitive systems with central role in cognitive and affective information processing (Menon, 2011). Across two HCP sessions, we demonstrate a high level of dynamic interactions between these networks and determine that the salience network has the highest temporal flexibility among the three networks. Our proposed methods provide a novel and powerful generative model for investigating dynamic brain connectivity. Copyright © 2017 Elsevier Inc. All rights reserved.

  12. Perceived insufficient rest or sleep among veterans: Behavioral Risk Factor Surveillance System 2009.

    PubMed

    Faestel, Paul M; Littell, Christopher T; Vitiello, Michael V; Forsberg, Christopher W; Littman, Alyson J

    2013-06-15

    Sleep problems are of particular concern among the active duty military population as factors such as inconsistent work hours and deployment may compromise adequate sleep and adversely impact performance. However, few prior studies have investigated whether the prevalence of sleep problems differ between Veterans and demographically similar non-Veterans. The purpose of this study is to investigate whether self-reported insufficient rest or sleep varies in relation to Veteran status and to identify high-risk groups of Veterans. This study used data from the 2009 Behavioral Risk Factor Surveillance System (analyzed in 2011), a state based national telephone survey of non-institutionalized US adults. Insufficient rest was assessed in 411,313 adults aged 21 and older, of whom 55,361 were Veterans. Sleep duration was assessed in 6 states (n = 4,936 Veterans and 30,983 non-Veterans). Model-based direct rate adjustment was used to estimate the prevalence of insufficient rest or sleep while controlling for confounding. Multivariable logistic regression was used to estimate odds ratios of insufficient sleep or rest in subgroups of Veterans. After multivariable adjustment, insufficient rest or sleep (22.7% vs. 21.1%, p < 0.001) and short sleep duration (< 7 h/night, 34.9% vs. 31.3%, p = 0.026) were more common among Veterans than non-Veterans. When the Veteran group was further divided among newly transitioned (≤ 12 months) and longer-term Veterans (> 12 months), the overall test for a difference was not statistically significant between groups, mainly because there was little difference in sleep between the two groups of Veterans. High-risk Veteran subgroups included those who were 21-44 years of age (vs. 65-74), women, non-whites, current smokers, obese, unable to work, and those in poor health. This study suggests that Veterans have a high burden of sleep problems and identifies subgroups that should be targeted to receive interventions and enhanced education regarding insufficient sleep.

  13. Time Series Model Identification by Estimating Information.

    DTIC Science & Technology

    1982-11-01

    principle, Applications of Statistics, P. R. Krishnaiah , ed., North-Holland: Amsterdam, 27-41. Anderson, T. W. (1971). The Statistical Analysis of Time Series...E. (1969). Multiple Time Series Modeling, Multivariate Analysis II, edited by P. Krishnaiah , Academic Press: New York, 389-409. Parzen, E. (1981...Newton, H. J. (1980). Multiple Time Series Modeling, II Multivariate Analysis - V, edited by P. Krishnaiah , North Holland: Amsterdam, 181-197. Shibata, R

  14. Assessing the response of area burned to changing climate in western boreal North America using a Multivariate Adaptive Regression Splines (MARS) approach

    Treesearch

    Michael S. Balshi; A. David McGuire; Paul Duffy; Mike Flannigan; John Walsh; Jerry Melillo

    2009-01-01

    We developed temporally and spatially explicit relationships between air temperature and fuel moisture codes derived from the Canadian Fire Weather Index System to estimate annual area burned at 2.5o (latitude x longitude) resolution using a Multivariate Adaptive Regression Spline (MARS) approach across Alaska and Canada. Burned area was...

  15. Transient multivariable sensor evaluation

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Vilim, Richard B.; Heifetz, Alexander

    A method and system for performing transient multivariable sensor evaluation. The method and system includes a computer system for identifying a model form, providing training measurement data, generating a basis vector, monitoring system data from sensor, loading the system data in a non-transient memory, performing an estimation to provide desired data and comparing the system data to the desired data and outputting an alarm for a defective sensor.

  16. Determining the Relationship Between Moral Waivers and Marine Corps Unsuitability Attrition

    DTIC Science & Technology

    2008-03-01

    observed characteristics. However, econometric research indicates that the magnitude of interaction effects estimated via probit or logit models may...1997 to 2005. Multivariate probit models were used to analyze the effects of moral waivers on unsatisfactory service separations. 15. NUMBER OF...files from fiscal years 1997 to 2005. Multivariate probit models were used to analyze the effects of moral waivers on unsatisfactory service

  17. A New Approach of Juvenile Age Estimation using Measurements of the Ilium and Multivariate Adaptive Regression Splines (MARS) Models for Better Age Prediction.

    PubMed

    Corron, Louise; Marchal, François; Condemi, Silvana; Chaumoître, Kathia; Adalian, Pascal

    2017-01-01

    Juvenile age estimation methods used in forensic anthropology generally lack methodological consistency and/or statistical validity. Considering this, a standard approach using nonparametric Multivariate Adaptive Regression Splines (MARS) models were tested to predict age from iliac biometric variables of male and female juveniles from Marseilles, France, aged 0-12 years. Models using unidimensional (length and width) and bidimensional iliac data (module and surface) were constructed on a training sample of 176 individuals and validated on an independent test sample of 68 individuals. Results show that MARS prediction models using iliac width, module and area give overall better and statistically valid age estimates. These models integrate punctual nonlinearities of the relationship between age and osteometric variables. By constructing valid prediction intervals whose size increases with age, MARS models take into account the normal increase of individual variability. MARS models can qualify as a practical and standardized approach for juvenile age estimation. © 2016 American Academy of Forensic Sciences.

  18. A new test of multivariate nonlinear causality

    PubMed Central

    Bai, Zhidong; Jiang, Dandan; Lv, Zhihui; Wong, Wing-Keung; Zheng, Shurong

    2018-01-01

    The multivariate nonlinear Granger causality developed by Bai et al. (2010) (Mathematics and Computers in simulation. 2010; 81: 5-17) plays an important role in detecting the dynamic interrelationships between two groups of variables. Following the idea of Hiemstra-Jones (HJ) test proposed by Hiemstra and Jones (1994) (Journal of Finance. 1994; 49(5): 1639-1664), they attempt to establish a central limit theorem (CLT) of their test statistic by applying the asymptotical property of multivariate U-statistic. However, Bai et al. (2016) (2016; arXiv: 1701.03992) revisit the HJ test and find that the test statistic given by HJ is NOT a function of U-statistics which implies that the CLT neither proposed by Hiemstra and Jones (1994) nor the one extended by Bai et al. (2010) is valid for statistical inference. In this paper, we re-estimate the probabilities and reestablish the CLT of the new test statistic. Numerical simulation shows that our new estimates are consistent and our new test performs decent size and power. PMID:29304085

  19. A new test of multivariate nonlinear causality.

    PubMed

    Bai, Zhidong; Hui, Yongchang; Jiang, Dandan; Lv, Zhihui; Wong, Wing-Keung; Zheng, Shurong

    2018-01-01

    The multivariate nonlinear Granger causality developed by Bai et al. (2010) (Mathematics and Computers in simulation. 2010; 81: 5-17) plays an important role in detecting the dynamic interrelationships between two groups of variables. Following the idea of Hiemstra-Jones (HJ) test proposed by Hiemstra and Jones (1994) (Journal of Finance. 1994; 49(5): 1639-1664), they attempt to establish a central limit theorem (CLT) of their test statistic by applying the asymptotical property of multivariate U-statistic. However, Bai et al. (2016) (2016; arXiv: 1701.03992) revisit the HJ test and find that the test statistic given by HJ is NOT a function of U-statistics which implies that the CLT neither proposed by Hiemstra and Jones (1994) nor the one extended by Bai et al. (2010) is valid for statistical inference. In this paper, we re-estimate the probabilities and reestablish the CLT of the new test statistic. Numerical simulation shows that our new estimates are consistent and our new test performs decent size and power.

  20. Estimating a Preference-Based Index from the Clinical Outcomes in Routine Evaluation–Outcome Measure (CORE-OM)

    PubMed Central

    Brazier, John E.; Rowen, Donna; Barkham, Michael

    2013-01-01

    Background. The Clinical Outcomes in Routine Evaluation–Outcome Measure (CORE-OM) is used to evaluate the effectiveness of psychological therapies in people with common mental disorders. The objective of this study was to estimate a preference-based index for this population using CORE-6D, a health state classification system derived from the CORE-OM consisting of a 5-item emotional component and a physical item, and to demonstrate a novel method for generating states that are not orthogonal. Methods. Rasch analysis was used to identify 11 emotional health states from CORE-6D that were frequently observed in the study population and are, thus, plausible (in contrast, conventional statistical design might generate implausible states). Combined with the 3 response levels of the physical item of CORE-6D, they generate 33 plausible health states, 18 of which were selected for valuation. A valuation survey of 220 members of the public in South Yorkshire, United Kingdom, was undertaken using the time tradeoff (TTO) method. Regression analysis was subsequently used to predict values for all possible states described by CORE-6D. Results. A number of multivariate regression models were built to predict values for the 33 health states of CORE-6D, using the Rasch logit value of the emotional state and the response level of the physical item as independent variables. A cubic model with high predictive value (adjusted R2 = 0.990) was selected to predict TTO values for all 729 CORE-6D health states. Conclusion. The CORE-6D preference-based index will enable the assessment of cost-effectiveness of interventions for people with common mental disorders using existing and prospective CORE-OM data sets. The new method for generating states may be useful for other instruments with highly correlated dimensions. PMID:23178639

  1. Estimation of Cellulose Crystallinity of Lignocelluloses Using Near-IR FT-Raman Spectroscopy and Comparison of the Raman and Segal-WAXS Methods

    Treesearch

    Umesh P. Agarwal; Richard R. Reiner; Sally A. Ralph

    2013-01-01

    Of the recently developed univariate and multivariate near-IR FT-Raman methods for estimating cellulose crystallinity, the former method was applied to a variety of lignocelluloses: softwoods, hardwoods, wood pulps, and agricultural residues/fibers. The effect of autofluorescence on the crystallinity estimation was minimized by solvent extraction or chemical treatment...

  2. The Graphical Display of Simulation Results, with Applications to the Comparison of Robust IRT Estimators of Ability.

    ERIC Educational Resources Information Center

    Thissen, David; Wainer, Howard

    Simulation studies of the performance of (potentially) robust statistical estimation produce large quantities of numbers in the form of performance indices of the various estimators under various conditions. This report presents a multivariate graphical display used to aid in the digestion of the plentiful results in a current study of Item…

  3. Using small area estimation and Lidar-derived variables for multivariate prediction of forest attributes

    Treesearch

    F. Mauro; Vicente Monleon; H. Temesgen

    2015-01-01

    Small area estimation (SAE) techniques have been successfully applied in forest inventories to provide reliable estimates for domains where the sample size is small (i.e. small areas). Previous studies have explored the use of either Area Level or Unit Level Empirical Best Linear Unbiased Predictors (EBLUPs) in a univariate framework, modeling each variable of interest...

  4. Inferring Instantaneous, Multivariate and Nonlinear Sensitivities for the Analysis of Feedback Processes in a Dynamical System: Lorenz Model Case Study

    NASA Technical Reports Server (NTRS)

    Aires, Filipe; Rossow, William B.; Hansen, James E. (Technical Monitor)

    2001-01-01

    A new approach is presented for the analysis of feedback processes in a nonlinear dynamical system by observing its variations. The new methodology consists of statistical estimates of the sensitivities between all pairs of variables in the system based on a neural network modeling of the dynamical system. The model can then be used to estimate the instantaneous, multivariate and nonlinear sensitivities, which are shown to be essential for the analysis of the feedbacks processes involved in the dynamical system. The method is described and tested on synthetic data from the low-order Lorenz circulation model where the correct sensitivities can be evaluated analytically.

  5. Identification of complex metabolic states in critically injured patients using bioinformatic cluster analysis.

    PubMed

    Cohen, Mitchell J; Grossman, Adam D; Morabito, Diane; Knudson, M Margaret; Butte, Atul J; Manley, Geoffrey T

    2010-01-01

    Advances in technology have made extensive monitoring of patient physiology the standard of care in intensive care units (ICUs). While many systems exist to compile these data, there has been no systematic multivariate analysis and categorization across patient physiological data. The sheer volume and complexity of these data make pattern recognition or identification of patient state difficult. Hierarchical cluster analysis allows visualization of high dimensional data and enables pattern recognition and identification of physiologic patient states. We hypothesized that processing of multivariate data using hierarchical clustering techniques would allow identification of otherwise hidden patient physiologic patterns that would be predictive of outcome. Multivariate physiologic and ventilator data were collected continuously using a multimodal bioinformatics system in the surgical ICU at San Francisco General Hospital. These data were incorporated with non-continuous data and stored on a server in the ICU. A hierarchical clustering algorithm grouped each minute of data into 1 of 10 clusters. Clusters were correlated with outcome measures including incidence of infection, multiple organ failure (MOF), and mortality. We identified 10 clusters, which we defined as distinct patient states. While patients transitioned between states, they spent significant amounts of time in each. Clusters were enriched for our outcome measures: 2 of the 10 states were enriched for infection, 6 of 10 were enriched for MOF, and 3 of 10 were enriched for death. Further analysis of correlations between pairs of variables within each cluster reveals significant differences in physiology between clusters. Here we show for the first time the feasibility of clustering physiological measurements to identify clinically relevant patient states after trauma. These results demonstrate that hierarchical clustering techniques can be useful for visualizing complex multivariate data and may provide new insights for the care of critically injured patients.

  6. Statistical analysis of latent generalized correlation matrix estimation in transelliptical distribution

    PubMed Central

    Han, Fang; Liu, Han

    2016-01-01

    Correlation matrix plays a key role in many multivariate methods (e.g., graphical model estimation and factor analysis). The current state-of-the-art in estimating large correlation matrices focuses on the use of Pearson’s sample correlation matrix. Although Pearson’s sample correlation matrix enjoys various good properties under Gaussian models, its not an effective estimator when facing heavy-tail distributions with possible outliers. As a robust alternative, Han and Liu (2013b) advocated the use of a transformed version of the Kendall’s tau sample correlation matrix in estimating high dimensional latent generalized correlation matrix under the transelliptical distribution family (or elliptical copula). The transelliptical family assumes that after unspecified marginal monotone transformations, the data follow an elliptical distribution. In this paper, we study the theoretical properties of the Kendall’s tau sample correlation matrix and its transformed version proposed in Han and Liu (2013b) for estimating the population Kendall’s tau correlation matrix and the latent Pearson’s correlation matrix under both spectral and restricted spectral norms. With regard to the spectral norm, we highlight the role of “effective rank” in quantifying the rate of convergence. With regard to the restricted spectral norm, we for the first time present a “sign subgaussian condition” which is sufficient to guarantee that the rank-based correlation matrix estimator attains the optimal rate of convergence. In both cases, we do not need any moment condition. PMID:28337068

  7. Order-restricted inference for multivariate longitudinal data with applications to the natural history of hearing loss.

    PubMed

    Rosen, Sophia; Davidov, Ori

    2012-07-20

    Multivariate outcomes are often measured longitudinally. For example, in hearing loss studies, hearing thresholds for each subject are measured repeatedly over time at several frequencies. Thus, each patient is associated with a multivariate longitudinal outcome. The multivariate mixed-effects model is a useful tool for the analysis of such data. There are situations in which the parameters of the model are subject to some restrictions or constraints. For example, it is known that hearing thresholds, at every frequency, increase with age. Moreover, this age-related threshold elevation is monotone in frequency, that is, the higher the frequency, the higher, on average, is the rate of threshold elevation. This means that there is a natural ordering among the different frequencies in the rate of hearing loss. In practice, this amounts to imposing a set of constraints on the different frequencies' regression coefficients modeling the mean effect of time and age at entry to the study on hearing thresholds. The aforementioned constraints should be accounted for in the analysis. The result is a multivariate longitudinal model with restricted parameters. We propose estimation and testing procedures for such models. We show that ignoring the constraints may lead to misleading inferences regarding the direction and the magnitude of various effects. Moreover, simulations show that incorporating the constraints substantially improves the mean squared error of the estimates and the power of the tests. We used this methodology to analyze a real hearing loss study. Copyright © 2012 John Wiley & Sons, Ltd.

  8. Up-scaling of multi-variable flood loss models from objects to land use units at the meso-scale

    NASA Astrophysics Data System (ADS)

    Kreibich, Heidi; Schröter, Kai; Merz, Bruno

    2016-05-01

    Flood risk management increasingly relies on risk analyses, including loss modelling. Most of the flood loss models usually applied in standard practice have in common that complex damaging processes are described by simple approaches like stage-damage functions. Novel multi-variable models significantly improve loss estimation on the micro-scale and may also be advantageous for large-scale applications. However, more input parameters also reveal additional uncertainty, even more in upscaling procedures for meso-scale applications, where the parameters need to be estimated on a regional area-wide basis. To gain more knowledge about challenges associated with the up-scaling of multi-variable flood loss models the following approach is applied: Single- and multi-variable micro-scale flood loss models are up-scaled and applied on the meso-scale, namely on basis of ATKIS land-use units. Application and validation is undertaken in 19 municipalities, which were affected during the 2002 flood by the River Mulde in Saxony, Germany by comparison to official loss data provided by the Saxon Relief Bank (SAB).In the meso-scale case study based model validation, most multi-variable models show smaller errors than the uni-variable stage-damage functions. The results show the suitability of the up-scaling approach, and, in accordance with micro-scale validation studies, that multi-variable models are an improvement in flood loss modelling also on the meso-scale. However, uncertainties remain high, stressing the importance of uncertainty quantification. Thus, the development of probabilistic loss models, like BT-FLEMO used in this study, which inherently provide uncertainty information are the way forward.

  9. Estimating the Classification Efficiency of a Test Battery.

    ERIC Educational Resources Information Center

    De Corte, Wilfried

    2000-01-01

    Shows how a theorem proven by H. Brogden (1951, 1959) can be used to estimate the allocation average (a predictor based classification of a test battery) assuming that the predictor intercorrelations and validities are known and that the predictor variables have a joint multivariate normal distribution. (SLD)

  10. Spatial extremes modeling applied to extreme precipitation data in the state of Paraná

    NASA Astrophysics Data System (ADS)

    Olinda, R. A.; Blanchet, J.; dos Santos, C. A. C.; Ozaki, V. A.; Ribeiro, P. J., Jr.

    2014-11-01

    Most of the mathematical models developed for rare events are based on probabilistic models for extremes. Although the tools for statistical modeling of univariate and multivariate extremes are well developed, the extension of these tools to model spatial extremes includes an area of very active research nowadays. A natural approach to such a modeling is the theory of extreme spatial and the max-stable process, characterized by the extension of infinite dimensions of multivariate extreme value theory, and making it possible then to incorporate the existing correlation functions in geostatistics and therefore verify the extremal dependence by means of the extreme coefficient and the Madogram. This work describes the application of such processes in modeling the spatial maximum dependence of maximum monthly rainfall from the state of Paraná, based on historical series observed in weather stations. The proposed models consider the Euclidean space and a transformation referred to as space weather, which may explain the presence of directional effects resulting from synoptic weather patterns. This method is based on the theorem proposed for de Haan and on the models of Smith and Schlather. The isotropic and anisotropic behavior of these models is also verified via Monte Carlo simulation. Estimates are made through pairwise likelihood maximum and the models are compared using the Takeuchi Information Criterion. By modeling the dependence of spatial maxima, applied to maximum monthly rainfall data from the state of Paraná, it was possible to identify directional effects resulting from meteorological phenomena, which, in turn, are important for proper management of risks and environmental disasters in countries with its economy heavily dependent on agribusiness.

  11. Local Health Departments' Promotion of Mental Health Care and Reductions in 30-Day All-Cause Readmission Rates in Maryland.

    PubMed

    Chen, Jie; Novak, Priscilla; Barath, Deanna; Goldman, Howard; Mortensen, Karoline

    2018-02-01

    Individuals affected with mental health conditions, including mood disorders and substance abuse, are at an increased risk of hospital readmission. The objective of this study is to examine whether local health departments' (LHDs) active roles of promoting mental health are associated with reductions in 30-day all-cause readmission rates, a common quality metric. Using datasets linked from multiple sources, including 2012-2013 State Inpatient Databases for the State of Maryland, the National Association of County and City Health Officials Profiles Survey, the Area Health Resource File, and US Census data, we employed multivariate logistic models to examine whether LHDs' active provision of mental health preventive care, mental health services, and health promotion were associated with the likelihood of having any 30-day all-cause readmission. Multivariate logistic regressions showed that LHDs' provision of mental health preventive care, mental health services, and health promotion were negatively associated with the likelihoods of having any 30-day readmission for adults 18-64 years old (odds ratios=0.71-0.82, P<0.001), and adults 65 and above (odds ratios=0.61-0.63, P<0.001, preventive care and services, respectively). These estimated associations were more prominent among individuals with mental illness and/or substance use disorders, African Americans, Medicare, and Medicaid enrollees. Our results suggest that LHDs in Maryland that engage in mental health prevention, promotion, and coordination activities are associated with benefits for residents and for the health care system at large. Additional research is needed to evaluate LHD activities in other states to determine if these results are generalizable.

  12. Characteristics of Adults Who Switched From Cigarette Smoking to E-cigarettes.

    PubMed

    Park, Su Hyun; Duncan, Dustin T; Shahawy, Omar El; Lee, Lily; Shearston, Jenni A; Tamura, Kosuke; Sherman, Scott E; Weitzman, Michael

    2017-11-01

    Because of the rapidly increasing use of electronic cigarettes (e-cigarettes), this study aimed to investigate the individual characteristics and state-level prevalence of U.S. adults who have switched to e-cigarettes from traditional cigarettes. Data from the 2012-2013 and 2013-2014 National Adult Tobacco Surveys were analyzed in 2016. Relative percent change in switching was estimated, and the state-specific prevalence of adults who switched to e-cigarettes from traditional cigarettes was calculated and mapped. Multivariate logistic regression was conducted to examine how switching varied by sociodemographic subgroups and region. Overall, the number of individuals who switched from traditional cigarettes to e-cigarettes increased by approximately 100% over the 1-year interval. Significant increases were found among a number of sociodemographics and regions. Multivariate logistic regression analyses showed that young adults and those living in the South and West were more likely to switch to e-cigarettes, compared to former smokers who did not switch. Compared with current dual users, those with higher education and those who were not single were more likely to switch to e-cigarettes. The state with the highest prevalence of switching was New Mexico (7.3%), whereas Connecticut had the lowest prevalence (0.8 %) among former smokers. There is an increase in the progression from traditional cigarette use to e-cigarette use. Further research is warranted to determine whether this change continues and facilitates cigarette smoking cessation as a possible public health benefit and opportunity to save lives rather than constitutes a potential threat to public health. Copyright © 2017 American Journal of Preventive Medicine. Published by Elsevier Inc. All rights reserved.

  13. Nonlinear Decoupling Control With ANFIS-Based Unmodeled Dynamics Compensation for a Class of Complex Industrial Processes.

    PubMed

    Zhang, Yajun; Chai, Tianyou; Wang, Hong; Wang, Dianhui; Chen, Xinkai

    2018-06-01

    Complex industrial processes are multivariable and generally exhibit strong coupling among their control loops with heavy nonlinear nature. These make it very difficult to obtain an accurate model. As a result, the conventional and data-driven control methods are difficult to apply. Using a twin-tank level control system as an example, a novel multivariable decoupling control algorithm with adaptive neural-fuzzy inference system (ANFIS)-based unmodeled dynamics (UD) compensation is proposed in this paper for a class of complex industrial processes. At first, a nonlinear multivariable decoupling controller with UD compensation is introduced. Different from the existing methods, the decomposition estimation algorithm using ANFIS is employed to estimate the UD, and the desired estimating and decoupling control effects are achieved. Second, the proposed method does not require the complicated switching mechanism which has been commonly used in the literature. This significantly simplifies the obtained decoupling algorithm and its realization. Third, based on some new lemmas and theorems, the conditions on the stability and convergence of the closed-loop system are analyzed to show the uniform boundedness of all the variables. This is then followed by the summary on experimental tests on a heavily coupled nonlinear twin-tank system that demonstrates the effectiveness and the practicability of the proposed method.

  14. Multivariate space-time modelling of multiple air pollutants and their health effects accounting for exposure uncertainty.

    PubMed

    Huang, Guowen; Lee, Duncan; Scott, E Marian

    2018-03-30

    The long-term health effects of air pollution are often estimated using a spatio-temporal ecological areal unit study, but this design leads to the following statistical challenges: (1) how to estimate spatially representative pollution concentrations for each areal unit; (2) how to allow for the uncertainty in these estimated concentrations when estimating their health effects; and (3) how to simultaneously estimate the joint effects of multiple correlated pollutants. This article proposes a novel 2-stage Bayesian hierarchical model for addressing these 3 challenges, with inference based on Markov chain Monte Carlo simulation. The first stage is a multivariate spatio-temporal fusion model for predicting areal level average concentrations of multiple pollutants from both monitored and modelled pollution data. The second stage is a spatio-temporal model for estimating the health impact of multiple correlated pollutants simultaneously, which accounts for the uncertainty in the estimated pollution concentrations. The novel methodology is motivated by a new study of the impact of both particulate matter and nitrogen dioxide concentrations on respiratory hospital admissions in Scotland between 2007 and 2011, and the results suggest that both pollutants exhibit substantial and independent health effects. © 2017 The Authors. Statistics in Medicine Published by John Wiley & Sons Ltd.

  15. Preliminary Multivariable Cost Model for Space Telescopes

    NASA Technical Reports Server (NTRS)

    Stahl, H. Philip

    2010-01-01

    Parametric cost models are routinely used to plan missions, compare concepts and justify technology investments. Previously, the authors published two single variable cost models based on 19 flight missions. The current paper presents the development of a multi-variable space telescopes cost model. The validity of previously published models are tested. Cost estimating relationships which are and are not significant cost drivers are identified. And, interrelationships between variables are explored

  16. Design, evaluation and test of an electronic, multivariable control for the F100 turbofan engine

    NASA Technical Reports Server (NTRS)

    Skira, C. A.; Dehoff, R. L.; Hall, W. E., Jr.

    1980-01-01

    A digital, multivariable control design procedure for the F100 turbofan engine is described. The controller is based on locally linear synthesis techniques using linear, quadratic regulator design methods. The control structure uses an explicit model reference form with proportional and integral feedback near a nominal trajectory. Modeling issues, design procedures for the control law and the estimation of poorly measured variables are presented.

  17. A multivariate analysis of genetic constraints to life history evolution in a wild population of red deer.

    PubMed

    Walling, Craig A; Morrissey, Michael B; Foerster, Katharina; Clutton-Brock, Tim H; Pemberton, Josephine M; Kruuk, Loeske E B

    2014-12-01

    Evolutionary theory predicts that genetic constraints should be widespread, but empirical support for their existence is surprisingly rare. Commonly applied univariate and bivariate approaches to detecting genetic constraints can underestimate their prevalence, with important aspects potentially tractable only within a multivariate framework. However, multivariate genetic analyses of data from natural populations are challenging because of modest sample sizes, incomplete pedigrees, and missing data. Here we present results from a study of a comprehensive set of life history traits (juvenile survival, age at first breeding, annual fecundity, and longevity) for both males and females in a wild, pedigreed, population of red deer (Cervus elaphus). We use factor analytic modeling of the genetic variance-covariance matrix ( G: ) to reduce the dimensionality of the problem and take a multivariate approach to estimating genetic constraints. We consider a range of metrics designed to assess the effect of G: on the deflection of a predicted response to selection away from the direction of fastest adaptation and on the evolvability of the traits. We found limited support for genetic constraint through genetic covariances between traits, both within sex and between sexes. We discuss these results with respect to other recent findings and to the problems of estimating these parameters for natural populations. Copyright © 2014 Walling et al.

  18. A Multivariate Analysis of Genetic Constraints to Life History Evolution in a Wild Population of Red Deer

    PubMed Central

    Walling, Craig A.; Morrissey, Michael B.; Foerster, Katharina; Clutton-Brock, Tim H.; Pemberton, Josephine M.; Kruuk, Loeske E. B.

    2014-01-01

    Evolutionary theory predicts that genetic constraints should be widespread, but empirical support for their existence is surprisingly rare. Commonly applied univariate and bivariate approaches to detecting genetic constraints can underestimate their prevalence, with important aspects potentially tractable only within a multivariate framework. However, multivariate genetic analyses of data from natural populations are challenging because of modest sample sizes, incomplete pedigrees, and missing data. Here we present results from a study of a comprehensive set of life history traits (juvenile survival, age at first breeding, annual fecundity, and longevity) for both males and females in a wild, pedigreed, population of red deer (Cervus elaphus). We use factor analytic modeling of the genetic variance–covariance matrix (G) to reduce the dimensionality of the problem and take a multivariate approach to estimating genetic constraints. We consider a range of metrics designed to assess the effect of G on the deflection of a predicted response to selection away from the direction of fastest adaptation and on the evolvability of the traits. We found limited support for genetic constraint through genetic covariances between traits, both within sex and between sexes. We discuss these results with respect to other recent findings and to the problems of estimating these parameters for natural populations. PMID:25278555

  19. A statistical framework for neuroimaging data analysis based on mutual information estimated via a gaussian copula

    PubMed Central

    Giordano, Bruno L.; Kayser, Christoph; Rousselet, Guillaume A.; Gross, Joachim; Schyns, Philippe G.

    2016-01-01

    Abstract We begin by reviewing the statistical framework of information theory as applicable to neuroimaging data analysis. A major factor hindering wider adoption of this framework in neuroimaging is the difficulty of estimating information theoretic quantities in practice. We present a novel estimation technique that combines the statistical theory of copulas with the closed form solution for the entropy of Gaussian variables. This results in a general, computationally efficient, flexible, and robust multivariate statistical framework that provides effect sizes on a common meaningful scale, allows for unified treatment of discrete, continuous, unidimensional and multidimensional variables, and enables direct comparisons of representations from behavioral and brain responses across any recording modality. We validate the use of this estimate as a statistical test within a neuroimaging context, considering both discrete stimulus classes and continuous stimulus features. We also present examples of analyses facilitated by these developments, including application of multivariate analyses to MEG planar magnetic field gradients, and pairwise temporal interactions in evoked EEG responses. We show the benefit of considering the instantaneous temporal derivative together with the raw values of M/EEG signals as a multivariate response, how we can separately quantify modulations of amplitude and direction for vector quantities, and how we can measure the emergence of novel information over time in evoked responses. Open‐source Matlab and Python code implementing the new methods accompanies this article. Hum Brain Mapp 38:1541–1573, 2017. © 2016 Wiley Periodicals, Inc. PMID:27860095

  20. Comparison of different statistical methods for estimation of extreme sea levels with wave set-up contribution

    NASA Astrophysics Data System (ADS)

    Kergadallan, Xavier; Bernardara, Pietro; Benoit, Michel; Andreewsky, Marc; Weiss, Jérôme

    2013-04-01

    Estimating the probability of occurrence of extreme sea levels is a central issue for the protection of the coast. Return periods of sea level with wave set-up contribution are estimated here in one site : Cherbourg in France in the English Channel. The methodology follows two steps : the first one is computation of joint probability of simultaneous wave height and still sea level, the second one is interpretation of that joint probabilities to assess a sea level for a given return period. Two different approaches were evaluated to compute joint probability of simultaneous wave height and still sea level : the first one is multivariate extreme values distributions of logistic type in which all components of the variables become large simultaneously, the second one is conditional approach for multivariate extreme values in which only one component of the variables have to be large. Two different methods were applied to estimate sea level with wave set-up contribution for a given return period : Monte-Carlo simulation in which estimation is more accurate but needs higher calculation time and classical ocean engineering design contours of type inverse-FORM in which the method is simpler and allows more complex estimation of wave setup part (wave propagation to the coast for example). We compare results from the two different approaches with the two different methods. To be able to use both Monte-Carlo simulation and design contours methods, wave setup is estimated with an simple empirical formula. We show advantages of the conditional approach compared to the multivariate extreme values approach when extreme sea-level occurs when either surge or wave height is large. We discuss the validity of the ocean engineering design contours method which is an alternative when computation of sea levels is too complex to use Monte-Carlo simulation method.

  1. Quality of Acute Care for Patients With Urinary Stones in the United States.

    PubMed

    Scales, Charles D; Bergman, Jonathan; Carter, Stacey; Jack, Gregory; Saigal, Christopher S; Litwin, Mark S

    2015-11-01

    To describe guideline adherence for patients with suspected upper tract stones. We performed a cross-sectional analysis of visits recorded by the National Hospital Ambulatory Medical Care Survey (emergency department [ED] component) in 2007-2010 (most recent data). We assessed adherence to clinical guidelines for diagnostic laboratory testing, imaging, and pharmacologic therapy. Multivariable regression models controlled for important covariates. An estimated 4,956,444 ED visits for patients with suspected kidney stones occurred during the study period. Guideline adherence was highest for diagnostic imaging, with 3,122,229 (63%) visits providing optimal imaging. Complete guideline-based laboratory testing occurred in only 2 of every 5 visits. Pharmacologic therapy to facilitate stone passage was prescribed during only 17% of eligible visits. In multivariable analysis of guideline adherence, we found little variation by patient, provider, or facility characteristics. Guideline-recommended care was absent from a substantial proportion of acute care visits for patients with suspected kidney stones. These failures of care delivery likely increase costs and temporary disability. Targeted interventions to improve guideline adherence should be designed and evaluated to improve care for patients with symptomatic kidney stones. Published by Elsevier Inc.

  2. Quality of Acute Care for Patients with Urinary Stones in the United States

    PubMed Central

    Scales, Charles D.; Bergman, Jonathan; Carter, Stacey; Jack, Gregory; Saigal, Christopher S.; Litwin, Mark S.

    2015-01-01

    Objective To describe guideline adherence for patients with suspected upper tract stones. Methods We performed a cross-sectional analysis of visits recorded by the National Hospital Ambulatory Medical Care Survey (ED component) in 2007–2010 (most recent data). We assessed adherence to clinical guidelines for diagnostic laboratory testing, imaging, and pharmacologic therapy. Multivariable regression models controlled for important covariates. Results An estimated 4,956,444 ED visits for patients with suspected kidney stones occurred during the study period. Guideline adherence was highest for diagnostic imaging, with 3,122,229 (63%) visits providing optimal imaging. Complete guideline-based laboratory testing occurred in only 2 of every 5 visits. Pharmacologic therapy to facilitate stone passage was prescribed during only 17% of eligible visits. In multivariable analysis of guideline adherence, we found little variation by patient, provider or facility characteristics. Conclusions Guideline-recommended care was absent from a substantial proportion of acute care visits for patients with suspected kidney stones. These failures of care delivery likely increase costs and temporary disability. Targeted interventions to improve guideline adherence should be designed and evaluated to improve care for patients with symptomatic kidney stones. PMID:26335495

  3. A comparison of multivariate and univariate time series approaches to modelling and forecasting emergency department demand in Western Australia.

    PubMed

    Aboagye-Sarfo, Patrick; Mai, Qun; Sanfilippo, Frank M; Preen, David B; Stewart, Louise M; Fatovich, Daniel M

    2015-10-01

    To develop multivariate vector-ARMA (VARMA) forecast models for predicting emergency department (ED) demand in Western Australia (WA) and compare them to the benchmark univariate autoregressive moving average (ARMA) and Winters' models. Seven-year monthly WA state-wide public hospital ED presentation data from 2006/07 to 2012/13 were modelled. Graphical and VARMA modelling methods were used for descriptive analysis and model fitting. The VARMA models were compared to the benchmark univariate ARMA and Winters' models to determine their accuracy to predict ED demand. The best models were evaluated by using error correction methods for accuracy. Descriptive analysis of all the dependent variables showed an increasing pattern of ED use with seasonal trends over time. The VARMA models provided a more precise and accurate forecast with smaller confidence intervals and better measures of accuracy in predicting ED demand in WA than the ARMA and Winters' method. VARMA models are a reliable forecasting method to predict ED demand for strategic planning and resource allocation. While the ARMA models are a closely competing alternative, they under-estimated future ED demand. Copyright © 2015 Elsevier Inc. All rights reserved.

  4. Testing the performance of pure spectrum resolution from Raman hyperspectral images of differently manufactured pharmaceutical tablets.

    PubMed

    Vajna, Balázs; Farkas, Attila; Pataki, Hajnalka; Zsigmond, Zsolt; Igricz, Tamás; Marosi, György

    2012-01-27

    Chemical imaging is a rapidly emerging analytical method in pharmaceutical technology. Due to the numerous chemometric solutions available, characterization of pharmaceutical samples with unknown components present has also become possible. This study compares the performance of current state-of-the-art curve resolution methods (multivariate curve resolution-alternating least squares, positive matrix factorization, simplex identification via split augmented Lagrangian and self-modelling mixture analysis) in the estimation of pure component spectra from Raman maps of differently manufactured pharmaceutical tablets. The batches of different technologies differ in the homogeneity level of the active ingredient, thus, the curve resolution methods are tested under different conditions. An empirical approach is shown to determine the number of components present in a sample. The chemometric algorithms are compared regarding the number of detected components, the quality of the resolved spectra and the accuracy of scores (spectral concentrations) compared to those calculated with classical least squares, using the true pure component (reference) spectra. It is demonstrated that using appropriate multivariate methods, Raman chemical imaging can be a useful tool in the non-invasive characterization of unknown (e.g. illegal or counterfeit) pharmaceutical products. Copyright © 2011 Elsevier B.V. All rights reserved.

  5. Predictor-based multivariable closed-loop system identification of the EXTRAP T2R reversed field pinch external plasma response

    NASA Astrophysics Data System (ADS)

    Olofsson, K. Erik J.; Brunsell, Per R.; Rojas, Cristian R.; Drake, James R.; Hjalmarsson, Håkan

    2011-08-01

    The usage of computationally feasible overparametrized and nonregularized system identification signal processing methods is assessed for automated determination of the full reversed-field pinch external plasma response spectrum for the experiment EXTRAP T2R. No assumptions on the geometry of eigenmodes are imposed. The attempted approach consists of high-order autoregressive exogenous estimation followed by Markov block coefficient construction and Hankel matrix singular value decomposition. It is seen that the obtained 'black-box' state-space models indeed can be compared with the commonplace ideal magnetohydrodynamics (MHD) resistive thin-shell model in cylindrical geometry. It is possible to directly map the most unstable autodetected empirical system pole to the corresponding theoretical resistive shell MHD eigenmode.

  6. Analysis of multivariate longitudinal kidney function outcomes using generalized linear mixed models.

    PubMed

    Jaffa, Miran A; Gebregziabher, Mulugeta; Jaffa, Ayad A

    2015-06-14

    Renal transplant patients are mandated to have continuous assessment of their kidney function over time to monitor disease progression determined by changes in blood urea nitrogen (BUN), serum creatinine (Cr), and estimated glomerular filtration rate (eGFR). Multivariate analysis of these outcomes that aims at identifying the differential factors that affect disease progression is of great clinical significance. Thus our study aims at demonstrating the application of different joint modeling approaches with random coefficients on a cohort of renal transplant patients and presenting a comparison of their performance through a pseudo-simulation study. The objective of this comparison is to identify the model with best performance and to determine whether accuracy compensates for complexity in the different multivariate joint models. We propose a novel application of multivariate Generalized Linear Mixed Models (mGLMM) to analyze multiple longitudinal kidney function outcomes collected over 3 years on a cohort of 110 renal transplantation patients. The correlated outcomes BUN, Cr, and eGFR and the effect of various covariates such patient's gender, age and race on these markers was determined holistically using different mGLMMs. The performance of the various mGLMMs that encompass shared random intercept (SHRI), shared random intercept and slope (SHRIS), separate random intercept (SPRI) and separate random intercept and slope (SPRIS) was assessed to identify the one that has the best fit and most accurate estimates. A bootstrap pseudo-simulation study was conducted to gauge the tradeoff between the complexity and accuracy of the models. Accuracy was determined using two measures; the mean of the differences between the estimates of the bootstrapped datasets and the true beta obtained from the application of each model on the renal dataset, and the mean of the square of these differences. The results showed that SPRI provided most accurate estimates and did not exhibit any computational or convergence problem. Higher accuracy was demonstrated when the level of complexity increased from shared random coefficient models to the separate random coefficient alternatives with SPRI showing to have the best fit and most accurate estimates.

  7. Considerations in cross-validation type density smoothing with a look at some data

    NASA Technical Reports Server (NTRS)

    Schuster, E. F.

    1982-01-01

    Experience gained in applying nonparametric maximum likelihood techniques of density estimation to judge the comparative quality of various estimators is reported. Two invariate data sets of one hundered samples (one Cauchy, one natural normal) are considered as well as studies in the multivariate case.

  8. A nonparametric clustering technique which estimates the number of clusters

    NASA Technical Reports Server (NTRS)

    Ramey, D. B.

    1983-01-01

    In applications of cluster analysis, one usually needs to determine the number of clusters, K, and the assignment of observations to each cluster. A clustering technique based on recursive application of a multivariate test of bimodality which automatically estimates both K and the cluster assignments is presented.

  9. Specifying and Refining a Complex Measurement Model.

    ERIC Educational Resources Information Center

    Levy, Roy; Mislevy, Robert J.

    This paper aims to describe a Bayesian approach to modeling and estimating cognitive models both in terms of statistical machinery and actual instrument development. Such a method taps the knowledge of experts to provide initial estimates for the probabilistic relationships among the variables in a multivariate latent variable model and refines…

  10. Expansion of Coverage under the Patient Protection and Affordable Care Act and Primary Care Utilization

    PubMed Central

    Hofer, Adam N; Abraham, Jean Marie; Moscovice, Ira

    2011-01-01

    Context: Provisions of the Patient Protection and Affordable Care Act of 2010 (PPACA) expand Medicaid to all individuals in families earning less than 133 percent of the federal poverty level (FPL) and make available subsidies to uninsured lower-income Americans (133 to 400 percent of FPL) without access to employer-based coverage to purchase insurance in new exchanges. Since primary care physicians typically serve as the point of entry into the health care delivery system, an adequate supply of them is critical to meeting the anticipated increase in demand for medical care resulting from the expansion of coverage. This article provides state-level estimates of the anticipated increases in primary care utilization given the PPACA's provisions for expanded coverage. Methods: Using the Medical Expenditure Panel Survey, this article estimates a multivariate regression model of annual primary care utilization. Using the model estimates and state-level information regarding the number of uninsured, it predicts, by state, the change in primary care visits expected from the expanded coverage. Finally, the article predicts the number of primary care physicians needed to accommodate this change in utilization. Findings: This expanded coverage is predicted to increase by 2019 the number of annual primary care visits between 15.07 million and 24.26 million. Assuming stable levels of physicians’ productivity, between 4,307 and 6,940 additional primary care physicians would be needed to accommodate this increase. Conclusions: The PPACA's health insurance expansion parameters are expected to significantly increase the use of primary care. Two strategies that policymakers may consider are creating stronger financial incentives to attract medical school students to primary care and changing the delivery of care in ways that lead to operational improvements, higher throughput, and better quality of care. PMID:21418313

  11. Obesity Prevalence by Occupation in Washington State, Behavioral Risk Factor Surveillance System

    PubMed Central

    Lu, Dayu; Fan, Z. Joyce

    2014-01-01

    Introduction Data that estimate the prevalence of and risk factors for worker obesity by occupation are generally unavailable and could inform the prioritization of workplace wellness programs. The aims of this study were to estimate the prevalence of obesity by occupation, examine the association of occupational physical activity and a range of health behaviors with obesity, and identify occupations in which workers are at high risk of obesity in Washington State. Methods We conducted descriptive and multivariable analyses among 37,626 employed Washington State respondents using the Behavioral Risk Factor Surveillance System in odd numbered years, from 2003 through 2009. We estimated prevalence and prevalence ratios (PRs) by occupational groups adjusting for demographics, occupational physical activity level, smoking, fruit and vegetable consumption, and leisure-time physical activity (LPTA). Results Overall obesity prevalence was 24.6% (95% confidence interval [CI], 24.0–25.1). Workers in protective services were 2.46 (95% CI, 1.72–3.50) times as likely to be obese as workers in health diagnosing occupations. Compared with their counterparts, workers who consumed adequate amounts of fruits and vegetables and had adequate LTPA were significantly less likely to be obese (PR = 0.91; 95% CI, 0.86–0.97 and PR = 0.63; 95% CI, 0.60–0.67, respectively). Workers with physically demanding occupational physical activity had a lower PR of obesity (PR = 0.83; 95% CI, 0.78–0.88) than those with nonphysically demanding occupational physical activity. Conclusion Obesity prevalence and health risk behaviors vary substantially by occupation. Employers, policy makers, and health promotion practitioners can use our results to target and prioritize workplace obesity prevention and health behavior promotion programs. PMID:24406093

  12. Electronic fetal heart rate monitoring and its relationship to neonatal and infant mortality in the United States.

    PubMed

    Chen, Han-Yang; Chauhan, Suneet P; Ananth, Cande V; Vintzileos, Anthony M; Abuhamad, Alfred Z

    2011-06-01

    To examine the association between electronic fetal heart rate monitoring and neonatal and infant mortality, as well as neonatal morbidity. We used the United States 2004 linked birth and infant death data. Multivariable log-binomial regression models were fitted to estimate risk ratio for association between electronic fetal heart rate monitoring and mortality, while adjusting for potential confounders. In 2004, 89% of singleton pregnancies had electronic fetal heart rate monitoring. Electronic fetal heart rate monitoring was associated with significantly lower infant mortality (adjusted relative risk, 0.75); this was mainly driven by the lower risk of early neonatal mortality (adjusted relative risk, 0.50). In low-risk pregnancies, electronic fetal heart rate monitoring was associated with decreased risk for Apgar scores <4 at 5 minutes (relative risk, 0.54); in high-risk pregnancies, with decreased risk of neonatal seizures (relative risk, 0.65). In the United States, the use of electronic fetal heart rate monitoring was associated with a substantial decrease in early neonatal mortality and morbidity that lowered infant mortality. Copyright © 2011 Mosby, Inc. All rights reserved.

  13. Predictive equations for the estimation of body size in seals and sea lions (Carnivora: Pinnipedia)

    PubMed Central

    Churchill, Morgan; Clementz, Mark T; Kohno, Naoki

    2014-01-01

    Body size plays an important role in pinniped ecology and life history. However, body size data is often absent for historical, archaeological, and fossil specimens. To estimate the body size of pinnipeds (seals, sea lions, and walruses) for today and the past, we used 14 commonly preserved cranial measurements to develop sets of single variable and multivariate predictive equations for pinniped body mass and total length. Principal components analysis (PCA) was used to test whether separate family specific regressions were more appropriate than single predictive equations for Pinnipedia. The influence of phylogeny was tested with phylogenetic independent contrasts (PIC). The accuracy of these regressions was then assessed using a combination of coefficient of determination, percent prediction error, and standard error of estimation. Three different methods of multivariate analysis were examined: bidirectional stepwise model selection using Akaike information criteria; all-subsets model selection using Bayesian information criteria (BIC); and partial least squares regression. The PCA showed clear discrimination between Otariidae (fur seals and sea lions) and Phocidae (earless seals) for the 14 measurements, indicating the need for family-specific regression equations. The PIC analysis found that phylogeny had a minor influence on relationship between morphological variables and body size. The regressions for total length were more accurate than those for body mass, and equations specific to Otariidae were more accurate than those for Phocidae. Of the three multivariate methods, the all-subsets approach required the fewest number of variables to estimate body size accurately. We then used the single variable predictive equations and the all-subsets approach to estimate the body size of two recently extinct pinniped taxa, the Caribbean monk seal (Monachus tropicalis) and the Japanese sea lion (Zalophus japonicus). Body size estimates using single variable regressions generally under or over-estimated body size; however, the all-subset regression produced body size estimates that were close to historically recorded body length for these two species. This indicates that the all-subset regression equations developed in this study can estimate body size accurately. PMID:24916814

  14. Estimating Demand for and Supply of Pediatric Preventive Dental Care for Children and Identifying Dental Care Shortage Areas, Georgia, 2015.

    PubMed

    Cao, Shanshan; Gentili, Monica; Griffin, Paul M; Griffin, Susan O; Harati, Pravara; Johnson, Ben; Serban, Nicoleta; Tomar, Scott

    Demand for dental care is expected to outpace supply through 2025. The objectives of this study were to determine the extent of pediatric dental care shortages in Georgia and to develop a general method for estimation that can be applied to other states. We estimated supply and demand for pediatric preventive dental care for the 159 counties in Georgia in 2015. We compared pediatric preventive dental care shortage areas (where demand exceeded twice the supply) designated by our methods with dental health professional shortage areas designated by the Health Resources & Services Administration. We estimated caries risk from a multivariate analysis of National Health and Nutrition Examination Survey data and national census data. We estimated county-level demand based on the time needed to perform preventive dental care services and the proportion of time that dentists spend on pediatric preventive dental care services from the Medical Expenditure Panel Survey. Pediatric preventive dental care supply exceeded demand in Georgia in 75 counties: the average annual county-level pediatric preventive dental care demand was 16 866 hours, and the supply was 32 969 hours. We identified 41 counties as pediatric dental care shortage areas, 14 of which had not been designated by the Health Resources & Services Administration. Age- and service-specific information on dental care shortage areas could result in more efficient provider staffing and geographic targeting.

  15. Multivariate meta-analysis: potential and promise.

    PubMed

    Jackson, Dan; Riley, Richard; White, Ian R

    2011-09-10

    The multivariate random effects model is a generalization of the standard univariate model. Multivariate meta-analysis is becoming more commonly used and the techniques and related computer software, although continually under development, are now in place. In order to raise awareness of the multivariate methods, and discuss their advantages and disadvantages, we organized a one day 'Multivariate meta-analysis' event at the Royal Statistical Society. In addition to disseminating the most recent developments, we also received an abundance of comments, concerns, insights, critiques and encouragement. This article provides a balanced account of the day's discourse. By giving others the opportunity to respond to our assessment, we hope to ensure that the various view points and opinions are aired before multivariate meta-analysis simply becomes another widely used de facto method without any proper consideration of it by the medical statistics community. We describe the areas of application that multivariate meta-analysis has found, the methods available, the difficulties typically encountered and the arguments for and against the multivariate methods, using four representative but contrasting examples. We conclude that the multivariate methods can be useful, and in particular can provide estimates with better statistical properties, but also that these benefits come at the price of making more assumptions which do not result in better inference in every case. Although there is evidence that multivariate meta-analysis has considerable potential, it must be even more carefully applied than its univariate counterpart in practice. Copyright © 2011 John Wiley & Sons, Ltd.

  16. Multivariate meta-analysis: Potential and promise

    PubMed Central

    Jackson, Dan; Riley, Richard; White, Ian R

    2011-01-01

    The multivariate random effects model is a generalization of the standard univariate model. Multivariate meta-analysis is becoming more commonly used and the techniques and related computer software, although continually under development, are now in place. In order to raise awareness of the multivariate methods, and discuss their advantages and disadvantages, we organized a one day ‘Multivariate meta-analysis’ event at the Royal Statistical Society. In addition to disseminating the most recent developments, we also received an abundance of comments, concerns, insights, critiques and encouragement. This article provides a balanced account of the day's discourse. By giving others the opportunity to respond to our assessment, we hope to ensure that the various view points and opinions are aired before multivariate meta-analysis simply becomes another widely used de facto method without any proper consideration of it by the medical statistics community. We describe the areas of application that multivariate meta-analysis has found, the methods available, the difficulties typically encountered and the arguments for and against the multivariate methods, using four representative but contrasting examples. We conclude that the multivariate methods can be useful, and in particular can provide estimates with better statistical properties, but also that these benefits come at the price of making more assumptions which do not result in better inference in every case. Although there is evidence that multivariate meta-analysis has considerable potential, it must be even more carefully applied than its univariate counterpart in practice. Copyright © 2011 John Wiley & Sons, Ltd. PMID:21268052

  17. Analyzing latent state-trait and multiple-indicator latent growth curve models as multilevel structural equation models

    PubMed Central

    Geiser, Christian; Bishop, Jacob; Lockhart, Ginger; Shiffman, Saul; Grenard, Jerry L.

    2013-01-01

    Latent state-trait (LST) and latent growth curve (LGC) models are frequently used in the analysis of longitudinal data. Although it is well-known that standard single-indicator LGC models can be analyzed within either the structural equation modeling (SEM) or multilevel (ML; hierarchical linear modeling) frameworks, few researchers realize that LST and multivariate LGC models, which use multiple indicators at each time point, can also be specified as ML models. In the present paper, we demonstrate that using the ML-SEM rather than the SL-SEM framework to estimate the parameters of these models can be practical when the study involves (1) a large number of time points, (2) individually-varying times of observation, (3) unequally spaced time intervals, and/or (4) incomplete data. Despite the practical advantages of the ML-SEM approach under these circumstances, there are also some limitations that researchers should consider. We present an application to an ecological momentary assessment study (N = 158 youths with an average of 23.49 observations of positive mood per person) using the software Mplus (Muthén and Muthén, 1998–2012) and discuss advantages and disadvantages of using the ML-SEM approach to estimate the parameters of LST and multiple-indicator LGC models. PMID:24416023

  18. A Novel Continuous Blood Pressure Estimation Approach Based on Data Mining Techniques.

    PubMed

    Miao, Fen; Fu, Nan; Zhang, Yuan-Ting; Ding, Xiao-Rong; Hong, Xi; He, Qingyun; Li, Ye

    2017-11-01

    Continuous blood pressure (BP) estimation using pulse transit time (PTT) is a promising method for unobtrusive BP measurement. However, the accuracy of this approach must be improved for it to be viable for a wide range of applications. This study proposes a novel continuous BP estimation approach that combines data mining techniques with a traditional mechanism-driven model. First, 14 features derived from simultaneous electrocardiogram and photoplethysmogram signals were extracted for beat-to-beat BP estimation. A genetic algorithm-based feature selection method was then used to select BP indicators for each subject. Multivariate linear regression and support vector regression were employed to develop the BP model. The accuracy and robustness of the proposed approach were validated for static, dynamic, and follow-up performance. Experimental results based on 73 subjects showed that the proposed approach exhibited excellent accuracy in static BP estimation, with a correlation coefficient and mean error of 0.852 and -0.001 ± 3.102 mmHg for systolic BP, and 0.790 and -0.004 ± 2.199 mmHg for diastolic BP. Similar performance was observed for dynamic BP estimation. The robustness results indicated that the estimation accuracy was lower by a certain degree one day after model construction but was relatively stable from one day to six months after construction. The proposed approach is superior to the state-of-the-art PTT-based model for an approximately 2-mmHg reduction in the standard derivation at different time intervals, thus providing potentially novel insights for cuffless BP estimation.

  19. Psychosocial risk factors and asthma among adults in Puerto Rico.

    PubMed

    Han, Yueh-Ying; Forno, Erick; Canino, Glorisa; Celedón, Juan C

    2018-05-08

    Asthma and psychosocial stressors are common among Puerto Rican adults living in the United States. We estimated the prevalence of current asthma, and examined potential psychosocial risk factors and current asthma, among adults in Puerto Rico. Cross-sectional study of 3,049 Puerto Ricans aged 18-64 years living in Puerto Rico between May 2014 and June 2016. A structured interview was conducted to obtain information on demographics, lifestyle, mental disorders, and respiratory health. Current asthma was defined as self-reported physician-diagnosed asthma and still having asthma. Multivariable logistic regression was used to examine psychosocial risk factors and current asthma. The estimated prevalence of current asthma was 10.2%. In a multivariable analysis, exposure to violence (odds ratio [OR] for each 1-point increment in a validated scale = 1.14, 95% confidence interval [CI] = 1.07 to 1.21) and a lifetime history of at least one suicide attempt (OR = 3.01, 95% CI = 1.80 to 5.01) were significantly associated with current asthma, independently of major depressive disorder. Moreover, a lifetime history of at least one suicide attempt was associated with co-existing asthma and chronic obstructive pulmonary disease (i.e., asthma-COPD overlap syndrome or ACOS (OR = 9.05, 95% CI = 3.32-24.67). Our findings suggest that asthma is a major health problem among adults in Puerto Rico, with psychosocial risk factors playing a significant role in asthma and ACOS. Addressing chronic stressors and mental illness should be part of comprehensive strategies to reduce asthma burden in this population.

  20. Measles case fatality rate in Bihar, India, 2011-12.

    PubMed

    Murhekar, Manoj V; Ahmad, Mohammad; Shukla, Hemant; Abhishek, Kunwar; Perry, Robert T; Bose, Anindya S; Shimpi, Rahul; Kumar, Arun; Kaliaperumal, Kanagasabai; Sethi, Raman; Selvaraj, Vadivoo; Kamaraj, Pattabi; Routray, Satyabrata; Das, Vidya Nand; Menabde, Nata; Bahl, Sunil

    2014-01-01

    Updated estimates of measles case fatality rates (CFR) are critical for monitoring progress towards measles elimination goals. India accounted for 36% of total measles deaths occurred globally in 2011. We conducted a retrospective cohort study to estimate measles CFR and identify the risk factors for measles death in Bihar-one of the north Indian states historically known for its low vaccination coverage. We systematically selected 16 of the 31 laboratory-confirmed measles outbreaks occurring in Bihar during 1 October 2011 to 30 April 2012. All households of the villages/urban localities affected by these outbreaks were visited to identify measles cases and deaths. We calculated CFR and used multivariate analysis to identify risk factors for measles death. The survey found 3670 measles cases and 28 deaths (CFR: 0.78, 95% confidence interval: 0.47-1.30). CFR was higher among under-five children (1.22%) and children belonging to scheduled castes/tribes (SC/ST, 1.72%). On multivariate analysis, independent risk factors associated with measles death were age <5 years, SC/ST status and non-administration of vitamin A during illness. Outbreaks with longer interval between the occurrence of first case and notification of the outbreak also had a higher rate of deaths. Measles CFR in Bihar was low. To further reduce case fatality, health authorities need to ensure that SC/ST are targeted by the immunization programme and that outbreak investigations target for vitamin A treatment of cases in high risk groups such as SC/ST and young children and ensure regular visits by health-workers in affected villages to administer vitamin A to new cases.

  1. Identifying effective connectivity parameters in simulated fMRI: a direct comparison of switching linear dynamic system, stochastic dynamic causal, and multivariate autoregressive models

    PubMed Central

    Smith, Jason F.; Chen, Kewei; Pillai, Ajay S.; Horwitz, Barry

    2013-01-01

    The number and variety of connectivity estimation methods is likely to continue to grow over the coming decade. Comparisons between methods are necessary to prune this growth to only the most accurate and robust methods. However, the nature of connectivity is elusive with different methods potentially attempting to identify different aspects of connectivity. Commonalities of connectivity definitions across methods upon which base direct comparisons can be difficult to derive. Here, we explicitly define “effective connectivity” using a common set of observation and state equations that are appropriate for three connectivity methods: dynamic causal modeling (DCM), multivariate autoregressive modeling (MAR), and switching linear dynamic systems for fMRI (sLDSf). In addition while deriving this set, we show how many other popular functional and effective connectivity methods are actually simplifications of these equations. We discuss implications of these connections for the practice of using one method to simulate data for another method. After mathematically connecting the three effective connectivity methods, simulated fMRI data with varying numbers of regions and task conditions is generated from the common equation. This simulated data explicitly contains the type of the connectivity that the three models were intended to identify. Each method is applied to the simulated data sets and the accuracy of parameter identification is analyzed. All methods perform above chance levels at identifying correct connectivity parameters. The sLDSf method was superior in parameter estimation accuracy to both DCM and MAR for all types of comparisons. PMID:23717258

  2. Spatio-temporal models of mental processes from fMRI.

    PubMed

    Janoos, Firdaus; Machiraju, Raghu; Singh, Shantanu; Morocz, Istvan Ákos

    2011-07-15

    Understanding the highly complex, spatially distributed and temporally organized phenomena entailed by mental processes using functional MRI is an important research problem in cognitive and clinical neuroscience. Conventional analysis methods focus on the spatial dimension of the data discarding the information about brain function contained in the temporal dimension. This paper presents a fully spatio-temporal multivariate analysis method using a state-space model (SSM) for brain function that yields not only spatial maps of activity but also its temporal structure along with spatially varying estimates of the hemodynamic response. Efficient algorithms for estimating the parameters along with quantitative validations are given. A novel low-dimensional feature-space for representing the data, based on a formal definition of functional similarity, is derived. Quantitative validation of the model and the estimation algorithms is provided with a simulation study. Using a real fMRI study for mental arithmetic, the ability of this neurophysiologically inspired model to represent the spatio-temporal information corresponding to mental processes is demonstrated. Moreover, by comparing the models across multiple subjects, natural patterns in mental processes organized according to different mental abilities are revealed. Copyright © 2011 Elsevier Inc. All rights reserved.

  3. A novel technique for optimal integration of active steering and differential braking with estimation to improve vehicle directional stability.

    PubMed

    Mirzaeinejad, Hossein; Mirzaei, Mehdi; Rafatnia, Sadra

    2018-06-11

    This study deals with the enhancement of directional stability of vehicle which turns with high speeds on various road conditions using integrated active steering and differential braking systems. In this respect, the minimum usage of intentional asymmetric braking force to compensate the drawbacks of active steering control with small reduction of vehicle longitudinal speed is desired. To this aim, a new optimal multivariable controller is analytically developed for integrated steering and braking systems based on the prediction of vehicle nonlinear responses. A fuzzy programming extracted from the nonlinear phase plane analysis is also used for managing the two control inputs in various driving conditions. With the proposed fuzzy programming, the weight factors of the control inputs are automatically tuned and softly changed. In order to simulate a real-world control system, some required information about the system states and parameters which cannot be directly measured, are estimated using the Unscented Kalman Filter (UKF). Finally, simulations studies are carried out using a validated vehicle model to show the effectiveness of the proposed integrated control system in the presence of model uncertainties and estimation errors. Copyright © 2018 ISA. Published by Elsevier Ltd. All rights reserved.

  4. Analysis of Forest Foliage Using a Multivariate Mixture Model

    NASA Technical Reports Server (NTRS)

    Hlavka, C. A.; Peterson, David L.; Johnson, L. F.; Ganapol, B.

    1997-01-01

    Data with wet chemical measurements and near infrared spectra of ground leaf samples were analyzed to test a multivariate regression technique for estimating component spectra which is based on a linear mixture model for absorbance. The resulting unmixed spectra for carbohydrates, lignin, and protein resemble the spectra of extracted plant starches, cellulose, lignin, and protein. The unmixed protein spectrum has prominent absorption spectra at wavelengths which have been associated with nitrogen bonds.

  5. POWERLIB: SAS/IML Software for Computing Power in Multivariate Linear Models

    PubMed Central

    Johnson, Jacqueline L.; Muller, Keith E.; Slaughter, James C.; Gurka, Matthew J.; Gribbin, Matthew J.; Simpson, Sean L.

    2014-01-01

    The POWERLIB SAS/IML software provides convenient power calculations for a wide range of multivariate linear models with Gaussian errors. The software includes the Box, Geisser-Greenhouse, Huynh-Feldt, and uncorrected tests in the “univariate” approach to repeated measures (UNIREP), the Hotelling Lawley Trace, Pillai-Bartlett Trace, and Wilks Lambda tests in “multivariate” approach (MULTIREP), as well as a limited but useful range of mixed models. The familiar univariate linear model with Gaussian errors is an important special case. For estimated covariance, the software provides confidence limits for the resulting estimated power. All power and confidence limits values can be output to a SAS dataset, which can be used to easily produce plots and tables for manuscripts. PMID:25400516

  6. Evaluating the role of admixture in cancer therapy via in vitro drug response and multivariate genome-wide associations

    PubMed Central

    Jack, John; Havener, Tammy M; McLeod, Howard L; Motsinger-Reif, Alison A; Foster, Matthew

    2015-01-01

    Aim: We investigate the role of ethnicity and admixture in drug response across a broad group of chemotherapeutic drugs. Also, we generate hypotheses on the genetic variants driving differential drug response through multivariate genome-wide association studies. Methods: Immortalized lymphoblastoid cell lines from 589 individuals (Hispanic or non-Hispanic/Caucasian) were used to investigate dose-response for 28 chemotherapeutic compounds. Univariate and multivariate statistical models were used to elucidate associations between genetic variants and differential drug response as well as the role of ethnicity in drug potency and efficacy. Results & Conclusion: For many drugs, the variability in drug response appears to correlate with self-reported race and estimates of genetic ancestry. Additionally, multivariate genome-wide association analyses offered interesting hypotheses governing these differential responses. PMID:26314407

  7. Designing a risk-based surveillance program for Mycobacterium avium ssp. paratuberculosis in Norwegian dairy herds using multivariate statistical process control analysis.

    PubMed

    Whist, A C; Liland, K H; Jonsson, M E; Sæbø, S; Sviland, S; Østerås, O; Norström, M; Hopp, P

    2014-11-01

    Surveillance programs for animal diseases are critical to early disease detection and risk estimation and to documenting a population's disease status at a given time. The aim of this study was to describe a risk-based surveillance program for detecting Mycobacterium avium ssp. paratuberculosis (MAP) infection in Norwegian dairy cattle. The included risk factors for detecting MAP were purchase of cattle, combined cattle and goat farming, and location of the cattle farm in counties containing goats with MAP. The risk indicators included production data [culling of animals >3 yr of age, carcass conformation of animals >3 yr of age, milk production decrease in older lactating cows (lactations 3, 4, and 5)], and clinical data (diarrhea, enteritis, or both, in animals >3 yr of age). Except for combined cattle and goat farming and cattle farm location, all data were collected at the cow level and summarized at the herd level. Predefined risk factors and risk indicators were extracted from different national databases and combined in a multivariate statistical process control to obtain a risk assessment for each herd. The ordinary Hotelling's T(2) statistic was applied as a multivariate, standardized measure of difference between the current observed state and the average state of the risk factors for a given herd. To make the analysis more robust and adapt it to the slowly developing nature of MAP, monthly risk calculations were based on data accumulated during a 24-mo period. Monitoring of these variables was performed to identify outliers that may indicate deviance in one or more of the underlying processes. The highest-ranked herds were scattered all over Norway and clustered in high-density dairy cattle farm areas. The resulting rankings of herds are being used in the national surveillance program for MAP in 2014 to increase the sensitivity of the ongoing surveillance program in which 5 fecal samples for bacteriological examination are collected from 25 dairy herds. The use of multivariate statistical process control for selection of herds will be beneficial when a diagnostic test suitable for mass screening is available and validated on the Norwegian cattle population, thus making it possible to increase the number of sampled herds. Copyright © 2014 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  8. Simple Penalties on Maximum-Likelihood Estimates of Genetic Parameters to Reduce Sampling Variation

    PubMed Central

    Meyer, Karin

    2016-01-01

    Multivariate estimates of genetic parameters are subject to substantial sampling variation, especially for smaller data sets and more than a few traits. A simple modification of standard, maximum-likelihood procedures for multivariate analyses to estimate genetic covariances is described, which can improve estimates by substantially reducing their sampling variances. This is achieved by maximizing the likelihood subject to a penalty. Borrowing from Bayesian principles, we propose a mild, default penalty—derived assuming a Beta distribution of scale-free functions of the covariance components to be estimated—rather than laboriously attempting to determine the stringency of penalization from the data. An extensive simulation study is presented, demonstrating that such penalties can yield very worthwhile reductions in loss, i.e., the difference from population values, for a wide range of scenarios and without distorting estimates of phenotypic covariances. Moreover, mild default penalties tend not to increase loss in difficult cases and, on average, achieve reductions in loss of similar magnitude to computationally demanding schemes to optimize the degree of penalization. Pertinent details required for the adaptation of standard algorithms to locate the maximum of the likelihood function are outlined. PMID:27317681

  9. What Factors Influence Smoking Prevalence and Smoke Free Policy Enactment across the European Union Member States

    PubMed Central

    Bogdanovica, Ilze; McNeill, Ann; Murray, Rachael; Britton, John

    2011-01-01

    Background Smoking prevention should be a primary public health priority for all governments, and effective preventive policies have been identified for decades. The heterogeneity of smoking prevalence between European Union (EU) Member States therefore reflects, at least in part, a failure by governments to prioritise public health over tobacco industry or possibly other financial interests, and hence potentially government corruption. The aims of this study were to test the hypothesis that smoking prevalence is higher in countries with high levels of public sector corruption, and explore the ecological association between smoking prevalence and a range of other national characteristics in current EU Member States. Methods Ecological data from 27 EU Member States were used to estimate univariate and multivariate correlations between smoking prevalence and the Transparency International Corruption Perceptions Index, and a range of other national characteristics including economic development, social inclusion, quality of life and importance of religion. We also explored the association between the Corruption Perceptions Index and measures of the extent to which smoke-free policies have been enacted and are enforced. Results In univariate analysis, smoking prevalence was significantly higher in countries with higher scores for corruption, material deprivation, and gender inequality; and lower in countries with higher per capita Gross Domestic Product, social spending, life satisfaction and human development scores. In multivariate analysis, only the corruption perception index was independently related to smoking prevalence. Exposure to tobacco smoke in the workplace was also correlated with corruption, independently from smoking prevalence, but not with the measures of national smoke-free policy implementation. Conclusions Corruption appears to be an important risk factor for failure of national tobacco control activity in EU countries, and the extent to which key tobacco control policies have been implemented. Further research is needed to assess the causal relationships involved. PMID:21909375

  10. Genetic ancestry, self-reported race and ethnicity in African Americans and European Americans in the PCaP cohort.

    PubMed

    Sucheston, Lara E; Bensen, Jeannette T; Xu, Zongli; Singh, Prashant K; Preus, Leah; Mohler, James L; Su, L Joseph; Fontham, Elizabeth T H; Ruiz, Bernardo; Smith, Gary J; Taylor, Jack A

    2012-01-01

    Family history and African-American race are important risk factors for both prostate cancer (CaP) incidence and aggressiveness. When studying complex diseases such as CaP that have a heritable component, chances of finding true disease susceptibility alleles can be increased by accounting for genetic ancestry within the population investigated. Race, ethnicity and ancestry were studied in a geographically diverse cohort of men with newly diagnosed CaP. Individual ancestry (IA) was estimated in the population-based North Carolina and Louisiana Prostate Cancer Project (PCaP), a cohort of 2,106 incident CaP cases (2063 with complete ethnicity information) comprising roughly equal numbers of research subjects reporting as Black/African American (AA) or European American/Caucasian/Caucasian American/White (EA) from North Carolina or Louisiana. Mean genome wide individual ancestry estimates of percent African, European and Asian were obtained and tested for differences by state and ethnicity (Cajun and/or Creole and Hispanic/Latino) using multivariate analysis of variance models. Principal components (PC) were compared to assess differences in genetic composition by self-reported race and ethnicity between and within states. Mean individual ancestries differed by state for self-reporting AA (p = 0.03) and EA (p = 0.001). This geographic difference attenuated for AAs who answered "no" to all ethnicity membership questions (non-ethnic research subjects; p = 0.78) but not EA research subjects, p = 0.002. Mean ancestry estimates of self-identified AA Louisiana research subjects for each ethnic group; Cajun only, Creole only and both Cajun and Creole differed significantly from self-identified non-ethnic AA Louisiana research subjects. These ethnicity differences were not seen in those who self-identified as EA. Mean IA differed by race between states, elucidating a potential contributing factor to these differences in AA research participants: self-reported ethnicity. Accurately accounting for genetic admixture in this cohort is essential for future analyses of the genetic and environmental contributions to CaP.

  11. Be the Volume: A Classroom Activity to Visualize Volume Estimation

    ERIC Educational Resources Information Center

    Mikhaylov, Jessica

    2011-01-01

    A hands-on activity can help multivariable calculus students visualize surfaces and understand volume estimation. This activity can be extended to include the concepts of Fubini's Theorem and the visualization of the curves resulting from cross-sections of the surface. This activity uses students as pillars and a sheet or tablecloth for the…

  12. On measuring bird habitat: influence of observer variability and sample size

    Treesearch

    William M. Block; Kimberly A. With; Michael L. Morrison

    1987-01-01

    We studied the effects of observer variability when estimating vegetation characteristics at 75 0.04-ha bird plots. Observer estimates were significantly different for 31 of 49 variables. Multivariate analyses showed significant interobserver differences for five of the seven classes of variables studied. Variable classes included the height, number, and diameter of...

  13. ASCAL: A Microcomputer Program for Estimating Logistic IRT Item Parameters.

    ERIC Educational Resources Information Center

    Vale, C. David; Gialluca, Kathleen A.

    ASCAL is a microcomputer-based program for calibrating items according to the three-parameter logistic model of item response theory. It uses a modified multivariate Newton-Raphson procedure for estimating item parameters. This study evaluated this procedure using Monte Carlo Simulation Techniques. The current version of ASCAL was then compared to…

  14. Estimation and Control for Linear Systems with Additive Cauchy Noise

    DTIC Science & Technology

    2013-12-17

    man & Hall, New York, 1994. [11] J. L. Speyer and W. H. Chung, Stochastic Processes, Estimation, and Control, SIAM, 2008. [12] Nassim N. Taleb ...Gaussian control algorithms. 18 4 References [1] N. N. Taleb . The Black Swan: The Impact of the Highly Improbable...the multivariable system. The estimator was then evaluated numerically for a third-order example. REFERENCES [1] N. N. Taleb , The Black Swan: The

  15. Physician capability to electronically exchange clinical information, 2011.

    PubMed

    Patel, Vaishali; Swain, Matthew J; King, Jennifer; Furukawa, Michael F

    2013-10-01

    To provide national estimates of physician capability to electronically share clinical information with other providers and to describe variation in exchange capability across states and electronic health record (EHR) vendors using the 2011 National Ambulatory Medical Care Survey Electronic Medical Record Supplement. Survey of a nationally representative sample of nonfederal office-based physicians who provide direct patient care. The survey was administered by mail with telephone follow-up and had a 61% weighted response rate. The overall sample consisted of 4326 respondents. We calculated estimates of electronic exchange capability at the national and state levels, and applied multivariate analyses to examine the association between the capability to exchange different types of clinical information and physician and practice characteristics. In 2011, 55% of physicians had computerized capability to send prescriptions electronically; 67% had the capability to view lab results electronically; 42% were able to incorporate lab results into their EHR; 35% were able to send lab orders electronically; and, 31% exchanged patient clinical summaries with other providers. The strongest predictor of exchange capability is adoption of an EHR. However, substantial variation exists across geography and EHR vendors in exchange capability, especially electronic exchange of clinical summaries. In 2011, a majority of office-based physicians could exchange lab and medication data, and approximately one-third could exchange clinical summaries with patients or other providers. EHRs serve as a key mechanism by which physicians can exchange clinical data, though physicians' capability to exchange varies by vendor and by state.

  16. Permutation Tests of Hierarchical Cluster Analyses of Carrion Communities and Their Potential Use in Forensic Entomology.

    PubMed

    van der Ham, Joris L

    2016-05-19

    Forensic entomologists can use carrion communities' ecological succession data to estimate the postmortem interval (PMI). Permutation tests of hierarchical cluster analyses of these data provide a conceptual method to estimate part of the PMI, the post-colonization interval (post-CI). This multivariate approach produces a baseline of statistically distinct clusters that reflect changes in the carrion community composition during the decomposition process. Carrion community samples of unknown post-CIs are compared with these baseline clusters to estimate the post-CI. In this short communication, I use data from previously published studies to demonstrate the conceptual feasibility of this multivariate approach. Analyses of these data produce series of significantly distinct clusters, which represent carrion communities during 1- to 20-day periods of the decomposition process. For 33 carrion community samples, collected over an 11-day period, this approach correctly estimated the post-CI within an average range of 3.1 days. © The Authors 2016. Published by Oxford University Press on behalf of Entomological Society of America. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  17. Estimating the Risk of Tropical Cyclone Characteristics Along the United States Gulf of Mexico Coastline Using Different Statistical Approaches

    NASA Astrophysics Data System (ADS)

    Trepanier, J. C.; Ellis, K.; Jagger, T.; Needham, H.; Yuan, J.

    2017-12-01

    Tropical cyclones, with their high wind speeds, high rainfall totals and deep storm surges, frequently strike the United States Gulf of Mexico coastline influencing millions of people and disrupting off shore economic activities. Events, such as Hurricane Katrina in 2005 and Hurricane Isaac in 2012, can be physically different but still provide detrimental effects due to their locations of influence. There are a wide variety of ways to estimate the risk of occurrence of extreme tropical cyclones. Here, the combined risk of tropical cyclone storm surge and nearshore wind speed using a statistical copula is provided for 22 Gulf of Mexico coastal cities. Of the cities considered, Bay St. Louis, Mississippi has the shortest return period for a tropical cyclone with at least a 50 m s-1 nearshore wind speed and a three meter surge (19.5 years, 17.1-23.5). Additionally, a multivariate regression model is provided estimating the compound effects of tropical cyclone tracks, landfall central pressure, the amount of accumulated precipitation, and storm surge for five locations around Lake Pontchartrain in Louisiana. It is shown the most intense tropical cyclones typically approach from the south and a small change in the amount of rainfall or landfall central pressure leads to a large change in the final storm surge depth. Data are used from the National Hurricane Center, U-Surge, SURGEDAT, and Cooperative Observer Program. The differences in the two statistical approaches are discussed, along with the advantages and limitations to each. The goal of combining the results of the two studies is to gain a better understanding of the most appropriate risk estimation technique for a given area.

  18. Different shades of default mode disturbance in schizophrenia: Subnodal covariance estimation in structure and function.

    PubMed

    Lefort-Besnard, Jérémy; Bassett, Danielle S; Smallwood, Jonathan; Margulies, Daniel S; Derntl, Birgit; Gruber, Oliver; Aleman, Andre; Jardri, Renaud; Varoquaux, Gaël; Thirion, Bertrand; Eickhoff, Simon B; Bzdok, Danilo

    2018-02-01

    Schizophrenia is a devastating mental disease with an apparent disruption in the highly associative default mode network (DMN). Interplay between this canonical network and others probably contributes to goal-directed behavior so its disturbance is a candidate neural fingerprint underlying schizophrenia psychopathology. Previous research has reported both hyperconnectivity and hypoconnectivity within the DMN, and both increased and decreased DMN coupling with the multimodal saliency network (SN) and dorsal attention network (DAN). This study systematically revisited network disruption in patients with schizophrenia using data-derived network atlases and multivariate pattern-learning algorithms in a multisite dataset (n = 325). Resting-state fluctuations in unconstrained brain states were used to estimate functional connectivity, and local volume differences between individuals were used to estimate structural co-occurrence within and between the DMN, SN, and DAN. In brain structure and function, sparse inverse covariance estimates of network coupling were used to characterize healthy participants and patients with schizophrenia, and to identify statistically significant group differences. Evidence did not confirm that the backbone of the DMN was the primary driver of brain dysfunction in schizophrenia. Instead, functional and structural aberrations were frequently located outside of the DMN core, such as in the anterior temporoparietal junction and precuneus. Additionally, functional covariation analyses highlighted dysfunctional DMN-DAN coupling, while structural covariation results highlighted aberrant DMN-SN coupling. Our findings reframe the role of the DMN core and its relation to canonical networks in schizophrenia. We thus underline the importance of large-scale neural interactions as effective biomarkers and indicators of how to tailor psychiatric care to single patients. © 2017 Wiley Periodicals, Inc.

  19. Estimates of complications of medical care in the adult US population.

    PubMed

    Villanueva, E V; Anderson, J N

    2001-01-01

    Total US population estimates of complications of medical care have relied on extrapolations of state-specific estimates. Generalizability is suspect because findings are limited by geographical location or time. We describe the relationship between the annual prevalence of complications of medical care (CM) and socio-demographic characteristics in the adult US population. We used data from the National Health Interview Surveys, annual nationwide surveys of the resident, civilian, noninstitutionalized population of the United States. The main outcome of interest was self-reported conditions from CMs (ICD-9 996-999) and activity limitations that arise from such events. Univariate estimates and multivariably adjusted models accounting for selected socio-demographic characteristics and health status were derived. A total of 618,167 reports of conditions from 313,438 subjects 18 years and older from 1987 to 1994 were examined. In 1987, 830,386 adults reported complications of medical care, increasing by about 40% to 1,174,089 adults in 1994. Based on an extrapolation to the US adult population, rates increased by 25% from 558 to 678 per 100,000 during the same period. One-third reported onset a year prior to the interview; two-thirds visited a doctor six months prior; half experienced limitation in major activities; a quarter reported limitation in personal care activities. In the two weeks preceding the interview, complications of medical care caused an average of 1.72 days of restricted activity, 0.79 days spent in bed, and 0.58 days of work lost. Race modified the age-specific risk of these complications. Complications of medical care impose heavier morbidity than previously considered with some indication that socio-demographic variables modify the risk for injuries.

  20. Deconvolution of mixing time series on a graph

    PubMed Central

    Blocker, Alexander W.; Airoldi, Edoardo M.

    2013-01-01

    In many applications we are interested in making inference on latent time series from indirect measurements, which are often low-dimensional projections resulting from mixing or aggregation. Positron emission tomography, super-resolution, and network traffic monitoring are some examples. Inference in such settings requires solving a sequence of ill-posed inverse problems, yt = Axt, where the projection mechanism provides information on A. We consider problems in which A specifies mixing on a graph of times series that are bursty and sparse. We develop a multilevel state-space model for mixing times series and an efficient approach to inference. A simple model is used to calibrate regularization parameters that lead to efficient inference in the multilevel state-space model. We apply this method to the problem of estimating point-to-point traffic flows on a network from aggregate measurements. Our solution outperforms existing methods for this problem, and our two-stage approach suggests an efficient inference strategy for multilevel models of multivariate time series. PMID:25309135

  1. Individual- and Structural-Level Risk Factors for Suicide Attempts Among Transgender Adults.

    PubMed

    Perez-Brumer, Amaya; Hatzenbuehler, Mark L; Oldenburg, Catherine E; Bockting, Walter

    2015-01-01

    This study assessed individual (ie, internalized transphobia) and structural forms of stigma as risk factors for suicide attempts among transgender adults. Internalized transphobia was assessed through a 26-item scale including four dimensions: pride, passing, alienation, and shame. State-level structural stigma was operationalized as a composite index, including density of same-sex couples; proportion of Gay-Straight Alliances per public high school; 5 policies related to sexual orientation discrimination; and aggregated public opinion toward homosexuality. Multivariable logistic generalized estimating equation models assessed associations of interest among an online sample of transgender adults (N = 1,229) representing 48 states and the District of Columbia. Lower levels of structural stigma were associated with fewer lifetime suicide attempts (AOR 0.96, 95% CI 0.92-0.997), and a higher score on the internalized transphobia scale was associated with greater lifetime suicide attempts (AOR 1.18, 95% CI 1.04-1.33). Addressing stigma at multiple levels is necessary to reduce the vulnerability of suicide attempts among transgender adults.

  2. Remote sensing estimation of the total phosphorus concentration in a large lake using band combinations and regional multivariate statistical modeling techniques.

    PubMed

    Gao, Yongnian; Gao, Junfeng; Yin, Hongbin; Liu, Chuansheng; Xia, Ting; Wang, Jing; Huang, Qi

    2015-03-15

    Remote sensing has been widely used for ater quality monitoring, but most of these monitoring studies have only focused on a few water quality variables, such as chlorophyll-a, turbidity, and total suspended solids, which have typically been considered optically active variables. Remote sensing presents a challenge in estimating the phosphorus concentration in water. The total phosphorus (TP) in lakes has been estimated from remotely sensed observations, primarily using the simple individual band ratio or their natural logarithm and the statistical regression method based on the field TP data and the spectral reflectance. In this study, we investigated the possibility of establishing a spatial modeling scheme to estimate the TP concentration of a large lake from multi-spectral satellite imagery using band combinations and regional multivariate statistical modeling techniques, and we tested the applicability of the spatial modeling scheme. The results showed that HJ-1A CCD multi-spectral satellite imagery can be used to estimate the TP concentration in a lake. The correlation and regression analysis showed a highly significant positive relationship between the TP concentration and certain remotely sensed combination variables. The proposed modeling scheme had a higher accuracy for the TP concentration estimation in the large lake compared with the traditional individual band ratio method and the whole-lake scale regression-modeling scheme. The TP concentration values showed a clear spatial variability and were high in western Lake Chaohu and relatively low in eastern Lake Chaohu. The northernmost portion, the northeastern coastal zone and the southeastern portion of western Lake Chaohu had the highest TP concentrations, and the other regions had the lowest TP concentration values, except for the coastal zone of eastern Lake Chaohu. These results strongly suggested that the proposed modeling scheme, i.e., the band combinations and the regional multivariate statistical modeling techniques, demonstrated advantages for estimating the TP concentration in a large lake and had a strong potential for universal application for the TP concentration estimation in large lake waters worldwide. Copyright © 2014 Elsevier Ltd. All rights reserved.

  3. Multivariate missing data in hydrology - Review and applications

    NASA Astrophysics Data System (ADS)

    Ben Aissia, Mohamed-Aymen; Chebana, Fateh; Ouarda, Taha B. M. J.

    2017-12-01

    Water resources planning and management require complete data sets of a number of hydrological variables, such as flood peaks and volumes. However, hydrologists are often faced with the problem of missing data (MD) in hydrological databases. Several methods are used to deal with the imputation of MD. During the last decade, multivariate approaches have gained popularity in the field of hydrology, especially in hydrological frequency analysis (HFA). However, treating the MD remains neglected in the multivariate HFA literature whereas the focus has been mainly on the modeling component. For a complete analysis and in order to optimize the use of data, MD should also be treated in the multivariate setting prior to modeling and inference. Imputation of MD in the multivariate hydrological framework can have direct implications on the quality of the estimation. Indeed, the dependence between the series represents important additional information that can be included in the imputation process. The objective of the present paper is to highlight the importance of treating MD in multivariate hydrological frequency analysis by reviewing and applying multivariate imputation methods and by comparing univariate and multivariate imputation methods. An application is carried out for multiple flood attributes on three sites in order to evaluate the performance of the different methods based on the leave-one-out procedure. The results indicate that, the performance of imputation methods can be improved by adopting the multivariate setting, compared to mean substitution and interpolation methods, especially when using the copula-based approach.

  4. Impact of the population at risk of diabetes on projections of diabetes burden in the United States: an epidemic on the way

    PubMed Central

    Baker, R.; Koopman, R. J.; Saxena, S.; Diaz, V. A.; Everett, C. J.; Majeed, A.

    2006-01-01

    Aims/hypothesis The aim of this study was to make projections of the future diabetes burden for the adult US population based in part on the prevalence of individuals at high risk of developing diabetes. Materials and methods Models were created from data in the nationally representative National Health and Nutrition Examination Survey (NHANES) II mortality survey (1976–1992), the NHANES III (1988–1994) and the NHANES 1999–2002. Population models for adults (>20 years of age) from NHANES III data were fitted to known diabetes prevalence in the NHANES 1999–2002 before making future projections. We used a multivariable diabetes risk score to estimate the likelihood of diabetes incidence in 10 years. Estimates of future diabetes (diagnosed and undiagnosed) prevalence in 2011, 2021, and 2031 were made under several assumptions. Results Based on the multivariable diabetes risk score, the number of adults at high risk of diabetes was 38.4 million in 1991 and 49.9 million in 2001. The total diabetes burden is anticipated to be 11.5% (25.4 million) in 2011, 13.5% (32.6 million) in 2021, and 14.5% (37.7 million) in 2031. Among individuals aged 30 to 39 years old who are not currently targeted for screening according to age, the prevalence of diabetes is expected to rise from 3.7% in 2001 to 5.2% in 2031. By 2031, 20.2% of adult Hispanic individuals are expected to have diabetes. Conclusions/interpretation The prevalence of diabetes is projected to rise to substantially greater levels than previously estimated. Diabetes prevalence within the Hispanic community is projected to be potentially overwhelming. Electronic supplementary material Supplementary material is available in the online version of this article at http://dx.doi.org/10.1007/s00125-006-0528-5 and is accessible to authorized users. PMID:17119914

  5. Preeclampsia and Long-term Renal Function in Women Who Underwent Kidney Transplantation.

    PubMed

    Vannevel, Valerie; Claes, Kathleen; Baud, David; Vial, Yvan; Golshayan, Delaviz; Yoon, Eugene W; Hodges, Ryan; Le Nepveu, Anne; Kerr, Peter G; Kennedy, Claire; Higgins, Mary; Resch, Elisabeth; Klaritsch, Philipp; Van Mieghem, Tim

    2018-01-01

    Preeclampsia often complicates pregnancies after maternal kidney transplantation. We aimed to assess whether preeclampsia is associated with kidney function decline either during the pregnancy or in the long term. We performed an international multicenter retrospective cohort study. Renal function at conception, pregnancy outcomes, and short- and long-term graft outcomes were collected for women who were pregnant after renal transplantation and had transplant and obstetric care at the participating centers. In women who had multiple pregnancies during the study period, only the last pregnancy was included. Univariate and multivariable analyses were performed. We retrieved pregnancy outcomes and long-term renal outcomes for 52 women. Chronic hypertension was present at baseline in 27%. Mean estimated glomerular filtration rate (GFR) at start of pregnancy was 52.4±17.5 mL/min/1.73 m. Mean estimated GFR at delivery was 47.6±21.6 mL/min/1.73 m, which was significantly lower than at conception (P=.03). Twenty women (38%) developed preeclampsia. In multivariable analysis, women who developed preeclampsia had a 10.7-mL/min/1.73 m higher drop in estimated GFR between conception and delivery than women who did not develop preeclampsia (P=.02). Long-term estimated GFR follow-up was obtained at a median of 5.8 years (range 1.3-27.5 years). Mean estimated GFR at last follow-up was 38±23 mL/kg/1.73 m. Seventeen women (33%) experienced graft loss over the follow-up period. Incidence of graft loss was similar in women with and without preeclampsia in their last pregnancy (30% and 34%, respectively; P=.99). In multivariable analysis, the decrease in estimated GFR between conception and last follow-up was similar in women who experienced preeclampsia during pregnancy and those who did not (difference -2.69 mL/min/1.73 m, P=.65). Preeclampsia commonly complicates pregnancies after renal transplantation but is not associated with long-term renal dysfunction or graft loss.

  6. A mixed model for the relationship between climate and human cranial form.

    PubMed

    Katz, David C; Grote, Mark N; Weaver, Timothy D

    2016-08-01

    We expand upon a multivariate mixed model from quantitative genetics in order to estimate the magnitude of climate effects in a global sample of recent human crania. In humans, genetic distances are correlated with distances based on cranial form, suggesting that population structure influences both genetic and quantitative trait variation. Studies controlling for this structure have demonstrated significant underlying associations of cranial distances with ecological distances derived from climate variables. However, to assess the biological importance of an ecological predictor, estimates of effect size and uncertainty in the original units of measurement are clearly preferable to significance claims based on units of distance. Unfortunately, the magnitudes of ecological effects are difficult to obtain with distance-based methods, while models that produce estimates of effect size generally do not scale to high-dimensional data like cranial shape and form. Using recent innovations that extend quantitative genetics mixed models to highly multivariate observations, we estimate morphological effects associated with a climate predictor for a subset of the Howells craniometric dataset. Several measurements, particularly those associated with cranial vault breadth, show a substantial linear association with climate, and the multivariate model incorporating a climate predictor is preferred in model comparison. Previous studies demonstrated the existence of a relationship between climate and cranial form. The mixed model quantifies this relationship concretely. Evolutionary questions that require population structure and phylogeny to be disentangled from potential drivers of selection may be particularly well addressed by mixed models. Am J Phys Anthropol 160:593-603, 2016. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.

  7. A statistical framework for neuroimaging data analysis based on mutual information estimated via a gaussian copula.

    PubMed

    Ince, Robin A A; Giordano, Bruno L; Kayser, Christoph; Rousselet, Guillaume A; Gross, Joachim; Schyns, Philippe G

    2017-03-01

    We begin by reviewing the statistical framework of information theory as applicable to neuroimaging data analysis. A major factor hindering wider adoption of this framework in neuroimaging is the difficulty of estimating information theoretic quantities in practice. We present a novel estimation technique that combines the statistical theory of copulas with the closed form solution for the entropy of Gaussian variables. This results in a general, computationally efficient, flexible, and robust multivariate statistical framework that provides effect sizes on a common meaningful scale, allows for unified treatment of discrete, continuous, unidimensional and multidimensional variables, and enables direct comparisons of representations from behavioral and brain responses across any recording modality. We validate the use of this estimate as a statistical test within a neuroimaging context, considering both discrete stimulus classes and continuous stimulus features. We also present examples of analyses facilitated by these developments, including application of multivariate analyses to MEG planar magnetic field gradients, and pairwise temporal interactions in evoked EEG responses. We show the benefit of considering the instantaneous temporal derivative together with the raw values of M/EEG signals as a multivariate response, how we can separately quantify modulations of amplitude and direction for vector quantities, and how we can measure the emergence of novel information over time in evoked responses. Open-source Matlab and Python code implementing the new methods accompanies this article. Hum Brain Mapp 38:1541-1573, 2017. © 2016 Wiley Periodicals, Inc. 2016 The Authors Human Brain Mapping Published by Wiley Periodicals, Inc.

  8. Algorithms for System Identification and Source Location.

    NASA Astrophysics Data System (ADS)

    Nehorai, Arye

    This thesis deals with several topics in least squares estimation and applications to source location. It begins with a derivation of a mapping between Wiener theory and Kalman filtering for nonstationary autoregressive moving average (ARMO) processes. Applying time domain analysis, connections are found between time-varying state space realizations and input-output impulse response by matrix fraction description (MFD). Using these connections, the whitening filters are derived by the two approaches, and the Kalman gain is expressed in terms of Wiener theory. Next, fast estimation algorithms are derived in a unified way as special cases of the Conjugate Direction Method. The fast algorithms included are the block Levinson, fast recursive least squares, ladder (or lattice) and fast Cholesky algorithms. The results give a novel derivation and interpretation for all these methods, which are efficient alternatives to available recursive system identification algorithms. Multivariable identification algorithms are usually designed only for left MFD models. In this work, recursive multivariable identification algorithms are derived for right MFD models with diagonal denominator matrices. The algorithms are of prediction error and model reference type. Convergence analysis results obtained by the Ordinary Differential Equation (ODE) method are presented along with simulations. Sources of energy can be located by estimating time differences of arrival (TDOA's) of waves between the receivers. A new method for TDOA estimation is proposed for multiple unknown ARMA sources and additive correlated receiver noise. The method is based on a formula that uses only the receiver cross-spectra and the source poles. Two algorithms are suggested that allow tradeoffs between computational complexity and accuracy. A new time delay model is derived and used to show the applicability of the methods for non -integer TDOA's. Results from simulations illustrate the performance of the algorithms. The last chapter analyzes the response of exact least squares predictors for enhancement of sinusoids with additive colored noise. Using the matrix inversion lemma and the Christoffel-Darboux formula, the frequency response and amplitude gain of the sinusoids are expressed as functions of the signal and noise characteristics. The results generalize the available white noise case.

  9. Obtaining appropriate interval estimates for age when multiple indicators are used: evaluation of an ad-hoc procedure.

    PubMed

    Fieuws, Steffen; Willems, Guy; Larsen-Tangmose, Sara; Lynnerup, Niels; Boldsen, Jesper; Thevissen, Patrick

    2016-03-01

    When an estimate of age is needed, typically multiple indicators are present as found in skeletal or dental information. There exists a vast literature on approaches to estimate age from such multivariate data. Application of Bayes' rule has been proposed to overcome drawbacks of classical regression models but becomes less trivial as soon as the number of indicators increases. Each of the age indicators can lead to a different point estimate ("the most plausible value for age") and a prediction interval ("the range of possible values"). The major challenge in the combination of multiple indicators is not the calculation of a combined point estimate for age but the construction of an appropriate prediction interval. Ignoring the correlation between the age indicators results in intervals being too small. Boldsen et al. (2002) presented an ad-hoc procedure to construct an approximate confidence interval without the need to model the multivariate correlation structure between the indicators. The aim of the present paper is to bring under attention this pragmatic approach and to evaluate its performance in a practical setting. This is all the more needed since recent publications ignore the need for interval estimation. To illustrate and evaluate the method, Köhler et al. (1995) third molar scores are used to estimate the age in a dataset of 3200 male subjects in the juvenile age range.

  10. Quantitative investigation of inappropriate regression model construction and the importance of medical statistics experts in observational medical research: a cross-sectional study.

    PubMed

    Nojima, Masanori; Tokunaga, Mutsumi; Nagamura, Fumitaka

    2018-05-05

    To investigate under what circumstances inappropriate use of 'multivariate analysis' is likely to occur and to identify the population that needs more support with medical statistics. The frequency of inappropriate regression model construction in multivariate analysis and related factors were investigated in observational medical research publications. The inappropriate algorithm of using only variables that were significant in univariate analysis was estimated to occur at 6.4% (95% CI 4.8% to 8.5%). This was observed in 1.1% of the publications with a medical statistics expert (hereinafter 'expert') as the first author, 3.5% if an expert was included as coauthor and in 12.2% if experts were not involved. In the publications where the number of cases was 50 or less and the study did not include experts, inappropriate algorithm usage was observed with a high proportion of 20.2%. The OR of the involvement of experts for this outcome was 0.28 (95% CI 0.15 to 0.53). A further, nation-level, analysis showed that the involvement of experts and the implementation of unfavourable multivariate analysis are associated at the nation-level analysis (R=-0.652). Based on the results of this study, the benefit of participation of medical statistics experts is obvious. Experts should be involved for proper confounding adjustment and interpretation of statistical models. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  11. Using cystoscopy to segment bladder tumors with a multivariate approach in different color spaces.

    PubMed

    Freitas, Nuno R; Vieira, Pedro M; Lima, Estevao; Lima, Carlos S

    2017-07-01

    Nowadays the diagnosis of bladder lesions relies upon cystoscopy examination and depends on the interpreter's experience. State of the art of bladder tumor identification are based on 3D reconstruction, using CT images (Virtual Cystoscopy) or images where the structures are exalted with the use of pigmentation, but none uses white light cystoscopy images. An initial attempt to automatically identify tumoral tissue was already developed by the authors and this paper will develop this idea. Traditional cystoscopy images processing has a huge potential to improve early tumor detection and allows a more effective treatment. In this paper is described a multivariate approach to do segmentation of bladder cystoscopy images, that will be used to automatically detect and improve physician diagnose. Each region can be assumed as a normal distribution with specific parameters, leading to the assumption that the distribution of intensities is a Gaussian Mixture Model (GMM). Region of high grade and low grade tumors, usually appears with higher intensity than normal regions. This paper proposes a Maximum a Posteriori (MAP) approach based on pixel intensities read simultaneously in different color channels from RGB, HSV and CIELab color spaces. The Expectation-Maximization (EM) algorithm is used to estimate the best multivariate GMM parameters. Experimental results show that the proposed method does bladder tumor segmentation into two classes in a more efficient way in RGB even in cases where the tumor shape is not well defined. Results also show that the elimination of component L from CIELab color space does not allow definition of the tumor shape.

  12. Univariate and multivariate skewness and kurtosis for measuring nonnormality: Prevalence, influence and estimation.

    PubMed

    Cain, Meghan K; Zhang, Zhiyong; Yuan, Ke-Hai

    2017-10-01

    Nonnormality of univariate data has been extensively examined previously (Blanca et al., Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 9(2), 78-84, 2013; Miceeri, Psychological Bulletin, 105(1), 156, 1989). However, less is known of the potential nonnormality of multivariate data although multivariate analysis is commonly used in psychological and educational research. Using univariate and multivariate skewness and kurtosis as measures of nonnormality, this study examined 1,567 univariate distriubtions and 254 multivariate distributions collected from authors of articles published in Psychological Science and the American Education Research Journal. We found that 74 % of univariate distributions and 68 % multivariate distributions deviated from normal distributions. In a simulation study using typical values of skewness and kurtosis that we collected, we found that the resulting type I error rates were 17 % in a t-test and 30 % in a factor analysis under some conditions. Hence, we argue that it is time to routinely report skewness and kurtosis along with other summary statistics such as means and variances. To facilitate future report of skewness and kurtosis, we provide a tutorial on how to compute univariate and multivariate skewness and kurtosis by SAS, SPSS, R and a newly developed Web application.

  13. Multivariate approach in popcorn genotypes using the Ward-MLM strategy: morpho-agronomic analysis and incidence of Fusarium spp.

    PubMed

    Kurosawa, R N F; do Amaral Junior, A T; Silva, F H L; Dos Santos, A; Vivas, M; Kamphorst, S H; Pena, G F

    2017-02-08

    The multivariate analyses are useful tools to estimate the genetic variability between accessions. In the breeding programs, the Ward-Modified Location Model (MLM) multivariate method has been a powerful strategy to quantify variability using quantitative and qualitative variables simultaneously. The present study was proposed in view of the dearth of information about popcorn breeding programs under a multivariate approach using the Ward-MLM methodology. The objective of this study was thus to estimate the genetic diversity among 37 genotypes of popcorn aiming to identify divergent groups associated with morpho-agronomic traits and traits related to resistance to Fusarium spp. To this end, 7 qualitative and 17 quantitative variables were analyzed. The experiment was conducted in 2014, at Universidade Estadual do Norte Fluminense, located in Campos dos Goytacazes, RJ, Brazil. The Ward-MLM strategy allowed the identification of four groups as follows: Group I with 10 genotypes, Group II with 11 genotypes, Group III with 9 genotypes, and Group IV with 7 genotypes. Group IV was distant in relation to the other groups, while groups I, II, and III were near. The crosses between genotypes from the other groups with those of group IV allow an exploitation of heterosis. The Ward-MLM strategy provided an appropriate grouping of genotypes; ear weight, ear diameter, and grain yield were the traits that most contributed to the analysis of genetic diversity.

  14. Enhanced ID Pit Sizing Using Multivariate Regression Algorithm

    NASA Astrophysics Data System (ADS)

    Krzywosz, Kenji

    2007-03-01

    EPRI is funding a program to enhance and improve the reliability of inside diameter (ID) pit sizing for balance-of plant heat exchangers, such as condensers and component cooling water heat exchangers. More traditional approaches to ID pit sizing involve the use of frequency-specific amplitude or phase angles. The enhanced multivariate regression algorithm for ID pit depth sizing incorporates three simultaneous input parameters of frequency, amplitude, and phase angle. A set of calibration data sets consisting of machined pits of various rounded and elongated shapes and depths was acquired in the frequency range of 100 kHz to 1 MHz for stainless steel tubing having nominal wall thickness of 0.028 inch. To add noise to the acquired data set, each test sample was rotated and test data acquired at 3, 6, 9, and 12 o'clock positions. The ID pit depths were estimated using a second order and fourth order regression functions by relying on normalized amplitude and phase angle information from multiple frequencies. Due to unique damage morphology associated with the microbiologically-influenced ID pits, it was necessary to modify the elongated calibration standard-based algorithms by relying on the algorithm developed solely from the destructive sectioning results. This paper presents the use of transformed multivariate regression algorithm to estimate ID pit depths and compare the results with the traditional univariate phase angle analysis. Both estimates were then compared with the destructive sectioning results.

  15. A review of the application of propensity score methods yielded increasing use, advantages in specific settings, but not substantially different estimates compared with conventional multivariable methods

    PubMed Central

    Stürmer, Til; Joshi, Manisha; Glynn, Robert J.; Avorn, Jerry; Rothman, Kenneth J.; Schneeweiss, Sebastian

    2006-01-01

    Objective Propensity score analyses attempt to control for confounding in non-experimental studies by adjusting for the likelihood that a given patient is exposed. Such analyses have been proposed to address confounding by indication, but there is little empirical evidence that they achieve better control than conventional multivariate outcome modeling. Study design and methods Using PubMed and Science Citation Index, we assessed the use of propensity scores over time and critically evaluated studies published through 2003. Results Use of propensity scores increased from a total of 8 papers before 1998 to 71 in 2003. Most of the 177 published studies abstracted assessed medications (N=60) or surgical interventions (N=51), mainly in cardiology and cardiac surgery (N=90). Whether PS methods or conventional outcome models were used to control for confounding had little effect on results in those studies in which such comparison was possible. Only 9 out of 69 studies (13%) had an effect estimate that differed by more than 20% from that obtained with a conventional outcome model in all PS analyses presented. Conclusions Publication of results based on propensity score methods has increased dramatically, but there is little evidence that these methods yield substantially different estimates compared with conventional multivariable methods. PMID:16632131

  16. Genetic association between milk yield, stayability, and mastitis in Holstein cows under tropical conditions.

    PubMed

    Irano, Natalia; Bignardi, Annaiza Braga; El Faro, Lenira; Santana, Mário Luiz; Cardoso, Vera Lúcia; Albuquerque, Lucia Galvão

    2014-03-01

    The objective of this study was to estimate genetic parameters for milk yield, stayability, and the occurrence of clinical mastitis in Holstein cows, as well as studying the genetic relationship between them, in order to provide subsidies for the genetic evaluation of these traits. Records from 5,090 Holstein cows with calving varying from 1991 to 2010, were used in the analysis. Two standard multivariate analyses were carried out, one containing the trait of accumulated 305-day milk yields in the first lactation (MY1), stayability (STAY) until the third lactation, and clinical mastitis (CM), as well as the other traits, considering accumulated 305-day milk yields (Y305), STAY, and CM, including the first three lactations as repeated measures for Y305 and CM. The covariance components were obtained by a Bayesian approach. The heritability estimates obtained by multivariate analysis with MY1 were 0.19, 0.28, and 0.13 for MY1, STAY, and CM, respectively, whereas using the multivariate analysis with the Y305, the estimates were 0.19, 0.31, and 0.14, respectively. The genetic correlations between MY1 and STAY, MY1 and CM, and STAY and CM, respectively, were 0.38, 0.12, and -0.49. The genetic correlations between Y305 and STAY, Y305 and CM, and STAY and CM, respectively, were 0.66, -0.25, and -0.52.

  17. Mental, social, and physical well-being in New Hampshire, Oregon, and Washington, 2010 Behavioral Risk Factor Surveillance System: implications for public health research and practice related to Healthy People 2020 foundation health measures on well-being.

    PubMed

    Kobau, Rosemarie; Bann, Carla; Lewis, Megan; Zack, Matthew M; Boardman, Angela M; Boyd, Renee; Lim, Kim C; Holder, Tommy; Hoff, Anastacia Kl; Luncheon, Cecily; Thompson, William; Horner-Johnson, Willi; Lucas, Richard E

    2013-09-24

    Well-being is now accepted as one of four cross-cutting measures in gauging progress for Healthy People 2020. This shift to population indicators of well-being redresses notions of health that have focused on absence of illness (negative health) as a primary or sufficient indicator of positive functioning. The purpose of this study was to estimate mental, social, and physical well-being in three US states using new measures piloted on the 2010 Behavioral Risk Factor Surveillance Survey System (BRFSS). Baseline estimates were provided for states overall, and within states for demographic subgroups, those with chronic health conditions or disabilities, and those with behavioral risk factors. Ten validated questions designed to assess mental (e.g., satisfaction with life, satisfaction with life domains, happiness), physical (e.g., satisfaction with energy level), and social dimensions (e.g., frequency of social support) of well-being were selected with state input for inclusion on BRFSS. 18,622 individuals responded to the BRFSS surveys administered by New Hampshire (N = 3,139), Oregon (N = 2,289), and Washington (N = 13,194). Multivariate adjusted proportions of positive responses to well-being items were examined. After adjustment for confounders, about 67% of adults in these states had high levels of well-being, including >80% reporting experiencing happiness. Most adults were satisfied with their work, neighborhood, and education, but significant differences were seen in subgroups. Well-being differed by demographic characteristics such as marital status, health behaviors, chronic conditions, and disability status, with those who reported a disability and smokers consistently experiencing the worst well-being. Well-being is accepted as one of four cross-cutting measures in gauging progress for Healthy People 2020. Well-being differs by important sociodemographic factors and health conditions (e.g., age, employment, smoking, disability status). These findings provide baseline estimates for the three states to use in gauging improvements in well-being and can serve as a model for other state-level or national surveillance systems. These findings also assist states in identifying vulnerable subgroups who may benefit from potential interventions such as those in the National Prevention Strategy that focus on enhancing well-being where such disparities exist.

  18. Mental, social, and physical well-being in New Hampshire, Oregon, and Washington, 2010 Behavioral Risk Factor Surveillance System: implications for public health research and practice related to Healthy People 2020 foundation health measures on well-being

    PubMed Central

    2013-01-01

    Background Well-being is now accepted as one of four cross-cutting measures in gauging progress for Healthy People 2020. This shift to population indicators of well-being redresses notions of health that have focused on absence of illness (negative health) as a primary or sufficient indicator of positive functioning. The purpose of this study was to estimate mental, social, and physical well-being in three US states using new measures piloted on the 2010 Behavioral Risk Factor Surveillance Survey System (BRFSS). Baseline estimates were provided for states overall, and within states for demographic subgroups, those with chronic health conditions or disabilities, and those with behavioral risk factors. Methods Ten validated questions designed to assess mental (e.g., satisfaction with life, satisfaction with life domains, happiness), physical (e.g., satisfaction with energy level), and social dimensions (e.g., frequency of social support) of well-being were selected with state input for inclusion on BRFSS. 18,622 individuals responded to the BRFSS surveys administered by New Hampshire (N = 3,139), Oregon (N = 2,289), and Washington (N = 13,194). Multivariate adjusted proportions of positive responses to well-being items were examined. Results After adjustment for confounders, about 67% of adults in these states had high levels of well-being, including >80% reporting experiencing happiness. Most adults were satisfied with their work, neighborhood, and education, but significant differences were seen in subgroups. Well-being differed by demographic characteristics such as marital status, health behaviors, chronic conditions, and disability status, with those who reported a disability and smokers consistently experiencing the worst well-being. Conclusions Well-being is accepted as one of four cross-cutting measures in gauging progress for Healthy People 2020. Well-being differs by important sociodemographic factors and health conditions (e.g., age, employment, smoking, disability status). These findings provide baseline estimates for the three states to use in gauging improvements in well-being and can serve as a model for other state-level or national surveillance systems. These findings also assist states in identifying vulnerable subgroups who may benefit from potential interventions such as those in the National Prevention Strategy that focus on enhancing well-being where such disparities exist. PMID:24063647

  19. A Study of Effects of MultiCollinearity in the Multivariable Analysis

    PubMed Central

    Yoo, Wonsuk; Mayberry, Robert; Bae, Sejong; Singh, Karan; (Peter) He, Qinghua; Lillard, James W.

    2015-01-01

    A multivariable analysis is the most popular approach when investigating associations between risk factors and disease. However, efficiency of multivariable analysis highly depends on correlation structure among predictive variables. When the covariates in the model are not independent one another, collinearity/multicollinearity problems arise in the analysis, which leads to biased estimation. This work aims to perform a simulation study with various scenarios of different collinearity structures to investigate the effects of collinearity under various correlation structures amongst predictive and explanatory variables and to compare these results with existing guidelines to decide harmful collinearity. Three correlation scenarios among predictor variables are considered: (1) bivariate collinear structure as the most simple collinearity case, (2) multivariate collinear structure where an explanatory variable is correlated with two other covariates, (3) a more realistic scenario when an independent variable can be expressed by various functions including the other variables. PMID:25664257

  20. A Study of Effects of MultiCollinearity in the Multivariable Analysis.

    PubMed

    Yoo, Wonsuk; Mayberry, Robert; Bae, Sejong; Singh, Karan; Peter He, Qinghua; Lillard, James W

    2014-10-01

    A multivariable analysis is the most popular approach when investigating associations between risk factors and disease. However, efficiency of multivariable analysis highly depends on correlation structure among predictive variables. When the covariates in the model are not independent one another, collinearity/multicollinearity problems arise in the analysis, which leads to biased estimation. This work aims to perform a simulation study with various scenarios of different collinearity structures to investigate the effects of collinearity under various correlation structures amongst predictive and explanatory variables and to compare these results with existing guidelines to decide harmful collinearity. Three correlation scenarios among predictor variables are considered: (1) bivariate collinear structure as the most simple collinearity case, (2) multivariate collinear structure where an explanatory variable is correlated with two other covariates, (3) a more realistic scenario when an independent variable can be expressed by various functions including the other variables.

  1. Load compensation in a lean burn natural gas vehicle

    NASA Astrophysics Data System (ADS)

    Gangopadhyay, Anupam

    A new multivariable PI tuning technique is developed in this research that is primarily developed for regulation purposes. Design guidelines are developed based on closed-loop stability. The new multivariable design is applied in a natural gas vehicle to combine idle and A/F ratio control loops. This results in better recovery during low idle operation of a vehicle under external step torques. A powertrain model of a natural gas engine is developed and validated for steady-state and transient operation. The nonlinear model has three states: engine speed, intake manifold pressure and fuel fraction in the intake manifold. The model includes the effect of fuel partial pressure in the intake manifold filling and emptying dynamics. Due to the inclusion of fuel fraction as a state, fuel flow rate into the cylinders is also accurately modeled. A linear system identification is performed on the nonlinear model. The linear model structure is predicted analytically from the nonlinear model and the coefficients of the predicted transfer function are shown to be functions of key physical parameters in the plant. Simulations of linear system and model parameter identification is shown to converge to the predicted values of the model coefficients. The multivariable controller developed in this research could be designed in an algebraic fashion once the plant model is known. It is thus possible to implement the multivariable PI design in an adaptive fashion combining the controller with identified plant model on-line. This will result in a self-tuning regulator (STR) type controller where the underlying design criteria is the multivariable tuning technique designed in this research.

  2. Estimating Rates of Motor Vehicle Crashes Using Medical Encounter Data: A Feasibility Study

    DTIC Science & Technology

    2015-11-05

    used to develop more detailed predictive risk models as well as strategies for preventing specific types of MVCs. Systematic Review of Evidence... used to estimate rates of accident-related injuries more generally,9 but not with specific reference to MVCs. For the present report, rates of...precise rate estimates based on person-years rather than active duty strength, (e) multivariable effects of specific risk /protective factors after

  3. Perceptions of the state policy environment and adoption of medications in the treatment of substance use disorders.

    PubMed

    Knudsen, Hannah K; Abraham, Amanda J

    2012-01-01

    Despite growing interest in the use of evidence-based treatment practices for treating substance use disorders, adoption of medications by treatment programs remains modest. Drawing on resource dependence and institutional theory, this study examined the relationships between adoption of medications by treatment programs and their perceptions about the state policy environment. Data were collected through mailed surveys and telephone interviews with 250 administrators of publicly funded substance abuse treatment programs in the United States between 2009 and 2010. Multiple imputation and multivariate logistic regression were used to estimate the associations between perceptions of the state policy environment and the odds of adopting at least one medication for the treatment of substance use disorders. A total of 91 (37%) programs reported having prescribed any medication for treatment of a substance use disorder. Programs were significantly more likely to have adopted at least one medication if they perceived greater support for medications by the Single State Agency. The odds of adoption were significantly greater if the program was aware that at least one medication was included on their state's Medicaid formulary and that state-contract funding permitted the purchase of medications. States may play significant roles in promoting the adoption of medications, but adequate dissemination of information about state policies and priorities may be vital to further adoption. Future research should continue to study the relationships between the adoption of medications for treating substance use disorders and the evolving policy environment.

  4. Kinetics of lisinopril intramolecular cyclization in solid phase monitored by Fourier transform infrared microscopy.

    PubMed

    Widjaja, Effendi; Tan, Wei Jian

    2008-08-01

    The solid-state intramolecular cyclization of lisinopril to diketopiperazine was investigated by in situ Fourier transform infrared (FT-IR) microscopy. Using a controllable heating cell, the isothermal transformation was monitored in situ at 147.5, 150, 152.5, 155, and 157.5 degrees C. The collected time-dependent FT-IR spectra at each isothermal temperature were preprocessed and analyzed using a multivariate chemometric approach. The pure component spectra of the observable component (lisinopril and diketopiperazine) were resolved and their time-dependent relative contributions were also determined. Model-free and various model fitting methods were implemented in the kinetic analysis to estimate the activation energy of the intramolecular cyclization reaction. Arrhenius plots indicate that the activation energy is circa 327 kJ/mol.

  5. Estimating Risk of Natural Gas Portfolios by Using GARCH-EVT-Copula Model.

    PubMed

    Tang, Jiechen; Zhou, Chao; Yuan, Xinyu; Sriboonchitta, Songsak

    2015-01-01

    This paper concentrates on estimating the risk of Title Transfer Facility (TTF) Hub natural gas portfolios by using the GARCH-EVT-copula model. We first use the univariate ARMA-GARCH model to model each natural gas return series. Second, the extreme value distribution (EVT) is fitted to the tails of the residuals to model marginal residual distributions. Third, multivariate Gaussian copula and Student t-copula are employed to describe the natural gas portfolio risk dependence structure. Finally, we simulate N portfolios and estimate value at risk (VaR) and conditional value at risk (CVaR). Our empirical results show that, for an equally weighted portfolio of five natural gases, the VaR and CVaR values obtained from the Student t-copula are larger than those obtained from the Gaussian copula. Moreover, when minimizing the portfolio risk, the optimal natural gas portfolio weights are found to be similar across the multivariate Gaussian copula and Student t-copula and different confidence levels.

  6. Robust Multivariable Estimation of the Relevant Information Coming from a Wheel Speed Sensor and an Accelerometer Embedded in a Car under Performance Tests

    PubMed Central

    Hernandez, Wilmar

    2005-01-01

    In the present paper, in order to estimate the response of both a wheel speed sensor and an accelerometer placed in a car under performance tests, robust and optimal multivariable estimation techniques are used. In this case, the disturbances and noises corrupting the relevant information coming from the sensors' outputs are so dangerous that their negative influence on the electrical systems impoverish the general performance of the car. In short, the solution to this problem is a safety related problem that deserves our full attention. Therefore, in order to diminish the negative effects of the disturbances and noises on the car's electrical and electromechanical systems, an optimum observer is used. The experimental results show a satisfactory improvement in the signal-to-noise ratio of the relevant signals and demonstrate the importance of the fusion of several intelligent sensor design techniques when designing the intelligent sensors that today's cars need.

  7. Integrating Growth Variability of the Ilium, Fifth Lumbar Vertebra, and Clavicle with Multivariate Adaptive Regression Splines Models for Subadult Age Estimation.

    PubMed

    Corron, Louise; Marchal, François; Condemi, Silvana; Telmon, Norbert; Chaumoitre, Kathia; Adalian, Pascal

    2018-05-31

    Subadult age estimation should rely on sampling and statistical protocols capturing development variability for more accurate age estimates. In this perspective, measurements were taken on the fifth lumbar vertebrae and/or clavicles of 534 French males and females aged 0-19 years and the ilia of 244 males and females aged 0-12 years. These variables were fitted in nonparametric multivariate adaptive regression splines (MARS) models with 95% prediction intervals (PIs) of age. The models were tested on two independent samples from Marseille and the Luis Lopes reference collection from Lisbon. Models using ilium width and module, maximum clavicle length, and lateral vertebral body heights were more than 92% accurate. Precision was lower for postpubertal individuals. Integrating punctual nonlinearities of the relationship between age and the variables and dynamic prediction intervals incorporated the normal increase in interindividual growth variability (heteroscedasticity of variance) with age for more biologically accurate predictions. © 2018 American Academy of Forensic Sciences.

  8. Estimating Risk of Natural Gas Portfolios by Using GARCH-EVT-Copula Model

    PubMed Central

    Tang, Jiechen; Zhou, Chao; Yuan, Xinyu; Sriboonchitta, Songsak

    2015-01-01

    This paper concentrates on estimating the risk of Title Transfer Facility (TTF) Hub natural gas portfolios by using the GARCH-EVT-copula model. We first use the univariate ARMA-GARCH model to model each natural gas return series. Second, the extreme value distribution (EVT) is fitted to the tails of the residuals to model marginal residual distributions. Third, multivariate Gaussian copula and Student t-copula are employed to describe the natural gas portfolio risk dependence structure. Finally, we simulate N portfolios and estimate value at risk (VaR) and conditional value at risk (CVaR). Our empirical results show that, for an equally weighted portfolio of five natural gases, the VaR and CVaR values obtained from the Student t-copula are larger than those obtained from the Gaussian copula. Moreover, when minimizing the portfolio risk, the optimal natural gas portfolio weights are found to be similar across the multivariate Gaussian copula and Student t-copula and different confidence levels. PMID:26351652

  9. Relative Performance of Rescaling and Resampling Approaches to Model Chi Square and Parameter Standard Error Estimation in Structural Equation Modeling.

    ERIC Educational Resources Information Center

    Nevitt, Johnathan; Hancock, Gregory R.

    Though common structural equation modeling (SEM) methods are predicated upon the assumption of multivariate normality, applied researchers often find themselves with data clearly violating this assumption and without sufficient sample size to use distribution-free estimation methods. Fortunately, promising alternatives are being integrated into…

  10. Sample Size Calculation for Estimating or Testing a Nonzero Squared Multiple Correlation Coefficient

    ERIC Educational Resources Information Center

    Krishnamoorthy, K.; Xia, Yanping

    2008-01-01

    The problems of hypothesis testing and interval estimation of the squared multiple correlation coefficient of a multivariate normal distribution are considered. It is shown that available one-sided tests are uniformly most powerful, and the one-sided confidence intervals are uniformly most accurate. An exact method of calculating sample size to…

  11. Effect of State-mandated Insurance Coverage on Accrual to Community Cancer Clinical Trials

    PubMed Central

    Ellis, Shellie D.; Carpenter, William R.; Minasian, Lori M.; Weiner, Bryan J.

    2012-01-01

    Thirty-five U.S. states and territories have implemented policies requiring insurers to cover patient care costs in the context of cancer clinical trials; however, evidence of the effectiveness of these policies is limited. This study assesses the impact of state insurance mandates on clinical trial accrual among community-based practices participating in the NCI Community Clinical Oncology Program (CCOP), which enrolls approximately one-third of all NCI cancer trial participants. We analyzed CCOP clinical trial enrollment over 17 years in 37 states, 14 of which implemented coverage policies, using fixed effects least squares regression to estimate the effect of state policies on trial accrual among community providers, controlling for state and CCOP differences in capacity to recruit. Of 91 CCOPs active during this time, 28 were directly affected by coverage mandates. Average recruitment per CCOP between 1991 and 2007 was 95.1 participants per year (SD = 55.8). CCOPs in states with a mandate recruited similar numbers of participants compared to states without a mandate. In multivariable analysis, treatment trial accrual among CCOPs in states that had implemented a coverage mandate, was not statistically different than accrual among CCOPs in states that did not implement a coverage mandate (β = 2.95, p = 0.681). State mandates did not appear to confer a benefit in terms of CCOP clinical trial accrual. State policies vary in strength, which may have diluted their effect on accrual. Nonetheless, policy mandates alone may not have a meaningful impact on participation in clinical trials in these states. PMID:22683991

  12. A hybrid PCA-CART-MARS-based prognostic approach of the remaining useful life for aircraft engines.

    PubMed

    Sánchez Lasheras, Fernando; García Nieto, Paulino José; de Cos Juez, Francisco Javier; Mayo Bayón, Ricardo; González Suárez, Victor Manuel

    2015-03-23

    Prognostics is an engineering discipline that predicts the future health of a system. In this research work, a data-driven approach for prognostics is proposed. Indeed, the present paper describes a data-driven hybrid model for the successful prediction of the remaining useful life of aircraft engines. The approach combines the multivariate adaptive regression splines (MARS) technique with the principal component analysis (PCA), dendrograms and classification and regression trees (CARTs). Elements extracted from sensor signals are used to train this hybrid model, representing different levels of health for aircraft engines. In this way, this hybrid algorithm is used to predict the trends of these elements. Based on this fitting, one can determine the future health state of a system and estimate its remaining useful life (RUL) with accuracy. To evaluate the proposed approach, a test was carried out using aircraft engine signals collected from physical sensors (temperature, pressure, speed, fuel flow, etc.). Simulation results show that the PCA-CART-MARS-based approach can forecast faults long before they occur and can predict the RUL. The proposed hybrid model presents as its main advantage the fact that it does not require information about the previous operation states of the input variables of the engine. The performance of this model was compared with those obtained by other benchmark models (multivariate linear regression and artificial neural networks) also applied in recent years for the modeling of remaining useful life. Therefore, the PCA-CART-MARS-based approach is very promising in the field of prognostics of the RUL for aircraft engines.

  13. A Hybrid PCA-CART-MARS-Based Prognostic Approach of the Remaining Useful Life for Aircraft Engines

    PubMed Central

    Lasheras, Fernando Sánchez; Nieto, Paulino José García; de Cos Juez, Francisco Javier; Bayón, Ricardo Mayo; Suárez, Victor Manuel González

    2015-01-01

    Prognostics is an engineering discipline that predicts the future health of a system. In this research work, a data-driven approach for prognostics is proposed. Indeed, the present paper describes a data-driven hybrid model for the successful prediction of the remaining useful life of aircraft engines. The approach combines the multivariate adaptive regression splines (MARS) technique with the principal component analysis (PCA), dendrograms and classification and regression trees (CARTs). Elements extracted from sensor signals are used to train this hybrid model, representing different levels of health for aircraft engines. In this way, this hybrid algorithm is used to predict the trends of these elements. Based on this fitting, one can determine the future health state of a system and estimate its remaining useful life (RUL) with accuracy. To evaluate the proposed approach, a test was carried out using aircraft engine signals collected from physical sensors (temperature, pressure, speed, fuel flow, etc.). Simulation results show that the PCA-CART-MARS-based approach can forecast faults long before they occur and can predict the RUL. The proposed hybrid model presents as its main advantage the fact that it does not require information about the previous operation states of the input variables of the engine. The performance of this model was compared with those obtained by other benchmark models (multivariate linear regression and artificial neural networks) also applied in recent years for the modeling of remaining useful life. Therefore, the PCA-CART-MARS-based approach is very promising in the field of prognostics of the RUL for aircraft engines. PMID:25806876

  14. Contamination Event Detection with Multivariate Time-Series Data in Agricultural Water Monitoring †

    PubMed Central

    Mao, Yingchi; Qi, Hai; Ping, Ping; Li, Xiaofang

    2017-01-01

    Time series data of multiple water quality parameters are obtained from the water sensor networks deployed in the agricultural water supply network. The accurate and efficient detection and warning of contamination events to prevent pollution from spreading is one of the most important issues when pollution occurs. In order to comprehensively reduce the event detection deviation, a spatial–temporal-based event detection approach with multivariate time-series data for water quality monitoring (M-STED) was proposed. The M-STED approach includes three parts. The first part is that M-STED adopts a Rule K algorithm to select backbone nodes as the nodes in the CDS, and forward the sensed data of multiple water parameters. The second part is to determine the state of each backbone node with back propagation neural network models and the sequential Bayesian analysis in the current timestamp. The third part is to establish a spatial model with Bayesian networks to estimate the state of the backbones in the next timestamp and trace the “outlier” node to its neighborhoods to detect a contamination event. The experimental results indicate that the average detection rate is more than 80% with M-STED and the false detection rate is lower than 9%, respectively. The M-STED approach can improve the rate of detection by about 40% and reduce the false alarm rate by about 45%, compared with the event detection with a single water parameter algorithm, S-STED. Moreover, the proposed M-STED can exhibit better performance in terms of detection delay and scalability. PMID:29207535

  15. Support for Marijuana Legalization and Predictors of Intentions to Use Marijuana More Often in Response to Legalization Among U.S. Young Adults.

    PubMed

    Cohn, Amy M; Johnson, Amanda L; Rose, Shyanika W; Rath, Jessica M; Villanti, Andrea C

    2017-01-28

    As of 2015, more than half of U.S. states have legalized, medicalized, or decriminalized marijuana. This study examined the prevalence and correlates of support for marijuana legalization in a national sample of young adults and the intention to use marijuana more frequently if it were legalized. Data were from Wave 7 (weighted N = 3532) of the Truth Initiative Young Adult Cohort, a national sample of men and women aged 18-34. We assessed demographics, past 30-day substance (alcohol, tobacco, marijuana, other drug use), depression and anxiety, social smoking, marijuana harm perceptions (relative to cigarettes), and state-level marijuana policies as correlates of support for marijuana legalization and intentions to use marijuana more often if it were legalized. Multivariable models of correlates of support for legalization and intentions to use marijuana were conducted separately for the full sample and for nonmarijuana users. Weighted estimates showed that 39% of the full sample and 9% of nonmarijuana users supported marijuana legalization. Multivariable models showed that lower marijuana harm perceptions and lifetime and past 30-day tobacco use were common predictors of support for marijuana legalization and intentions to use marijuana among non-users of marijuana. Conclusions/Importance: Over a third of the sample supported marijuana legalization. Tobacco use and perceptions that marijuana is less harmful than cigarettes were robust risk correlates of support for marijuana legalization and intentions to use more frequently among nonusers. Public health campaigns should target these factors to deter marijuana-related harm in susceptible young adults.

  16. Solution of nonlinear multivariable constrained systems using a gradient projection digital algorithm that is insensitive to the initial state

    NASA Technical Reports Server (NTRS)

    Hargrove, A.

    1982-01-01

    Optimal digital control of nonlinear multivariable constrained systems was studied. The optimal controller in the form of an algorithm was improved and refined by reducing running time and storage requirements. A particularly difficult system of nine nonlinear state variable equations was chosen as a test problem for analyzing and improving the controller. Lengthy analysis, modeling, computing and optimization were accomplished. A remote interactive teletype terminal was installed. Analysis requiring computer usage of short duration was accomplished using Tuskegee's VAX 11/750 system.

  17. Comparing interval estimates for small sample ordinal CFA models

    PubMed Central

    Natesan, Prathiba

    2015-01-01

    Robust maximum likelihood (RML) and asymptotically generalized least squares (AGLS) methods have been recommended for fitting ordinal structural equation models. Studies show that some of these methods underestimate standard errors. However, these studies have not investigated the coverage and bias of interval estimates. An estimate with a reasonable standard error could still be severely biased. This can only be known by systematically investigating the interval estimates. The present study compares Bayesian, RML, and AGLS interval estimates of factor correlations in ordinal confirmatory factor analysis models (CFA) for small sample data. Six sample sizes, 3 factor correlations, and 2 factor score distributions (multivariate normal and multivariate mildly skewed) were studied. Two Bayesian prior specifications, informative and relatively less informative were studied. Undercoverage of confidence intervals and underestimation of standard errors was common in non-Bayesian methods. Underestimated standard errors may lead to inflated Type-I error rates. Non-Bayesian intervals were more positive biased than negatively biased, that is, most intervals that did not contain the true value were greater than the true value. Some non-Bayesian methods had non-converging and inadmissible solutions for small samples and non-normal data. Bayesian empirical standard error estimates for informative and relatively less informative priors were closer to the average standard errors of the estimates. The coverage of Bayesian credibility intervals was closer to what was expected with overcoverage in a few cases. Although some Bayesian credibility intervals were wider, they reflected the nature of statistical uncertainty that comes with the data (e.g., small sample). Bayesian point estimates were also more accurate than non-Bayesian estimates. The results illustrate the importance of analyzing coverage and bias of interval estimates, and how ignoring interval estimates can be misleading. Therefore, editors and policymakers should continue to emphasize the inclusion of interval estimates in research. PMID:26579002

  18. Comparing interval estimates for small sample ordinal CFA models.

    PubMed

    Natesan, Prathiba

    2015-01-01

    Robust maximum likelihood (RML) and asymptotically generalized least squares (AGLS) methods have been recommended for fitting ordinal structural equation models. Studies show that some of these methods underestimate standard errors. However, these studies have not investigated the coverage and bias of interval estimates. An estimate with a reasonable standard error could still be severely biased. This can only be known by systematically investigating the interval estimates. The present study compares Bayesian, RML, and AGLS interval estimates of factor correlations in ordinal confirmatory factor analysis models (CFA) for small sample data. Six sample sizes, 3 factor correlations, and 2 factor score distributions (multivariate normal and multivariate mildly skewed) were studied. Two Bayesian prior specifications, informative and relatively less informative were studied. Undercoverage of confidence intervals and underestimation of standard errors was common in non-Bayesian methods. Underestimated standard errors may lead to inflated Type-I error rates. Non-Bayesian intervals were more positive biased than negatively biased, that is, most intervals that did not contain the true value were greater than the true value. Some non-Bayesian methods had non-converging and inadmissible solutions for small samples and non-normal data. Bayesian empirical standard error estimates for informative and relatively less informative priors were closer to the average standard errors of the estimates. The coverage of Bayesian credibility intervals was closer to what was expected with overcoverage in a few cases. Although some Bayesian credibility intervals were wider, they reflected the nature of statistical uncertainty that comes with the data (e.g., small sample). Bayesian point estimates were also more accurate than non-Bayesian estimates. The results illustrate the importance of analyzing coverage and bias of interval estimates, and how ignoring interval estimates can be misleading. Therefore, editors and policymakers should continue to emphasize the inclusion of interval estimates in research.

  19. Regional trends in short-duration precipitation extremes: a flexible multivariate monotone quantile regression approach

    NASA Astrophysics Data System (ADS)

    Cannon, Alex

    2017-04-01

    Estimating historical trends in short-duration rainfall extremes at regional and local scales is challenging due to low signal-to-noise ratios and the limited availability of homogenized observational data. In addition to being of scientific interest, trends in rainfall extremes are of practical importance, as their presence calls into question the stationarity assumptions that underpin traditional engineering and infrastructure design practice. Even with these fundamental challenges, increasingly complex questions are being asked about time series of extremes. For instance, users may not only want to know whether or not rainfall extremes have changed over time, they may also want information on the modulation of trends by large-scale climate modes or on the nonstationarity of trends (e.g., identifying hiatus periods or periods of accelerating positive trends). Efforts have thus been devoted to the development and application of more robust and powerful statistical estimators for regional and local scale trends. While a standard nonparametric method like the regional Mann-Kendall test, which tests for the presence of monotonic trends (i.e., strictly non-decreasing or non-increasing changes), makes fewer assumptions than parametric methods and pools information from stations within a region, it is not designed to visualize detected trends, include information from covariates, or answer questions about the rate of change in trends. As a remedy, monotone quantile regression (MQR) has been developed as a nonparametric alternative that can be used to estimate a common monotonic trend in extremes at multiple stations. Quantile regression makes efficient use of data by directly estimating conditional quantiles based on information from all rainfall data in a region, i.e., without having to precompute the sample quantiles. The MQR method is also flexible and can be used to visualize and analyze the nonlinearity of the detected trend. However, it is fundamentally a univariate technique, and cannot incorporate information from additional covariates, for example ENSO state or physiographic controls on extreme rainfall within a region. Here, the univariate MQR model is extended to allow the use of multiple covariates. Multivariate monotone quantile regression (MMQR) is based on a single hidden-layer feedforward network with the quantile regression error function and partial monotonicity constraints. The MMQR model is demonstrated via Monte Carlo simulations and the estimation and visualization of regional trends in moderate rainfall extremes based on homogenized sub-daily precipitation data at stations in Canada.

  20. Exploring the Dynamics of Dyadic Interactions via Hierarchical Segmentation

    ERIC Educational Resources Information Center

    Hsieh, Fushing; Ferrer, Emilio; Chen, Shu-Chun; Chow, Sy-Miin

    2010-01-01

    In this article we present an exploratory tool for extracting systematic patterns from multivariate data. The technique, hierarchical segmentation (HS), can be used to group multivariate time series into segments with similar discrete-state recurrence patterns and it is not restricted by the stationarity assumption. We use a simulation study to…

  1. Adaptation and application of multivariate AMBI (M-AMBI) in US coastal waters

    EPA Science Inventory

    The multivariate AMBI (M-AMBI) is an extension of the AZTI Marine Biotic Index (AMBI) that has been used extensively in Europe, but not in the United States. In a previous study, we adapted AMBI for use in US coastal waters (US AMBI), but saw biases in salinity and score distribu...

  2. Mapping CHU9D Utility Scores from the PedsQLTM 4.0 SF-15.

    PubMed

    Mpundu-Kaambwa, Christine; Chen, Gang; Russo, Remo; Stevens, Katherine; Petersen, Karin Dam; Ratcliffe, Julie

    2017-04-01

    The Pediatric Quality of Life Inventory™ 4.0 Short Form 15 Generic Core Scales (hereafter the PedsQL) and the Child Health Utility-9 Dimensions (CHU9D) are two generic instruments designed to measure health-related quality of life in children and adolescents in the general population and paediatric patient groups living with specific health conditions. Although the PedsQL is widely used among paediatric patient populations, presently it is not possible to directly use the scores from the instrument to calculate quality-adjusted life-years (QALYs) for application in economic evaluation because it produces summary scores which are not preference-based. This paper examines different econometric mapping techniques for estimating CHU9D utility scores from the PedsQL for the purpose of calculating QALYs for cost-utility analysis. The PedsQL and the CHU9D were completed by a community sample of 755 Australian adolescents aged 15-17 years. Seven regression models were estimated: ordinary least squares estimator, generalised linear model, robust MM estimator, multivariate factorial polynomial estimator, beta-binomial estimator, finite mixture model and multinomial logistic model. The mean absolute error (MAE) and the mean squared error (MSE) were used to assess predictive ability of the models. The MM estimator with stepwise-selected PedsQL dimension scores as explanatory variables had the best predictive accuracy using MAE and the equivalent beta-binomial model had the best predictive accuracy using MSE. Our mapping algorithm facilitates the estimation of health-state utilities for use within economic evaluations where only PedsQL data is available and is suitable for use in community-based adolescents aged 15-17 years. Applicability of the algorithm in younger populations should be assessed in further research.

  3. Multivariate selection and intersexual genetic constraints in a wild bird population.

    PubMed

    Poissant, J; Morrissey, M B; Gosler, A G; Slate, J; Sheldon, B C

    2016-10-01

    When selection differs between the sexes for traits that are genetically correlated between the sexes, there is potential for the effect of selection in one sex to be altered by indirect selection in the other sex, a situation commonly referred to as intralocus sexual conflict (ISC). While potentially common, ISC has rarely been studied in wild populations. Here, we studied ISC over a set of morphological traits (wing length, tarsus length, bill depth and bill length) in a wild population of great tits (Parus major) from Wytham Woods, UK. Specifically, we quantified the microevolutionary impacts of ISC by combining intra- and intersex additive genetic (co)variances and sex-specific selection estimates in a multivariate framework. Large genetic correlations between homologous male and female traits combined with evidence for sex-specific multivariate survival selection suggested that ISC could play an appreciable role in the evolution of this population. Together, multivariate sex-specific selection and additive genetic (co)variance for the traits considered accounted for additive genetic variance in fitness that was uncorrelated between the sexes (cross-sex genetic correlation = -0.003, 95% CI = -0.83, 0.83). Gender load, defined as the reduction in a population's rate of adaptation due to sex-specific effects, was estimated at 50% (95% CI = 13%, 86%). This study provides novel insights into the evolution of sexual dimorphism in wild populations and illustrates how quantitative genetics and selection analyses can be combined in a multivariate framework to quantify the microevolutionary impacts of ISC. © 2016 European Society For Evolutionary Biology. Journal of Evolutionary Biology © 2016 European Society For Evolutionary Biology.

  4. Assessing Multivariate Constraints to Evolution across Ten Long-Term Avian Studies

    PubMed Central

    Teplitsky, Celine; Tarka, Maja; Møller, Anders P.; Nakagawa, Shinichi; Balbontín, Javier; Burke, Terry A.; Doutrelant, Claire; Gregoire, Arnaud; Hansson, Bengt; Hasselquist, Dennis; Gustafsson, Lars; de Lope, Florentino; Marzal, Alfonso; Mills, James A.; Wheelwright, Nathaniel T.; Yarrall, John W.; Charmantier, Anne

    2014-01-01

    Background In a rapidly changing world, it is of fundamental importance to understand processes constraining or facilitating adaptation through microevolution. As different traits of an organism covary, genetic correlations are expected to affect evolutionary trajectories. However, only limited empirical data are available. Methodology/Principal Findings We investigate the extent to which multivariate constraints affect the rate of adaptation, focusing on four morphological traits often shown to harbour large amounts of genetic variance and considered to be subject to limited evolutionary constraints. Our data set includes unique long-term data for seven bird species and a total of 10 populations. We estimate population-specific matrices of genetic correlations and multivariate selection coefficients to predict evolutionary responses to selection. Using Bayesian methods that facilitate the propagation of errors in estimates, we compare (1) the rate of adaptation based on predicted response to selection when including genetic correlations with predictions from models where these genetic correlations were set to zero and (2) the multivariate evolvability in the direction of current selection to the average evolvability in random directions of the phenotypic space. We show that genetic correlations on average decrease the predicted rate of adaptation by 28%. Multivariate evolvability in the direction of current selection was systematically lower than average evolvability in random directions of space. These significant reductions in the rate of adaptation and reduced evolvability were due to a general nonalignment of selection and genetic variance, notably orthogonality of directional selection with the size axis along which most (60%) of the genetic variance is found. Conclusions These results suggest that genetic correlations can impose significant constraints on the evolution of avian morphology in wild populations. This could have important impacts on evolutionary dynamics and hence population persistence in the face of rapid environmental change. PMID:24608111

  5. Seizure-Onset Mapping Based on Time-Variant Multivariate Functional Connectivity Analysis of High-Dimensional Intracranial EEG: A Kalman Filter Approach.

    PubMed

    Lie, Octavian V; van Mierlo, Pieter

    2017-01-01

    The visual interpretation of intracranial EEG (iEEG) is the standard method used in complex epilepsy surgery cases to map the regions of seizure onset targeted for resection. Still, visual iEEG analysis is labor-intensive and biased due to interpreter dependency. Multivariate parametric functional connectivity measures using adaptive autoregressive (AR) modeling of the iEEG signals based on the Kalman filter algorithm have been used successfully to localize the electrographic seizure onsets. Due to their high computational cost, these methods have been applied to a limited number of iEEG time-series (<60). The aim of this study was to test two Kalman filter implementations, a well-known multivariate adaptive AR model (Arnold et al. 1998) and a simplified, computationally efficient derivation of it, for their potential application to connectivity analysis of high-dimensional (up to 192 channels) iEEG data. When used on simulated seizures together with a multivariate connectivity estimator, the partial directed coherence, the two AR models were compared for their ability to reconstitute the designed seizure signal connections from noisy data. Next, focal seizures from iEEG recordings (73-113 channels) in three patients rendered seizure-free after surgery were mapped with the outdegree, a graph-theory index of outward directed connectivity. Simulation results indicated high levels of mapping accuracy for the two models in the presence of low-to-moderate noise cross-correlation. Accordingly, both AR models correctly mapped the real seizure onset to the resection volume. This study supports the possibility of conducting fully data-driven multivariate connectivity estimations on high-dimensional iEEG datasets using the Kalman filter approach.

  6. Effect of latitude on the rate of change in incidence of Lyme disease in the United States

    PubMed Central

    Tuite, Ashleigh R.; Greer, Amy L.

    2013-01-01

    Background Tick-borne illnesses represent an important class of emerging zoonoses, with climate change projected to increase the geographic range within which tick-borne zoonoses might become endemic. We evaluated the impact of latitude on the rate of change in the incidence of Lyme disease in the United States, using publicly available data. Methods We estimated state-level year-on-year incidence rate ratios (IRRs) for Lyme disease for the period 1993 to 2007 using Poisson regression methods. We evaluated between-state heterogeneity in IRRs using a random-effects meta-analytic approach. We identified state-level characteristics associated with increasing incidence using random-effects meta-regression. Results The incidence of Lyme disease in the US increased by about 80% between 1993 and 2007 (IRR per year 1.049, 95% CI [confidence interval] 1.048 to 1.050). There was marked between-state heterogeneity in the average incidence of Lyme disease, ranging from 0.008 per 100 000 person-years in Colorado to 75 per 100 000 in Connecticut, and significant between-state heterogeneity in temporal trends (p < 0.001). In multivariable meta-regression models, increasing incidence showed a linear association with state latitude and population density. These 2 factors explained 27% of the between-state variation in IRRs. No independent association was identified for other state-level characteristics. Interpretation Lyme disease incidence increased in the US as a whole during the study period, but the changes were not uniform. Marked increases were identified in northern-most states, whereas southern states experienced stable or declining rates of Lyme disease. PMID:25077101

  7. Risk factors for prevalent hepatitis C virus-infection among inmates in a state prison system in Mexico

    PubMed Central

    Mosqueda-Gomez, Juan L.; Macias-Hernandez, Alejando; Sierra-Madero, Juan G.; Ahmed, Saifuddin; Beyrer, Chris

    2017-01-01

    Objectives To estimate the prevalence of HCV-infection and identify associated factors among inmates in the State Prison System of Guanajuato in Mexico (Sep-2011 to Feb-2012). Methods Cross-sectional, observational study in 10 prisons in the State of Guanajuato in Mexico (2011–2012). We offered HCV-testing and applied audio computer-assisted self-interviews to all adults imprisoned in the State Prison System. We used a complex survey analysis to estimate the distribution of variables and its corresponding 95% confidence intervals, taking into consideration the expected cluster effect by common characteristics within prisons. Inverse probability weights were applied to correct potential biased estimates arising from non-participation in accrual activities and non-response rates. We fitted multivariate logistic regression models to identify risk-behaviors associated to HCV-infection. Results We included data of 2,519 participating inmates. Prevalence of HCV-infection was 4.9 (95%CI = 3.6–5.9). Most HCV-infected inmates were male (99%). Before being incarcerated, inmates with HCV-infection were more frequently tattooed, used and injected drugs more frequently, and were more likely to share materials for injecting, when compared with those non-infected. During incarceration, HCV-infected inmates got tattoos and used drugs more often than non-infected, including injecting-drugs and sharing materials. Injecting-drug use (OR = 7.6, 95%CI, 2.5–23.4), sharing materials for injecting-drugs (OR = 19.6, 95%CI, 4.7–81.7) and being tattooed at least once before incarceration (OR = 2.1, 95%CI, 1.1–3.9), but not during incarceration, were independently associated to HCV-infection. Conclusions The prevalence of HCV-infection among inmates in the State of Guanajuato in Mexico is considerably higher than in the general population. The most important risk factors for HCV in this inmate population were injecting-drugs and sharing materials for injections before incarceration. High-risk behaviors during imprisonment are very high particularly among those already infected. HCV diagnostic and treatment services, and harm-reduction programs for incarcerated injecting-drug users in Mexico should be integrated to control the HCV epidemic in Mexico. PMID:28654650

  8. Risk factors for prevalent hepatitis C virus-infection among inmates in a state prison system in Mexico.

    PubMed

    Belaunzarán-Zamudio, Pablo F; Mosqueda-Gomez, Juan L; Macias-Hernandez, Alejando; Sierra-Madero, Juan G; Ahmed, Saifuddin; Beyrer, Chris

    2017-01-01

    To estimate the prevalence of HCV-infection and identify associated factors among inmates in the State Prison System of Guanajuato in Mexico (Sep-2011 to Feb-2012). Cross-sectional, observational study in 10 prisons in the State of Guanajuato in Mexico (2011-2012). We offered HCV-testing and applied audio computer-assisted self-interviews to all adults imprisoned in the State Prison System. We used a complex survey analysis to estimate the distribution of variables and its corresponding 95% confidence intervals, taking into consideration the expected cluster effect by common characteristics within prisons. Inverse probability weights were applied to correct potential biased estimates arising from non-participation in accrual activities and non-response rates. We fitted multivariate logistic regression models to identify risk-behaviors associated to HCV-infection. We included data of 2,519 participating inmates. Prevalence of HCV-infection was 4.9 (95%CI = 3.6-5.9). Most HCV-infected inmates were male (99%). Before being incarcerated, inmates with HCV-infection were more frequently tattooed, used and injected drugs more frequently, and were more likely to share materials for injecting, when compared with those non-infected. During incarceration, HCV-infected inmates got tattoos and used drugs more often than non-infected, including injecting-drugs and sharing materials. Injecting-drug use (OR = 7.6, 95%CI, 2.5-23.4), sharing materials for injecting-drugs (OR = 19.6, 95%CI, 4.7-81.7) and being tattooed at least once before incarceration (OR = 2.1, 95%CI, 1.1-3.9), but not during incarceration, were independently associated to HCV-infection. The prevalence of HCV-infection among inmates in the State of Guanajuato in Mexico is considerably higher than in the general population. The most important risk factors for HCV in this inmate population were injecting-drugs and sharing materials for injections before incarceration. High-risk behaviors during imprisonment are very high particularly among those already infected. HCV diagnostic and treatment services, and harm-reduction programs for incarcerated injecting-drug users in Mexico should be integrated to control the HCV epidemic in Mexico.

  9. A gap-filling model for eddy covariance latent heat flux: Estimating evapotranspiration of a subtropical seasonal evergreen broad-leaved forest as an example

    NASA Astrophysics Data System (ADS)

    Chen, Yi-Ying; Chu, Chia-Ren; Li, Ming-Hsu

    2012-10-01

    SummaryIn this paper we present a semi-parametric multivariate gap-filling model for tower-based measurement of latent heat flux (LE). Two statistical techniques, the principal component analysis (PCA) and a nonlinear interpolation approach were integrated into this LE gap-filling model. The PCA was first used to resolve the multicollinearity relationships among various environmental variables, including radiation, soil moisture deficit, leaf area index, wind speed, etc. Two nonlinear interpolation methods, multiple regressions (MRS) and the K-nearest neighbors (KNNs) were examined with random selected flux gaps for both clear sky and nighttime/cloudy data to incorporate into this LE gap-filling model. Experimental results indicated that the KNN interpolation approach is able to provide consistent LE estimations while MRS presents over estimations during nighttime/cloudy. Rather than using empirical regression parameters, the KNN approach resolves the nonlinear relationship between the gap-filled LE flux and principal components with adaptive K values under different atmospheric states. The developed LE gap-filling model (PCA with KNN) works with a RMSE of 2.4 W m-2 (˜0.09 mm day-1) at a weekly time scale by adding 40% artificial flux gaps into original dataset. Annual evapotranspiration at this study site were estimated at 736 mm (1803 MJ) and 728 mm (1785 MJ) for year 2008 and 2009, respectively.

  10. Greater Perceived Age Discrimination in England than the United States: Results from HRS and ELSA

    PubMed Central

    Zaninotto, Paola; Steptoe, Andrew

    2015-01-01

    Objectives. We examined cross-national differences in perceptions of age discrimination in England and the United States. Under the premise that the United States has had age discrimination legislation in place for considerably longer than England, we hypothesized that perceptions of age discrimination would be lower in the United States. Methods. We analyzed data from two nationally representative studies of aging, the U.S. Health and Retirement Study (n = 4,818) and the English Longitudinal Study of Ageing (n = 7,478). Respondents aged 52 years and older who attributed any experiences of discrimination to their age were treated as cases of perceived age discrimination. We used multivariable logistic regression to estimate the odds ratios of experiencing perceived age discrimination in relation to selected sociodemographic factors. Results. Perceptions of age discrimination were significantly higher in England than the United States, with 34.8% of men and women in England reporting age discrimination compared with 29.1% in the United States. Associations between perceived age discrimination and older age and lower levels of household wealth were observed in both countries, but we found differences between England and the United States in the relationship between perceived age discrimination and education. Discussion. Our study revealed that levels of perceived age discrimination are lower in the United States than England and are less socially patterned. This suggests that differing social and political circumstances in the two countries may have an important role to play. PMID:26224759

  11. Perceptions of the State Policy Environment and Adoption of Medications in the Treatment of Substance Use Disorders

    PubMed Central

    Knudsen, Hannah K.; Abraham, Amanda J.

    2012-01-01

    Objective Despite growing interest in the use of evidence-based treatment practices for treating substance use disorders, adoption of medications by treatment programs remains modest. Drawing on resource dependence and institutional theory, this study examined the relationships between adoption of medications by treatment programs and their perceptions about the state policy environment. Methods Data were collected through mailed surveys and telephone interviews with 250 administrators of publicly funded substance abuse treatment programs in the United States between 2009 and 2010. Multiple imputation and multivariate logistic regression were used to estimate the associations between perceptions of the state policy environment and the odds of adopting at least one medication for the treatment of substance use disorders. Results A total of 91 (37%) programs reported having prescribed any medication for treatment of a substance use disorder. Programs were significantly more likely to have adopted at least one medication if they perceived greater support for medications by the Single State Agency. The odds of adoption were significantly greater if the program was aware that at least one medication was included on their state’s Medicaid formulary and that state-contract funding permitted the purchase of medications. Conclusions States may play significant roles in promoting the adoption of medications, but adequate dissemination of information about state policies and priorities may be vital to further adoption. Future research should continue to study the relationships between the adoption of medications for treating substance use disorders and the evolving policy environment. PMID:22227755

  12. Bias correction in the hierarchical likelihood approach to the analysis of multivariate survival data.

    PubMed

    Jeon, Jihyoun; Hsu, Li; Gorfine, Malka

    2012-07-01

    Frailty models are useful for measuring unobserved heterogeneity in risk of failures across clusters, providing cluster-specific risk prediction. In a frailty model, the latent frailties shared by members within a cluster are assumed to act multiplicatively on the hazard function. In order to obtain parameter and frailty variate estimates, we consider the hierarchical likelihood (H-likelihood) approach (Ha, Lee and Song, 2001. Hierarchical-likelihood approach for frailty models. Biometrika 88, 233-243) in which the latent frailties are treated as "parameters" and estimated jointly with other parameters of interest. We find that the H-likelihood estimators perform well when the censoring rate is low, however, they are substantially biased when the censoring rate is moderate to high. In this paper, we propose a simple and easy-to-implement bias correction method for the H-likelihood estimators under a shared frailty model. We also extend the method to a multivariate frailty model, which incorporates complex dependence structure within clusters. We conduct an extensive simulation study and show that the proposed approach performs very well for censoring rates as high as 80%. We also illustrate the method with a breast cancer data set. Since the H-likelihood is the same as the penalized likelihood function, the proposed bias correction method is also applicable to the penalized likelihood estimators.

  13. Multivariate statistical approach to estimate mixing proportions for unknown end members

    USGS Publications Warehouse

    Valder, Joshua F.; Long, Andrew J.; Davis, Arden D.; Kenner, Scott J.

    2012-01-01

    A multivariate statistical method is presented, which includes principal components analysis (PCA) and an end-member mixing model to estimate unknown end-member hydrochemical compositions and the relative mixing proportions of those end members in mixed waters. PCA, together with the Hotelling T2 statistic and a conceptual model of groundwater flow and mixing, was used in selecting samples that best approximate end members, which then were used as initial values in optimization of the end-member mixing model. This method was tested on controlled datasets (i.e., true values of estimates were known a priori) and found effective in estimating these end members and mixing proportions. The controlled datasets included synthetically generated hydrochemical data, synthetically generated mixing proportions, and laboratory analyses of sample mixtures, which were used in an evaluation of the effectiveness of this method for potential use in actual hydrological settings. For three different scenarios tested, correlation coefficients (R2) for linear regression between the estimated and known values ranged from 0.968 to 0.993 for mixing proportions and from 0.839 to 0.998 for end-member compositions. The method also was applied to field data from a study of end-member mixing in groundwater as a field example and partial method validation.

  14. Probabilistic estimates of drought impacts on agricultural production

    NASA Astrophysics Data System (ADS)

    Madadgar, Shahrbanou; AghaKouchak, Amir; Farahmand, Alireza; Davis, Steven J.

    2017-08-01

    Increases in the severity and frequency of drought in a warming climate may negatively impact agricultural production and food security. Unlike previous studies that have estimated agricultural impacts of climate condition using single-crop yield distributions, we develop a multivariate probabilistic model that uses projected climatic conditions (e.g., precipitation amount or soil moisture) throughout a growing season to estimate the probability distribution of crop yields. We demonstrate the model by an analysis of the historical period 1980-2012, including the Millennium Drought in Australia (2001-2009). We find that precipitation and soil moisture deficit in dry growing seasons reduced the average annual yield of the five largest crops in Australia (wheat, broad beans, canola, lupine, and barley) by 25-45% relative to the wet growing seasons. Our model can thus produce region- and crop-specific agricultural sensitivities to climate conditions and variability. Probabilistic estimates of yield may help decision-makers in government and business to quantitatively assess the vulnerability of agriculture to climate variations. We develop a multivariate probabilistic model that uses precipitation to estimate the probability distribution of crop yields. The proposed model shows how the probability distribution of crop yield changes in response to droughts. During Australia's Millennium Drought precipitation and soil moisture deficit reduced the average annual yield of the five largest crops.

  15. Development of the Complex General Linear Model in the Fourier Domain: Application to fMRI Multiple Input-Output Evoked Responses for Single Subjects

    PubMed Central

    Rio, Daniel E.; Rawlings, Robert R.; Woltz, Lawrence A.; Gilman, Jodi; Hommer, Daniel W.

    2013-01-01

    A linear time-invariant model based on statistical time series analysis in the Fourier domain for single subjects is further developed and applied to functional MRI (fMRI) blood-oxygen level-dependent (BOLD) multivariate data. This methodology was originally developed to analyze multiple stimulus input evoked response BOLD data. However, to analyze clinical data generated using a repeated measures experimental design, the model has been extended to handle multivariate time series data and demonstrated on control and alcoholic subjects taken from data previously analyzed in the temporal domain. Analysis of BOLD data is typically carried out in the time domain where the data has a high temporal correlation. These analyses generally employ parametric models of the hemodynamic response function (HRF) where prewhitening of the data is attempted using autoregressive (AR) models for the noise. However, this data can be analyzed in the Fourier domain. Here, assumptions made on the noise structure are less restrictive, and hypothesis tests can be constructed based on voxel-specific nonparametric estimates of the hemodynamic transfer function (HRF in the Fourier domain). This is especially important for experimental designs involving multiple states (either stimulus or drug induced) that may alter the form of the response function. PMID:23840281

  16. Development of the complex general linear model in the Fourier domain: application to fMRI multiple input-output evoked responses for single subjects.

    PubMed

    Rio, Daniel E; Rawlings, Robert R; Woltz, Lawrence A; Gilman, Jodi; Hommer, Daniel W

    2013-01-01

    A linear time-invariant model based on statistical time series analysis in the Fourier domain for single subjects is further developed and applied to functional MRI (fMRI) blood-oxygen level-dependent (BOLD) multivariate data. This methodology was originally developed to analyze multiple stimulus input evoked response BOLD data. However, to analyze clinical data generated using a repeated measures experimental design, the model has been extended to handle multivariate time series data and demonstrated on control and alcoholic subjects taken from data previously analyzed in the temporal domain. Analysis of BOLD data is typically carried out in the time domain where the data has a high temporal correlation. These analyses generally employ parametric models of the hemodynamic response function (HRF) where prewhitening of the data is attempted using autoregressive (AR) models for the noise. However, this data can be analyzed in the Fourier domain. Here, assumptions made on the noise structure are less restrictive, and hypothesis tests can be constructed based on voxel-specific nonparametric estimates of the hemodynamic transfer function (HRF in the Fourier domain). This is especially important for experimental designs involving multiple states (either stimulus or drug induced) that may alter the form of the response function.

  17. Seroprevalence of Helicobacter pylori in Hispanics living in Puerto Rico: A population-based study.

    PubMed

    González-Pons, María; Soto-Salgado, Marievelisse; Sevilla, Javier; Márquez-Lespier, Juan M; Morgan, Douglas; Pérez, Cynthia M; Cruz-Correa, Marcia

    2018-02-01

    Helicobacter pylori is an important etiologic factor for peptic ulcers and gastric cancer, one of the top ten leading causes of cancer death in Puerto Rico. However, the prevalence of H. pylori infections in this population was previously unknown. The aim of this study was to examine the seroprevalence of H. pylori and its associated risk factors in Puerto Rico. A cross-sectional study was designed using an existing population-based biorepository. Seropositivity was determined using the Premier ™ H. pylori immunoassay. Helicobacter pylori seroprevalence was estimated with 95% confidence using marginal standardization following logistic regression. To assess the risk factors associated with H. pylori seropositivity, a multivariable log-binomial model was fitted to estimate the prevalence ratio (PR) and its 95% confidence interval (95% CI). A total of 528 population-based serum samples were analyzed. The mean age of the study population was 41 ± 12 years, of whom 55.3% were females. The overall seroprevalence of H. pylori was 33.0% (95% CI = 28.3%-38.1%). Increasing age and having <12 years of education were significantly (P < .05) associated with H. pylori seropositivity in the multivariable model; however, residing in counties with low population density reached marginal significance (P = .085). We report that H. pylori infection is common among Hispanics living in Puerto Rico. The H. pylori seroprevalence observed in Puerto Rico is similar to the seroprevalence reported in the overall population of the United States. The association between H. pylori seroprevalence and the risk factors analyzed offers insight into the epidemiology of gastric cancer in Puerto Rico and warrants further investigation. © 2017 The Authors. Helicobacter Published by John Wiley & Sons Ltd.

  18. Measles Case Fatality Rate in Bihar, India, 2011–12

    PubMed Central

    Murhekar, Manoj V.; Ahmad, Mohammad; Shukla, Hemant; Abhishek, Kunwar; Perry, Robert T.; Bose, Anindya S.; Shimpi, Rahul; Kumar, Arun; Kaliaperumal, Kanagasabai; Sethi, Raman; Selvaraj, Vadivoo; Kamaraj, Pattabi; Routray, Satyabrata; Das, Vidya Nand; Menabde, Nata; Bahl, Sunil

    2014-01-01

    Background Updated estimates of measles case fatality rates (CFR) are critical for monitoring progress towards measles elimination goals. India accounted for 36% of total measles deaths occurred globally in 2011. We conducted a retrospective cohort study to estimate measles CFR and identify the risk factors for measles death in Bihar–one of the north Indian states historically known for its low vaccination coverage. Methods We systematically selected 16 of the 31 laboratory-confirmed measles outbreaks occurring in Bihar during 1 October 2011 to 30 April 2012. All households of the villages/urban localities affected by these outbreaks were visited to identify measles cases and deaths. We calculated CFR and used multivariate analysis to identify risk factors for measles death. Results The survey found 3670 measles cases and 28 deaths (CFR: 0.78, 95% confidence interval: 0.47–1.30). CFR was higher among under-five children (1.22%) and children belonging to scheduled castes/tribes (SC/ST, 1.72%). On multivariate analysis, independent risk factors associated with measles death were age <5 years, SC/ST status and non-administration of vitamin A during illness. Outbreaks with longer interval between the occurrence of first case and notification of the outbreak also had a higher rate of deaths. Conclusions Measles CFR in Bihar was low. To further reduce case fatality, health authorities need to ensure that SC/ST are targeted by the immunization programme and that outbreak investigations target for vitamin A treatment of cases in high risk groups such as SC/ST and young children and ensure regular visits by health-workers in affected villages to administer vitamin A to new cases. PMID:24824641

  19. Association between secondhand smoke exposure and hypertension in never smokers: a cross-sectional survey using data from Korean National Health and Nutritional Examination Survey V, 2010-2012.

    PubMed

    Park, Young Sik; Lee, Chang-Hoon; Kim, Yu-Il; Ahn, Chul Min; Kim, Ju Ock; Park, Ju-Heon; Lee, Sang Haak; Kim, Jae Yeol; Chun, Eun Mi; Jung, Tae-Hoon; Yoo, Kwang-Ha

    2018-05-14

    Secondhand smoke (SHS) exposure is associated with cardiovascular disease. This study aims to determine the association between SHS exposure estimated by questionnaire and hypertension in Korean never smokers. Korean National Health and Nutrition Examination Survey (KNHANES) V was conducted from 2010 to 2012. We selected the never smokers aged over 20 years who answered the question about the SHS exposure. SHS exposure in both the home and work place was estimated using a self-reporting questionnaire. We investigated the association between SHS exposure and hypertension by using multivariate analysis. And we evaluated the mean systolic and diastolic blood pressure values according to SHS exposure after adjusting for possible confounding factors. All analyses were stratified by women and men. There were 10 532 (women 8987 and men 1545) never smokers. We divided the subjects into three groups according to the amount of SHS exposure: none-group I, <2 hour/day-group II and ≥2 hour/day-group III. Using multivariate analysis, hypertension was more commonly associated with group III than group I in women (adjusted OR 1.50, 95% CI 1.00 to 2.04, p=0.011). Adjusted mean systolic and diastolic blood pressure values in women who were not taking antihypertensive medication were significantly elevated in group III by 2.3 and 1.7 mm Hg, respectively. SHS exposure is significantly associated with hypertension in women never smokers. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  20. Diversification in North American arid lands: niche conservatism, divergence and expansion of habitat explain speciation in the genus Ephedra.

    PubMed

    Loera, Israel; Sosa, Victoria; Ickert-Bond, Stefanie M

    2012-11-01

    A lineage of 12 arid land shrubby species in the gymnosperm genus Ephedra (Gnetales) from North America is used to evaluate the influence of climate on speciation. With a long evolutionary history, and a well documented fossil record this lineage is an ideal model for understanding the process of speciation under a niche conservatism scenario. Using seven DNA molecular markers, Bayesian inference is carried out to uncover sister species and to estimate time of divergence of the lineages. Ecological niche models are generated for four parapatric and sympatric sister species and two analyses of niche evolution are performed, one based on ecological niche models and another using raw data and multivariate analysis. As previous analyses suggest, the diversification of North America Ephedra species may be the result of a recent secondary radiation. Both parapatric and sympatric species diverged mostly in a scenario of climatic niche conservatism. However, we also found strong evidence for niche divergence for one of the sister species pairs (E. californica-E. trifurca). Moreover, the multivariate analysis found environmental differences for some variables between sister species. The estimated divergence time of three pairs of sister species distributed in southwestern North America (E. cutleri-E. aspera, E. californica-E. trifurca and E. torreyana-E. viridis) is inferred to have occurred in the Late Miocene to Pliocene and for the sister species pair E. antisyphilitica-E. coryi distributed in the southern United States and northeastern Mexico, it was inferred from the Pliocene to Pleistocene. The orogenetic and climatic changes documented for these regions related to expansion of arid lands, may have contributed to the diversification in North American Ephedra, rather than adaptations to new climatic conditions. Copyright © 2012 Elsevier Inc. All rights reserved.

  1. Functional connectivity and structural covariance between regions of interest can be measured more accurately using multivariate distance correlation.

    PubMed

    Geerligs, Linda; Cam-Can; Henson, Richard N

    2016-07-15

    Studies of brain-wide functional connectivity or structural covariance typically use measures like the Pearson correlation coefficient, applied to data that have been averaged across voxels within regions of interest (ROIs). However, averaging across voxels may result in biased connectivity estimates when there is inhomogeneity within those ROIs, e.g., sub-regions that exhibit different patterns of functional connectivity or structural covariance. Here, we propose a new measure based on "distance correlation"; a test of multivariate dependence of high dimensional vectors, which allows for both linear and non-linear dependencies. We used simulations to show how distance correlation out-performs Pearson correlation in the face of inhomogeneous ROIs. To evaluate this new measure on real data, we use resting-state fMRI scans and T1 structural scans from 2 sessions on each of 214 participants from the Cambridge Centre for Ageing & Neuroscience (Cam-CAN) project. Pearson correlation and distance correlation showed similar average connectivity patterns, for both functional connectivity and structural covariance. Nevertheless, distance correlation was shown to be 1) more reliable across sessions, 2) more similar across participants, and 3) more robust to different sets of ROIs. Moreover, we found that the similarity between functional connectivity and structural covariance estimates was higher for distance correlation compared to Pearson correlation. We also explored the relative effects of different preprocessing options and motion artefacts on functional connectivity. Because distance correlation is easy to implement and fast to compute, it is a promising alternative to Pearson correlations for investigating ROI-based brain-wide connectivity patterns, for functional as well as structural data. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  2. Gender, position of authority, and the risk of depression and post-traumatic stress disorder among a national sample of U.S. Reserve Component Personnel

    PubMed Central

    Cohen, Gregory H.; Sampson, Laura A.; Fink, David S.; Wang, Jing; Russell, Dale; Gifford, Robert; Fullerton, Carol; Ursano, Robert; Galea, Sandro

    2016-01-01

    BACKGROUND Recent United States military operations in Iraq and Afghanistan have seen dramatic increases in the proportion of women serving, and the breadth of their occupational roles. General population studies suggest that women, compared to men, and persons with lower, as compared to higher, social position may be at greater risk of post-traumatic stress disorder (PTSD) and depression. However, these relations remain unclear in military populations. Accordingly, we aimed to estimate the effects of (1) gender, (2) military authority (i.e., rank) and (3) the interaction of gender and military authority upon: (a) risk of most-recent-deployment-related PTSD, and (b) risk of depression since most-recent-deployment. METHODS Using a nationally representative sample of 1024 previously deployed Reserve Component personnel surveyed in 2010, we constructed multivariable logistic regression models to estimate effects of interest. RESULTS Weighted multivariable logistic regression models demonstrated no statistically significant associations between gender or authority, and either PTSD or depression. Interaction models demonstrated multiplicative statistical interaction between gender and authority for PTSD (beta= −2.37;p=0.01), and depression (beta=-1.21; p=0.057). Predicted probabilities of PTSD and depression, respectively, were lowest in male officers (0.06, 0.09), followed by male enlisted (0.07, 0.14), female enlisted (0.07, 0.15), and female officers (0.30, 0.25). CONCLUSIONS Female officers in the Reserve Component may be at greatest risk for PTSD and depression following deployment, relative to their male and enlisted counterparts, and this relation is not explained by deployment trauma exposure. Future studies may fruitfully examine whether social support, family responsibilities peri-deployment, or contradictory class status may explain these findings. PMID:26899583

  3. Interhemispheric Effective and Functional Cortical Connectivity Signatures of Spina Bifida Are Consistent with Callosal Anomaly

    PubMed Central

    Malekpour, Sheida; Li, Zhimin; Cheung, Bing Leung Patrick; Castillo, Eduardo M.; Papanicolaou, Andrew C.; Kramer, Larry A.; Fletcher, Jack M.

    2012-01-01

    Abstract The impact of the posterior callosal anomalies associated with spina bifida on interhemispheric cortical connectivity is studied using a method for estimating cortical multivariable autoregressive models from scalp magnetoencephalography data. Interhemispheric effective and functional connectivity, measured using conditional Granger causality and coherence, respectively, is determined for the anterior and posterior cortical regions in a population of five spina bifida and five control subjects during a resting eyes-closed state. The estimated connectivity is shown to be consistent over the randomly selected subsets of the data for each subject. The posterior interhemispheric effective and functional connectivity and cortical power are significantly lower in the spina bifida group, a result that is consistent with posterior callosal anomalies. The anterior interhemispheric effective and functional connectivity are elevated in the spina bifida group, a result that may reflect compensatory mechanisms. In contrast, the intrahemispheric effective connectivity is comparable in the two groups. The differences between the spina bifida and control groups are most significant in the θ and α bands. PMID:22571349

  4. Analyzing whether countries are equally efficient at improving longevity for men and women.

    PubMed

    Barthold, Douglas; Nandi, Arijit; Mendoza Rodríguez, José M; Heymann, Jody

    2014-11-01

    We examined the efficiency of country-specific health care spending in improving life expectancies for men and women. We estimated efficiencies of health care spending for 27 Organisation for Economic Co-operation and Development (OECD) countries during the period 1991 to 2007 using multivariable regression models, including country fixed-effects and controlling for time-varying levels of national social expenditures, economic development, and health behaviors. Findings indicated robust differences in health-spending efficiency. A 1% annual increase in health expenditures was associated with percent changes in life expectancy ranging from 0.020 in the United States (95% confidence interval [CI] = 0.008, 0.032) to 0.121 in Germany (95% CI = 0.099, 0.143). Health-spending increases were associated with greater life expectancy improvements for men than for women in nearly every OECD country. This is the first study to our knowledge to estimate the effect of country-specific health expenditures on life expectancies of men and women. Future work understanding the determinants of these differences has the potential to improve the overall efficiency and equity of national health systems.

  5. Electroencephalography signatures of attention-deficit/hyperactivity disorder: clinical utility.

    PubMed

    Alba, Guzmán; Pereda, Ernesto; Mañas, Soledad; Méndez, Leopoldo D; González, Almudena; González, Julián J

    2015-01-01

    The techniques and the most important results on the use of electroencephalography (EEG) to extract different measures are reviewed in this work, which can be clinically useful to study subjects with attention-deficit/hyperactivity disorder (ADHD). First, we discuss briefly and in simple terms the EEG analysis and processing techniques most used in the context of ADHD. We review techniques that both analyze individual EEG channels (univariate measures) and study the statistical interdependence between different EEG channels (multivariate measures), the so-called functional brain connectivity. Among the former ones, we review the classical indices of absolute and relative spectral power and estimations of the complexity of the channels, such as the approximate entropy and the Lempel-Ziv complexity. Among the latter ones, we focus on the magnitude square coherence and on different measures based on the concept of generalized synchronization and its estimation in the state space. Second, from a historical point of view, we present the most important results achieved with these techniques and their clinical utility (sensitivity, specificity, and accuracy) to diagnose ADHD. Finally, we propose future research lines based on these results.

  6. Offline handwritten word recognition using MQDF-HMMs

    NASA Astrophysics Data System (ADS)

    Ramachandrula, Sitaram; Hambarde, Mangesh; Patial, Ajay; Sahoo, Dushyant; Kochar, Shaivi

    2015-01-01

    We propose an improved HMM formulation for offline handwriting recognition (HWR). The main contribution of this work is using modified quadratic discriminant function (MQDF) [1] within HMM framework. In an MQDF-HMM the state observation likelihood is calculated by a weighted combination of MQDF likelihoods of individual Gaussians of GMM (Gaussian Mixture Model). The quadratic discriminant function (QDF) of a multivariate Gaussian can be rewritten by avoiding the inverse of covariance matrix by using the Eigen values and Eigen vectors of it. The MQDF is derived from QDF by substituting few of badly estimated lower-most Eigen values by an appropriate constant. The estimation errors of non-dominant Eigen vectors and Eigen values of covariance matrix for which the training data is insufficient can be controlled by this approach. MQDF has been successfully shown to improve the character recognition performance [1]. The usage of MQDF in HMM improves the computation, storage and modeling power of HMM when there is limited training data. We have got encouraging results on offline handwritten character (NIST database) and word recognition in English using MQDF HMMs.

  7. Examining Factors Associated with Heavy Episodic Drinking Among College Undergraduates

    PubMed Central

    Scholly, Kristen; Katz, Alan R.; Kehl, Lisa

    2014-01-01

    Heavy episodic drinking among college students is a serious health concern. The purpose of this study was to identify factors associated with heavy episodic drinking behaviors amongst a predominately Asian undergraduate college student population in the United States. A survey measuring alcohol use behaviors was completed by a random sample of 18-24 year old undergraduates during April, 2011. A multivariate logistic regression analysis was conducted to determine factors associated with students’ heavy episodic drinking behavior. Independent factors associated with heavy episodic drinking included living on campus, ethnicity, perceived drinking behavior among peers, and a belief that alcohol is a central part of one’s social life. Heavy episodic drinking was also associated with poor academic performance. Campus-wide educational strategies to reduce heavy episodic drinking among college undergraduates should incorporate accurate information regarding alcohol use norms to correct students’ perceived over estimation of their peers alcohol consumption rates and the under estimation of students protective alcohol use behaviors. These efforts should focus in on-campus residence halls where a higher occurrence of heavy episodic drinking is often found. PMID:26973931

  8. Comparison of causality analysis on simultaneously measured fMRI and NIRS signals during motor tasks.

    PubMed

    Anwar, Abdul Rauf; Muthalib, Makii; Perrey, Stephane; Galka, Andreas; Granert, Oliver; Wolff, Stephan; Deuschl, Guenther; Raethjen, Jan; Heute, Ulrich; Muthuraman, Muthuraman

    2013-01-01

    Brain activity can be measured using different modalities. Since most of the modalities tend to complement each other, it seems promising to measure them simultaneously. In to be presented research, the data recorded from Functional Magnetic Resonance Imaging (fMRI) and Near Infrared Spectroscopy (NIRS), simultaneously, are subjected to causality analysis using time-resolved partial directed coherence (tPDC). Time-resolved partial directed coherence uses the principle of state space modelling to estimate Multivariate Autoregressive (MVAR) coefficients. This method is useful to visualize both frequency and time dynamics of causality between the time series. Afterwards, causality results from different modalities are compared by estimating the Spearman correlation. In to be presented study, we used directionality vectors to analyze correlation, rather than actual signal vectors. Results show that causality analysis of the fMRI correlates more closely to causality results of oxy-NIRS as compared to deoxy-NIRS in case of a finger sequencing task. However, in case of simple finger tapping, no clear difference between oxy-fMRI and deoxy-fMRI correlation is identified.

  9. Conditional survival estimates improve over time for patients with advanced melanoma: results from a population-based analysis.

    PubMed

    Xing, Yan; Chang, George J; Hu, Chung-Yuan; Askew, Robert L; Ross, Merrick I; Gershenwald, Jeffrey E; Lee, Jeffrey E; Mansfield, Paul F; Lucci, Anthony; Cormier, Janice N

    2010-05-01

    Conditional survival (CS) has emerged as a clinically relevant measure of prognosis for cancer survivors. The objective of this analysis was to provide melanoma-specific CS estimates to help clinicians promote more informed patient decision making. Patients with melanoma and at least 5 years of follow-up were identified from the Surveillance Epidemiology and End Results registry (1988-2000). By using the methods of Kaplan and Meier, stage-specific, 5-year CS estimates were independently calculated for survivors for each year after diagnosis. Stage-specific multivariate Cox regression models including baseline survivor functions were used to calculate adjusted melanoma-specific CS for different subgroups of patients further stratified by age, gender, race, marital status, anatomic tumor location, and tumor histology. Five-year CS estimates for patients with stage I disease remained constant at 97% annually, while for patients with stages II, III, and IV disease, 5-year CS estimates from time 0 (diagnosis) to 5 years improved from 72% to 86%, 51% to 87%, and 19% to 84%, respectively. Multivariate CS analysis revealed that differences in stages II through IV CS based on age, gender, and race decreased over time. Five-year melanoma-specific CS estimates improve dramatically over time for survivors with advanced stages of disease. These prognostic data are critical to patients for both treatment and nontreatment related life decisions. (c) 2010 American Cancer Society.

  10. Estimating brain connectivity when few data points are available: Perspectives and limitations.

    PubMed

    Antonacci, Yuri; Toppi, Jlenia; Caschera, Stefano; Anzolin, Alessandra; Mattia, Donatella; Astolfi, Laura

    2017-07-01

    Methods based on the use of multivariate autoregressive modeling (MVAR) have proved to be an accurate and flexible tool for the estimation of brain functional connectivity. The multivariate approach, however, implies the use of a model whose complexity (in terms of number of parameters) increases quadratically with the number of signals included in the problem. This can often lead to an underdetermined problem and to the condition of multicollinearity. The aim of this paper is to introduce and test an approach based on Ridge Regression combined with a modified version of the statistics usually adopted for these methods, to broaden the estimation of brain connectivity to those conditions in which current methods fail, due to the lack of enough data points. We tested the performances of this new approach, in comparison with the classical approach based on ordinary least squares (OLS), by means of a simulation study implementing different ground-truth networks, under different network sizes and different levels of data points. Simulation results showed that the new approach provides better performances, in terms of accuracy of the parameters estimation and false positives/false negatives rates, in all conditions related to a low data points/model dimension ratio, and may thus be exploited to estimate and validate estimated patterns at single-trial level or when short time data segments are available.

  11. Analysis models for the estimation of oceanic fields

    NASA Technical Reports Server (NTRS)

    Carter, E. F.; Robinson, A. R.

    1987-01-01

    A general model for statistically optimal estimates is presented for dealing with scalar, vector and multivariate datasets. The method deals with anisotropic fields and treats space and time dependence equivalently. Problems addressed include the analysis, or the production of synoptic time series of regularly gridded fields from irregular and gappy datasets, and the estimate of fields by compositing observations from several different instruments and sampling schemes. Technical issues are discussed, including the convergence of statistical estimates, the choice of representation of the correlations, the influential domain of an observation, and the efficiency of numerical computations.

  12. Self-tuning multivariable pole placement control of a multizone crystal growth furnace

    NASA Technical Reports Server (NTRS)

    Batur, C.; Sharpless, R. B.; Duval, W. M. B.; Rosenthal, B. N.

    1992-01-01

    This paper presents the design and implementation of a multivariable self-tuning temperature controller for the control of lead bromide crystal growth. The crystal grows inside a multizone transparent furnace. There are eight interacting heating zones shaping the axial temperature distribution inside the furnace. A multi-input, multi-output furnace model is identified on-line by a recursive least squares estimation algorithm. A multivariable pole placement controller based on this model is derived and implemented. Comparison between single-input, single-output and multi-input, multi-output self-tuning controllers demonstrates that the zone-to-zone interactions can be minimized better by a multi-input, multi-output controller design. This directly affects the quality of crystal grown.

  13. Estimated Number of Preterm Births and Low Birth Weight Children Born in the United States Due to Maternal Binge Drinking

    PubMed Central

    Truong, Khoa D; Reifsnider, Odette S; Mayorga, Maria E; Spitler, Hugh

    2013-01-01

    Objective To estimate the aggregate burden of maternal binge drinking on preterm birth (PTB) and low birth weight (LBW) across American sociodemographic groups in 2008. Methods A simulation model was developed to estimate the number of PTB and LBW cases due to maternal binge drinking. Data inputs for the model included number of births and rates of preterm and LBW from the National Center for Health Statistics; female population by childbearing age groups from the U.S. Census; increased relative risks of preterm and LBW deliveries due to maternal binge drinking extracted from the literature; and adjusted prevalence of binge drinking among pregnant women estimated in a multivariate logistic regression model using Behavioral Risk Factor Surveillance System survey. Results The most conservative estimates attributed maternal binge drinking to 8,701 (95% CI: 7,804–9,598) PTBs (1.75% of all PTBs) and 5,627 (95% CI 5,121–6,133) LBW deliveries in 2008, with 3,708 (95% CI: 3,375–4,041) cases of both PTB and LBW. The estimated rate of PTB due to maternal binge drinking was 1.57% among all PTBs to White women, 0.69% among Black women, 3.31% among Hispanic women, and 2.35% among other races. Compared to other age groups, women ages 40–44 had the highest adjusted binge drinking rate and highest PTB rate due to maternal binge drinking (4.33%). Conclusion Maternal binge drinking contributed significantly to PTB and LBW differentially across sociodemographic groups. PMID:22711260

  14. A multivariate method for estimating mortality rates among children under 5 years from health and social indicators in Iraq.

    PubMed

    Garfield, R; Leu, C S

    2000-06-01

    Many reports on Iraq suggest that a rise in rates of death and disease have occurred since the Gulf War of January/February 1991 and the economic sanctions that followed it. Four preliminary models, based on unadjusted projections, were developed. A logistic regression model was then developed on the basis of six social variables in Iraq and comparable information from countries in the State of the World's Children report. Missing data were estimated for this model by a multiple imputation procedure. The final model depends on three socio-medical indicators: adult literacy, nutritional stunting of children under 5 years, and access to piped water. The model successfully predicted both the mortality rate in 1990, under stable conditions, and in 1991, following the Gulf War. For 1996, after 5 years of sanctions and prior to receipt of humanitarian food via the oil for food programme, this model shows mortality among children under 5 to have reached an estimated 87 per 1000, a rate last experienced more than 30 years ago. Accurate and timely estimates of mortality levels in developing countries are costly and require considerable methodological expertise. A rapid estimation technique like the one developed here may be a useful tool for quick and efficient estimation of mortality rates among under 5 year olds in countries where good mortality data are not routinely available. This is especially true for countries with complex humanitarian emergencies where information on mortality changes can guide interventions and the social stability to use standard demographic methods does not exist.

  15. Snow multivariable data assimilation for hydrological predictions in Alpine sites

    NASA Astrophysics Data System (ADS)

    Piazzi, Gaia; Thirel, Guillaume; Campo, Lorenzo; Gabellani, Simone; Stevenin, Hervè

    2017-04-01

    Snowpack dynamics (snow accumulation and ablation) strongly impacts on hydrological processes in Alpine areas. During the winter season the presence of snow cover (snow accumulation) reduces the drainage in the basin with a resulting lower watershed time of concentration in case of possible rainfall events. Moreover, the release of the significant water volume stored in winter (snowmelt) considerably contributes to the total discharge during the melting period. Therefore when modeling hydrological processes in snow-dominated catchments the quality of predictions deeply depends on how the model succeeds in catching snowpack dynamics. The integration of a hydrological model with a snow module allows improving predictions of river discharges. Besides the well-known modeling limitations (uncertainty in parameterizations; possible errors affecting both meteorological forcing data and initial conditions; approximations in boundary conditions), there are physical factors that make an exhaustive reconstruction of snow dynamics complicated: snow intermittence in space and time, stratification and slow phenomena like metamorphism processes, uncertainty in snowfall evaluation, wind transportation, etc. Data Assimilation (DA) techniques provide an objective methodology to combine several independent snow-related data sources (model simulations, ground-based measurements and remote sensed observations) in order to obtain the most likely estimate of snowpack state. This study presents SMASH (Snow Multidata Assimilation System for Hydrology), a multi-layer snow dynamic model strengthened by a multivariable DA framework for hydrological purposes. The model is physically based on mass and energy balances and can be used to reproduce the main physical processes occurring within the snowpack: accumulation, density dynamics, melting, sublimation, radiative balance, heat and mass exchanges. The model is driven by observed forcing meteorological data (air temperature, wind velocity, relative air humidity, precipitation and incident solar radiation) to provide a complete estimate of snowpack state. The implementation of a DA scheme enables to assimilate simultaneously ground-based observations of different snow-related variables (snow depth, snow density, surface temperature and albedo). SMASH performances are evaluated by using observed data supplied by meteorological stations located in three experimental Alpine sites: Col de Porte (1325 m, France); Torgnon (2160 m, Italy); Weissfluhjoch (2540 m, Switzerland). A comparison analysis between the resulting performaces of Particle Filter and Ensemble Kalman Filter schemes is shown.

  16. On the multivariate total least-squares approach to empirical coordinate transformations. Three algorithms

    NASA Astrophysics Data System (ADS)

    Schaffrin, Burkhard; Felus, Yaron A.

    2008-06-01

    The multivariate total least-squares (MTLS) approach aims at estimating a matrix of parameters, Ξ, from a linear model ( Y- E Y = ( X- E X ) · Ξ) that includes an observation matrix, Y, another observation matrix, X, and matrices of randomly distributed errors, E Y and E X . Two special cases of the MTLS approach include the standard multivariate least-squares approach where only the observation matrix, Y, is perturbed by random errors and, on the other hand, the data least-squares approach where only the coefficient matrix X is affected by random errors. In a previous contribution, the authors derived an iterative algorithm to solve the MTLS problem by using the nonlinear Euler-Lagrange conditions. In this contribution, new lemmas are developed to analyze the iterative algorithm, modify it, and compare it with a new ‘closed form’ solution that is based on the singular-value decomposition. For an application, the total least-squares approach is used to estimate the affine transformation parameters that convert cadastral data from the old to the new Israeli datum. Technical aspects of this approach, such as scaling the data and fixing the columns in the coefficient matrix are investigated. This case study illuminates the issue of “symmetry” in the treatment of two sets of coordinates for identical point fields, a topic that had already been emphasized by Teunissen (1989, Festschrift to Torben Krarup, Geodetic Institute Bull no. 58, Copenhagen, Denmark, pp 335-342). The differences between the standard least-squares and the TLS approach are analyzed in terms of the estimated variance component and a first-order approximation of the dispersion matrix of the estimated parameters.

  17. Probabilistic flood damage modelling at the meso-scale

    NASA Astrophysics Data System (ADS)

    Kreibich, Heidi; Botto, Anna; Schröter, Kai; Merz, Bruno

    2014-05-01

    Decisions on flood risk management and adaptation are usually based on risk analyses. Such analyses are associated with significant uncertainty, even more if changes in risk due to global change are expected. Although uncertainty analysis and probabilistic approaches have received increased attention during the last years, they are still not standard practice for flood risk assessments. Most damage models have in common that complex damaging processes are described by simple, deterministic approaches like stage-damage functions. Novel probabilistic, multi-variate flood damage models have been developed and validated on the micro-scale using a data-mining approach, namely bagging decision trees (Merz et al. 2013). In this presentation we show how the model BT-FLEMO (Bagging decision Tree based Flood Loss Estimation MOdel) can be applied on the meso-scale, namely on the basis of ATKIS land-use units. The model is applied in 19 municipalities which were affected during the 2002 flood by the River Mulde in Saxony, Germany. The application of BT-FLEMO provides a probability distribution of estimated damage to residential buildings per municipality. Validation is undertaken on the one hand via a comparison with eight other damage models including stage-damage functions as well as multi-variate models. On the other hand the results are compared with official damage data provided by the Saxon Relief Bank (SAB). The results show, that uncertainties of damage estimation remain high. Thus, the significant advantage of this probabilistic flood loss estimation model BT-FLEMO is that it inherently provides quantitative information about the uncertainty of the prediction. Reference: Merz, B.; Kreibich, H.; Lall, U. (2013): Multi-variate flood damage assessment: a tree-based data-mining approach. NHESS, 13(1), 53-64.

  18. Using genetic algorithms to optimize k-Nearest Neighbors configurations for use with airborne laser scanning data

    Treesearch

    Ronald E. McRoberts; Grant M. Domke; Qi Chen; Erik Næsset; Terje Gobakken

    2016-01-01

    The relatively small sampling intensities used by national forest inventories are often insufficient to produce the desired precision for estimates of population parameters unless the estimation process is augmented with auxiliary information, usually in the form of remotely sensed data. The k-Nearest Neighbors (k-NN) technique is a non-parametric,multivariate approach...

  19. Estimating areal means and variances of forest attributes using the k-Nearest Neighbors technique and satellite imagery

    Treesearch

    Ronald E. McRoberts; Erkki O. Tomppo; Andrew O. Finley; Heikkinen Juha

    2007-01-01

    The k-Nearest Neighbor (k-NN) technique has become extremely popular for a variety of forest inventory mapping and estimation applications. Much of this popularity may be attributed to the non-parametric, multivariate features of the technique, its intuitiveness, and its ease of use. When used with satellite imagery and forest...

  20. A constrained multinomial Probit route choice model in the metro network: Formulation, estimation and application

    PubMed Central

    Zhang, Yongsheng; Wei, Heng; Zheng, Kangning

    2017-01-01

    Considering that metro network expansion brings us with more alternative routes, it is attractive to integrate the impacts of routes set and the interdependency among alternative routes on route choice probability into route choice modeling. Therefore, the formulation, estimation and application of a constrained multinomial probit (CMNP) route choice model in the metro network are carried out in this paper. The utility function is formulated as three components: the compensatory component is a function of influencing factors; the non-compensatory component measures the impacts of routes set on utility; following a multivariate normal distribution, the covariance of error component is structured into three parts, representing the correlation among routes, the transfer variance of route, and the unobserved variance respectively. Considering multidimensional integrals of the multivariate normal probability density function, the CMNP model is rewritten as Hierarchical Bayes formula and M-H sampling algorithm based Monte Carlo Markov Chain approach is constructed to estimate all parameters. Based on Guangzhou Metro data, reliable estimation results are gained. Furthermore, the proposed CMNP model also shows a good forecasting performance for the route choice probabilities calculation and a good application performance for transfer flow volume prediction. PMID:28591188

  1. Misspecification of Cox regression models with composite endpoints

    PubMed Central

    Wu, Longyang; Cook, Richard J

    2012-01-01

    Researchers routinely adopt composite endpoints in multicenter randomized trials designed to evaluate the effect of experimental interventions in cardiovascular disease, diabetes, and cancer. Despite their widespread use, relatively little attention has been paid to the statistical properties of estimators of treatment effect based on composite endpoints. We consider this here in the context of multivariate models for time to event data in which copula functions link marginal distributions with a proportional hazards structure. We then examine the asymptotic and empirical properties of the estimator of treatment effect arising from a Cox regression model for the time to the first event. We point out that even when the treatment effect is the same for the component events, the limiting value of the estimator based on the composite endpoint is usually inconsistent for this common value. We find that in this context the limiting value is determined by the degree of association between the events, the stochastic ordering of events, and the censoring distribution. Within the framework adopted, marginal methods for the analysis of multivariate failure time data yield consistent estimators of treatment effect and are therefore preferred. We illustrate the methods by application to a recent asthma study. Copyright © 2012 John Wiley & Sons, Ltd. PMID:22736519

  2. Drunk driving detection based on classification of multivariate time series.

    PubMed

    Li, Zhenlong; Jin, Xue; Zhao, Xiaohua

    2015-09-01

    This paper addresses the problem of detecting drunk driving based on classification of multivariate time series. First, driving performance measures were collected from a test in a driving simulator located in the Traffic Research Center, Beijing University of Technology. Lateral position and steering angle were used to detect drunk driving. Second, multivariate time series analysis was performed to extract the features. A piecewise linear representation was used to represent multivariate time series. A bottom-up algorithm was then employed to separate multivariate time series. The slope and time interval of each segment were extracted as the features for classification. Third, a support vector machine classifier was used to classify driver's state into two classes (normal or drunk) according to the extracted features. The proposed approach achieved an accuracy of 80.0%. Drunk driving detection based on the analysis of multivariate time series is feasible and effective. The approach has implications for drunk driving detection. Copyright © 2015 Elsevier Ltd and National Safety Council. All rights reserved.

  3. Monthly variation of United States pediatric headache emergency department visits.

    PubMed

    Kedia, Sita; Ginde, Adit A; Grubenhoff, Joseph A; Kempe, Allison; Hershey, Andrew D; Powers, Scott W

    2014-05-01

    The objective of this article is to determine the monthly variation of emergency department (ED) visits for pediatric headache. We hypothesized youth have increased headache-related ED visits in the months associated with school attendance. Using a United States representative sample of ED visits in the National Hospital Ambulatory Medical Care Survey from 1997 to 2009, we estimated number of visits associated with ICD-9 codes related to headache, migraine, status migrainosus, or tension-type headache in 5- to 18-year-olds. Age-stratified multivariate models are presented for month of visit (July as reference). There was a national estimate of 250,000 ED visits annually related to headache (2.1% of total visits) in 5- to 18-year-olds. In 5- to 11-year-olds, the adjusted rate of headache-related visits was lower in April (OR 0.42, 95% CI 0.20, 0.88). In 12- to 18-year-olds, there were higher rates in January (OR 1.92, 95% CI 1.16, 3.14) and September (OR 1.64, 95% CI 1.06, 2.55). In adolescents we found higher ED utilization in January and September, the same months associated with school return from vacation for a majority of children nationally. No significant reduction in the summer suggests that school itself is not the issue, but rather changes in daily lifestyle and transitions.

  4. Trait and state anxiety across academic evaluative contexts: development and validation of the MTEA-12 and MSEA-12 scales.

    PubMed

    Sotardi, Valerie A

    2018-05-01

    Educational measures of anxiety focus heavily on students' experiences with tests yet overlook other assessment contexts. In this research, two brief multiscale questionnaires were developed and validated to measure trait evaluation anxiety (MTEA-12) and state evaluation anxiety (MSEA-12) for use in various assessment contexts in non-clinical, educational settings. The research included a cross-sectional analysis of self-report data using authentic assessment settings in which evaluation anxiety was measured. Instruments were tested using a validation sample of 241 first-year university students in New Zealand. Scale development included component structures for state and trait scales based on existing theoretical frameworks. Analyses using confirmatory factor analysis and descriptive statistics indicate that the scales are reliable and structurally valid. Multivariate general linear modeling using subscales from the MTEA-12, MSEA-12, and student grades suggest adequate criterion-related validity. Initial predictive validity in which one relevant MTEA-12 factor explained between 21% and 54% of the variance in three MSEA-12 factors. Results document MTEA-12 and MSEA-12 as reliable measures of trait and state dimensions of evaluation anxiety for test and writing contexts. Initial estimates suggest the scales as having promising validity, and recommendations for further validation are outlined.

  5. Control-group feature normalization for multivariate pattern analysis of structural MRI data using the support vector machine.

    PubMed

    Linn, Kristin A; Gaonkar, Bilwaj; Satterthwaite, Theodore D; Doshi, Jimit; Davatzikos, Christos; Shinohara, Russell T

    2016-05-15

    Normalization of feature vector values is a common practice in machine learning. Generally, each feature value is standardized to the unit hypercube or by normalizing to zero mean and unit variance. Classification decisions based on support vector machines (SVMs) or by other methods are sensitive to the specific normalization used on the features. In the context of multivariate pattern analysis using neuroimaging data, standardization effectively up- and down-weights features based on their individual variability. Since the standard approach uses the entire data set to guide the normalization, it utilizes the total variability of these features. This total variation is inevitably dependent on the amount of marginal separation between groups. Thus, such a normalization may attenuate the separability of the data in high dimensional space. In this work we propose an alternate approach that uses an estimate of the control-group standard deviation to normalize features before training. We study our proposed approach in the context of group classification using structural MRI data. We show that control-based normalization leads to better reproducibility of estimated multivariate disease patterns and improves the classifier performance in many cases. Copyright © 2016 Elsevier Inc. All rights reserved.

  6. Estimating the ratio of multivariate recurrent event rates with application to a blood transfusion study.

    PubMed

    Ning, Jing; Rahbar, Mohammad H; Choi, Sangbum; Piao, Jin; Hong, Chuan; Del Junco, Deborah J; Rahbar, Elaheh; Fox, Erin E; Holcomb, John B; Wang, Mei-Cheng

    2017-08-01

    In comparative effectiveness studies of multicomponent, sequential interventions like blood product transfusion (plasma, platelets, red blood cells) for trauma and critical care patients, the timing and dynamics of treatment relative to the fragility of a patient's condition is often overlooked and underappreciated. While many hospitals have established massive transfusion protocols to ensure that physiologically optimal combinations of blood products are rapidly available, the period of time required to achieve a specified massive transfusion standard (e.g. a 1:1 or 1:2 ratio of plasma or platelets:red blood cells) has been ignored. To account for the time-varying characteristics of transfusions, we use semiparametric rate models for multivariate recurrent events to estimate blood product ratios. We use latent variables to account for multiple sources of informative censoring (early surgical or endovascular hemorrhage control procedures or death). The major advantage is that the distributions of latent variables and the dependence structure between the multivariate recurrent events and informative censoring need not be specified. Thus, our approach is robust to complex model assumptions. We establish asymptotic properties and evaluate finite sample performance through simulations, and apply the method to data from the PRospective Observational Multicenter Major Trauma Transfusion study.

  7. [Analysis of variance of repeated data measured by water maze with SPSS].

    PubMed

    Qiu, Hong; Jin, Guo-qin; Jin, Ru-feng; Zhao, Wei-kang

    2007-01-01

    To introduce the method of analyzing repeated data measured by water maze with SPSS 11.0, and offer a reference statistical method to clinical and basic medicine researchers who take the design of repeated measures. Using repeated measures and multivariate analysis of variance (ANOVA) process of the general linear model in SPSS and giving comparison among different groups and different measure time pairwise. Firstly, Mauchly's test of sphericity should be used to judge whether there were relations among the repeatedly measured data. If any (P

  8. Using multivariate generalizability theory to assess the effect of content stratification on the reliability of a performance assessment.

    PubMed

    Keller, Lisa A; Clauser, Brian E; Swanson, David B

    2010-12-01

    In recent years, demand for performance assessments has continued to grow. However, performance assessments are notorious for lower reliability, and in particular, low reliability resulting from task specificity. Since reliability analyses typically treat the performance tasks as randomly sampled from an infinite universe of tasks, these estimates of reliability may not be accurate. For tests built according to a table of specifications, tasks are randomly sampled from different strata (content domains, skill areas, etc.). If these strata remain fixed in the test construction process, ignoring this stratification in the reliability analysis results in an underestimate of "parallel forms" reliability, and an overestimate of the person-by-task component. This research explores the effect of representing and misrepresenting the stratification appropriately in estimation of reliability and the standard error of measurement. Both multivariate and univariate generalizability studies are reported. Results indicate that the proper specification of the analytic design is essential in yielding the proper information both about the generalizability of the assessment and the standard error of measurement. Further, illustrative D studies present the effect under a variety of situations and test designs. Additional benefits of multivariate generalizability theory in test design and evaluation are also discussed.

  9. Phylogenetic Factor Analysis.

    PubMed

    Tolkoff, Max R; Alfaro, Michael E; Baele, Guy; Lemey, Philippe; Suchard, Marc A

    2018-05-01

    Phylogenetic comparative methods explore the relationships between quantitative traits adjusting for shared evolutionary history. This adjustment often occurs through a Brownian diffusion process along the branches of the phylogeny that generates model residuals or the traits themselves. For high-dimensional traits, inferring all pair-wise correlations within the multivariate diffusion is limiting. To circumvent this problem, we propose phylogenetic factor analysis (PFA) that assumes a small unknown number of independent evolutionary factors arise along the phylogeny and these factors generate clusters of dependent traits. Set in a Bayesian framework, PFA provides measures of uncertainty on the factor number and groupings, combines both continuous and discrete traits, integrates over missing measurements and incorporates phylogenetic uncertainty with the help of molecular sequences. We develop Gibbs samplers based on dynamic programming to estimate the PFA posterior distribution, over 3-fold faster than for multivariate diffusion and a further order-of-magnitude more efficiently in the presence of latent traits. We further propose a novel marginal likelihood estimator for previously impractical models with discrete data and find that PFA also provides a better fit than multivariate diffusion in evolutionary questions in columbine flower development, placental reproduction transitions and triggerfish fin morphometry.

  10. Insights on multivariate updates of physical and biogeochemical ocean variables using an Ensemble Kalman Filter and an idealized model of upwelling

    NASA Astrophysics Data System (ADS)

    Yu, Liuqian; Fennel, Katja; Bertino, Laurent; Gharamti, Mohamad El; Thompson, Keith R.

    2018-06-01

    Effective data assimilation methods for incorporating observations into marine biogeochemical models are required to improve hindcasts, nowcasts and forecasts of the ocean's biogeochemical state. Recent assimilation efforts have shown that updating model physics alone can degrade biogeochemical fields while only updating biogeochemical variables may not improve a model's predictive skill when the physical fields are inaccurate. Here we systematically investigate whether multivariate updates of physical and biogeochemical model states are superior to only updating either physical or biogeochemical variables. We conducted a series of twin experiments in an idealized ocean channel that experiences wind-driven upwelling. The forecast model was forced with biased wind stress and perturbed biogeochemical model parameters compared to the model run representing the "truth". Taking advantage of the multivariate nature of the deterministic Ensemble Kalman Filter (DEnKF), we assimilated different combinations of synthetic physical (sea surface height, sea surface temperature and temperature profiles) and biogeochemical (surface chlorophyll and nitrate profiles) observations. We show that when biogeochemical and physical properties are highly correlated (e.g., thermocline and nutricline), multivariate updates of both are essential for improving model skill and can be accomplished by assimilating either physical (e.g., temperature profiles) or biogeochemical (e.g., nutrient profiles) observations. In our idealized domain, the improvement is largely due to a better representation of nutrient upwelling, which results in a more accurate nutrient input into the euphotic zone. In contrast, assimilating surface chlorophyll improves the model state only slightly, because surface chlorophyll contains little information about the vertical density structure. We also show that a degradation of the correlation between observed subsurface temperature and nutrient fields, which has been an issue in several previous assimilation studies, can be reduced by multivariate updates of physical and biogeochemical fields.

  11. Simultaneous calibration of ensemble river flow predictions over an entire range of lead times

    NASA Astrophysics Data System (ADS)

    Hemri, S.; Fundel, F.; Zappa, M.

    2013-10-01

    Probabilistic estimates of future water levels and river discharge are usually simulated with hydrologic models using ensemble weather forecasts as main inputs. As hydrologic models are imperfect and the meteorological ensembles tend to be biased and underdispersed, the ensemble forecasts for river runoff typically are biased and underdispersed, too. Thus, in order to achieve both reliable and sharp predictions statistical postprocessing is required. In this work Bayesian model averaging (BMA) is applied to statistically postprocess ensemble runoff raw forecasts for a catchment in Switzerland, at lead times ranging from 1 to 240 h. The raw forecasts have been obtained using deterministic and ensemble forcing meteorological models with different forecast lead time ranges. First, BMA is applied based on mixtures of univariate normal distributions, subject to the assumption of independence between distinct lead times. Then, the independence assumption is relaxed in order to estimate multivariate runoff forecasts over the entire range of lead times simultaneously, based on a BMA version that uses multivariate normal distributions. Since river runoff is a highly skewed variable, Box-Cox transformations are applied in order to achieve approximate normality. Both univariate and multivariate BMA approaches are able to generate well calibrated probabilistic forecasts that are considerably sharper than climatological forecasts. Additionally, multivariate BMA provides a promising approach for incorporating temporal dependencies into the postprocessed forecasts. Its major advantage against univariate BMA is an increase in reliability when the forecast system is changing due to model availability.

  12. Multivariable control altitude demonstration on the F100 turbofan engine

    NASA Technical Reports Server (NTRS)

    Lehtinen, B.; Dehoff, R. L.; Hackney, R. D.

    1979-01-01

    The F100 Multivariable control synthesis (MVCS) program, was aimed at demonstrating the benefits of LGR synthesis theory in the design of a multivariable engine control system for operation throughout the flight envelope. The advantages of such procedures include: (1) enhanced performance from cross-coupled controls, (2) maximum use of engine variable geometry, and (3) a systematic design procedure that can be applied efficiently to new engine systems. The control system designed, under the MVCS program, for the Pratt & Whitney F100 turbofan engine is described. Basic components of the control include: (1) a reference value generator for deriving a desired equilibrium state and an approximate control vector, (2) a transition model to produce compatible reference point trajectories during gross transients, (3) gain schedules for producing feedback terms appropriate to the flight condition, and (4) integral switching logic to produce acceptable steady-state performance without engine operating limit exceedance.

  13. Analyzing developmental processes on an individual level using nonstationary time series modeling.

    PubMed

    Molenaar, Peter C M; Sinclair, Katerina O; Rovine, Michael J; Ram, Nilam; Corneal, Sherry E

    2009-01-01

    Individuals change over time, often in complex ways. Generally, studies of change over time have combined individuals into groups for analysis, which is inappropriate in most, if not all, studies of development. The authors explain how to identify appropriate levels of analysis (individual vs. group) and demonstrate how to estimate changes in developmental processes over time using a multivariate nonstationary time series model. They apply this model to describe the changing relationships between a biological son and father and a stepson and stepfather at the individual level. The authors also explain how to use an extended Kalman filter with iteration and smoothing estimator to capture how dynamics change over time. Finally, they suggest further applications of the multivariate nonstationary time series model and detail the next steps in the development of statistical models used to analyze individual-level data.

  14. Estimation of Subpixel Snow-Covered Area by Nonparametric Regression Splines

    NASA Astrophysics Data System (ADS)

    Kuter, S.; Akyürek, Z.; Weber, G.-W.

    2016-10-01

    Measurement of the areal extent of snow cover with high accuracy plays an important role in hydrological and climate modeling. Remotely-sensed data acquired by earth-observing satellites offer great advantages for timely monitoring of snow cover. However, the main obstacle is the tradeoff between temporal and spatial resolution of satellite imageries. Soft or subpixel classification of low or moderate resolution satellite images is a preferred technique to overcome this problem. The most frequently employed snow cover fraction methods applied on Moderate Resolution Imaging Spectroradiometer (MODIS) data have evolved from spectral unmixing and empirical Normalized Difference Snow Index (NDSI) methods to latest machine learning-based artificial neural networks (ANNs). This study demonstrates the implementation of subpixel snow-covered area estimation based on the state-of-the-art nonparametric spline regression method, namely, Multivariate Adaptive Regression Splines (MARS). MARS models were trained by using MODIS top of atmospheric reflectance values of bands 1-7 as predictor variables. Reference percentage snow cover maps were generated from higher spatial resolution Landsat ETM+ binary snow cover maps. A multilayer feed-forward ANN with one hidden layer trained with backpropagation was also employed to estimate the percentage snow-covered area on the same data set. The results indicated that the developed MARS model performed better than th

  15. Modelling Long Term Disability following Injury: Comparison of Three Approaches for Handling Multiple Injuries

    PubMed Central

    Gabbe, Belinda J.; Harrison, James E.; Lyons, Ronan A.; Jolley, Damien

    2011-01-01

    Background Injury is a leading cause of the global burden of disease (GBD). Estimates of non-fatal injury burden have been limited by a paucity of empirical outcomes data. This study aimed to (i) establish the 12-month disability associated with each GBD 2010 injury health state, and (ii) compare approaches to modelling the impact of multiple injury health states on disability as measured by the Glasgow Outcome Scale – Extended (GOS-E). Methods 12-month functional outcomes for 11,337 survivors to hospital discharge were drawn from the Victorian State Trauma Registry and the Victorian Orthopaedic Trauma Outcomes Registry. ICD-10 diagnosis codes were mapped to the GBD 2010 injury health states. Cases with a GOS-E score >6 were defined as “recovered.” A split dataset approach was used. Cases were randomly assigned to development or test datasets. Probability of recovery for each health state was calculated using the development dataset. Three logistic regression models were evaluated: a) additive, multivariable; b) “worst injury;” and c) multiplicative. Models were adjusted for age and comorbidity and investigated for discrimination and calibration. Findings A single injury health state was recorded for 46% of cases (1–16 health states per case). The additive (C-statistic 0.70, 95% CI: 0.69, 0.71) and “worst injury” (C-statistic 0.70; 95% CI: 0.68, 0.71) models demonstrated higher discrimination than the multiplicative (C-statistic 0.68; 95% CI: 0.67, 0.70) model. The additive and “worst injury” models demonstrated acceptable calibration. Conclusions The majority of patients survived with persisting disability at 12-months, highlighting the importance of improving estimates of non-fatal injury burden. Additive and “worst” injury models performed similarly. GBD 2010 injury states were moderately predictive of recovery 1-year post-injury. Further evaluation using additional measures of health status and functioning and comparison with the GBD 2010 disability weights will be needed to optimise injury states for future GBD studies. PMID:21984951

  16. Costs of colorectal cancer in different states of the disease.

    PubMed

    Färkkilä, Niilo; Torvinen, Saku; Sintonen, Harri; Saarto, Tiina; Järvinen, Heikki; Hänninen, Juha; Taari, Kimmo; Roine, Risto P

    2015-04-01

    This cross-sectional study estimates the resource use and costs among prevalent colorectal cancer (CRC) patients in different states of the disease. Altogether 508 Finnish CRC patients (aged 26-96; colon cancer 56%; female 47%) answered a questionnaire enquiring about informal care, work capacity, and demographic factors. Furthermore, data on direct medical resource use and productivity costs were obtained from registries. Patients were divided into five mutually exclusive groups based on the disease state and the time from diagnosis: primary treatments (the first six months after the diagnosis), rehabilitation, remission, metastatic disease, and palliative care. The costs were calculated for a six-month period. Multivariate modeling was performed to find the cost drivers. The costs were highest during the primary treatment state and the advanced disease states. The total costs for the cross-sectional six-month period were €22 200 in the primary treatment state, €2106 in the rehabilitation state, €2812 in the remission state, €20 540 in the metastatic state, and €21 146 in the palliative state. Most of the costs were direct medical costs. The informal care cost was highest per patient in the palliative care state, amounting to 33% of the total costs. The productivity costs varied between disease states, constituting 19-40% of the total costs, and were highest in the primary treatment state. The first six months after the diagnosis of CRC are resource intensive, but compared with the metastatic disease state, which lasts on average for 2-3 years, the costs are rather modest. Informal care constitutes a remarkable share of the total costs, especially in the palliative state. These results form a basis for the evaluation of the cost effectiveness of new treatments when allocating resources in CRC treatment.

  17. State variation in HIV/AIDS health outcomes: the effect of spending on social services and public health.

    PubMed

    Talbert-Slagle, Kristina M; Canavan, Maureen E; Rogan, Erika M; Curry, Leslie A; Bradley, Elizabeth H

    2016-02-20

    Despite considerable advances in the prevention and treatment of HIV/AIDS, the burden of new infections of HIV and AIDS varies substantially across the country. Previous studies have demonstrated associations between increased healthcare spending and better HIV/AIDS outcomes; however, less is known about the association between spending on social services and public health spending and HIV/AIDS outcomes. We sought to examine the association between state-level spending on social services and public health and HIV/AIDS case rates and AIDS deaths across the United States. We conducted a retrospective, longitudinal study of the 50 U.S. states over 2000-2009 using a dataset of HIV/AIDS case rates and AIDS deaths per 100 000 people matched with a unique dataset of state-level spending on social services and public health per person in poverty. We estimated multivariable regression models for each HIV/AIDS outcome as a function of the social service and public health spending 1 and 5 years earlier in the state, adjusted for the log of state GDP per capita, regional and time fixed effects, Medicaid spending as % of GDP, and socio-demographic, economic, and health resource factors. States with higher spending on social services and public health per person in poverty had significantly lower HIV and AIDS case rates and fewer AIDS deaths, both 1 and 5 years post expenditure (P ≤ 0.05). Our findings suggest that spending on social services and public health may provide a leverage point for state policymakers to reduce HIV/AIDS case rates and AIDS deaths in their state.

  18. Risk factors for age-related macular degeneration: findings from the Andhra Pradesh eye disease study in South India.

    PubMed

    Krishnaiah, Sannapaneni; Das, Taraprasad; Nirmalan, Praveen K; Nutheti, Rishita; Shamanna, Bindiganavale R; Rao, Gullapalli N; Thomas, Ravi

    2005-12-01

    To assess prevalence, potential risk factors, and population attributable risk percentage (PAR%) for age-related macular degeneration (AMD) in the Indian state of Andhra Pradesh. A population-based study, using a stratified, random, cluster, systematic sampling strategy, was conducted in the state of Andhra Pradesh in India from 1996 to 2000. Participants from 94 clusters in one urban and three rural areas representative of the population of Andhra Pradesh underwent a detailed interview and a detailed dilated ocular evaluation by trained professionals. In this report, the authors present the prevalence estimates of AMD and examine the association of AMD with potential risk factors in persons aged 40 to 102 years (n = 3723). AMD was defined according to the international classification and grading system. Standard bivariate and multivariate analyses were performed to identify the potential risk factors for AMD. PAR% was calculated by Levin's formula. AMD was present in 71 subjects--an age-gender-area-adjusted prevalence of 1.82% (95% confidence interval [CI], 1.39%-2.25%). Risk factors that were significant in bivariate analyses were considered for multivariate logistic regression analysis. Multivariate analysis showed that the adjusted prevalence of AMD was significantly higher in those 60 years of age or older (odds ratio [OR], 3.55; 95% CI, 1.61-7.82) and history of prior cigar smoking (OR, 3.29; 95%CI, 1.42-7.57). Presence of cortical cataract and prior cataract surgery were significantly associated with increased prevalence of AMD (adjusted OR, 2.87; 95% CI, 1.57-5.26 and 3.79; 95% CI, 2.1-6.78), respectively. The prevalence of AMD was significantly lower in light alcohol drinkers (adjusted OR, 0.38; 95% CI, 0.19-0.76) compared with nondrinkers. The PAR% for hypertension and heavy cigar smoking was 10% and 14%, respectively, in this population. The prevalence of AMD in this south Indian population is similar to those reported in other developed countries. Abstinence from smoking may reduce the risk of AMD in this population.

  19. Estimated number of preterm births and low birth weight children born in the United States due to maternal binge drinking.

    PubMed

    Truong, Khoa D; Reifsnider, Odette S; Mayorga, Maria E; Spitler, Hugh

    2013-05-01

    The objective of this study was to estimate the aggregate burden of maternal binge drinking on preterm birth (PTB) and low birth weight (LBW) across American sociodemographic groups in 2008. To estimate the aggregate burden of maternal binge drinking on preterm birth (PTB) and low birth weight (LBW) across American sociodemographic groups in 2008. A simulation model was developed to estimate the number of PTB and LBW cases due to maternal binge drinking. Data inputs for the model included number of births and rates of preterm and LBW from the National Center for Health Statistics; female population by childbearing age groups from the U.S. Census; increased relative risks of preterm and LBW deliveries due to maternal binge drinking extracted from the literature; and adjusted prevalence of binge drinking among pregnant women estimated in a multivariate logistic regression model using Behavioral Risk Factor Surveillance System survey. The most conservative estimates attributed maternal binge drinking to 8,701 (95% CI: 7,804-9,598) PTBs (1.75% of all PTBs) and 5,627 (95% CI 5,121-6,133) LBW deliveries in 2008, with 3,708 (95% CI: 3,375-4,041) cases of both PTB and LBW. The estimated rate of PTB due to maternal binge drinking was 1.57% among all PTBs to White women, 0.69% among Black women, 3.31% among Hispanic women, and 2.35% among other races. Compared to other age groups, women ages 40-44 had the highest adjusted binge drinking rate and highest PTB rate due to maternal binge drinking (4.33%). Maternal binge drinking contributed significantly to PTB and LBW differentially across sociodemographic groups.

  20. Mapping as a visual health communication tool: promises and dilemmas.

    PubMed

    Parrott, Roxanne; Hopfer, Suellen; Ghetian, Christie; Lengerich, Eugene

    2007-01-01

    In the era of evidence-based public health promotion and planning, the use of maps as a form of evidence to communicate about the multiple determinants of cancer is on the rise. Geographic information systems and mapping technologies make future proliferation of this strategy likely. Yet disease maps as a communication form remain largely unexamined. This content analysis considers the presence of multivariate information, credibility cues, and the communication function of publicly accessible maps for cancer control activities. Thirty-six state comprehensive cancer control plans were publicly available in July 2005 and were reviewed for the presence of maps. Fourteen of the 36 state cancer plans (39%) contained map images (N = 59 static maps). A continuum of map inter activity was observed, with 10 states having interactive mapping tools available to query and map cancer information. Four states had both cancer plans with map images and interactive mapping tools available to the public on their Web sites. Of the 14 state cancer plans that depicted map images, two displayed multivariate data in a single map. Nine of the 10 states with interactive mapping capability offered the option to display multivariate health risk messages. The most frequent content category mapped was cancer incidence and mortality, with stage at diagnosis infrequently available. The most frequent communication function served by the maps reviewed was redundancy, as maps repeated information contained in textual forms. The social and ethical implications for communicating about cancer through the use of visual geographic representations are discussed.

  1. Racial and ethnic disparities in maternal morbidity and obstetric care.

    PubMed

    Grobman, William A; Bailit, Jennifer L; Rice, Madeline Murguia; Wapner, Ronald J; Reddy, Uma M; Varner, Michael W; Thorp, John M; Leveno, Kenneth J; Caritis, Steve N; Iams, Jay D; Tita, Alan T N; Saade, George; Rouse, Dwight J; Blackwell, Sean C; Tolosa, Jorge E; VanDorsten, J Peter

    2015-06-01

    To evaluate whether racial and ethnic disparities exist in obstetric care and adverse outcomes. We analyzed data from a cohort of women who delivered at 25 hospitals across the United States over a 3-year period. Race and ethnicity was categorized as non-Hispanic white, non-Hispanic black, Hispanic, or Asian. Associations between race and ethnicity and severe postpartum hemorrhage, peripartum infection, and severe perineal laceration at spontaneous vaginal delivery as well as between race and ethnicity and obstetric care (eg, episiotomy) relevant to the adverse outcomes were estimated by univariable analysis and multivariable logistic regression. Of 115,502 studied women, 95% were classified by one of the race and ethnicity categories. Non-Hispanic white women were significantly less likely to experience severe postpartum hemorrhage (1.6% non-Hispanic white compared with 3.0% non-Hispanic black compared with 3.1% Hispanic compared with 2.2% Asian) and peripartum infection (4.1% non-Hispanic white compared with 4.9% non-Hispanic black compared with 6.4% Hispanic compared with 6.2% Asian) than others (P<.001 for both). Severe perineal laceration at spontaneous vaginal delivery was significantly more likely in Asian women (2.5% non-Hispanic white compared with 1.2% non-Hispanic black compared with 1.5% Hispanic compared with 5.5% Asian; P<.001). These disparities persisted in multivariable analysis. Many types of obstetric care examined also were significantly different according to race and ethnicity in both univariable and multivariable analysis. There were no significant interactions between race and ethnicity and hospital of delivery. Racial and ethnic disparities exist for multiple adverse obstetric outcomes and types of obstetric care and do not appear to be explained by differences in patient characteristics or by delivery hospital. II.

  2. Racial and Ethnic Disparities in Maternal Morbidity and Obstetric Care

    PubMed Central

    Grobman, William A.; Bailit, Jennifer L.; Rice, Madeline Murguia; Wapner, Ronald J.; Reddy, Uma M.; Varner, Michael W.; Thorp, John M.; Leveno, Kenneth J.; Caritis, Steve N.; Iams, Jay D.; Tita, Alan T. N.; Saade, George; Rouse, Dwight J.; Blackwell, Sean C.; Tolosa, Jorge E.; VanDorsten, J. Peter

    2015-01-01

    Objective To evaluate whether racial and ethnic disparities exist in obstetric care and adverse outcomes. Methods We analyzed data from a cohort of women who delivered at 25 hospitals across the United States over a 3-year period. Race and ethnicity was categorized as Non-Hispanic white, Non-Hispanic black, Hispanic, or Asian. Associations between race and ethnicity and severe postpartum hemorrhage (PPH), peripartum infection, and severe perineal laceration at spontaneous vaginal delivery, as well as between race and ethnicity and obstetric care (eg, episiotomy) relevant to the adverse outcomes, were estimated by univariable analysis and multivariable logistic regression. Results Of 115,502 studied women, 95% were classified by one of the race and ethnicity categories. Non-Hispanic white women were significantly less likely to experience severe PPH (1.6% non-Hispanic white vs. 3.0% Non-Hispanic black vs. 3.1% Hispanic vs. 2.2%Asian) and peripartum infection (4.1% non-Hispanic white vs. 4.9% Non-Hispanic black vs. 6.4% Hispanic vs. 6.2% Asian) than others (P < 0.001 for both). Severe perineal laceration at spontaneous vaginal delivery was significantly more likely in Asian women (2.5% non-Hispanic white vs. 1.2% Non-Hispanic black vs. 1.5% Hispanic vs. 5.5% Asian) P< 0.001). These disparities persisted in multivariable analysis. Many types of obstetric care examined also were significantly different according to race and ethnicity in both univariable and multivariable analysis. There were no significant interactions between race and ethnicity and hospital of delivery. Conclusion Racial and ethnic disparities exist for multiple adverse obstetric outcomes and types of obstetric care, and do not appear to be explained by differences in patient characteristics or by delivery hospital. PMID:26000518

  3. State Pupil Transportation Funding: Equity and Efficiency.

    ERIC Educational Resources Information Center

    Zeitlin, Laurie S.

    1990-01-01

    Explores the influences state departments of education have on the cost and quality of pupil transportation. Evaluates the following state funding methodologies: (1) actual costs incurred; (2) a flat rate per unit; or (3) a multivariate calculation in providing service efficiently and equitably between districts. (MLF)

  4. Immigration Status, Visa Types, and Body Weight Among New Immigrants in the United States.

    PubMed

    Yeh, Ming-Chin; Parikh, Nina S; Megliola, Alison E; Kelvin, Elizabeth A

    2018-03-01

    To investigate the relationship between immigration-related factors and body mass index (BMI) among immigrants. Secondary analyses of cross-sectional survey data. The New Immigrant Survey (NIS-2003) contains data from in-person or telephone interviews between May and November 2003, with a probability sample of immigrants granted legal permanent residency in the United States. A total of 8573 US immigrants. The NIS-2003 provided data on sociobehavioral domains, including migration history, education, employment, marital history, language, and health-related behaviors. The visa classifications are as follows: (1) family reunification, (2) employment, (3) diversity, (4) refugee, and (5) legalization. Nested multivariable linear regression analysis was used to estimate the independent relationships between BMI and the variables of interest. Overall, 32.6% of participants were overweight and 11.3% were obese (mean BMI = 25). Participants who were admitted to the United States with employment, refugee, or legalization visas compared with those who came with family reunion visas had a significantly higher BMI ( P < .001, P < .001, P < .01, respectively). Duration in the United States predicted BMI, with those immigrants in the United States longer having a higher BMI ( P < .001). Our findings suggest that immigrants who obtain particular visa categorizations and immigration status might have a higher risk of being overweight or obese. Immigrants need to be targeted along with the rest of the US population for weight management interventions.

  5. Semiparametric Estimation of the Impacts of Longitudinal Interventions on Adolescent Obesity using Targeted Maximum-Likelihood: Accessible Estimation with the ltmle Package

    PubMed Central

    Decker, Anna L.; Hubbard, Alan; Crespi, Catherine M.; Seto, Edmund Y.W.; Wang, May C.

    2015-01-01

    While child and adolescent obesity is a serious public health concern, few studies have utilized parameters based on the causal inference literature to examine the potential impacts of early intervention. The purpose of this analysis was to estimate the causal effects of early interventions to improve physical activity and diet during adolescence on body mass index (BMI), a measure of adiposity, using improved techniques. The most widespread statistical method in studies of child and adolescent obesity is multi-variable regression, with the parameter of interest being the coefficient on the variable of interest. This approach does not appropriately adjust for time-dependent confounding, and the modeling assumptions may not always be met. An alternative parameter to estimate is one motivated by the causal inference literature, which can be interpreted as the mean change in the outcome under interventions to set the exposure of interest. The underlying data-generating distribution, upon which the estimator is based, can be estimated via a parametric or semi-parametric approach. Using data from the National Heart, Lung, and Blood Institute Growth and Health Study, a 10-year prospective cohort study of adolescent girls, we estimated the longitudinal impact of physical activity and diet interventions on 10-year BMI z-scores via a parameter motivated by the causal inference literature, using both parametric and semi-parametric estimation approaches. The parameters of interest were estimated with a recently released R package, ltmle, for estimating means based upon general longitudinal treatment regimes. We found that early, sustained intervention on total calories had a greater impact than a physical activity intervention or non-sustained interventions. Multivariable linear regression yielded inflated effect estimates compared to estimates based on targeted maximum-likelihood estimation and data-adaptive super learning. Our analysis demonstrates that sophisticated, optimal semiparametric estimation of longitudinal treatment-specific means via ltmle provides an incredibly powerful, yet easy-to-use tool, removing impediments for putting theory into practice. PMID:26046009

  6. Multivariate pattern classification reveals autonomic and experiential representations of discrete emotions.

    PubMed

    Kragel, Philip A; Labar, Kevin S

    2013-08-01

    Defining the structural organization of emotions is a central unresolved question in affective science. In particular, the extent to which autonomic nervous system activity signifies distinct affective states remains controversial. Most prior research on this topic has used univariate statistical approaches in attempts to classify emotions from psychophysiological data. In the present study, electrodermal, cardiac, respiratory, and gastric activity, as well as self-report measures were taken from healthy subjects during the experience of fear, anger, sadness, surprise, contentment, and amusement in response to film and music clips. Information pertaining to affective states present in these response patterns was analyzed using multivariate pattern classification techniques. Overall accuracy for classifying distinct affective states was 58.0% for autonomic measures and 88.2% for self-report measures, both of which were significantly above chance. Further, examining the error distribution of classifiers revealed that the dimensions of valence and arousal selectively contributed to decoding emotional states from self-report, whereas a categorical configuration of affective space was evident in both self-report and autonomic measures. Taken together, these findings extend recent multivariate approaches to study emotion and indicate that pattern classification tools may improve upon univariate approaches to reveal the underlying structure of emotional experience and physiological expression. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  7. Multivariate Pattern Classification Reveals Autonomic and Experiential Representations of Discrete Emotions

    PubMed Central

    Kragel, Philip A.; LaBar, Kevin S.

    2013-01-01

    Defining the structural organization of emotions is a central unresolved question in affective science. In particular, the extent to which autonomic nervous system activity signifies distinct affective states remains controversial. Most prior research on this topic has used univariate statistical approaches in attempts to classify emotions from psychophysiological data. In the present study, electrodermal, cardiac, respiratory, and gastric activity, as well as self-report measures were taken from healthy subjects during the experience of fear, anger, sadness, surprise, contentment, and amusement in response to film and music clips. Information pertaining to affective states present in these response patterns was analyzed using multivariate pattern classification techniques. Overall accuracy for classifying distinct affective states was 58.0% for autonomic measures and 88.2% for self-report measures, both of which were significantly above chance. Further, examining the error distribution of classifiers revealed that the dimensions of valence and arousal selectively contributed to decoding emotional states from self-report, whereas a categorical configuration of affective space was evident in both self-report and autonomic measures. Taken together, these findings extend recent multivariate approaches to study emotion and indicate that pattern classification tools may improve upon univariate approaches to reveal the underlying structure of emotional experience and physiological expression. PMID:23527508

  8. [The impact of social and hygienic lifestyle factors on health status of students].

    PubMed

    Sakharova, O B; Kiku, P F; Gorborukova, T V

    2012-01-01

    The complex estimation of the impact of socio-hygienic lifestyle factors on the health of students has been performed. In the work the data of sociological analysis (questionnaire), the methods of multivariate statistics (correlation, regression analysis, method of correlation pleiades by P. V. Terentiev) were used. Among the analyzed components the average monthly income was found to make the greatest contribution of the health state and physical capacity of the studied contingent of students. The influence of this factor is most pronounced in a group of students with an average wealth. The quality of nutrition and the mode of life depend on the level of material well-being of students. Students with a deficiency or excess body weight are more susceptible to the effects of such lifestyle factors such as nutrition, physical activity, bad habits and prosperity.

  9. Analytical methods in multivariate highway safety exposure data estimation

    DOT National Transportation Integrated Search

    1984-01-01

    Three general analytical techniques which may be of use in : extending, enhancing, and combining highway accident exposure data are : discussed. The techniques are log-linear modelling, iterative propor : tional fitting and the expectation maximizati...

  10. Multivariate frequency domain analysis of protein dynamics

    NASA Astrophysics Data System (ADS)

    Matsunaga, Yasuhiro; Fuchigami, Sotaro; Kidera, Akinori

    2009-03-01

    Multivariate frequency domain analysis (MFDA) is proposed to characterize collective vibrational dynamics of protein obtained by a molecular dynamics (MD) simulation. MFDA performs principal component analysis (PCA) for a bandpass filtered multivariate time series using the multitaper method of spectral estimation. By applying MFDA to MD trajectories of bovine pancreatic trypsin inhibitor, we determined the collective vibrational modes in the frequency domain, which were identified by their vibrational frequencies and eigenvectors. At near zero temperature, the vibrational modes determined by MFDA agreed well with those calculated by normal mode analysis. At 300 K, the vibrational modes exhibited characteristic features that were considerably different from the principal modes of the static distribution given by the standard PCA. The influences of aqueous environments were discussed based on two different sets of vibrational modes, one derived from a MD simulation in water and the other from a simulation in vacuum. Using the varimax rotation, an algorithm of the multivariate statistical analysis, the representative orthogonal set of eigenmodes was determined at each vibrational frequency.

  11. Calibrated Multivariate Regression with Application to Neural Semantic Basis Discovery.

    PubMed

    Liu, Han; Wang, Lie; Zhao, Tuo

    2015-08-01

    We propose a calibrated multivariate regression method named CMR for fitting high dimensional multivariate regression models. Compared with existing methods, CMR calibrates regularization for each regression task with respect to its noise level so that it simultaneously attains improved finite-sample performance and tuning insensitiveness. Theoretically, we provide sufficient conditions under which CMR achieves the optimal rate of convergence in parameter estimation. Computationally, we propose an efficient smoothed proximal gradient algorithm with a worst-case numerical rate of convergence O (1/ ϵ ), where ϵ is a pre-specified accuracy of the objective function value. We conduct thorough numerical simulations to illustrate that CMR consistently outperforms other high dimensional multivariate regression methods. We also apply CMR to solve a brain activity prediction problem and find that it is as competitive as a handcrafted model created by human experts. The R package camel implementing the proposed method is available on the Comprehensive R Archive Network http://cran.r-project.org/web/packages/camel/.

  12. Multivariate longitudinal data analysis with censored and intermittent missing responses.

    PubMed

    Lin, Tsung-I; Lachos, Victor H; Wang, Wan-Lun

    2018-05-08

    The multivariate linear mixed model (MLMM) has emerged as an important analytical tool for longitudinal data with multiple outcomes. However, the analysis of multivariate longitudinal data could be complicated by the presence of censored measurements because of a detection limit of the assay in combination with unavoidable missing values arising when subjects miss some of their scheduled visits intermittently. This paper presents a generalization of the MLMM approach, called the MLMM-CM, for a joint analysis of the multivariate longitudinal data with censored and intermittent missing responses. A computationally feasible expectation maximization-based procedure is developed to carry out maximum likelihood estimation within the MLMM-CM framework. Moreover, the asymptotic standard errors of fixed effects are explicitly obtained via the information-based method. We illustrate our methodology by using simulated data and a case study from an AIDS clinical trial. Experimental results reveal that the proposed method is able to provide more satisfactory performance as compared with the traditional MLMM approach. Copyright © 2018 John Wiley & Sons, Ltd.

  13. Multivariate meta-analysis for non-linear and other multi-parameter associations

    PubMed Central

    Gasparrini, A; Armstrong, B; Kenward, M G

    2012-01-01

    In this paper, we formalize the application of multivariate meta-analysis and meta-regression to synthesize estimates of multi-parameter associations obtained from different studies. This modelling approach extends the standard two-stage analysis used to combine results across different sub-groups or populations. The most straightforward application is for the meta-analysis of non-linear relationships, described for example by regression coefficients of splines or other functions, but the methodology easily generalizes to any setting where complex associations are described by multiple correlated parameters. The modelling framework of multivariate meta-analysis is implemented in the package mvmeta within the statistical environment R. As an illustrative example, we propose a two-stage analysis for investigating the non-linear exposure–response relationship between temperature and non-accidental mortality using time-series data from multiple cities. Multivariate meta-analysis represents a useful analytical tool for studying complex associations through a two-stage procedure. Copyright © 2012 John Wiley & Sons, Ltd. PMID:22807043

  14. Factors related to choosing an academic career track among spine fellowship applicants.

    PubMed

    Park, Daniel K; Rhee, John M; Wu, Baohua; Easley, Kirk

    2013-03-01

    Retrospective review. To identify factors associated with the likelihood of spine surgery fellowship applicants choosing an academic job upon fellowship completion. Training academic spine surgeons is an important goal of many spine fellowships. However, there are no established criteria associated with academic job choice to guide selection committees. Two hundred three consecutive applications of candidates who were granted an interview to a single spine surgical fellowship from 2005 to 2010 were analyzed. Factors investigated included the following: membership in honor societies; number of publications, presentations, and book chapters; age; completion of an additional degree; completion of a research fellowship; teaching experience; marital status; graduation from a top-20 school; attendance in a residency with a spine fellowship; and comments made in personal statements and letters of recommendation. The job taken upon graduation from fellowship was determined. The χ2 test or Fisher exact test was used to estimate the strength of the association between the covariates and response. Significant variables were selected for further multivariate analysis. The following were significantly associated in a univariable analysis with academia: 5 or more national presentations; completion of a research fellowship; attendance in a top-20 medical school; stated desire in the personal statement to become an academic surgeon; and letters of reference stating likelihood of pursuing academics on hiring the applicant. When significant variables were selected for multivariable analysis, completion of a research fellowship, graduation from a top-20 medical school, and stated desire in the personal statement to become an academic surgeon were most strongly associated with choice of academia. Although job choice is multifactorial, the present study demonstrates that there are objective factors listed on spine fellowship applications associated with a significantly higher likelihood of academic job choice. Analyzing these factors may help selection committees evaluate spine fellowship applicants consistent with the academic missions of their programs.

  15. Arm structure in normal spiral galaxies, 1: Multivariate data for 492 galaxies

    NASA Technical Reports Server (NTRS)

    Magri, Christopher

    1994-01-01

    Multivariate data have been collected as part of an effort to develop a new classification system for spiral galaxies, one which is not necessarily based on subjective morphological properties. A sample of 492 moderately bright northern Sa and Sc spirals was chosen for future statistical analysis. New observations were made at 20 and 21 cm; the latter data are described in detail here. Infrared Astronomy Satellite (IRAS) fluxes were obtained from archival data. Finally, new estimates of arm pattern radomness and of local environmental harshness were compiled for most sample objects.

  16. Regression Models For Multivariate Count Data

    PubMed Central

    Zhang, Yiwen; Zhou, Hua; Zhou, Jin; Sun, Wei

    2016-01-01

    Data with multivariate count responses frequently occur in modern applications. The commonly used multinomial-logit model is limiting due to its restrictive mean-variance structure. For instance, analyzing count data from the recent RNA-seq technology by the multinomial-logit model leads to serious errors in hypothesis testing. The ubiquity of over-dispersion and complicated correlation structures among multivariate counts calls for more flexible regression models. In this article, we study some generalized linear models that incorporate various correlation structures among the counts. Current literature lacks a treatment of these models, partly due to the fact that they do not belong to the natural exponential family. We study the estimation, testing, and variable selection for these models in a unifying framework. The regression models are compared on both synthetic and real RNA-seq data. PMID:28348500

  17. Regression Models For Multivariate Count Data.

    PubMed

    Zhang, Yiwen; Zhou, Hua; Zhou, Jin; Sun, Wei

    2017-01-01

    Data with multivariate count responses frequently occur in modern applications. The commonly used multinomial-logit model is limiting due to its restrictive mean-variance structure. For instance, analyzing count data from the recent RNA-seq technology by the multinomial-logit model leads to serious errors in hypothesis testing. The ubiquity of over-dispersion and complicated correlation structures among multivariate counts calls for more flexible regression models. In this article, we study some generalized linear models that incorporate various correlation structures among the counts. Current literature lacks a treatment of these models, partly due to the fact that they do not belong to the natural exponential family. We study the estimation, testing, and variable selection for these models in a unifying framework. The regression models are compared on both synthetic and real RNA-seq data.

  18. Multivariate flood risk assessment: reinsurance perspective

    NASA Astrophysics Data System (ADS)

    Ghizzoni, Tatiana; Ellenrieder, Tobias

    2013-04-01

    For insurance and re-insurance purposes the knowledge of the spatial characteristics of fluvial flooding is fundamental. The probability of simultaneous flooding at different locations during one event and the associated severity and losses have to be estimated in order to assess premiums and for accumulation control (Probable Maximum Losses calculation). Therefore, the identification of a statistical model able to describe the multivariate joint distribution of flood events in multiple location is necessary. In this context, copulas can be viewed as alternative tools for dealing with multivariate simulations as they allow to formalize dependence structures of random vectors. An application of copula function for flood scenario generation is presented for Australia (Queensland, New South Wales and Victoria) where 100.000 possible flood scenarios covering approximately 15.000 years were simulated.

  19. Gain-scheduling multivariable LPV control of an irrigation canal system.

    PubMed

    Bolea, Yolanda; Puig, Vicenç

    2016-07-01

    The purpose of this paper is to present a multivariable linear parameter varying (LPV) controller with a gain scheduling Smith Predictor (SP) scheme applicable to open-flow canal systems. This LPV controller based on SP is designed taking into account the uncertainty in the estimation of delay and the variation of plant parameters according to the operating point. This new methodology can be applied to a class of delay systems that can be represented by a set of models that can be factorized into a rational multivariable model in series with left/right diagonal (multiple) delays, such as, the case of irrigation canals. A multiple pool canal system is used to test and validate the proposed control approach. Copyright © 2016 ISA. Published by Elsevier Ltd. All rights reserved.

  20. Hybrid least squares multivariate spectral analysis methods

    DOEpatents

    Haaland, David M.

    2004-03-23

    A set of hybrid least squares multivariate spectral analysis methods in which spectral shapes of components or effects not present in the original calibration step are added in a following prediction or calibration step to improve the accuracy of the estimation of the amount of the original components in the sampled mixture. The hybrid method herein means a combination of an initial calibration step with subsequent analysis by an inverse multivariate analysis method. A spectral shape herein means normally the spectral shape of a non-calibrated chemical component in the sample mixture but can also mean the spectral shapes of other sources of spectral variation, including temperature drift, shifts between spectrometers, spectrometer drift, etc. The shape can be continuous, discontinuous, or even discrete points illustrative of the particular effect.

Top