Sample records for structured additive regression

  1. A review of the performance and structural considerations of paraffin wax hybrid rocket fuels with additives

    NASA Astrophysics Data System (ADS)

    Veale, Kirsty; Adali, Sarp; Pitot, Jean; Brooks, Michael

    2017-12-01

    Paraffin wax as a hybrid rocket fuel has not been comprehensively characterised, especially regarding the structural feasibility of the material in launch applications. Preliminary structural testing has shown paraffin wax to be a brittle, low strength material, and at risk of failure under launch loading conditions. Structural enhancing additives have been identified, but their effect on motor performance has not always been considered, nor has any standard method of testing been identified between research institutes. A review of existing regression rate measurement techniques on paraffin wax based fuels and the results obtained with various additives are collated and discussed in this paper. The review includes 2D slab motors that enable visualisation of liquefying fuel droplet entrainment and the effect of an increased viscosity on the droplet entrainment mechanism, which can occur with the addition of structural enhancing polymers. An increased viscosity has been shown to reduce the regression rate of liquefying fuels. Viscosity increasing additives that have been tested include EVA and LDPE. Both these additives increase the structural properties of paraffin wax, where the elongation and UTS are improved. Other additives, such as metal hydrides, aluminium and boron generally offer improvements on the regression rate. However, very little consideration has been given to the structural effects these additives have on the wax grain. A 40% aluminised grain, for example, offers a slight increase in the UTS but reduces the elongation of paraffin wax. Geometrically accurate lab-scale motors have also been used to determine the regression rate properties of various additives in paraffin wax. A concise review of all available regression rate testing techniques and results on paraffin wax based hybrid propellants, as well as existing structural testing data, is presented in this paper.

  2. Structured functional additive regression in reproducing kernel Hilbert spaces.

    PubMed

    Zhu, Hongxiao; Yao, Fang; Zhang, Hao Helen

    2014-06-01

    Functional additive models (FAMs) provide a flexible yet simple framework for regressions involving functional predictors. The utilization of data-driven basis in an additive rather than linear structure naturally extends the classical functional linear model. However, the critical issue of selecting nonlinear additive components has been less studied. In this work, we propose a new regularization framework for the structure estimation in the context of Reproducing Kernel Hilbert Spaces. The proposed approach takes advantage of the functional principal components which greatly facilitates the implementation and the theoretical analysis. The selection and estimation are achieved by penalized least squares using a penalty which encourages the sparse structure of the additive components. Theoretical properties such as the rate of convergence are investigated. The empirical performance is demonstrated through simulation studies and a real data application.

  3. Structured functional additive regression in reproducing kernel Hilbert spaces

    PubMed Central

    Zhu, Hongxiao; Yao, Fang; Zhang, Hao Helen

    2013-01-01

    Summary Functional additive models (FAMs) provide a flexible yet simple framework for regressions involving functional predictors. The utilization of data-driven basis in an additive rather than linear structure naturally extends the classical functional linear model. However, the critical issue of selecting nonlinear additive components has been less studied. In this work, we propose a new regularization framework for the structure estimation in the context of Reproducing Kernel Hilbert Spaces. The proposed approach takes advantage of the functional principal components which greatly facilitates the implementation and the theoretical analysis. The selection and estimation are achieved by penalized least squares using a penalty which encourages the sparse structure of the additive components. Theoretical properties such as the rate of convergence are investigated. The empirical performance is demonstrated through simulation studies and a real data application. PMID:25013362

  4. Boosting structured additive quantile regression for longitudinal childhood obesity data.

    PubMed

    Fenske, Nora; Fahrmeir, Ludwig; Hothorn, Torsten; Rzehak, Peter; Höhle, Michael

    2013-07-25

    Childhood obesity and the investigation of its risk factors has become an important public health issue. Our work is based on and motivated by a German longitudinal study including 2,226 children with up to ten measurements on their body mass index (BMI) and risk factors from birth to the age of 10 years. We introduce boosting of structured additive quantile regression as a novel distribution-free approach for longitudinal quantile regression. The quantile-specific predictors of our model include conventional linear population effects, smooth nonlinear functional effects, varying-coefficient terms, and individual-specific effects, such as intercepts and slopes. Estimation is based on boosting, a computer intensive inference method for highly complex models. We propose a component-wise functional gradient descent boosting algorithm that allows for penalized estimation of the large variety of different effects, particularly leading to individual-specific effects shrunken toward zero. This concept allows us to flexibly estimate the nonlinear age curves of upper quantiles of the BMI distribution, both on population and on individual-specific level, adjusted for further risk factors and to detect age-varying effects of categorical risk factors. Our model approach can be regarded as the quantile regression analog of Gaussian additive mixed models (or structured additive mean regression models), and we compare both model classes with respect to our obesity data.

  5. Random regression analyses using B-splines functions to model growth from birth to adult age in Canchim cattle.

    PubMed

    Baldi, F; Alencar, M M; Albuquerque, L G

    2010-12-01

    The objective of this work was to estimate covariance functions using random regression models on B-splines functions of animal age, for weights from birth to adult age in Canchim cattle. Data comprised 49,011 records on 2435 females. The model of analysis included fixed effects of contemporary groups, age of dam as quadratic covariable and the population mean trend taken into account by a cubic regression on orthogonal polynomials of animal age. Residual variances were modelled through a step function with four classes. The direct and maternal additive genetic effects, and animal and maternal permanent environmental effects were included as random effects in the model. A total of seventeen analyses, considering linear, quadratic and cubic B-splines functions and up to seven knots, were carried out. B-spline functions of the same order were considered for all random effects. Random regression models on B-splines functions were compared to a random regression model on Legendre polynomials and with a multitrait model. Results from different models of analyses were compared using the REML form of the Akaike Information criterion and Schwarz' Bayesian Information criterion. In addition, the variance components and genetic parameters estimated for each random regression model were also used as criteria to choose the most adequate model to describe the covariance structure of the data. A model fitting quadratic B-splines, with four knots or three segments for direct additive genetic effect and animal permanent environmental effect and two knots for maternal additive genetic effect and maternal permanent environmental effect, was the most adequate to describe the covariance structure of the data. Random regression models using B-spline functions as base functions fitted the data better than Legendre polynomials, especially at mature ages, but higher number of parameters need to be estimated with B-splines functions. © 2010 Blackwell Verlag GmbH.

  6. Factor Scores, Structure Coefficients, and Communality Coefficients

    ERIC Educational Resources Information Center

    Goodwyn, Fara

    2012-01-01

    This paper presents heuristic explanations of factor scores, structure coefficients, and communality coefficients. Common misconceptions regarding these topics are clarified. In addition, (a) the regression (b) Bartlett, (c) Anderson-Rubin, and (d) Thompson methods for calculating factor scores are reviewed. Syntax necessary to execute all four…

  7. Predicting the occurrence of wildfires with binary structured additive regression models.

    PubMed

    Ríos-Pena, Laura; Kneib, Thomas; Cadarso-Suárez, Carmen; Marey-Pérez, Manuel

    2017-02-01

    Wildfires are one of the main environmental problems facing societies today, and in the case of Galicia (north-west Spain), they are the main cause of forest destruction. This paper used binary structured additive regression (STAR) for modelling the occurrence of wildfires in Galicia. Binary STAR models are a recent contribution to the classical logistic regression and binary generalized additive models. Their main advantage lies in their flexibility for modelling non-linear effects, while simultaneously incorporating spatial and temporal variables directly, thereby making it possible to reveal possible relationships among the variables considered. The results showed that the occurrence of wildfires depends on many covariates which display variable behaviour across space and time, and which largely determine the likelihood of ignition of a fire. The joint possibility of working on spatial scales with a resolution of 1 × 1 km cells and mapping predictions in a colour range makes STAR models a useful tool for plotting and predicting wildfire occurrence. Lastly, it will facilitate the development of fire behaviour models, which can be invaluable when it comes to drawing up fire-prevention and firefighting plans. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. Using Structured Additive Regression Models to Estimate Risk Factors of Malaria: Analysis of 2010 Malawi Malaria Indicator Survey Data

    PubMed Central

    Chirombo, James; Lowe, Rachel; Kazembe, Lawrence

    2014-01-01

    Background After years of implementing Roll Back Malaria (RBM) interventions, the changing landscape of malaria in terms of risk factors and spatial pattern has not been fully investigated. This paper uses the 2010 malaria indicator survey data to investigate if known malaria risk factors remain relevant after many years of interventions. Methods We adopted a structured additive logistic regression model that allowed for spatial correlation, to more realistically estimate malaria risk factors. Our model included child and household level covariates, as well as climatic and environmental factors. Continuous variables were modelled by assuming second order random walk priors, while spatial correlation was specified as a Markov random field prior, with fixed effects assigned diffuse priors. Inference was fully Bayesian resulting in an under five malaria risk map for Malawi. Results Malaria risk increased with increasing age of the child. With respect to socio-economic factors, the greater the household wealth, the lower the malaria prevalence. A general decline in malaria risk was observed as altitude increased. Minimum temperatures and average total rainfall in the three months preceding the survey did not show a strong association with disease risk. Conclusions The structured additive regression model offered a flexible extension to standard regression models by enabling simultaneous modelling of possible nonlinear effects of continuous covariates, spatial correlation and heterogeneity, while estimating usual fixed effects of categorical and continuous observed variables. Our results confirmed that malaria epidemiology is a complex interaction of biotic and abiotic factors, both at the individual, household and community level and that risk factors are still relevant many years after extensive implementation of RBM activities. PMID:24991915

  9. Using structured additive regression models to estimate risk factors of malaria: analysis of 2010 Malawi malaria indicator survey data.

    PubMed

    Chirombo, James; Lowe, Rachel; Kazembe, Lawrence

    2014-01-01

    After years of implementing Roll Back Malaria (RBM) interventions, the changing landscape of malaria in terms of risk factors and spatial pattern has not been fully investigated. This paper uses the 2010 malaria indicator survey data to investigate if known malaria risk factors remain relevant after many years of interventions. We adopted a structured additive logistic regression model that allowed for spatial correlation, to more realistically estimate malaria risk factors. Our model included child and household level covariates, as well as climatic and environmental factors. Continuous variables were modelled by assuming second order random walk priors, while spatial correlation was specified as a Markov random field prior, with fixed effects assigned diffuse priors. Inference was fully Bayesian resulting in an under five malaria risk map for Malawi. Malaria risk increased with increasing age of the child. With respect to socio-economic factors, the greater the household wealth, the lower the malaria prevalence. A general decline in malaria risk was observed as altitude increased. Minimum temperatures and average total rainfall in the three months preceding the survey did not show a strong association with disease risk. The structured additive regression model offered a flexible extension to standard regression models by enabling simultaneous modelling of possible nonlinear effects of continuous covariates, spatial correlation and heterogeneity, while estimating usual fixed effects of categorical and continuous observed variables. Our results confirmed that malaria epidemiology is a complex interaction of biotic and abiotic factors, both at the individual, household and community level and that risk factors are still relevant many years after extensive implementation of RBM activities.

  10. Application of nonlinear least-squares regression to ground-water flow modeling, west-central Florida

    USGS Publications Warehouse

    Yobbi, D.K.

    2000-01-01

    A nonlinear least-squares regression technique for estimation of ground-water flow model parameters was applied to an existing model of the regional aquifer system underlying west-central Florida. The regression technique minimizes the differences between measured and simulated water levels. Regression statistics, including parameter sensitivities and correlations, were calculated for reported parameter values in the existing model. Optimal parameter values for selected hydrologic variables of interest are estimated by nonlinear regression. Optimal estimates of parameter values are about 140 times greater than and about 0.01 times less than reported values. Independently estimating all parameters by nonlinear regression was impossible, given the existing zonation structure and number of observations, because of parameter insensitivity and correlation. Although the model yields parameter values similar to those estimated by other methods and reproduces the measured water levels reasonably accurately, a simpler parameter structure should be considered. Some possible ways of improving model calibration are to: (1) modify the defined parameter-zonation structure by omitting and/or combining parameters to be estimated; (2) carefully eliminate observation data based on evidence that they are likely to be biased; (3) collect additional water-level data; (4) assign values to insensitive parameters, and (5) estimate the most sensitive parameters first, then, using the optimized values for these parameters, estimate the entire data set.

  11. Spatial quantile regression using INLA with applications to childhood overweight in Malawi.

    PubMed

    Mtambo, Owen P L; Masangwi, Salule J; Kazembe, Lawrence N M

    2015-04-01

    Analyses of childhood overweight have mainly used mean regression. However, using quantile regression is more appropriate as it provides flexibility to analyse the determinants of overweight corresponding to quantiles of interest. The main objective of this study was to fit a Bayesian additive quantile regression model with structured spatial effects for childhood overweight in Malawi using the 2010 Malawi DHS data. Inference was fully Bayesian using R-INLA package. The significant determinants of childhood overweight ranged from socio-demographic factors such as type of residence to child and maternal factors such as child age and maternal BMI. We observed significant positive structured spatial effects on childhood overweight in some districts of Malawi. We recommended that the childhood malnutrition policy makers should consider timely interventions based on risk factors as identified in this paper including spatial targets of interventions. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. High regression rate hybrid rocket fuel grains with helical port structures

    NASA Astrophysics Data System (ADS)

    Walker, Sean D.

    Hybrid rockets are popular in the aerospace industry due to their storage safety, simplicity, and controllability during rocket motor burn. However, they produce fuel regression rates typically 25% lower than solid fuel motors of the same thrust level. These lowered regression rates produce unacceptably high oxidizer-to-fuel (O/F) ratios that produce a potential for motor instability, nozzle erosion, and reduced motor duty cycles. To achieve O/F ratios that produce acceptable combustion characteristics, traditional cylindrical fuel ports are fabricated with very long length-to-diameter ratios to increase the total burning area. These high aspect ratios produce further reduced fuel regression rate and thrust levels, poor volumetric efficiency, and a potential for lateral structural loading issues during high thrust burns. In place of traditional cylindrical fuel ports, it is proposed that by researching the effects of centrifugal flow patterns introduced by embedded helical fuel port structures, a significant increase in fuel regression rates can be observed. The benefits of increasing volumetric efficiencies by lengthening the internal flow path will also be observed. The mechanisms of this increased fuel regression rate are driven by enhancing surface skin friction and reducing the effect of boundary layer "blowing" to enhance convective heat transfer to the fuel surface. Preliminary results using additive manufacturing to fabricate hybrid rocket fuel grains from acrylonitrile-butadiene-styrene (ABS) with embedded helical fuel port structures have been obtained, with burn-rate amplifications up to 3.0x than that of cylindrical fuel ports.

  13. Multilevel covariance regression with correlated random effects in the mean and variance structure.

    PubMed

    Quintero, Adrian; Lesaffre, Emmanuel

    2017-09-01

    Multivariate regression methods generally assume a constant covariance matrix for the observations. In case a heteroscedastic model is needed, the parametric and nonparametric covariance regression approaches can be restrictive in the literature. We propose a multilevel regression model for the mean and covariance structure, including random intercepts in both components and allowing for correlation between them. The implied conditional covariance function can be different across clusters as a result of the random effect in the variance structure. In addition, allowing for correlation between the random intercepts in the mean and covariance makes the model convenient for skewedly distributed responses. Furthermore, it permits us to analyse directly the relation between the mean response level and the variability in each cluster. Parameter estimation is carried out via Gibbs sampling. We compare the performance of our model to other covariance modelling approaches in a simulation study. Finally, the proposed model is applied to the RN4CAST dataset to identify the variables that impact burnout of nurses in Belgium. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Structured Additive Quantile Regression for Assessing the Determinants of Childhood Anemia in Rwanda.

    PubMed

    Habyarimana, Faustin; Zewotir, Temesgen; Ramroop, Shaun

    2017-06-17

    Childhood anemia is among the most significant health problems faced by public health departments in developing countries. This study aims at assessing the determinants and possible spatial effects associated with childhood anemia in Rwanda. The 2014/2015 Rwanda Demographic and Health Survey (RDHS) data was used. The analysis was done using the structured spatial additive quantile regression model. The findings of this study revealed that the child's age; the duration of breastfeeding; gender of the child; the nutritional status of the child (whether underweight and/or wasting); whether the child had a fever; had a cough in the two weeks prior to the survey or not; whether the child received vitamin A supplementation in the six weeks before the survey or not; the household wealth index; literacy of the mother; mother's anemia status; mother's age at the birth are all significant factors associated with childhood anemia in Rwanda. Furthermore, significant structured spatial location effects on childhood anemia was found.

  15. Image interpolation via regularized local linear regression.

    PubMed

    Liu, Xianming; Zhao, Debin; Xiong, Ruiqin; Ma, Siwei; Gao, Wen; Sun, Huifang

    2011-12-01

    The linear regression model is a very attractive tool to design effective image interpolation schemes. Some regression-based image interpolation algorithms have been proposed in the literature, in which the objective functions are optimized by ordinary least squares (OLS). However, it is shown that interpolation with OLS may have some undesirable properties from a robustness point of view: even small amounts of outliers can dramatically affect the estimates. To address these issues, in this paper we propose a novel image interpolation algorithm based on regularized local linear regression (RLLR). Starting with the linear regression model where we replace the OLS error norm with the moving least squares (MLS) error norm leads to a robust estimator of local image structure. To keep the solution stable and avoid overfitting, we incorporate the l(2)-norm as the estimator complexity penalty. Moreover, motivated by recent progress on manifold-based semi-supervised learning, we explicitly consider the intrinsic manifold structure by making use of both measured and unmeasured data points. Specifically, our framework incorporates the geometric structure of the marginal probability distribution induced by unmeasured samples as an additional local smoothness preserving constraint. The optimal model parameters can be obtained with a closed-form solution by solving a convex optimization problem. Experimental results on benchmark test images demonstrate that the proposed method achieves very competitive performance with the state-of-the-art interpolation algorithms, especially in image edge structure preservation. © 2011 IEEE

  16. Feature Grouping and Selection Over an Undirected Graph.

    PubMed

    Yang, Sen; Yuan, Lei; Lai, Ying-Cheng; Shen, Xiaotong; Wonka, Peter; Ye, Jieping

    2012-01-01

    High-dimensional regression/classification continues to be an important and challenging problem, especially when features are highly correlated. Feature selection, combined with additional structure information on the features has been considered to be promising in promoting regression/classification performance. Graph-guided fused lasso (GFlasso) has recently been proposed to facilitate feature selection and graph structure exploitation, when features exhibit certain graph structures. However, the formulation in GFlasso relies on pairwise sample correlations to perform feature grouping, which could introduce additional estimation bias. In this paper, we propose three new feature grouping and selection methods to resolve this issue. The first method employs a convex function to penalize the pairwise l ∞ norm of connected regression/classification coefficients, achieving simultaneous feature grouping and selection. The second method improves the first one by utilizing a non-convex function to reduce the estimation bias. The third one is the extension of the second method using a truncated l 1 regularization to further reduce the estimation bias. The proposed methods combine feature grouping and feature selection to enhance estimation accuracy. We employ the alternating direction method of multipliers (ADMM) and difference of convex functions (DC) programming to solve the proposed formulations. Our experimental results on synthetic data and two real datasets demonstrate the effectiveness of the proposed methods.

  17. A Selective Review of Group Selection in High-Dimensional Models

    PubMed Central

    Huang, Jian; Breheny, Patrick; Ma, Shuangge

    2013-01-01

    Grouping structures arise naturally in many statistical modeling problems. Several methods have been proposed for variable selection that respect grouping structure in variables. Examples include the group LASSO and several concave group selection methods. In this article, we give a selective review of group selection concerning methodological developments, theoretical properties and computational algorithms. We pay particular attention to group selection methods involving concave penalties. We address both group selection and bi-level selection methods. We describe several applications of these methods in nonparametric additive models, semiparametric regression, seemingly unrelated regressions, genomic data analysis and genome wide association studies. We also highlight some issues that require further study. PMID:24174707

  18. Bayesian structured additive regression modeling of epidemic data: application to cholera

    PubMed Central

    2012-01-01

    Background A significant interest in spatial epidemiology lies in identifying associated risk factors which enhances the risk of infection. Most studies, however, make no, or limited use of the spatial structure of the data, as well as possible nonlinear effects of the risk factors. Methods We develop a Bayesian Structured Additive Regression model for cholera epidemic data. Model estimation and inference is based on fully Bayesian approach via Markov Chain Monte Carlo (MCMC) simulations. The model is applied to cholera epidemic data in the Kumasi Metropolis, Ghana. Proximity to refuse dumps, density of refuse dumps, and proximity to potential cholera reservoirs were modeled as continuous functions; presence of slum settlers and population density were modeled as fixed effects, whereas spatial references to the communities were modeled as structured and unstructured spatial effects. Results We observe that the risk of cholera is associated with slum settlements and high population density. The risk of cholera is equal and lower for communities with fewer refuse dumps, but variable and higher for communities with more refuse dumps. The risk is also lower for communities distant from refuse dumps and potential cholera reservoirs. The results also indicate distinct spatial variation in the risk of cholera infection. Conclusion The study highlights the usefulness of Bayesian semi-parametric regression model analyzing public health data. These findings could serve as novel information to help health planners and policy makers in making effective decisions to control or prevent cholera epidemics. PMID:22866662

  19. Parametric correlation functions to model the structure of permanent environmental (co)variances in milk yield random regression models.

    PubMed

    Bignardi, A B; El Faro, L; Cardoso, V L; Machado, P F; Albuquerque, L G

    2009-09-01

    The objective of the present study was to estimate milk yield genetic parameters applying random regression models and parametric correlation functions combined with a variance function to model animal permanent environmental effects. A total of 152,145 test-day milk yields from 7,317 first lactations of Holstein cows belonging to herds located in the southeastern region of Brazil were analyzed. Test-day milk yields were divided into 44 weekly classes of days in milk. Contemporary groups were defined by herd-test-day comprising a total of 2,539 classes. The model included direct additive genetic, permanent environmental, and residual random effects. The following fixed effects were considered: contemporary group, age of cow at calving (linear and quadratic regressions), and the population average lactation curve modeled by fourth-order orthogonal Legendre polynomial. Additive genetic effects were modeled by random regression on orthogonal Legendre polynomials of days in milk, whereas permanent environmental effects were estimated using a stationary or nonstationary parametric correlation function combined with a variance function of different orders. The structure of residual variances was modeled using a step function containing 6 variance classes. The genetic parameter estimates obtained with the model using a stationary correlation function associated with a variance function to model permanent environmental effects were similar to those obtained with models employing orthogonal Legendre polynomials for the same effect. A model using a sixth-order polynomial for additive effects and a stationary parametric correlation function associated with a seventh-order variance function to model permanent environmental effects would be sufficient for data fitting.

  20. Non-homogeneous hybrid rocket fuel for enhanced regression rates utilizing partial entrainment

    NASA Astrophysics Data System (ADS)

    Boronowsky, Kenny

    A concept was developed and tested to enhance the performance and regression rate of hydroxyl terminated polybutadiene (HTPB), a commonly used hybrid rocket fuel. By adding small nodules of paraffin into the HTPB fuel, a non-homogeneous mixture was created resulting in increased regression rates. The goal was to develop a fuel with a simplified single core geometry and a tailorable regression rate. The new fuel would benefit from the structural stability of HTPB yet not suffer from the large void fraction representative of typical HTPB core geometries. Regression rates were compared between traditional HTPB single core grains, 85% HTPB mixed with 15% (by weight) paraffin cores, 70% HTPB mixed with 30% paraffin cores, and plain paraffin single core grains. Each fuel combination was tested at oxidizer flow rates, ranging from 0.9 - 3.3 g/s of gaseous oxygen, in a small scale hybrid test rocket and average regression rates were measured. While large uncertainties were present in the experimental setup, the overall data showed that the regression rate was enhanced as paraffin concentration increased. While further testing would be required at larger scales of interest, the trends are encouraging. Inclusion of paraffin nodules in the HTPB grain may produce a greater advantage than other more noxious additives in current use. In addition, it may lead to safer rocket motors with higher integrated thrust due to the decreased void fraction.

  1. Group Additivity Determination for Oxygenates, Oxonium Ions, and Oxygen-Containing Carbenium Ions

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Dellon, Lauren D.; Sung, Chun-Yi; Robichaud, David J.

    Bio-oil produced from biomass fast pyrolysis often requires catalytic upgrading to remove oxygen and acidic species over zeolite catalysts. The elementary reactions in the mechanism for this process involve carbenium and oxonium ions. In order to develop a detailed kinetic model for the catalytic upgrading of biomass, rate constants are required for these elementary reactions. The parameters in the Arrhenius equation can be related to thermodynamic properties through structure-reactivity relationships, such as the Evans-Polanyi relationship. For this relationship, enthalpies of formation of each species are required, which can be reasonably estimated using group additivity. However, the literature previously lacked groupmore » additivity values for oxygenates, oxonium ions, and oxygen-containing carbenium ions. In this work, 71 group additivity values for these types of groups were regressed, 65 of which had not been reported previously and six of which were newly estimated based on regression in the context of the 65 new groups. Heats of formation based on atomization enthalpy calculations for a set of reference molecules and isodesmic reactions for a small set of larger species for which experimental data was available were used to demonstrate the accuracy of the Gaussian-4 quantum mechanical method in estimating enthalpies of formation for species involving the moieties of interest. Isodesmic reactions for a total of 195 species were constructed from the reference molecules to calculate enthalpies of formation that were used to regress the group additivity values. The results showed an average deviation of 1.95 kcal/mol between the values calculated from Gaussian-4 and isodesmic reactions versus those calculated from the group additivity values that were newly regressed. Importantly, the new groups enhance the database for group additivity values, especially those involving oxonium ions.« less

  2. Family structure and the treatment of childhood asthma.

    PubMed

    Chen, Alex Y; Escarce, José J

    2008-02-01

    Family structure is known to influence children's behavioral, educational, and cognitive outcomes, and recent studies suggest that family structure affects children's access to health care as well. However, no study has addressed whether family structure is associated with the care children receive for particular conditions or with their physical health outcomes. To assess the effects of family structure on the treatment and outcomes of children with asthma. Our data sources were the 1996-2003 Medical Expenditure Panel Survey (MEPS) and the 2003 National Survey of Children's Health (NSCH). The study samples consisted of children 2-17 years of age with asthma who lived in single-mother or 2-parent families. We assessed the effect of number of parents and number of other children in the household on office visits for asthma and use of asthma medications using negative binomial regression, and we assessed the effect of family structure on the severity of asthma symptoms using binary and ordinal logistic regression. Our regression models adjusted for sociodemographic characteristics, parental experience in child-rearing and in caring for an asthmatic child and, when appropriate, measures of children's health status. Asthmatic children in single-mother families had fewer office visits for asthma and filled fewer prescriptions for controller medications than children with 2 parents. In addition, children living in families with 3 or more other children had fewer office visits and filled fewer prescriptions for reliever and controller medications than children living with no other children. Children from single-mother families had more health difficulties from asthma than children with 2 parents, and children living with 2 or more other children were more likely to have an asthma attack in the past 12 months than children living with no other children. For children with asthma, living with a single mother and the presence of additional children in the household are associated with less treatment for asthma and worse asthma outcomes.

  3. Nitrate removal in stream ecosystems measured by 15N addition experiments: Total uptake

    USGS Publications Warehouse

    Hall, R.O.; Tank, J.L.; Sobota, D.J.; Mulholland, P.J.; O'Brien, J. M.; Dodds, W.K.; Webster, J.R.; Valett, H.M.; Poole, G.C.; Peterson, B.J.; Meyer, J.L.; McDowell, W.H.; Johnson, S.L.; Hamilton, S.K.; Grimm, N. B.; Gregory, S.V.; Dahm, Clifford N.; Cooper, L.W.; Ashkenas, L.R.; Thomas, S.M.; Sheibley, R.W.; Potter, J.D.; Niederlehner, B.R.; Johnson, L.T.; Helton, A.M.; Crenshaw, C.M.; Burgin, A.J.; Bernot, M.J.; Beaulieu, J.J.; Arangob, C.P.

    2009-01-01

    We measured uptake length of 15NO-3 in 72 streams in eight regions across the United States and Puerto Rico to develop quantitative predictive models on controls of NO-3 uptake length. As part of the Lotic Intersite Nitrogen eXperiment II project, we chose nine streams in each region corresponding to natural (reference), suburban-urban, and agricultural land uses. Study streams spanned a range of human land use to maximize variation in NO-3 concentration, geomorphology, and metabolism. We tested a causal model predicting controls on NO-3 uptake length using structural equation modeling. The model included concomitant measurements of ecosystem metabolism, hydraulic parameters, and nitrogen concentration. We compared this structural equation model to multiple regression models which included additional biotic, catchment, and riparian variables. The structural equation model explained 79% of the variation in log uptake length (S Wtot). Uptake length increased with specific discharge (Q/w) and increasing NO-3 concentrations, showing a loss in removal efficiency in streams with high NO-3 concentration. Uptake lengths shortened with increasing gross primary production, suggesting autotrophic assimilation dominated NO-3 removal. The fraction of catchment area as agriculture and suburban-urban land use weakly predicted NO-3 uptake in bivariate regression, and did improve prediction in a set of multiple regression models. Adding land use to the structural equation model showed that land use indirectly affected NO-3 uptake lengths via directly increasing both gross primary production and NO-3 concentration. Gross primary production shortened SWtot, while increasing NO-3 lengthened SWtot resulting in no net effect of land use on NO- 3 removal. ?? 2009.

  4. An example of complex modelling in dentistry using Markov chain Monte Carlo (MCMC) simulation.

    PubMed

    Helfenstein, Ulrich; Menghini, Giorgio; Steiner, Marcel; Murati, Francesca

    2002-09-01

    In the usual regression setting one regression line is computed for a whole data set. In a more complex situation, each person may be observed for example at several points in time and thus a regression line might be calculated for each person. Additional complexities, such as various forms of errors in covariables may make a straightforward statistical evaluation difficult or even impossible. During recent years methods have been developed allowing convenient analysis of problems where the data and the corresponding models show these and many other forms of complexity. The methodology makes use of a Bayesian approach and Markov chain Monte Carlo (MCMC) simulations. The methods allow the construction of increasingly elaborate models by building them up from local sub-models. The essential structure of the models can be represented visually by directed acyclic graphs (DAG). This attractive property allows communication and discussion of the essential structure and the substantial meaning of a complex model without needing algebra. After presentation of the statistical methods an example from dentistry is presented in order to demonstrate their application and use. The dataset of the example had a complex structure; each of a set of children was followed up over several years. The number of new fillings in permanent teeth had been recorded at several ages. The dependent variables were markedly different from the normal distribution and could not be transformed to normality. In addition, explanatory variables were assumed to be measured with different forms of error. Illustration of how the corresponding models can be estimated conveniently via MCMC simulation, in particular, 'Gibbs sampling', using the freely available software BUGS is presented. In addition, how the measurement error may influence the estimates of the corresponding coefficients is explored. It is demonstrated that the effect of the independent variable on the dependent variable may be markedly underestimated if the measurement error is not taken into account ('regression dilution bias'). Markov chain Monte Carlo methods may be of great value to dentists in allowing analysis of data sets which exhibit a wide range of different forms of complexity.

  5. The discovery of indicator variables for QSAR using inductive logic programming

    NASA Astrophysics Data System (ADS)

    King, Ross D.; Srinivasan, Ashwin

    1997-11-01

    A central problem in forming accurate regression equations in QSAR studies isthe selection of appropriate descriptors for the compounds under study. Wedescribe a novel procedure for using inductive logic programming (ILP) todiscover new indicator variables (attributes) for QSAR problems, and show thatthese improve the accuracy of the derived regression equations. ILP techniqueshave previously been shown to work well on drug design problems where thereis a large structural component or where clear comprehensible rules arerequired. However, ILP techniques have had the disadvantage of only being ableto make qualitative predictions (e.g. active, inactive) and not to predictreal numbers (regression). We unify ILP and linear regression techniques togive a QSAR method that has the strength of ILP at describing stericstructure, with the familiarity and power of linear regression. We evaluatedthe utility of this new QSAR technique by examining the prediction ofbiological activity with and without the addition of new structural indicatorvariables formed by ILP. In three out of five datasets examined the additionof ILP variables produced statistically better results (P < 0.01) over theoriginal description. The new ILP variables did not increase the overallcomplexity of the derived QSAR equations and added insight into possiblemechanisms of action. We conclude that ILP can aid in the process of drugdesign.

  6. Application of Boosting Regression Trees to Preliminary Cost Estimation in Building Construction Projects

    PubMed Central

    2015-01-01

    Among the recent data mining techniques available, the boosting approach has attracted a great deal of attention because of its effective learning algorithm and strong boundaries in terms of its generalization performance. However, the boosting approach has yet to be used in regression problems within the construction domain, including cost estimations, but has been actively utilized in other domains. Therefore, a boosting regression tree (BRT) is applied to cost estimations at the early stage of a construction project to examine the applicability of the boosting approach to a regression problem within the construction domain. To evaluate the performance of the BRT model, its performance was compared with that of a neural network (NN) model, which has been proven to have a high performance in cost estimation domains. The BRT model has shown results similar to those of NN model using 234 actual cost datasets of a building construction project. In addition, the BRT model can provide additional information such as the importance plot and structure model, which can support estimators in comprehending the decision making process. Consequently, the boosting approach has potential applicability in preliminary cost estimations in a building construction project. PMID:26339227

  7. Application of Boosting Regression Trees to Preliminary Cost Estimation in Building Construction Projects.

    PubMed

    Shin, Yoonseok

    2015-01-01

    Among the recent data mining techniques available, the boosting approach has attracted a great deal of attention because of its effective learning algorithm and strong boundaries in terms of its generalization performance. However, the boosting approach has yet to be used in regression problems within the construction domain, including cost estimations, but has been actively utilized in other domains. Therefore, a boosting regression tree (BRT) is applied to cost estimations at the early stage of a construction project to examine the applicability of the boosting approach to a regression problem within the construction domain. To evaluate the performance of the BRT model, its performance was compared with that of a neural network (NN) model, which has been proven to have a high performance in cost estimation domains. The BRT model has shown results similar to those of NN model using 234 actual cost datasets of a building construction project. In addition, the BRT model can provide additional information such as the importance plot and structure model, which can support estimators in comprehending the decision making process. Consequently, the boosting approach has potential applicability in preliminary cost estimations in a building construction project.

  8. Influence of professional preparation and class structure on sexuality topics taught in middle and high schools.

    PubMed

    Rhodes, Darson L; Kirchofer, Gregg; Hammig, Bart J; Ogletree, Roberta J

    2013-05-01

    This study examined the impact of professional preparation and class structure on sexuality topics taught and use of practice-based instructional strategies in US middle and high school health classes. Data from the classroom-level file of the 2006 School Health Policies and Programs were used. A series of multivariable logistic regression models were employed to determine if sexuality content taught was dependent on professional preparation and /or class structure (HE only versus HE/another subject combined). Additional multivariable logistic regression models were employed to determine if use of practice-based instructional strategies was dependent upon professional preparation and/or class structure. Years of teaching health topics and size of the school district were included as covariates in the multivariable logistic regression models. Findings indicated professionally prepared health educators were significantly more likely to teach 7 of the 13 sexuality topics as compared to nonprofessionally prepared health educators. There was no statistically significant difference in the instructional strategies used by professionally prepared and nonprofessionally prepared health educators. Exclusively health education classes versus combined classes were significantly more likely to have included 6 of the 13 topics and to have incorporated practice-based instructional strategies in the curricula. This study indicated professional preparation and class structure impacted sexuality content taught. Class structure also impacted whether opportunities for students to practice skills were made available. Results support the need for continued advocacy for professionally prepared health educators and health only courses. © 2013, American School Health Association.

  9. Nonparametric regression applied to quantitative structure-activity relationships

    PubMed

    Constans; Hirst

    2000-03-01

    Several nonparametric regressors have been applied to modeling quantitative structure-activity relationship (QSAR) data. The simplest regressor, the Nadaraya-Watson, was assessed in a genuine multivariate setting. Other regressors, the local linear and the shifted Nadaraya-Watson, were implemented within additive models--a computationally more expedient approach, better suited for low-density designs. Performances were benchmarked against the nonlinear method of smoothing splines. A linear reference point was provided by multilinear regression (MLR). Variable selection was explored using systematic combinations of different variables and combinations of principal components. For the data set examined, 47 inhibitors of dopamine beta-hydroxylase, the additive nonparametric regressors have greater predictive accuracy (as measured by the mean absolute error of the predictions or the Pearson correlation in cross-validation trails) than MLR. The use of principal components did not improve the performance of the nonparametric regressors over use of the original descriptors, since the original descriptors are not strongly correlated. It remains to be seen if the nonparametric regressors can be successfully coupled with better variable selection and dimensionality reduction in the context of high-dimensional QSARs.

  10. Investigating bias in squared regression structure coefficients

    PubMed Central

    Nimon, Kim F.; Zientek, Linda R.; Thompson, Bruce

    2015-01-01

    The importance of structure coefficients and analogs of regression weights for analysis within the general linear model (GLM) has been well-documented. The purpose of this study was to investigate bias in squared structure coefficients in the context of multiple regression and to determine if a formula that had been shown to correct for bias in squared Pearson correlation coefficients and coefficients of determination could be used to correct for bias in squared regression structure coefficients. Using data from a Monte Carlo simulation, this study found that squared regression structure coefficients corrected with Pratt's formula produced less biased estimates and might be more accurate and stable estimates of population squared regression structure coefficients than estimates with no such corrections. While our findings are in line with prior literature that identified multicollinearity as a predictor of bias in squared regression structure coefficients but not coefficients of determination, the findings from this study are unique in that the level of predictive power, number of predictors, and sample size were also observed to contribute bias in squared regression structure coefficients. PMID:26217273

  11. Estimating the prevalence of 26 health-related indicators at neighbourhood level in the Netherlands using structured additive regression.

    PubMed

    van de Kassteele, Jan; Zwakhals, Laurens; Breugelmans, Oscar; Ameling, Caroline; van den Brink, Carolien

    2017-07-01

    Local policy makers increasingly need information on health-related indicators at smaller geographic levels like districts or neighbourhoods. Although more large data sources have become available, direct estimates of the prevalence of a health-related indicator cannot be produced for neighbourhoods for which only small samples or no samples are available. Small area estimation provides a solution, but unit-level models for binary-valued outcomes that can handle both non-linear effects of the predictors and spatially correlated random effects in a unified framework are rarely encountered. We used data on 26 binary-valued health-related indicators collected on 387,195 persons in the Netherlands. We associated the health-related indicators at the individual level with a set of 12 predictors obtained from national registry data. We formulated a structured additive regression model for small area estimation. The model captured potential non-linear relations between the predictors and the outcome through additive terms in a functional form using penalized splines and included a term that accounted for spatially correlated heterogeneity between neighbourhoods. The registry data were used to predict individual outcomes which in turn are aggregated into higher geographical levels, i.e. neighbourhoods. We validated our method by comparing the estimated prevalences with observed prevalences at the individual level and by comparing the estimated prevalences with direct estimates obtained by weighting methods at municipality level. We estimated the prevalence of the 26 health-related indicators for 415 municipalities, 2599 districts and 11,432 neighbourhoods in the Netherlands. We illustrate our method on overweight data and show that there are distinct geographic patterns in the overweight prevalence. Calibration plots show that the estimated prevalences agree very well with observed prevalences at the individual level. The estimated prevalences agree reasonably well with the direct estimates at the municipal level. Structured additive regression is a useful tool to provide small area estimates in a unified framework. We are able to produce valid nationwide small area estimates of 26 health-related indicators at neighbourhood level in the Netherlands. The results can be used for local policy makers to make appropriate health policy decisions.

  12. Left atrial accessory appendages, diverticula, and left-sided septal pouch in multi-slice computed tomography. Association with atrial fibrillation and cerebrovascular accidents.

    PubMed

    Hołda, Mateusz K; Koziej, Mateusz; Wszołek, Karolina; Pawlik, Wiesław; Krawczyk-Ożóg, Agata; Sorysz, Danuta; Łoboda, Piotr; Kuźma, Katarzyna; Kuniewicz, Marcin; Lelakowski, Jacek; Dudek, Dariusz; Klimek-Piotrowska, Wiesława

    2017-10-01

    The aim of this study is to provide a morphometric description of the left-sided septal pouch (LSSP), left atrial accessory appendages, and diverticula using cardiac multi-slice computed tomography (MSCT) and to compare results between patient subgroups. Two hundred and ninety four patients (42.9% females) with a mean of 69.4±13.1years of age were investigated using MSCT. The presence of the LSSP, left atrial accessory appendages, and diverticula was evaluated. Multiple logistic regression analysis was performed to check whether the presence of additional left atrial structures is associated with increased risk of atrial fibrillation and cerebrovascular accidents. At least one additional left atrial structure was present in 51.7% of patients. A single LSSP, left atrial diverticulum, and accessory appendage were present in 35.7%, 16.0%, and 4.1% of patients, respectively. After adjusting for other risk factors via multiple logistic regression, patients with LSSP are more likely to have atrial fibrillation (OR=2.00, 95% CI=1.14-3.48, p=0.01). The presence of a LSSP was found to be associated with an increased risk of transient ischemic attack using multiple logistic regression analysis after adjustment for other risk factors (OR=3.88, 95% CI=1.10-13.69, p=0.03). In conclusion LSSPs, accessory appendages, and diverticula are highly prevalent anatomic structures within the left atrium, which could be easily identified by MSCT. The presence of LSSP is associated with increased risk for atrial fibrillation and transient ischemic attack. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Doubly Robust Additive Hazards Models to Estimate Effects of a Continuous Exposure on Survival.

    PubMed

    Wang, Yan; Lee, Mihye; Liu, Pengfei; Shi, Liuhua; Yu, Zhi; Abu Awad, Yara; Zanobetti, Antonella; Schwartz, Joel D

    2017-11-01

    The effect of an exposure on survival can be biased when the regression model is misspecified. Hazard difference is easier to use in risk assessment than hazard ratio and has a clearer interpretation in the assessment of effect modifications. We proposed two doubly robust additive hazards models to estimate the causal hazard difference of a continuous exposure on survival. The first model is an inverse probability-weighted additive hazards regression. The second model is an extension of the doubly robust estimator for binary exposures by categorizing the continuous exposure. We compared these with the marginal structural model and outcome regression with correct and incorrect model specifications using simulations. We applied doubly robust additive hazard models to the estimation of hazard difference of long-term exposure to PM2.5 (particulate matter with an aerodynamic diameter less than or equal to 2.5 microns) on survival using a large cohort of 13 million older adults residing in seven states of the Southeastern United States. We showed that the proposed approaches are doubly robust. We found that each 1 μg m increase in annual PM2.5 exposure was associated with a causal hazard difference in mortality of 8.0 × 10 (95% confidence interval 7.4 × 10, 8.7 × 10), which was modified by age, medical history, socioeconomic status, and urbanicity. The overall hazard difference translates to approximately 5.5 (5.1, 6.0) thousand deaths per year in the study population. The proposed approaches improve the robustness of the additive hazards model and produce a novel additive causal estimate of PM2.5 on survival and several additive effect modifications, including social inequality.

  14. Exploring students' patterns of reasoning

    NASA Astrophysics Data System (ADS)

    Matloob Haghanikar, Mojgan

    As part of a collaborative study of the science preparation of elementary school teachers, we investigated the quality of students' reasoning and explored the relationship between sophistication of reasoning and the degree to which the courses were considered inquiry oriented. To probe students' reasoning, we developed open-ended written content questions with the distinguishing feature of applying recently learned concepts in a new context. We devised a protocol for developing written content questions that provided a common structure for probing and classifying students' sophistication level of reasoning. In designing our protocol, we considered several distinct criteria, and classified students' responses based on their performance for each criterion. First, we classified concepts into three types: Descriptive, Hypothetical, and Theoretical and categorized the abstraction levels of the responses in terms of the types of concepts and the inter-relationship between the concepts. Second, we devised a rubric based on Bloom's revised taxonomy with seven traits (both knowledge types and cognitive processes) and a defined set of criteria to evaluate each trait. Along with analyzing students' reasoning, we visited universities and observed the courses in which the students were enrolled. We used the Reformed Teaching Observation Protocol (RTOP) to rank the courses with respect to characteristics that are valued for the inquiry courses. We conducted logistic regression for a sample of 18courses with about 900 students and reported the results for performing logistic regression to estimate the relationship between traits of reasoning and RTOP score. In addition, we analyzed conceptual structure of students' responses, based on conceptual classification schemes, and clustered students' responses into six categories. We derived regression model, to estimate the relationship between the sophistication of the categories of conceptual structure and RTOP scores. However, the outcome variable with six categories required a more complicated regression model, known as multinomial logistic regression, generalized from binary logistic regression. With the large amount of collected data, we found that the likelihood of the higher cognitive processes were in favor of classes with higher measures on inquiry. However, the usage of more abstract concepts with higher order conceptual structures was less prevalent in higher RTOP courses.

  15. Support vector regression-guided unravelling: antioxidant capacity and quantitative structure-activity relationship predict reduction and promotion effects of flavonoids on acrylamide formation

    PubMed Central

    Huang, Mengmeng; Wei, Yan; Wang, Jun; Zhang, Yu

    2016-01-01

    We used the support vector regression (SVR) approach to predict and unravel reduction/promotion effect of characteristic flavonoids on the acrylamide formation under a low-moisture Maillard reaction system. Results demonstrated the reduction/promotion effects by flavonoids at addition levels of 1–10000 μmol/L. The maximal inhibition rates (51.7%, 68.8% and 26.1%) and promote rates (57.7%, 178.8% and 27.5%) caused by flavones, flavonols and isoflavones were observed at addition levels of 100 μmol/L and 10000 μmol/L, respectively. The reduction/promotion effects were closely related to the change of trolox equivalent antioxidant capacity (ΔTEAC) and well predicted by triple ΔTEAC measurements via SVR models (R: 0.633–0.900). Flavonols exhibit stronger effects on the acrylamide formation than flavones and isoflavones as well as their O-glycosides derivatives, which may be attributed to the number and position of phenolic and 3-enolic hydroxyls. The reduction/promotion effects were well predicted by using optimized quantitative structure-activity relationship (QSAR) descriptors and SVR models (R: 0.926–0.994). Compared to artificial neural network and multi-linear regression models, SVR models exhibited better fitting performance for both TEAC-dependent and QSAR descriptor-dependent predicting work. These observations demonstrated that the SVR models are competent for predicting our understanding on the future use of natural antioxidants for decreasing the acrylamide formation. PMID:27586851

  16. Support vector regression-guided unravelling: antioxidant capacity and quantitative structure-activity relationship predict reduction and promotion effects of flavonoids on acrylamide formation

    NASA Astrophysics Data System (ADS)

    Huang, Mengmeng; Wei, Yan; Wang, Jun; Zhang, Yu

    2016-09-01

    We used the support vector regression (SVR) approach to predict and unravel reduction/promotion effect of characteristic flavonoids on the acrylamide formation under a low-moisture Maillard reaction system. Results demonstrated the reduction/promotion effects by flavonoids at addition levels of 1-10000 μmol/L. The maximal inhibition rates (51.7%, 68.8% and 26.1%) and promote rates (57.7%, 178.8% and 27.5%) caused by flavones, flavonols and isoflavones were observed at addition levels of 100 μmol/L and 10000 μmol/L, respectively. The reduction/promotion effects were closely related to the change of trolox equivalent antioxidant capacity (ΔTEAC) and well predicted by triple ΔTEAC measurements via SVR models (R: 0.633-0.900). Flavonols exhibit stronger effects on the acrylamide formation than flavones and isoflavones as well as their O-glycosides derivatives, which may be attributed to the number and position of phenolic and 3-enolic hydroxyls. The reduction/promotion effects were well predicted by using optimized quantitative structure-activity relationship (QSAR) descriptors and SVR models (R: 0.926-0.994). Compared to artificial neural network and multi-linear regression models, SVR models exhibited better fitting performance for both TEAC-dependent and QSAR descriptor-dependent predicting work. These observations demonstrated that the SVR models are competent for predicting our understanding on the future use of natural antioxidants for decreasing the acrylamide formation.

  17. QSRR modeling for diverse drugs using different feature selection methods coupled with linear and nonlinear regressions.

    PubMed

    Goodarzi, Mohammad; Jensen, Richard; Vander Heyden, Yvan

    2012-12-01

    A Quantitative Structure-Retention Relationship (QSRR) is proposed to estimate the chromatographic retention of 83 diverse drugs on a Unisphere poly butadiene (PBD) column, using isocratic elutions at pH 11.7. Previous work has generated QSRR models for them using Classification And Regression Trees (CART). In this work, Ant Colony Optimization is used as a feature selection method to find the best molecular descriptors from a large pool. In addition, several other selection methods have been applied, such as Genetic Algorithms, Stepwise Regression and the Relief method, not only to evaluate Ant Colony Optimization as a feature selection method but also to investigate its ability to find the important descriptors in QSRR. Multiple Linear Regression (MLR) and Support Vector Machines (SVMs) were applied as linear and nonlinear regression methods, respectively, giving excellent correlation between the experimental, i.e. extrapolated to a mobile phase consisting of pure water, and predicted logarithms of the retention factors of the drugs (logk(w)). The overall best model was the SVM one built using descriptors selected by ACO. Copyright © 2012 Elsevier B.V. All rights reserved.

  18. Bivariate least squares linear regression: Towards a unified analytic formalism. I. Functional models

    NASA Astrophysics Data System (ADS)

    Caimmi, R.

    2011-08-01

    Concerning bivariate least squares linear regression, the classical approach pursued for functional models in earlier attempts ( York, 1966, 1969) is reviewed using a new formalism in terms of deviation (matrix) traces which, for unweighted data, reduce to usual quantities leaving aside an unessential (but dimensional) multiplicative factor. Within the framework of classical error models, the dependent variable relates to the independent variable according to the usual additive model. The classes of linear models considered are regression lines in the general case of correlated errors in X and in Y for weighted data, and in the opposite limiting situations of (i) uncorrelated errors in X and in Y, and (ii) completely correlated errors in X and in Y. The special case of (C) generalized orthogonal regression is considered in detail together with well known subcases, namely: (Y) errors in X negligible (ideally null) with respect to errors in Y; (X) errors in Y negligible (ideally null) with respect to errors in X; (O) genuine orthogonal regression; (R) reduced major-axis regression. In the limit of unweighted data, the results determined for functional models are compared with their counterparts related to extreme structural models i.e. the instrumental scatter is negligible (ideally null) with respect to the intrinsic scatter ( Isobe et al., 1990; Feigelson and Babu, 1992). While regression line slope and intercept estimators for functional and structural models necessarily coincide, the contrary holds for related variance estimators even if the residuals obey a Gaussian distribution, with the exception of Y models. An example of astronomical application is considered, concerning the [O/H]-[Fe/H] empirical relations deduced from five samples related to different stars and/or different methods of oxygen abundance determination. For selected samples and assigned methods, different regression models yield consistent results within the errors (∓ σ) for both heteroscedastic and homoscedastic data. Conversely, samples related to different methods produce discrepant results, due to the presence of (still undetected) systematic errors, which implies no definitive statement can be made at present. A comparison is also made between different expressions of regression line slope and intercept variance estimators, where fractional discrepancies are found to be not exceeding a few percent, which grows up to about 20% in the presence of large dispersion data. An extension of the formalism to structural models is left to a forthcoming paper.

  19. Peroxiredoxin 2 regulates PGF2α-induced corpus luteum regression in mice by inhibiting ROS-dependent JNK activation.

    PubMed

    Park, Sun-Ji; Kim, Jung-Hak; Kim, Tae-Shin; Lee, Sang-Rae; Park, Jeen-Woo; Lee, Seunghoon; Kim, Jin-Man; Lee, Dong-Seok

    2017-07-01

    Luteal regression is a natural and necessary event to regulate the reproductive process in all mammals. Prostaglandin F2α (PGF2α) is the main factor that causes functional and structural regression of the corpus luteum (CL). It is well known that PGF2α-mediated ROS generation is closely involved in luteal regression. Peroxiredoxin 2 (Prx2) as an antioxidant enzyme plays a protective role against oxidative stress-induced cell death. However, the effect of Prx2 on PGF2α-induced luteal regression has not been reported. Here, we investigated the role of Prx2 in functional and structural CL regression induced by PGF2α-mediated ROS using Prx2-deficient (-/-) mice. We found that PGF2α-induced ROS generation was significantly higher in Prx2-/- MEF cells compared with that in wild-type (WT) cells, which induced apoptosis by activating JNK-mediated apoptotic signaling pathway. Also, PGF2α treatment in the CL derived from Prx2-/- mice promoted the reduction of steroidogenic enzyme expression and the activation of JNK and caspase3. Compared to WT mice, serum progesterone levels and luteal expression of steroidogenic enzymes decreased more rapidly whereas JNK and caspase3 activations were significantly increased in Prx2-/- mice injected with PGF2α. However, the impaired steroidogenesis and PGF2α-induced JNK-dependent apoptosis were rescued by the addition of the antioxidant N-acetyl-L-cysteine (NAC). This is the first study to demonstrate that Prx2 deficiency ultimately accelerated the PGF2α-induced luteal regression through activation of the ROS-dependent JNK pathway. These findings suggest that Prx2 plays a crucial role in preventing accelerated luteal regression via inhibition of the ROS/JNK pathway. Copyright © 2017 Elsevier Inc. All rights reserved.

  20. Multiple-Instance Regression with Structured Data

    NASA Technical Reports Server (NTRS)

    Wagstaff, Kiri L.; Lane, Terran; Roper, Alex

    2008-01-01

    We present a multiple-instance regression algorithm that models internal bag structure to identify the items most relevant to the bag labels. Multiple-instance regression (MIR) operates on a set of bags with real-valued labels, each containing a set of unlabeled items, in which the relevance of each item to its bag label is unknown. The goal is to predict the labels of new bags from their contents. Unlike previous MIR methods, MI-ClusterRegress can operate on bags that are structured in that they contain items drawn from a number of distinct (but unknown) distributions. MI-ClusterRegress simultaneously learns a model of the bag's internal structure, the relevance of each item, and a regression model that accurately predicts labels for new bags. We evaluated this approach on the challenging MIR problem of crop yield prediction from remote sensing data. MI-ClusterRegress provided predictions that were more accurate than those obtained with non-multiple-instance approaches or MIR methods that do not model the bag structure.

  1. Predicting tree species presence and basal area in Utah: A comparison of stochastic gradient boosting, generalized additive models, and tree-based methods

    Treesearch

    Gretchen G. Moisen; Elizabeth A. Freeman; Jock A. Blackard; Tracey S. Frescino; Niklaus E. Zimmermann; Thomas C. Edwards

    2006-01-01

    Many efforts are underway to produce broad-scale forest attribute maps by modelling forest class and structure variables collected in forest inventories as functions of satellite-based and biophysical information. Typically, variants of classification and regression trees implemented in Rulequest's© See5 and Cubist (for binary and continuous responses,...

  2. Use of Forest Inventory and Analysis information in wildlife habitat modeling: a process for linking multiple scales

    Treesearch

    Thomas C. Edwards; Gretchen G. Moisen; Tracey S. Frescino; Joshua L. Lawler

    2002-01-01

    We describe our collective efforts to develop and apply methods for using FIA data to model forest resources and wildlife habitat. Our work demonstrates how flexible regression techniques, such as generalized additive models, can be linked with spatially explicit environmental information for the mapping of forest type and structure. We illustrate how these maps of...

  3. The annealing helicase and branch migration activities of Drosophila HARP.

    PubMed

    Kassavetis, George A; Kadonaga, James T

    2014-01-01

    HARP (SMARCAL1, MARCAL1) is an annealing helicase that functions in the repair and restart of damaged DNA replication forks through its DNA branch migration and replication fork regression activities. HARP is conserved among metazoans. HARP from invertebrates differs by the absence of one of the two HARP-specific domain repeats found in vertebrates. The annealing helicase and branch migration activity of invertebrate HARP has not been documented. We found that HARP from Drosophila melanogaster retains the annealing helicase activity of human HARP, the ability to disrupt D-loops and to branch migrate Holliday junctions, but fails to regress model DNA replication fork structures. A comparison of human and Drosophila HARP on additional substrates revealed that both HARPs are competent in branch migrating a bidirectional replication bubble composed of either DNA:DNA or RNA:DNA hybrid. Human, but not Drosophila, HARP is also capable of regressing a replication fork structure containing a highly stable poly rG:dC hybrid. Persistent RNA:DNA hybrids in vivo can lead to replication fork arrest and genome instability. The ability of HARP to strand transfer hybrids may signify a hybrid removal function for this enzyme, in vivo.

  4. A structured sparse regression method for estimating isoform expression level from multi-sample RNA-seq data.

    PubMed

    Zhang, L; Liu, X J

    2016-06-03

    With the rapid development of next-generation high-throughput sequencing technology, RNA-seq has become a standard and important technique for transcriptome analysis. For multi-sample RNA-seq data, the existing expression estimation methods usually deal with each single-RNA-seq sample, and ignore that the read distributions are consistent across multiple samples. In the current study, we propose a structured sparse regression method, SSRSeq, to estimate isoform expression using multi-sample RNA-seq data. SSRSeq uses a non-parameter model to capture the general tendency of non-uniformity read distribution for all genes across multiple samples. Additionally, our method adds a structured sparse regularization, which not only incorporates the sparse specificity between a gene and its corresponding isoform expression levels, but also reduces the effects of noisy reads, especially for lowly expressed genes and isoforms. Four real datasets were used to evaluate our method on isoform expression estimation. Compared with other popular methods, SSRSeq reduced the variance between multiple samples, and produced more accurate isoform expression estimations, and thus more meaningful biological interpretations.

  5. Predicting β-Turns in Protein Using Kernel Logistic Regression

    PubMed Central

    Elbashir, Murtada Khalafallah; Sheng, Yu; Wang, Jianxin; Wu, FangXiang; Li, Min

    2013-01-01

    A β-turn is a secondary protein structure type that plays a significant role in protein configuration and function. On average 25% of amino acids in protein structures are located in β-turns. It is very important to develope an accurate and efficient method for β-turns prediction. Most of the current successful β-turns prediction methods use support vector machines (SVMs) or neural networks (NNs). The kernel logistic regression (KLR) is a powerful classification technique that has been applied successfully in many classification problems. However, it is often not found in β-turns classification, mainly because it is computationally expensive. In this paper, we used KLR to obtain sparse β-turns prediction in short evolution time. Secondary structure information and position-specific scoring matrices (PSSMs) are utilized as input features. We achieved Q total of 80.7% and MCC of 50% on BT426 dataset. These results show that KLR method with the right algorithm can yield performance equivalent to or even better than NNs and SVMs in β-turns prediction. In addition, KLR yields probabilistic outcome and has a well-defined extension to multiclass case. PMID:23509793

  6. Predicting β-turns in protein using kernel logistic regression.

    PubMed

    Elbashir, Murtada Khalafallah; Sheng, Yu; Wang, Jianxin; Wu, Fangxiang; Li, Min

    2013-01-01

    A β-turn is a secondary protein structure type that plays a significant role in protein configuration and function. On average 25% of amino acids in protein structures are located in β-turns. It is very important to develope an accurate and efficient method for β-turns prediction. Most of the current successful β-turns prediction methods use support vector machines (SVMs) or neural networks (NNs). The kernel logistic regression (KLR) is a powerful classification technique that has been applied successfully in many classification problems. However, it is often not found in β-turns classification, mainly because it is computationally expensive. In this paper, we used KLR to obtain sparse β-turns prediction in short evolution time. Secondary structure information and position-specific scoring matrices (PSSMs) are utilized as input features. We achieved Q total of 80.7% and MCC of 50% on BT426 dataset. These results show that KLR method with the right algorithm can yield performance equivalent to or even better than NNs and SVMs in β-turns prediction. In addition, KLR yields probabilistic outcome and has a well-defined extension to multiclass case.

  7. Estimating standard errors in feature network models.

    PubMed

    Frank, Laurence E; Heiser, Willem J

    2007-05-01

    Feature network models are graphical structures that represent proximity data in a discrete space while using the same formalism that is the basis of least squares methods employed in multidimensional scaling. Existing methods to derive a network model from empirical data only give the best-fitting network and yield no standard errors for the parameter estimates. The additivity properties of networks make it possible to consider the model as a univariate (multiple) linear regression problem with positivity restrictions on the parameters. In the present study, both theoretical and empirical standard errors are obtained for the constrained regression parameters of a network model with known features. The performance of both types of standard error is evaluated using Monte Carlo techniques.

  8. Methods for estimating population density in data-limited areas: evaluating regression and tree-based models in Peru.

    PubMed

    Anderson, Weston; Guikema, Seth; Zaitchik, Ben; Pan, William

    2014-01-01

    Obtaining accurate small area estimates of population is essential for policy and health planning but is often difficult in countries with limited data. In lieu of available population data, small area estimate models draw information from previous time periods or from similar areas. This study focuses on model-based methods for estimating population when no direct samples are available in the area of interest. To explore the efficacy of tree-based models for estimating population density, we compare six different model structures including Random Forest and Bayesian Additive Regression Trees. Results demonstrate that without information from prior time periods, non-parametric tree-based models produced more accurate predictions than did conventional regression methods. Improving estimates of population density in non-sampled areas is important for regions with incomplete census data and has implications for economic, health and development policies.

  9. Methods for Estimating Population Density in Data-Limited Areas: Evaluating Regression and Tree-Based Models in Peru

    PubMed Central

    Anderson, Weston; Guikema, Seth; Zaitchik, Ben; Pan, William

    2014-01-01

    Obtaining accurate small area estimates of population is essential for policy and health planning but is often difficult in countries with limited data. In lieu of available population data, small area estimate models draw information from previous time periods or from similar areas. This study focuses on model-based methods for estimating population when no direct samples are available in the area of interest. To explore the efficacy of tree-based models for estimating population density, we compare six different model structures including Random Forest and Bayesian Additive Regression Trees. Results demonstrate that without information from prior time periods, non-parametric tree-based models produced more accurate predictions than did conventional regression methods. Improving estimates of population density in non-sampled areas is important for regions with incomplete census data and has implications for economic, health and development policies. PMID:24992657

  10. Formulation and Testing of Paraffin-Based Solid Fuels Containing Energetic Additives for Hybrid Rockets

    NASA Technical Reports Server (NTRS)

    Larson, Daniel B.; Boyer, Eric; Wachs,Trevor; Kuo, Kenneth K.; Story, George

    2012-01-01

    Many approaches have been considered in an effort to improve the regression rate of solid fuels for hybrid rocket applications. One promising method is to use a fuel with a fast burning rate such as paraffin wax; however, additional performance increases to the fuel regression rate are necessary to make the fuel a viable candidate to replace current launch propulsion systems. The addition of energetic and/or nano-sized particles is one way to increase mass-burning rates of the solid fuels and increase the overall performance of the hybrid rocket motor.1,2 Several paraffin-based fuel grains with various energetic additives (e.g., lithium aluminum hydride (LiAlH4) have been cast in an attempt to improve regression rates. There are two major advantages to introducing LiAlH4 additive into the solid fuel matrix: 1) the increased characteristic velocity, 2) decreased dependency of Isp on oxidizer-to-fuel ratio. The testing and characterization of these solid-fuel grains have shown that continued work is necessary to eliminate unburned/unreacted fuel in downstream sections of the test apparatus.3 Changes to the fuel matrix include higher melting point wax and smaller energetic additive particles. The reduction in particle size through various methods can result in more homogeneous grain structure. The higher melting point wax can serve to reduce the melt-layer thickness, allowing the LiAlH4 particles to react closer to the burning surface, thus increasing the heat feedback rate and fuel regression rate. In addition to the formulation of LiAlH4 and paraffin wax solid-fuel grains, liquid additives of triethylaluminum and diisobutylaluminum hydride will be included in this study. Another promising fuel formulation consideration is to incorporate a small percentage of RDX as an additive to paraffin. A novel casting technique will be used by dissolving RDX in a solvent to crystallize the energetic additive. After dissolving the RDX in a solvent chosen for its compatibility with both paraffin and RDX, the mixture will be combined with the melted paraffin. With the melting point of the paraffin far below the decomposition temperature of the RDX, the solvent will be boiled off, leaving the crystallized RDX embedded in the paraffin. At low percentages of RDX additive and with crystallized RDX surrounded by paraffin, the fuel grains will remain inert, maintaining a key benefit of hybrids in the safety of the solid fuel.

  11. Estimating interaction on an additive scale between continuous determinants in a logistic regression model.

    PubMed

    Knol, Mirjam J; van der Tweel, Ingeborg; Grobbee, Diederick E; Numans, Mattijs E; Geerlings, Mirjam I

    2007-10-01

    To determine the presence of interaction in epidemiologic research, typically a product term is added to the regression model. In linear regression, the regression coefficient of the product term reflects interaction as departure from additivity. However, in logistic regression it refers to interaction as departure from multiplicativity. Rothman has argued that interaction estimated as departure from additivity better reflects biologic interaction. So far, literature on estimating interaction on an additive scale using logistic regression only focused on dichotomous determinants. The objective of the present study was to provide the methods to estimate interaction between continuous determinants and to illustrate these methods with a clinical example. and results From the existing literature we derived the formulas to quantify interaction as departure from additivity between one continuous and one dichotomous determinant and between two continuous determinants using logistic regression. Bootstrapping was used to calculate the corresponding confidence intervals. To illustrate the theory with an empirical example, data from the Utrecht Health Project were used, with age and body mass index as risk factors for elevated diastolic blood pressure. The methods and formulas presented in this article are intended to assist epidemiologists to calculate interaction on an additive scale between two variables on a certain outcome. The proposed methods are included in a spreadsheet which is freely available at: http://www.juliuscenter.nl/additive-interaction.xls.

  12. Forest biomass, canopy structure, and species composition relationships with multipolarization L-band synthetic aperture radar data

    NASA Technical Reports Server (NTRS)

    Sader, Steven A.

    1987-01-01

    The effect of forest biomass, canopy structure, and species composition on L-band synthetic aperature radar data at 44 southern Mississippi bottomland hardwood and pine-hardwood forest sites was investigated. Cross-polarization mean digital values for pine forests were significantly correlated with green weight biomass and stand structure. Multiple linear regression with five forest structure variables provided a better integrated measure of canopy roughness and produced highly significant correlation coefficients for hardwood forests using HV/VV ratio only. Differences in biomass levels and canopy structure, including branching patterns and vertical canopy stratification, were important sources of volume scatter affecting multipolarization radar data. Standardized correction techniques and calibration of aircraft data, in addition to development of canopy models, are recommended for future investigations of forest biomass and structure using synthetic aperture radar.

  13. Complex regression Doppler optical coherence tomography

    NASA Astrophysics Data System (ADS)

    Elahi, Sahar; Gu, Shi; Thrane, Lars; Rollins, Andrew M.; Jenkins, Michael W.

    2018-04-01

    We introduce a new method to measure Doppler shifts more accurately and extend the dynamic range of Doppler optical coherence tomography (OCT). The two-point estimate of the conventional Doppler method is replaced with a regression that is applied to high-density B-scans in polar coordinates. We built a high-speed OCT system using a 1.68-MHz Fourier domain mode locked laser to acquire high-density B-scans (16,000 A-lines) at high enough frame rates (˜100 fps) to accurately capture the dynamics of the beating embryonic heart. Flow phantom experiments confirm that the complex regression lowers the minimum detectable velocity from 12.25 mm / s to 374 μm / s, whereas the maximum velocity of 400 mm / s is measured without phase wrapping. Complex regression Doppler OCT also demonstrates higher accuracy and precision compared with the conventional method, particularly when signal-to-noise ratio is low. The extended dynamic range allows monitoring of blood flow over several stages of development in embryos without adjusting the imaging parameters. In addition, applying complex averaging recovers hidden features in structural images.

  14. Can additive measures add to an intersectional understanding? Experiences of gay and ethnic discrimination among HIV-positive Latino gay men.

    PubMed

    Reisen, Carol A; Brooks, Kelly D; Zea, Maria Cecilia; Poppen, Paul J; Bianchi, Fernanda T

    2013-04-01

    The current study investigated a methodological question of whether traditional, additive, quantitative data can be used to address intersectional issues, and illustrated such an approach with a sample of 301 HIV-positive, Latino gay men in the United States. Participants were surveyed using A-CASI. Hierarchical logistic set regression investigated the role of sets of variables reflecting demographic characteristics, gender nonconformity, and gay and ethnic discrimination in relation to depression and gay collective identity. Results showed the discrimination set was related to depression and to gay collective identity, as was gender nonconformity. Follow-up logistic regression showed that both types of discrimination were associated with greater depression, but gender nonconformity was not. Gay discrimination and gender nonconformity were positively associated with gay collective identity, whereas ethnic discrimination was negatively associated. Results are discussed in terms of the use of traditional quantitative data as a potential means of understanding intersectional issues, as well as of contributing to knowledge about individuals facing multiple structural inequalities.

  15. A Permutation Approach for Selecting the Penalty Parameter in Penalized Model Selection

    PubMed Central

    Sabourin, Jeremy A; Valdar, William; Nobel, Andrew B

    2015-01-01

    Summary We describe a simple, computationally effcient, permutation-based procedure for selecting the penalty parameter in LASSO penalized regression. The procedure, permutation selection, is intended for applications where variable selection is the primary focus, and can be applied in a variety of structural settings, including that of generalized linear models. We briefly discuss connections between permutation selection and existing theory for the LASSO. In addition, we present a simulation study and an analysis of real biomedical data sets in which permutation selection is compared with selection based on the following: cross-validation (CV), the Bayesian information criterion (BIC), Scaled Sparse Linear Regression, and a selection method based on recently developed testing procedures for the LASSO. PMID:26243050

  16. Error Covariance Penalized Regression: A novel multivariate model combining penalized regression with multivariate error structure.

    PubMed

    Allegrini, Franco; Braga, Jez W B; Moreira, Alessandro C O; Olivieri, Alejandro C

    2018-06-29

    A new multivariate regression model, named Error Covariance Penalized Regression (ECPR) is presented. Following a penalized regression strategy, the proposed model incorporates information about the measurement error structure of the system, using the error covariance matrix (ECM) as a penalization term. Results are reported from both simulations and experimental data based on replicate mid and near infrared (MIR and NIR) spectral measurements. The results for ECPR are better under non-iid conditions when compared with traditional first-order multivariate methods such as ridge regression (RR), principal component regression (PCR) and partial least-squares regression (PLS). Copyright © 2018 Elsevier B.V. All rights reserved.

  17. Towards molecular design using 2D-molecular contour maps obtained from PLS regression coefficients

    NASA Astrophysics Data System (ADS)

    Borges, Cleber N.; Barigye, Stephen J.; Freitas, Matheus P.

    2017-12-01

    The multivariate image analysis descriptors used in quantitative structure-activity relationships are direct representations of chemical structures as they are simply numerical decodifications of pixels forming the 2D chemical images. These MDs have found great utility in the modeling of diverse properties of organic molecules. Given the multicollinearity and high dimensionality of the data matrices generated with the MIA-QSAR approach, modeling techniques that involve the projection of the data space onto orthogonal components e.g. Partial Least Squares (PLS) have been generally used. However, the chemical interpretation of the PLS-based MIA-QSAR models, in terms of the structural moieties affecting the modeled bioactivity has not been straightforward. This work describes the 2D-contour maps based on the PLS regression coefficients, as a means of assessing the relevance of single MIA predictors to the response variable, and thus allowing for the structural, electronic and physicochemical interpretation of the MIA-QSAR models. A sample study to demonstrate the utility of the 2D-contour maps to design novel drug-like molecules is performed using a dataset of some anti-HIV-1 2-amino-6-arylsulfonylbenzonitriles and derivatives, and the inferences obtained are consistent with other reports in the literature. In addition, the different schemes for encoding atomic properties in molecules are discussed and evaluated.

  18. Development of high performance hybrid rocket fuels

    NASA Astrophysics Data System (ADS)

    Zaseck, Christopher R.

    In this document I discuss paraffin fuel combustion and investigate the effects of additives on paraffin entrainment and regression. In general, hybrid rockets offer an economical and safe alternative to standard liquid and solid rockets. However, slow polymeric fuel regression and low combustion efficiency have limited the commercial use of hybrid rockets. Paraffin is a fast burning fuel that has received significant attention in the 2000's and 2010's as a replacement for standard fuels. Paraffin regresses three to four times faster than polymeric fuels due to the entrainment of a surface melt layer. However, further regression rate enhancement over the base paraffin fuel is necessary for widespread hybrid rocket adoption. I use a small scale opposed flow burner to investigate the effect of additives on the combustion of paraffin. Standard additives such as aluminum combust above the flame zone where sufficient oxidizer levels are present. As a result no heat is generated below the flame itself. In small scale opposed burner experiments the effect of limited heat feedback is apparent. Aluminum in particular does not improve the regression of paraffin in the opposed burner. The lack of heat feedback from additive combustion limits the applicability of the opposed burner. In turn, the results obtained in the opposed burner with metal additive loaded hybrid fuels do not match results from hybrid rocket experiments. In addition, nano-scale aluminum increases melt layer viscosity and greatly slows the regression of paraffin in the opposed flow burner. However, the reactive additives improve the regression rate of paraffin in the opposed burner where standard metals do not. At 5 wt.% mechanically activated titanium and carbon (Ti-C) improves the regression rate of paraffin by 47% in the opposed burner. The mechanically activated Ti C likely reacts in or near the melt layer and provides heat feedback below the flame region that results in faster opposed burner regression. In order to examine paraffin/additive combustion in a motor environment, I conducted experiments on well characterized aluminum based additives. In particular, I investigate the influence of aluminum, unpassivated aluminum, milled aluminum/polytetrafluoroethylene (PTFE), and aluminum hydride on the performance of paraffin fuels for hybrid rocket propulsion. I use an optically accessible combustor to examine the performance of the fuel mixtures in terms of characteristic velocity efficiency and regression rate. Each combustor test consumes a 12.7 cm long, 1.9 cm diameter fuel strand under 160 kg/m 2s of oxygen at up to 1.4 MPa. The experimental results indicate that the addition of 5 wt.% 30 mum or 80 nm aluminum to paraffin increases the regression rate by approximately 15% compared to neat paraffin grains. At higher aluminum concentrations and nano-scale particles sizes, the increased melt layer viscosity causes slower regression. Alane and Al/PTFE at 12.5 wt.% increase the regression of paraffin by 21% and 32% respectively. Finally, an aging study indicates that paraffin can protect air and moisture sensitive particles from oxidation. The opposed burner and aluminum/paraffin hybrid rocket experiments show that additives can alter bulk fuel properties, such as viscosity, that regulate entrainment. The general effect of melt layer properties on the entrainment and regression rate of paraffin is not well understood. Improved understanding of how solid additives affect the properties and regression of paraffin is essential to maximize performance. In this document I investigate the effect of melt layer properties on paraffin regression using inert additives. Tests are performed in the optical cylindrical combustor at ˜1 MPa under a gaseous oxygen mass flux of ˜160 kg/m2s. The experiments indicate that the regression rate is proportional to mu0.08rho 0.38kappa0.82. In addition, I explore how to predict fuel viscosity, thermal conductivity, and density prior to testing. Mechanically activated Ti-C and Al/PTFE are examined in the optical combustor. I examine the effect of the reactivity by altering the mill time for the Ti-C and Al/PTFE particles. Mechanical activation of both Ti-C and Al/PTFE improve the regression rate of paraffin more than the unmilled additives. At 12.5 wt.% Al/PTFE milled for 40 minutes regresses 12% faster than the unmilled fuel. Similarly, at 12.5 wt.% 7.5 minute milled Ti C regresses 7% faster than unmilled Ti-C. The reactive particles increase heat transfer to the fuel surface and improve regression. The composition of the combustion products are examined using a particle catcher system in conjunction with visible light and electron microscopy. The exhaust products indicate that the mechanical activation of the Al/PTFE particles cause microexplosions that reduce exhaust particle size. However, the composition of the mechanically activated Al/PTFE products do not indicate more complete combustion. In addition, the mechanically activated and unmilled Ti-C showed no difference in exhaust products.

  19. Family structure and child health outcomes in the United States.

    PubMed

    Bass, Loretta E; Warehime, M Nicole

    2011-01-01

    We use categorical and logistic regression models to investigate the extent that family structure affects children’s health outcomes at age five (i.e., child’s type of health insurance coverage, the use of a routine medical doctor, and report of being in excellent health) using a sample of 4,898 children from the "Fragile Families and Child Well-Being Study." We find that children with married biological parents are most likely to have private health insurance compared with each of three other relationship statuses. With each additional child in the home, a child is less likely to have private insurance compared with no insurance and Medicaid insurance. Children with cohabiting biological parents are less likely to have a routine doctor compared with children of married biological parents, yet having additional children in the household is not associated with having a routine doctor. Children with biological parents who are not romantically involved and those with additional children in the household are less likely to be in excellent health, all else being equal.

  20. Determinants of performance failure in the nursing home industry☆

    PubMed Central

    Zinn, Jacqueline; Mor, Vincent; Feng, Zhanlian; Intrator, Orna

    2013-01-01

    This study investigates the determinants of performance failure in U.S. nursing homes. The sample consisted of 91,168 surveys from 10,901 facilities included in the Online Survey Certification and Reporting system from 1996 to 2005. Failed performance was defined as termination from the Medicare and Medicaid programs. Determinants of performance failure were identified as core structural change (ownership change), peripheral change (related diversification), prior financial and quality of care performance, size and environmental shock (Medicaid case mix reimbursement and prospective payment system introduction). Additional control variables that could contribute to the likelihood of performance failure were included in a cross-sectional time series generalized estimating equation logistic regression model. Our results support the contention, derived from structural inertia theory, that where in an organization’s structure change occurs determines whether it is adaptive or disruptive. In addition, while poor prior financial and quality performance and the introduction of case mix reimbursement increases the risk of failure, larger size is protective, decreasing the likelihood of performance failure. PMID:19128865

  1. Determinants of performance failure in the nursing home industry.

    PubMed

    Zinn, Jacqueline; Mor, Vincent; Feng, Zhanlian; Intrator, Orna

    2009-03-01

    This study investigates the determinants of performance failure in U.S. nursing homes. The sample consisted of 91,168 surveys from 10,901 facilities included in the Online Survey Certification and Reporting system from 1996 to 2005. Failed performance was defined as termination from the Medicare and Medicaid programs. Determinants of performance failure were identified as core structural change (ownership change), peripheral change (related diversification), prior financial and quality of care performance, size and environmental shock (Medicaid case mix reimbursement and prospective payment system introduction). Additional control variables that could contribute to the likelihood of performance failure were included in a cross-sectional time series generalized estimating equation logistic regression model. Our results support the contention, derived from structural inertia theory, that where in an organization's structure change occurs determines whether it is adaptive or disruptive. In addition, while poor prior financial and quality performance and the introduction of case mix reimbursement increases the risk of failure, larger size is protective, decreasing the likelihood of performance failure.

  2. Efficient Online Learning Algorithms Based on LSTM Neural Networks.

    PubMed

    Ergen, Tolga; Kozat, Suleyman Serdar

    2017-09-13

    We investigate online nonlinear regression and introduce novel regression structures based on the long short term memory (LSTM) networks. For the introduced structures, we also provide highly efficient and effective online training methods. To train these novel LSTM-based structures, we put the underlying architecture in a state space form and introduce highly efficient and effective particle filtering (PF)-based updates. We also provide stochastic gradient descent and extended Kalman filter-based updates. Our PF-based training method guarantees convergence to the optimal parameter estimation in the mean square error sense provided that we have a sufficient number of particles and satisfy certain technical conditions. More importantly, we achieve this performance with a computational complexity in the order of the first-order gradient-based methods by controlling the number of particles. Since our approach is generic, we also introduce a gated recurrent unit (GRU)-based approach by directly replacing the LSTM architecture with the GRU architecture, where we demonstrate the superiority of our LSTM-based approach in the sequential prediction task via different real life data sets. In addition, the experimental results illustrate significant performance improvements achieved by the introduced algorithms with respect to the conventional methods over several different benchmark real life data sets.

  3. Predicting residue-wise contact orders in proteins by support vector regression.

    PubMed

    Song, Jiangning; Burrage, Kevin

    2006-10-03

    The residue-wise contact order (RWCO) describes the sequence separations between the residues of interest and its contacting residues in a protein sequence. It is a new kind of one-dimensional protein structure that represents the extent of long-range contacts and is considered as a generalization of contact order. Together with secondary structure, accessible surface area, the B factor, and contact number, RWCO provides comprehensive and indispensable important information to reconstructing the protein three-dimensional structure from a set of one-dimensional structural properties. Accurately predicting RWCO values could have many important applications in protein three-dimensional structure prediction and protein folding rate prediction, and give deep insights into protein sequence-structure relationships. We developed a novel approach to predict residue-wise contact order values in proteins based on support vector regression (SVR), starting from primary amino acid sequences. We explored seven different sequence encoding schemes to examine their effects on the prediction performance, including local sequence in the form of PSI-BLAST profiles, local sequence plus amino acid composition, local sequence plus molecular weight, local sequence plus secondary structure predicted by PSIPRED, local sequence plus molecular weight and amino acid composition, local sequence plus molecular weight and predicted secondary structure, and local sequence plus molecular weight, amino acid composition and predicted secondary structure. When using local sequences with multiple sequence alignments in the form of PSI-BLAST profiles, we could predict the RWCO distribution with a Pearson correlation coefficient (CC) between the predicted and observed RWCO values of 0.55, and root mean square error (RMSE) of 0.82, based on a well-defined dataset with 680 protein sequences. Moreover, by incorporating global features such as molecular weight and amino acid composition we could further improve the prediction performance with the CC to 0.57 and an RMSE of 0.79. In addition, combining the predicted secondary structure by PSIPRED was found to significantly improve the prediction performance and could yield the best prediction accuracy with a CC of 0.60 and RMSE of 0.78, which provided at least comparable performance compared with the other existing methods. The SVR method shows a prediction performance competitive with or at least comparable to the previously developed linear regression-based methods for predicting RWCO values. In contrast to support vector classification (SVC), SVR is very good at estimating the raw value profiles of the samples. The successful application of the SVR approach in this study reinforces the fact that support vector regression is a powerful tool in extracting the protein sequence-structure relationship and in estimating the protein structural profiles from amino acid sequences.

  4. Simultaneous Force Regression and Movement Classification of Fingers via Surface EMG within a Unified Bayesian Framework.

    PubMed

    Baldacchino, Tara; Jacobs, William R; Anderson, Sean R; Worden, Keith; Rowson, Jennifer

    2018-01-01

    This contribution presents a novel methodology for myolectric-based control using surface electromyographic (sEMG) signals recorded during finger movements. A multivariate Bayesian mixture of experts (MoE) model is introduced which provides a powerful method for modeling force regression at the fingertips, while also performing finger movement classification as a by-product of the modeling algorithm. Bayesian inference of the model allows uncertainties to be naturally incorporated into the model structure. This method is tested using data from the publicly released NinaPro database which consists of sEMG recordings for 6 degree-of-freedom force activations for 40 intact subjects. The results demonstrate that the MoE model achieves similar performance compared to the benchmark set by the authors of NinaPro for finger force regression. Additionally, inherent to the Bayesian framework is the inclusion of uncertainty in the model parameters, naturally providing confidence bounds on the force regression predictions. Furthermore, the integrated clustering step allows a detailed investigation into classification of the finger movements, without incurring any extra computational effort. Subsequently, a systematic approach to assessing the importance of the number of electrodes needed for accurate control is performed via sensitivity analysis techniques. A slight degradation in regression performance is observed for a reduced number of electrodes, while classification performance is unaffected.

  5. On the use of log-transformation vs. nonlinear regression for analyzing biological power laws.

    PubMed

    Xiao, Xiao; White, Ethan P; Hooten, Mevin B; Durham, Susan L

    2011-10-01

    Power-law relationships are among the most well-studied functional relationships in biology. Recently the common practice of fitting power laws using linear regression (LR) on log-transformed data has been criticized, calling into question the conclusions of hundreds of studies. It has been suggested that nonlinear regression (NLR) is preferable, but no rigorous comparison of these two methods has been conducted. Using Monte Carlo simulations, we demonstrate that the error distribution determines which method performs better, with NLR better characterizing data with additive, homoscedastic, normal error and LR better characterizing data with multiplicative, heteroscedastic, lognormal error. Analysis of 471 biological power laws shows that both forms of error occur in nature. While previous analyses based on log-transformation appear to be generally valid, future analyses should choose methods based on a combination of biological plausibility and analysis of the error distribution. We provide detailed guidelines and associated computer code for doing so, including a model averaging approach for cases where the error structure is uncertain.

  6. Fundamental Phenomena on Fuel Decomposition and Boundary-Layer Combustion Precesses with Applications to Hybrid Rocket Motors. Part 1; Experimental Investigation

    NASA Technical Reports Server (NTRS)

    Kuo, Kenneth K.; Lu, Yeu-Cherng; Chiaverini, Martin J.; Johnson, David K.; Serin, Nadir; Risha, Grant A.; Merkle, Charles L.; Venkateswaran, Sankaran

    1996-01-01

    This final report summarizes the major findings on the subject of 'Fundamental Phenomena on Fuel Decomposition and Boundary-Layer Combustion Processes with Applications to Hybrid Rocket Motors', performed from 1 April 1994 to 30 June 1996. Both experimental results from Task 1 and theoretical/numerical results from Task 2 are reported here in two parts. Part 1 covers the experimental work performed and describes the test facility setup, data reduction techniques employed, and results of the test firings, including effects of operating conditions and fuel additives on solid fuel regression rate and thermal profiles of the condensed phase. Part 2 concerns the theoretical/numerical work. It covers physical modeling of the combustion processes including gas/surface coupling, and radiation effect on regression rate. The numerical solution of the flowfield structure and condensed phase regression behavior are presented. Experimental data from the test firings were used for numerical model validation.

  7. [The Influence of Subjective Health Status, Post-Traumatic Growth, and Social Support on Successful Aging in Middle-Aged Women].

    PubMed

    Lee, Seung Hee; Jang, Hyung Suk; Yang, Young Hee

    2016-10-01

    This study was done to investigate factors influencing successful aging in middle-aged women. A convenience sample of 103 middle-aged women was selected from the community. Data were collected using a structured questionnaire and analyzed using descriptive statistics, two-sample t-test, one-way ANOVA, Kruskal Wallis test, Pearson correlations, Spearman correlations and multiple regression analysis with the SPSS/WIN 22.0 program. Results of regression analysis showed that significant factors influencing successful aging were post-traumatic growth and social support. This regression model explained 48% of the variance in successful aging. Findings show that the concept 'post-traumatic growth' is an important factor influencing successful aging in middle-aged women. In addition, social support from friends/co-workers had greater influence on successful aging than social support from family. Thus, we need to consider the positive impact of post-traumatic growth and increase the chances of social participation in a successful aging program for middle-aged women.

  8. Sperm Retrieval in Patients with Klinefelter Syndrome: A Skewed Regression Model Analysis.

    PubMed

    Chehrazi, Mohammad; Rahimiforoushani, Abbas; Sabbaghian, Marjan; Nourijelyani, Keramat; Sadighi Gilani, Mohammad Ali; Hoseini, Mostafa; Vesali, Samira; Yaseri, Mehdi; Alizadeh, Ahad; Mohammad, Kazem; Samani, Reza Omani

    2017-01-01

    The most common chromosomal abnormality due to non-obstructive azoospermia (NOA) is Klinefelter syndrome (KS) which occurs in 1-1.72 out of 500-1000 male infants. The probability of retrieving sperm as the outcome could be asymmetrically different between patients with and without KS, therefore logistic regression analysis is not a well-qualified test for this type of data. This study has been designed to evaluate skewed regression model analysis for data collected from microsurgical testicular sperm extraction (micro-TESE) among azoospermic patients with and without non-mosaic KS syndrome. This cohort study compared the micro-TESE outcome between 134 men with classic KS and 537 men with NOA and normal karyotype who were referred to Royan Institute between 2009 and 2011. In addition to our main outcome, which was sperm retrieval, we also used logistic and skewed regression analyses to compare the following demographic and hormonal factors: age, level of follicle stimulating hormone (FSH), luteinizing hormone (LH), and testosterone between the two groups. A comparison of the micro-TESE between the KS and control groups showed a success rate of 28.4% (38/134) for the KS group and 22.2% (119/537) for the control group. In the KS group, a significantly difference (P<0.001) existed between testosterone levels for the successful sperm retrieval group (3.4 ± 0.48 mg/mL) compared to the unsuccessful sperm retrieval group (2.33 ± 0.23 mg/mL). The index for quasi Akaike information criterion (QAIC) had a goodness of fit of 74 for the skewed model which was lower than logistic regression (QAIC=85). According to the results, skewed regression is more efficient in estimating sperm retrieval success when the data from patients with KS are analyzed. This finding should be investigated by conducting additional studies with different data structures.

  9. Methods for estimating the magnitude and frequency of floods for urban and small, rural streams in Georgia, South Carolina, and North Carolina, 2011

    USGS Publications Warehouse

    Feaster, Toby D.; Gotvald, Anthony J.; Weaver, J. Curtis

    2014-01-01

    Reliable estimates of the magnitude and frequency of floods are essential for the design of transportation and water-conveyance structures, flood-insurance studies, and flood-plain management. Such estimates are particularly important in densely populated urban areas. In order to increase the number of streamflow-gaging stations (streamgages) available for analysis, expand the geographical coverage that would allow for application of regional regression equations across State boundaries, and build on a previous flood-frequency investigation of rural U.S Geological Survey streamgages in the Southeast United States, a multistate approach was used to update methods for determining the magnitude and frequency of floods in urban and small, rural streams that are not substantially affected by regulation or tidal fluctuations in Georgia, South Carolina, and North Carolina. The at-site flood-frequency analysis of annual peak-flow data for urban and small, rural streams (through September 30, 2011) included 116 urban streamgages and 32 small, rural streamgages, defined in this report as basins draining less than 1 square mile. The regional regression analysis included annual peak-flow data from an additional 338 rural streamgages previously included in U.S. Geological Survey flood-frequency reports and 2 additional rural streamgages in North Carolina that were not included in the previous Southeast rural flood-frequency investigation for a total of 488 streamgages included in the urban and small, rural regression analysis. The at-site flood-frequency analyses for the urban and small, rural streamgages included the expected moments algorithm, which is a modification of the Bulletin 17B log-Pearson type III method for fitting the statistical distribution to the logarithms of the annual peak flows. Where applicable, the flood-frequency analysis also included low-outlier and historic information. Additionally, the application of a generalized Grubbs-Becks test allowed for the detection of multiple potentially influential low outliers. Streamgage basin characteristics were determined using geographical information system techniques. Initial ordinary least squares regression simulations reduced the number of basin characteristics on the basis of such factors as statistical significance, coefficient of determination, Mallow’s Cp statistic, and ease of measurement of the explanatory variable. Application of generalized least squares regression techniques produced final predictive (regression) equations for estimating the 50-, 20-, 10-, 4-, 2-, 1-, 0.5-, and 0.2-percent annual exceedance probability flows for urban and small, rural ungaged basins for three hydrologic regions (HR1, Piedmont–Ridge and Valley; HR3, Sand Hills; and HR4, Coastal Plain), which previously had been defined from exploratory regression analysis in the Southeast rural flood-frequency investigation. Because of the limited availability of urban streamgages in the Coastal Plain of Georgia, South Carolina, and North Carolina, additional urban streamgages in Florida and New Jersey were used in the regression analysis for this region. Including the urban streamgages in New Jersey allowed for the expansion of the applicability of the predictive equations in the Coastal Plain from 3.5 to 53.5 square miles. Average standard error of prediction for the predictive equations, which is a measure of the average accuracy of the regression equations when predicting flood estimates for ungaged sites, range from 25.0 percent for the 10-percent annual exceedance probability regression equation for the Piedmont–Ridge and Valley region to 73.3 percent for the 0.2-percent annual exceedance probability regression equation for the Sand Hills region.

  10. Homogeneity Pursuit

    PubMed Central

    Ke, Tracy; Fan, Jianqing; Wu, Yichao

    2014-01-01

    This paper explores the homogeneity of coefficients in high-dimensional regression, which extends the sparsity concept and is more general and suitable for many applications. Homogeneity arises when regression coefficients corresponding to neighboring geographical regions or a similar cluster of covariates are expected to be approximately the same. Sparsity corresponds to a special case of homogeneity with a large cluster of known atom zero. In this article, we propose a new method called clustering algorithm in regression via data-driven segmentation (CARDS) to explore homogeneity. New mathematics are provided on the gain that can be achieved by exploring homogeneity. Statistical properties of two versions of CARDS are analyzed. In particular, the asymptotic normality of our proposed CARDS estimator is established, which reveals better estimation accuracy for homogeneous parameters than that without homogeneity exploration. When our methods are combined with sparsity exploration, further efficiency can be achieved beyond the exploration of sparsity alone. This provides additional insights into the power of exploring low-dimensional structures in high-dimensional regression: homogeneity and sparsity. Our results also shed lights on the properties of the fussed Lasso. The newly developed method is further illustrated by simulation studies and applications to real data. Supplementary materials for this article are available online. PMID:26085701

  11. Multiscale regression modeling in mouse supraspinatus tendons reveals that dynamic processes act as mediators in structure-function relationships.

    PubMed

    Connizzo, Brianne K; Adams, Sheila M; Adams, Thomas H; Jawad, Abbas F; Birk, David E; Soslowsky, Louis J

    2016-06-14

    Recent advances in technology have allowed for the measurement of dynamic processes (re-alignment, crimp, deformation, sliding), but only a limited number of studies have investigated their relationship with mechanical properties. The overall objective of this study was to investigate the role of composition, structure, and the dynamic response to load in predicting tendon mechanical properties in a multi-level fashion mimicking native hierarchical collagen structure. Multiple linear regression models were investigated to determine the relationships between composition/structure, dynamic processes, and mechanical properties. Mediation was then used to determine if dynamic processes mediated structure-function relationships. Dynamic processes were strong predictors of mechanical properties. These predictions were location-dependent, with the insertion site utilizing all four dynamic responses and the midsubstance responding primarily with fibril deformation and sliding. In addition, dynamic processes were moderately predicted by composition and structure in a regionally-dependent manner. Finally, dynamic processes were partial mediators of the relationship between composition/structure and mechanical function, and results suggested that mediation is likely shared between multiple dynamic processes. In conclusion, the mechanical properties at the midsubstance of the tendon are controlled primarily by fibril structure and this region responds to load via fibril deformation and sliding. Conversely, the mechanical function at the insertion site is controlled by many other important parameters and the region responds to load via all four dynamic mechanisms. Overall, this study presents a strong foundation on which to design future experimental and modeling efforts in order to fully understand the complex structure-function relationships present in tendon. Copyright © 2016 Elsevier Ltd. All rights reserved.

  12. The role of recognition and interest in physics identity development

    NASA Astrophysics Data System (ADS)

    Lock, Robynne

    2016-03-01

    While the number of students earning bachelor's degrees in physics has increased in recent years, this number has only recently surpassed the peak value of the 1960s. Additionally, the percentage of women earning bachelor's degrees in physics has stagnated for the past 10 years and may even be declining. We use a physics identity framework consisting of three dimensions to understand how students make their initial career decisions at the end of high school and the beginning of college. The three dimensions consist of recognition (perception that teachers, parents, and peers see the student as a ``physics person''), interest (desire to learn more about physics), and performance/competence (perception of abilities to complete physics related tasks and to understand physics). Using data from the Sustainability and Gender in Engineering survey administered to a nationally representative sample of college students, we built a regression model to determine which identity dimensions have the largest effect on physics career choice and a structural equation model to understand how the identity dimensions are related. Additionally, we used regression models to identify teaching strategies that predict each identity dimension.

  13. Using CART to Identify Thresholds and Hierarchies in the Determinants of Funding Decisions.

    PubMed

    Schilling, Chris; Mortimer, Duncan; Dalziel, Kim

    2017-02-01

    There is much interest in understanding decision-making processes that determine funding outcomes for health interventions. We use classification and regression trees (CART) to identify cost-effectiveness thresholds and hierarchies in the determinants of funding decisions. The hierarchical structure of CART is suited to analyzing complex conditional and nonlinear relationships. Our analysis uncovered hierarchies where interventions were grouped according to their type and objective. Cost-effectiveness thresholds varied markedly depending on which group the intervention belonged to: lifestyle-type interventions with a prevention objective had an incremental cost-effectiveness threshold of $2356, suggesting that such interventions need to be close to cost saving or dominant to be funded. For lifestyle-type interventions with a treatment objective, the threshold was much higher at $37,024. Lower down the tree, intervention attributes such as the level of patient contribution and the eligibility for government reimbursement influenced the likelihood of funding within groups of similar interventions. Comparison between our CART models and previously published results demonstrated concurrence with standard regression techniques while providing additional insights regarding the role of the funding environment and the structure of decision-maker preferences.

  14. Random regression analyses using B-spline functions to model growth of Nellore cattle.

    PubMed

    Boligon, A A; Mercadante, M E Z; Lôbo, R B; Baldi, F; Albuquerque, L G

    2012-02-01

    The objective of this study was to estimate (co)variance components using random regression on B-spline functions to weight records obtained from birth to adulthood. A total of 82 064 weight records of 8145 females obtained from the data bank of the Nellore Breeding Program (PMGRN/Nellore Brazil) which started in 1987, were used. The models included direct additive and maternal genetic effects and animal and maternal permanent environmental effects as random. Contemporary group and dam age at calving (linear and quadratic effect) were included as fixed effects, and orthogonal Legendre polynomials of age (cubic regression) were considered as random covariate. The random effects were modeled using B-spline functions considering linear, quadratic and cubic polynomials for each individual segment. Residual variances were grouped in five age classes. Direct additive genetic and animal permanent environmental effects were modeled using up to seven knots (six segments). A single segment with two knots at the end points of the curve was used for the estimation of maternal genetic and maternal permanent environmental effects. A total of 15 models were studied, with the number of parameters ranging from 17 to 81. The models that used B-splines were compared with multi-trait analyses with nine weight traits and to a random regression model that used orthogonal Legendre polynomials. A model fitting quadratic B-splines, with four knots or three segments for direct additive genetic effect and animal permanent environmental effect and two knots for maternal additive genetic effect and maternal permanent environmental effect, was the most appropriate and parsimonious model to describe the covariance structure of the data. Selection for higher weight, such as at young ages, should be performed taking into account an increase in mature cow weight. Particularly, this is important in most of Nellore beef cattle production systems, where the cow herd is maintained on range conditions. There is limited modification of the growth curve of Nellore cattle with respect to the aim of selecting them for rapid growth at young ages while maintaining constant adult weight.

  15. ANALYSIS OF OIL-BEARING CRETACEOUS SANDSTONE HYDROCARBON RESERVOIRS, EXCLUSIVE OF THE DAKOTA SANDSTONE, ON THE JICARILLA APACHE INDIAN RESERVATION, NEW MEXICO

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Jennie Ridgley

    2000-01-21

    An additional 450 wells were added to the structural database; there are now 2550 wells in the database with corrected tops on the Juana Lopez, base of the Bridge Creek Limestone, and datum. This completes the structural data base compilation. Fifteen oil and five gas fields from the Mancos-ElVado interval were evaluated with respect to the newly defined sequence stratigraphic model for this interval. The five gas fields are located away from the structural margins of the deep part of the San Juan Basin. All the fields have characteristics of basin-centered gas and can be considered as continuous gas accumulationsmore » as recently defined by the U.S. Geological Survey. Oil production occurs in thinly interbedded sandstone and shale or in discrete sandstone bodies. Production is both from transgressive and regressive strata as redefined in this study. Oil production is both stratigraphically and structurally controlled with production occurring along the Chaco slope or in steeply west-dipping rocks along the east margin of the basin. The ElVado Sandstone of subsurface usage is redefined to encompass a narrower interval; it appears to be more time correlative with the Dalton Sandstone. Thus, it was deposited as part of a regressive sequence, in contrast to the underlying rock units which were deposited during transgression.« less

  16. Time-varying effect moderation using the structural nested mean model: estimation using inverse-weighted regression with residuals

    PubMed Central

    Almirall, Daniel; Griffin, Beth Ann; McCaffrey, Daniel F.; Ramchand, Rajeev; Yuen, Robert A.; Murphy, Susan A.

    2014-01-01

    This article considers the problem of examining time-varying causal effect moderation using observational, longitudinal data in which treatment, candidate moderators, and possible confounders are time varying. The structural nested mean model (SNMM) is used to specify the moderated time-varying causal effects of interest in a conditional mean model for a continuous response given time-varying treatments and moderators. We present an easy-to-use estimator of the SNMM that combines an existing regression-with-residuals (RR) approach with an inverse-probability-of-treatment weighting (IPTW) strategy. The RR approach has been shown to identify the moderated time-varying causal effects if the time-varying moderators are also the sole time-varying confounders. The proposed IPTW+RR approach provides estimators of the moderated time-varying causal effects in the SNMM in the presence of an additional, auxiliary set of known and measured time-varying confounders. We use a small simulation experiment to compare IPTW+RR versus the traditional regression approach and to compare small and large sample properties of asymptotic versus bootstrap estimators of the standard errors for the IPTW+RR approach. This article clarifies the distinction between time-varying moderators and time-varying confounders. We illustrate the methodology in a case study to assess if time-varying substance use moderates treatment effects on future substance use. PMID:23873437

  17. Inter-class sparsity based discriminative least square regression.

    PubMed

    Wen, Jie; Xu, Yong; Li, Zuoyong; Ma, Zhongli; Xu, Yuanrong

    2018-06-01

    Least square regression is a very popular supervised classification method. However, two main issues greatly limit its performance. The first one is that it only focuses on fitting the input features to the corresponding output labels while ignoring the correlations among samples. The second one is that the used label matrix, i.e., zero-one label matrix is inappropriate for classification. To solve these problems and improve the performance, this paper presents a novel method, i.e., inter-class sparsity based discriminative least square regression (ICS_DLSR), for multi-class classification. Different from other methods, the proposed method pursues that the transformed samples have a common sparsity structure in each class. For this goal, an inter-class sparsity constraint is introduced to the least square regression model such that the margins of samples from the same class can be greatly reduced while those of samples from different classes can be enlarged. In addition, an error term with row-sparsity constraint is introduced to relax the strict zero-one label matrix, which allows the method to be more flexible in learning the discriminative transformation matrix. These factors encourage the method to learn a more compact and discriminative transformation for regression and thus has the potential to perform better than other methods. Extensive experimental results show that the proposed method achieves the best performance in comparison with other methods for multi-class classification. Copyright © 2018 Elsevier Ltd. All rights reserved.

  18. Increases in soil aggregation following phosphorus additions in a tropical premontane forest are not driven by root and arbuscular mycorrhizal fungal abundances

    NASA Astrophysics Data System (ADS)

    Camenzind, Tessa; Papathanasiou, Helena; Foerster, Antje; Dietrich, Karla; Hertel, Dietrich; Homeier, Juergen; Oelmann, Yvonne; Olsson, Pål Axel; Suárez, Juan; Rillig, Matthias

    2015-12-01

    Tropical ecosystems have an important role in global change scenarios, in part because they serve as a large terrestrial carbon pool. Carbon protection is mediated by soil aggregation processes, whereby biotic and abiotic factors influence the formation and stability of aggregates. Nutrient additions may affect soil structure indirectly by simultaneous shifts in biotic factors, mainly roots and fungal hyphae, but also via impacts on abiotic soil properties. Here, we tested the hypothesis that soil aggregation will be affected by nutrient additions primarily via changes in arbuscular mycorrhizal fungal (AMF) hyphae and root length in a pristine tropical forest system. Therefore, the percentage of water-stable macroaggregates (> 250µm) (WSA) and the soil mean weight diameter (MWD) was analyzed, as well as nutrient contents, pH, root length and AMF abundance. Phosphorus additions significantly increased the amount of WSA, which was consistent across two different sampling times. Despite a positive effect of phosphorus additions on extraradical AMF biomass, no relationship between WSA and extra-radical AMF nor roots was revealed by regression analyses, contrary to the proposed hypothesis. These findings emphasize the importance of analyzing soil structure in understudied tropical systems, since it might be affected by increasing nutrient deposition expected in the future.

  19. The Impact of Social Structures on Deviant Behaviors: The Study of 402 High Risk Street Drug Users in Iran.

    PubMed

    Mehrabi, Maryam; Eskandarieh, Sharareh; Khodadost, Mahmoud; Sadeghi, Maneli; Nikfarjam, Ali; Hajebi, Ahmad

    2016-01-01

    This study is a sociological analysis of the three dimensions of social structure including institutional, relational, and embodied structures that have an impact on the individuals' deviant behaviors in the society. The authors used a mix method to analyze the qualitative and quantitative data of 402 high risk abandoned substance users in 2008 in Tehran, capital city of Iran. The leading reasons of substance use were categorized into four fundamental themes as follows: stress, deviant social networks, and low social capital and weak social support sources. In addition, the epidemiology model of regression analysis provides a brief explanation to assess the association between the demographical and etiological variables, and the drug users' deviant behaviors. In sum, substance use is discussed as a deviant behavior pattern which stems from a comorbidity of weak social structures.

  20. Computationally efficient algorithm for Gaussian Process regression in case of structured samples

    NASA Astrophysics Data System (ADS)

    Belyaev, M.; Burnaev, E.; Kapushev, Y.

    2016-04-01

    Surrogate modeling is widely used in many engineering problems. Data sets often have Cartesian product structure (for instance factorial design of experiments with missing points). In such case the size of the data set can be very large. Therefore, one of the most popular algorithms for approximation-Gaussian Process regression-can be hardly applied due to its computational complexity. In this paper a computationally efficient approach for constructing Gaussian Process regression in case of data sets with Cartesian product structure is presented. Efficiency is achieved by using a special structure of the data set and operations with tensors. Proposed algorithm has low computational as well as memory complexity compared to existing algorithms. In this work we also introduce a regularization procedure allowing to take into account anisotropy of the data set and avoid degeneracy of regression model.

  1. Accounting for measurement error in log regression models with applications to accelerated testing.

    PubMed

    Richardson, Robert; Tolley, H Dennis; Evenson, William E; Lunt, Barry M

    2018-01-01

    In regression settings, parameter estimates will be biased when the explanatory variables are measured with error. This bias can significantly affect modeling goals. In particular, accelerated lifetime testing involves an extrapolation of the fitted model, and a small amount of bias in parameter estimates may result in a significant increase in the bias of the extrapolated predictions. Additionally, bias may arise when the stochastic component of a log regression model is assumed to be multiplicative when the actual underlying stochastic component is additive. To account for these possible sources of bias, a log regression model with measurement error and additive error is approximated by a weighted regression model which can be estimated using Iteratively Re-weighted Least Squares. Using the reduced Eyring equation in an accelerated testing setting, the model is compared to previously accepted approaches to modeling accelerated testing data with both simulations and real data.

  2. [Application of SAS macro to evaluated multiplicative and additive interaction in logistic and Cox regression in clinical practices].

    PubMed

    Nie, Z Q; Ou, Y Q; Zhuang, J; Qu, Y J; Mai, J Z; Chen, J M; Liu, X Q

    2016-05-01

    Conditional logistic regression analysis and unconditional logistic regression analysis are commonly used in case control study, but Cox proportional hazard model is often used in survival data analysis. Most literature only refer to main effect model, however, generalized linear model differs from general linear model, and the interaction was composed of multiplicative interaction and additive interaction. The former is only statistical significant, but the latter has biological significance. In this paper, macros was written by using SAS 9.4 and the contrast ratio, attributable proportion due to interaction and synergy index were calculated while calculating the items of logistic and Cox regression interactions, and the confidence intervals of Wald, delta and profile likelihood were used to evaluate additive interaction for the reference in big data analysis in clinical epidemiology and in analysis of genetic multiplicative and additive interactions.

  3. Boosted structured additive regression for Escherichia coli fed-batch fermentation modeling.

    PubMed

    Melcher, Michael; Scharl, Theresa; Luchner, Markus; Striedner, Gerald; Leisch, Friedrich

    2017-02-01

    The quality of biopharmaceuticals and patients' safety are of highest priority and there are tremendous efforts to replace empirical production process designs by knowledge-based approaches. Main challenge in this context is that real-time access to process variables related to product quality and quantity is severely limited. To date comprehensive on- and offline monitoring platforms are used to generate process data sets that allow for development of mechanistic and/or data driven models for real-time prediction of these important quantities. Ultimate goal is to implement model based feed-back control loops that facilitate online control of product quality. In this contribution, we explore structured additive regression (STAR) models in combination with boosting as a variable selection tool for modeling the cell dry mass, product concentration, and optical density on the basis of online available process variables and two-dimensional fluorescence spectroscopic data. STAR models are powerful extensions of linear models allowing for inclusion of smooth effects or interactions between predictors. Boosting constructs the final model in a stepwise manner and provides a variable importance measure via predictor selection frequencies. Our results show that the cell dry mass can be modeled with a relative error of about ±3%, the optical density with ±6%, the soluble protein with ±16%, and the insoluble product with an accuracy of ±12%. Biotechnol. Bioeng. 2017;114: 321-334. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  4. On the use of log-transformation vs. nonlinear regression for analyzing biological power laws

    USGS Publications Warehouse

    Xiao, X.; White, E.P.; Hooten, M.B.; Durham, S.L.

    2011-01-01

    Power-law relationships are among the most well-studied functional relationships in biology. Recently the common practice of fitting power laws using linear regression (LR) on log-transformed data has been criticized, calling into question the conclusions of hundreds of studies. It has been suggested that nonlinear regression (NLR) is preferable, but no rigorous comparison of these two methods has been conducted. Using Monte Carlo simulations, we demonstrate that the error distribution determines which method performs better, with NLR better characterizing data with additive, homoscedastic, normal error and LR better characterizing data with multiplicative, heteroscedastic, lognormal error. Analysis of 471 biological power laws shows that both forms of error occur in nature. While previous analyses based on log-transformation appear to be generally valid, future analyses should choose methods based on a combination of biological plausibility and analysis of the error distribution. We provide detailed guidelines and associated computer code for doing so, including a model averaging approach for cases where the error structure is uncertain. ?? 2011 by the Ecological Society of America.

  5. SAR405838: An optimized inhibitor of MDM2-p53 interaction that induces complete and durable tumor regression

    DOE PAGES

    Wang, Shaomeng; Sun, Wei; Zhao, Yujun; ...

    2014-08-21

    Blocking the MDM2-p53 protein-protein interaction has long been considered to offer a broad cancer therapeutic strategy, despite the potential risks of selecting tumors harboring p53 mutations that escape MDM2 control. In this study, we report a novel small molecule inhibitor of the MDM2-p53 interaction, SAR405838 (MI-77301) that has been advanced into Phase I clinical trials. SAR405838 binds to MDM2 with K i = 0.88 nM and has high specificity over other proteins. A co-crystal structure of the SAR405838:MDM2 complex shows that in addition to mimicking three key p53 amino acid residues, the inhibitor captures additional interactions not observed in themore » p53-MDM2 complex and induces refolding of the short, unstructured MDM2 N-terminal region to achieve its high affinity. SAR405838 effectively activates wild-type p53 in vitro and in xenograft tumor tissue of leukemia and solid tumors, leading to p53-dependent cell cycle arrest and/or apoptosis. At well-tolerated dose schedules, SAR405838 achieves either durable tumor regression or complete tumor growth inhibition in mouse xenograft models of SJSA-1 osteosarcoma, RS4;11 acute leukemia, LNCaP prostate cancer and HCT-116 colon cancer. Remarkably, a single oral dose of SAR405838 is sufficient to achieve complete tumor regression in the SJSA-1 model. Mechanistically, robust transcriptional up-regulation of PUMA induced by SAR405838 results in strong apoptosis in tumor tissue, leading to complete tumor regression. Lastly, our findings provide a preclinical basis upon which to evaluate SAR405838 as a therapeutic agent in patients whose tumors retain wild-type p53.« less

  6. Is Susceptibility to Prenatal Methylmercury Exposure from Fish Consumption Non-Homogeneous? Tree-Structured Analysis for the Seychelles Child Development Study

    PubMed Central

    Huang, Li-Shan; Myers, Gary J.; Davidson, Philip W.; Cox, Christopher; Xiao, Fenyuan; Thurston, Sally W.; Cernichiari, Elsa; Shamlaye, Conrad F.; Sloane-Reeves, Jean; Georger, Lesley; Clarkson, Thomas W.

    2007-01-01

    Studies of the association between prenatal methylmercury exposure from maternal fish consumption during pregnancy and neurodevelopmental test scores in the Seychelles Child Development Study have found no consistent pattern of associations through age nine years. The analyses for the most recent nine-year data examined the population effects of prenatal exposure, but did not address the possibility of non-homogeneous susceptibility. This paper presents a regression tree approach: covariate effects are treated nonlinearly and non-additively and non-homogeneous effects of prenatal methylmercury exposure are permitted among the covariate clusters identified by the regression tree. The approach allows us to address whether children in the lower or higher ends of the developmental spectrum differ in susceptibility to subtle exposure effects. Of twenty-one endpoints available at age nine years, we chose the Weschler Full Scale IQ and its associated covariates to construct the regression tree. The prenatal mercury effect in each of the nine resulting clusters was assessed linearly and non-homogeneously. In addition we reanalyzed five other nine-year endpoints that in the linear analysis has a two-tailed p-value <0.2 for the effect of prenatal exposure. In this analysis, motor proficiency and activity level improved significantly with increasing MeHg for 53% of the children who had an average home environment. Motor proficiency significantly decreased with increasing prenatal MeHg exposure in 7% of the children whose home environment was below average. The regression tree results support previous analyses of outcomes in this cohort. However, this analysis raises the intriguing possibility that an effect may be non-homogeneous among children with different backgrounds and IQ levels. PMID:17942158

  7. Is susceptibility to prenatal methylmercury exposure from fish consumption non-homogeneous? Tree-structured analysis for the Seychelles Child Development Study.

    PubMed

    Huang, Li-Shan; Myers, Gary J; Davidson, Philip W; Cox, Christopher; Xiao, Fenyuan; Thurston, Sally W; Cernichiari, Elsa; Shamlaye, Conrad F; Sloane-Reeves, Jean; Georger, Lesley; Clarkson, Thomas W

    2007-11-01

    Studies of the association between prenatal methylmercury exposure from maternal fish consumption during pregnancy and neurodevelopmental test scores in the Seychelles Child Development Study have found no consistent pattern of associations through age 9 years. The analyses for the most recent 9-year data examined the population effects of prenatal exposure, but did not address the possibility of non-homogeneous susceptibility. This paper presents a regression tree approach: covariate effects are treated non-linearly and non-additively and non-homogeneous effects of prenatal methylmercury exposure are permitted among the covariate clusters identified by the regression tree. The approach allows us to address whether children in the lower or higher ends of the developmental spectrum differ in susceptibility to subtle exposure effects. Of 21 endpoints available at age 9 years, we chose the Weschler Full Scale IQ and its associated covariates to construct the regression tree. The prenatal mercury effect in each of the nine resulting clusters was assessed linearly and non-homogeneously. In addition we reanalyzed five other 9-year endpoints that in the linear analysis had a two-tailed p-value <0.2 for the effect of prenatal exposure. In this analysis, motor proficiency and activity level improved significantly with increasing MeHg for 53% of the children who had an average home environment. Motor proficiency significantly decreased with increasing prenatal MeHg exposure in 7% of the children whose home environment was below average. The regression tree results support previous analyses of outcomes in this cohort. However, this analysis raises the intriguing possibility that an effect may be non-homogeneous among children with different backgrounds and IQ levels.

  8. A Simulation-Based Comparison of Several Stochastic Linear Regression Methods in the Presence of Outliers.

    ERIC Educational Resources Information Center

    Rule, David L.

    Several regression methods were examined within the framework of weighted structural regression (WSR), comparing their regression weight stability and score estimation accuracy in the presence of outlier contamination. The methods compared are: (1) ordinary least squares; (2) WSR ridge regression; (3) minimum risk regression; (4) minimum risk 2;…

  9. Using Marginal Structural Modeling to Estimate the Cumulative Impact of an Unconditional Tax Credit on Self-Rated Health.

    PubMed

    Pega, Frank; Blakely, Tony; Glymour, M Maria; Carter, Kristie N; Kawachi, Ichiro

    2016-02-15

    In previous studies, researchers estimated short-term relationships between financial credits and health outcomes using conventional regression analyses, but they did not account for time-varying confounders affected by prior treatment (CAPTs) or the credits' cumulative impacts over time. In this study, we examined the association between total number of years of receiving New Zealand's Family Tax Credit (FTC) and self-rated health (SRH) in 6,900 working-age parents using 7 waves of New Zealand longitudinal data (2002-2009). We conducted conventional linear regression analyses, both unadjusted and adjusted for time-invariant and time-varying confounders measured at baseline, and fitted marginal structural models (MSMs) that more fully adjusted for confounders, including CAPTs. Of all participants, 5.1%-6.8% received the FTC for 1-3 years and 1.8%-3.6% for 4-7 years. In unadjusted and adjusted conventional regression analyses, each additional year of receiving the FTC was associated with 0.033 (95% confidence interval (CI): -0.047, -0.019) and 0.026 (95% CI: -0.041, -0.010) units worse SRH (on a 5-unit scale). In the MSMs, the average causal treatment effect also reflected a small decrease in SRH (unstabilized weights: β = -0.039 unit, 95% CI: -0.058, -0.020; stabilized weights: β = -0.031 unit, 95% CI: -0.050, -0.007). Cumulatively receiving the FTC marginally reduced SRH. Conventional regression analyses and MSMs produced similar estimates, suggesting little bias from CAPTs. © The Author 2016. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  10. Fish assemblage structure and habitat associations in a large western river system

    USGS Publications Warehouse

    Smith, C.D.; Quist, Michael C.; Hardy, R. S.

    2016-01-01

    Longitudinal gradients of fish assemblage and habitat structure were investigated in the Kootenai River of northern Idaho. A total of 43 500-m river reaches was sampled repeatedly with several techniques (boat-mounted electrofishing, hoop nets and benthic trawls) in the summers of 2012 and 2013. Differences in habitat and fish assemblage structure were apparent along the longitudinal gradient of the Kootenai River. Habitat characteristics (e.g. depth, substrate composition and water velocity) were related to fish assemblage structure in three different geomorphic river sections. Upper river sections were characterized by native salmonids (e.g. mountain whitefish Prosopium williamsoni), whereas native cyprinids (peamouth Mylocheilus caurinus, northern pikeminnow Ptychocheilus oregonensis) and non-native fishes (pumpkinseed Lepomis gibbosus, yellow perch Perca flavescens) were common in the downstream section. Overall, a general pattern of species addition from upstream to downstream sections was discovered and is likely related to increased habitat complexity and additions of non-native species in downstream sections. Assemblage structure of the upper sections were similar, but were both dissimilar to the lower section of the Kootenai River. Species-specific hurdle regressions indicated the relationships among habitat characteristics and the predicted probability of occurrence and relative abundance varied by species. Understanding fish assemblage structure in relation to habitat could improve conservation efforts of rare fishes and improve management of coldwater river systems.

  11. Functional Additive Mixed Models

    PubMed Central

    Scheipl, Fabian; Staicu, Ana-Maria; Greven, Sonja

    2014-01-01

    We propose an extensive framework for additive regression models for correlated functional responses, allowing for multiple partially nested or crossed functional random effects with flexible correlation structures for, e.g., spatial, temporal, or longitudinal functional data. Additionally, our framework includes linear and nonlinear effects of functional and scalar covariates that may vary smoothly over the index of the functional response. It accommodates densely or sparsely observed functional responses and predictors which may be observed with additional error and includes both spline-based and functional principal component-based terms. Estimation and inference in this framework is based on standard additive mixed models, allowing us to take advantage of established methods and robust, flexible algorithms. We provide easy-to-use open source software in the pffr() function for the R-package refund. Simulations show that the proposed method recovers relevant effects reliably, handles small sample sizes well and also scales to larger data sets. Applications with spatially and longitudinally observed functional data demonstrate the flexibility in modeling and interpretability of results of our approach. PMID:26347592

  12. Functional Additive Mixed Models.

    PubMed

    Scheipl, Fabian; Staicu, Ana-Maria; Greven, Sonja

    2015-04-01

    We propose an extensive framework for additive regression models for correlated functional responses, allowing for multiple partially nested or crossed functional random effects with flexible correlation structures for, e.g., spatial, temporal, or longitudinal functional data. Additionally, our framework includes linear and nonlinear effects of functional and scalar covariates that may vary smoothly over the index of the functional response. It accommodates densely or sparsely observed functional responses and predictors which may be observed with additional error and includes both spline-based and functional principal component-based terms. Estimation and inference in this framework is based on standard additive mixed models, allowing us to take advantage of established methods and robust, flexible algorithms. We provide easy-to-use open source software in the pffr() function for the R-package refund. Simulations show that the proposed method recovers relevant effects reliably, handles small sample sizes well and also scales to larger data sets. Applications with spatially and longitudinally observed functional data demonstrate the flexibility in modeling and interpretability of results of our approach.

  13. SCI model structure determination program (OSR) user's guide. [optimal subset regression

    NASA Technical Reports Server (NTRS)

    1979-01-01

    The computer program, OSR (Optimal Subset Regression) which estimates models for rotorcraft body and rotor force and moment coefficients is described. The technique used is based on the subset regression algorithm. Given time histories of aerodynamic coefficients, aerodynamic variables, and control inputs, the program computes correlation between various time histories. The model structure determination is based on these correlations. Inputs and outputs of the program are given.

  14. ToxiM: A Toxicity Prediction Tool for Small Molecules Developed Using Machine Learning and Chemoinformatics Approaches.

    PubMed

    Sharma, Ashok K; Srivastava, Gopal N; Roy, Ankita; Sharma, Vineet K

    2017-01-01

    The experimental methods for the prediction of molecular toxicity are tedious and time-consuming tasks. Thus, the computational approaches could be used to develop alternative methods for toxicity prediction. We have developed a tool for the prediction of molecular toxicity along with the aqueous solubility and permeability of any molecule/metabolite. Using a comprehensive and curated set of toxin molecules as a training set, the different chemical and structural based features such as descriptors and fingerprints were exploited for feature selection, optimization and development of machine learning based classification and regression models. The compositional differences in the distribution of atoms were apparent between toxins and non-toxins, and hence, the molecular features were used for the classification and regression. On 10-fold cross-validation, the descriptor-based, fingerprint-based and hybrid-based classification models showed similar accuracy (93%) and Matthews's correlation coefficient (0.84). The performances of all the three models were comparable (Matthews's correlation coefficient = 0.84-0.87) on the blind dataset. In addition, the regression-based models using descriptors as input features were also compared and evaluated on the blind dataset. Random forest based regression model for the prediction of solubility performed better ( R 2 = 0.84) than the multi-linear regression (MLR) and partial least square regression (PLSR) models, whereas, the partial least squares based regression model for the prediction of permeability (caco-2) performed better ( R 2 = 0.68) in comparison to the random forest and MLR based regression models. The performance of final classification and regression models was evaluated using the two validation datasets including the known toxins and commonly used constituents of health products, which attests to its accuracy. The ToxiM web server would be a highly useful and reliable tool for the prediction of toxicity, solubility, and permeability of small molecules.

  15. ToxiM: A Toxicity Prediction Tool for Small Molecules Developed Using Machine Learning and Chemoinformatics Approaches

    PubMed Central

    Sharma, Ashok K.; Srivastava, Gopal N.; Roy, Ankita; Sharma, Vineet K.

    2017-01-01

    The experimental methods for the prediction of molecular toxicity are tedious and time-consuming tasks. Thus, the computational approaches could be used to develop alternative methods for toxicity prediction. We have developed a tool for the prediction of molecular toxicity along with the aqueous solubility and permeability of any molecule/metabolite. Using a comprehensive and curated set of toxin molecules as a training set, the different chemical and structural based features such as descriptors and fingerprints were exploited for feature selection, optimization and development of machine learning based classification and regression models. The compositional differences in the distribution of atoms were apparent between toxins and non-toxins, and hence, the molecular features were used for the classification and regression. On 10-fold cross-validation, the descriptor-based, fingerprint-based and hybrid-based classification models showed similar accuracy (93%) and Matthews's correlation coefficient (0.84). The performances of all the three models were comparable (Matthews's correlation coefficient = 0.84–0.87) on the blind dataset. In addition, the regression-based models using descriptors as input features were also compared and evaluated on the blind dataset. Random forest based regression model for the prediction of solubility performed better (R2 = 0.84) than the multi-linear regression (MLR) and partial least square regression (PLSR) models, whereas, the partial least squares based regression model for the prediction of permeability (caco-2) performed better (R2 = 0.68) in comparison to the random forest and MLR based regression models. The performance of final classification and regression models was evaluated using the two validation datasets including the known toxins and commonly used constituents of health products, which attests to its accuracy. The ToxiM web server would be a highly useful and reliable tool for the prediction of toxicity, solubility, and permeability of small molecules. PMID:29249969

  16. Improved explanation of human intelligence using cortical features with second order moments and regression.

    PubMed

    Park, Hyunjin; Yang, Jin-ju; Seo, Jongbum; Choi, Yu-yong; Lee, Kun-ho; Lee, Jong-min

    2014-04-01

    Cortical features derived from magnetic resonance imaging (MRI) provide important information to account for human intelligence. Cortical thickness, surface area, sulcal depth, and mean curvature were considered to explain human intelligence. One region of interest (ROI) of a cortical structure consisting of thousands of vertices contained thousands of measurements, and typically, one mean value (first order moment), was used to represent a chosen ROI, which led to a potentially significant loss of information. We proposed a technological improvement to account for human intelligence in which a second moment (variance) in addition to the mean value was adopted to represent a chosen ROI, so that the loss of information would be less severe. Two computed moments for the chosen ROIs were analyzed with partial least squares regression (PLSR). Cortical features for 78 adults were measured and analyzed in conjunction with the full-scale intelligence quotient (FSIQ). Our results showed that 45% of the variance of the FSIQ could be explained using the combination of four cortical features using two moments per chosen ROI. Our results showed improvement over using a mean value for each ROI, which explained 37% of the variance of FSIQ using the same set of cortical measurements. Our results suggest that using additional second order moments is potentially better than using mean values of chosen ROIs for regression analysis to account for human intelligence. Copyright © 2014 Elsevier Ltd. All rights reserved.

  17. On the relation between personality and job performance of airline pilots.

    PubMed

    Hormann, H J; Maschke, P

    1996-01-01

    The validity of a personality questionnaire for the prediction of job success of airline pilots is compared to validities of a simulator checkflight and of flying experience data. During selection, 274 pilots applying for employment with a European charter airline were examined with a multidimensional personality questionnaire (Temperature Structure Scales; TSS). Additionally, the applicants were graded in a simulator checkflight. On the basis of training records, the pilots were classified as performing at standard or below standard after about 3 years of employment in the hiring company. In a multiple-regression model, this dichotomous criterion for job success can be predicted with 73.8% accuracy through the simulator checkflight and flying experience prior to employment. By adding the personality questionnaire to the regression equation, the number of correct classifications increases to 79.3%. On average, successful pilots score substantially higher on interpersonal scales and lower on emotional scales of the TSS.

  18. Effects of export concentration on CO2 emissions in developed countries: an empirical analysis.

    PubMed

    Apergis, Nicholas; Can, Muhlis; Gozgor, Giray; Lau, Chi Keung Marco

    2018-03-08

    This paper provides the evidence on the short- and the long-run effects of the export product concentration on the level of CO 2 emissions in 19 developed (high-income) economies, spanning the period 1962-2010. To this end, the paper makes use of the nonlinear panel unit root and cointegration tests with multiple endogenous structural breaks. It also considers the mean group estimations, the autoregressive distributed lag model, and the panel quantile regression estimations. The findings illustrate that the environmental Kuznets curve (EKC) hypothesis is valid in the panel dataset of 19 developed economies. In addition, it documents that a higher level of the product concentration of exports leads to lower CO 2 emissions. The results from the panel quantile regressions also indicate that the effect of the export product concentration upon the per capita CO 2 emissions is relatively high at the higher quantiles.

  19. A tandem regression-outlier analysis of a ligand cellular system for key structural modifications around ligand binding.

    PubMed

    Lin, Ying-Ting

    2013-04-30

    A tandem technique of hard equipment is often used for the chemical analysis of a single cell to first isolate and then detect the wanted identities. The first part is the separation of wanted chemicals from the bulk of a cell; the second part is the actual detection of the important identities. To identify the key structural modifications around ligand binding, the present study aims to develop a counterpart of tandem technique for cheminformatics. A statistical regression and its outliers act as a computational technique for separation. A PPARγ (peroxisome proliferator-activated receptor gamma) agonist cellular system was subjected to such an investigation. Results show that this tandem regression-outlier analysis, or the prioritization of the context equations tagged with features of the outliers, is an effective regression technique of cheminformatics to detect key structural modifications, as well as their tendency of impact to ligand binding. The key structural modifications around ligand binding are effectively extracted or characterized out of cellular reactions. This is because molecular binding is the paramount factor in such ligand cellular system and key structural modifications around ligand binding are expected to create outliers. Therefore, such outliers can be captured by this tandem regression-outlier analysis.

  20. [Effects of glycerol on the spectral properties of sodium caseinate].

    PubMed

    Li, Yan; Chang, Fen-fen; Gao, Huan-yuan; Cao, Qing; Jin, Li-e

    2015-01-01

    Although the immigration of water molecule, and diffusion and traversing of oxygen can be prevented by the edible film prepared through sodium caseinate, which plays a good protection role for the food, the strong hydrophilicity makes its watertightness and mechanical properties become inferior. Because the toughness and water resistance of SC films can be enhanced by glycerol (G) as an additive, it is necessary to elucidate the interaction between G and SC through the spectral characteristics such as fluorescence spectra, infrared spectra and UV spectra. The results show that the fluorescence intensity of SC decreases due to the addition of G. The binding constant obtained by the double logarithmic regression curve analysis is 1. 127 x 10(3) L . mol-1 and the number of binding sites reaches 1. 161. It indicates that the weak chemical bond is primary between G and SC molecules; From IR the absorption peaks of SC are almost the same before and after adding G. However, there is a certain difference among their absorption intensities. It reveals that the secondary structure of SC is affected, β folding length decreases, α helix, random coil structure, β angle structure increases, and the intermolecular hydrogen bond is strengthened; From UV the peptide bond structure of SC is not changed after the addition of G, but the polymer with larger molecular weight, which is formed by non-covalent bond, makes the peak intensity decrease. The research gives the mode of G and SC from the molecular level.

  1. Breaking the solid ground of common sense: undoing "structure" with Michael Balint.

    PubMed

    Bonomi, Carlo

    2003-09-01

    Balint's great merit was to question what, in the classical perspective, was assumed as a prerequisite for analysis and thus located beyond analysis: the maturity of the ego. A fundamental premise of his work was Ferenczi's distrust for the structural model, which praised the maturity of the ego and its verbal, social, and adaptive abilities. Ferenczi's view of ego maturation as a trauma derivative was strikingly different from the theories of all other psychoanalytic schools and seems to be responsible for Balint's understanding of regression as a sort of inverted process that enables the undoing of the sheltering structures of the mature mind. Balint's understanding of the relation between mature ego and regression diverged not only from the ego psychologists, who emphasized the idea of therapeutic alliance, but also from most of the authors who embraced the object-relational view, like Klein (who considered regression a manifestation of the patient's craving for oral gratification), Fairbairn (who gave up the notion of regression), and Guntrip (who viewed regression as a schizoid phenomenon related to the ego weakness). According to Balint, the clinical appearance of a regression would "depend also on the way the regression is recognized, is accepted, and is responded to by the analyst." In this respect, his position was close to Winnicott's reformulation of the therapeutic action. Yet, the work of Balint reflects the persuasion that the progressive fluidification of the solid structure could be enabled only by the analyst's capacity for becoming himself or herself [unsolid].

  2. Factors associated with local public health agency participation in obesity prevention in southern States.

    PubMed

    Hatala, Jeffrey J; Fields, Tina T

    2015-05-01

    Obesity rates in the southern US states are higher than in other states. Historically, large-scale community-based interventions in the United States have not proven successful. With local public health agencies (LPHAs) tasked with prevention, their role in obesity prevention is important, yet little research exists regarding what predicts the participation of LPHAs. Cross-sectional data from the 2008 National Association of City and County Health Officials profile study and two public health conceptual frameworks were used to assess structural and environmental predictors of LPHA participation in obesity prevention. The predictors were compared between southern and nonsouthern states. Univariate and weighted logistic regressions were performed. Analysis revealed that more LPHAs in southern states were engaged in nearly all of the 10 essential public health functions related to obesity prevention compared with nonsouthern states. Presence of community-based organizations and staffing levels were the only significant variables in two of the six logistic regression models. This study provides insights into the success rates of the obesity prevention efforts of LPHAs in southern and nonsouthern states. Future research is needed to understand why and how certain structural elements and any additional factors influence LPHA participation in obesity prevention.

  3. On the of neural modeling of some dynamic parameters of earthquakes and fire safety in high-rise construction

    NASA Astrophysics Data System (ADS)

    Haritonova, Larisa

    2018-03-01

    The recent change in the correlation of the number of man-made and natural catastrophes is presented in the paper. Some recommendations are proposed to increase the firefighting efficiency in the high-rise buildings. The article analyzes the methodology of modeling seismic effects. The prospectivity of applying the neural modeling and artificial neural networks to analyze a such dynamic parameters of the earthquake foci as the value of dislocation (or the average rupture slip) is shown. The following two input signals were used: the power class and the number of earthquakes. The regression analysis has been carried out for the predicted results and the target outputs. The equations of the regression for the outputs and target are presented in the work as well as the correlation coefficients in training, validation, testing, and the total (All) for the network structure 2-5-5-1for the average rupture slip. The application of the results obtained in the article for the seismic design for the newly constructed buildings and structures and the given recommendations will provide the additional protection from fire and earthquake risks, reduction of their negative economic and environmental consequences.

  4. Two-level structural sparsity regularization for identifying lattices and defects in noisy images

    DOE PAGES

    Li, Xin; Belianinov, Alex; Dyck, Ondrej E.; ...

    2018-03-09

    Here, this paper presents a regularized regression model with a two-level structural sparsity penalty applied to locate individual atoms in a noisy scanning transmission electron microscopy image (STEM). In crystals, the locations of atoms is symmetric, condensed into a few lattice groups. Therefore, by identifying the underlying lattice in a given image, individual atoms can be accurately located. We propose to formulate the identification of the lattice groups as a sparse group selection problem. Furthermore, real atomic scale images contain defects and vacancies, so atomic identification based solely on a lattice group may result in false positives and false negatives.more » To minimize error, model includes an individual sparsity regularization in addition to the group sparsity for a within-group selection, which results in a regression model with a two-level sparsity regularization. We propose a modification of the group orthogonal matching pursuit (gOMP) algorithm with a thresholding step to solve the atom finding problem. The convergence and statistical analyses of the proposed algorithm are presented. The proposed algorithm is also evaluated through numerical experiments with simulated images. The applicability of the algorithm on determination of atom structures and identification of imaging distortions and atomic defects was demonstrated using three real STEM images. In conclusion, we believe this is an important step toward automatic phase identification and assignment with the advent of genomic databases for materials.« less

  5. Two-level structural sparsity regularization for identifying lattices and defects in noisy images

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Li, Xin; Belianinov, Alex; Dyck, Ondrej E.

    Here, this paper presents a regularized regression model with a two-level structural sparsity penalty applied to locate individual atoms in a noisy scanning transmission electron microscopy image (STEM). In crystals, the locations of atoms is symmetric, condensed into a few lattice groups. Therefore, by identifying the underlying lattice in a given image, individual atoms can be accurately located. We propose to formulate the identification of the lattice groups as a sparse group selection problem. Furthermore, real atomic scale images contain defects and vacancies, so atomic identification based solely on a lattice group may result in false positives and false negatives.more » To minimize error, model includes an individual sparsity regularization in addition to the group sparsity for a within-group selection, which results in a regression model with a two-level sparsity regularization. We propose a modification of the group orthogonal matching pursuit (gOMP) algorithm with a thresholding step to solve the atom finding problem. The convergence and statistical analyses of the proposed algorithm are presented. The proposed algorithm is also evaluated through numerical experiments with simulated images. The applicability of the algorithm on determination of atom structures and identification of imaging distortions and atomic defects was demonstrated using three real STEM images. In conclusion, we believe this is an important step toward automatic phase identification and assignment with the advent of genomic databases for materials.« less

  6. Additivity of nonlinear biomass equations

    Treesearch

    Bernard R. Parresol

    2001-01-01

    Two procedures that guarantee the property of additivity among the components of tree biomass and total tree biomass utilizing nonlinear functions are developed. Procedure 1 is a simple combination approach, and procedure 2 is based on nonlinear joint-generalized regression (nonlinear seemingly unrelated regressions) with parameter restrictions. Statistical theory is...

  7. 'The effect of inulin addition on structural and textural properties of extruded products under several extrusion conditions': The effect of inulin addition on structural and textural properties of rice flour extrudates.

    PubMed

    Tsokolar-Tsikopoulos, Konstantinos C; Katsavou, Ioanna D; Krokida, Magdalini K

    2015-10-01

    The growing consumer demand for healthy snacks has turned the interest of industry and research in the development of new ready-to-eat products, enriched with dietary fibers. Inulin is a soluble fiber with a neutral taste that promotes the good function of the intestine. Rice flour extrudates were produced under various extrusion temperatures, screw speeds, feed moisture concentrations and inulin replacement levels. The objective of this study was to investigate the effect of the material characteristics and the extrusion conditions on the structural and textural properties of the extrudates. Simple mathematical models were used for properties correlation with process conditions and through regression analysis it was revealed that there is a significant effect of extrusion temperature, screw speed, feed moisture content and inulin concentration on the final properties. Both density and maximum stress increased when moisture content and inulin concentration increased, while they decreased when extrusion temperature and screw speed increased. These results were also strengthened by scanning electron microscopy. The highest expansion ratio was presented when decreasing all process conditions apart from screw speed.

  8. Impact of the variation in dynamic vehicle load on flexible pavement responses

    NASA Astrophysics Data System (ADS)

    Ahsanuzzaman, Md

    The purpose of this research was to evaluate the dynamic variation in asphalt pavement critical responses due to dynamic tire load variations. An attempt was also made to develop generalized regression equations to predict the dynamic response variation in flexible pavement under various dynamic load conditions. The study used an extensive database of computed pavement response histories for five different types of sites (smooth, rough, medium rough, very rough and severely rough), two different asphalt pavement structures (thin and thick) at two temperatures (70 °F and 104 °F), subjected to a tandem axle dual tire at three speeds 25, 37 and 50 mph (40, 60 and 80 km/h). All pavement responses were determined using the 3D-Move Analysis program (Version 1.2) developed by University of Nevada, Reno. A new term called Dynamic Response Coefficient (DRC) was introduced in this study to address the variation in critical pavement responses due to dynamic loads as traditionally measured by the Dynamic Load Coefficient (DLC). While DLC represents the additional varying component of the tire load, DRC represents the additional varying component of the response value (standard deviation divided by mean response). In this study, DRC was compared with DLC for five different sites based on the roughness condition of the sites. Previous studies showed that DLC varies with vehicle speed and suspension types, and assumes a constant value for the whole pavement structure (lateral and vertical directions). On the other hand, in this study, DRC was found to be significantly varied with the asphalt pavement and function of pavement structure, road roughness conditions, temperatures, vehicle speeds, suspension types, and locations of the point of interest in the pavement. A major contribution of the study is that the variation of pavement responses due to dynamic load in a flexible pavement system can be predicted with generalized regression equations. Fitting parameters (R2) in the rage of 0.60 to 0.87 were observed the DRC predictive equations. In addition, verification of those generalized equations was evaluated using different sets of asphalt pavement structures and pavement materials. The differences between calculated and predicted values were found to be within +/-20% for the maximum tensile strain and +/-30% for the maximum compressive strain in the asphalt layer.

  9. Experimental investigation of fuel regression rate in a HTPB based lab-scale hybrid rocket motor

    NASA Astrophysics Data System (ADS)

    Li, Xintian; Tian, Hui; Yu, Nanjia; Cai, Guobiao

    2014-12-01

    The fuel regression rate is an important parameter in the design process of the hybrid rocket motor. Additives in the solid fuel may have influences on the fuel regression rate, which will affect the internal ballistics of the motor. A series of firing experiments have been conducted on lab-scale hybrid rocket motors with 98% hydrogen peroxide (H2O2) oxidizer and hydroxyl terminated polybutadiene (HTPB) based fuels in this paper. An innovative fuel regression rate analysis method is established to diminish the errors caused by start and tailing stages in a short time firing test. The effects of the metal Mg, Al, aromatic hydrocarbon anthracene (C14H10), and carbon black (C) on the fuel regression rate are investigated. The fuel regression rate formulas of different fuel components are fitted according to the experiment data. The results indicate that the influence of C14H10 on the fuel regression rate of HTPB is not evident. However, the metal additives in the HTPB fuel can increase the fuel regression rate significantly.

  10. The consequences of ignoring measurement invariance for path coefficients in structural equation models

    PubMed Central

    Guenole, Nigel; Brown, Anna

    2014-01-01

    We report a Monte Carlo study examining the effects of two strategies for handling measurement non-invariance – modeling and ignoring non-invariant items – on structural regression coefficients between latent variables measured with item response theory models for categorical indicators. These strategies were examined across four levels and three types of non-invariance – non-invariant loadings, non-invariant thresholds, and combined non-invariance on loadings and thresholds – in simple, partial, mediated and moderated regression models where the non-invariant latent variable occupied predictor, mediator, and criterion positions in the structural regression models. When non-invariance is ignored in the latent predictor, the focal group regression parameters are biased in the opposite direction to the difference in loadings and thresholds relative to the referent group (i.e., lower loadings and thresholds for the focal group lead to overestimated regression parameters). With criterion non-invariance, the focal group regression parameters are biased in the same direction as the difference in loadings and thresholds relative to the referent group. While unacceptable levels of parameter bias were confined to the focal group, bias occurred at considerably lower levels of ignored non-invariance than was previously recognized in referent and focal groups. PMID:25278911

  11. Linear regression crash prediction models : issues and proposed solutions.

    DOT National Transportation Integrated Search

    2010-05-01

    The paper develops a linear regression model approach that can be applied to : crash data to predict vehicle crashes. The proposed approach involves novice data aggregation : to satisfy linear regression assumptions; namely error structure normality ...

  12. Theory Can Help Structure Regression Models for Projecting Stream Conditions Under Alternative Land Use Scenarios

    NASA Astrophysics Data System (ADS)

    van Sickle, J.; Baker, J.; Herlihy, A.

    2005-05-01

    We built multiple regression models for Emphemeroptera/ Plecoptera/ Tricoptera (EPT) taxon richness and other indicators of biological condition in streams of the Willamette River Basin, Oregon, USA. The models were used to project the changes in condition that would be expected in all 2-4th order streams of the 30000 sq km basin under alternative scenarios of future land use. In formulating the models, we invoked the theory of limiting factors to express the interactive effects of stream power and watershed land use on EPT richness. The resulting models were parsimonious, and they fit the data in our wedge-shaped scatterplots slightly better than did a naive additive-effects model. Just as theory helped formulate our regression models, the models in turn helped us identify a new research need for the Basin's streams. Our future scenarios project that conversions of agricultural to urban uses may dominate landscape dynamics in the basin over the next 50 years. But our models could not detect any difference between the effects of agricultural and urban development in watersheds on stream biota. This result points to an increased need for understanding how agricultural and urban land uses in the Basin differentially influence stream ecosystems.

  13. Semiparametric regression analysis of interval-censored competing risks data.

    PubMed

    Mao, Lu; Lin, Dan-Yu; Zeng, Donglin

    2017-09-01

    Interval-censored competing risks data arise when each study subject may experience an event or failure from one of several causes and the failure time is not observed directly but rather is known to lie in an interval between two examinations. We formulate the effects of possibly time-varying (external) covariates on the cumulative incidence or sub-distribution function of competing risks (i.e., the marginal probability of failure from a specific cause) through a broad class of semiparametric regression models that captures both proportional and non-proportional hazards structures for the sub-distribution. We allow each subject to have an arbitrary number of examinations and accommodate missing information on the cause of failure. We consider nonparametric maximum likelihood estimation and devise a fast and stable EM-type algorithm for its computation. We then establish the consistency, asymptotic normality, and semiparametric efficiency of the resulting estimators for the regression parameters by appealing to modern empirical process theory. In addition, we show through extensive simulation studies that the proposed methods perform well in realistic situations. Finally, we provide an application to a study on HIV-1 infection with different viral subtypes. © 2017, The International Biometric Society.

  14. Test anxiety and academic performance in chiropractic students.

    PubMed

    Zhang, Niu; Henderson, Charles N R

    2014-01-01

    Objective : We assessed the level of students' test anxiety, and the relationship between test anxiety and academic performance. Methods : We recruited 166 third-quarter students. The Test Anxiety Inventory (TAI) was administered to all participants. Total scores from written examinations and objective structured clinical examinations (OSCEs) were used as response variables. Results : Multiple regression analysis shows that there was a modest, but statistically significant negative correlation between TAI scores and written exam scores, but not OSCE scores. Worry and emotionality were the best predictive models for written exam scores. Mean total anxiety and emotionality scores for females were significantly higher than those for males, but not worry scores. Conclusion : Moderate-to-high test anxiety was observed in 85% of the chiropractic students examined. However, total test anxiety, as measured by the TAI score, was a very weak predictive model for written exam performance. Multiple regression analysis demonstrated that replacing total anxiety (TAI) with worry and emotionality (TAI subscales) produces a much more effective predictive model of written exam performance. Sex, age, highest current academic degree, and ethnicity contributed little additional predictive power in either regression model. Moreover, TAI scores were not found to be statistically significant predictors of physical exam skill performance, as measured by OSCEs.

  15. Predicting the aquatic toxicity mode of action using logistic regression and linear discriminant analysis.

    PubMed

    Ren, Y Y; Zhou, L C; Yang, L; Liu, P Y; Zhao, B W; Liu, H X

    2016-09-01

    The paper highlights the use of the logistic regression (LR) method in the construction of acceptable statistically significant, robust and predictive models for the classification of chemicals according to their aquatic toxic modes of action. Essentials accounting for a reliable model were all considered carefully. The model predictors were selected by stepwise forward discriminant analysis (LDA) from a combined pool of experimental data and chemical structure-based descriptors calculated by the CODESSA and DRAGON software packages. Model predictive ability was validated both internally and externally. The applicability domain was checked by the leverage approach to verify prediction reliability. The obtained models are simple and easy to interpret. In general, LR performs much better than LDA and seems to be more attractive for the prediction of the more toxic compounds, i.e. compounds that exhibit excess toxicity versus non-polar narcotic compounds and more reactive compounds versus less reactive compounds. In addition, model fit and regression diagnostics was done through the influence plot which reflects the hat-values, studentized residuals, and Cook's distance statistics of each sample. Overdispersion was also checked for the LR model. The relationships between the descriptors and the aquatic toxic behaviour of compounds are also discussed.

  16. Numerical analysis of combustion characteristics of hybrid rocket motor with multi-section swirl injection

    NASA Astrophysics Data System (ADS)

    Li, Chengen; Cai, Guobiao; Tian, Hui

    2016-06-01

    This paper is aimed to analyse the combustion characteristics of hybrid rocket motor with multi-section swirl injection by simulating the combustion flow field. Numerical combustion flow field and combustion performance parameters are obtained through three-dimensional numerical simulations based on a steady numerical model proposed in this paper. The hybrid rocket motor adopts 98% hydrogen peroxide and polyethylene as the propellants. Multiple injection sections are set along the axis of the solid fuel grain, and the oxidizer enters the combustion chamber by means of tangential injection via the injector ports in the injection sections. Simulation results indicate that the combustion flow field structure of the hybrid rocket motor could be improved by multi-section swirl injection method. The transformation of the combustion flow field can greatly increase the fuel regression rate and the combustion efficiency. The average fuel regression rate of the motor with multi-section swirl injection is improved by 8.37 times compared with that of the motor with conventional head-end irrotational injection. The combustion efficiency is increased to 95.73%. Besides, the simulation results also indicate that (1) the additional injection sections can increase the fuel regression rate and the combustion efficiency; (2) the upstream offset of the injection sections reduces the combustion efficiency; and (3) the fuel regression rate and the combustion efficiency decrease with the reduction of the number of injector ports in each injection section.

  17. Metal hydride and pyrophoric fuel additives for dicyclopentadiene based hybrid propellants

    NASA Astrophysics Data System (ADS)

    Shark, Steven C.

    The purpose of this study is to investigate the use of reactive energetic fuel additives that have the potential to increase the combustion performance of hybrid rocket propellants in terms of solid fuel regression rate and combustion efficiency. Additives that can augment the combustion flame zone in a hybrid rocket motor by means of increased energy feedback to the fuel grain surface are of great interest. Metal hydrides have large volumetric hydrogen densities, which gives these materials high performance potential as fuel additives in terms of specifc impulse. The excess hydrogen and corresponding base metal may also cause an increase in the hybrid rocket solid fuel regression rate. Pyrophoric additives also have potential to increase the solid fuel regression rate by reacting more readily near the burning fuel surface providing rapid energy feedback. An experimental performance evaluation of metal hydride fuel additives for hybrid rocket motor propulsion systems is examined in this study. Hypergolic ignition droplet tests and an accelerated aging study revealed the protection capabilities of Dicyclopentadiene (DCPD) as a fuel binder, and the ability for unaided ignition. Static hybrid rocket motor experiments were conducted using DCPD as the fuel. Sodium borohydride (NabH4) and aluminum hydride (AlH3) were examined as fuel additives. Ninety percent rocket grade hydrogen peroxide (RGHP) was used as the oxidizer. In this study, the sensitivity of solid fuel regression rate and characteristic velocity (C*) efficiency to total fuel grain port mass flux and particle loading is examined. These results were compared to HTPB combustion performance as a baseline. Chamber pressure histories revealed steady motor operation in most tests, with reduced ignition delays when using NabH4 as a fuel additive. The addition of NabH4 and AlH3 produced up to a 47% and 85% increase in regression rate over neat DCPD, respectively. For all test conditions examined C* efficiency ranges between 80% and 90%. The regression rate and C* efficiency mass flux dependence indicate a shift towards a more diffusion controlled system with metal hydride particle addition. Although these types of energetic particles have potential as high performing fuel additives, they can be in low supply and expensive. An opposed flow burner was investigated as a means to screen and characterize hybrid rocket fuels prior to full scale rocket motor testing. Although this type of configuration has been investigated in the past, no comparison has been made to hybrid rocket motor operation in terms of mass flux. Polymeric fuels and low melt temperature fuels with and without additives were investigated via an opposed flow burner. The effects of laminar and turbulent flow regimes on the convective heat transfer in the opposed flow system was depicted in the regression rate trends of these fuels. Regression rate trends similar to hybrid rocket motor operation were depicted, including the entrainment mechanism for paran fuel. However, there was a shift in overall magnitude of these results. A decrease in regression rate occurred for HTPB loaded with passivated nano-aluminum, due to low resonance time in the reaction zone. Previous results have shown that pyrophoric additives can cause an increase in regression rate in the opposed flow burner configuration. It is proposed that the opposed burner is useful as a screening and characterization tool for some propellant combinations. Gaseous oxygen (GOX) was investigated as an oxidizer for similar fuels evaluated with RGHP. Specifically, combustion performance sensitivity to mass flux and MH particle size was investigated. Similar results to the RGHP experiments were observed for the regression rate tends of HTPB, DPCD, and NabH 4 addition. Kinetically limited regression rate dependence on mass flux was observed at the higher mass flux levels. No major increase in C* efficiency was observed for MH addition. The C* efficiency varied with equivalence ratio by approximately 10 percentage points, which was not observed in the RGHP experiments. A 10 percentage point decrease in C* efficiency was observed with increasing mass flux in the system. This was most likely due to poorly mixed fuel and oxidizer in center of the combustion chamber at the higher mass flux levels. Detailed measurements of the hybrid rocket combustion zone is useful for understanding the mechanisms governing performance, but can be difficult to obtain. Traditional slab burner configurations have proven useful but are operationally limited in pressure and mass flux ranges. A new optical cylindrical combustor (OCC) design is presented that allows surface and flame zone imaging and tracking during hybrid rocket motor operation at appreciable mass flux and pressure levels, > 100 kg/s/m2 and > 0.69 MPa. The flame height and regression rate sensitivity to mass flux and chamber pressure was examined for the same fuels examined in the GOX hybrid rocket motor, with the addition of DCPD fuel loaded with Al and unpassivated mechanically activated Al-PTFE. The regression rate trends were on the same order of magnitude of traditional hybrid rocket motor results. A flame height decrease was observed for increased mass flux. The flame height increased with NabH 4 addition, which is most likely a function of increased blowing at the surface. There was no appreciable flame height sensitivity to NabH4 particle size. There was no relative change in flame height or regression rate between the Al and AL-PTFE addition. The OCC allowed visualization of the hybrid rocket fuel flame zone at mass flux and pressure levels that are not known to be report for traditional slab burner configurations in literature. The OCC proved to be a new useful tool for investigated hybrid rocket propellant combustion characteristics.

  18. Using Multivariate Adaptive Regression Spline and Artificial Neural Network to Simulate Urbanization in Mumbai, India

    NASA Astrophysics Data System (ADS)

    Ahmadlou, M.; Delavar, M. R.; Tayyebi, A.; Shafizadeh-Moghadam, H.

    2015-12-01

    Land use change (LUC) models used for modelling urban growth are different in structure and performance. Local models divide the data into separate subsets and fit distinct models on each of the subsets. Non-parametric models are data driven and usually do not have a fixed model structure or model structure is unknown before the modelling process. On the other hand, global models perform modelling using all the available data. In addition, parametric models have a fixed structure before the modelling process and they are model driven. Since few studies have compared local non-parametric models with global parametric models, this study compares a local non-parametric model called multivariate adaptive regression spline (MARS), and a global parametric model called artificial neural network (ANN) to simulate urbanization in Mumbai, India. Both models determine the relationship between a dependent variable and multiple independent variables. We used receiver operating characteristic (ROC) to compare the power of the both models for simulating urbanization. Landsat images of 1991 (TM) and 2010 (ETM+) were used for modelling the urbanization process. The drivers considered for urbanization in this area were distance to urban areas, urban density, distance to roads, distance to water, distance to forest, distance to railway, distance to central business district, number of agricultural cells in a 7 by 7 neighbourhoods, and slope in 1991. The results showed that the area under the ROC curve for MARS and ANN was 94.77% and 95.36%, respectively. Thus, ANN performed slightly better than MARS to simulate urban areas in Mumbai, India.

  19. More insights into early brain development through statistical analyses of eigen-structural elements of diffusion tensor imaging using multivariate adaptive regression splines

    PubMed Central

    Chen, Yasheng; Zhu, Hongtu; An, Hongyu; Armao, Diane; Shen, Dinggang; Gilmore, John H.; Lin, Weili

    2013-01-01

    The aim of this study was to characterize the maturational changes of the three eigenvalues (λ1 ≥ λ2 ≥ λ3) of diffusion tensor imaging (DTI) during early postnatal life for more insights into early brain development. In order to overcome the limitations of using presumed growth trajectories for regression analysis, we employed Multivariate Adaptive Regression Splines (MARS) to derive data-driven growth trajectories for the three eigenvalues. We further employed Generalized Estimating Equations (GEE) to carry out statistical inferences on the growth trajectories obtained with MARS. With a total of 71 longitudinal datasets acquired from 29 healthy, full-term pediatric subjects, we found that the growth velocities of the three eigenvalues were highly correlated, but significantly different from each other. This paradox suggested the existence of mechanisms coordinating the maturations of the three eigenvalues even though different physiological origins may be responsible for their temporal evolutions. Furthermore, our results revealed the limitations of using the average of λ2 and λ3 as the radial diffusivity in interpreting DTI findings during early brain development because these two eigenvalues had significantly different growth velocities even in central white matter. In addition, based upon the three eigenvalues, we have documented the growth trajectory differences between central and peripheral white matter, between anterior and posterior limbs of internal capsule, and between inferior and superior longitudinal fasciculus. Taken together, we have demonstrated that more insights into early brain maturation can be gained through analyzing eigen-structural elements of DTI. PMID:23455648

  20. Measuring missing heritability: Inferring the contribution of common variants

    PubMed Central

    Golan, David; Lander, Eric S.; Rosset, Saharon

    2014-01-01

    Genome-wide association studies (GWASs), also called common variant association studies (CVASs), have uncovered thousands of genetic variants associated with hundreds of diseases. However, the variants that reach statistical significance typically explain only a small fraction of the heritability. One explanation for the “missing heritability” is that there are many additional disease-associated common variants whose effects are too small to detect with current sample sizes. It therefore is useful to have methods to quantify the heritability due to common variation, without having to identify all causal variants. Recent studies applied restricted maximum likelihood (REML) estimation to case–control studies for diseases. Here, we show that REML considerably underestimates the fraction of heritability due to common variation in this setting. The degree of underestimation increases with the rarity of disease, the heritability of the disease, and the size of the sample. Instead, we develop a general framework for heritability estimation, called phenotype correlation–genotype correlation (PCGC) regression, which generalizes the well-known Haseman–Elston regression method. We show that PCGC regression yields unbiased estimates. Applying PCGC regression to six diseases, we estimate the proportion of the phenotypic variance due to common variants to range from 25% to 56% and the proportion of heritability due to common variants from 41% to 68% (mean 60%). These results suggest that common variants may explain at least half the heritability for many diseases. PCGC regression also is readily applicable to other settings, including analyzing extreme-phenotype studies and adjusting for covariates such as sex, age, and population structure. PMID:25422463

  1. Testing Mediation Using Multiple Regression and Structural Equation Modeling Analyses in Secondary Data

    ERIC Educational Resources Information Center

    Li, Spencer D.

    2011-01-01

    Mediation analysis in child and adolescent development research is possible using large secondary data sets. This article provides an overview of two statistical methods commonly used to test mediated effects in secondary analysis: multiple regression and structural equation modeling (SEM). Two empirical studies are presented to illustrate the…

  2. Prediction of Emergency Department Hospital Admission Based on Natural Language Processing and Neural Networks.

    PubMed

    Zhang, Xingyu; Kim, Joyce; Patzer, Rachel E; Pitts, Stephen R; Patzer, Aaron; Schrager, Justin D

    2017-10-26

    To describe and compare logistic regression and neural network modeling strategies to predict hospital admission or transfer following initial presentation to Emergency Department (ED) triage with and without the addition of natural language processing elements. Using data from the National Hospital Ambulatory Medical Care Survey (NHAMCS), a cross-sectional probability sample of United States EDs from 2012 and 2013 survey years, we developed several predictive models with the outcome being admission to the hospital or transfer vs. discharge home. We included patient characteristics immediately available after the patient has presented to the ED and undergone a triage process. We used this information to construct logistic regression (LR) and multilayer neural network models (MLNN) which included natural language processing (NLP) and principal component analysis from the patient's reason for visit. Ten-fold cross validation was used to test the predictive capacity of each model and receiver operating curves (AUC) were then calculated for each model. Of the 47,200 ED visits from 642 hospitals, 6,335 (13.42%) resulted in hospital admission (or transfer). A total of 48 principal components were extracted by NLP from the reason for visit fields, which explained 75% of the overall variance for hospitalization. In the model including only structured variables, the AUC was 0.824 (95% CI 0.818-0.830) for logistic regression and 0.823 (95% CI 0.817-0.829) for MLNN. Models including only free-text information generated AUC of 0.742 (95% CI 0.731- 0.753) for logistic regression and 0.753 (95% CI 0.742-0.764) for MLNN. When both structured variables and free text variables were included, the AUC reached 0.846 (95% CI 0.839-0.853) for logistic regression and 0.844 (95% CI 0.836-0.852) for MLNN. The predictive accuracy of hospital admission or transfer for patients who presented to ED triage overall was good, and was improved with the inclusion of free text data from a patient's reason for visit regardless of modeling approach. Natural language processing and neural networks that incorporate patient-reported outcome free text may increase predictive accuracy for hospital admission.

  3. A LEAST ABSOLUTE SHRINKAGE AND SELECTION OPERATOR (LASSO) FOR NONLINEAR SYSTEM IDENTIFICATION

    NASA Technical Reports Server (NTRS)

    Kukreja, Sunil L.; Lofberg, Johan; Brenner, Martin J.

    2006-01-01

    Identification of parametric nonlinear models involves estimating unknown parameters and detecting its underlying structure. Structure computation is concerned with selecting a subset of parameters to give a parsimonious description of the system which may afford greater insight into the functionality of the system or a simpler controller design. In this study, a least absolute shrinkage and selection operator (LASSO) technique is investigated for computing efficient model descriptions of nonlinear systems. The LASSO minimises the residual sum of squares by the addition of a 1 penalty term on the parameter vector of the traditional 2 minimisation problem. Its use for structure detection is a natural extension of this constrained minimisation approach to pseudolinear regression problems which produces some model parameters that are exactly zero and, therefore, yields a parsimonious system description. The performance of this LASSO structure detection method was evaluated by using it to estimate the structure of a nonlinear polynomial model. Applicability of the method to more complex systems such as those encountered in aerospace applications was shown by identifying a parsimonious system description of the F/A-18 Active Aeroelastic Wing using flight test data.

  4. Use of random regression to estimate genetic parameters of temperament across an age continuum in a crossbred cattle population.

    PubMed

    Littlejohn, B P; Riley, D G; Welsh, T H; Randel, R D; Willard, S T; Vann, R C

    2018-05-12

    The objective was to estimate genetic parameters of temperament in beef cattle across an age continuum. The population consisted predominantly of Brahman-British crossbred cattle. Temperament was quantified by: 1) pen score (PS), the reaction of a calf to a single experienced evaluator on a scale of 1 to 5 (1 = calm, 5 = excitable); 2) exit velocity (EV), the rate (m/sec) at which a calf traveled 1.83 m upon exiting a squeeze chute; and 3) temperament score (TS), the numerical average of PS and EV. Covariates included days of age and proportion of Bos indicus in the calf and dam. Random regression models included the fixed effects determined from the repeated measures models, except for calf age. Likelihood ratio tests were used to determine the most appropriate random structures. In repeated measures models, the proportion of Bos indicus in the calf was positively related with each calf temperament trait (0.41 ± 0.20, 0.85 ± 0.21, and 0.57 ± 0.18 for PS, EV, and TS, respectively; P < 0.01). There was an effect of contemporary group (combinations of season, year of birth, and management group) and dam age (P < 0.001) in all models. From repeated records analyses, estimates of heritability (h2) were 0.34 ± 0.04, 0.31 ± 0.04, and 0.39 ± 0.04, while estimates of permanent environmental variance as a proportion of the phenotypic variance (c2) were 0.30 ± 0.04, 0.31 ± 0.03, and 0.34 ± 0.04 for PS, EV, and TS, respectively. Quadratic additive genetic random regressions on Legendre polynomials of age were significant for all traits. Quadratic permanent environmental random regressions were significant for PS and TS, but linear permanent environmental random regressions were significant for EV. Random regression results suggested that these components change across the age dimension of these data. There appeared to be an increasing influence of permanent environmental effects and decreasing influence of additive genetic effects corresponding to increasing calf age for EV, and to a lesser extent for TS. Inherited temperament may be overcome by accumulating environmental stimuli with increases in age, especially after weaning.

  5. PARAMETRIC AND NON PARAMETRIC (MARS: MULTIVARIATE ADDITIVE REGRESSION SPLINES) LOGISTIC REGRESSIONS FOR PREDICTION OF A DICHOTOMOUS RESPONSE VARIABLE WITH AN EXAMPLE FOR PRESENCE/ABSENCE OF AMPHIBIANS

    EPA Science Inventory

    The purpose of this report is to provide a reference manual that could be used by investigators for making informed use of logistic regression using two methods (standard logistic regression and MARS). The details for analyses of relationships between a dependent binary response ...

  6. Application of Machine-Learning Models to Predict Tacrolimus Stable Dose in Renal Transplant Recipients

    NASA Astrophysics Data System (ADS)

    Tang, Jie; Liu, Rong; Zhang, Yue-Li; Liu, Mou-Ze; Hu, Yong-Fang; Shao, Ming-Jie; Zhu, Li-Jun; Xin, Hua-Wen; Feng, Gui-Wen; Shang, Wen-Jun; Meng, Xiang-Guang; Zhang, Li-Rong; Ming, Ying-Zi; Zhang, Wei

    2017-02-01

    Tacrolimus has a narrow therapeutic window and considerable variability in clinical use. Our goal was to compare the performance of multiple linear regression (MLR) and eight machine learning techniques in pharmacogenetic algorithm-based prediction of tacrolimus stable dose (TSD) in a large Chinese cohort. A total of 1,045 renal transplant patients were recruited, 80% of which were randomly selected as the “derivation cohort” to develop dose-prediction algorithm, while the remaining 20% constituted the “validation cohort” to test the final selected algorithm. MLR, artificial neural network (ANN), regression tree (RT), multivariate adaptive regression splines (MARS), boosted regression tree (BRT), support vector regression (SVR), random forest regression (RFR), lasso regression (LAR) and Bayesian additive regression trees (BART) were applied and their performances were compared in this work. Among all the machine learning models, RT performed best in both derivation [0.71 (0.67-0.76)] and validation cohorts [0.73 (0.63-0.82)]. In addition, the ideal rate of RT was 4% higher than that of MLR. To our knowledge, this is the first study to use machine learning models to predict TSD, which will further facilitate personalized medicine in tacrolimus administration in the future.

  7. Constructing a self: the role of self-structure and self-certainty in social anxiety.

    PubMed

    Stopa, Lusia; Brown, Mike A; Luke, Michelle A; Hirsch, Colette R

    2010-10-01

    Current cognitive models stress the importance of negative self-perceptions in maintaining social anxiety, but focus predominantly on content rather than structure. Two studies examine the role of self-structure (self-organisation, self-complexity, and self-concept clarity) in social anxiety. In study one, self-organisation and self-concept clarity were correlated with social anxiety, and a step-wise multiple regression showed that after controlling for depression and self-esteem, which explained 35% of the variance in social anxiety scores, self-concept clarity uniquely predicted social anxiety and accounted for an additional 7% of the variance in social anxiety scores in an undergraduate sample (N=95) and the interaction between self-concept clarity and compartmentalisation (an aspect of evaluative self-organisation) at step 3 of the multiple regression accounted for a further 3% of the variance in social anxiety scores. In study two, high (n=26) socially anxious participants demonstrated less self-concept clarity than low socially anxious participants (n=26) on both self-report (used in study one) and on computerised measures of self-consistency and confidence in self-related judgments. The high socially anxious group had more compartmentalised self-organisation than the low anxious group, but there were no differences between the two groups on any of the other measures of self-organisation. Self-complexity did not contribute to social anxiety in either study, although this may have been due to the absence of a stressor. Overall, the results suggest that self-structure has a potentially important role in understanding social anxiety and that self-concept clarity and other aspects of self-structure such as compartmentalisation interact with each other and could be potential maintaining factors in social anxiety. Cognitive therapy for social phobia might influence self-structure, and understanding the role of structural variables in maintenance and treatment could eventually help to improve treatment outcome. Copyright 2010 Elsevier Ltd. All rights reserved.

  8. Constructing a self: The role of self-structure and self-certainty in social anxiety

    PubMed Central

    Stopa, Lusia; Brown, Mike A.; Luke, Michelle A.; Hirsch, Colette R.

    2010-01-01

    Current cognitive models stress the importance of negative self-perceptions in maintaining social anxiety, but focus predominantly on content rather than structure. Two studies examine the role of self-structure (self-organisation, self-complexity, and self-concept clarity) in social anxiety. In study one, self-organisation and self-concept clarity were correlated with social anxiety, and a step-wise multiple regression showed that after controlling for depression and self-esteem, which explained 35% of the variance in social anxiety scores, self-concept clarity uniquely predicted social anxiety and accounted for an additional 7% of the variance in social anxiety scores in an undergraduate sample (N = 95) and the interaction between self-concept clarity and compartmentalisation (an aspect of evaluative self-organisation) at step 3 of the multiple regression accounted for a further 3% of the variance in social anxiety scores. In study two, high (n = 26) socially anxious participants demonstrated less self-concept clarity than low socially anxious participants (n = 26) on both self-report (used in study one) and on computerised measures of self-consistency and confidence in self-related judgments. The high socially anxious group had more compartmentalised self-organisation than the low anxious group, but there were no differences between the two groups on any of the other measures of self-organisation. Self-complexity did not contribute to social anxiety in either study, although this may have been due to the absence of a stressor. Overall, the results suggest that self-structure has a potentially important role in understanding social anxiety and that self-concept clarity and other aspects of self-structure such as compartmentalisation interact with each other and could be potential maintaining factors in social anxiety. Cognitive therapy for social phobia might influence self-structure, and understanding the role of structural variables in maintenance and treatment could eventually help to improve treatment outcome. PMID:20800751

  9. Beyond logistic regression: structural equations modelling for binary variables and its application to investigating unobserved confounders.

    PubMed

    Kupek, Emil

    2006-03-15

    Structural equation modelling (SEM) has been increasingly used in medical statistics for solving a system of related regression equations. However, a great obstacle for its wider use has been its difficulty in handling categorical variables within the framework of generalised linear models. A large data set with a known structure among two related outcomes and three independent variables was generated to investigate the use of Yule's transformation of odds ratio (OR) into Q-metric by (OR-1)/(OR+1) to approximate Pearson's correlation coefficients between binary variables whose covariance structure can be further analysed by SEM. Percent of correctly classified events and non-events was compared with the classification obtained by logistic regression. The performance of SEM based on Q-metric was also checked on a small (N = 100) random sample of the data generated and on a real data set. SEM successfully recovered the generated model structure. SEM of real data suggested a significant influence of a latent confounding variable which would have not been detectable by standard logistic regression. SEM classification performance was broadly similar to that of the logistic regression. The analysis of binary data can be greatly enhanced by Yule's transformation of odds ratios into estimated correlation matrix that can be further analysed by SEM. The interpretation of results is aided by expressing them as odds ratios which are the most frequently used measure of effect in medical statistics.

  10. Random regression models using different functions to model test-day milk yield of Brazilian Holstein cows.

    PubMed

    Bignardi, A B; El Faro, L; Torres Júnior, R A A; Cardoso, V L; Machado, P F; Albuquerque, L G

    2011-10-31

    We analyzed 152,145 test-day records from 7317 first lactations of Holstein cows recorded from 1995 to 2003. Our objective was to model variations in test-day milk yield during the first lactation of Holstein cows by random regression model (RRM), using various functions in order to obtain adequate and parsimonious models for the estimation of genetic parameters. Test-day milk yields were grouped into weekly classes of days in milk, ranging from 1 to 44 weeks. The contemporary groups were defined as herd-test-day. The analyses were performed using a single-trait RRM, including the direct additive, permanent environmental and residual random effects. In addition, contemporary group and linear and quadratic effects of the age of cow at calving were included as fixed effects. The mean trend of milk yield was modeled with a fourth-order orthogonal Legendre polynomial. The additive genetic and permanent environmental covariance functions were estimated by random regression on two parametric functions, Ali and Schaeffer and Wilmink, and on B-spline functions of days in milk. The covariance components and the genetic parameters were estimated by the restricted maximum likelihood method. Results from RRM parametric and B-spline functions were compared to RRM on Legendre polynomials and with a multi-trait analysis, using the same data set. Heritability estimates presented similar trends during mid-lactation (13 to 31 weeks) and between week 37 and the end of lactation, for all RRM. Heritabilities obtained by multi-trait analysis were of a lower magnitude than those estimated by RRM. The RRMs with a higher number of parameters were more useful to describe the genetic variation of test-day milk yield throughout the lactation. RRM using B-spline and Legendre polynomials as base functions appears to be the most adequate to describe the covariance structure of the data.

  11. Spreading properties of cosmetic emollients: Use of synthetic skin surface to elucidate structural effect.

    PubMed

    Douguet, Marine; Picard, Céline; Savary, Géraldine; Merlaud, Fabien; Loubat-Bouleuc, Nathalie; Grisel, Michel

    2017-06-01

    The study focuses on the impact of structural and physicochemical properties of emollients on their spreadability. Fifty-three emollients, among which esters, silicones, vegetable and mineral oils, have been characterized. Their viscosity, surface tension, density and spreadability have been measured. Vitro-skin ® , an artificial skin substitute, was used as an artificial porous substrate to measure spreadability. Two different methods have been selected to characterize spreadability, namely contact angle and spreading value. Dynamic contact angle measurements showed that emollient spreadability is first governed by spontaneous spreading and that, in a second phase, absorption and migration into the porous substrate becomes the driver of the extension of the spreading area. Statistical analysis of physicochemical and spreading value data revealed that viscosity has a major impact on the spreading behavior of emollients whatever their chemical type. A special emphasis was placed on the ester family in which chemical diversity is very wide. The results highlighted a difference between "high viscosity esters" for which viscosity is the main factor impacting spreadability and "low viscosity esters" for which structural variations (mono/diester, saturated/unsaturated chain, linear/branched chain) have to be considered in addition to viscosity. Linear regressions were used to express spreading value as a function of viscosity for each of the four emollient families tested (esters, silicones, vegetable and mineral oils). These regressions allowed the development of reliable predictive models as a powerful tool for formulators to forecast spreadability of emollients. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. Principal Covariates Clusterwise Regression (PCCR): Accounting for Multicollinearity and Population Heterogeneity in Hierarchically Organized Data.

    PubMed

    Wilderjans, Tom Frans; Vande Gaer, Eva; Kiers, Henk A L; Van Mechelen, Iven; Ceulemans, Eva

    2017-03-01

    In the behavioral sciences, many research questions pertain to a regression problem in that one wants to predict a criterion on the basis of a number of predictors. Although in many cases, ordinary least squares regression will suffice, sometimes the prediction problem is more challenging, for three reasons: first, multiple highly collinear predictors can be available, making it difficult to grasp their mutual relations as well as their relations to the criterion. In that case, it may be very useful to reduce the predictors to a few summary variables, on which one regresses the criterion and which at the same time yields insight into the predictor structure. Second, the population under study may consist of a few unknown subgroups that are characterized by different regression models. Third, the obtained data are often hierarchically structured, with for instance, observations being nested into persons or participants within groups or countries. Although some methods have been developed that partially meet these challenges (i.e., principal covariates regression (PCovR), clusterwise regression (CR), and structural equation models), none of these methods adequately deals with all of them simultaneously. To fill this gap, we propose the principal covariates clusterwise regression (PCCR) method, which combines the key idea's behind PCovR (de Jong & Kiers in Chemom Intell Lab Syst 14(1-3):155-164, 1992) and CR (Späth in Computing 22(4):367-373, 1979). The PCCR method is validated by means of a simulation study and by applying it to cross-cultural data regarding satisfaction with life.

  13. USE OF LETHALITY DATA DURING CATEGORICAL REGRESSION MODELING OF ACUTE REFERENCE EXPOSURES

    EPA Science Inventory

    Categorical regression is being considered by the U.S. EPA as an additional tool for derivation of acute reference exposures (AREs) to be used for human health risk assessment for exposure to inhaled chemicals. Categorical regression is used to calculate probability-response fun...

  14. What Is Wrong with ANOVA and Multiple Regression? Analyzing Sentence Reading Times with Hierarchical Linear Models

    ERIC Educational Resources Information Center

    Richter, Tobias

    2006-01-01

    Most reading time studies using naturalistic texts yield data sets characterized by a multilevel structure: Sentences (sentence level) are nested within persons (person level). In contrast to analysis of variance and multiple regression techniques, hierarchical linear models take the multilevel structure of reading time data into account. They…

  15. The influence of intrinsic and extrinsic job values on turnover intention among continuing care assistants in Nova Scotia.

    PubMed

    Dill, Donna M; Keefe, Janice M; McGrath, Daniel S

    2012-01-01

    This article examines the influence that intrinsic and extrinsic job values have on the turnover intention of continuing care assistants (CCAs) who work either in home care or facility-based care in Nova Scotia (n = 188). Factor analysis of job values identified three latent job values structures: "compensation and commitment," "flexibility and opportunity," and "positive work relationships." Using binary logistic regression, we examined the predictive utility of these factors on two indices of turnover intention. Regression results indicate that, in general, job values constructs did not significantly predict turnover intention when controlling for demographics and job characteristics. However, a trend was found for the "positive work relationships" factor in predicting consideration of changing employers. In addition, CCAs who work in facility-based care were significantly more likely to have considered leaving their current employer. With projected increases in the demand for these workers in both home and continuing care, more attention is needed to identify and address factors to reduce turnover intention.

  16. The complicated substrates enhance the microbial diversity and zinc leaching efficiency in sphalerite bioleaching system.

    PubMed

    Xiao, Yunhua; Xu, YongDong; Dong, Weiling; Liang, Yili; Fan, Fenliang; Zhang, Xiaoxia; Zhang, Xian; Niu, Jiaojiao; Ma, Liyuan; She, Siyuan; He, Zhili; Liu, Xueduan; Yin, Huaqun

    2015-12-01

    This study used an artificial enrichment microbial consortium to examine the effects of different substrate conditions on microbial diversity, composition, and function (e.g., zinc leaching efficiency) through adding pyrite (SP group), chalcopyrite (SC group), or both (SPC group) in sphalerite bioleaching systems. 16S rRNA gene sequencing analysis showed that microbial community structures and compositions dramatically changed with additions of pyrite or chalcopyrite during the sphalerite bioleaching process. Shannon diversity index showed a significantly increase in the SP (1.460), SC (1.476), and SPC (1.341) groups compared with control (sphalerite group, 0.624) on day 30, meanwhile, zinc leaching efficiencies were enhanced by about 13.4, 2.9, and 13.2%, respectively. Also, additions of pyrite or chalcopyrite could increase electric potential (ORP) and the concentrations of Fe3+ and H+, which were the main factors shaping microbial community structures by Mantel test analysis. Linear regression analysis showed that ORP, Fe3+ concentration, and pH were significantly correlated to zinc leaching efficiency and microbial diversity. In addition, we found that leaching efficiency showed a positive and significant relationship with microbial diversity. In conclusion, our results showed that the complicated substrates could significantly enhance microbial diversity and activity of function.

  17. Semiexperimental equilibrium structures for building blocks of organic and biological molecules: the B2PLYP route.

    PubMed

    Penocchio, Emanuele; Piccardo, Matteo; Barone, Vincenzo

    2015-10-13

    The B2PLYP double hybrid functional, coupled with the correlation-consistent triple-ζ cc-pVTZ (VTZ) basis set, has been validated in the framework of the semiexperimental (SE) approach for deriving accurate equilibrium structures of molecules containing up to 15 atoms. A systematic comparison between new B2PLYP/VTZ results and several equilibrium SE structures previously determined at other levels, in particular B3LYP/SNSD and CCSD(T) with various basis sets, has put in evidence the accuracy and the remarkable stability of such model chemistry for both equilibrium structures and vibrational corrections. New SE equilibrium structures for phenylacetylene, pyruvic acid, peroxyformic acid, and phenyl radical are discussed and compared with literature data. Particular attention has been devoted to the discussion of systems for which lack of sufficient experimental data prevents a complete SE determination. In order to obtain an accurate equilibrium SE structure for these situations, the so-called templating molecule approach is discussed and generalized with respect to our previous work. Important applications are those involving biological building blocks, like uracil and thiouracil. In addition, for more general situations the linear regression approach has been proposed and validated.

  18. Structured association analysis leads to insight into Saccharomyces cerevisiae gene regulation by finding multiple contributing eQTL hotspots associated with functional gene modules.

    PubMed

    Curtis, Ross E; Kim, Seyoung; Woolford, John L; Xu, Wenjie; Xing, Eric P

    2013-03-21

    Association analysis using genome-wide expression quantitative trait locus (eQTL) data investigates the effect that genetic variation has on cellular pathways and leads to the discovery of candidate regulators. Traditional analysis of eQTL data via pairwise statistical significance tests or linear regression does not leverage the availability of the structural information of the transcriptome, such as presence of gene networks that reveal correlation and potentially regulatory relationships among the study genes. We employ a new eQTL mapping algorithm, GFlasso, which we have previously developed for sparse structured regression, to reanalyze a genome-wide yeast dataset. GFlasso fully takes into account the dependencies among expression traits to suppress false positives and to enhance the signal/noise ratio. Thus, GFlasso leverages the gene-interaction network to discover the pleiotropic effects of genetic loci that perturb the expression level of multiple (rather than individual) genes, which enables us to gain more power in detecting previously neglected signals that are marginally weak but pleiotropically significant. While eQTL hotspots in yeast have been reported previously as genomic regions controlling multiple genes, our analysis reveals additional novel eQTL hotspots and, more interestingly, uncovers groups of multiple contributing eQTL hotspots that affect the expression level of functional gene modules. To our knowledge, our study is the first to report this type of gene regulation stemming from multiple eQTL hotspots. Additionally, we report the results from in-depth bioinformatics analysis for three groups of these eQTL hotspots: ribosome biogenesis, telomere silencing, and retrotransposon biology. We suggest candidate regulators for the functional gene modules that map to each group of hotspots. Not only do we find that many of these candidate regulators contain mutations in the promoter and coding regions of the genes, in the case of the Ribi group, we provide experimental evidence suggesting that the identified candidates do regulate the target genes predicted by GFlasso. Thus, this structured association analysis of a yeast eQTL dataset via GFlasso, coupled with extensive bioinformatics analysis, discovers a novel regulation pattern between multiple eQTL hotspots and functional gene modules. Furthermore, this analysis demonstrates the potential of GFlasso as a powerful computational tool for eQTL studies that exploit the rich structural information among expression traits due to correlation, regulation, or other forms of biological dependencies.

  19. Beyond Multiple Regression: Using Commonality Analysis to Better Understand R[superscript 2] Results

    ERIC Educational Resources Information Center

    Warne, Russell T.

    2011-01-01

    Multiple regression is one of the most common statistical methods used in quantitative educational research. Despite the versatility and easy interpretability of multiple regression, it has some shortcomings in the detection of suppressor variables and for somewhat arbitrarily assigning values to the structure coefficients of correlated…

  20. The Variance Normalization Method of Ridge Regression Analysis.

    ERIC Educational Resources Information Center

    Bulcock, J. W.; And Others

    The testing of contemporary sociological theory often calls for the application of structural-equation models to data which are inherently collinear. It is shown that simple ridge regression, which is commonly used for controlling the instability of ordinary least squares regression estimates in ill-conditioned data sets, is not a legitimate…

  1. Spatio-Temporal Regression Based Clustering of Precipitation Extremes in a Presence of Systematically Missing Covariates

    NASA Astrophysics Data System (ADS)

    Kaiser, Olga; Martius, Olivia; Horenko, Illia

    2017-04-01

    Regression based Generalized Pareto Distribution (GPD) models are often used to describe the dynamics of hydrological threshold excesses relying on the explicit availability of all of the relevant covariates. But, in real application the complete set of relevant covariates might be not available. In this context, it was shown that under weak assumptions the influence coming from systematically missing covariates can be reflected by a nonstationary and nonhomogenous dynamics. We present a data-driven, semiparametric and an adaptive approach for spatio-temporal regression based clustering of threshold excesses in a presence of systematically missing covariates. The nonstationary and nonhomogenous behavior of threshold excesses is describes by a set of local stationary GPD models, where the parameters are expressed as regression models, and a non-parametric spatio-temporal hidden switching process. Exploiting nonparametric Finite Element time-series analysis Methodology (FEM) with Bounded Variation of the model parameters (BV) for resolving the spatio-temporal switching process, the approach goes beyond strong a priori assumptions made is standard latent class models like Mixture Models and Hidden Markov Models. Additionally, the presented FEM-BV-GPD provides a pragmatic description of the corresponding spatial dependence structure by grouping together all locations that exhibit similar behavior of the switching process. The performance of the framework is demonstrated on daily accumulated precipitation series over 17 different locations in Switzerland from 1981 till 2013 - showing that the introduced approach allows for a better description of the historical data.

  2. Adding thin-ideal internalization and impulsiveness to the cognitive-behavioral model of bulimic symptoms.

    PubMed

    Schnitzler, Caroline E; von Ranson, Kristin M; Wallace, Laurel M

    2012-08-01

    This study evaluated the cognitive-behavioral (CB) model of bulimia nervosa and an extension that included two additional maintaining factors - thin-ideal internalization and impulsiveness - in 327 undergraduate women. Participants completed measures of demographics, self-esteem, concern about shape and weight, dieting, bulimic symptoms, thin-ideal internalization, and impulsiveness. Both the original CB model and the extended model provided good fits to the data. Although structural equation modeling analyses suggested that the original CB model was most parsimonious, hierarchical regression analyses indicated that the additional variables accounted for significantly more variance. Additional analyses showed that the model fit could be improved by adding a path from concern about shape and weight, and deleting the path from dieting, to bulimic symptoms. Expanding upon the factors considered in the model may better capture the scope of variables maintaining bulimic symptoms in young women with a range of severity of bulimic symptoms. Copyright © 2012 Elsevier Ltd. All rights reserved.

  3. The importance of molecular structures, endpoints' values, and predictivity parameters in QSAR research: QSAR analysis of a series of estrogen receptor binders.

    PubMed

    Li, Jiazhong; Gramatica, Paola

    2010-11-01

    Quantitative structure-activity relationship (QSAR) methodology aims to explore the relationship between molecular structures and experimental endpoints, producing a model for the prediction of new data; the predictive performance of the model must be checked by external validation. Clearly, the qualities of chemical structure information and experimental endpoints, as well as the statistical parameters used to verify the external predictivity have a strong influence on QSAR model reliability. Here, we emphasize the importance of these three aspects by analyzing our models on estrogen receptor binders (Endocrine disruptor knowledge base (EDKB) database). Endocrine disrupting chemicals, which mimic or antagonize the endogenous hormones such as estrogens, are a hot topic in environmental and toxicological sciences. QSAR shows great values in predicting the estrogenic activity and exploring the interactions between the estrogen receptor and ligands. We have verified our previously published model for additional external validation on new EDKB chemicals. Having found some errors in the used 3D molecular conformations, we redevelop a new model using the same data set with corrected structures, the same method (ordinary least-square regression, OLS) and DRAGON descriptors. The new model, based on some different descriptors, is more predictive on external prediction sets. Three different formulas to calculate correlation coefficient for the external prediction set (Q2 EXT) were compared, and the results indicated that the new proposal of Consonni et al. had more reasonable results, consistent with the conclusions from regression line, Williams plot and root mean square error (RMSE) values. Finally, the importance of reliable endpoints values has been highlighted by comparing the classification assignments of EDKB with those of another estrogen receptor binders database (METI): we found that 16.1% assignments of the common compounds were opposite (20 among 124 common compounds). In order to verify the real assignments for these inconsistent compounds, we predicted these samples, as a blind external set, by our regression models and compared the results with the two databases. The results indicated that most of the predictions were consistent with METI. Furthermore, we built a kNN classification model using the 104 consistent compounds to predict those inconsistent ones, and most of the predictions were also in agreement with METI database.

  4. Network selection, Information filtering and Scalable computation

    NASA Astrophysics Data System (ADS)

    Ye, Changqing

    This dissertation explores two application scenarios of sparsity pursuit method on large scale data sets. The first scenario is classification and regression in analyzing high dimensional structured data, where predictors corresponds to nodes of a given directed graph. This arises in, for instance, identification of disease genes for the Parkinson's diseases from a network of candidate genes. In such a situation, directed graph describes dependencies among the genes, where direction of edges represent certain causal effects. Key to high-dimensional structured classification and regression is how to utilize dependencies among predictors as specified by directions of the graph. In this dissertation, we develop a novel method that fully takes into account such dependencies formulated through certain nonlinear constraints. We apply the proposed method to two applications, feature selection in large margin binary classification and in linear regression. We implement the proposed method through difference convex programming for the cost function and constraints. Finally, theoretical and numerical analyses suggest that the proposed method achieves the desired objectives. An application to disease gene identification is presented. The second application scenario is personalized information filtering which extracts the information specifically relevant to a user, predicting his/her preference over a large number of items, based on the opinions of users who think alike or its content. This problem is cast into the framework of regression and classification, where we introduce novel partial latent models to integrate additional user-specific and content-specific predictors, for higher predictive accuracy. In particular, we factorize a user-over-item preference matrix into a product of two matrices, each representing a user's preference and an item preference by users. Then we propose a likelihood method to seek a sparsest latent factorization, from a class of over-complete factorizations, possibly with a high percentage of missing values. This promotes additional sparsity beyond rank reduction. Computationally, we design methods based on a ``decomposition and combination'' strategy, to break large-scale optimization into many small subproblems to solve in a recursive and parallel manner. On this basis, we implement the proposed methods through multi-platform shared-memory parallel programming, and through Mahout, a library for scalable machine learning and data mining, for mapReduce computation. For example, our methods are scalable to a dataset consisting of three billions of observations on a single machine with sufficient memory, having good timings. Both theoretical and numerical investigations show that the proposed methods exhibit significant improvement in accuracy over state-of-the-art scalable methods.

  5. Effects of functional constraints and opportunism on the functional structure of a vertebrate predator assemblage.

    PubMed

    Farias, Ariel A; Jaksic, Fabian M

    2007-03-01

    1. Within mainstream ecological literature, functional structure has been viewed as resulting from the interplay of species interactions, resource levels and environmental variability. Classical models state that interspecific competition generates species segregation and guild formation in stable saturated environments, whereas opportunism causes species aggregation on abundant resources in variable unsaturated situations. 2. Nevertheless, intrinsic functional constraints may result in species-specific differences in resource-use capabilities. This could force some degree of functional structure without assuming other putative causes. However, the influence of such constraints has rarely been tested, and their relative contribution to observed patterns has not been quantified. 3. We used a multiple null-model approach to quantify the magnitude and direction (non-random aggregation or divergence) of the functional structure of a vertebrate predator assemblage exposed to variable prey abundance over an 18-year period. Observed trends were contrasted with predictions from null-models designed in an orthogonal fashion to account independently for the effects of functional constraints and opportunism. Subsequently, the unexplained variation was regressed against environmental variables to search for evidence of interspecific competition. 4. Overall, null-models accounting for functional constraints showed the best fit to the observed data, and suggested an effect of this factor in modulating predator opportunistic responses. However, regression models on residual variation indicated that such an effect was dependent on both total and relative abundance of principal (small mammals) and alternative (arthropods, birds, reptiles) prey categories. 5. In addition, no clear evidence for interspecific competition was found, but differential delays in predator functional responses could explain some of the unaccounted variation. Thus, we call for caution when interpreting empirical data in the context of classical models assuming synchronous responses of consumers to resource levels.

  6. The alarming problems of confounding equivalence using logistic regression models in the perspective of causal diagrams.

    PubMed

    Yu, Yuanyuan; Li, Hongkai; Sun, Xiaoru; Su, Ping; Wang, Tingting; Liu, Yi; Yuan, Zhongshang; Liu, Yanxun; Xue, Fuzhong

    2017-12-28

    Confounders can produce spurious associations between exposure and outcome in observational studies. For majority of epidemiologists, adjusting for confounders using logistic regression model is their habitual method, though it has some problems in accuracy and precision. It is, therefore, important to highlight the problems of logistic regression and search the alternative method. Four causal diagram models were defined to summarize confounding equivalence. Both theoretical proofs and simulation studies were performed to verify whether conditioning on different confounding equivalence sets had the same bias-reducing potential and then to select the optimum adjusting strategy, in which logistic regression model and inverse probability weighting based marginal structural model (IPW-based-MSM) were compared. The "do-calculus" was used to calculate the true causal effect of exposure on outcome, then the bias and standard error were used to evaluate the performances of different strategies. Adjusting for different sets of confounding equivalence, as judged by identical Markov boundaries, produced different bias-reducing potential in the logistic regression model. For the sets satisfied G-admissibility, adjusting for the set including all the confounders reduced the equivalent bias to the one containing the parent nodes of the outcome, while the bias after adjusting for the parent nodes of exposure was not equivalent to them. In addition, all causal effect estimations through logistic regression were biased, although the estimation after adjusting for the parent nodes of exposure was nearest to the true causal effect. However, conditioning on different confounding equivalence sets had the same bias-reducing potential under IPW-based-MSM. Compared with logistic regression, the IPW-based-MSM could obtain unbiased causal effect estimation when the adjusted confounders satisfied G-admissibility and the optimal strategy was to adjust for the parent nodes of outcome, which obtained the highest precision. All adjustment strategies through logistic regression were biased for causal effect estimation, while IPW-based-MSM could always obtain unbiased estimation when the adjusted set satisfied G-admissibility. Thus, IPW-based-MSM was recommended to adjust for confounders set.

  7. Comparing methods for estimation of heterogeneous treatment effects using observational data from health care databases.

    PubMed

    Wendling, T; Jung, K; Callahan, A; Schuler, A; Shah, N H; Gallego, B

    2018-06-03

    There is growing interest in using routinely collected data from health care databases to study the safety and effectiveness of therapies in "real-world" conditions, as it can provide complementary evidence to that of randomized controlled trials. Causal inference from health care databases is challenging because the data are typically noisy, high dimensional, and most importantly, observational. It requires methods that can estimate heterogeneous treatment effects while controlling for confounding in high dimensions. Bayesian additive regression trees, causal forests, causal boosting, and causal multivariate adaptive regression splines are off-the-shelf methods that have shown good performance for estimation of heterogeneous treatment effects in observational studies of continuous outcomes. However, it is not clear how these methods would perform in health care database studies where outcomes are often binary and rare and data structures are complex. In this study, we evaluate these methods in simulation studies that recapitulate key characteristics of comparative effectiveness studies. We focus on the conditional average effect of a binary treatment on a binary outcome using the conditional risk difference as an estimand. To emulate health care database studies, we propose a simulation design where real covariate and treatment assignment data are used and only outcomes are simulated based on nonparametric models of the real outcomes. We apply this design to 4 published observational studies that used records from 2 major health care databases in the United States. Our results suggest that Bayesian additive regression trees and causal boosting consistently provide low bias in conditional risk difference estimates in the context of health care database studies. Copyright © 2018 John Wiley & Sons, Ltd.

  8. Spatial Assessment of Model Errors from Four Regression Techniques

    Treesearch

    Lianjun Zhang; Jeffrey H. Gove; Jeffrey H. Gove

    2005-01-01

    Fomst modelers have attempted to account for the spatial autocorrelations among trees in growth and yield models by applying alternative regression techniques such as linear mixed models (LMM), generalized additive models (GAM), and geographicalIy weighted regression (GWR). However, the model errors are commonly assessed using average errors across the entire study...

  9. Regression Models For Multivariate Count Data

    PubMed Central

    Zhang, Yiwen; Zhou, Hua; Zhou, Jin; Sun, Wei

    2016-01-01

    Data with multivariate count responses frequently occur in modern applications. The commonly used multinomial-logit model is limiting due to its restrictive mean-variance structure. For instance, analyzing count data from the recent RNA-seq technology by the multinomial-logit model leads to serious errors in hypothesis testing. The ubiquity of over-dispersion and complicated correlation structures among multivariate counts calls for more flexible regression models. In this article, we study some generalized linear models that incorporate various correlation structures among the counts. Current literature lacks a treatment of these models, partly due to the fact that they do not belong to the natural exponential family. We study the estimation, testing, and variable selection for these models in a unifying framework. The regression models are compared on both synthetic and real RNA-seq data. PMID:28348500

  10. Regression Models For Multivariate Count Data.

    PubMed

    Zhang, Yiwen; Zhou, Hua; Zhou, Jin; Sun, Wei

    2017-01-01

    Data with multivariate count responses frequently occur in modern applications. The commonly used multinomial-logit model is limiting due to its restrictive mean-variance structure. For instance, analyzing count data from the recent RNA-seq technology by the multinomial-logit model leads to serious errors in hypothesis testing. The ubiquity of over-dispersion and complicated correlation structures among multivariate counts calls for more flexible regression models. In this article, we study some generalized linear models that incorporate various correlation structures among the counts. Current literature lacks a treatment of these models, partly due to the fact that they do not belong to the natural exponential family. We study the estimation, testing, and variable selection for these models in a unifying framework. The regression models are compared on both synthetic and real RNA-seq data.

  11. Changes of choroidal neovascularization in indocyanine green angiography after intravitreal ranibizumab injection.

    PubMed

    Lee, Ji Eun; Kim, Hyun Woong; Lee, Sang Joon; Lee, Joo Eun

    2015-05-01

    To investigate vascular structural changes of choroidal neovascularization (CNV) followed by intravitreal ranibizumab injections using indocyanine green angiography. A total of 31 patients with exudative age-related macular degeneration and CNV whose structures were identifiable in indocyanine green angiography were included. Ranibizumab was injected into the vitreous cavity once a month for 3 months and then as needed for the next 3 months prospectively. Indocyanine green angiography was performed at baseline, 3, and 6 months. Early to midphase images of the indocyanine green angiography in the details of vascular structure of the CNV were discerned the best were used in the image analysis. Vascular structures of CNV were described as arteriovenular and capillary components, and structural changes were assessed. Arteriovenular components were observed in 29 eyes (94%). Regression of the capillary components was observed in most cases. Although regression of arteriovenular component was noted in 14 eyes (48%), complete resolution was not observed. The eyes were categorized into 3 groups according to CNV structural changes: the regressed (Group R, 10 eyes, 31%), the matured (Group M, 7 eyes, 23%), and the growing (Group G, 14 eyes, 45%). In Group R, there was no regrowth of CNV found at 6 months. In Group M, distinct vascular structures were observed at 3 months and persisted without apparent changes at 6 months. In Group G, growth or reperfusion of capillary components from the persisting arteriovenular components was noted at 6 months. Both capillary and arteriovenular components were regressed during monthly ranibizumab injections. However, CNV regrowth was observed in a group of patients during the as-needed treatment phase.

  12. Interpreting Regression Results: beta Weights and Structure Coefficients are Both Important.

    ERIC Educational Resources Information Center

    Thompson, Bruce

    Various realizations have led to less frequent use of the "OVA" methods (analysis of variance--ANOVA--among others) and to more frequent use of general linear model approaches such as regression. However, too few researchers understand all the various coefficients produced in regression. This paper explains these coefficients and their…

  13. Normalization Regression Estimation With Application to a Nonorthogonal, Nonrecursive Model of School Learning.

    ERIC Educational Resources Information Center

    Bulcock, J. W.; And Others

    Advantages of normalization regression estimation over ridge regression estimation are demonstrated by reference to Bloom's model of school learning. Theoretical concern centered on the structure of scholastic achievement at grade 10 in Canadian high schools. Data on 886 students were randomly sampled from the Carnegie Human Resources Data Bank.…

  14. Regression analysis using dependent Polya trees.

    PubMed

    Schörgendorfer, Angela; Branscum, Adam J

    2013-11-30

    Many commonly used models for linear regression analysis force overly simplistic shape and scale constraints on the residual structure of data. We propose a semiparametric Bayesian model for regression analysis that produces data-driven inference by using a new type of dependent Polya tree prior to model arbitrary residual distributions that are allowed to evolve across increasing levels of an ordinal covariate (e.g., time, in repeated measurement studies). By modeling residual distributions at consecutive covariate levels or time points using separate, but dependent Polya tree priors, distributional information is pooled while allowing for broad pliability to accommodate many types of changing residual distributions. We can use the proposed dependent residual structure in a wide range of regression settings, including fixed-effects and mixed-effects linear and nonlinear models for cross-sectional, prospective, and repeated measurement data. A simulation study illustrates the flexibility of our novel semiparametric regression model to accurately capture evolving residual distributions. In an application to immune development data on immunoglobulin G antibodies in children, our new model outperforms several contemporary semiparametric regression models based on a predictive model selection criterion. Copyright © 2013 John Wiley & Sons, Ltd.

  15. Techniques for estimating peak-streamflow frequency for unregulated streams and streams regulated by small floodwater retarding structures in Oklahoma

    USGS Publications Warehouse

    Tortorelli, Robert L.

    1997-01-01

    Statewide regression equations for Oklahoma were determined for estimating peak discharge and flood frequency for selected recurrence intervals from 2 to 500 years for ungaged sites on natural unregulated streams. The most significant independent variables required to estimate peak-streamflow frequency for natural unregulated streams in Oklahoma are contributing drainage area, main-channel slope, and mean-annual precipitation. The regression equations are applicable for watersheds with drainage areas less than 2,510 square miles that are not affected by regulation from manmade works. Limitations on the use of the regression relations and the reliability of regression estimates for natural unregulated streams are discussed. Log-Pearson Type III analysis information, basin and climatic characteristics, and the peak-stream-flow frequency estimates for 251 gaging stations in Oklahoma and adjacent states are listed. Techniques are presented to make a peak-streamflow frequency estimate for gaged sites on natural unregulated streams and to use this result to estimate a nearby ungaged site on the same stream. For ungaged sites on urban streams, an adjustment of the statewide regression equations for natural unregulated streams can be used to estimate peak-streamflow frequency. For ungaged sites on streams regulated by small floodwater retarding structures, an adjustment of the statewide regression equations for natural unregulated streams can be used to estimate peak-streamflow frequency. The statewide regression equations are adjusted by substituting the drainage area below the floodwater retarding structures, or drainage area that represents the percentage of the unregulated basin, in the contributing drainage area parameter to obtain peak-streamflow frequency estimates.

  16. Field- and Remote Sensing-based Structural Attributes Measured at Multiple Scales Influence the Relationship Between Nitrogen and Reflectance of Forest Canopies

    NASA Astrophysics Data System (ADS)

    Sullivan, F.; Ollinger, S. V.; Palace, M. W.; Ouimette, A.; Sanders-DeMott, R.; Lepine, L. C.

    2017-12-01

    The correlation between near-infrared reflectance and forest canopy nitrogen concentration has been demonstrated at varying scales using a range of optical sensors on airborne and satellite platforms. Although the mechanism underpinning the relationship is unclear, at its basis are biologically-driven functional relationships of multiple plant traits that affect canopy chemistry and structure. The link between near-infrared reflectance and canopy nitrogen has been hypothesized to be partially driven by covariation of canopy nitrogen with canopy structure. In this study, we used a combination of airborne LiDAR data and field measured leaf and canopy chemical and structural traits to explore interrelationships between canopy nitrogen, near-infrared reflectance, and canopy structure on plots at Bartlett Experimental Forest in the White Mountain National Forest, New Hampshire. Over each plot, we developed a 1-meter resolution canopy height profile and a 1-meter resolution canopy height model. From canopy height profiles and canopy height models, we calculated a set of metrics describing the plot-level variability, breadth, depth, and arrangement of LiDAR returns. This combination of metrics was used to describe both vertical and horizontal variation in structure. In addition, we developed and measured several field-based metrics of leaf and canopy structure at the plot scale by directly measuring the canopy or by weighting leaf-level metrics by species leaf area contribution. We assessed relationships between leaf and structural metrics, near-infrared reflectance and canopy nitrogen concentration using multiple linear regression and mixed effects modeling. Consistent with our hypothesis, we found moderately strong links between both near-infrared reflectance and canopy nitrogen concentration with LiDAR-derived structural metrics, and we additionally found that leaf-level metrics scaled to the plot level share an important role in canopy reflectance. We suggest that canopy structure has a governing role in canopy reflectance, reducing maximum potential reflectance as structural complexity increases, and therefore also influences the relationship between canopy nitrogen and NIR reflectance.

  17. The development of reactive fuel grains for pyrophoric relight of in-space hybrid rocket thrusters

    NASA Astrophysics Data System (ADS)

    Steiner, Matthew Wellington

    This study presents and investigates a novel hybrid fuel grain that reacts pyrophorically with gaseous oxidizer to achieve restart of a hybrid rocket motor propulsion system while reducing cost and handling concerns. This reactive fuel grain (RFG) relies on the pyrophoric nature of finely divided metal particles dispersed in a solid dicyclopentadiene (DCPD) binder, which has been shown to encapsulate air-sensitive additives until they are exposed to combustion gases. An RFG is thus effectively inert in open air in the absence of an ignition source, though the particles encapsulated within remain pyrophoric. In practice, this means that an RFG that is ignited in the vacuum of space and then extinguished will expose unoxidized pyrophoric particles, which can be used to generate sufficient heat to relight the propellant when oxidizer is flowed. The experiments outlined in this work aim to develop a suitable pyrophoric material for use in an RFG, demonstrate pyrophoric relight, and characterize performance under conditions relevant to a hybrid rocket thruster. Magnesium, lithium, calcium, and an alloy of titanium, chromium, and manganese (TiCrMn) were investigated to determine suitability of pure metals as RFG additives. Additionally, aluminum hydride (AlH3), lithium aluminum hydride (LiAlH4), lithium borohydride (LiBH4), and magnesium hydride (MgH2) were investigated to determine suitability of metals hydrides as RFG additives or as precursors for pure-metal RFG additives. Pyrophoric metals have been previously investigated as additives for increasing the regression rate of hybrid fuels, but to the author's knowledge, these materials have not been specifically investigated for their ability to ignite a propellant pyrophorically. Commercial research-grade metals were obtained as coarse powders, then ball-milled to attempt to reduce particle size below a critical diameter needed for pyrophoricity. Magnesium hydride was ball-milled and then cycled in a hydride cycling apparatus to attempt to fracture the particles through hydrogen sorption and thermal stresses. These powders were then tested for pyrophoricity with atmospheric and pure concentrations of oxygen. The TiCrMn powder was chosen as the material for evaluation of propellant performance, and was mixed with DCPD in various weight ratios to determine the required additive loading needed for pyrophoricity of the bulk propellant. Weight percentages of 10, 20, 30, and 50 wt.% TiCrMn were used to evaluate relight capability and propellant performance, and weight loadings of 50, 70, and 90 wt.% TiCrMn were used to evaluate approximate maximum loading possible without rendering the propellant structurally unsound. Propellant tests were conducted in an opposed flow burner apparatus for sub-scale regression rate and relight experiments, and an optically accessible cylindrical combustion chamber (OCC) that allows high speed cameras to record the regressing propellant surface during combustion. Gaseous oxygen (GOX) was used as an oxidizer for all tests due to its ready availability and common use as a hybrid rocket oxidizer. Opposed flow burner experiments are an inexpensive means of rapidly testing various propellant formulations at different conditions, whereas OCC tests are useful for obtaining realistic data on how an RFG would likely operate as part of a propulsion system. Relight in the opposed flow burner was attempted by cycling oxygen and nitrogen flows with carefully timed solenoid valves to initiate and extinguish combustion, and to control the slow diffusion of oxygen to the surface of the propellant, which would render the TiCrMn non-pyrophoric. The opposed flow burner experiments did not conclusively demonstrate the pyrophoric relight capability of the RFG propellant due in part to the persistence of hot spots between oxygen and purge nitrogen cycles, as determined by high-speed imaging in the near infrared range. An opposed flow burner apparatus was then constructed within a vacuum chamber assembly thus preventing atmospheric oxygen from diffusing to the propellant surface, but these tests did not demonstrate pyrophoric relight. Future work is proposed to evaluate the effect of pyrophoric particle size in order to determine the role ignition delay of each particle has in the relight capability of RFGs. OCC experiments were conducted at a low and high GOX mass flux of approximately 150 and 300 kg/s/m2, respectively, at a nominal chamber pressure of 150 psia. Four strand compositions were used: pure DCPD, 30 wt.% pyrophoric TiCrMn powder with average particle diameters of approximately 1-10 microns, 30 wt.% oxidized TiCrMn powder with average particle diameters of approximately 1-10 microns, and 30 wt.% TiCrMn powder with average particle diameters of approximately 1-4 mm. Regression rate was measure by weight loss, average web thickness change at three axial locations on the strand, and through time-resolved tracking of the regressing propellant surface via high speed video. While visual observations suggest that the addition of TiCrMn significantly increases regression rate, initial data do not show a significant trend. Additionally, it is observed that the oxidized TiCrMn strands regress at the same rate as those loaded with pyrophoric TiCrMn, suggesting that erosive burning and heat addition of the added metal may be the cause of the observed increase in regression rate. The data are too sparse to make conclusions about the effect of particle size on regression rate, so further tests are recommended to develop a significant data set for the effect of pyrophoricity and particle size on regression rate. The test article was damaged at the end of the regression rate experimental campaign, which precluded the collection of relight data that was planned for strands loaded with 50 wt.% TiCrMn particles with an average diameter of approximately 1-4 mm. Though further tests are needed to demonstrate pyrophoric relight of an RFG, the current work establishes a baseline for RFG performance and suggests that pyrophoric relight is possible by tailoring the particle size of the pyrophoric metal additive to control heat release and ignition delay.

  18. Estimation of genetic effects in the presence of multicollinearity in multibreed beef cattle evaluation.

    PubMed

    Roso, V M; Schenkel, F S; Miller, S P; Schaeffer, L R

    2005-08-01

    Breed additive, dominance, and epistatic loss effects are of concern in the genetic evaluation of a multibreed population. Multiple regression equations used for fitting these effects may show a high degree of multicollinearity among predictor variables. Typically, when strong linear relationships exist, the regression coefficients have large SE and are sensitive to changes in the data file and to the addition or deletion of variables in the model. Generalized ridge regression methods were applied to obtain stable estimates of direct and maternal breed additive, dominance, and epistatic loss effects in the presence of multicollinearity among predictor variables. Preweaning weight gains of beef calves in Ontario, Canada, from 1986 to 1999 were analyzed. The genetic model included fixed direct and maternal breed additive, dominance, and epistatic loss effects, fixed environmental effects of age of the calf, contemporary group, and age of the dam x sex of the calf, random additive direct and maternal genetic effects, and random maternal permanent environment effect. The degree and the nature of the multicollinearity were identified and ridge regression methods were used as an alternative to ordinary least squares (LS). Ridge parameters were obtained using two different objective methods: 1) generalized ridge estimator of Hoerl and Kennard (R1); and 2) bootstrap in combination with cross-validation (R2). Both ridge regression methods outperformed the LS estimator with respect to mean squared error of predictions (MSEP) and variance inflation factors (VIF) computed over 100 bootstrap samples. The MSEP of R1 and R2 were similar, and they were 3% less than the MSEP of LS. The average VIF of LS, R1, and R2 were equal to 26.81, 6.10, and 4.18, respectively. Ridge regression methods were particularly effective in decreasing the multicollinearity involving predictor variables of breed additive effects. Because of a high degree of confounding between estimates of maternal dominance and direct epistatic loss effects, it was not possible to compare the relative importance of these effects with a high level of confidence. The inclusion of epistatic loss effects in the additive-dominance model did not cause noticeable reranking of sires, dams, and calves based on across-breed EBV. More precise estimates of breed effects as a result of this study may result in more stable across-breed estimated breeding values over the years.

  19. Regression Levels of Selected Affective Factors on Science Achievement: A Structural Equation Model with TIMSS 2011 Data

    ERIC Educational Resources Information Center

    Akilli, Mustafa

    2015-01-01

    The aim of this study is to demonstrate the science success regression levels of chosen emotional features of 8th grade students using Structural Equation Model. The study was conducted by the analysis of students' questionnaires and science success in TIMSS 2011 data using SEM. Initially, the factors that are thought to have an effect on science…

  20. Evaluating the efficacy of distributed detention structures to reduce downstream flooding under variable rainfall, antecedent soil, and structural storage conditions

    NASA Astrophysics Data System (ADS)

    Thomas, Nicholas W.; Arenas Amado, Antonio; Schilling, Keith E.; Weber, Larry J.

    2016-10-01

    This research systematically analyzed the influence of antecedent soil wetness, rainfall depth, and the subsequent impact on peak flows in a 45 km2 watershed. Peak flows increased with increasing antecedent wetness and rainfall depth, with the highest peak flows occurring under intense precipitation on wet soils. Flood mitigation structures were included and investigated under full and empty initial storage conditions. Peak flows were reduced at the outlet of the watershed by 3-17%. The highest peak flow reductions occurred in scenarios with dry soil, empty project storage, and low rainfall depths. These analyses showed that with increased rainfall depth, antecedent moisture conditions became increasingly less impactful. Scaling invariance of peak discharges were shown to hold true within this basin and were fit through ordinary least squares regression for each design scenario. Scale-invariance relationships were extrapolated beyond the outlet of the analyzed basin to the point of intersection of with and without structure scenarios. In each scenario extrapolated peak discharge benefits depreciated at a drainage area of approximately 100 km2. The associated drainage area translated to roughly 2 km downstream of the Beaver Creek watershed outlet. This work provides an example of internal watershed benefits of structural flood mitigation efforts, and the impact the may exert outside of the basin. Additionally, the influence of 1.8 million in flood reduction tools was not sufficient to routinely address downstream flood concerns, shedding light on the additional investment required to alter peak flows in large basins.

  1. Psychometric Properties of the Dutch Depression Stigma Scale (DSS) and Associations with Personal and Perceived Stigma in a Depressed and Community Sample

    PubMed Central

    Cuijpers, P.; Griffiths, K. M.; Kleiboer, A. M.

    2016-01-01

    Background Research on depression stigma is needed to gain more insight into the underlying construct and to reduce the level of stigma in the community. However, few validated measurements of depression stigma are available in the Netherlands. Therefore, this study first sought to examine the psychometric properties of the Dutch translation of the Depression Stigma Scale (DSS). Second, we examined which demographic (gender, age, education, partner status) and other variables (anxiety and knowledge of depression) are associated with personal and perceived stigma within these samples. Methods The study population consisted of an adult convenience sample (n = 253) (study 1) and a community adult sample with elevated depressive symptoms (n = 264) (study 2). Factor structure, internal consistency, and validity were assessed. The associations between stigma, demographic variables and anxiety level were examined with regression analyses. Results Confirmatory factor analysis supported the validity and internal consistency of the DSS personal stigma scale. Internal consistency was sufficient (Cronbach’s alpha = .70 (study 1) and .77 (study 2)). The results regarding the perceived stigma scale revealed no clear factor structure. Regression analyses showed that personal stigma was higher in younger people, those with no experience with depression, and those with lower education. Conclusions This study established the validity and internal consistency of the DSS personal scale in the Netherlands, in a community sample and in people with elevated depressive symptoms. However, additional research is needed to examine the factor structure of the DSS perceived scale and its use in other samples. PMID:27500969

  2. Quantitative structure-retention relationship studies using immobilized artificial membrane chromatography I: amended linear solvation energy relationships with the introduction of a molecular electronic factor.

    PubMed

    Li, Jie; Sun, Jin; Cui, Shengmiao; He, Zhonggui

    2006-11-03

    Linear solvation energy relationships (LSERs) amended by the introduction of a molecular electronic factor were employed to establish quantitative structure-retention relationships using immobilized artificial membrane (IAM) chromatography, in particular ionizable solutes. The chromatographic indices, log k(IAM), were determined by HPLC on an IAM.PC.DD2 column for 53 structurally diverse compounds, including neutral, acidic and basic compounds. Unlike neutral compounds, the IAM chromatographic retention of ionizable compounds was affected by their molecular charge state. When the mean net charge per molecule (delta) was introduced into the amended LSER as the sixth variable, the LSER regression coefficient was significantly improved for the test set including ionizable solutes. The delta coefficients of acidic and basic compounds were quite different indicating that the molecular electronic factor had a markedly different impact on the retention of acidic and basic compounds on IAM column. Ionization of acidic compounds containing a carboxylic group tended to impair their retention on IAM, while the ionization of basic compounds did not have such a marked effect. In addition, the extra-interaction with the polar head of phospholipids might cause a certain change in the retention of basic compounds. A comparison of calculated and experimental retention indices suggested that the semi-empirical LSER amended by the addition of a molecular electronic factor was able to reproduce adequately the experimental retention factors of the structurally diverse solutes investigated.

  3. Distributed collaborative probabilistic design of multi-failure structure with fluid-structure interaction using fuzzy neural network of regression

    NASA Astrophysics Data System (ADS)

    Song, Lu-Kai; Wen, Jie; Fei, Cheng-Wei; Bai, Guang-Chen

    2018-05-01

    To improve the computing efficiency and precision of probabilistic design for multi-failure structure, a distributed collaborative probabilistic design method-based fuzzy neural network of regression (FR) (called as DCFRM) is proposed with the integration of distributed collaborative response surface method and fuzzy neural network regression model. The mathematical model of DCFRM is established and the probabilistic design idea with DCFRM is introduced. The probabilistic analysis of turbine blisk involving multi-failure modes (deformation failure, stress failure and strain failure) was investigated by considering fluid-structure interaction with the proposed method. The distribution characteristics, reliability degree, and sensitivity degree of each failure mode and overall failure mode on turbine blisk are obtained, which provides a useful reference for improving the performance and reliability of aeroengine. Through the comparison of methods shows that the DCFRM reshapes the probability of probabilistic analysis for multi-failure structure and improves the computing efficiency while keeping acceptable computational precision. Moreover, the proposed method offers a useful insight for reliability-based design optimization of multi-failure structure and thereby also enriches the theory and method of mechanical reliability design.

  4. A covariance correction that accounts for correlation estimation to improve finite-sample inference with generalized estimating equations: A study on its applicability with structured correlation matrices

    PubMed Central

    Westgate, Philip M.

    2016-01-01

    When generalized estimating equations (GEE) incorporate an unstructured working correlation matrix, the variances of regression parameter estimates can inflate due to the estimation of the correlation parameters. In previous work, an approximation for this inflation that results in a corrected version of the sandwich formula for the covariance matrix of regression parameter estimates was derived. Use of this correction for correlation structure selection also reduces the over-selection of the unstructured working correlation matrix. In this manuscript, we conduct a simulation study to demonstrate that an increase in variances of regression parameter estimates can occur when GEE incorporates structured working correlation matrices as well. Correspondingly, we show the ability of the corrected version of the sandwich formula to improve the validity of inference and correlation structure selection. We also study the relative influences of two popular corrections to a different source of bias in the empirical sandwich covariance estimator. PMID:27818539

  5. A covariance correction that accounts for correlation estimation to improve finite-sample inference with generalized estimating equations: A study on its applicability with structured correlation matrices.

    PubMed

    Westgate, Philip M

    2016-01-01

    When generalized estimating equations (GEE) incorporate an unstructured working correlation matrix, the variances of regression parameter estimates can inflate due to the estimation of the correlation parameters. In previous work, an approximation for this inflation that results in a corrected version of the sandwich formula for the covariance matrix of regression parameter estimates was derived. Use of this correction for correlation structure selection also reduces the over-selection of the unstructured working correlation matrix. In this manuscript, we conduct a simulation study to demonstrate that an increase in variances of regression parameter estimates can occur when GEE incorporates structured working correlation matrices as well. Correspondingly, we show the ability of the corrected version of the sandwich formula to improve the validity of inference and correlation structure selection. We also study the relative influences of two popular corrections to a different source of bias in the empirical sandwich covariance estimator.

  6. Flood quantile estimation at ungauged sites by Bayesian networks

    NASA Astrophysics Data System (ADS)

    Mediero, L.; Santillán, D.; Garrote, L.

    2012-04-01

    Estimating flood quantiles at a site for which no observed measurements are available is essential for water resources planning and management. Ungauged sites have no observations about the magnitude of floods, but some site and basin characteristics are known. The most common technique used is the multiple regression analysis, which relates physical and climatic basin characteristic to flood quantiles. Regression equations are fitted from flood frequency data and basin characteristics at gauged sites. Regression equations are a rigid technique that assumes linear relationships between variables and cannot take the measurement errors into account. In addition, the prediction intervals are estimated in a very simplistic way from the variance of the residuals in the estimated model. Bayesian networks are a probabilistic computational structure taken from the field of Artificial Intelligence, which have been widely and successfully applied to many scientific fields like medicine and informatics, but application to the field of hydrology is recent. Bayesian networks infer the joint probability distribution of several related variables from observations through nodes, which represent random variables, and links, which represent causal dependencies between them. A Bayesian network is more flexible than regression equations, as they capture non-linear relationships between variables. In addition, the probabilistic nature of Bayesian networks allows taking the different sources of estimation uncertainty into account, as they give a probability distribution as result. A homogeneous region in the Tagus Basin was selected as case study. A regression equation was fitted taking the basin area, the annual maximum 24-hour rainfall for a given recurrence interval and the mean height as explanatory variables. Flood quantiles at ungauged sites were estimated by Bayesian networks. Bayesian networks need to be learnt from a huge enough data set. As observational data are reduced, a stochastic generator of synthetic data was developed. Synthetic basin characteristics were randomised, keeping the statistical properties of observed physical and climatic variables in the homogeneous region. The synthetic flood quantiles were stochastically generated taking the regression equation as basis. The learnt Bayesian network was validated by the reliability diagram, the Brier Score and the ROC diagram, which are common measures used in the validation of probabilistic forecasts. Summarising, the flood quantile estimations through Bayesian networks supply information about the prediction uncertainty as a probability distribution function of discharges is given as result. Therefore, the Bayesian network model has application as a decision support for water resources and planning management.

  7. The Collinearity Free and Bias Reduced Regression Estimation Project: The Theory of Normalization Ridge Regression. Report No. 2.

    ERIC Educational Resources Information Center

    Bulcock, J. W.; And Others

    Multicollinearity refers to the presence of highly intercorrelated independent variables in structural equation models, that is, models estimated by using techniques such as least squares regression and maximum likelihood. There is a problem of multicollinearity in both the natural and social sciences where theory formulation and estimation is in…

  8. Application and interpretation of functional data analysis techniques to differential scanning calorimetry data from lupus patients.

    PubMed

    Kendrick, Sarah K; Zheng, Qi; Garbett, Nichola C; Brock, Guy N

    2017-01-01

    DSC is used to determine thermally-induced conformational changes of biomolecules within a blood plasma sample. Recent research has indicated that DSC curves (or thermograms) may have different characteristics based on disease status and, thus, may be useful as a monitoring and diagnostic tool for some diseases. Since thermograms are curves measured over a range of temperature values, they are considered functional data. In this paper we apply functional data analysis techniques to analyze differential scanning calorimetry (DSC) data from individuals from the Lupus Family Registry and Repository (LFRR). The aim was to assess the effect of lupus disease status as well as additional covariates on the thermogram profiles, and use FD analysis methods to create models for classifying lupus vs. control patients on the basis of the thermogram curves. Thermograms were collected for 300 lupus patients and 300 controls without lupus who were matched with diseased individuals based on sex, race, and age. First, functional regression with a functional response (DSC) and categorical predictor (disease status) was used to determine how thermogram curve structure varied according to disease status and other covariates including sex, race, and year of birth. Next, functional logistic regression with disease status as the response and functional principal component analysis (FPCA) scores as the predictors was used to model the effect of thermogram structure on disease status prediction. The prediction accuracy for patients with Osteoarthritis and Rheumatoid Arthritis but without Lupus was also calculated to determine the ability of the classifier to differentiate between Lupus and other diseases. Data were divided 1000 times into separate 2/3 training and 1/3 test data for evaluation of predictions. Finally, derivatives of thermogram curves were included in the models to determine whether they aided in prediction of disease status. Functional regression with thermogram as a functional response and disease status as predictor showed a clear separation in thermogram curve structure between cases and controls. The logistic regression model with FPCA scores as the predictors gave the most accurate results with a mean 79.22% correct classification rate with a mean sensitivity = 79.70%, and specificity = 81.48%. The model correctly classified OA and RA patients without Lupus as controls at a rate of 75.92% on average with a mean sensitivity = 79.70% and specificity = 77.6%. Regression models including FPCA scores for derivative curves did not perform as well, nor did regression models including covariates. Changes in thermograms observed in the disease state likely reflect covalent modifications of plasma proteins or changes in large protein-protein interacting networks resulting in the stabilization of plasma proteins towards thermal denaturation. By relating functional principal components from thermograms to disease status, our Functional Principal Component Analysis model provides results that are more easily interpretable compared to prior studies. Further, the model could also potentially be coupled with other biomarkers to improve diagnostic classification for lupus.

  9. A short generic measure of work stress in the era of globalization: effort-reward imbalance.

    PubMed

    Siegrist, Johannes; Wege, Natalia; Pühlhofer, Frank; Wahrendorf, Morten

    2009-08-01

    We evaluate psychometric properties of a short version of the original effort-reward imbalance (ERI) questionnaire. This measure is of interest in the context of assessing stressful work conditions in the era of economic globalization. In a representative sample of 10,698 employed men and women participating in the longitudinal Socio-Economic Panel (SOEP) in Germany, a short version of the ERI questionnaire was included in the 2006 panel wave. Structural equation modeling and logistic regression analysis were applied. In addition to satisfactory internal consistency of scales, a model representing the theoretical structure of the scales provided the best data fit in a competitive test (RMSEA = 0.059, CAIC = 4124.19). Scoring high on the ERI scales was associated with elevated risks of poor self-rated health. This short version of the ERI questionnaire reveals satisfactory psychometric properties, and can be recommended for further use in research and practice.

  10. Fuzzy classifier based support vector regression framework for Poisson ratio determination

    NASA Astrophysics Data System (ADS)

    Asoodeh, Mojtaba; Bagheripour, Parisa

    2013-09-01

    Poisson ratio is considered as one of the most important rock mechanical properties of hydrocarbon reservoirs. Determination of this parameter through laboratory measurement is time, cost, and labor intensive. Furthermore, laboratory measurements do not provide continuous data along the reservoir intervals. Hence, a fast, accurate, and inexpensive way of determining Poisson ratio which produces continuous data over the whole reservoir interval is desirable. For this purpose, support vector regression (SVR) method based on statistical learning theory (SLT) was employed as a supervised learning algorithm to estimate Poisson ratio from conventional well log data. SVR is capable of accurately extracting the implicit knowledge contained in conventional well logs and converting the gained knowledge into Poisson ratio data. Structural risk minimization (SRM) principle which is embedded in the SVR structure in addition to empirical risk minimization (EMR) principle provides a robust model for finding quantitative formulation between conventional well log data and Poisson ratio. Although satisfying results were obtained from an individual SVR model, it had flaws of overestimation in low Poisson ratios and underestimation in high Poisson ratios. These errors were eliminated through implementation of fuzzy classifier based SVR (FCBSVR). The FCBSVR significantly improved accuracy of the final prediction. This strategy was successfully applied to data from carbonate reservoir rocks of an Iranian Oil Field. Results indicated that SVR predicted Poisson ratio values are in good agreement with measured values.

  11. Multilevel regularized regression for simultaneous taxa selection and network construction with metagenomic count data.

    PubMed

    Liu, Zhenqiu; Sun, Fengzhu; Braun, Jonathan; McGovern, Dermot P B; Piantadosi, Steven

    2015-04-01

    Identifying disease associated taxa and constructing networks for bacteria interactions are two important tasks usually studied separately. In reality, differentiation of disease associated taxa and correlation among taxa may affect each other. One genus can be differentiated because it is highly correlated with another highly differentiated one. In addition, network structures may vary under different clinical conditions. Permutation tests are commonly used to detect differences between networks in distinct phenotypes, and they are time-consuming. In this manuscript, we propose a multilevel regularized regression method to simultaneously identify taxa and construct networks. We also extend the framework to allow construction of a common network and differentiated network together. An efficient algorithm with dual formulation is developed to deal with the large-scale n ≪ m problem with a large number of taxa (m) and a small number of samples (n) efficiently. The proposed method is regularized with a general Lp (p ∈ [0, 2]) penalty and models the effects of taxa abundance differentiation and correlation jointly. We demonstrate that it can identify both true and biologically significant genera and network structures. Software MLRR in MATLAB is available at http://biostatistics.csmc.edu/mlrr/. Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  12. Multidimensional Predictors of Fatigue among Octogenarians and Centenarians

    PubMed Central

    Cho, Jinmyoung; Martin, Peter; Margrett, Jennifer; MacDonald, Maurice; Johnson, Mary Ann; Poon, Leonard W.

    2012-01-01

    Background Fatigue is a common and frequently observed complaint among older adults. However, knowledge about the nature and correlates of fatigue in old age is very limited. Objective: This study examined the relationship of functional indicators, psychological and situational factors and fatigue for 210 octogenarians and centenarians from the Georgia Centenarian Study. Methods Three indicators of functional capacity (self-rated health, instrumental activities of daily living, physical activities of daily living), two indicators of psychological well-being (positive and negative affect), two indicators of situational factors (social network and social support), and a multidimensional fatigue scale were used. Blocked multiple regression analyses were computed to examine significant factors related to fatigue. In addition, multi-group analysis in structural equation modeling was used to investigate residential differences (i.e., long-term care facilities vs. private homes) in the relationship between significant factors and fatigue. Results Blocked multiple regression analyses indicated that two indicators of functional capacity, self-rated health and instrumental activities of daily living, both positive and negative affect, and social support were significant predictors of fatigue among oldest-old adults. The multiple group analysis in structural equation modeling revealed a significant difference among oldest-old adults based on residential status. Conclusion The results suggest that we should not consider fatigue as merely an unpleasant physical symptom, but rather adopt a perspective that different factors such as psychosocial aspects can influence fatigue in advanced later life. PMID:22094445

  13. Multidimensional predictors of fatigue among octogenarians and centenarians.

    PubMed

    Cho, Jinmyoung; Martin, Peter; Margrett, Jennifer; MacDonald, Maurice; Johnson, Mary Ann; Poon, Leonard W; Jazwinski, S M; Green, R C; Gearing, M; Woodard, J L; Tenover, J S; Siegler, I C; Rott, C; Rodgers, W L; Hausman, D; Arnold, J; Davey, A

    2012-01-01

    Fatigue is a common and frequently observed complaint among older adults. However, knowledge about the nature and correlates of fatigue in old age is very limited. This study examined the relationship of functional indicators, psychological and situational factors and fatigue for 210 octogenarians and centenarians from the Georgia Centenarian Study. Three indicators of functional capacity (self-rated health, instrumental activities of daily living, physical activities of daily living), two indicators of psychological well-being (positive and negative affect), two indicators of situational factors (social network and social support), and a multidimensional fatigue scale were used. Blocked multiple regression analyses were computed to examine significant factors related to fatigue. In addition, multi-group analysis in structural equation modeling was used to investigate residential differences (i.e., long-term care facilities vs. private homes) in the relationship between significant factors and fatigue. Blocked multiple regression analyses indicated that two indicators of functional capacity, self-rated health and instrumental activities of daily living, both positive and negative affect, and social support were significant predictors of fatigue among oldest-old adults. The multiple group analysis in structural equation modeling revealed a significant difference among oldest-old adults based on residential status. The results suggest that we should not consider fatigue as merely an unpleasant physical symptom, but rather adopt a perspective that different factors such as psychosocial aspects can influence fatigue in advanced later life. Copyright © 2011 S. Karger AG, Basel.

  14. Quality of care and patient satisfaction in hospitals with high concentrations of black patients.

    PubMed

    Brooks-Carthon, J Margo; Kutney-Lee, Ann; Sloane, Douglas M; Cimiotti, Jeannie P; Aiken, Linda H

    2011-09-01

    To examine the influence of nursing-specifically nurse staffing and the nurse work environment-on quality of care and patient satisfaction in hospitals with varying concentrations of Black patients. Cross-sectional secondary analysis of 2006-2007 nurse survey data collected across four states (Florida, Pennsylvania, New Jersey, and California), the Hospital Consumer Assessment of Healthcare Providers and Systems survey, and administrative data. Global analysis of variance and linear regression models were used to examine the association between the concentration of Black patients on quality measures (readiness for discharge, patient or family complaints, health care-associated infections) and patient satisfaction, before and after accounting for nursing and hospital characteristics. Nurses working in hospitals with higher concentrations of Blacks reported poorer confidence in patients' readiness for discharge and more frequent complaints and infections. Patients treated in hospitals with higher concentrations of Blacks were less satisfied with their care. In the fully adjusted regression models for quality and patient satisfaction outcomes, the effects associated with the concentration of Blacks were explained in part by nursing and structural hospital characteristics. This study demonstrates a relationship between nursing, structural hospital characteristics, quality of care, and patient satisfaction in hospitals with high concentrations of Black patients. Consideration of nursing factors, in addition to other important hospital characteristics, is critical to understanding and improving quality of care and patient satisfaction in minority-serving hospitals. © 2011 Sigma Theta Tau International.

  15. Assessing mediation using marginal structural models in the presence of confounding and moderation

    PubMed Central

    Coffman, Donna L.; Zhong, Wei

    2012-01-01

    This paper presents marginal structural models (MSMs) with inverse propensity weighting (IPW) for assessing mediation. Generally, individuals are not randomly assigned to levels of the mediator. Therefore, confounders of the mediator and outcome may exist that limit causal inferences, a goal of mediation analysis. Either regression adjustment or IPW can be used to take confounding into account, but IPW has several advantages. Regression adjustment of even one confounder of the mediator and outcome that has been influenced by treatment results in biased estimates of the direct effect (i.e., the effect of treatment on the outcome that does not go through the mediator). One advantage of IPW is that it can properly adjust for this type of confounding, assuming there are no unmeasured confounders. Further, we illustrate that IPW estimation provides unbiased estimates of all effects when there is a baseline moderator variable that interacts with the treatment, when there is a baseline moderator variable that interacts with the mediator, and when the treatment interacts with the mediator. IPW estimation also provides unbiased estimates of all effects in the presence of non-randomized treatments. In addition, for testing mediation we propose a test of the null hypothesis of no mediation. Finally, we illustrate this approach with an empirical data set in which the mediator is continuous, as is often the case in psychological research. PMID:22905648

  16. Assessing mediation using marginal structural models in the presence of confounding and moderation.

    PubMed

    Coffman, Donna L; Zhong, Wei

    2012-12-01

    This article presents marginal structural models with inverse propensity weighting (IPW) for assessing mediation. Generally, individuals are not randomly assigned to levels of the mediator. Therefore, confounders of the mediator and outcome may exist that limit causal inferences, a goal of mediation analysis. Either regression adjustment or IPW can be used to take confounding into account, but IPW has several advantages. Regression adjustment of even one confounder of the mediator and outcome that has been influenced by treatment results in biased estimates of the direct effect (i.e., the effect of treatment on the outcome that does not go through the mediator). One advantage of IPW is that it can properly adjust for this type of confounding, assuming there are no unmeasured confounders. Further, we illustrate that IPW estimation provides unbiased estimates of all effects when there is a baseline moderator variable that interacts with the treatment, when there is a baseline moderator variable that interacts with the mediator, and when the treatment interacts with the mediator. IPW estimation also provides unbiased estimates of all effects in the presence of nonrandomized treatments. In addition, for testing mediation we propose a test of the null hypothesis of no mediation. Finally, we illustrate this approach with an empirical data set in which the mediator is continuous, as is often the case in psychological research. PsycINFO Database Record (c) 2013 APA, all rights reserved.

  17. CALCINOSIS CIRCUMSCRIPTA IN A COHORT OF RELATED JUVENILE AFRICAN LIONS (PANTHERA LEO).

    PubMed

    Bauer, Kendra L; Sander, Samantha J; Steeil, James C; Walsh, Timothy F; Neiffer, Donald L

    2017-09-01

    Three juvenile, genetically related African lions (Panthera leo) were evaluated for discrete dome-shaped subcutaneous masses present over the proximal lateral metatarsal-tarsal area. The lesions measured 3-8 cm in diameter, were fluctuant to firm, nonulcerated, and attached to underlying structures. On radiographic evaluation, the lesions were characterized by well-circumscribed punctate mineralizations in the soft tissue surrounded by soft tissue swelling without evidence of adjacent bony involvement. On cut surface, the lesions were made of numerous loculi containing 2-5-mm round-to-ovoid, white-to-gray, firm structures interspersed with fibrous tissue and pockets of serosanguinous fluid. Hematology, serum biochemistry, serum thyroid screening (including total thyroxine, total triiodothyronine, free thyroxine, and free triiodothyronine), and serum vitamin D panels (including parathyroid hormone, ionized calcium, and 25-hydroxyvitamin D) were unremarkable. Histopathologic evaluation of the lesions was consistent with calcinosis circumscripta with fibroplasia, chronic inflammation, and seroma formation. An additional two genetically related lions were considered suspect for calcinosis circumscripta based on presentation, exam findings, and similarity to the confirmed cases. All masses self-regressed and were not associated with additional clinical signs other than initial lameness in two cases.

  18. A Classroom Game on a Negative Externality Correcting Tax: Revenue Return, Regressivity, and the Double Dividend

    ERIC Educational Resources Information Center

    Duke, Joshua M.; Sassoon, David M.

    2017-01-01

    The concept of negative externality is central to the teaching of environmental economics, but corrective taxes are almost always regressive. How exactly might governments return externality-correcting tax revenue to overcome regressivity and not alter marginal incentives? In addition, there is a desire to achieve a double dividend in the use of…

  19. Multi-temporal LiDAR and Landsat quantification of fire-induced changes to forest structure

    USGS Publications Warehouse

    McCarley, T. Ryan; Kolden, Crystal A.; Vaillant, Nicole M.; Hudak, Andrew T.; Smith, Alistair M.S.; Wing, Brian M.; Kellogg, Bryce; Kreitler, Jason R.

    2017-01-01

    Measuring post-fire effects at landscape scales is critical to an ecological understanding of wildfire effects. Predominantly this is accomplished with either multi-spectral remote sensing data or through ground-based field sampling plots. While these methods are important, field data is usually limited to opportunistic post-fire observations, and spectral data often lacks validation with specific variables of change. Additional uncertainty remains regarding how best to account for environmental variables influencing fire effects (e.g., weather) for which observational data cannot easily be acquired, and whether pre-fire agents of change such as bark beetle and timber harvest impact model accuracy. This study quantifies wildfire effects by correlating changes in forest structure derived from multi-temporal Light Detection and Ranging (LiDAR) acquisitions to multi-temporal spectral changes captured by the Landsat Thematic Mapper and Operational Land Imager for the 2012 Pole Creek Fire in central Oregon. Spatial regression modeling was assessed as a methodology to account for spatial autocorrelation, and model consistency was quantified across areas impacted by pre-fire mountain pine beetle and timber harvest. The strongest relationship (pseudo-r2 = 0.86, p < 0.0001) was observed between the ratio of shortwave infrared and near infrared reflectance (d74) and LiDAR-derived estimate of canopy cover change. Relationships between percentage of LiDAR returns in forest strata and spectral indices generally increased in strength with strata height. Structural measurements made closer to the ground were not well correlated. The spatial regression approach improved all relationships, demonstrating its utility, but model performance declined across pre-fire agents of change, suggesting that such studies should stratify by pre-fire forest condition. This study establishes that spectral indices such as d74 and dNBR are most sensitive to wildfire-caused structural changes such as reduction in canopy cover and perform best when that structure has not been reduced pre-fire.

  20. Regression equations for calculation of z scores for echocardiographic measurements of left heart structures in healthy Han Chinese children.

    PubMed

    Wang, Shan-Shan; Hong, Wen-Jing; Zhang, Yu-Qi; Chen, Shu-Bao; Huang, Guo-Ying; Zhang, Hong-Yan; Chen, Li-Jun; Wu, Lan-Ping; Shen, Rong; Liu, Yi-Qing; Zhu, Jun-Xue

    2018-06-01

    Clinical decision making in children with heart disease relies on detailed measurements of cardiac structures using two-dimensional and M-mode echocardiography. However, no echocardiographic reference values are available for the Chinese children. We aimed to establish z-score regression equations for left heart structures in a population-based cohort of healthy Chinese Han children. Echocardiography was performed in 545 children with a normal heart. The dimensions of the aortic valve annulus (AVA), aortic sinuses of Valsalva (ASV), sinotubular junction (STJ), ascending aorta (AAO), left atrium (LA), mitral valve annulus (MVA), interventricular septal end-diastolic thickness (IVSd), interventricular septal end-systolic thickness (IVSs), left ventricular end-diastolic diameter (LVIDd), left ventricular end-systolic diameter (LVIDs), left ventricular posterior wall end-diastolic thickness (LVPWd), left ventricular posterior wall end-systolic thickness (LVPWs) were measured. Regression analyses were conducted to relate the measurements of left heart structures to body surface area (BSA). Left ventricular ejection fraction (LVEF) and left ventricular fractional shortening (LVFS) were calculated. Several models were used, and the adjusted R2 values were compared for each model. AVA, ASV, STJ, AAO, LA, MVA, IVSd, IVSs, LVIDd, LVIDs, LVPWd, and LVPWs had a cubic relationship with BSA. LVEF and LVFS fell within a narrow range. Our results provide reference values for z scores and regression equations for left heart structures in Han Chinese children. These data may help make a quick and accurate judgment of the routine clinical measurement of left heart structures in children with heart disease. © 2018 Wiley Periodicals, Inc.

  1. Lidar and Hyperspectral Remote Sensing for the Analysis of Coniferous Biomass Stocks and Fluxes

    NASA Astrophysics Data System (ADS)

    Halligan, K. Q.; Roberts, D. A.

    2006-12-01

    Airborne lidar and hyperspectral data can improve estimates of aboveground carbon stocks and fluxes through their complimentary responses to vegetation structure and biochemistry. While strong relationships have been demonstrated between lidar-estimated vegetation structural parameters and field data, research is needed to explore the portability of these methods across a range of topographic conditions, disturbance histories, vegetation type and climate. Additionally, research is needed to evaluate contributions of hyperspectral data in refining biomass estimates and determination of fluxes. To address these questions we are a conducting study of lidar and hyperspectral remote sensing data across sites including coniferous forests, broadleaf deciduous forests and a tropical rainforest. Here we focus on a single study site, Yellowstone National Park, where tree heights, stem locations, above ground biomass and basal area were mapped using first-return small-footprint lidar data. A new method using lidar intensity data was developed for separating the terrain and vegetation components in lidar data using a two-scale iterative local minima filter. Resulting Digital Terrain Models (DTM) and Digital Canopy Models (DCM) were then processed to retrieve a diversity of vertical and horizontal structure metrics. Univariate linear models were used to estimate individual tree heights while stepwise linear regression was used to estimate aboveground biomass and basal area. Three small-area field datasets were compared for their utility in model building and validation of vegetation structure parameters. All structural parameters were linearly correlated with lidar-derived metrics, with higher accuracies obtained where field and imagery data were precisely collocated . Initial analysis of hyperspectral data suggests that vegetation health metrics including measures of live and dead vegetation and stress indices may provide good indicators of carbon flux by mapping vegetation vigor or senescence. Additionally, the strength of hyperspectral data for vegetation classification suggests these data have additional utility for modeling carbon flux dynamics by allowing more accurate plant functional type mapping.

  2. Regression analysis of informative current status data with the additive hazards model.

    PubMed

    Zhao, Shishun; Hu, Tao; Ma, Ling; Wang, Peijie; Sun, Jianguo

    2015-04-01

    This paper discusses regression analysis of current status failure time data arising from the additive hazards model in the presence of informative censoring. Many methods have been developed for regression analysis of current status data under various regression models if the censoring is noninformative, and also there exists a large literature on parametric analysis of informative current status data in the context of tumorgenicity experiments. In this paper, a semiparametric maximum likelihood estimation procedure is presented and in the method, the copula model is employed to describe the relationship between the failure time of interest and the censoring time. Furthermore, I-splines are used to approximate the nonparametric functions involved and the asymptotic consistency and normality of the proposed estimators are established. A simulation study is conducted and indicates that the proposed approach works well for practical situations. An illustrative example is also provided.

  3. Nonparametric instrumental regression with non-convex constraints

    NASA Astrophysics Data System (ADS)

    Grasmair, M.; Scherzer, O.; Vanhems, A.

    2013-03-01

    This paper considers the nonparametric regression model with an additive error that is dependent on the explanatory variables. As is common in empirical studies in epidemiology and economics, it also supposes that valid instrumental variables are observed. A classical example in microeconomics considers the consumer demand function as a function of the price of goods and the income, both variables often considered as endogenous. In this framework, the economic theory also imposes shape restrictions on the demand function, such as integrability conditions. Motivated by this illustration in microeconomics, we study an estimator of a nonparametric constrained regression function using instrumental variables by means of Tikhonov regularization. We derive rates of convergence for the regularized model both in a deterministic and stochastic setting under the assumption that the true regression function satisfies a projected source condition including, because of the non-convexity of the imposed constraints, an additional smallness condition.

  4. Organic matter variations in transgressive and regressive shales

    USGS Publications Warehouse

    Pasley, M.A.; Gregory, W.A.; Hart, G.F.

    1991-01-01

    Organic matter in the Upper Cretaceous Mancos Shale adjacent to the Tocito Sandstone in the San Juan Basin of New Mexico was characterized using organic petrology and organic geochemistry. Differences in the organic matter found in these regressive and transgressive offshore marine sediments have been documented and assessed within a sequence stratigraphic framework. The regressive Lower Mancos Shale below the Tocito Sandstone contains abundant well preserved phytoclasts and correspondingly low hydrogen indices. Total organic carbon values for the regressive shale are low. Sediments from the transgressive systems tract (Tocito Sandstone and overlying Upper Mancos Shale) contain less terrestrially derived organic matter, more amorphous non-structured protistoclasts, higher hydrogen indices and more total organic carbon. Advanced stages of degradation are characteristic of the phytoclasts found in the transgressive shale. Amorphous material in the transgressive shale fluoresces strongly while that found in the regressive shale is typically non-fluorescent. Data from pyrolysis-gas chromatography confirm these observations. These differences are apparently related to the contrasting depositional styles that were active on the shelf during regression and subsequent transgression. It is suggested that data from organic petrology and organic geochemistry provide greater resolution in sedimentologic and stratigraphic interpretations, particularly when working with basinward, fine-grained sediments. Petroleum source potential for the regressive Lower Mancos Shale below the Tocito Sandstone is poor. Based on abundant fluorescent amorphous material, high hydrogen indices, and high total organic carbon, the transgressive Upper Mancos Shale above the Tocito Sandstone possesses excellent source potential. This suggests that appreciable source potential can be found in offshore, fine-grained sediments of the transgressive systems tract below the condensed section and associated downlap surface. Organic petrology can be used to accurately predict petroleum source potential. The addition of simple fluorescence microscopy greatly enhances this predictive ability because non-generative amorphous material is generally non-fluorescent. Organic petrology must also be used to properly evaluate the utility of Tmax from programmed pyrolysis as a thermal maturity indicator. Organic matter dominated by autochthonous amorphous protistoclasts exhibits lower Tmax values than that which is composed of mostly phytoclasts. ?? 1991.

  5. Changes in quantitative 3D shape features of the optic nerve head associated with age

    NASA Astrophysics Data System (ADS)

    Christopher, Mark; Tang, Li; Fingert, John H.; Scheetz, Todd E.; Abramoff, Michael D.

    2013-02-01

    Optic nerve head (ONH) structure is an important biological feature of the eye used by clinicians to diagnose and monitor progression of diseases such as glaucoma. ONH structure is commonly examined using stereo fundus imaging or optical coherence tomography. Stereo fundus imaging provides stereo views of the ONH that retain 3D information useful for characterizing structure. In order to quantify 3D ONH structure, we applied a stereo correspondence algorithm to a set of stereo fundus images. Using these quantitative 3D ONH structure measurements, eigen structures were derived using principal component analysis from stereo images of 565 subjects from the Ocular Hypertension Treatment Study (OHTS). To evaluate the usefulness of the eigen structures, we explored associations with the demographic variables age, gender, and race. Using regression analysis, the eigen structures were found to have significant (p < 0.05) associations with both age and race after Bonferroni correction. In addition, classifiers were constructed to predict the demographic variables based solely on the eigen structures. These classifiers achieved an area under receiver operating characteristic curve of 0.62 in predicting a binary age variable, 0.52 in predicting gender, and 0.67 in predicting race. The use of objective, quantitative features or eigen structures can reveal hidden relationships between ONH structure and demographics. The use of these features could similarly allow specific aspects of ONH structure to be isolated and associated with the diagnosis of glaucoma, disease progression and outcomes, and genetic factors.

  6. A Comparison between the Use of Beta Weights and Structure Coefficients in Interpreting Regression Results

    ERIC Educational Resources Information Center

    Tong, Fuhui

    2006-01-01

    Background: An extensive body of researches has favored the use of regression over other parametric analyses that are based on OVA. In case of noteworthy regression results, researchers tend to explore magnitude of beta weights for the respective predictors. Purpose: The purpose of this paper is to examine both beta weights and structure…

  7. Study of Rapid-Regression Liquefying Hybrid Rocket Fuels

    NASA Technical Reports Server (NTRS)

    Zilliac, Greg; DeZilwa, Shane; Karabeyoglu, M. Arif; Cantwell, Brian J.; Castellucci, Paul

    2004-01-01

    A report describes experiments directed toward the development of paraffin-based hybrid rocket fuels that burn at regression rates greater than those of conventional hybrid rocket fuels like hydroxyl-terminated butadiene. The basic approach followed in this development is to use materials such that a hydrodynamically unstable liquid layer forms on the melting surface of a burning fuel body. Entrainment of droplets from the liquid/gas interface can substantially increase the rate of fuel mass transfer, leading to surface regression faster than can be achieved using conventional fuels. The higher regression rate eliminates the need for the complex multi-port grain structures of conventional solid rocket fuels, making it possible to obtain acceptable performance from single-port structures. The high-regression-rate fuels contain no toxic or otherwise hazardous components and can be shipped commercially as non-hazardous commodities. Among the experiments performed on these fuels were scale-up tests using gaseous oxygen. The data from these tests were found to agree with data from small-scale, low-pressure and low-mass-flux laboratory tests and to confirm the expectation that these fuels would burn at high regression rates, chamber pressures, and mass fluxes representative of full-scale rocket motors.

  8. Alternatives for using multivariate regression to adjust prospective payment rates

    PubMed Central

    Sheingold, Steven H.

    1990-01-01

    Multivariate regression analysis has been used in structuring three of the adjustments to Medicare's prospective payment rates. Because the indirect-teaching adjustment, the disproportionate-share adjustment, and the adjustment for large cities are responsible for distributing approximately $3 billion in payments each year, the specification of regression models for these adjustments is of critical importance. In this article, the application of regression for adjusting Medicare's prospective rates is discussed, and the implications that differing specifications could have for these adjustments are demonstrated. PMID:10113271

  9. Rank-Optimized Logistic Matrix Regression toward Improved Matrix Data Classification.

    PubMed

    Zhang, Jianguang; Jiang, Jianmin

    2018-02-01

    While existing logistic regression suffers from overfitting and often fails in considering structural information, we propose a novel matrix-based logistic regression to overcome the weakness. In the proposed method, 2D matrices are directly used to learn two groups of parameter vectors along each dimension without vectorization, which allows the proposed method to fully exploit the underlying structural information embedded inside the 2D matrices. Further, we add a joint [Formula: see text]-norm on two parameter matrices, which are organized by aligning each group of parameter vectors in columns. This added co-regularization term has two roles-enhancing the effect of regularization and optimizing the rank during the learning process. With our proposed fast iterative solution, we carried out extensive experiments. The results show that in comparison to both the traditional tensor-based methods and the vector-based regression methods, our proposed solution achieves better performance for matrix data classifications.

  10. Structural Analysis of Women’s Heptathlon

    PubMed Central

    Gassmann, Freya; Fröhlich, Michael; Emrich, Eike

    2016-01-01

    The heptathlon comprises the results of seven single disciplines, assuming an equal influence from each discipline, depending on the measured performance. Data analysis was based on the data recorded for the individual performances of the 10 winning heptathletes in the World Athletics Championships from 1987 to 2013 and the Olympic Games from 1988 to 2012. In addition to descriptive analysis methods, correlations, bivariate and multivariate linear regressions, and panel data regressions were used. The transformation of the performances from seconds, centimeters, and meters into points showed that the individual disciplines do not equally affect the overall competition result. The currently valid conversion formula for the run, jump, and throw disciplines prefers the sprint and jump disciplines but penalizes the athletes performing in the 800 m run, javelin throw, and shotput disciplines. Furthermore, 21% to 48% of the variance of the sum of points can be attributed to the performances in the disciplines of long jump, 200 m sprint, 100 m hurdles, and high jump. To balance the effects of the single disciplines in the heptathlon, the formula to calculate points should be reevaluated. PMID:29910260

  11. Reinforcement of subarachnoid block by epidural volume effect in lower abdominal surgery: A comparison between fentanyl and tramadol for efficacy and block properties

    PubMed Central

    Mohan, Atiharsh; Singh, Preet Mohinder; Malviya, Deepak; Arya, Sunil Kumar; Singh, Dinesh Kumar

    2012-01-01

    Background: Epidural volume extension (EVE) is claimed to increase the block height and decrease the dose requirement for intrathecal drug. However, almost all studies have been done in obstetric population and none actually compares the effect of additional drugs added to epidural volume. Materials and Methods: Seventy-five (ASA I and II) patients scheduled for lower abdominal surgery were randomly divided into three groups. All groups received intrathecal 10 mg bupivacaine; two groups received additional 10 ml of normal saline epidurally with 25 mg tramadol or 25 mg of fentanyl. Groups were than compared for maximal block height, rate of sensory block regression to T10, and motor block regression to Bromage scale of 0. Time to first analgesia and adverse effects were also compared among the three groups. Materials and Methods: Seventy-five (ASA I and II) patients scheduled for lower abdominal surgery were randomly divided into three groups. All groups received intrathecal 10 mg bupivacaine; two groups received additional 10 ml of normal saline epidurally with 25 mg tramadol or 25 mg of fentanyl. Groups were than compared for maximal block height, rate of sensory block regression to T10, and motor block regression to Bromage scale of 0. Time to first analgesia and adverse effects were also compared among the three groups. Results: Groups with EVE had statistically significant higher block height, with a significant faster regression that the control group. However, both fentanyl and tramadol groups were inseparable in respect to motor or sensory block regression. Fentanyl group had maximal time to first analgesia, followed by tramadol and control groups. Hemodynamic alterations were also more common in EVE groups. Conclusion: EVE can increase the block height significantly, but it seems to be limited only to the physical property of additional volume in epidural space and fentanyl or tramadol do not seem to differ in their ability to alter block properties. PMID:25885615

  12. Effects of aluminum and iron nanoparticle additives on composite AP/HTPB solid propellant regression rate

    NASA Astrophysics Data System (ADS)

    Styborski, Jeremy A.

    This project was started in the interest of supplementing existing data on additives to composite solid propellants. The study on the addition of iron and aluminum nanoparticles to composite AP/HTPB propellants was conducted at the Combustion and Energy Systems Laboratory at RPI in the new strand-burner experiment setup. For this study, a large literature review was conducted on history of solid propellant combustion modeling and the empirical results of tests on binders, plasticizers, AP particle size, and additives. The study focused on the addition of nano-scale aluminum and iron in small concentrations to AP/HTPB solid propellants with an average AP particle size of 200 microns. Replacing 1% of the propellant's AP with 40-60 nm aluminum particles produced no change in combustive behavior. The addition of 1% 60-80 nm iron particles produced a significant increase in burn rate, although the increase was lesser at higher pressures. These results are summarized in Table 2. The increase in the burn rate at all pressures due to the addition of iron nanoparticles warranted further study on the effect of concentration of iron. Tests conducted at 10 atm showed that the mean regression rate varied with iron concentration, peaking at 1% and 3%. Regardless of the iron concentration, the regression rate was higher than the baseline AP/HTPB propellants. These results are summarized in Table 3.

  13. A Decomposition of Hospital Profitability: An Application of DuPont Analysis to the US Market.

    PubMed

    Turner, Jason; Broom, Kevin; Elliott, Michael; Lee, Jen-Fu

    2015-01-01

    This paper evaluates the drivers of profitability for a large sample of U.S. hospitals. Following a methodology frequently used by financial analysts, we use a DuPont analysis as a framework to evaluate the quality of earnings. By decomposing returns on equity (ROE) into profit margin, total asset turnover, and capital structure, the DuPont analysis reveals what drives overall profitability. Profit margin, the efficiency with which services are rendered (total asset turnover), and capital structure is calculated for 3,255 U.S. hospitals between 2007 and 2012 using data from the Centers for Medicare & Medicaid Services' Healthcare Cost Report Information System (CMS Form 2552). The sample is then stratified by ownership, size, system affiliation, teaching status, critical access designation, and urban or non-urban location. Those hospital characteristics and interaction terms are then regressed (OLS) against the ROE and the respective DuPont components. Sensitivity to regression methodology is also investigated using a seemingly unrelated regression. When the sample is stratified by hospital characteristics, the results indicate investor-owned hospitals have higher profit margins, higher efficiency, and are substantially more leveraged. Hospitals in systems are found to have higher ROE, margins, and efficiency but are associated with less leverage. In addition, a number of important and significant interactions between teaching status, ownership, location, critical access designation, and inclusion in a system are documented. Many of the significant relationships, most notably not-for-profit ownership, lose significance or are predominately associated with one interaction effect when interaction terms are introduced as explanatory variables. Results are not sensitive to the alternative methodology. The results of the DuPont analysis suggest that although there appears to be convergence in the behavior of NFP and IO hospitals, significant financial differences remain depending on their respective hospital characteristics. Those differences are tempered or exacerbated by location, size, teaching status, system affiliation, and critical access designation. With the exception of cost-based reimbursement for critical access hospitals, emerging payment systems are placing additional financial pressures on hospitals. The financial pressures being applied treat hospitals as a monolithic category and, given the delicate and often negative ROE for many hospitals, the long-term stability of the healthcare facility infrastructure may be negatively impacted.

  14. A Decomposition of Hospital Profitability

    PubMed Central

    Broom, Kevin; Elliott, Michael; Lee, Jen-Fu

    2015-01-01

    Objectives: This paper evaluates the drivers of profitability for a large sample of U.S. hospitals. Following a methodology frequently used by financial analysts, we use a DuPont analysis as a framework to evaluate the quality of earnings. By decomposing returns on equity (ROE) into profit margin, total asset turnover, and capital structure, the DuPont analysis reveals what drives overall profitability. Methods: Profit margin, the efficiency with which services are rendered (total asset turnover), and capital structure is calculated for 3,255 U.S. hospitals between 2007 and 2012 using data from the Centers for Medicare & Medicaid Services’ Healthcare Cost Report Information System (CMS Form 2552). The sample is then stratified by ownership, size, system affiliation, teaching status, critical access designation, and urban or non-urban location. Those hospital characteristics and interaction terms are then regressed (OLS) against the ROE and the respective DuPont components. Sensitivity to regression methodology is also investigated using a seemingly unrelated regression. Results: When the sample is stratified by hospital characteristics, the results indicate investor-owned hospitals have higher profit margins, higher efficiency, and are substantially more leveraged. Hospitals in systems are found to have higher ROE, margins, and efficiency but are associated with less leverage. In addition, a number of important and significant interactions between teaching status, ownership, location, critical access designation, and inclusion in a system are documented. Many of the significant relationships, most notably not-for-profit ownership, lose significance or are predominately associated with one interaction effect when interaction terms are introduced as explanatory variables. Results are not sensitive to the alternative methodology. Conclusion: The results of the DuPont analysis suggest that although there appears to be convergence in the behavior of NFP and IO hospitals, significant financial differences remain depending on their respective hospital characteristics. Those differences are tempered or exacerbated by location, size, teaching status, system affiliation, and critical access designation. With the exception of cost-based reimbursement for critical access hospitals, emerging payment systems are placing additional financial pressures on hospitals. The financial pressures being applied treat hospitals as a monolithic category and, given the delicate and often negative ROE for many hospitals, the long-term stability of the healthcare facility infrastructure may be negatively impacted. PMID:28462258

  15. 4D-Fingerprint Categorical QSAR Models for Skin Sensitization Based on Classification Local Lymph Node Assay Measures

    PubMed Central

    Li, Yi; Tseng, Yufeng J.; Pan, Dahua; Liu, Jianzhong; Kern, Petra S.; Gerberick, G. Frank; Hopfinger, Anton J.

    2008-01-01

    Currently, the only validated methods to identify skin sensitization effects are in vivo models, such as the Local Lymph Node Assay (LLNA) and guinea pig studies. There is a tremendous need, in particular due to novel legislation, to develop animal alternatives, eg. Quantitative Structure-Activity Relationship (QSAR) models. Here, QSAR models for skin sensitization using LLNA data have been constructed. The descriptors used to generate these models are derived from the 4D-molecular similarity paradigm and are referred to as universal 4D-fingerprints. A training set of 132 structurally diverse compounds and a test set of 15 structurally diverse compounds were used in this study. The statistical methodologies used to build the models are logistic regression (LR), and partial least square coupled logistic regression (PLS-LR), which prove to be effective tools for studying skin sensitization measures expressed in the two categorical terms of sensitizer and non-sensitizer. QSAR models with low values of the Hosmer-Lemeshow goodness-of-fit statistic, χHL2, are significant and predictive. For the training set, the cross-validated prediction accuracy of the logistic regression models ranges from 77.3% to 78.0%, while that of PLS-logistic regression models ranges from 87.1% to 89.4%. For the test set, the prediction accuracy of logistic regression models ranges from 80.0%-86.7%, while that of PLS-logistic regression models ranges from 73.3%-80.0%. The QSAR models are made up of 4D-fingerprints related to aromatic atoms, hydrogen bond acceptors and negatively partially charged atoms. PMID:17226934

  16. The effect of service satisfaction and spiritual well-being on the quality of life of patients with schizophrenia.

    PubMed

    Lanfredi, Mariangela; Candini, Valentina; Buizza, Chiara; Ferrari, Clarissa; Boero, Maria E; Giobbio, Gian M; Goldschmidt, Nicoletta; Greppo, Stefania; Iozzino, Laura; Maggi, Paolo; Melegari, Anna; Pasqualetti, Patrizio; Rossi, Giuseppe; de Girolamo, Giovanni

    2014-05-15

    Quality of life (QOL) has been considered an important outcome measure in psychiatric research and determinants of QOL have been widely investigated. We aimed at detecting predictors of QOL at baseline and at testing the longitudinal interrelations of the baseline predictors with QOL scores at a 1-year follow-up in a sample of patients living in Residential Facilities (RFs). Logistic regression models were adopted to evaluate the association between WHOQoL-Bref scores and potential determinants of QOL. In addition, all variables significantly associated with QOL domains in the final logistic regression model were included by using the Structural Equation Modeling (SEM). We included 139 patients with a diagnosis of schizophrenia spectrum. In the final logistic regression model level of activity, social support, age, service satisfaction, spiritual well-being and symptoms' severity were identified as predictors of QOL scores at baseline. Longitudinal analyses carried out by SEM showed that 40% of QOL follow-up variability was explained by QOL at baseline, and significant indirect effects toward QOL at follow-up were found for satisfaction with services and for social support. Rehabilitation plans for people with schizophrenia living in RFs should also consider mediators of change in subjective QOL such as satisfaction with mental health services. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.

  17. Logistic Regression with Multiple Random Effects: A Simulation Study of Estimation Methods and Statistical Packages.

    PubMed

    Kim, Yoonsang; Choi, Young-Ku; Emery, Sherry

    2013-08-01

    Several statistical packages are capable of estimating generalized linear mixed models and these packages provide one or more of three estimation methods: penalized quasi-likelihood, Laplace, and Gauss-Hermite. Many studies have investigated these methods' performance for the mixed-effects logistic regression model. However, the authors focused on models with one or two random effects and assumed a simple covariance structure between them, which may not be realistic. When there are multiple correlated random effects in a model, the computation becomes intensive, and often an algorithm fails to converge. Moreover, in our analysis of smoking status and exposure to anti-tobacco advertisements, we have observed that when a model included multiple random effects, parameter estimates varied considerably from one statistical package to another even when using the same estimation method. This article presents a comprehensive review of the advantages and disadvantages of each estimation method. In addition, we compare the performances of the three methods across statistical packages via simulation, which involves two- and three-level logistic regression models with at least three correlated random effects. We apply our findings to a real dataset. Our results suggest that two packages-SAS GLIMMIX Laplace and SuperMix Gaussian quadrature-perform well in terms of accuracy, precision, convergence rates, and computing speed. We also discuss the strengths and weaknesses of the two packages in regard to sample sizes.

  18. Logistic Regression with Multiple Random Effects: A Simulation Study of Estimation Methods and Statistical Packages

    PubMed Central

    Kim, Yoonsang; Emery, Sherry

    2013-01-01

    Several statistical packages are capable of estimating generalized linear mixed models and these packages provide one or more of three estimation methods: penalized quasi-likelihood, Laplace, and Gauss-Hermite. Many studies have investigated these methods’ performance for the mixed-effects logistic regression model. However, the authors focused on models with one or two random effects and assumed a simple covariance structure between them, which may not be realistic. When there are multiple correlated random effects in a model, the computation becomes intensive, and often an algorithm fails to converge. Moreover, in our analysis of smoking status and exposure to anti-tobacco advertisements, we have observed that when a model included multiple random effects, parameter estimates varied considerably from one statistical package to another even when using the same estimation method. This article presents a comprehensive review of the advantages and disadvantages of each estimation method. In addition, we compare the performances of the three methods across statistical packages via simulation, which involves two- and three-level logistic regression models with at least three correlated random effects. We apply our findings to a real dataset. Our results suggest that two packages—SAS GLIMMIX Laplace and SuperMix Gaussian quadrature—perform well in terms of accuracy, precision, convergence rates, and computing speed. We also discuss the strengths and weaknesses of the two packages in regard to sample sizes. PMID:24288415

  19. Structured sparse linear graph embedding.

    PubMed

    Wang, Haixian

    2012-03-01

    Subspace learning is a core issue in pattern recognition and machine learning. Linear graph embedding (LGE) is a general framework for subspace learning. In this paper, we propose a structured sparse extension to LGE (SSLGE) by introducing a structured sparsity-inducing norm into LGE. Specifically, SSLGE casts the projection bases learning into a regression-type optimization problem, and then the structured sparsity regularization is applied to the regression coefficients. The regularization selects a subset of features and meanwhile encodes high-order information reflecting a priori structure information of the data. The SSLGE technique provides a unified framework for discovering structured sparse subspace. Computationally, by using a variational equality and the Procrustes transformation, SSLGE is efficiently solved with closed-form updates. Experimental results on face image show the effectiveness of the proposed method. Copyright © 2011 Elsevier Ltd. All rights reserved.

  20. MODELING SNAKE MICROHABITAT FROM RADIOTELEMETRY STUDIES USING POLYTOMOUS LOGISTIC REGRESSION

    EPA Science Inventory

    Multivariate analysis of snake microhabitat has historically used techniques that were derived under assumptions of normality and common covariance structure (e.g., discriminant function analysis, MANOVA). In this study, polytomous logistic regression (PLR which does not require ...

  1. Traffic flow forecasting using approximate nearest neighbor nonparametric regression

    DOT National Transportation Integrated Search

    2000-12-01

    The purpose of this research is to enhance nonparametric regression (NPR) for use in real-time systems by first reducing execution time using advanced data structures and imprecise computations and then developing a methodology for applying NPR. Due ...

  2. Partial Least Squares with Structured Output for Modelling the Metabolomics Data Obtained from Complex Experimental Designs: A Study into the Y-Block Coding.

    PubMed

    Xu, Yun; Muhamadali, Howbeer; Sayqal, Ali; Dixon, Neil; Goodacre, Royston

    2016-10-28

    Partial least squares (PLS) is one of the most commonly used supervised modelling approaches for analysing multivariate metabolomics data. PLS is typically employed as either a regression model (PLS-R) or a classification model (PLS-DA). However, in metabolomics studies it is common to investigate multiple, potentially interacting, factors simultaneously following a specific experimental design. Such data often cannot be considered as a "pure" regression or a classification problem. Nevertheless, these data have often still been treated as a regression or classification problem and this could lead to ambiguous results. In this study, we investigated the feasibility of designing a hybrid target matrix Y that better reflects the experimental design than simple regression or binary class membership coding commonly used in PLS modelling. The new design of Y coding was based on the same principle used by structural modelling in machine learning techniques. Two real metabolomics datasets were used as examples to illustrate how the new Y coding can improve the interpretability of the PLS model compared to classic regression/classification coding.

  3. Hierarchical Matching and Regression with Application to Photometric Redshift Estimation

    NASA Astrophysics Data System (ADS)

    Murtagh, Fionn

    2017-06-01

    This work emphasizes that heterogeneity, diversity, discontinuity, and discreteness in data is to be exploited in classification and regression problems. A global a priori model may not be desirable. For data analytics in cosmology, this is motivated by the variety of cosmological objects such as elliptical, spiral, active, and merging galaxies at a wide range of redshifts. Our aim is matching and similarity-based analytics that takes account of discrete relationships in the data. The information structure of the data is represented by a hierarchy or tree where the branch structure, rather than just the proximity, is important. The representation is related to p-adic number theory. The clustering or binning of the data values, related to the precision of the measurements, has a central role in this methodology. If used for regression, our approach is a method of cluster-wise regression, generalizing nearest neighbour regression. Both to exemplify this analytics approach, and to demonstrate computational benefits, we address the well-known photometric redshift or `photo-z' problem, seeking to match Sloan Digital Sky Survey (SDSS) spectroscopic and photometric redshifts.

  4. Saturn's dynamic D ring

    USGS Publications Warehouse

    Hedman, M.M.; Burns, J.A.; Showalter, M.R.; Porco, C.C.; Nicholson, P.D.; Bosh, A.S.; Tiscareno, M.S.; Brown, R.H.; Buratti, B.J.; Baines, K.H.; Clark, R.

    2007-01-01

    The Cassini spacecraft has provided the first clear images of the D ring since the Voyager missions. These observations show that the structure of the D ring has undergone significant changes over the last 25 years. The brightest of the three ringlets seen in the Voyager images (named D72), has transformed from a narrow, <40-km wide ringlet to a much broader and more diffuse 250-km wide feature. In addition, its center of light has shifted inwards by over 200 km relative to other features in the D ring. Cassini also finds that the locations of other narrow features in the D ring and the structure of the diffuse material in the D ring differ from those measured by Voyager. Furthermore, Cassini has detected additional ringlets and structures in the D ring that were not observed by Voyager. These include a sheet of material just interior to the inner edge of the C ring that is only observable at phase angles below about 60??. New photometric and spectroscopic data from the ISS (Imaging Science Subsystem) and VIMS (Visual and Infrared Mapping Spectrometer) instruments onboard Cassini show the D ring contains a variety of different particle populations with typical particle sizes ranging from 1 to 100 microns. High-resolution images reveal fine-scale structures in the D ring that appear to be variable in time and/or longitude. Particularly interesting is a remarkably regular, periodic structure with a wavelength of ??? 30 ?? km extending between orbital radii of 73,200 and 74,000 km. A similar structure was previously observed in 1995 during the occultation of the star GSC5249-01240, at which time it had a wavelength of ??? 60 ?? km. We interpret this structure as a periodic vertical corrugation in the D ring produced by differential nodal regression of an initially inclined ring. We speculate that this structure may have formed in response to an impact with a comet or meteoroid in early 1984. ?? 2006 Elsevier Inc. All rights reserved.

  5. Flood-Frequency Estimates for Streams on Kaua`i, O`ahu, Moloka`i, Maui, and Hawai`i, State of Hawai`i

    USGS Publications Warehouse

    Oki, Delwyn S.; Rosa, Sarah N.; Yeung, Chiu W.

    2010-01-01

    This study provides an updated analysis of the magnitude and frequency of peak stream discharges in Hawai`i. Annual peak-discharge data collected by the U.S. Geological Survey during and before water year 2008 (ending September 30, 2008) at stream-gaging stations were analyzed. The existing generalized-skew value for the State of Hawai`i was retained, although three methods were used to evaluate whether an update was needed. Regional regression equations were developed for peak discharges with 2-, 5-, 10-, 25-, 50-, 100-, and 500-year recurrence intervals for unregulated streams (those for which peak discharges are not affected to a large extent by upstream reservoirs, dams, diversions, or other structures) in areas with less than 20 percent combined medium- and high-intensity development on Kaua`i, O`ahu, Moloka`i, Maui, and Hawai`i. The generalized-least-squares (GLS) regression equations relate peak stream discharge to quantified basin characteristics (for example, drainage-basin area and mean annual rainfall) that were determined using geographic information system (GIS) methods. Each of the islands of Kaua`i,O`ahu, Moloka`i, Maui, and Hawai`i was divided into two regions, generally corresponding to a wet region and a dry region. Unique peak-discharge regression equations were developed for each region. The regression equations developed for this study have standard errors of prediction ranging from 16 to 620 percent. Standard errors of prediction are greatest for regression equations developed for leeward Moloka`i and southern Hawai`i. In general, estimated 100-year peak discharges from this study are lower than those from previous studies, which may reflect the longer periods of record used in this study. Each regression equation is valid within the range of values of the explanatory variables used to develop the equation. The regression equations were developed using peak-discharge data from streams that are mainly unregulated, and they should not be used to estimate peak discharges in regulated streams. Use of a regression equation beyond its limits will produce peak-discharge estimates with unknown error and should therefore be avoided. Improved estimates of the magnitude and frequency of peak discharges in Hawai`i will require continued operation of existing stream-gaging stations and operation of additional gaging stations for areas such as Moloka`i and Hawai`i, where limited stream-gaging data are available.

  6. The ice age cycle and the deglaciations: an application of nonlinear regression modelling

    NASA Astrophysics Data System (ADS)

    Dalgleish, A. N.; Boulton, G. S.; Renshaw, E.

    2000-03-01

    We have applied the nonlinear regression technique known as additivity and variance stabilisation (AVAS) to time series which reflect Earth's climate over the last 600 ka. AVAS estimates a smooth, nonlinear transform for each variable, under the assumption of an additive model. The Earth's orbital parameters and insolation variations have been used as regression variables. Analysis of the contribution of each variable shows that the deglaciations are characterised by periods of increasing obliquity and perihelion approaching the vernal equinox, but not by any systematic change in eccentricity. The magnitude of insolation changes also plays no role. By approximating the transforms we can obtain a future prediction, with a glacial maximum at 60 ka AP, and a subsequent obliquity and precession forced deglaciation.

  7. The relationship between context, structure, and processes with outcomes of 6 regional diabetes networks in Europe.

    PubMed

    Mahdavi, Mahdi; Vissers, Jan; Elkhuizen, Sylvia; van Dijk, Mattees; Vanhala, Antero; Karampli, Eleftheria; Faubel, Raquel; Forte, Paul; Coroian, Elena; van de Klundert, Joris

    2018-01-01

    While health service provisioning for the chronic condition Type 2 Diabetes (T2D) often involves a network of organisations and professionals, most evidence on the relationships between the structures and processes of service provisioning and the outcomes considers single organisations or solo practitioners. Extending Donabedian's Structure-Process-Outcome (SPO) model, we investigate how differences in quality of life, effective coverage of diabetes, and service satisfaction are associated with differences in the structures, processes, and context of T2D services in six regions in Finland, Germany, Greece, Netherlands, Spain, and UK. Data collection consisted of: a) systematic modelling of provider network's structures and processes, and b) a cross-sectional survey of patient reported outcomes and other information. The survey resulted in data from 1459 T2D patients, during 2011-2012. Stepwise linear regression models were used to identify how independent cumulative proportion of variance in quality of life and service satisfaction are related to differences in context, structure and process. The selected context, structure and process variables are based on Donabedian's SPO model, a service quality research instrument (SERVQUAL), and previous organization and professional level evidence. Additional analysis deepens the possible bidirectional relation between outcomes and processes. The regression models explain 44% of variance in service satisfaction, mostly by structure and process variables (such as human resource use and the SERVQUAL dimensions). The models explained 23% of variance in quality of life between the networks, much of which is related to contextual variables. Our results suggest that effectiveness of A1c control is negatively correlated with process variables such as total hours of care provided per year and cost of services per year. While the selected structure and process variables explain much of the variance in service satisfaction, this is less the case for quality of life. Moreover, it appears that the effect of the clinical outcome A1c control on processes is stronger than the other way around, as poorer control seems to relate to more service use, and higher cost. The standardized operational models used in this research prove to form a basis for expanding the network level evidence base for effective T2D service provisioning.

  8. The relationship between context, structure, and processes with outcomes of 6 regional diabetes networks in Europe

    PubMed Central

    Elkhuizen, Sylvia; van Dijk, Mattees; Vanhala, Antero; Karampli, Eleftheria; Faubel, Raquel; Forte, Paul; Coroian, Elena

    2018-01-01

    Background While health service provisioning for the chronic condition Type 2 Diabetes (T2D) often involves a network of organisations and professionals, most evidence on the relationships between the structures and processes of service provisioning and the outcomes considers single organisations or solo practitioners. Extending Donabedian’s Structure-Process-Outcome (SPO) model, we investigate how differences in quality of life, effective coverage of diabetes, and service satisfaction are associated with differences in the structures, processes, and context of T2D services in six regions in Finland, Germany, Greece, Netherlands, Spain, and UK. Methods Data collection consisted of: a) systematic modelling of provider network’s structures and processes, and b) a cross-sectional survey of patient reported outcomes and other information. The survey resulted in data from 1459 T2D patients, during 2011–2012. Stepwise linear regression models were used to identify how independent cumulative proportion of variance in quality of life and service satisfaction are related to differences in context, structure and process. The selected context, structure and process variables are based on Donabedian’s SPO model, a service quality research instrument (SERVQUAL), and previous organization and professional level evidence. Additional analysis deepens the possible bidirectional relation between outcomes and processes. Results The regression models explain 44% of variance in service satisfaction, mostly by structure and process variables (such as human resource use and the SERVQUAL dimensions). The models explained 23% of variance in quality of life between the networks, much of which is related to contextual variables. Our results suggest that effectiveness of A1c control is negatively correlated with process variables such as total hours of care provided per year and cost of services per year. Conclusions While the selected structure and process variables explain much of the variance in service satisfaction, this is less the case for quality of life. Moreover, it appears that the effect of the clinical outcome A1c control on processes is stronger than the other way around, as poorer control seems to relate to more service use, and higher cost. The standardized operational models used in this research prove to form a basis for expanding the network level evidence base for effective T2D service provisioning. PMID:29447220

  9. Structured Trauma-Focused CBT and Unstructured Play/Experiential Techniques in the Treatment of Sexually Abused Children: A Field Study With Practicing Clinicians.

    PubMed

    Allen, Brian; Hoskowitz, Natalie Armstrong

    2017-05-01

    Structured, trauma-focused cognitive-behavioral therapy (CBT) techniques are widely considered an effective intervention for children who experienced sexual abuse. However, unstructured (i.e., nondirective) play/experiential techniques have a longer history of widespread promotion and are preferred by many practicing clinicians. No evidence is available, however, to determine how the integration of these techniques impacts treatment outcome. In this study, community-based clinicians who received training in a structured, trauma-focused cognitive-behavioral intervention administered pretreatment and posttreatment evaluations to 260 sexually abused children presenting with elevated posttraumatic stress. In addition, they completed a questionnaire describing the treatment techniques implemented with each child. Overall, significant improvement was observed for each of the six clinical outcomes. Regression analyses indicated that technique selection was a significant factor in posttreatment outcome for posttraumatic stress, dissociation, anxiety, and anger/aggression. In general, a greater utilization of the structured CBT techniques was related to lower posttreatment scores, whereas a higher frequency of play/experiential techniques was associated with higher posttreatment scores. However, no interaction effects were observed. The implication of these findings for clinical practice and future research are examined.

  10. Supporting employees' work-family needs improves health care quality: Longitudinal evidence from long-term care.

    PubMed

    Okechukwu, Cassandra A; Kelly, Erin L; Bacic, Janine; DePasquale, Nicole; Hurtado, David; Kossek, Ellen; Sembajwe, Grace

    2016-05-01

    We analyzed qualitative and quantitative data from U.S.-based employees in 30 long-term care facilities. Analysis of semi-structured interviews from 154 managers informed quantitative analyses. Quantitative data include 1214 employees' scoring of their supervisors and their organizations on family supportiveness (individual scores and aggregated to facility level), and three outcomes: (1), care quality indicators assessed at facility level (n = 30) and collected monthly for six months after employees' data collection; (2), employees' dichotomous survey response on having additional off-site jobs; and (3), proportion of employees with additional jobs at each facility. Thematic analyses revealed that managers operate within the constraints of an industry that simultaneously: (a) employs low-wage employees with multiple work-family challenges, and (b) has firmly institutionalized goals of prioritizing quality of care and minimizing labor costs. Managers universally described providing work-family support and prioritizing care quality as antithetical to each other. Concerns surfaced that family-supportiveness encouraged employees to work additional jobs off-site, compromising care quality. Multivariable linear regression analysis of facility-level data revealed that higher family-supportive supervision was associated with significant decreases in residents' incidence of all pressure ulcers (-2.62%) and other injuries (-9.79%). Higher family-supportive organizational climate was associated with significant decreases in all falls (-17.94%) and falls with injuries (-7.57%). Managers' concerns about additional jobs were not entirely unwarranted: multivariable logistic regression of employee-level data revealed that among employees with children, having family-supportive supervision was associated with significantly higher likelihood of additional off-site jobs (RR 1.46, 95%CI 1.08-1.99), but family-supportive organizational climate was associated with lower likelihood (RR 0.76, 95%CI 0.59-0.99). However, proportion of workers with additional off-site jobs did not significantly predict care quality at facility levels. Although managers perceived providing work-family support and ensuring high care quality as conflicting goals, results suggest that family-supportiveness is associated with better care quality. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Supporting employees’ work-family needs improves health care quality: longitudinal evidence from long-term care

    PubMed Central

    Okechukwu, Cassandra A.; Kelly, Erin L.; Bacic, Janine; DePasquale, Nicole; Hurtado, David; Kossek, Ellen; Sembajwe, Grace

    2016-01-01

    We analyzed qualitative and quantitative data from U.S.-based employees in 30 long-term care facilities. Analysis of semi-structured interviews from 154 managers informed quantitative analyses. Quantitative data include 1,214 employees’ scoring of their supervisors and their organizations on family supportiveness (individual scores and aggregated to facility level), and three outcomes: (1), care quality indicators assessed at facility level (n=30) and collected monthly for six months after employees’ data collection; (2), employees’ dichotomous survey response on having additional off-site jobs; and (3), proportion of employees with additional jobs at each facility. Thematic analyses revealed that managers operate within the constraints of an industry that simultaneously: (a) employs low-wage employees with multiple work-family challenges, and (b) has firmly institutionalized goals of prioritizing quality of care and minimizing labor costs. Managers universally described providing work-family support and prioritizing care quality as antithetical to each other. Concerns surfaced that family-supportiveness encouraged employees to work additional jobs off-site, compromising care quality. Multivariable linear regression analysis of facility-level data revealed that higher family-supportive supervision was associated with significant decreases in residents’ incidence of all pressure ulcers (−2.62%) and other injuries (−9.79%). Higher family-supportive organizational climate was associated with significant decreases in all falls (−17.94%) and falls with injuries (−7.57%). Managers’ concerns about additional jobs were not entirely unwarranted: multivariable logistic regression of employee-level data revealed that among employees with children, having family-supportive supervision was associated with significantly higher likelihood of additional off-site jobs (RR 1.46, 95%CI 1.08-1.99), but family-supportive organizational climate was associated with lower likelihood (RR 0.76, 95%CI 0.59-0.99). However, proportion of workers with additional off-site jobs did not significantly predict care quality at facility levels. Although managers perceived providing work-family support and ensuring high care quality as conflicting goals, results suggest that family-supportiveness is associated with better care quality. PMID:27082022

  12. Factors related to in-house agricultural animal caseloads in US veterinary teaching hospitals.

    PubMed

    Tyler, Jeff W; Miller, Robert B; Constable, Peter D; Hostetler, Douglas E; Lakritz, Jeff; Hardin, David K; Angel, Kenneth L; Wolfe, Dwight F

    2002-01-01

    A retrospective observational study was conducted to determine whether agricultural animal caseloads at veterinary teaching hospitals declined between 1995 and 1998. Thereafter, the effect of organizational and demographic factors on 1998 in-house agricultural animal caseloads was examined. Caseload data were obtained from the American Association of Veterinary Medical Colleges. Demographic and organizational data were obtained by surveys, telephone interviews, and web-based resources. Complete data were available from 25 veterinary colleges, and data from these schools were used in subsequent analyses. In 1998, in-house food animal caseload decreased relative to 1995 in 17 schools and increased relative to 1995 in 8 schools. This trend was not significant (P = .053); however, the power of the test was limited (.50). Mean 1998 caseload was 886 +/- 504. Among schools with a discipline-based organizational structure, annual mean caseload was 464 +/- 220. Among schools with a species-based organizational structure, mean caseload was 1,167 +/- 463. The regression model that best predicted caseload was a forward-stepping model that included only organizational structure as an independent variable. No additional independent variable was significantly associated with caseload.

  13. A bias correction for covariance estimators to improve inference with generalized estimating equations that use an unstructured correlation matrix.

    PubMed

    Westgate, Philip M

    2013-07-20

    Generalized estimating equations (GEEs) are routinely used for the marginal analysis of correlated data. The efficiency of GEE depends on how closely the working covariance structure resembles the true structure, and therefore accurate modeling of the working correlation of the data is important. A popular approach is the use of an unstructured working correlation matrix, as it is not as restrictive as simpler structures such as exchangeable and AR-1 and thus can theoretically improve efficiency. However, because of the potential for having to estimate a large number of correlation parameters, variances of regression parameter estimates can be larger than theoretically expected when utilizing the unstructured working correlation matrix. Therefore, standard error estimates can be negatively biased. To account for this additional finite-sample variability, we derive a bias correction that can be applied to typical estimators of the covariance matrix of parameter estimates. Via simulation and in application to a longitudinal study, we show that our proposed correction improves standard error estimation and statistical inference. Copyright © 2012 John Wiley & Sons, Ltd.

  14. Contribution of the Cholinergic System to Verbal Memory Performance in Mild Cognitive Impairment.

    PubMed

    Peter, Jessica; Lahr, Jacob; Minkova, Lora; Lauer, Eliza; Grothe, Michel J; Teipel, Stefan; Köstering, Lena; Kaller, Christoph P; Heimbach, Bernhard; Hüll, Michael; Normann, Claus; Nissen, Christoph; Reis, Janine; Klöppel, Stefan

    2016-06-18

    Acetylcholine is critically involved in modulating learning and memory function, which both decline in neurodegeneration. It remains unclear to what extent structural and functional changes in the cholinergic system contribute to episodic memory dysfunction in mild cognitive impairment (MCI), in addition to hippocampal degeneration. A better understanding is critical, given that the cholinergic system is the main target of current symptomatic treatment in mild to moderate Alzheimer's disease. We simultaneously assessed the structural and functional integrity of the cholinergic system in 20 patients with MCI and 20 matched healthy controls and examined their effect on verbal episodic memory via multivariate regression analyses. Mediating effects of either cholinergic function or hippocampal volume on the relationship between cholinergic structure and episodic memory were computed. In MCI, a less intact structure and function of the cholinergic system was found. A smaller cholinergic structure was significantly correlated with a functionally more active cholinergic system in patients, but not in controls. This association was not modulated by age or disease severity, arguing against compensational processes. Further analyses indicated that neither functional nor structural changes in the cholinergic system influence verbal episodic memory at the MCI stage. In fact, those associations were fully mediated by hippocampal volume. Although the cholinergic system is structurally and functionally altered in MCI, episodic memory dysfunction results primarily from hippocampal neurodegeneration, which may explain the inefficiency of cholinergic treatment at this disease stage.

  15. Biomacromolecular quantitative structure-activity relationship (BioQSAR): a proof-of-concept study on the modeling, prediction and interpretation of protein-protein binding affinity.

    PubMed

    Zhou, Peng; Wang, Congcong; Tian, Feifei; Ren, Yanrong; Yang, Chao; Huang, Jian

    2013-01-01

    Quantitative structure-activity relationship (QSAR), a regression modeling methodology that establishes statistical correlation between structure feature and apparent behavior for a series of congeneric molecules quantitatively, has been widely used to evaluate the activity, toxicity and property of various small-molecule compounds such as drugs, toxicants and surfactants. However, it is surprising to see that such useful technique has only very limited applications to biomacromolecules, albeit the solved 3D atom-resolution structures of proteins, nucleic acids and their complexes have accumulated rapidly in past decades. Here, we present a proof-of-concept paradigm for the modeling, prediction and interpretation of the binding affinity of 144 sequence-nonredundant, structure-available and affinity-known protein complexes (Kastritis et al. Protein Sci 20:482-491, 2011) using a biomacromolecular QSAR (BioQSAR) scheme. We demonstrate that the modeling performance and predictive power of BioQSAR are comparable to or even better than that of traditional knowledge-based strategies, mechanism-type methods and empirical scoring algorithms, while BioQSAR possesses certain additional features compared to the traditional methods, such as adaptability, interpretability, deep-validation and high-efficiency. The BioQSAR scheme could be readily modified to infer the biological behavior and functions of other biomacromolecules, if their X-ray crystal structures, NMR conformation assemblies or computationally modeled structures are available.

  16. Genetic evaluation and selection response for growth in meat-type quail through random regression models using B-spline functions and Legendre polynomials.

    PubMed

    Mota, L F M; Martins, P G M A; Littiere, T O; Abreu, L R A; Silva, M A; Bonafé, C M

    2018-04-01

    The objective was to estimate (co)variance functions using random regression models (RRM) with Legendre polynomials, B-spline function and multi-trait models aimed at evaluating genetic parameters of growth traits in meat-type quail. A database containing the complete pedigree information of 7000 meat-type quail was utilized. The models included the fixed effects of contemporary group and generation. Direct additive genetic and permanent environmental effects, considered as random, were modeled using B-spline functions considering quadratic and cubic polynomials for each individual segment, and Legendre polynomials for age. Residual variances were grouped in four age classes. Direct additive genetic and permanent environmental effects were modeled using 2 to 4 segments and were modeled by Legendre polynomial with orders of fit ranging from 2 to 4. The model with quadratic B-spline adjustment, using four segments for direct additive genetic and permanent environmental effects, was the most appropriate and parsimonious to describe the covariance structure of the data. The RRM using Legendre polynomials presented an underestimation of the residual variance. Lesser heritability estimates were observed for multi-trait models in comparison with RRM for the evaluated ages. In general, the genetic correlations between measures of BW from hatching to 35 days of age decreased as the range between the evaluated ages increased. Genetic trend for BW was positive and significant along the selection generations. The genetic response to selection for BW in the evaluated ages presented greater values for RRM compared with multi-trait models. In summary, RRM using B-spline functions with four residual variance classes and segments were the best fit for genetic evaluation of growth traits in meat-type quail. In conclusion, RRM should be considered in genetic evaluation of breeding programs.

  17. Key factors associated with oral health-related quality of life (OHRQOL) in Hong Kong Chinese adults with orofacial pain.

    PubMed

    Zheng, Jun; Wong, May C M; Lam, Cindy L K

    2011-08-01

    To investigate key factors associated with oral health-related quality of life (OHRQOL) of Hong Kong Chinese adults with orofacial pain (OFP) symptoms. A cross-sectional study was conducted amongst a random sample of registered patients at a primary medical care teaching clinic in Hong Kong. Patients who were aged 35-70 years and had experienced OFP symptoms in the past 1 month were included. The OHRQOL was assessed by the Chinese version of the Oral Health Impact Profile (OHIP-14). A structured questionnaire on OFP symptoms and characteristics in the past 1 month, the depression and non-specific physical symptoms (NPS) scale in the research diagnostic criteria for temporomandibular disorders (RDC/TMD) questionnaire, and questions about professional treatment and dental attendance were administered before a standard clinical assessment. Negative binomial regression with forward stepwise selection was used to investigate key factors associated with the OHIP-14 additive score. The mean OHIP-14 additive score of the 200 participants was 10.1 (SD 9.4). Regression analysis revealed that five independent factors were significantly associated with higher OHIP-14 additive scores (indicating a poorer OHRQOL): a higher pain scale rating in the past 1 month (p=0.001), OFP clinical classification as musculoligamentous/soft tissue (MST) or dentoalveolar (DA) instead of neurological/vascular (NV) (p<0.001), more frequent dental attendance (p=0.008), moderate/severe RDC/TMD depression (p=0.005) and moderate/severe RDC/TMD NPS with pain (p=0.003). Various factors were associated with OHRQOL and could have implications for the improvement of OHRQOL in people in the community who have OFP symptoms. Copyright © 2011 Elsevier Ltd. All rights reserved.

  18. A pseudo-penalized quasi-likelihood approach to the spatial misalignment problem with non-normal data.

    PubMed

    Lopiano, Kenneth K; Young, Linda J; Gotway, Carol A

    2014-09-01

    Spatially referenced datasets arising from multiple sources are routinely combined to assess relationships among various outcomes and covariates. The geographical units associated with the data, such as the geographical coordinates or areal-level administrative units, are often spatially misaligned, that is, observed at different locations or aggregated over different geographical units. As a result, the covariate is often predicted at the locations where the response is observed. The method used to align disparate datasets must be accounted for when subsequently modeling the aligned data. Here we consider the case where kriging is used to align datasets in point-to-point and point-to-areal misalignment problems when the response variable is non-normally distributed. If the relationship is modeled using generalized linear models, the additional uncertainty induced from using the kriging mean as a covariate introduces a Berkson error structure. In this article, we develop a pseudo-penalized quasi-likelihood algorithm to account for the additional uncertainty when estimating regression parameters and associated measures of uncertainty. The method is applied to a point-to-point example assessing the relationship between low-birth weights and PM2.5 levels after the onset of the largest wildfire in Florida history, the Bugaboo scrub fire. A point-to-areal misalignment problem is presented where the relationship between asthma events in Florida's counties and PM2.5 levels after the onset of the fire is assessed. Finally, the method is evaluated using a simulation study. Our results indicate the method performs well in terms of coverage for 95% confidence intervals and naive methods that ignore the additional uncertainty tend to underestimate the variability associated with parameter estimates. The underestimation is most profound in Poisson regression models. © 2014, The International Biometric Society.

  19. Anterior cingulate cortex surface area relates to behavioral inhibition in adolescents with and without heavy prenatal alcohol exposure

    PubMed Central

    Migliorini, Robyn; Moore, Eileen M.; Glass, Leila; Infante, M. Alejandra; Tapert, Susan F.; Jones, Kenneth Lyons; Mattson, Sarah N.; Riley, Edward P.

    2015-01-01

    Prenatal alcohol exposure is associated with behavioral disinhibition, yet the brain structure correlates of this deficit have not been determined with sufficient detail. We examined the hypothesis that the structure of the anterior cingulate cortex (ACC) relates to inhibition performance in youth with histories of heavy prenatal alcohol exposure (AE, n = 32) and non-exposed controls (CON, n = 21). Adolescents (12–17 years) underwent structural magnetic resonance imaging yielding measures of gray matter volume, surface area, and thickness across four ACC subregions. A subset of subjects were administered the NEPSY-II Inhibition subtest. MANCOVA was utilized to test for group differences in ACC and inhibition performance and multiple linear regression was used to probe ACC-inhibition relationships. ACC surface area was significantly smaller in AE, though this effect was primarily driven by reduced right caudal ACC (rcACC). AE also performed significantly worse on inhibition speed but not on inhibition accuracy. Regression analyses with the rcACC revealed a significant group × ACC interaction. A smaller rcACC surface area was associated with slower inhibition completion time for AE but was not significantly associated with inhibition in CON. After accounting for processing speed, smaller rcACC surface area was associated with worse (i.e., slower) inhibition regardless of group. Examining processing speed independently, a decrease in rcACC surface area was associated with faster processing speed for CON but not significantly associated with processing speed in AE. Results support the theory that caudal ACC may monitor reaction time in addition to inhibition and highlight the possibility of delayed ACC neurodevelopment in prenatal alcohol exposure. PMID:26025509

  20. Anterior cingulate cortex surface area relates to behavioral inhibition in adolescents with and without heavy prenatal alcohol exposure.

    PubMed

    Migliorini, Robyn; Moore, Eileen M; Glass, Leila; Infante, M Alejandra; Tapert, Susan F; Jones, Kenneth Lyons; Mattson, Sarah N; Riley, Edward P

    2015-10-01

    Prenatal alcohol exposure is associated with behavioral disinhibition, yet the brain structure correlates of this deficit have not been determined with sufficient detail. We examined the hypothesis that the structure of the anterior cingulate cortex (ACC) relates to inhibition performance in youth with histories of heavy prenatal alcohol exposure (AE, n = 32) and non-exposed controls (CON, n = 21). Adolescents (12-17 years) underwent structural magnetic resonance imaging yielding measures of gray matter volume, surface area, and thickness across four ACC subregions. A subset of subjects were administered the NEPSY-II Inhibition subtest. MANCOVA was utilized to test for group differences in ACC and inhibition performance and multiple linear regression was used to probe ACC-inhibition relationships. ACC surface area was significantly smaller in AE, though this effect was primarily driven by reduced right caudal ACC (rcACC). AE also performed significantly worse on inhibition speed but not on inhibition accuracy. Regression analyses with the rcACC revealed a significant group × ACC interaction. A smaller rcACC surface area was associated with slower inhibition completion time for AE but was not significantly associated with inhibition in CON. After accounting for processing speed, smaller rcACC surface area was associated with worse (i.e., slower) inhibition regardless of group. Examining processing speed independently, a decrease in rcACC surface area was associated with faster processing speed for CON but not significantly associated with processing speed in AE. Results support the theory that caudal ACC may monitor reaction time in addition to inhibition and highlight the possibility of delayed ACC neurodevelopment in prenatal alcohol exposure. Copyright © 2015 Elsevier B.V. All rights reserved.

  1. Rey's Auditory Verbal Learning Test scores can be predicted from whole brain MRI in Alzheimer's disease.

    PubMed

    Moradi, Elaheh; Hallikainen, Ilona; Hänninen, Tuomo; Tohka, Jussi

    2017-01-01

    Rey's Auditory Verbal Learning Test (RAVLT) is a powerful neuropsychological tool for testing episodic memory, which is widely used for the cognitive assessment in dementia and pre-dementia conditions. Several studies have shown that an impairment in RAVLT scores reflect well the underlying pathology caused by Alzheimer's disease (AD), thus making RAVLT an effective early marker to detect AD in persons with memory complaints. We investigated the association between RAVLT scores (RAVLT Immediate and RAVLT Percent Forgetting) and the structural brain atrophy caused by AD. The aim was to comprehensively study to what extent the RAVLT scores are predictable based on structural magnetic resonance imaging (MRI) data using machine learning approaches as well as to find the most important brain regions for the estimation of RAVLT scores. For this, we built a predictive model to estimate RAVLT scores from gray matter density via elastic net penalized linear regression model. The proposed approach provided highly significant cross-validated correlation between the estimated and observed RAVLT Immediate (R = 0.50) and RAVLT Percent Forgetting (R = 0.43) in a dataset consisting of 806 AD, mild cognitive impairment (MCI) or healthy subjects. In addition, the selected machine learning method provided more accurate estimates of RAVLT scores than the relevance vector regression used earlier for the estimation of RAVLT based on MRI data. The top predictors were medial temporal lobe structures and amygdala for the estimation of RAVLT Immediate and angular gyrus, hippocampus and amygdala for the estimation of RAVLT Percent Forgetting. Further, the conversion of MCI subjects to AD in 3-years could be predicted based on either observed or estimated RAVLT scores with an accuracy comparable to MRI-based biomarkers.

  2. Neural correlates of gait variability in people with multiple sclerosis with fall history.

    PubMed

    Kalron, Alon; Allali, Gilles; Achiron, Anat

    2018-05-28

    Investigate the association between step time variability and related brain structures in accordance with fall status in people with multiple sclerosis (PwMS). The study included 225 PwMS. A whole-brain MRI was performed by a high-resolution 3.0-Telsa MR scanner in addition to volumetric analysis based on 3D T1-weighted images using the FreeSurfer image analysis suite. Step time variability was measured by an electronic walkway. Participants were defined as "fallers" (at least two falls during the previous year) and "non-fallers". One hundred and five PwMS were defined as fallers and had a greater step time variability compared to non-fallers (5.6% (S.D.=3.4) vs. 3.4% (S.D.=1.5); p=0.001). MS fallers exhibited a reduced volume in the left caudate and both cerebellum hemispheres compared to non-fallers. By using a linear regression analysis no association was found between gait variability and related brain structures in the total cohort and non-fallers group. However, the analysis found an association between the left hippocampus and left putamen volumes with step time variability in the faller group; p=0.031, 0.048, respectively, controlling for total cranial volume, walking speed, disability, age and gender. Nevertheless, according to the hierarchical regression model, the contribution of these brain measures to predict gait variability was relatively small compared to walking speed. An association between low left hippocampal, putamen volumes and step time variability was found in PwMS with a history of falls, suggesting brain structural characteristics may be related to falls and increased gait variability in PwMS. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  3. Differentiating regressed melanoma from regressed lichenoid keratosis.

    PubMed

    Chan, Aegean H; Shulman, Kenneth J; Lee, Bonnie A

    2017-04-01

    Distinguishing regressed lichen planus-like keratosis (LPLK) from regressed melanoma can be difficult on histopathologic examination, potentially resulting in mismanagement of patients. We aimed to identify histopathologic features by which regressed melanoma can be differentiated from regressed LPLK. Twenty actively inflamed LPLK, 12 LPLK with regression and 15 melanomas with regression were compared and evaluated by hematoxylin and eosin staining as well as Melan-A, microphthalmia transcription factor (MiTF) and cytokeratin (AE1/AE3) immunostaining. (1) A total of 40% of regressed melanomas showed complete or near complete loss of melanocytes within the epidermis with Melan-A and MiTF immunostaining, while 8% of regressed LPLK exhibited this finding. (2) Necrotic keratinocytes were seen in the epidermis in 33% regressed melanomas as opposed to all of the regressed LPLK. (3) A dense infiltrate of melanophages in the papillary dermis was seen in 40% of regressed melanomas, a feature not seen in regressed LPLK. In summary, our findings suggest that a complete or near complete loss of melanocytes within the epidermis strongly favors a regressed melanoma over a regressed LPLK. In addition, necrotic epidermal keratinocytes and the presence of a dense band-like distribution of dermal melanophages can be helpful in differentiating these lesions. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  4. The Breslow estimator of the nonparametric baseline survivor function in Cox's regression model: some heuristics.

    PubMed

    Hanley, James A

    2008-01-01

    Most survival analysis textbooks explain how the hazard ratio parameters in Cox's life table regression model are estimated. Fewer explain how the components of the nonparametric baseline survivor function are derived. Those that do often relegate the explanation to an "advanced" section and merely present the components as algebraic or iterative solutions to estimating equations. None comment on the structure of these estimators. This note brings out a heuristic representation that may help to de-mystify the structure.

  5. Aeroelastic Model Structure Computation for Envelope Expansion

    NASA Technical Reports Server (NTRS)

    Kukreja, Sunil L.

    2007-01-01

    Structure detection is a procedure for selecting a subset of candidate terms, from a full model description, that best describes the observed output. This is a necessary procedure to compute an efficient system description which may afford greater insight into the functionality of the system or a simpler controller design. Structure computation as a tool for black-box modelling may be of critical importance in the development of robust, parsimonious models for the flight-test community. Moreover, this approach may lead to efficient strategies for rapid envelope expansion which may save significant development time and costs. In this study, a least absolute shrinkage and selection operator (LASSO) technique is investigated for computing efficient model descriptions of nonlinear aeroelastic systems. The LASSO minimises the residual sum of squares by the addition of an l(sub 1) penalty term on the parameter vector of the traditional 2 minimisation problem. Its use for structure detection is a natural extension of this constrained minimisation approach to pseudolinear regression problems which produces some model parameters that are exactly zero and, therefore, yields a parsimonious system description. Applicability of this technique for model structure computation for the F/A-18 Active Aeroelastic Wing using flight test data is shown for several flight conditions (Mach numbers) by identifying a parsimonious system description with a high percent fit for cross-validated data.

  6. Cortico-Cerebellar Structural Connectivity Is Related to Residual Motor Output in Chronic Stroke.

    PubMed

    Schulz, Robert; Frey, Benedikt M; Koch, Philipp; Zimerman, Maximo; Bönstrup, Marlene; Feldheim, Jan; Timmermann, Jan E; Schön, Gerhard; Cheng, Bastian; Thomalla, Götz; Gerloff, Christian; Hummel, Friedhelm C

    2017-01-01

    Functional imaging studies have argued that interactions between cortical motor areas and the cerebellum are relevant for motor output and recovery processes after stroke. However, the impact of the underlying structural connections is poorly understood. To investigate this, diffusion-weighted brain imaging was conducted in 26 well-characterized chronic stroke patients (aged 63 ± 1.9 years, 18 males) with supratentorial ischemic lesions and 26 healthy participants. Probabilistic tractography was used to reconstruct reciprocal cortico-cerebellar tracts and to relate their microstructural integrity to residual motor functioning applying linear regression modeling. The main finding was a significant association between cortico-cerebellar structural connectivity and residual motor function, independent from the level of damage to the cortico-spinal tract. Specifically, white matter integrity of the cerebellar outflow tract, the dentato-thalamo-cortical tract, was positively related to both general motor output and fine motor skills. Additionally, the integrity of the descending cortico-ponto-cerebellar tract contributed to rather fine motor skills. A comparable structure-function relationship was not evident in the controls. The present study provides first tract-related structural data demonstrating a critical importance of distinct cortico-cerebellar connections for motor output after stroke. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  7. Nurse aide decision making in nursing homes: factors affecting empowerment.

    PubMed

    Chaudhuri, Tanni; Yeatts, Dale E; Cready, Cynthia M

    2013-09-01

    To evaluate factors affecting structural empowerment among nurse aides in nursing homes. Structural empowerment can be defined as the actual rather than perceived ability to make autonomous decisions within an organisation. Given the paucity of research on the subject, this study helps to close the gap by identifying factors that affect nurse aide empowerment, that is, decision-making among nurse aides. The data for the study come from self-administered questionnaires distributed to direct-care workers (nurse aides) in 11 nursing homes in a southern state in the USA. Ordinary least square regression models were estimated to analyse the effects of demographic predictors, personal factors (competency, emotional exhaustion and positive attitude) and structural characteristics (coworker and supervisor support, information availability and shared governance) on nurse aide decision-making. Findings suggest race among demographic predictors, emotional exhaustion among personal characteristics, and supervisor support, and shared governance among structural factors, significantly affect nurse aide decision-making. It is important to explore race as one of the central determinants of structural empowerment among nurse aides. In addition, the nature and type of emotional exhaustion that propels decision-making needs to be further examined. The study shows the importance of shared governance and supervisor support for fostering nurse aide empowerment. © 2013 Blackwell Publishing Ltd.

  8. Additive hazards regression and partial likelihood estimation for ecological monitoring data across space.

    PubMed

    Lin, Feng-Chang; Zhu, Jun

    2012-01-01

    We develop continuous-time models for the analysis of environmental or ecological monitoring data such that subjects are observed at multiple monitoring time points across space. Of particular interest are additive hazards regression models where the baseline hazard function can take on flexible forms. We consider time-varying covariates and take into account spatial dependence via autoregression in space and time. We develop statistical inference for the regression coefficients via partial likelihood. Asymptotic properties, including consistency and asymptotic normality, are established for parameter estimates under suitable regularity conditions. Feasible algorithms utilizing existing statistical software packages are developed for computation. We also consider a simpler additive hazards model with homogeneous baseline hazard and develop hypothesis testing for homogeneity. A simulation study demonstrates that the statistical inference using partial likelihood has sound finite-sample properties and offers a viable alternative to maximum likelihood estimation. For illustration, we analyze data from an ecological study that monitors bark beetle colonization of red pines in a plantation of Wisconsin.

  9. The role of defensible space for residential structure protection during wildfires

    USGS Publications Warehouse

    Syphard, Alexandra D.; Brennan, Teresa J.; Keeley, Jon E.

    2014-01-01

    With the potential for worsening fire conditions, discussion is escalating over how to best reduce effects on urban communities. A widely supported strategy is the creation of defensible space immediately surrounding homes and other structures. Although state and local governments publish specific guidelines and requirements, there is little empirical evidence to suggest how much vegetation modification is needed to provide significant benefits. We analysed the role of defensible space by mapping and measuring a suite of variables on modern pre-fire aerial photography for 1000 destroyed and 1000 surviving structures for all fires where homes burned from 2001 to 2010 in San Diego County, CA, USA. Structures were more likely to survive a fire with defensible space immediately adjacent to them. The most effective treatment distance varied between 5 and 20 m (16–58 ft) from the structure, but distances larger than 30 m (100 ft) did not provide additional protection, even for structures located on steep slopes. The most effective actions were reducing woody cover up to 40% immediately adjacent to structures and ensuring that vegetation does not overhang or touch the structure. Multiple-regression models showed landscape-scale factors, including low housing density and distances to major roads, were more important in explaining structure destruction. The best long-term solution will involve a suite of prevention measures that include defensible space as well as building design approach, community education and proactive land use planning that limits exposure to fire.

  10. A comparative study on entrepreneurial attitudes modeled with logistic regression and Bayes nets.

    PubMed

    López Puga, Jorge; García García, Juan

    2012-11-01

    Entrepreneurship research is receiving increasing attention in our context, as entrepreneurs are key social agents involved in economic development. We compare the success of the dichotomic logistic regression model and the Bayes simple classifier to predict entrepreneurship, after manipulating the percentage of missing data and the level of categorization in predictors. A sample of undergraduate university students (N = 1230) completed five scales (motivation, attitude towards business creation, obstacles, deficiencies, and training needs) and we found that each of them predicted different aspects of the tendency to business creation. Additionally, our results show that the receiver operating characteristic (ROC) curve is affected by the rate of missing data in both techniques, but logistic regression seems to be more vulnerable when faced with missing data, whereas Bayes nets underperform slightly when categorization has been manipulated. Our study sheds light on the potential entrepreneur profile and we propose to use Bayesian networks as an additional alternative to overcome the weaknesses of logistic regression when missing data are present in applied research.

  11. Parametric and Nonparametric Statistical Methods for Genomic Selection of Traits with Additive and Epistatic Genetic Architectures

    PubMed Central

    Howard, Réka; Carriquiry, Alicia L.; Beavis, William D.

    2014-01-01

    Parametric and nonparametric methods have been developed for purposes of predicting phenotypes. These methods are based on retrospective analyses of empirical data consisting of genotypic and phenotypic scores. Recent reports have indicated that parametric methods are unable to predict phenotypes of traits with known epistatic genetic architectures. Herein, we review parametric methods including least squares regression, ridge regression, Bayesian ridge regression, least absolute shrinkage and selection operator (LASSO), Bayesian LASSO, best linear unbiased prediction (BLUP), Bayes A, Bayes B, Bayes C, and Bayes Cπ. We also review nonparametric methods including Nadaraya-Watson estimator, reproducing kernel Hilbert space, support vector machine regression, and neural networks. We assess the relative merits of these 14 methods in terms of accuracy and mean squared error (MSE) using simulated genetic architectures consisting of completely additive or two-way epistatic interactions in an F2 population derived from crosses of inbred lines. Each simulated genetic architecture explained either 30% or 70% of the phenotypic variability. The greatest impact on estimates of accuracy and MSE was due to genetic architecture. Parametric methods were unable to predict phenotypic values when the underlying genetic architecture was based entirely on epistasis. Parametric methods were slightly better than nonparametric methods for additive genetic architectures. Distinctions among parametric methods for additive genetic architectures were incremental. Heritability, i.e., proportion of phenotypic variability, had the second greatest impact on estimates of accuracy and MSE. PMID:24727289

  12. MAGMA: Generalized Gene-Set Analysis of GWAS Data

    PubMed Central

    de Leeuw, Christiaan A.; Mooij, Joris M.; Heskes, Tom; Posthuma, Danielle

    2015-01-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn’s Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn’s Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn’s Disease data was found to be considerably faster as well. PMID:25885710

  13. MAGMA: generalized gene-set analysis of GWAS data.

    PubMed

    de Leeuw, Christiaan A; Mooij, Joris M; Heskes, Tom; Posthuma, Danielle

    2015-04-01

    By aggregating data for complex traits in a biologically meaningful way, gene and gene-set analysis constitute a valuable addition to single-marker analysis. However, although various methods for gene and gene-set analysis currently exist, they generally suffer from a number of issues. Statistical power for most methods is strongly affected by linkage disequilibrium between markers, multi-marker associations are often hard to detect, and the reliance on permutation to compute p-values tends to make the analysis computationally very expensive. To address these issues we have developed MAGMA, a novel tool for gene and gene-set analysis. The gene analysis is based on a multiple regression model, to provide better statistical performance. The gene-set analysis is built as a separate layer around the gene analysis for additional flexibility. This gene-set analysis also uses a regression structure to allow generalization to analysis of continuous properties of genes and simultaneous analysis of multiple gene sets and other gene properties. Simulations and an analysis of Crohn's Disease data are used to evaluate the performance of MAGMA and to compare it to a number of other gene and gene-set analysis tools. The results show that MAGMA has significantly more power than other tools for both the gene and the gene-set analysis, identifying more genes and gene sets associated with Crohn's Disease while maintaining a correct type 1 error rate. Moreover, the MAGMA analysis of the Crohn's Disease data was found to be considerably faster as well.

  14. Wrong Signs in Regression Coefficients

    NASA Technical Reports Server (NTRS)

    McGee, Holly

    1999-01-01

    When using parametric cost estimation, it is important to note the possibility of the regression coefficients having the wrong sign. A wrong sign is defined as a sign on the regression coefficient opposite to the researcher's intuition and experience. Some possible causes for the wrong sign discussed in this paper are a small range of x's, leverage points, missing variables, multicollinearity, and computational error. Additionally, techniques for determining the cause of the wrong sign are given.

  15. Mass Spectrometry Based Identification of Geometric Isomers during Metabolic Stability Study of a New Cytotoxic Sulfonamide Derivatives Supported by Quantitative Structure-Retention Relationships

    PubMed Central

    Belka, Mariusz; Hewelt-Belka, Weronika; Sławiński, Jarosław; Bączek, Tomasz

    2014-01-01

    A set of 15 new sulphonamide derivatives, presenting antitumor activity have been subjected to a metabolic stability study. The results showed that besides products of biotransformation, some additional peaks occurred in chromatograms. Tandem mass spectrometry revealed the same mass and fragmentation pathway, suggesting that geometric isomerization occurred. Thus, to support this hypothesis, quantitative structure-retention relationships were applied. Human liver microsomes were used as an in vitro model of metabolism. The biotransformation reactions were tracked by liquid chromatography assay and additionally, fragmentation mass spectra were recorded. In silico molecular modeling at a semi-empirical level was conducted as a starting point for molecular descriptor calculations. A quantitative structure-retention relationship model was built applying multiple linear regression based on selected three-dimensional descriptors. The studied compounds revealed high metabolic stability, with a tendency to form hydroxylated biotransformation products. However, significant chemical instability in conditions simulating human body fluids was noticed. According to literature and MS data geometrical isomerization was suggested. The developed in sillico model was able to describe the relationship between the geometry of isomer pairs and their chromatographic retention properties, thus it supported the hypothesis that the observed pairs of peaks are most likely geometric isomers. However, extensive structural investigations are needed to fully identify isomers’ geometry. An effort to describe MS fragmentation pathways of novel chemical structures is often not enough to propose structures of potent metabolites and products of other chemical reactions that can be observed in compound solutions at early drug discovery studies. The results indicate that the relatively non-expensive and not time- and labor-consuming in sillico approach could be a good supportive tool assisting the identification of cis-trans isomers based on retention data. This methodology can be helpful during the structural identification of biotransformation and degradation products of new chemical entities - potential new drugs. PMID:24893169

  16. Comparison of various error functions in predicting the optimum isotherm by linear and non-linear regression analysis for the sorption of basic red 9 by activated carbon.

    PubMed

    Kumar, K Vasanth; Porkodi, K; Rocha, F

    2008-01-15

    A comparison of linear and non-linear regression method in selecting the optimum isotherm was made to the experimental equilibrium data of basic red 9 sorption by activated carbon. The r(2) was used to select the best fit linear theoretical isotherm. In the case of non-linear regression method, six error functions namely coefficient of determination (r(2)), hybrid fractional error function (HYBRID), Marquardt's percent standard deviation (MPSD), the average relative error (ARE), sum of the errors squared (ERRSQ) and sum of the absolute errors (EABS) were used to predict the parameters involved in the two and three parameter isotherms and also to predict the optimum isotherm. Non-linear regression was found to be a better way to obtain the parameters involved in the isotherms and also the optimum isotherm. For two parameter isotherm, MPSD was found to be the best error function in minimizing the error distribution between the experimental equilibrium data and predicted isotherms. In the case of three parameter isotherm, r(2) was found to be the best error function to minimize the error distribution structure between experimental equilibrium data and theoretical isotherms. The present study showed that the size of the error function alone is not a deciding factor to choose the optimum isotherm. In addition to the size of error function, the theory behind the predicted isotherm should be verified with the help of experimental data while selecting the optimum isotherm. A coefficient of non-determination, K(2) was explained and was found to be very useful in identifying the best error function while selecting the optimum isotherm.

  17. Application of stepwise multiple regression techniques to inversion of Nimbus 'IRIS' observations.

    NASA Technical Reports Server (NTRS)

    Ohring, G.

    1972-01-01

    Exploratory studies with Nimbus-3 infrared interferometer-spectrometer (IRIS) data indicate that, in addition to temperature, such meteorological parameters as geopotential heights of pressure surfaces, tropopause pressure, and tropopause temperature can be inferred from the observed spectra with the use of simple regression equations. The technique of screening the IRIS spectral data by means of stepwise regression to obtain the best radiation predictors of meteorological parameters is validated. The simplicity of application of the technique and the simplicity of the derived linear regression equations - which contain only a few terms - suggest usefulness for this approach. Based upon the results obtained, suggestions are made for further development and exploitation of the stepwise regression analysis technique.

  18. Structure-function relationships using spectral-domain optical coherence tomography: comparison with scanning laser polarimetry.

    PubMed

    Aptel, Florent; Sayous, Romain; Fortoul, Vincent; Beccat, Sylvain; Denis, Philippe

    2010-12-01

    To evaluate and compare the regional relationships between visual field sensitivity and retinal nerve fiber layer (RNFL) thickness as measured by spectral-domain optical coherence tomography (OCT) and scanning laser polarimetry. Prospective cross-sectional study. One hundred and twenty eyes of 120 patients (40 with healthy eyes, 40 with suspected glaucoma, and 40 with glaucoma) were tested on Cirrus-OCT, GDx VCC, and standard automated perimetry. Raw data on RNFL thickness were extracted for 256 peripapillary sectors of 1.40625 degrees each for the OCT measurement ellipse and 64 peripapillary sectors of 5.625 degrees each for the GDx VCC measurement ellipse. Correlations between peripapillary RNFL thickness in 6 sectors and visual field sensitivity in the 6 corresponding areas were evaluated using linear and logarithmic regression analysis. Receiver operating curve areas were calculated for each instrument. With spectral-domain OCT, the correlations (r(2)) between RNFL thickness and visual field sensitivity ranged from 0.082 (nasal RNFL and corresponding visual field area, linear regression) to 0.726 (supratemporal RNFL and corresponding visual field area, logarithmic regression). By comparison, with GDx-VCC, the correlations ranged from 0.062 (temporal RNFL and corresponding visual field area, linear regression) to 0.362 (supratemporal RNFL and corresponding visual field area, logarithmic regression). In pairwise comparisons, these structure-function correlations were generally stronger with spectral-domain OCT than with GDx VCC and with logarithmic regression than with linear regression. The largest areas under the receiver operating curve were seen for OCT superior thickness (0.963 ± 0.022; P < .001) in eyes with glaucoma and for OCT average thickness (0.888 ± 0.072; P < .001) in eyes with suspected glaucoma. The structure-function relationship was significantly stronger with spectral-domain OCT than with scanning laser polarimetry, and was better expressed logarithmically than linearly. Measurements with these 2 instruments should not be considered to be interchangeable. Copyright © 2010 Elsevier Inc. All rights reserved.

  19. Prediction models for clustered data: comparison of a random intercept and standard regression model

    PubMed Central

    2013-01-01

    Background When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Methods Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. Results The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. Conclusion The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters. PMID:23414436

  20. Prediction models for clustered data: comparison of a random intercept and standard regression model.

    PubMed

    Bouwmeester, Walter; Twisk, Jos W R; Kappen, Teus H; van Klei, Wilton A; Moons, Karel G M; Vergouwe, Yvonne

    2013-02-15

    When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters.

  1. [Quantitative structure-gas chromatographic retention relationship of polycyclic aromatic sulfur heterocycles using molecular electronegativity-distance vector].

    PubMed

    Li, Zhenghua; Cheng, Fansheng; Xia, Zhining

    2011-01-01

    The chemical structures of 114 polycyclic aromatic sulfur heterocycles (PASHs) have been studied by molecular electronegativity-distance vector (MEDV). The linear relationships between gas chromatographic retention index and the MEDV have been established by a multiple linear regression (MLR) model. The results of variable selection by stepwise multiple regression (SMR) and the powerful predictive abilities of the optimization model appraised by leave-one-out cross-validation showed that the optimization model with the correlation coefficient (R) of 0.994 7 and the cross-validated correlation coefficient (Rcv) of 0.994 0 possessed the best statistical quality. Furthermore, when the 114 PASHs compounds were divided into calibration and test sets in the ratio of 2:1, the statistical analysis showed our models possesses almost equal statistical quality, the very similar regression coefficients and the good robustness. The quantitative structure-retention relationship (QSRR) model established may provide a convenient and powerful method for predicting the gas chromatographic retention of PASHs.

  2. Interaction Models for Functional Regression.

    PubMed

    Usset, Joseph; Staicu, Ana-Maria; Maity, Arnab

    2016-02-01

    A functional regression model with a scalar response and multiple functional predictors is proposed that accommodates two-way interactions in addition to their main effects. The proposed estimation procedure models the main effects using penalized regression splines, and the interaction effect by a tensor product basis. Extensions to generalized linear models and data observed on sparse grids or with measurement error are presented. A hypothesis testing procedure for the functional interaction effect is described. The proposed method can be easily implemented through existing software. Numerical studies show that fitting an additive model in the presence of interaction leads to both poor estimation performance and lost prediction power, while fitting an interaction model where there is in fact no interaction leads to negligible losses. The methodology is illustrated on the AneuRisk65 study data.

  3. Global Efficiency of Structural Networks Mediates Cognitive Control in Mild Cognitive Impairment

    PubMed Central

    Berlot, Rok; Metzler-Baddeley, Claudia; Ikram, M. Arfan; Jones, Derek K.; O’Sullivan, Michael J.

    2016-01-01

    Background: Cognitive control has been linked to both the microstructure of individual tracts and the structure of whole-brain networks, but their relative contributions in health and disease remain unclear. Objective: To determine the contribution of both localized white matter tract damage and disruption of global network architecture to cognitive control, in older age and Mild Cognitive Impairment (MCI). Materials and Methods: Twenty-five patients with MCI and 20 age, sex, and intelligence-matched healthy volunteers were investigated with 3 Tesla structural magnetic resonance imaging (MRI). Cognitive control and episodic memory were evaluated with established tests. Structural network graphs were constructed from diffusion MRI-based whole-brain tractography. Their global measures were calculated using graph theory. Regression models utilized both global network metrics and microstructure of specific connections, known to be critical for each domain, to predict cognitive scores. Results: Global efficiency and the mean clustering coefficient of networks were reduced in MCI. Cognitive control was associated with global network topology. Episodic memory, in contrast, correlated with individual temporal tracts only. Relationships between cognitive control and network topology were attenuated by addition of single tract measures to regression models, consistent with a partial mediation effect. The mediation effect was stronger in MCI than healthy volunteers, explaining 23-36% of the effect of cingulum microstructure on cognitive control performance. Network clustering was a significant mediator in the relationship between tract microstructure and cognitive control in both groups. Conclusion: The status of critical connections and large-scale network topology are both important for maintenance of cognitive control in MCI. Mediation via large-scale networks is more important in patients with MCI than healthy volunteers. This effect is domain-specific, and true for cognitive control but not for episodic memory. Interventions to improve cognitive control will need to address both dysfunction of local circuitry and global network architecture to be maximally effective. PMID:28018208

  4. Using Refined Regression Analysis To Assess The Ecological Services Of Restored Wetlands

    EPA Science Inventory

    A hierarchical approach to regression analysis of wetland water treatment was conducted to determine which factors are the most appropriate for characterizing wetlands of differing structure and function. We used this approach in an effort to identify the types and characteristi...

  5. Functional brain networks reconstruction using group sparsity-regularized learning.

    PubMed

    Zhao, Qinghua; Li, Will X Y; Jiang, Xi; Lv, Jinglei; Lu, Jianfeng; Liu, Tianming

    2018-06-01

    Investigating functional brain networks and patterns using sparse representation of fMRI data has received significant interests in the neuroimaging community. It has been reported that sparse representation is effective in reconstructing concurrent and interactive functional brain networks. To date, most of data-driven network reconstruction approaches rarely take consideration of anatomical structures, which are the substrate of brain function. Furthermore, it has been rarely explored whether structured sparse representation with anatomical guidance could facilitate functional networks reconstruction. To address this problem, in this paper, we propose to reconstruct brain networks utilizing the structure guided group sparse regression (S2GSR) in which 116 anatomical regions from the AAL template, as prior knowledge, are employed to guide the network reconstruction when performing sparse representation of whole-brain fMRI data. Specifically, we extract fMRI signals from standard space aligned with the AAL template. Then by learning a global over-complete dictionary, with the learned dictionary as a set of features (regressors), the group structured regression employs anatomical structures as group information to regress whole brain signals. Finally, the decomposition coefficients matrix is mapped back to the brain volume to represent functional brain networks and patterns. We use the publicly available Human Connectome Project (HCP) Q1 dataset as the test bed, and the experimental results indicate that the proposed anatomically guided structure sparse representation is effective in reconstructing concurrent functional brain networks.

  6. Using structural equation modeling to construct calibration equations relating PM2.5 mass concentration samplers to the federal reference method sampler

    NASA Astrophysics Data System (ADS)

    Bilonick, Richard A.; Connell, Daniel P.; Talbott, Evelyn O.; Rager, Judith R.; Xue, Tao

    2015-02-01

    The objective of this study was to remove systematic bias among fine particulate matter (PM2.5) mass concentration measurements made by different types of samplers used in the Pittsburgh Aerosol Research and Inhalation Epidemiology Study (PARIES). PARIES is a retrospective epidemiology study that aims to provide a comprehensive analysis of the associations between air quality and human health effects in the Pittsburgh, Pennsylvania, region from 1999 to 2008. Calibration was needed in order to minimize the amount of systematic error in PM2.5 exposure estimation as a result of including data from 97 different PM2.5 samplers at 47 monitoring sites. Ordinary regression often has been used for calibrating air quality measurements from pairs of measurement devices; however, this is only appropriate when one of the two devices (the "independent" variable) is free from random error, which is rarely the case. A group of methods known as "errors-in-variables" (e.g., Deming regression, reduced major axis regression) has been developed to handle calibration between two devices when both are subject to random error, but these methods require information on the relative sizes of the random errors for each device, which typically cannot be obtained from the observed data. When data from more than two devices (or repeats of the same device) are available, the additional information is not used to inform the calibration. A more general approach that often has been overlooked is the use of a measurement error structural equation model (SEM) that allows the simultaneous comparison of three or more devices (or repeats). The theoretical underpinnings of all of these approaches to calibration are described, and the pros and cons of each are discussed. In particular, it is shown that both ordinary regression (when used for calibration) and Deming regression are particular examples of SEMs but with substantial deficiencies. To illustrate the use of SEMs, the 7865 daily average PM2.5 mass concentration measurements made by seven collocated samplers at an urban monitoring site in Pittsburgh, Pennsylvania, were used. These samplers, which included three federal reference method (FRM) samplers, three speciation samplers, and a tapered element oscillating microbalance (TEOM), operated at various times during the 10-year PARIES study period. Because TEOM measurements are known to depend on temperature, the constructed SEM provided calibration equations relating the TEOM to the FRM and speciation samplers as a function of ambient temperature. It was shown that TEOM imprecision and TEOM bias (relative to the FRM) both decreased as temperature increased. It also was shown that the temperature dependency for bias was non-linear and followed a sigmoidal (logistic) pattern. The speciation samplers exhibited only small bias relative to the FRM samplers, although the FRM samplers were shown to be substantially more precise than both the TEOM and the speciation samplers. Comparison of the SEM results to pairwise simple linear regression results showed that the regression results can differ substantially from the correctly-derived calibration equations, especially if the less-precise device is used as the independent variable in the regression.

  7. Random regression models using different functions to model milk flow in dairy cows.

    PubMed

    Laureano, M M M; Bignardi, A B; El Faro, L; Cardoso, V L; Tonhati, H; Albuquerque, L G

    2014-09-12

    We analyzed 75,555 test-day milk flow records from 2175 primiparous Holstein cows that calved between 1997 and 2005. Milk flow was obtained by dividing the mean milk yield (kg) of the 3 daily milking by the total milking time (min) and was expressed as kg/min. Milk flow was grouped into 43 weekly classes. The analyses were performed using a single-trait Random Regression Models that included direct additive genetic, permanent environmental, and residual random effects. In addition, the contemporary group and linear and quadratic effects of cow age at calving were included as fixed effects. Fourth-order orthogonal Legendre polynomial of days in milk was used to model the mean trend in milk flow. The additive genetic and permanent environmental covariance functions were estimated using random regression Legendre polynomials and B-spline functions of days in milk. The model using a third-order Legendre polynomial for additive genetic effects and a sixth-order polynomial for permanent environmental effects, which contained 7 residual classes, proved to be the most adequate to describe variations in milk flow, and was also the most parsimonious. The heritability in milk flow estimated by the most parsimonious model was of moderate to high magnitude.

  8. Mapping the Structure-Function Relationship in Glaucoma and Healthy Patients Measured with Spectralis OCT and Humphrey Perimetry

    PubMed Central

    Muñoz–Negrete, Francisco J.; Oblanca, Noelia; Rebolleda, Gema

    2018-01-01

    Purpose To study the structure-function relationship in glaucoma and healthy patients assessed with Spectralis OCT and Humphrey perimetry using new statistical approaches. Materials and Methods Eighty-five eyes were prospectively selected and divided into 2 groups: glaucoma (44) and healthy patients (41). Three different statistical approaches were carried out: (1) factor analysis of the threshold sensitivities (dB) (automated perimetry) and the macular thickness (μm) (Spectralis OCT), subsequently applying Pearson's correlation to the obtained regions, (2) nonparametric regression analysis relating the values in each pair of regions that showed significant correlation, and (3) nonparametric spatial regressions using three models designed for the purpose of this study. Results In the glaucoma group, a map that relates structural and functional damage was drawn. The strongest correlation with visual fields was observed in the peripheral nasal region of both superior and inferior hemigrids (r = 0.602 and r = 0.458, resp.). The estimated functions obtained with the nonparametric regressions provided the mean sensitivity that corresponds to each given macular thickness. These functions allowed for accurate characterization of the structure-function relationship. Conclusions Both maps and point-to-point functions obtained linking structure and function damage contribute to a better understanding of this relationship and may help in the future to improve glaucoma diagnosis. PMID:29850196

  9. Clinical Considerations Regarding Regression in Psychotherapy with Patients with Conversion Disorder.

    PubMed

    Kaplan, Marcia

    Regression is a ubiquitous phenomenon in psychodynamic psychotherapy and psychoanalysis, typically part of a reorganization that leads to progression, at least with respect to recruiting elements in the unconscious to consciousness. Regression in patients with conversion disorder (i.e., pseudo-neurological symptoms without an organic basis) is often itself somatic/physical rather than psychic in nature. Psychotherapists working with these patients must be prepared for confusing or frightening forms of regression that should be expected as part of the therapeutic process. In conversion disorder patients with adequate character structure, this regression, when handled effectively by the psychotherapist, ultimately leads to verbalized thoughts and feelings and a gradually strengthening alternative to physically experienced psychic conflict.

  10. Linear regression models for solvent accessibility prediction in proteins.

    PubMed

    Wagner, Michael; Adamczak, Rafał; Porollo, Aleksey; Meller, Jarosław

    2005-04-01

    The relative solvent accessibility (RSA) of an amino acid residue in a protein structure is a real number that represents the solvent exposed surface area of this residue in relative terms. The problem of predicting the RSA from the primary amino acid sequence can therefore be cast as a regression problem. Nevertheless, RSA prediction has so far typically been cast as a classification problem. Consequently, various machine learning techniques have been used within the classification framework to predict whether a given amino acid exceeds some (arbitrary) RSA threshold and would thus be predicted to be "exposed," as opposed to "buried." We have recently developed novel methods for RSA prediction using nonlinear regression techniques which provide accurate estimates of the real-valued RSA and outperform classification-based approaches with respect to commonly used two-class projections. However, while their performance seems to provide a significant improvement over previously published approaches, these Neural Network (NN) based methods are computationally expensive to train and involve several thousand parameters. In this work, we develop alternative regression models for RSA prediction which are computationally much less expensive, involve orders-of-magnitude fewer parameters, and are still competitive in terms of prediction quality. In particular, we investigate several regression models for RSA prediction using linear L1-support vector regression (SVR) approaches as well as standard linear least squares (LS) regression. Using rigorously derived validation sets of protein structures and extensive cross-validation analysis, we compare the performance of the SVR with that of LS regression and NN-based methods. In particular, we show that the flexibility of the SVR (as encoded by metaparameters such as the error insensitivity and the error penalization terms) can be very beneficial to optimize the prediction accuracy for buried residues. We conclude that the simple and computationally much more efficient linear SVR performs comparably to nonlinear models and thus can be used in order to facilitate further attempts to design more accurate RSA prediction methods, with applications to fold recognition and de novo protein structure prediction methods.

  11. Do clinical data and human papilloma virus genotype influence spontaneous regression in grade I cervical intraepithelial neoplasia?

    PubMed

    Cortés-Alaguero, Caterina; González-Mirasol, Esteban; Morales-Roselló, José; Poblet-Martinez, Enrique

    2017-03-15

    To determine whether medical history, clinical examination and human papilloma virus (HPV) genotype influence spontaneous regression in cervical intraepithelial neoplasia grade I (CIN-I). We retrospectively evaluated 232 women who were histologically diagnosed as have CIN-I by means of Kaplan-Meier curves, the pattern of spontaneous regression according to the medical history, clinical examination, and HPV genotype. Spontaneous regression occurred in most patients and was influenced by the presence of multiple HPV genotypes but not by the HPV genotype itself. In addition, regression frequency was diminished when more than 50% of the cervix surface was affected or when an abnormal cytology was present at the beginning of follow-up. The frequency of regression in CIN-I is high, making long-term follow-up and conservative management advisable. Data from clinical examination and HPV genotyping might help to anticipate which lesions will regress.

  12. Sample size determination for logistic regression on a logit-normal distribution.

    PubMed

    Kim, Seongho; Heath, Elisabeth; Heilbrun, Lance

    2017-06-01

    Although the sample size for simple logistic regression can be readily determined using currently available methods, the sample size calculation for multiple logistic regression requires some additional information, such as the coefficient of determination ([Formula: see text]) of a covariate of interest with other covariates, which is often unavailable in practice. The response variable of logistic regression follows a logit-normal distribution which can be generated from a logistic transformation of a normal distribution. Using this property of logistic regression, we propose new methods of determining the sample size for simple and multiple logistic regressions using a normal transformation of outcome measures. Simulation studies and a motivating example show several advantages of the proposed methods over the existing methods: (i) no need for [Formula: see text] for multiple logistic regression, (ii) available interim or group-sequential designs, and (iii) much smaller required sample size.

  13. Predicting human skin absorption of chemicals: development of a novel quantitative structure activity relationship.

    PubMed

    Luo, Wen; Medrek, Sarah; Misra, Jatin; Nohynek, Gerhard J

    2007-02-01

    The objective of this study was to construct and validate a quantitative structure-activity relationship model for skin absorption. Such models are valuable tools for screening and prioritization in safety and efficacy evaluation, and risk assessment of drugs and chemicals. A database of 340 chemicals with percutaneous absorption was assembled. Two models were derived from the training set consisting 306 chemicals (90/10 random split). In addition to the experimental K(ow) values, over 300 2D and 3D atomic and molecular descriptors were analyzed using MDL's QsarIS computer program. Subsequently, the models were validated using both internal (leave-one-out) and external validation (test set) procedures. Using the stepwise regression analysis, three molecular descriptors were determined to have significant statistical correlation with K(p) (R2 = 0.8225): logK(ow), X0 (quantification of both molecular size and the degree of skeletal branching), and SsssCH (count of aromatic carbon groups). In conclusion, two models to estimate skin absorption were developed. When compared to other skin absorption QSAR models in the literature, our model incorporated more chemicals and explored a large number of descriptors. Additionally, our models are reasonably predictive and have met both internal and external statistical validations.

  14. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Zhu Qin, E-mail: zhuqin@fudan.edu.cn; Peng Xizhe, E-mail: xzpeng@fudan.edu.cn

    This study examines the impacts of population size, population structure, and consumption level on carbon emissions in China from 1978 to 2008. To this end, we expanded the stochastic impacts by regression on population, affluence, and technology model and used the ridge regression method, which overcomes the negative influences of multicollinearity among independent variables under acceptable bias. Results reveal that changes in consumption level and population structure were the major impact factors, not changes in population size. Consumption level and carbon emissions were highly correlated. In terms of population structure, urbanization, population age, and household size had distinct effects onmore » carbon emissions. Urbanization increased carbon emissions, while the effect of age acted primarily through the expansion of the labor force and consequent overall economic growth. Shrinking household size increased residential consumption, resulting in higher carbon emissions. Households, rather than individuals, are a more reasonable explanation for the demographic impact on carbon emissions. Potential social policies for low carbon development are also discussed. - Highlights: Black-Right-Pointing-Pointer We examine the impacts of population change on carbon emissions in China. Black-Right-Pointing-Pointer We expand the STIRPAT model by containing population structure factors in the model. Black-Right-Pointing-Pointer The population structure includes age structure, urbanization level, and household size. Black-Right-Pointing-Pointer The ridge regression method is used to estimate the model with multicollinearity. Black-Right-Pointing-Pointer The population structure plays a more important role compared with the population size.« less

  15. Microwave Spectra of the Two Conformers of PROPENE-3-{d}_1 and a Semiexperimental Equilibrium Structure of Propene

    NASA Astrophysics Data System (ADS)

    Craig, Norman C.; Demaison, J.; Rudolph, Heinz Dieter; Gurusinghe, Ranil M.; Tubergen, Michael; Coudert, L. H.; Szalay, Peter; Császár, Attila

    2017-06-01

    FT microwave spectra have been observed and analyzed for the S (in-plane) and A (out-of-plane) conformers of propene-3-{d}_1 in the 10-22 GHz region. Both conformers display splittings due to deuterium quadrupole coupling; for the latter one only, a 19 MHz splitting due to internal rotation of the partially deuterated methyl group has been observed. In addition to rotational constants, the analysis yielded quadrupole coupling constants and parameters describing the tunneling splitting and its rotational dependence. Improved rotational constants for parent propene and the three ^{13}C_1 species are recently available. Use of vibration-rotation interaction constants computed at the MP2(FC)/cc-pVTZ level gave equilibrium rotational constants for these six species and for fourteen more deuterium isotopologues with diminished accuracy from early literature data. A semiexperimental equilibrium structure, r_e^{SE}, has been determined for propene by fitting fourteen structural parameters to the equilibrium rotational constants. The new r_e^{SE} structure compares well with an ab initio equilibrium structure computed with the all-electron CCSD(T)/cc-pV(Q,T)Z model and with a structure obtained using the mixed regression method with predicates and equilibrium rotational constants. N. C. Craig, P. Groner, A. R. Conrad, R. Gurusinghe, M. J. Tubergen J. Mol. Spectrosc. 248, 1-6 (2016).

  16. Altered voxel-wise gray matter structural brain networks in schizophrenia: Association with brain genetic expression pattern.

    PubMed

    Liu, Feng; Tian, Hongjun; Li, Jie; Li, Shen; Zhuo, Chuanjun

    2018-05-04

    Previous seed- and atlas-based structural covariance/connectivity analyses have demonstrated that patients with schizophrenia is accompanied by aberrant structural connection and abnormal topological organization. However, it remains unclear whether this disruption is present in unbiased whole-brain voxel-wise structural covariance networks (SCNs) and whether brain genetic expression variations are linked with network alterations. In this study, ninety-five patients with schizophrenia and 95 matched healthy controls were recruited and gray matter volumes were extracted from high-resolution structural magnetic resonance imaging scans. Whole-brain voxel-wise gray matter SCNs were constructed at the group level and were further analyzed by using graph theory method. Nonparametric permutation tests were employed for group comparisons. In addition, regression modes along with random effect analysis were utilized to explore the associations between structural network changes and gene expression from the Allen Human Brain Atlas. Compared with healthy controls, the patients with schizophrenia showed significantly increased structural covariance strength (SCS) in the right orbital part of superior frontal gyrus and bilateral middle frontal gyrus, while decreased SCS in the bilateral superior temporal gyrus and precuneus. The altered SCS showed reproducible correlations with the expression profiles of the gene classes involved in therapeutic targets and neurodevelopment. Overall, our findings not only demonstrate that the topological architecture of whole-brain voxel-wise SCNs is impaired in schizophrenia, but also provide evidence for the possible role of therapeutic targets and neurodevelopment-related genes in gray matter structural brain networks in schizophrenia.

  17. Gesture and intonation are “sister systems” of infant communication: Evidence from regression patterns of language development

    PubMed Central

    Snow, David P.

    2016-01-01

    This study investigates infants’ transition from nonverbal to verbal communication using evidence from regression patterns. As an example of regressions, prelinguistic infants learning American Sign Language (ASL) use pointing gestures to communicate. At the onset of single signs, however, these gestures disappear. Petitto (1987) attributed the regression to the children’s discovery that pointing has two functions, namely, deixis and linguistic pronouns. The 1:2 relation (1 form, 2 functions) violates the simple 1:1 pattern that infants are believed to expect. This kind of conflict, Petitto argued, explains the regression. Based on the additional observation that the regression coincided with the boundary between prelinguistic and linguistic communication, Petitto concluded that the prelinguistic and linguistic periods are autonomous. The purpose of the present study was to evaluate the 1:1 model and to determine whether it explains a previously reported regression of intonation in English. Background research showed that gestures and intonation have different forms but the same pragmatic meanings, a 2:1 form-function pattern that plausibly precipitates the regression. The hypothesis of the study was that gestures and intonation are closely related. Moreover, because gestures and intonation change in the opposite direction, the negative correlation between them indicates a robust inverse relationship. To test this prediction, speech samples of 29 infants (8 to 16 months) were analyzed acoustically and compared to parent-report data on several verbal and gestural scales. In support of the hypothesis, gestures alone were inversely correlated with intonation. In addition, the regression model explains nonlinearities stemming from different form-function configurations. However, the results failed to support the claim that regressions linked to early words or signs reflect autonomy. The discussion ends with a focus on the special role of intonation in children’s transition from “prelinguistic” communication to language. PMID:28729753

  18. On Relevance of Codon Usage to Expression of Synthetic and Natural Genes in Escherichia coli

    PubMed Central

    Supek, Fran; Šmuc, Tomislav

    2010-01-01

    A recent investigation concluded that codon bias did not affect expression of green fluorescent protein (GFP) variants in Escherichia coli, while stability of an mRNA secondary structure near the 5′ end played a dominant role. We demonstrate that combining the two variables using regression trees or support vector regression yields a biologically plausible model with better support in the GFP data set and in other experimental data: codon usage is relevant for protein levels if the 5′ mRNA structures are not strong. Natural E. coli genes had weaker 5′ mRNA structures than the examined set of GFP variants and did not exhibit a correlation between the folding free energy of 5′ mRNA structures and protein expression. PMID:20421604

  19. Robust neural network with applications to credit portfolio data analysis.

    PubMed

    Feng, Yijia; Li, Runze; Sudjianto, Agus; Zhang, Yiyun

    2010-01-01

    In this article, we study nonparametric conditional quantile estimation via neural network structure. We proposed an estimation method that combines quantile regression and neural network (robust neural network, RNN). It provides good smoothing performance in the presence of outliers and can be used to construct prediction bands. A Majorization-Minimization (MM) algorithm was developed for optimization. Monte Carlo simulation study is conducted to assess the performance of RNN. Comparison with other nonparametric regression methods (e.g., local linear regression and regression splines) in real data application demonstrate the advantage of the newly proposed procedure.

  20. Modeling sheep pox disease from the 1994-1998 epidemic in Evros Prefecture, Greece.

    PubMed

    Malesios, C; Demiris, N; Abas, Z; Dadousis, K; Koutroumanidis, T

    2014-10-01

    Sheep pox is a highly transmissible disease which can cause serious loss of livestock and can therefore have major economic impact. We present data from sheep pox epidemics which occurred between 1994 and 1998. The data include weekly records of infected farms as well as a number of covariates. We implement Bayesian stochastic regression models which, in addition to various explanatory variables like seasonal and environmental/meteorological factors, also contain serial correlation structure based on variants of the Ornstein-Uhlenbeck process. We take a predictive view in model selection by utilizing deviance-based measures. The results indicate that seasonality and the number of infected farms are important predictors for sheep pox incidence. Copyright © 2014 Elsevier Ltd. All rights reserved.

  1. Early experiences building a software quality prediction model

    NASA Technical Reports Server (NTRS)

    Agresti, W. W.; Evanco, W. M.; Smith, M. C.

    1990-01-01

    Early experiences building a software quality prediction model are discussed. The overall research objective is to establish a capability to project a software system's quality from an analysis of its design. The technical approach is to build multivariate models for estimating reliability and maintainability. Data from 21 Ada subsystems were analyzed to test hypotheses about various design structures leading to failure-prone or unmaintainable systems. Current design variables highlight the interconnectivity and visibility of compilation units. Other model variables provide for the effects of reusability and software changes. Reported results are preliminary because additional project data is being obtained and new hypotheses are being developed and tested. Current multivariate regression models are encouraging, explaining 60 to 80 percent of the variation in error density of the subsystems.

  2. Early Childhood Adversity and Pregnancy Outcomes

    PubMed Central

    Smith, Megan V.; Gotman, Nathan; Yonkers, Kimberly A.

    2016-01-01

    Objectives To examine the association between adverse childhood experiences (ACEs) and pregnancy outcomes; to explore mediators of this association including psychiatric illness and health habits. Methods Exposure to ACEs was determined by the Early Trauma Inventory Self Report Short Form; psychiatric diagnoses were generated by the Composite International Diagnostic Interview administered in a cohort of 2303 pregnant women. Linear regression and structural equation modeling bootstrapping approaches tested for multiple mediators. Results Each additional ACE decreased birth weight by 16.33 g and decreased gestational age by 0.063. Smoking was the strongest mediator of the effect on gestational age. Conclusions ACEs have an enduring effect on maternal reproductive health, as manifested by mothers’ delivery of offspring that were of reduced birth weight and shorter gestational age. PMID:26762511

  3. Aircraft wing weight build-up methodology with modification for materials and construction techniques

    NASA Technical Reports Server (NTRS)

    York, P.; Labell, R. W.

    1980-01-01

    An aircraft wing weight estimating method based on a component buildup technique is described. A simplified analytically derived beam model, modified by a regression analysis, is used to estimate the wing box weight, utilizing a data base of 50 actual airplane wing weights. Factors representing materials and methods of construction were derived and incorporated into the basic wing box equations. Weight penalties to the wing box for fuel, engines, landing gear, stores and fold or pivot are also included. Methods for estimating the weight of additional items (secondary structure, control surfaces) have the option of using details available at the design stage (i.e., wing box area, flap area) or default values based on actual aircraft from the data base.

  4. Local Linear Regression for Data with AR Errors.

    PubMed

    Li, Runze; Li, Yan

    2009-07-01

    In many statistical applications, data are collected over time, and they are likely correlated. In this paper, we investigate how to incorporate the correlation information into the local linear regression. Under the assumption that the error process is an auto-regressive process, a new estimation procedure is proposed for the nonparametric regression by using local linear regression method and the profile least squares techniques. We further propose the SCAD penalized profile least squares method to determine the order of auto-regressive process. Extensive Monte Carlo simulation studies are conducted to examine the finite sample performance of the proposed procedure, and to compare the performance of the proposed procedures with the existing one. From our empirical studies, the newly proposed procedures can dramatically improve the accuracy of naive local linear regression with working-independent error structure. We illustrate the proposed methodology by an analysis of real data set.

  5. Soft sediment deformation structures in the Maastrichtian Ajali Formation Western Flank of Anambra Basin, Southern Nigeria

    NASA Astrophysics Data System (ADS)

    Olabode, Solomon Ojo

    2014-01-01

    Soft sediment deformation structures were recognized in the Maastrichtian shallow marine wave to tide influenced regressive sediments of Ajali Formation in the western flank of Anambra basin, southern Nigerian. The soft sediment deformation structures were in association with cross bedded sands, clay and silt and show different morphological types. Two main types recognised are plastic deformations represented by different types of recumbent folds and injection structure represented by clastic dykes. Other structures in association with the plastic deformation structures include distorted convolute lamination, subsidence lobes, pillars, cusps and sand balls. These structures are interpreted to have been formed by liquefaction and fluidization mechanisms. The driving forces inferred include gravitational instabilities and hydraulic processes. Facies analysis, detailed morphologic study of the soft sediment deformation structures and previous tectonic history of the basin indicate that the main trigger agent for deformation is earthquake shock. The soft sediment deformation structures recognised in the western part of Anambra basin provide a continuous record of the tectonic processes that acted on the regressive Ajali Formation during the Maastrichtian.

  6. Glomerular structural-functional relationship models of diabetic nephropathy are robust in type 1 diabetic patients.

    PubMed

    Mauer, Michael; Caramori, Maria Luiza; Fioretto, Paola; Najafian, Behzad

    2015-06-01

    Studies of structural-functional relationships have improved understanding of the natural history of diabetic nephropathy (DN). However, in order to consider structural end points for clinical trials, the robustness of the resultant models needs to be verified. This study examined whether structural-functional relationship models derived from a large cohort of type 1 diabetic (T1D) patients with a wide range of renal function are robust. The predictability of models derived from multiple regression analysis and piecewise linear regression analysis was also compared. T1D patients (n = 161) with research renal biopsies were divided into two equal groups matched for albumin excretion rate (AER). Models to explain AER and glomerular filtration rate (GFR) by classical DN lesions in one group (T1D-model, or T1D-M) were applied to the other group (T1D-test, or T1D-T) and regression analyses were performed. T1D-M-derived models explained 70 and 63% of AER variance and 32 and 21% of GFR variance in T1D-M and T1D-T, respectively, supporting the substantial robustness of the models. Piecewise linear regression analyses substantially improved predictability of the models with 83% of AER variance and 66% of GFR variance explained by classical DN glomerular lesions alone. These studies demonstrate that DN structural-functional relationship models are robust, and if appropriate models are used, glomerular lesions alone explain a major proportion of AER and GFR variance in T1D patients. © The Author 2014. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.

  7. Three-dimensional mapping of soil chemical characteristics at micrometric scale: Statistical prediction by combining 2D SEM-EDX data and 3D X-ray computed micro-tomographic images

    NASA Astrophysics Data System (ADS)

    Hapca, Simona

    2015-04-01

    Many soil properties and functions emerge from interactions of physical, chemical and biological processes at microscopic scales, which can be understood only by integrating techniques that traditionally are developed within separate disciplines. While recent advances in imaging techniques, such as X-ray computed tomography (X-ray CT), offer the possibility to reconstruct the 3D physical structure at fine resolutions, for the distribution of chemicals in soil, existing methods, based on scanning electron microscope (SEM) and energy dispersive X-ray detection (EDX), allow for characterization of the chemical composition only on 2D surfaces. At present, direct 3D measurement techniques are still lacking, sequential sectioning of soils, followed by 2D mapping of chemical elements and interpolation to 3D, being an alternative which is explored in this study. Specifically, we develop an integrated experimental and theoretical framework which combines 3D X-ray CT imaging technique with 2D SEM-EDX and use spatial statistics methods to map the chemical composition of soil in 3D. The procedure involves three stages 1) scanning a resin impregnated soil cube by X-ray CT, followed by precision cutting to produce parallel thin slices, the surfaces of which are scanned by SEM-EDX, 2) alignment of the 2D chemical maps within the internal 3D structure of the soil cube, and 3) development, of spatial statistics methods to predict the chemical composition of 3D soil based on the observed 2D chemical and 3D physical data. Specifically, three statistical models consisting of a regression tree, a regression tree kriging and cokriging model were used to predict the 3D spatial distribution of carbon, silicon, iron and oxygen in soil, these chemical elements showing a good spatial agreement between the X-ray grayscale intensities and the corresponding 2D SEM-EDX data. Due to the spatial correlation between the physical and chemical data, the regression-tree model showed a great potential in predicting chemical composition in particular for iron, which is generally sparsely distributed in soil. For carbon, silicon and oxygen, which are more densely distributed, the additional kriging of the regression tree residuals improved significantly the prediction, whereas prediction based on co-kriging was less consistent across replicates, underperforming regression-tree kriging. The present study shows a great potential in integrating geo-statistical methods with imaging techniques to unveil the 3D chemical structure of soil at very fine scales, the framework being suitable to be further applied to other types of imaging data such as images of biological thin sections for characterization of microbial distribution. Key words: X-ray CT, SEM-EDX, segmentation techniques, spatial correlation, 3D soil images, 2D chemical maps.

  8. Complex networks untangle competitive advantage in Australian football

    NASA Astrophysics Data System (ADS)

    Braham, Calum; Small, Michael

    2018-05-01

    We construct player-based complex network models of Australian football teams for the 2014 Australian Football League season; modelling the passes between players as weighted, directed edges. We show that analysis of these measures can give an insight into the underlying structure and strategy of Australian football teams, quantitatively distinguishing different playing styles. The relationships observed between network properties and match outcomes suggest that successful teams exhibit well-connected passing networks with the passes distributed between all 22 players as evenly as possible. Linear regression models of team scores and match margins show significant improvements in R2 and Bayesian information criterion when network measures are added to models that use conventional measures, demonstrating that network analysis measures contain useful, extra information. Several measures, particularly the mean betweenness centrality, are shown to be useful in predicting the outcomes of future matches, suggesting they measure some aspect of the intrinsic strength of teams. In addition, several local centrality measures are shown to be useful in analysing individual players' differing contributions to the team's structure.

  9. Complex networks untangle competitive advantage in Australian football.

    PubMed

    Braham, Calum; Small, Michael

    2018-05-01

    We construct player-based complex network models of Australian football teams for the 2014 Australian Football League season; modelling the passes between players as weighted, directed edges. We show that analysis of these measures can give an insight into the underlying structure and strategy of Australian football teams, quantitatively distinguishing different playing styles. The relationships observed between network properties and match outcomes suggest that successful teams exhibit well-connected passing networks with the passes distributed between all 22 players as evenly as possible. Linear regression models of team scores and match margins show significant improvements in R 2 and Bayesian information criterion when network measures are added to models that use conventional measures, demonstrating that network analysis measures contain useful, extra information. Several measures, particularly the mean betweenness centrality, are shown to be useful in predicting the outcomes of future matches, suggesting they measure some aspect of the intrinsic strength of teams. In addition, several local centrality measures are shown to be useful in analysing individual players' differing contributions to the team's structure.

  10. Statistical Issues in Galaxy Cluster Cosmology

    NASA Technical Reports Server (NTRS)

    Mantz, Adam

    2013-01-01

    The number and growth of massive galaxy clusters are sensitive probes of cosmological structure formation. Surveys at various wavelengths can detect clusters to high redshift, but the fact that cluster mass is not directly observable complicates matters, requiring us to simultaneously constrain scaling relations of observable signals with mass. The problem can be cast as one of regression, in which the data set is truncated, the (cosmology-dependent) underlying population must be modeled, and strong, complex correlations between measurements often exist. Simulations of cosmological structure formation provide a robust prediction for the number of clusters in the Universe as a function of mass and redshift (the mass function), but they cannot reliably predict the observables used to detect clusters in sky surveys (e.g. X-ray luminosity). Consequently, observers must constrain observable-mass scaling relations using additional data, and use the scaling relation model in conjunction with the mass function to predict the number of clusters as a function of redshift and luminosity.

  11. Statistical Analyses of Raw Material Data for MTM45-1/CF7442A-36% RW: CMH Cure Cycle

    NASA Technical Reports Server (NTRS)

    Coroneos, Rula; Pai, Shantaram, S.; Murthy, Pappu

    2013-01-01

    This report describes statistical characterization of physical properties of the composite material system MTM45-1/CF7442A, which has been tested and is currently being considered for use on spacecraft structures. This composite system is made of 6K plain weave graphite fibers in a highly toughened resin system. This report summarizes the distribution types and statistical details of the tests and the conditions for the experimental data generated. These distributions will be used in multivariate regression analyses to help determine material and design allowables for similar material systems and to establish a procedure for other material systems. Additionally, these distributions will be used in future probabilistic analyses of spacecraft structures. The specific properties that are characterized are the ultimate strength, modulus, and Poisson??s ratio by using a commercially available statistical package. Results are displayed using graphical and semigraphical methods and are included in the accompanying appendixes.

  12. Optical critical dimension metrology for directed self-assembly assisted contact hole shrink

    NASA Astrophysics Data System (ADS)

    Dixit, Dhairya; Green, Avery; Hosler, Erik R.; Kamineni, Vimal; Preil, Moshe E.; Keller, Nick; Race, Joseph; Chun, Jun Sung; O'Sullivan, Michael; Khare, Prasanna; Montgomery, Warren; Diebold, Alain C.

    2016-01-01

    Directed self-assembly (DSA) is a potential patterning solution for future generations of integrated circuits. Its main advantages are high pattern resolution (˜10 nm), high throughput, no requirement of high-resolution mask, and compatibility with standard fab-equipment and processes. The application of Mueller matrix (MM) spectroscopic ellipsometry-based scatterometry to optically characterize DSA patterned contact hole structures fabricated with phase-separated polystyrene-b-polymethylmethacrylate (PS-b-PMMA) is described. A regression-based approach is used to calculate the guide critical dimension (CD), DSA CD, height of the PS column, thicknesses of underlying layers, and contact edge roughness of the post PMMA etch DSA contact hole sample. Scanning electron microscopy and imaging analysis is conducted as a comparative metric for scatterometry. In addition, optical model-based simulations are used to investigate MM elements' sensitivity to various DSA-based contact hole structures, predict sensitivity to dimensional changes, and its limits to characterize DSA-induced defects, such as hole placement inaccuracy, missing vias, and profile inaccuracy of the PMMA cylinder.

  13. Belief, motivational, and ideological correlates of human rights attitudes.

    PubMed

    Crowson, H Michael; DeBacker, Teresa K

    2008-06-01

    Many people believe that an informed and thoughtful citizenry is essential to the maintenance of democratic ideals within the United States and the spread of those ideals abroad. Since the September 11, 2001, terrorist attacks, the evidence that Americans consider issues of human dignity and rights when making judgments about the U.S. government's war on terror has been mixed. In our study, we assessed the relative contributions of ideological, belief, and cognitive-motivational factors to the prediction of human rights and civil liberties attitudes. Individuals scoring high on measures of right-wing authoritarianism (RWA) and the belief that the structure of knowledge is simple were the most likely to support restrictions on human rights and civil liberties as part of the war on terror. In a subsequent regression analysis, individuals scoring higher on personal need for structure or exhibiting lower levels of epistemological belief complexity tended to score higher on RWA. Additionally, men were generally more likely to support restrictions on rights and liberties and to score higher on RWA than were women.

  14. Predictors for return to work for those with occupational respiratory disease: clinical and structural factors.

    PubMed

    Zoeckler, Jeanette M; Cibula, Donald A; Morley, Christopher P; Lax, Michael B

    2013-12-01

    Few occupational researchers have examined "return to work" among patients with work-related respiratory diseases. In addition, prior studies have emphasized individual patient characteristics rather than a more multi-dimensional approach that includes both clinical and structural factors. A retrospective chart review identified patients with occupational respiratory diseases in the Occupational Health Clinical Center, Syracuse, NY between 1991 and 2009. We assessed predictors of work status using an exploratory, sequential mixed methods research design, multinomial (n = 188) and Cox regressions (n = 130). The findings suggest that patients with an increased number of diagnoses, non-union members, and those who took more than a year before clinical presentation had significantly poorer work status outcomes, after adjusting for age, education level, and relevant diagnoses. Efforts to prevent slow return to work after developing occupational respiratory disease should recognize the importance of timely access to occupational health services, disease severity, union membership, and smoking status. © 2013 Wiley Periodicals, Inc.

  15. Importance of spatial autocorrelation in modeling bird distributions at a continental scale

    USGS Publications Warehouse

    Bahn, V.; O'Connor, R.J.; Krohn, W.B.

    2006-01-01

    Spatial autocorrelation in species' distributions has been recognized as inflating the probability of a type I error in hypotheses tests, causing biases in variable selection, and violating the assumption of independence of error terms in models such as correlation or regression. However, it remains unclear whether these problems occur at all spatial resolutions and extents, and under which conditions spatially explicit modeling techniques are superior. Our goal was to determine whether spatial models were superior at large extents and across many different species. In addition, we investigated the importance of purely spatial effects in distribution patterns relative to the variation that could be explained through environmental conditions. We studied distribution patterns of 108 bird species in the conterminous United States using ten years of data from the Breeding Bird Survey. We compared the performance of spatially explicit regression models with non-spatial regression models using Akaike's information criterion. In addition, we partitioned the variance in species distributions into an environmental, a pure spatial and a shared component. The spatially-explicit conditional autoregressive regression models strongly outperformed the ordinary least squares regression models. In addition, partialling out the spatial component underlying the species' distributions showed that an average of 17% of the explained variation could be attributed to purely spatial effects independent of the spatial autocorrelation induced by the underlying environmental variables. We concluded that location in the range and neighborhood play an important role in the distribution of species. Spatially explicit models are expected to yield better predictions especially for mobile species such as birds, even in coarse-grained models with a large extent. ?? Ecography.

  16. Genetic analysis of body weights of individually fed beef bulls in South Africa using random regression models.

    PubMed

    Selapa, N W; Nephawe, K A; Maiwashe, A; Norris, D

    2012-02-08

    The aim of this study was to estimate genetic parameters for body weights of individually fed beef bulls measured at centralized testing stations in South Africa using random regression models. Weekly body weights of Bonsmara bulls (N = 2919) tested between 1999 and 2003 were available for the analyses. The model included a fixed regression of the body weights on fourth-order orthogonal Legendre polynomials of the actual days on test (7, 14, 21, 28, 35, 42, 49, 56, 63, 70, 77, and 84) for starting age and contemporary group effects. Random regressions on fourth-order orthogonal Legendre polynomials of the actual days on test were included for additive genetic effects and additional uncorrelated random effects of the weaning-herd-year and the permanent environment of the animal. Residual effects were assumed to be independently distributed with heterogeneous variance for each test day. Variance ratios for additive genetic, permanent environment and weaning-herd-year for weekly body weights at different test days ranged from 0.26 to 0.29, 0.37 to 0.44 and 0.26 to 0.34, respectively. The weaning-herd-year was found to have a significant effect on the variation of body weights of bulls despite a 28-day adjustment period. Genetic correlations amongst body weights at different test days were high, ranging from 0.89 to 1.00. Heritability estimates were comparable to literature using multivariate models. Therefore, random regression model could be applied in the genetic evaluation of body weight of individually fed beef bulls in South Africa.

  17. The prediction of food additives in the fruit juice based on electronic nose with chemometrics.

    PubMed

    Qiu, Shanshan; Wang, Jun

    2017-09-01

    Food additives are added to products to enhance their taste, and preserve flavor or appearance. While their use should be restricted to achieve a technological benefit, the contents of food additives should be also strictly controlled. In this study, E-nose was applied as an alternative to traditional monitoring technologies for determining two food additives, namely benzoic acid and chitosan. For quantitative monitoring, support vector machine (SVM), random forest (RF), extreme learning machine (ELM) and partial least squares regression (PLSR) were applied to establish regression models between E-nose signals and the amount of food additives in fruit juices. The monitoring models based on ELM and RF reached higher correlation coefficients (R 2 s) and lower root mean square errors (RMSEs) than models based on PLSR and SVM. This work indicates that E-nose combined with RF or ELM can be a cost-effective, easy-to-build and rapid detection system for food additive monitoring. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Robust Image Regression Based on the Extended Matrix Variate Power Exponential Distribution of Dependent Noise.

    PubMed

    Luo, Lei; Yang, Jian; Qian, Jianjun; Tai, Ying; Lu, Gui-Fu

    2017-09-01

    Dealing with partial occlusion or illumination is one of the most challenging problems in image representation and classification. In this problem, the characterization of the representation error plays a crucial role. In most current approaches, the error matrix needs to be stretched into a vector and each element is assumed to be independently corrupted. This ignores the dependence between the elements of error. In this paper, it is assumed that the error image caused by partial occlusion or illumination changes is a random matrix variate and follows the extended matrix variate power exponential distribution. This has the heavy tailed regions and can be used to describe a matrix pattern of l×m dimensional observations that are not independent. This paper reveals the essence of the proposed distribution: it actually alleviates the correlations between pixels in an error matrix E and makes E approximately Gaussian. On the basis of this distribution, we derive a Schatten p -norm-based matrix regression model with L q regularization. Alternating direction method of multipliers is applied to solve this model. To get a closed-form solution in each step of the algorithm, two singular value function thresholding operators are introduced. In addition, the extended Schatten p -norm is utilized to characterize the distance between the test samples and classes in the design of the classifier. Extensive experimental results for image reconstruction and classification with structural noise demonstrate that the proposed algorithm works much more robustly than some existing regression-based methods.

  19. Foot Type Biomechanics Part 2: are structure and anthropometrics related to function?

    PubMed

    Mootanah, Rajshree; Song, Jinsup; Lenhoff, Mark W; Hafer, Jocelyn F; Backus, Sherry I; Gagnon, David; Deland, Jonathan T; Hillstrom, Howard J

    2013-03-01

    Many foot pathologies are associated with specific foot types. If foot structure and function are related, measurement of either could assist with differential diagnosis of pedal pathologies. Biomechanical measures of foot structure and function are related in asymptomatic healthy individuals. Sixty-one healthy subjects' left feet were stratified into cavus (n=12), rectus (n=27) and planus (n=22) foot types. Foot structure was assessed by malleolar valgus index, arch height index, and arch height flexibility. Anthropometrics (height and weight), age, and walking speed were measured. Foot function was assessed by center of pressure excursion index, peak plantar pressure, maximum force, and gait pattern parameters. Foot structure and anthropometric variables were entered into stepwise linear regression models to identify predictors of function. Measures of foot structure and anthropometrics explained 10-37% of the model variance (adjusted R(2)) for gait pattern parameters. When walking speed was included, the adjusted R(2) increased to 45-77% but foot structure was no longer a factor. Foot structure and anthropometrics predicted 7-47% of the model variance for plantar pressure and 16-64% for maximum force parameters. All multivariate models were significant (p<0.05), supporting acceptance of the hypothesis. Foot structure and function are related in asymptomatic healthy individuals. The structural parameters employed are basic measurements that do not require ionizing radiation and could be used in a clinical setting. Further research is needed to identify additional predictive parameters (plantar soft tissue characteristics, skeletal alignment, and neuromuscular control) and to include individuals with pathology. Copyright © 2012. Published by Elsevier B.V.

  20. Foot Type Biomechanics Part 2: Are structure and anthropometrics related to function?

    PubMed Central

    Mootanah, Rajshree; Song, Jinsup; Lenhoff, Mark W.; Hafer, Jocelyn F.; Backus, Sherry I.; Gagnon, David; Deland, Jonathan T.; Hillstrom, Howard J.

    2013-01-01

    Background Many foot pathologies are associated with specific foot types. If foot structure and function are related, measurement of either could assist with differential diagnosis of pedal pathologies. Hypothesis Biomechanical measures of foot structure and function are related in asymptomatic healthy individuals. Methods Sixty-one healthy subjects' left feet were stratified into cavus (n = 12), rectus (n = 27) and planus (n = 22) foot types. Foot structure was assessed by malleolar valgus index, arch height index, and arch height flexibility. Anthropometrics (height and weight), age, and walking speed were measured. Foot function was assessed by center of pressure excursion index, peak plantar pressure, maximum force, and gait pattern parameters. Foot structure and anthropometric variables were entered into stepwise linear regression models to identify predictors of function. Results Measures of foot structure and anthropometrics explained 10–37% of the model variance (adjusted R2) for gait pattern parameters. When walking speed was included, the adjusted R2 increased to 45–77% but foot structure was no longer a factor. Foot structure and anthropometrics predicted 7–47% of the model variance for plantar pressure and 16–64% for maximum force parameters. All multivariate models were significant (p < 0.05), supporting acceptance of the hypothesis. Discussion and conclusion Foot structure and function are related in asymptomatic healthy individuals. The structural parameters employed are basic measurements that do not require ionizing radiation and could be used in a clinical setting. Further research is needed to identify additional predictive parameters (plantar soft tissue characteristics, skeletal alignment, and neuromuscular control) and to include individuals with pathology. PMID:23107624

  1. Penalized nonparametric scalar-on-function regression via principal coordinates

    PubMed Central

    Reiss, Philip T.; Miller, David L.; Wu, Pei-Shien; Hua, Wen-Yu

    2016-01-01

    A number of classical approaches to nonparametric regression have recently been extended to the case of functional predictors. This paper introduces a new method of this type, which extends intermediate-rank penalized smoothing to scalar-on-function regression. In the proposed method, which we call principal coordinate ridge regression, one regresses the response on leading principal coordinates defined by a relevant distance among the functional predictors, while applying a ridge penalty. Our publicly available implementation, based on generalized additive modeling software, allows for fast optimal tuning parameter selection and for extensions to multiple functional predictors, exponential family-valued responses, and mixed-effects models. In an application to signature verification data, principal coordinate ridge regression, with dynamic time warping distance used to define the principal coordinates, is shown to outperform a functional generalized linear model. PMID:29217963

  2. Using Robust Variance Estimation to Combine Multiple Regression Estimates with Meta-Analysis

    ERIC Educational Resources Information Center

    Williams, Ryan

    2013-01-01

    The purpose of this study was to explore the use of robust variance estimation for combining commonly specified multiple regression models and for combining sample-dependent focal slope estimates from diversely specified models. The proposed estimator obviates traditionally required information about the covariance structure of the dependent…

  3. SEMIPARAMETRIC QUANTILE REGRESSION WITH HIGH-DIMENSIONAL COVARIATES

    PubMed Central

    Zhu, Liping; Huang, Mian; Li, Runze

    2012-01-01

    This paper is concerned with quantile regression for a semiparametric regression model, in which both the conditional mean and conditional variance function of the response given the covariates admit a single-index structure. This semiparametric regression model enables us to reduce the dimension of the covariates and simultaneously retains the flexibility of nonparametric regression. Under mild conditions, we show that the simple linear quantile regression offers a consistent estimate of the index parameter vector. This is a surprising and interesting result because the single-index model is possibly misspecified under the linear quantile regression. With a root-n consistent estimate of the index vector, one may employ a local polynomial regression technique to estimate the conditional quantile function. This procedure is computationally efficient, which is very appealing in high-dimensional data analysis. We show that the resulting estimator of the quantile function performs asymptotically as efficiently as if the true value of the index vector were known. The methodologies are demonstrated through comprehensive simulation studies and an application to a real dataset. PMID:24501536

  4. [Responses of plant community structure and species composition to warming and N addition in an alpine meadow, northern Tibetan Plateau, China].

    PubMed

    Zong, Ning; Chai, Xi; Shi, Pei Li; Jiang, Jing; Niu, Ben; Zhang, Xian Zhou; He, Yong Tao

    2016-12-01

    Global climate warming and increasing nitrogen (N) deposition, as controversial global environmental issues, may distinctly affect the functions and processes of terrestrial ecosystems. It has been reported that the Qinghai-Tibet Plateau has been experiencing significant warming in recent decades, especially in winter. Previous studies have mainly focused on the effects of warming all the year round; however, few studies have tested the effects of winter warming. To investigate the effects of winter warming and N addition on plant community structure and species composition of alpine meadow, long-term N addition and simulated warming experiment was conducted in alpine meadow from 2010 in Damxung, northern Tibet. The experiment consisted of three warming patterns: Year-round warming (YW), winter warming (WW) and control (NW), crossed respectively with five N gradients: 0, 10, 20, 40, 80 kg N·hm -2 ·a -1 . From 2012 to 2014, both warming and N addition significantly affected the total coverage of plant community. Specifically, YW significantly decreased the total coverage of plant community. Without N addition, WW remarkably reduced the vegetation coverage. However, with N addition, the total vegetation coverage gradually increased with the increase of N level. Warming and N addition had different effects on plants from different functional groups. Warming significantly reduced the plant coverage of grasses and sedges, while N addition significantly enhanced the plant coverage of grasses. Regression analyses showed that the total coverage of plant community was positively related to soil water content in vigorous growth stages, indicating that the decrease in soil water content resulted from warming during dry seasons might be the main reason for the decline of total community coverage. As soil moisture in semi-arid alpine meadow is mainly regulated by rainfalls, our results indicated that changes in spatial and temporal patterns of rainfalls under the future climate change scenarios would dramatically influence the vegetation coverage and species composition. Additionally, the effects of increasing atmospheric N deposition on vegetation community might also depend on the change of rainfall patterns.

  5. Genetic analyses of protein yield in dairy cows applying random regression models with time-dependent and temperature x humidity-dependent covariates.

    PubMed

    Brügemann, K; Gernand, E; von Borstel, U U; König, S

    2011-08-01

    Data used in the present study included 1,095,980 first-lactation test-day records for protein yield of 154,880 Holstein cows housed on 196 large-scale dairy farms in Germany. Data were recorded between 2002 and 2009 and merged with meteorological data from public weather stations. The maximum distance between each farm and its corresponding weather station was 50 km. Hourly temperature-humidity indexes (THI) were calculated using the mean of hourly measurements of dry bulb temperature and relative humidity. On the phenotypic scale, an increase in THI was generally associated with a decrease in daily protein yield. For genetic analyses, a random regression model was applied using time-dependent (d in milk, DIM) and THI-dependent covariates. Additive genetic and permanent environmental effects were fitted with this random regression model and Legendre polynomials of order 3 for DIM and THI. In addition, the fixed curve was modeled with Legendre polynomials of order 3. Heterogeneous residuals were fitted by dividing DIM into 5 classes, and by dividing THI into 4 classes, resulting in 20 different classes. Additive genetic variances for daily protein yield decreased with increasing degrees of heat stress and were lowest at the beginning of lactation and at extreme THI. Due to higher additive genetic variances, slightly higher permanent environment variances, and similar residual variances, heritabilities were highest for low THI in combination with DIM at the end of lactation. Genetic correlations among individual values for THI were generally >0.90. These trends from the complex random regression model were verified by applying relatively simple bivariate animal models for protein yield measured in 2 THI environments; that is, defining a THI value of 60 as a threshold. These high correlations indicate the absence of any substantial genotype × environment interaction for protein yield. However, heritabilities and additive genetic variances from the random regression model tended to be slightly higher in the THI range corresponding to cows' comfort zone. Selecting such superior environments for progeny testing can contribute to an accurate genetic differentiation among selection candidates. Copyright © 2011 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  6. Long-term nitrogen addition leads to loss of species richness due to litter accumulation and soil acidification in a temperate steppe.

    PubMed

    Fang, Ying; Xun, Fen; Bai, Wenming; Zhang, Wenhao; Li, Linghao

    2012-01-01

    Although community structure and species richness are known to respond to nitrogen fertilization dramatically, little is known about the mechanisms underlying specific species replacement and richness loss. In an experiment in semiarid temperate steppe of China, manipulative N addition with five treatments was conducted to evaluate the effect of N addition on the community structure and species richness. Species richness and biomass of community in each plot were investigated in a randomly selected quadrat. Root element, available and total phosphorus (AP, TP) in rhizospheric soil, and soil moisture, pH, AP, TP and inorganic N in the soil were measured. The relationship between species richness and the measured factors was analyzed using bivariate correlations and stepwise multiple linear regressions. The two dominant species, a shrub Artemisia frigida and a grass Stipa krylovii, responded differently to N addition such that the former was gradually replaced by the latter. S. krylovii and A. frigida had highly-branched fibrous and un-branched tap root systems, respectively. S. krylovii had higher height than A. frigida in both control and N added plots. These differences may contribute to the observed species replacement. In addition, the analysis on root element and AP contents in rhizospheric soil suggests that different calcium acquisition strategies, and phosphorus and sodium responses of the two species may account for the replacement. Species richness was significantly reduced along the five N addition levels. Our results revealed a significant relationship between species richness and soil pH, litter amount, soil moisture, AP concentration and inorganic N concentration. Our results indicate that litter accumulation and soil acidification accounted for 52.3% and 43.3% of the variation in species richness, respectively. These findings would advance our knowledge on the changes in species richness in semiarid temperate steppe of northern China under N deposition scenario.

  7. Female homicide in Rio Grande do Sul, Brazil.

    PubMed

    Leites, Gabriela Tomedi; Meneghel, Stela Nazareth; Hirakata, Vania Noemi

    2014-01-01

    This study aimed to assess the female homicide rate due to aggression in Rio Grande do Sul, Brazil, using this as a "proxy" of femicide. This was an ecological study which correlated the female homicide rate due to aggression in Rio Grande do Sul, according to the 35 microregions defined by the Brazilian Institute of Geography and Statistics (IBGE), with socioeconomic and demographic variables access and health indicators. Pearson's correlation test was performed with the selected variables. After this, multiple linear regressions were performed with variables with p < 0.20. The standardized average of female homicide rate due to aggression in the period from 2003 to 2007 was 3.1 obits per 100 thousand. After multiple regression analysis, the final model included male mortality due to aggression (p = 0.016), the percentage of hospital admissions for alcohol (p = 0.005) and the proportion of ill-defined deaths (p = 0.015). The model have an explanatory power of 39% (adjusted r2 = 0.391). The results are consistent with other studies and indicate a strong relationship between structural violence in society and violence against women, in addition to a higher incidence of female deaths in places with high alcohol hospitalization.

  8. A flexible count data regression model for risk analysis.

    PubMed

    Guikema, Seth D; Coffelt, Jeremy P; Goffelt, Jeremy P

    2008-02-01

    In many cases, risk and reliability analyses involve estimating the probabilities of discrete events such as hardware failures and occurrences of disease or death. There is often additional information in the form of explanatory variables that can be used to help estimate the likelihood of different numbers of events in the future through the use of an appropriate regression model, such as a generalized linear model. However, existing generalized linear models (GLM) are limited in their ability to handle the types of variance structures often encountered in using count data in risk and reliability analysis. In particular, standard models cannot handle both underdispersed data (variance less than the mean) and overdispersed data (variance greater than the mean) in a single coherent modeling framework. This article presents a new GLM based on a reformulation of the Conway-Maxwell Poisson (COM) distribution that is useful for both underdispersed and overdispersed count data and demonstrates this model by applying it to the assessment of electric power system reliability. The results show that the proposed COM GLM can provide as good of fits to data as the commonly used existing models for overdispered data sets while outperforming these commonly used models for underdispersed data sets.

  9. Using Multiple and Logistic Regression to Estimate the Median WillCost and Probability of Cost and Schedule Overrun for Program Managers

    DTIC Science & Technology

    2017-03-23

    PUBLIC RELEASE; DISTRIBUTION UNLIMITED Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and... Cost and Probability of Cost and Schedule Overrun for Program Managers Ryan C. Trudelle Follow this and additional works at: https://scholar.afit.edu...afit.edu. Recommended Citation Trudelle, Ryan C., "Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and

  10. Effects of Diverse Forms of Family Structure on Female and Male Homicide

    ERIC Educational Resources Information Center

    Schwartz, Jennifer

    2006-01-01

    Utilizing 2000 data on 1,618 counties and seemingly unrelated regression, I assess whether family structure effects on homicide vary across family structure measures and gender. There is evidence of robust, multidimensional family structure effects across constructs reflecting the presence of two-parent families: mother/father absence, shortages…

  11. Cancer Research Participation Beliefs and Behaviors of a Southern Black Population: A Quantitative Analysis of the Role of Structural Factors in Cancer Research Participation.

    PubMed

    Farr, Deeonna E; Brandt, Heather M; Comer, Kimberly D; Jackson, Dawnyéa D; Pandya, Kinjal; Friedman, Daniela B; Ureda, John R; Williams, Deloris G; Scott, Dolores B; Green, Wanda; Hébert, James R

    2015-09-01

    Increasing the participation of Blacks in cancer research is a vital component of a strategy to reduce racial inequities in cancer burden. Community-based participatory research (CBPR) is especially well-suited to advancing our knowledge of factors that influence research participation to ultimately address cancer-related health inequities. A paucity of literature focuses on the role of structural factors limiting participation in cancer research. As part of a larger CBPR project, we used survey data from a statewide cancer needs assessment of a Black faith community to examine the influence of structural factors on attitudes toward research and the contributions of both structural and attitudinal factors on whether individuals participate in research. Regression analyses and non-parametric statistics were conducted on data from 727 adult survey respondents. Structural factors, such as having health insurance coverage, experiencing discrimination during health care encounters, and locale, predicted belief in the benefits, but not the risks, of research participation. Positive attitudes toward research predicted intention to participate in cancer research. Significant differences in structural and attitudinal factors were found between cancer research participants and non-participants; however, directionality is confounded by the cross-sectional survey design and causality cannot be determined. This study points to complex interplay of structural and attitudinal factors on research participation as well as need for additional quantitative examinations of the various types of factors that influence research participation in Black communities.

  12. Regression equations for calculation of z scores for echocardiographic measurements of right heart structures in healthy Han Chinese children.

    PubMed

    Wang, Shan-Shan; Zhang, Yu-Qi; Chen, Shu-Bao; Huang, Guo-Ying; Zhang, Hong-Yan; Zhang, Zhi-Fang; Wu, Lan-Ping; Hong, Wen-Jing; Shen, Rong; Liu, Yi-Qing; Zhu, Jun-Xue

    2017-06-01

    Clinical decision making in children with congenital and acquired heart disease relies on measurements of cardiac structures using two-dimensional echocardiography. We aimed to establish z-score regression equations for right heart structures in healthy Chinese Han children. Two-dimensional and M-mode echocardiography was performed in 515 patients. We measured the dimensions of the pulmonary valve annulus (PVA), main pulmonary artery (MPA), left pulmonary artery (LPA), right pulmonary artery (RPA), right ventricular outflow tract at end-diastole (RVOTd) and at end-systole (RVOTs), tricuspid valve annulus (TVA), right ventricular inflow tract at end-diastole (RVIDd) and at end-systole (RVIDs), and right atrium (RA). Regression analyses were conducted to relate the measurements of right heart structures to 4body surface area (BSA). Right ventricular outflow-tract fractional shortening (RVOTFS) was also calculated. Several models were used, and the best model was chosen to establish a z-score calculator. PVA, MPA, LPA, RPA, RVOTd, RVOTs, TVA, RVIDd, RVIDs, and RA (R 2  = 0.786, 0.705, 0.728, 0.701, 0.706, 0.824, 0.804, 0.663, 0.626, and 0.793, respectively) had a cubic polynomial relationship with BSA; specifically, measurement (M) = β0 + β1 × BSA + β2 × BSA 2  + β3 × BSA. 3 RVOTFS (0.28 ± 0.02) fell within a narrow range (0.12-0.51). Our results provide reference values for z scores and regression equations for right heart structures in Han Chinese children. These data may help interpreting the routine clinical measurement of right heart structures in children with congenital or acquired heart disease. © 2016 Wiley Periodicals, Inc. J Clin Ultrasound 45:293-303, 2017. © 2017 Wiley Periodicals, Inc.

  13. Iterative Purification and Effect Size Use with Logistic Regression for Differential Item Functioning Detection

    ERIC Educational Resources Information Center

    French, Brian F.; Maller, Susan J.

    2007-01-01

    Two unresolved implementation issues with logistic regression (LR) for differential item functioning (DIF) detection include ability purification and effect size use. Purification is suggested to control inaccuracies in DIF detection as a result of DIF items in the ability estimate. Additionally, effect size use may be beneficial in controlling…

  14. Minimizing bias in biomass allometry: Model selection and log transformation of data

    Treesearch

    Joseph Mascaro; undefined undefined; Flint Hughes; Amanda Uowolo; Stefan A. Schnitzer

    2011-01-01

    Nonlinear regression is increasingly used to develop allometric equations for forest biomass estimation (i.e., as opposed to the raditional approach of log-transformation followed by linear regression). Most statistical software packages, however, assume additive errors by default, violating a key assumption of allometric theory and possibly producing spurious models....

  15. Network structure and travel time perception.

    PubMed

    Parthasarathi, Pavithra; Levinson, David; Hochmair, Hartwig

    2013-01-01

    The purpose of this research is to test the systematic variation in the perception of travel time among travelers and relate the variation to the underlying street network structure. Travel survey data from the Twin Cities metropolitan area (which includes the cities of Minneapolis and St. Paul) is used for the analysis. Travelers are classified into two groups based on the ratio of perceived and estimated commute travel time. The measures of network structure are estimated using the street network along the identified commute route. T-test comparisons are conducted to identify statistically significant differences in estimated network measures between the two traveler groups. The combined effect of these estimated network measures on travel time is then analyzed using regression models. The results from the t-test and regression analyses confirm the influence of the underlying network structure on the perception of travel time.

  16. A novel simple QSAR model for the prediction of anti-HIV activity using multiple linear regression analysis.

    PubMed

    Afantitis, Antreas; Melagraki, Georgia; Sarimveis, Haralambos; Koutentis, Panayiotis A; Markopoulos, John; Igglessi-Markopoulou, Olga

    2006-08-01

    A quantitative-structure activity relationship was obtained by applying Multiple Linear Regression Analysis to a series of 80 1-[2-hydroxyethoxy-methyl]-6-(phenylthio) thymine (HEPT) derivatives with significant anti-HIV activity. For the selection of the best among 37 different descriptors, the Elimination Selection Stepwise Regression Method (ES-SWR) was utilized. The resulting QSAR model (R (2) (CV) = 0.8160; S (PRESS) = 0.5680) proved to be very accurate both in training and predictive stages.

  17. Chronological age and its impact on associative learning proficiency and brain structure in middle adulthood.

    PubMed

    Diwadkar, Vaibhav A; Bellani, Marcella; Ahmed, Rizwan; Dusi, Nicola; Rambaldelli, Gianluca; Perlini, Cinzia; Marinelli, Veronica; Ramaseshan, Karthik; Ruggeri, Mirella; Bambilla, Paolo

    2016-01-15

    The rate of biological change in middle-adulthood is relatively under-studied. Here, we used behavioral testing in conjunction with structural magnetic resonance imaging to examine the effects of chronological age on associative learning proficiency and on brain regions that previous functional MRI studies have closely related to the domain of associative learning. Participants (n=66) completed a previously established associative learning paradigm, and consented to be scanned using structural magnetic resonance imaging. Age-related effects were investigated both across sub-groups in the sample (younger vs. older) and across the entire sample (using regression approaches). Chronological age had substantial effects on learning proficiency (independent of IQ and Education Level), with older adults showing a decrement compared to younger adults. In addition, decreases in estimated gray matter volume were observed in multiple brain regions including the hippocampus and the dorsal prefrontal cortex, both of which are strongly implicated in associative learning. The results suggest that middle adulthood may be a more dynamic period of life-span change than previously believed. The conjunctive application of narrowly focused tasks, with conjointly acquired structural MRI data may allow us to enrich the search for, and the interpretation of, age-related changes in cross-sectional samples. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. Quantitative structure-retention relationship studies for taxanes including epimers and isomeric metabolites in ultra fast liquid chromatography.

    PubMed

    Dong, Pei-Pei; Ge, Guang-Bo; Zhang, Yan-Yan; Ai, Chun-Zhi; Li, Guo-Hui; Zhu, Liang-Liang; Luan, Hong-Wei; Liu, Xing-Bao; Yang, Ling

    2009-10-16

    Seven pairs of epimers and one pair of isomeric metabolites of taxanes, each pair of which have similar structures but different retention behaviors, together with additional 13 taxanes with different substitutions were chosen to investigate the quantitative structure-retention relationship (QSRR) of taxanes in ultra fast liquid chromatography (UFLC). Monte Carlo variable selection (MCVS) method was adopted to choose descriptors. The selected four descriptors were used to build QSRR model with multi-linear regression (MLR) and artificial neural network (ANN) modeling techniques. Both linear and nonlinear models show good predictive ability, of which ANN model was better with the determination coefficient R(2) for training, validation and test set being 0.9892, 0.9747 and 0.9840, respectively. The results of 100 times' leave-12-out cross validation showed the robustness of this model. All the isomers can be correctly differentiated by this model. According to the selected descriptors, the three dimensional structural information was critical for recognition of epimers. Hydrophobic interaction was the uppermost factor for retention in UFLC. Molecules' polarizability and polarity properties were also closely correlated with retention behaviors. This QSRR model will be useful for separation and identification of taxanes including epimers and metabolites from botanical or biological samples.

  19. A hybrid optimization algorithm to explore atomic configurations of TiO 2 nanoparticles

    DOE PAGES

    Inclan, Eric J.; Geohegan, David B.; Yoon, Mina

    2017-10-17

    Here in this paper we present a hybrid algorithm comprised of differential evolution, coupled with the Broyden–Fletcher–Goldfarb–Shanno quasi-Newton optimization algorithm, for the purpose of identifying a broad range of (meta)stable Ti nO 2n nanoparticles, as an example system, described by Buckingham interatomic potential. The potential and its gradient are modified to be piece-wise continuous to enable use of these continuous-domain, unconstrained algorithms, thereby improving compatibility. To measure computational effectiveness a regression on known structures is used. This approach defines effectiveness as the ability of an algorithm to produce a set of structures whose energy distribution follows the regression as themore » number of Ti nO 2n increases such that the shape of the distribution is consistent with the algorithm’s stated goals. Our calculation demonstrates that the hybrid algorithm finds global minimum configurations more effectively than the differential evolution algorithms, widely employed in the field of materials science. Specifically, the hybrid algorithm is shown to reproduce the global minimum energy structures reported in the literature up to n = 5, and retains good agreement with the regression up to n = 25. For 25 < n < 100, where literature structures are unavailable, the hybrid effectively obtains structures that are in lower energies per TiO 2 unit as the system size increases.« less

  20. Criterion for evaluating the predictive ability of nonlinear regression models without cross-validation.

    PubMed

    Kaneko, Hiromasa; Funatsu, Kimito

    2013-09-23

    We propose predictive performance criteria for nonlinear regression models without cross-validation. The proposed criteria are the determination coefficient and the root-mean-square error for the midpoints between k-nearest-neighbor data points. These criteria can be used to evaluate predictive ability after the regression models are updated, whereas cross-validation cannot be performed in such a situation. The proposed method is effective and helpful in handling big data when cross-validation cannot be applied. By analyzing data from numerical simulations and quantitative structural relationships, we confirm that the proposed criteria enable the predictive ability of the nonlinear regression models to be appropriately quantified.

  1. Modelling the standing timber volume of Baden-Württemberg-A large-scale approach using a fusion of Landsat, airborne LiDAR and National Forest Inventory data

    NASA Astrophysics Data System (ADS)

    Maack, Joachim; Lingenfelder, Marcus; Weinacker, Holger; Koch, Barbara

    2016-07-01

    Remote sensing-based timber volume estimation is key for modelling the regional potential, accessibility and price of lignocellulosic raw material for an emerging bioeconomy. We used a unique wall-to-wall airborne LiDAR dataset and Landsat 7 satellite images in combination with terrestrial inventory data derived from the National Forest Inventory (NFI), and applied generalized additive models (GAM) to estimate spatially explicit timber distribution and volume in forested areas. Since the NFI data showed an underlying structure regarding size and ownership, we additionally constructed a socio-economic predictor to enhance the accuracy of the analysis. Furthermore, we balanced the training dataset with a bootstrap method to achieve unbiased regression weights for interpolating timber volume. Finally, we compared and discussed the model performance of the original approach (r2 = 0.56, NRMSE = 9.65%), the approach with balanced training data (r2 = 0.69, NRMSE = 12.43%) and the final approach with balanced training data and the additional socio-economic predictor (r2 = 0.72, NRMSE = 12.17%). The results demonstrate the usefulness of remote sensing techniques for mapping timber volume for a future lignocellulose-based bioeconomy.

  2. Influence of Family Structure on Health among Youths with Diabetes.

    ERIC Educational Resources Information Center

    Thompson, Sanna J.; Auslander, Wendy F.; White, Neil H.

    2001-01-01

    Discusses the extent to which family structure is significantly associated with health in youth with Type 1 diabetes. Multiple regression analyses demonstrated that family structure remains a significant predictor of youth's health when statistically controlling for race, child's age, family socioeconomic status, and adherence. (BF)

  3. Viability estimation of pepper seeds using time-resolved photothermal signal characterization

    NASA Astrophysics Data System (ADS)

    Kim, Ghiseok; Kim, Geon-Hee; Lohumi, Santosh; Kang, Jum-Soon; Cho, Byoung-Kwan

    2014-11-01

    We used infrared thermal signal measurement system and photothermal signal and image reconstruction techniques for viability estimation of pepper seeds. Photothermal signals from healthy and aged seeds were measured for seven periods (24, 48, 72, 96, 120, 144, and 168 h) using an infrared camera and analyzed by a regression method. The photothermal signals were regressed using a two-term exponential decay curve with two amplitudes and two time variables (lifetime) as regression coefficients. The regression coefficients of the fitted curve showed significant differences for each seed groups, depending on the aging times. In addition, the viability of a single seed was estimated by imaging of its regression coefficient, which was reconstructed from the measured photothermal signals. The time-resolved photothermal characteristics, along with the regression coefficient images, can be used to discriminate the aged or dead pepper seeds from the healthy seeds.

  4. Mechanisms of Optical Regression Following Corneal Laser Refractive Surgery: Epithelial and Stromal Responses

    PubMed Central

    MOSHIRFAR, Majid; DESAUTELS, Jordan D.; WALKER, Brian D.; MURRI, Michael S.; BIRDSONG, Orry C.; HOOPES, Phillip C. Sr

    2018-01-01

    Laser vision correction is a safe and effective method of reducing spectacle dependence. Photorefractive Keratectomy (PRK), Laser In Situ Keratomileusis (LASIK), and Small-Incision Lenticule Extraction (SMILE) can accurately correct myopia, hyperopia, and astigmatism. Although these procedures are nearing optimization in terms of their ability to produce a desired refractive target, the long term cellular responses of the cornea to these procedures can cause patients to regress from the their ideal postoperative refraction. In many cases, refractive regression requires follow up enhancement surgeries, presenting additional risks to patients. Although some risk factors underlying refractive regression have been identified, the exact mechanisms have not been elucidated. It is clear that cellular proliferation events are important mediators of optical regression. This review focused specifically on cellular changes to the corneal epithelium and stroma, which may influence postoperative visual regression following LASIK, PRK, and SMILE procedures. PMID:29644238

  5. A Comparison of Methods for Estimating Quadratic Effects in Nonlinear Structural Equation Models

    ERIC Educational Resources Information Center

    Harring, Jeffrey R.; Weiss, Brandi A.; Hsu, Jui-Chen

    2012-01-01

    Two Monte Carlo simulations were performed to compare methods for estimating and testing hypotheses of quadratic effects in latent variable regression models. The methods considered in the current study were (a) a 2-stage moderated regression approach using latent variable scores, (b) an unconstrained product indicator approach, (c) a latent…

  6. Aeroelastic Model Structure Computation for Envelope Expansion

    NASA Technical Reports Server (NTRS)

    Kukreja, Sunil L.

    2007-01-01

    Structure detection is a procedure for selecting a subset of candidate terms, from a full model description, that best describes the observed output. This is a necessary procedure to compute an efficient system description which may afford greater insight into the functionality of the system or a simpler controller design. Structure computation as a tool for black-box modeling may be of critical importance in the development of robust, parsimonious models for the flight-test community. Moreover, this approach may lead to efficient strategies for rapid envelope expansion that may save significant development time and costs. In this study, a least absolute shrinkage and selection operator (LASSO) technique is investigated for computing efficient model descriptions of non-linear aeroelastic systems. The LASSO minimises the residual sum of squares with the addition of an l(Sub 1) penalty term on the parameter vector of the traditional l(sub 2) minimisation problem. Its use for structure detection is a natural extension of this constrained minimisation approach to pseudo-linear regression problems which produces some model parameters that are exactly zero and, therefore, yields a parsimonious system description. Applicability of this technique for model structure computation for the F/A-18 (McDonnell Douglas, now The Boeing Company, Chicago, Illinois) Active Aeroelastic Wing project using flight test data is shown for several flight conditions (Mach numbers) by identifying a parsimonious system description with a high percent fit for cross-validated data.

  7. Multivariate generalized hidden Markov regression models with random covariates: Physical exercise in an elderly population.

    PubMed

    Punzo, Antonio; Ingrassia, Salvatore; Maruotti, Antonello

    2018-04-22

    A time-varying latent variable model is proposed to jointly analyze multivariate mixed-support longitudinal data. The proposal can be viewed as an extension of hidden Markov regression models with fixed covariates (HMRMFCs), which is the state of the art for modelling longitudinal data, with a special focus on the underlying clustering structure. HMRMFCs are inadequate for applications in which a clustering structure can be identified in the distribution of the covariates, as the clustering is independent from the covariates distribution. Here, hidden Markov regression models with random covariates are introduced by explicitly specifying state-specific distributions for the covariates, with the aim of improving the recovering of the clusters in the data with respect to a fixed covariates paradigm. The hidden Markov regression models with random covariates class is defined focusing on the exponential family, in a generalized linear model framework. Model identifiability conditions are sketched, an expectation-maximization algorithm is outlined for parameter estimation, and various implementation and operational issues are discussed. Properties of the estimators of the regression coefficients, as well as of the hidden path parameters, are evaluated through simulation experiments and compared with those of HMRMFCs. The method is applied to physical activity data. Copyright © 2018 John Wiley & Sons, Ltd.

  8. Multi-Target Regression via Robust Low-Rank Learning.

    PubMed

    Zhen, Xiantong; Yu, Mengyang; He, Xiaofei; Li, Shuo

    2018-02-01

    Multi-target regression has recently regained great popularity due to its capability of simultaneously learning multiple relevant regression tasks and its wide applications in data mining, computer vision and medical image analysis, while great challenges arise from jointly handling inter-target correlations and input-output relationships. In this paper, we propose Multi-layer Multi-target Regression (MMR) which enables simultaneously modeling intrinsic inter-target correlations and nonlinear input-output relationships in a general framework via robust low-rank learning. Specifically, the MMR can explicitly encode inter-target correlations in a structure matrix by matrix elastic nets (MEN); the MMR can work in conjunction with the kernel trick to effectively disentangle highly complex nonlinear input-output relationships; the MMR can be efficiently solved by a new alternating optimization algorithm with guaranteed convergence. The MMR leverages the strength of kernel methods for nonlinear feature learning and the structural advantage of multi-layer learning architectures for inter-target correlation modeling. More importantly, it offers a new multi-layer learning paradigm for multi-target regression which is endowed with high generality, flexibility and expressive ability. Extensive experimental evaluation on 18 diverse real-world datasets demonstrates that our MMR can achieve consistently high performance and outperforms representative state-of-the-art algorithms, which shows its great effectiveness and generality for multivariate prediction.

  9. Estimating parasitic sea lamprey abundance in Lake Huron from heterogenous data sources

    USGS Publications Warehouse

    Young, Robert J.; Jones, Michael L.; Bence, James R.; McDonald, Rodney B.; Mullett, Katherine M.; Bergstedt, Roger A.

    2003-01-01

    The Great Lakes Fishery Commission uses time series of transformer, parasitic, and spawning population estimates to evaluate the effectiveness of its sea lamprey (Petromyzon marinus) control program. This study used an inverse variance weighting method to integrate Lake Huron sea lamprey population estimates derived from two estimation procedures: 1) prediction of the lake-wide spawning population from a regression model based on stream size and, 2) whole-lake mark and recapture estimates. In addition, we used a re-sampling procedure to evaluate the effect of trading off sampling effort between the regression and mark-recapture models. Population estimates derived from the regression model ranged from 132,000 to 377,000 while mark-recapture estimates of marked recently metamorphosed juveniles and parasitic sea lampreys ranged from 536,000 to 634,000 and 484,000 to 1,608,000, respectively. The precision of the estimates varied greatly among estimation procedures and years. The integrated estimate of the mark-recapture and spawner regression procedures ranged from 252,000 to 702,000 transformers. The re-sampling procedure indicated that the regression model is more sensitive to reduction in sampling effort than the mark-recapture model. Reliance on either the regression or mark-recapture model alone could produce misleading estimates of abundance of sea lampreys and the effect of the control program on sea lamprey abundance. These analyses indicate that the precision of the lakewide population estimate can be maximized by re-allocating sampling effort from marking sea lampreys to trapping additional streams.

  10. Ethanolic extract of Artemisia aucheri induces regression of aorta wall fatty streaks in hypercholesterolemic rabbits.

    PubMed

    Asgary, S; Dinani, N Jafari; Madani, H; Mahzouni, P

    2008-05-01

    Artemisia aucheri is a native-growing plant which is widely used in Iranian traditional medicine. This study was designed to evaluate the effects of A. aucheri on regression of atherosclerosis in hypercholesterolemic rabbits. Twenty five rabbits were randomly divided into five groups of five each and treated 3-months as follows: 1: normal diet, 2: hypercholesterolemic diet (HCD), 3 and 4: HCD for 60 days and then normal diet and normal diet + A. aucheri (100 mg x kg(-1) x day(-1)) respectively for an additional 30 days (regression period). In the regression period dietary use of A. aucheri in group 4 significantly decreased total cholesterol, triglyceride and LDL-cholesterol, while HDL-cholesterol was significantly increased. The atherosclerotic area was significantly decreased in this group. Animals, which received only normal diet in the regression period showed no regression but rather progression of atherosclerosis. These findings suggest that A. aucheri may cause regression of atherosclerotic lesions.

  11. Real estate value prediction using multivariate regression models

    NASA Astrophysics Data System (ADS)

    Manjula, R.; Jain, Shubham; Srivastava, Sharad; Rajiv Kher, Pranav

    2017-11-01

    The real estate market is one of the most competitive in terms of pricing and the same tends to vary significantly based on a lot of factors, hence it becomes one of the prime fields to apply the concepts of machine learning to optimize and predict the prices with high accuracy. Therefore in this paper, we present various important features to use while predicting housing prices with good accuracy. We have described regression models, using various features to have lower Residual Sum of Squares error. While using features in a regression model some feature engineering is required for better prediction. Often a set of features (multiple regressions) or polynomial regression (applying a various set of powers in the features) is used for making better model fit. For these models are expected to be susceptible towards over fitting ridge regression is used to reduce it. This paper thus directs to the best application of regression models in addition to other techniques to optimize the result.

  12. Sibship size, birth order, family structure and childhood mental disorders.

    PubMed

    Carballo, Juan J; García-Nieto, Rebeca; Alvarez-García, Raquel; Caro-Cañizares, Irene; López-Castromán, Jorge; Muñoz-Lorenzo, Laura; de Leon-Martinez, Victoria; Baca-García, Enrique

    2013-08-01

    The aim of this study was to determine the role that birth order, sibship size and family structure have as risk factors in the development of common childhood mental disorders. A case-control study design was conducted (N = 16,823). The group under study consisted of all those subjects who had consulted with a psychiatrist/psychologist and had received a clinical diagnosis at public mental health centres within the Region of Madrid (Spain), between 1980 and 2008. A multiple logistic regression was used to explore the independent association with each diagnosis: emotional disorders (ED) with onset specific to childhood, attention deficit hyperactivity disorder (ADHD), conduct disorder (CD), mental retardation (MR), and pervasive developmental disorder (PDD). Birth order and family structure significantly predicted the risk of being diagnosed with ED or ADHD. In addition, sibship size and sex predicted the risk of being diagnosed with a childhood mental disorder. We concluded that being the middle child and living with both biological parents appear to be protective factors against the development of ED or ADHD. Living in large families appears to increase the risk of receiving a CD, MR, or PDD diagnosis. Further research is warranted.

  13. Grandparenting and adolescent adjustment in two-parent biological, lone-parent, and step-families.

    PubMed

    Attar-Schwartz, Shalhevet; Tan, Jo-Pei; Buchanan, Ann; Flouri, Eirini; Griggs, Julia

    2009-02-01

    There is limited research on the links between grandparenting and adolescents' well-being, especially from the perspective of the adolescents. The study examined whether grandparent involvement varied in two-parent biological, lone-parent, and step-families and whether this had a different contribution to the emotional and behavioral adjustment of adolescents across different family structures. The study is based on a sample of 1,515 secondary school students (ages 11-16 years) from England and Wales who completed a structured questionnaire. Findings of hierarchical regression analyses showed that among the whole sample, greater grandparent involvement was associated with fewer emotional problems (p < .01) and with more prosocial behavior (p < .001). In addition, while there were no differences in the level of grandparent involvement across the different family structures, grandparent involvement was more strongly associated with reduced adjustment difficulties among adolescents from lone-parent and step-families than those from two-parent biological families. A possible implication is that the positive role of grandparent involvement in lone-parent and step- families should be more emphasized in family psychology. (PsycINFO Database Record (c) 2009 APA, all rights reserved).

  14. Quantitative structure-activity relationship of the curcumin-related compounds using various regression methods

    NASA Astrophysics Data System (ADS)

    Khazaei, Ardeshir; Sarmasti, Negin; Seyf, Jaber Yousefi

    2016-03-01

    Quantitative structure activity relationship were used to study a series of curcumin-related compounds with inhibitory effect on prostate cancer PC-3 cells, pancreas cancer Panc-1 cells, and colon cancer HT-29 cells. Sphere exclusion method was used to split data set in two categories of train and test set. Multiple linear regression, principal component regression and partial least squares were used as the regression methods. In other hand, to investigate the effect of feature selection methods, stepwise, Genetic algorithm, and simulated annealing were used. In two cases (PC-3 cells and Panc-1 cells), the best models were generated by a combination of multiple linear regression and stepwise (PC-3 cells: r2 = 0.86, q2 = 0.82, pred_r2 = 0.93, and r2m (test) = 0.43, Panc-1 cells: r2 = 0.85, q2 = 0.80, pred_r2 = 0.71, and r2m (test) = 0.68). For the HT-29 cells, principal component regression with stepwise (r2 = 0.69, q2 = 0.62, pred_r2 = 0.54, and r2m (test) = 0.41) is the best method. The QSAR study reveals descriptors which have crucial role in the inhibitory property of curcumin-like compounds. 6ChainCount, T_C_C_1, and T_O_O_7 are the most important descriptors that have the greatest effect. With a specific end goal to design and optimization of novel efficient curcumin-related compounds it is useful to introduce heteroatoms such as nitrogen, oxygen, and sulfur atoms in the chemical structure (reduce the contribution of T_C_C_1 descriptor) and increase the contribution of 6ChainCount and T_O_O_7 descriptors. Models can be useful in the better design of some novel curcumin-related compounds that can be used in the treatment of prostate, pancreas, and colon cancers.

  15. The effect of dual accreditation on family medicine residency programs.

    PubMed

    Mims, Lisa D; Bressler, Lindsey C; Wannamaker, Louise R; Carek, Peter J

    2015-04-01

    In 1985, the American Osteopathic Association (AOA) Board of Trustees agreed to allow residency programs to become dually accredited by the AOA and Accreditation Council for Graduate Medical Education (ACGME). Despite the increase in such programs, there has been minimal research comparing these programs to exclusively ACGME-accredited residencies. This study examines the association between dual accreditation and suggested markers of quality. Standard characteristics such as regional location, program structure (community or university based), postgraduate year one (PGY-1) positions offered, and salary (PGY-1) were obtained for each residency program. In addition, the faculty to resident ratio in the family medicine clinic and the number of half days residents spent in the clinic each week were recorded. Initial Match rates and pass rates of new graduates on the ABFM examination from 2009 to 2013 were also obtained. Variables were analyzed using chi-square and Student's t test. Logistic regression models were then created to predict a program's 5-year aggregate initial Match rate and Board pass rate in the top tertile as compared to the lowest tertile. Dual accreditation was obtained by 117 (27.0%) of programs. Initial analyses revealed associations between dually accredited programs and mean year of initial ACGME program accreditation, regional location, program structure, tracks, and alternative medicine curriculum. When evaluated in logistic regression, dual accreditation status was not associated with Match rates or ABFM pass rates. By examining suggested markers of program quality for dually accredited programs in comparison to ACGME-only accredited programs, this study successfully established both differences and similarities among the two types.

  16. Structural Hippocampal Anomalies in a Schizophrenia Population Correlate with Navigation Performance on a Wayfinding Task

    PubMed Central

    Ledoux, Andrée-Anne; Boyer, Patrice; Phillips, Jennifer L.; Labelle, Alain; Smith, Andra; Bohbot, Véronique D.

    2014-01-01

    Episodic memory, related to the hippocampus, has been found to be impaired in schizophrenia. Further, hippocampal anomalies have also been observed in schizophrenia. This study investigated whether average hippocampal gray matter (GM) would differentiate performance on a hippocampus-dependent memory task in patients with schizophrenia and healthy controls. Twenty-one patients with schizophrenia and 22 control participants were scanned with an MRI while being tested on a wayfinding task in a virtual town (e.g., find the grocery store from the school). Regressions were performed for both groups individually and together using GM and performance on the wayfinding task. Results indicate that controls successfully completed the task more often than patients, took less time, and made fewer errors. Additionally, controls had significantly more hippocampal GM than patients. Poor performance was associated with a GM decrease in the right hippocampus for both groups. Within group regressions found an association between right hippocampi GM and performance in controls and an association between the left hippocampi GM and performance in patients. A second analysis revealed that different anatomical GM regions, known to be associated with the hippocampus, such as the parahippocampal cortex, amygdala, medial, and orbital prefrontal cortices, covaried with the hippocampus in the control group. Interestingly, the cuneus and cingulate gyrus also covaried with the hippocampus in the patient group but the orbital frontal cortex did not, supporting the hypothesis of impaired connectivity between the hippocampus and the frontal cortex in schizophrenia. These results present important implications for creating intervention programs aimed at measuring functional and structural changes in the hippocampus in schizophrenia. PMID:24672451

  17. Correlating Reactivity and Selectivity to Cyclopentadienyl Ligand Properties in Rh(III)-Catalyzed C-H Activation Reactions: An Experimental and Computational Study.

    PubMed

    Piou, Tiffany; Romanov-Michailidis, Fedor; Romanova-Michaelides, Maria; Jackson, Kelvin E; Semakul, Natthawat; Taggart, Trevor D; Newell, Brian S; Rithner, Christopher D; Paton, Robert S; Rovis, Tomislav

    2017-01-25

    Cp X Rh(III)-catalyzed C-H functionalization reactions are a proven method for the efficient assembly of small molecules. However, rationalization of the effects of cyclopentadienyl (Cp X ) ligand structure on reaction rate and selectivity has been viewed as a black box, and a truly systematic study is lacking. Consequently, predicting the outcomes of these reactions is challenging because subtle variations in ligand structure can cause notable changes in reaction behavior. A predictive tool is, nonetheless, of considerable value to the community as it would greatly accelerate reaction development. Designing a data set in which the steric and electronic properties of the Cp X Rh(III) catalysts were systematically varied allowed us to apply multivariate linear regression algorithms to establish correlations between these catalyst-based descriptors and the regio-, diastereoselectivity, and rate of model reactions. This, in turn, led to the development of quantitative predictive models that describe catalyst performance. Our newly described cone angles and Sterimol parameters for Cp X ligands served as highly correlative steric descriptors in the regression models. Through rational design of training and validation sets, key diastereoselectivity outliers were identified. Computations reveal the origins of the outstanding stereoinduction displayed by these outliers. The results are consistent with partial η 5 -η 3 ligand slippage that occurs in the transition state of the selectivity-determining step. In addition to the instructive value of our study, we believe that the insights gained are transposable to other group 9 transition metals and pave the way toward rational design of C-H functionalization catalysts.

  18. Structural hippocampal anomalies in a schizophrenia population correlate with navigation performance on a wayfinding task.

    PubMed

    Ledoux, Andrée-Anne; Boyer, Patrice; Phillips, Jennifer L; Labelle, Alain; Smith, Andra; Bohbot, Véronique D

    2014-01-01

    Episodic memory, related to the hippocampus, has been found to be impaired in schizophrenia. Further, hippocampal anomalies have also been observed in schizophrenia. This study investigated whether average hippocampal gray matter (GM) would differentiate performance on a hippocampus-dependent memory task in patients with schizophrenia and healthy controls. Twenty-one patients with schizophrenia and 22 control participants were scanned with an MRI while being tested on a wayfinding task in a virtual town (e.g., find the grocery store from the school). Regressions were performed for both groups individually and together using GM and performance on the wayfinding task. Results indicate that controls successfully completed the task more often than patients, took less time, and made fewer errors. Additionally, controls had significantly more hippocampal GM than patients. Poor performance was associated with a GM decrease in the right hippocampus for both groups. Within group regressions found an association between right hippocampi GM and performance in controls and an association between the left hippocampi GM and performance in patients. A second analysis revealed that different anatomical GM regions, known to be associated with the hippocampus, such as the parahippocampal cortex, amygdala, medial, and orbital prefrontal cortices, covaried with the hippocampus in the control group. Interestingly, the cuneus and cingulate gyrus also covaried with the hippocampus in the patient group but the orbital frontal cortex did not, supporting the hypothesis of impaired connectivity between the hippocampus and the frontal cortex in schizophrenia. These results present important implications for creating intervention programs aimed at measuring functional and structural changes in the hippocampus in schizophrenia.

  19. The Relationship between Bureaucratic School Structures and Teacher Self-Efficacy

    ERIC Educational Resources Information Center

    Kilinç, Ali Çagatay; Kosar, Serkan; Er, Emre; Ögdem, Zeki

    2016-01-01

    The purpose of this study was to examine the relationship between bureaucratic school structures and teachers' self-efficacy. Participants included 252 teachers from 15 primary schools in Ankara, Turkey. Mean, standard deviation, correlation, and regression analyses were conducted. Results indicated that bureaucratic school structures and teacher…

  20. Tools to Support Interpreting Multiple Regression in the Face of Multicollinearity

    PubMed Central

    Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K.

    2012-01-01

    While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses. PMID:22457655

  1. Tools to support interpreting multiple regression in the face of multicollinearity.

    PubMed

    Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K

    2012-01-01

    While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses.

  2. Strengthening the Regression Discontinuity Design Using Additional Design Elements: A Within-Study Comparison

    ERIC Educational Resources Information Center

    Wing, Coady; Cook, Thomas D.

    2013-01-01

    The sharp regression discontinuity design (RDD) has three key weaknesses compared to the randomized clinical trial (RCT). It has lower statistical power, it is more dependent on statistical modeling assumptions, and its treatment effect estimates are limited to the narrow subpopulation of cases immediately around the cutoff, which is rarely of…

  3. Further Insight and Additional Inference Methods for Polynomial Regression Applied to the Analysis of Congruence

    ERIC Educational Resources Information Center

    Cohen, Ayala; Nahum-Shani, Inbal; Doveh, Etti

    2010-01-01

    In their seminal paper, Edwards and Parry (1993) presented the polynomial regression as a better alternative to applying difference score in the study of congruence. Although this method is increasingly applied in congruence research, its complexity relative to other methods for assessing congruence (e.g., difference score methods) was one of the…

  4. Statistical summary of selected physical, chemical, and toxicity characteristics and estimates of annual constituent loads in urban stormwater, Maricopa County, Arizona

    USGS Publications Warehouse

    Fossum, Kenneth D.; O'Day, Christie M.; Wilson, Barbara J.; Monical, Jim E.

    2001-01-01

    Stormwater and streamflow in Maricopa County were monitored to (1) describe the physical, chemical, and toxicity characteristics of stormwater from areas having different land uses, (2) describe the physical, chemical, and toxicity characteristics of streamflow from areas that receive urban stormwater, and (3) estimate constituent loads in stormwater. Urban stormwater and streamflow had similar ranges in most constituent concentrations. The mean concentration of dissolved solids in urban stormwater was lower than in streamflow from the Salt River and Indian Bend Wash. Urban stormwater, however, had a greater chemical oxygen demand and higher concentrations of most nutrients. Mean seasonal loads and mean annual loads of 11 constituents and volumes of runoff were estimated for municipalities in the metropolitan Phoenix area, Arizona, by adjusting regional regression equations of loads. This adjustment procedure uses the original regional regression equation and additional explanatory variables that were not included in the original equation. The adjusted equations had standard errors that ranged from 161 to 196 percent. The large standard errors of the prediction result from the large variability of the constituent concentration data used in the regression analysis. Adjustment procedures produced unsatisfactory results for nine of the regressions?suspended solids, dissolved solids, total phosphorus, dissolved phosphorus, total recoverable cadmium, total recoverable copper, total recoverable lead, total recoverable zinc, and storm runoff. These equations had no consistent direction of bias and no other additional explanatory variables correlated with the observed loads. A stepwise-multiple regression or a three-variable regression (total storm rainfall, drainage area, and impervious area) and local data were used to develop local regression equations for these nine constituents. These equations had standard errors from 15 to 183 percent.

  5. Experimental investigation of paraffin-based fuels for hybrid rocket propulsion

    NASA Astrophysics Data System (ADS)

    Galfetti, L.; Merotto, L.; Boiocchi, M.; Maggi, F.; DeLuca, L. T.

    2013-03-01

    Solid fuels for hybrid rockets were characterized in the framework of a research project aimed to develop a new generation of solid fuels, combining at the same time good mechanical and ballistic properties. Original techniques were implemented in order to improve paraffin-based fuels. The first strengthening technique involves the use of a polyurethane foam (PUF); a second technique is based on thermoplastic polymers mixed at molecular level with the paraffin binder. A ballistic characterization of paraffin-based hybrid rocket solid fuels was performed, considering pure wax-based fuels and fuels doped with suitable metal additives. Nano-Al powders and metal hydrides (magnesium hydride (MgH2), lithium aluminum hydride (LiAlH4 )) were used as fillers in paraffin matrices. The results of this investigation show a strong correlation between the measured viscosity of the melted paraffin layer and the regression rate: a decrease of viscosity increases the regression rate. This trend is due to the increasing development of entrainment phenomena, which strongly increase the regression rate. Addition of LiAlH4 (mass fraction 10%) can further increase the regression rate up to 378% with respect to the pure HTPB regression rate, taken as baseline reference fuel. The highest regression rates were found for the Solid Wax (SW) composition, added with 5% MgH2 mass fraction; at 350 kg/(m2s) oxygen mass flux, the measured regression rate, averaged in space and time, was 2.5 mm/s, which is approximately five times higher than that of the pure HTPB composition. Compositions added with nanosized aluminum powders were compared with those added with MgH2, using gel or solid wax.

  6. Civic communities and urban violence.

    PubMed

    Doucet, Jessica M; Lee, Matthew R

    2015-07-01

    Civic communities have a spirit of entrepreneurialism, a locally invested population and an institutional structure fostering civic engagement. Prior research, mainly confined to studying rural communities and fairly large geographic areas, has demonstrated that civic communities have lower rates of violence. The current study analyzes the associations between the components of civic communities and homicide rates for New Orleans neighborhoods (census tracts) in the years following Hurricane Katrina. Results from negative binomial regression models adjusting for spatial autocorrelation reveal that community homicide rates are lower where an entrepreneurial business climate is more pronounced and where there is more local investment. Additionally, an interaction between the availability of civic institutions and resource disadvantage reveals that the protective effects of civic institutions are only evident in disadvantaged communities. Copyright © 2015 Elsevier Inc. All rights reserved.

  7. Quantitative Structure-Activity Relationship of Insecticidal Activity of Benzyl Ether Diamidine Derivatives

    NASA Astrophysics Data System (ADS)

    Zhai, Mengting; Chen, Yan; Li, Jing; Zhou, Jun

    2017-12-01

    The molecular electrongativity distance vector (MEDV-13) was used to describe the molecular structure of benzyl ether diamidine derivatives in this paper, Based on MEDV-13, The three-parameter (M 3, M 15, M 47) QSAR model of insecticidal activity (pIC 50) for 60 benzyl ether diamidine derivatives was constructed by leaps-and-bounds regression (LBR) . The traditional correlation coefficient (R) and the cross-validation correlation coefficient (R CV ) were 0.975 and 0.971, respectively. The robustness of the regression model was validated by Jackknife method, the correlation coefficient R were between 0.971 and 0.983. Meanwhile, the independent variables in the model were tested to be no autocorrelation. The regression results indicate that the model has good robust and predictive capabilities. The research would provide theoretical guidance for the development of new generation of anti African trypanosomiasis drugs with efficiency and low toxicity.

  8. Brain structure correlates of component reading processes: implications for reading disability.

    PubMed

    Phinney, Erin; Pennington, Bruce F; Olson, Richard; Filley, Christopher M; Filipek, Pauline A

    2007-08-01

    Brain structures implicated in developmental dyslexia (reading disability - RD) vary greatly across structural magnetic resonance imaging (MRI) studies due to methodological differences regarding the definition of RD and the exact measurements of a specific brain structure. The current study attempts to resolve some of those methodological concerns by examining brain volume as it relates to components of proposed RD subtypes. We performed individual regression analyses on total cerebral volume, neocortical volume, subcortical volume, 9 neo-cortical structures and 2 sub-cortical structures. These analyses used three dimensions of reading, phonemic ability (PA), orthographic ability, and rapid naming (RN) ability, while accounting for total cerebral volume, age, and performance IQ (PIQ). Primary analyses included membership to a group (poor reader vs. good reader) in the analysis. The result was a significant interaction between PA and reading ability as it predicts total cerebral volume. Analyses revealed that poor readers lacked a relationship between PA and brain size, but that good readers had a significant positive relationship. This pattern of interaction was not present for the other two reading component factors. These findings bring into question the general belief that individuals with RD are at the low end of a reading ability distribution and do not have a unique disorder. Additional analyses revealed only a few significant relationships between brain size and task performance, most notably a positive correlation between orthographic ability and the angular gyrus (AG), as well as a negative correlation between RN ability and the parietal operculum (PO).

  9. Equivalent circuit models for interpreting impedance perturbation spectroscopy data

    NASA Astrophysics Data System (ADS)

    Smith, R. Lowell

    2004-07-01

    As in-situ structural integrity monitoring disciplines mature, there is a growing need to process sensor/actuator data efficiently in real time. Although smaller, faster embedded processors will contribute to this, it is also important to develop straightforward, robust methods to reduce the overall computational burden for practical applications of interest. This paper addresses the use of equivalent circuit modeling techniques for inferring structure attributes monitored using impedance perturbation spectroscopy. In pioneering work about ten years ago significant progress was associated with the development of simple impedance models derived from the piezoelectric equations. Using mathematical modeling tools currently available from research in ultrasonics and impedance spectroscopy is expected to provide additional synergistic benefits. For purposes of structural health monitoring the objective is to use impedance spectroscopy data to infer the physical condition of structures to which small piezoelectric actuators are bonded. Features of interest include stiffness changes, mass loading, and damping or mechanical losses. Equivalent circuit models are typically simple enough to facilitate the development of practical analytical models of the actuator-structure interaction. This type of parametric structure model allows raw impedance/admittance data to be interpreted optimally using standard multiple, nonlinear regression analysis. One potential long-term outcome is the possibility of cataloging measured viscoelastic properties of the mechanical subsystems of interest as simple lists of attributes and their statistical uncertainties, whose evolution can be followed in time. Equivalent circuit models are well suited for addressing calibration and self-consistency issues such as temperature corrections, Poisson mode coupling, and distributed relaxation processes.

  10. Network Structure and Travel Time Perception

    PubMed Central

    Parthasarathi, Pavithra; Levinson, David; Hochmair, Hartwig

    2013-01-01

    The purpose of this research is to test the systematic variation in the perception of travel time among travelers and relate the variation to the underlying street network structure. Travel survey data from the Twin Cities metropolitan area (which includes the cities of Minneapolis and St. Paul) is used for the analysis. Travelers are classified into two groups based on the ratio of perceived and estimated commute travel time. The measures of network structure are estimated using the street network along the identified commute route. T-test comparisons are conducted to identify statistically significant differences in estimated network measures between the two traveler groups. The combined effect of these estimated network measures on travel time is then analyzed using regression models. The results from the t-test and regression analyses confirm the influence of the underlying network structure on the perception of travel time. PMID:24204932

  11. Fusion of sensor geometry into additive strain fields measured with sensing skin

    NASA Astrophysics Data System (ADS)

    Downey, Austin; Sadoughi, Mohammadkazem; Laflamme, Simon; Hu, Chao

    2018-07-01

    Recently, numerous studies have been conducted on flexible skin-like membranes for the cost effective monitoring of large-scale structures. The authors have proposed a large-area electronic consisting of a soft elastomeric capacitor (SEC) that transduces a structure’s strain into a measurable change in capacitance. Arranged in a network configuration, SECs deployed onto the surface of a structure could be used to reconstruct strain maps. Several regression methods have been recently developed with the purpose of reconstructing such maps, but all these algorithms assumed that each SEC-measured strain located at its geometric center. This assumption may not be realistic since an SEC measures the average strain value of the whole area covered by the sensor. One solution is to reduce the size of each SEC, but this would also increase the number of required sensors needed to cover the large-scale structure, therefore increasing the need for the power and data acquisition capabilities. Instead, this study proposes an algorithm that accounts for the sensor’s strain averaging feature by adjusting the strain measurements and constructing a full-field strain map using the kriging interpolation method. The proposed algorithm fuses the geometry of an SEC sensor into the strain map reconstruction in order to adaptively adjust the average kriging-estimated strain of the area monitored by the sensor to the signal. Results show that by considering the sensor geometry, in addition to the sensor signal and location, the proposed strain map adjustment algorithm is capable of producing more accurate full-field strain maps than the traditional spatial interpolation method that considered only signal and location.

  12. Uncertainty analysis of terrestrial net primary productivity and net biome productivity in China during 1901-2005

    NASA Astrophysics Data System (ADS)

    Shao, Junjiong; Zhou, Xuhui; Luo, Yiqi; Zhang, Guodong; Yan, Wei; Li, Jiaxuan; Li, Bo; Dan, Li; Fisher, Joshua B.; Gao, Zhiqiang; He, Yong; Huntzinger, Deborah; Jain, Atul K.; Mao, Jiafu; Meng, Jihua; Michalak, Anna M.; Parazoo, Nicholas C.; Peng, Changhui; Poulter, Benjamin; Schwalm, Christopher R.; Shi, Xiaoying; Sun, Rui; Tao, Fulu; Tian, Hanqin; Wei, Yaxing; Zeng, Ning; Zhu, Qiuan; Zhu, Wenquan

    2016-05-01

    Despite the importance of net primary productivity (NPP) and net biome productivity (NBP), estimates of NPP and NBP for China are highly uncertain. To investigate the main sources of uncertainty, we synthesized model estimates of NPP and NBP for China from published literature and the Multi-scale Synthesis and Terrestrial Model Intercomparison Project (MsTMIP). The literature-based results showed that total NPP and NBP in China were 3.35 ± 1.25 and 0.14 ± 0.094 Pg C yr-1, respectively. Classification and regression tree analysis based on literature data showed that model type was the primary source of the uncertainty, explaining 36% and 64% of the variance in NPP and NBP, respectively. Spatiotemporal scales, land cover conditions, inclusion of the N cycle, and effects of N addition also contributed to the overall uncertainty. Results based on the MsTMIP data suggested that model structures were overwhelmingly important (>90%) for the overall uncertainty compared to simulations with different combinations of time-varying global change factors. The interannual pattern of NPP was similar among diverse studies and increased by 0.012 Pg C yr-1 during 1981-2000. In addition, high uncertainty in China's NPP occurred in areas with high productivity, whereas NBP showed the opposite pattern. Our results suggest that to significantly reduce uncertainty in estimated NPP and NBP, model structures should be substantially tested on the basis of empirical results. To this end, coordinated distributed experiments with multiple global change factors might be a practical approach that can validate specific structures of different models.

  13. Effectiveness of the New Hampshire stream-gaging network in providing regional streamflow information

    USGS Publications Warehouse

    Olson, Scott A.

    2003-01-01

    The stream-gaging network in New Hampshire was analyzed for its effectiveness in providing regional information on peak-flood flow, mean-flow, and low-flow frequency. The data available for analysis were from stream-gaging stations in New Hampshire and selected stations in adjacent States. The principles of generalized-least-squares regression analysis were applied to develop regional regression equations that relate streamflow-frequency characteristics to watershed characteristics. Regression equations were developed for (1) the instantaneous peak flow with a 100-year recurrence interval, (2) the mean-annual flow, and (3) the 7-day, 10-year low flow. Active and discontinued stream-gaging stations with 10 or more years of flow data were used to develop the regression equations. Each stream-gaging station in the network was evaluated and ranked on the basis of how much the data from that station contributed to the cost-weighted sampling-error component of the regression equation. The potential effect of data from proposed and new stream-gaging stations on the sampling error also was evaluated. The stream-gaging network was evaluated for conditions in water year 2000 and for estimated conditions under various network strategies if an additional 5 years and 20 years of streamflow data were collected. The effectiveness of the stream-gaging network in providing regional streamflow information could be improved for all three flow characteristics with the collection of additional flow data, both temporally and spatially. With additional years of data collection, the greatest reduction in the average sampling error of the regional regression equations was found for the peak- and low-flow characteristics. In general, additional data collection at stream-gaging stations with unregulated flow, relatively short-term record (less than 20 years), and drainage areas smaller than 45 square miles contributed the largest cost-weighted reduction to the average sampling error of the regional estimating equations. The results of the network analyses can be used to prioritize the continued operation of active stations, the reactivation of discontinued stations, or the activation of new stations to maximize the regional information content provided by the stream-gaging network. Final decisions regarding altering the New Hampshire stream-gaging network would require the consideration of the many uses of the streamflow data serving local, State, and Federal interests.

  14. Eye Movements Reveal the Influence of Event Structure on Reading Behavior.

    PubMed

    Swets, Benjamin; Kurby, Christopher A

    2016-03-01

    When we read narrative texts such as novels and newspaper articles, we segment information presented in such texts into discrete events, with distinct boundaries between those events. But do our eyes reflect this event structure while reading? This study examines whether eye movements during the reading of discourse reveal how readers respond online to event structure. Participants read narrative passages as we monitored their eye movements. Several measures revealed that event structure predicted eye movements. In two experiments, we found that both early and overall reading times were longer for event boundaries. We also found that regressive saccades were more likely to land on event boundaries, but that readers were less likely to regress out of an event boundary. Experiment 2 also demonstrated that tracking event structure carries a working memory load. Eye movements provide a rich set of online data to test the cognitive reality of event segmentation during reading. Copyright © 2015 Cognitive Science Society, Inc.

  15. Evaluation and prediction of shrub cover in coastal Oregon forests (USA)

    Treesearch

    Becky K. Kerns; Janet L. Ohmann

    2004-01-01

    We used data from regional forest inventories and research programs, coupled with mapped climatic and topographic information, to explore relationships and develop multiple linear regression (MLR) and regression tree models for total and deciduous shrub cover in the Oregon coastal province. Results from both types of models indicate that forest structure variables were...

  16. Detecting DIF in Polytomous Items Using MACS, IRT and Ordinal Logistic Regression

    ERIC Educational Resources Information Center

    Elosua, Paula; Wells, Craig

    2013-01-01

    The purpose of the present study was to compare the Type I error rate and power of two model-based procedures, the mean and covariance structure model (MACS) and the item response theory (IRT), and an observed-score based procedure, ordinal logistic regression, for detecting differential item functioning (DIF) in polytomous items. A simulation…

  17. Weighted linear regression using D2H and D2 as the independent variables

    Treesearch

    Hans T. Schreuder; Michael S. Williams

    1998-01-01

    Several error structures for weighted regression equations used for predicting volume were examined for 2 large data sets of felled and standing loblolly pine trees (Pinus taeda L.). The generally accepted model with variance of error proportional to the value of the covariate squared ( D2H = diameter squared times height or D...

  18. Latent Variable Regression 4-Level Hierarchical Model Using Multisite Multiple-Cohorts Longitudinal Data. CRESST Report 801

    ERIC Educational Resources Information Center

    Choi, Kilchan

    2011-01-01

    This report explores a new latent variable regression 4-level hierarchical model for monitoring school performance over time using multisite multiple-cohorts longitudinal data. This kind of data set has a 4-level hierarchical structure: time-series observation nested within students who are nested within different cohorts of students. These…

  19. Techniques for estimating flood peak discharges for unregulated streams and streams regulated by small floodwater retarding structures in Oklahoma

    USGS Publications Warehouse

    Tortorelli, R.L.; Bergman, D.L.

    1985-01-01

    Statewide regression relations for Oklahoma were determined for estimating peak discharge of floods for selected recurrence intervals from 2 to 500 years. The independent variables required for estimating flood discharge for rural streams are contributing drainage area and mean annual precipitation. Main-channel slope, a variable used in previous reports, was found to contribute very little to the accuracy of the relations and was not used. The regression equations are applicable for watersheds with drainage areas less than 2,500 square miles that are not significantly affected by regulation from manmade works. These relations are presented in graphical form for easy application. Limitations on the use of the regression relations and the reliability of regression estimates for rural unregulated streams are discussed. Basin and climatic characteristics, log-Pearson Type III statistics and the flood-frequency relations for 226 gaging stations in Oklahoma and adjacent states are presented. Regression relations are investigated for estimating flood magnitude and frequency for watersheds affected by regulation from small FRS (floodwater retarding structures) built by the U.S. Soil Conservation Service in their watershed protection and flood prevention program. Gaging-station data from nine FRS regulated sites in Oklahoma and one FRS regulated site in Kansas are used. For sites regulated by FRS, an adjustment of the statewide rural regression relations can be used to estimate flood magnitude and frequency. The statewide regression equations are used by substituting the drainage area below the FRS, or drainage area that represents the percent of the basin unregulated, in the contributing drainage area parameter to obtain flood-frequency estimates. Flood-frequency curves and flow-duration curves are presented for five gaged sites to illustrate the effects of FRS regulation on peak discharge.

  20. Neither fixed nor random: weighted least squares meta-regression.

    PubMed

    Stanley, T D; Doucouliagos, Hristos

    2017-03-01

    Our study revisits and challenges two core conventional meta-regression estimators: the prevalent use of 'mixed-effects' or random-effects meta-regression analysis and the correction of standard errors that defines fixed-effects meta-regression analysis (FE-MRA). We show how and explain why an unrestricted weighted least squares MRA (WLS-MRA) estimator is superior to conventional random-effects (or mixed-effects) meta-regression when there is publication (or small-sample) bias that is as good as FE-MRA in all cases and better than fixed effects in most practical applications. Simulations and statistical theory show that WLS-MRA provides satisfactory estimates of meta-regression coefficients that are practically equivalent to mixed effects or random effects when there is no publication bias. When there is publication selection bias, WLS-MRA always has smaller bias than mixed effects or random effects. In practical applications, an unrestricted WLS meta-regression is likely to give practically equivalent or superior estimates to fixed-effects, random-effects, and mixed-effects meta-regression approaches. However, random-effects meta-regression remains viable and perhaps somewhat preferable if selection for statistical significance (publication bias) can be ruled out and when random, additive normal heterogeneity is known to directly affect the 'true' regression coefficient. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  1. Factor Structure of the Primary Scales of the Inventory of Personality Organization in a Nonclinical Sample Using Exploratory Structural Equation Modeling

    ERIC Educational Resources Information Center

    Ellison, William D.; Levy, Kenneth N.

    2012-01-01

    Using exploratory structural equation modeling and multiple regression, we examined the factor structure and criterion relations of the primary scales of the Inventory of Personality Organization (IPO; Kernberg & Clarkin, 1995) in a nonclinical sample. Participants (N = 1,260) completed the IPO and measures of self-concept clarity, defenses,…

  2. Psychometrics of the AAN Caregiver Driving Safety Questionnaire and contributors to caregiver concern about driving safety in older adults.

    PubMed

    Carvalho, Janessa O; Springate, Beth; Bernier, Rachel A; Davis, Jennifer

    2018-03-01

    ABSTRACTBackground:The American Academy of Neurology (AAN) updated their practice parameters in the evaluation of driving risk in dementia and developed a Caregiver Driving Safety Questionnaire, detailed in their original manuscript (Iverson Gronseth, Reger, Classen, Dubinsky, & Rizzo, 2010). They described four factors associated with decreased driving ability in dementia patients: history of crashes or citations, informant-reported concerns, reduced mileage, and aggressive driving. An informant-reported AAN Caregiver Driving Safety Questionnaire was designed with these elements, and the current study was the first to explore the factor structure of this questionnaire. Additionally, we examined associations between these factors and cognitive and behavioral measures in patients with mild cognitive impairment or early Alzheimer's disease and their informants. Exploratory factor analysis revealed a four-component structure, consistent with the theory behind the AAN scale composition. These four factor scores also were significantly associated with performance on cognitive screening instruments and informant reported behavioral dysfunction. Regressions revealed that behavioral dysfunction predicted caregiver concerns about driving safety beyond objective patient cognitive dysfunction. In this first known quantitative exploration of the scale, our results support continued use of this scale in office driving safety assessments. Additionally, patient behavioral changes predicted caregiver concerns about driving safety over and above cognitive status, which suggests that caregivers may benefit from psychoeducation about cognitive factors that may negatively impact driving safety.

  3. Embedded Sensors for Measuring Surface Regression

    NASA Technical Reports Server (NTRS)

    Gramer, Daniel J.; Taagen, Thomas J.; Vermaak, Anton G.

    2006-01-01

    The development and evaluation of new hybrid and solid rocket motors requires accurate characterization of the propellant surface regression as a function of key operational parameters. These characteristics establish the propellant flow rate and are prime design drivers affecting the propulsion system geometry, size, and overall performance. There is a similar need for the development of advanced ablative materials, and the use of conventional ablatives exposed to new operational environments. The Miniature Surface Regression Sensor (MSRS) was developed to serve these applications. It is designed to be cast or embedded in the material of interest and regresses along with it. During this process, the resistance of the sensor is related to its instantaneous length, allowing the real-time thickness of the host material to be established. The time derivative of this data reveals the instantaneous surface regression rate. The MSRS could also be adapted to perform similar measurements for a variety of other host materials when it is desired to monitor thicknesses and/or regression rate for purposes of safety, operational control, or research. For example, the sensor could be used to monitor the thicknesses of brake linings or racecar tires and indicate when they need to be replaced. At the time of this reporting, over 200 of these sensors have been installed into a variety of host materials. An MSRS can be made in either of two configurations, denoted ladder and continuous (see Figure 1). A ladder MSRS includes two highly electrically conductive legs, across which narrow strips of electrically resistive material are placed at small increments of length. These strips resemble the rungs of a ladder and are electrically equivalent to many tiny resistors connected in parallel. A substrate material provides structural support for the legs and rungs. The instantaneous sensor resistance is read by an external signal conditioner via wires attached to the conductive legs on the non-eroding end of the sensor. The sensor signal can be transmitted from inside a high-pressure chamber to the ambient environment, using commercially available feedthrough connectors. Miniaturized internal recorders or wireless data transmission could also potentially be employed to eliminate the need for producing penetrations in the chamber case. The rungs are designed so that as each successive rung is eroded away, the resistance changes by an amount that yields a readily measurable signal larger than the background noise. (In addition, signal-conditioning techniques are used in processing the resistance readings to mitigate the effect of noise.) Hence, each discrete change of resistance serves to indicate the arrival of the regressing host material front at the known depth of the affected resistor rung. The average rate of regression between two adjacent resistors can be calculated simply as the distance between the resistors divided by the time interval between their resistance jumps. Advanced data reduction techniques have also been developed to establish the instantaneous surface position and regression rate when the regressing front is between rungs.

  4. LiDAR based prediction of forest biomass using hierarchical models with spatially varying coefficients

    USGS Publications Warehouse

    Babcock, Chad; Finley, Andrew O.; Bradford, John B.; Kolka, Randall K.; Birdsey, Richard A.; Ryan, Michael G.

    2015-01-01

    Many studies and production inventory systems have shown the utility of coupling covariates derived from Light Detection and Ranging (LiDAR) data with forest variables measured on georeferenced inventory plots through regression models. The objective of this study was to propose and assess the use of a Bayesian hierarchical modeling framework that accommodates both residual spatial dependence and non-stationarity of model covariates through the introduction of spatial random effects. We explored this objective using four forest inventory datasets that are part of the North American Carbon Program, each comprising point-referenced measures of above-ground forest biomass and discrete LiDAR. For each dataset, we considered at least five regression model specifications of varying complexity. Models were assessed based on goodness of fit criteria and predictive performance using a 10-fold cross-validation procedure. Results showed that the addition of spatial random effects to the regression model intercept improved fit and predictive performance in the presence of substantial residual spatial dependence. Additionally, in some cases, allowing either some or all regression slope parameters to vary spatially, via the addition of spatial random effects, further improved model fit and predictive performance. In other instances, models showed improved fit but decreased predictive performance—indicating over-fitting and underscoring the need for cross-validation to assess predictive ability. The proposed Bayesian modeling framework provided access to pixel-level posterior predictive distributions that were useful for uncertainty mapping, diagnosing spatial extrapolation issues, revealing missing model covariates, and discovering locally significant parameters.

  5. Influence factors and forecast of carbon emission in China: structure adjustment for emission peak

    NASA Astrophysics Data System (ADS)

    Wang, B.; Cui, C. Q.; Li, Z. P.

    2018-02-01

    This paper introduced Principal Component Analysis and Multivariate Linear Regression Model to verify long-term balance relationships between Carbon Emissions and the impact factors. The integrated model of improved PCA and multivariate regression analysis model is attainable to figure out the pattern of carbon emission sources. Main empirical results indicate that among all selected variables, the role of energy consumption scale was largest. GDP and Population follow and also have significant impacts on carbon emission. Industrialization rate and fossil fuel proportion, which is the indicator of reflecting the economic structure and energy structure, have a higher importance than the factor of urbanization rate and the dweller consumption level of urban areas. In this way, some suggestions are put forward for government to achieve the peak of carbon emissions.

  6. Estimation of True Change with Three-Wave Data.

    ERIC Educational Resources Information Center

    Gupta, J. K.; And Others

    1988-01-01

    A multiple regression model for estimating true change of an individual by using information in addition to pretest-posttest data is described. Longitudinal panel or multi-wave data taken between the pretest and the posttest constitute the additional information. (TJH)

  7. Inhibitory saccadic dysfunction is associated with cerebellar injury in multiple sclerosis.

    PubMed

    Kolbe, Scott C; Kilpatrick, Trevor J; Mitchell, Peter J; White, Owen; Egan, Gary F; Fielding, Joanne

    2014-05-01

    Cognitive dysfunction is common in patients with multiple sclerosis (MS). Saccadic eye movement paradigms such as antisaccades (AS) can sensitively interrogate cognitive function, in particular, the executive and attentional processes of response selection and inhibition. Although we have previously demonstrated significant deficits in the generation of AS in MS patients, the neuropathological changes underlying these deficits were not elucidated. In this study, 24 patients with relapsing-remitting MS underwent testing using an AS paradigm. Rank correlation and multiple regression analyses were subsequently used to determine whether AS errors in these patients were associated with: (i) neurological and radiological abnormalities, as measured by standard clinical techniques, (ii) cognitive dysfunction, and (iii) regionally specific cerebral white and gray-matter damage. Although AS error rates in MS patients did not correlate with clinical disability (using the Expanded Disability Status Score), T2 lesion load or brain parenchymal fraction, AS error rate did correlate with performance on the Paced Auditory Serial Addition Task and the Symbol Digit Modalities Test, neuropsychological tests commonly used in MS. Further, voxel-wise regression analyses revealed associations between AS errors and reduced fractional anisotropy throughout most of the cerebellum, and increased mean diffusivity in the cerebellar vermis. Region-wise regression analyses confirmed that AS errors also correlated with gray-matter atrophy in the cerebellum right VI subregion. These results support the use of the AS paradigm as a marker for cognitive dysfunction in MS and implicate structural and microstructural changes to the cerebellum as a contributing mechanism for AS deficits in these patients. Copyright © 2013 Wiley Periodicals, Inc.

  8. Finding the Perfect Match: Factors That Influence Family Medicine Residency Selection.

    PubMed

    Wright, Katherine M; Ryan, Elizabeth R; Gatta, John L; Anderson, Lauren; Clements, Deborah S

    2016-04-01

    Residency program selection is a significant experience for emerging physicians, yet there is limited information about how applicants narrow their list of potential programs. This study examines factors that influence residency program selection among medical students interested in family medicine at the time of application. Medical students with an expressed interest in family medicine were invited to participate in a 37-item, online survey. Students were asked to rate factors that may impact residency selection on a 6-point Likert scale in addition to three open-ended qualitative questions. Mean values were calculated for each survey item and were used to determine a rank order for selection criteria. Logistic regression analysis was performed to identify factors that predict a strong interest in urban, suburban, and rural residency programs. Logistic regression was also used to identify factors that predict a strong interest in academic health center-based residencies, community-based residencies, and community-based residencies with an academic affiliation. A total of 705 medical students from 32 states across the country completed the survey. Location, work/life balance, and program structure (curriculum, schedule) were rated the most important factors for residency selection. Logistic regression analysis was used to refine our understanding of how each factor relates to specific types of residencies. These findings have implications for how to best advise students in selecting a residency, as well as marketing residencies to the right candidates. Refining the recruitment process will ensure a better fit between applicants and potential programs. Limited recruitment resources may be better utilized by focusing on targeted dissemination strategies.

  9. Utility of the comprehensive marijuana motives questionnaire among medical cannabis patients.

    PubMed

    Bohnert, Kipling M; Bonar, Erin E; Arnedt, J Todd; Conroy, Deirdre A; Walton, Maureen A; Ilgen, Mark A

    2018-01-01

    Little is known about motives for cannabis use among the population of adults using cannabis medically. Therefore, we evaluated the performance of the 12 factor, 36-item Comprehensive Marijuana Motives Questionnaire (CMMQ) among a sample of medical cannabis patients. Study participants were adults ages 21years or older with scheduled appointments to obtain new or renewed medical cannabis certification from clinics in one Midwestern state (n=1116). Confirmatory factor analysis was used to evaluate properties of the CMMQ. Multiple regressions were used to estimate associations between motives and cannabis use, physical health functioning, and mental health functioning. Fit indices were acceptable, and factor loadings ranged from 0.57 to 0.94. Based on regression analyses, motives accounted for 7% of the variance in recent cannabis use, and independent of cannabis use, accounted for 5% and 19% of physical and mental health functioning, respectively. Regression analyses also revealed that distinct motives were associated with cannabis use and physical and mental health functioning. Among adults seeking medical cannabis certification, the factor structure of the CMMQ was supported, and consistent with prior studies of adolescents and young adults using cannabis recreationally. Thus, individuals who use cannabis medically may have diverse reasons for use that extend beyond the management of medical symptoms. In addition, coping and sleep-related motives may be particularly salient for this population. Findings support the utility of the CMMQ in future research on medical cannabis use; however, expansion of the scale may be needed to address medical motives for use. Published by Elsevier Ltd.

  10. Marginal regression approach for additive hazards models with clustered current status data.

    PubMed

    Su, Pei-Fang; Chi, Yunchan

    2014-01-15

    Current status data arise naturally from tumorigenicity experiments, epidemiology studies, biomedicine, econometrics and demographic and sociology studies. Moreover, clustered current status data may occur with animals from the same litter in tumorigenicity experiments or with subjects from the same family in epidemiology studies. Because the only information extracted from current status data is whether the survival times are before or after the monitoring or censoring times, the nonparametric maximum likelihood estimator of survival function converges at a rate of n(1/3) to a complicated limiting distribution. Hence, semiparametric regression models such as the additive hazards model have been extended for independent current status data to derive the test statistics, whose distributions converge at a rate of n(1/2) , for testing the regression parameters. However, a straightforward application of these statistical methods to clustered current status data is not appropriate because intracluster correlation needs to be taken into account. Therefore, this paper proposes two estimating functions for estimating the parameters in the additive hazards model for clustered current status data. The comparative results from simulation studies are presented, and the application of the proposed estimating functions to one real data set is illustrated. Copyright © 2013 John Wiley & Sons, Ltd.

  11. The Trait Emotional Intelligence Questionnaire: Internal Structure, Convergent, Criterion, and Incremental Validity in an Italian Sample

    ERIC Educational Resources Information Center

    Andrei, Federica; Smith, Martin M.; Surcinelli, Paola; Baldaro, Bruno; Saklofske, Donald H.

    2016-01-01

    This study investigated the structure and validity of the Italian translation of the Trait Emotional Intelligence Questionnaire. Data were self-reported from 227 participants. Confirmatory factor analysis supported the four-factor structure of the scale. Hierarchical regressions also demonstrated its incremental validity beyond demographics, the…

  12. QSAR, docking and ADMET studies of artemisinin derivatives for antimalarial activity targeting plasmepsin II, a hemoglobin-degrading enzyme from P. falciparum.

    PubMed

    Qidwai, Tabish; Yadav, Dharmendra K; Khan, Feroz; Dhawan, Sangeeta; Bhakuni, R S

    2012-01-01

    This work presents the development of quantitative structure activity relationship (QSAR) model to predict the antimalarial activity of artemisinin derivatives. The structures of the molecules are represented by chemical descriptors that encode topological, geometric, and electronic structure features. Screening through QSAR model suggested that compounds A24, A24a, A53, A54, A62 and A64 possess significant antimalarial activity. Linear model is developed by the multiple linear regression method to link structures to their reported antimalarial activity. The correlation in terms of regression coefficient (r(2)) was 0.90 and prediction accuracy of model in terms of cross validation regression coefficient (rCV(2)) was 0.82. This study indicates that chemical properties viz., atom count (all atoms), connectivity index (order 1, standard), ring count (all rings), shape index (basic kappa, order 2), and solvent accessibility surface area are well correlated with antimalarial activity. The docking study showed high binding affinity of predicted active compounds against antimalarial target Plasmepsins (Plm-II). Further studies for oral bioavailability, ADMET and toxicity risk assessment suggest that compound A24, A24a, A53, A54, A62 and A64 exhibits marked antimalarial activity comparable to standard antimalarial drugs. Later one of the predicted active compound A64 was chemically synthesized, structure elucidated by NMR and in vivo tested in multidrug resistant strain of Plasmodium yoelii nigeriensis infected mice. The experimental results obtained agreed well with the predicted values.

  13. Regression mixture models: Does modeling the covariance between independent variables and latent classes improve the results?

    PubMed Central

    Lamont, Andrea E.; Vermunt, Jeroen K.; Van Horn, M. Lee

    2016-01-01

    Regression mixture models are increasingly used as an exploratory approach to identify heterogeneity in the effects of a predictor on an outcome. In this simulation study, we test the effects of violating an implicit assumption often made in these models – i.e., independent variables in the model are not directly related to latent classes. Results indicated that the major risk of failing to model the relationship between predictor and latent class was an increase in the probability of selecting additional latent classes and biased class proportions. Additionally, this study tests whether regression mixture models can detect a piecewise relationship between a predictor and outcome. Results suggest that these models are able to detect piecewise relations, but only when the relationship between the latent class and the predictor is included in model estimation. We illustrate the implications of making this assumption through a re-analysis of applied data examining heterogeneity in the effects of family resources on academic achievement. We compare previous results (which assumed no relation between independent variables and latent class) to the model where this assumption is lifted. Implications and analytic suggestions for conducting regression mixture based on these findings are noted. PMID:26881956

  14. Incorporation of prior information on parameters into nonlinear regression groundwater flow models: 1. Theory

    USGS Publications Warehouse

    Cooley, Richard L.

    1982-01-01

    Prior information on the parameters of a groundwater flow model can be used to improve parameter estimates obtained from nonlinear regression solution of a modeling problem. Two scales of prior information can be available: (1) prior information having known reliability (that is, bias and random error structure) and (2) prior information consisting of best available estimates of unknown reliability. A regression method that incorporates the second scale of prior information assumes the prior information to be fixed for any particular analysis to produce improved, although biased, parameter estimates. Approximate optimization of two auxiliary parameters of the formulation is used to help minimize the bias, which is almost always much smaller than that resulting from standard ridge regression. It is shown that if both scales of prior information are available, then a combined regression analysis may be made.

  15. Panel regressions to estimate low-flow response to rainfall variability in ungaged basins

    USGS Publications Warehouse

    Bassiouni, Maoya; Vogel, Richard M.; Archfield, Stacey A.

    2016-01-01

    Multicollinearity and omitted-variable bias are major limitations to developing multiple linear regression models to estimate streamflow characteristics in ungaged areas and varying rainfall conditions. Panel regression is used to overcome limitations of traditional regression methods, and obtain reliable model coefficients, in particular to understand the elasticity of streamflow to rainfall. Using annual rainfall and selected basin characteristics at 86 gaged streams in the Hawaiian Islands, regional regression models for three stream classes were developed to estimate the annual low-flow duration discharges. Three panel-regression structures (random effects, fixed effects, and pooled) were compared to traditional regression methods, in which space is substituted for time. Results indicated that panel regression generally was able to reproduce the temporal behavior of streamflow and reduce the standard errors of model coefficients compared to traditional regression, even for models in which the unobserved heterogeneity between streams is significant and the variance inflation factor for rainfall is much greater than 10. This is because both spatial and temporal variability were better characterized in panel regression. In a case study, regional rainfall elasticities estimated from panel regressions were applied to ungaged basins on Maui, using available rainfall projections to estimate plausible changes in surface-water availability and usable stream habitat for native species. The presented panel-regression framework is shown to offer benefits over existing traditional hydrologic regression methods for developing robust regional relations to investigate streamflow response in a changing climate.

  16. Panel regressions to estimate low-flow response to rainfall variability in ungaged basins

    NASA Astrophysics Data System (ADS)

    Bassiouni, Maoya; Vogel, Richard M.; Archfield, Stacey A.

    2016-12-01

    Multicollinearity and omitted-variable bias are major limitations to developing multiple linear regression models to estimate streamflow characteristics in ungaged areas and varying rainfall conditions. Panel regression is used to overcome limitations of traditional regression methods, and obtain reliable model coefficients, in particular to understand the elasticity of streamflow to rainfall. Using annual rainfall and selected basin characteristics at 86 gaged streams in the Hawaiian Islands, regional regression models for three stream classes were developed to estimate the annual low-flow duration discharges. Three panel-regression structures (random effects, fixed effects, and pooled) were compared to traditional regression methods, in which space is substituted for time. Results indicated that panel regression generally was able to reproduce the temporal behavior of streamflow and reduce the standard errors of model coefficients compared to traditional regression, even for models in which the unobserved heterogeneity between streams is significant and the variance inflation factor for rainfall is much greater than 10. This is because both spatial and temporal variability were better characterized in panel regression. In a case study, regional rainfall elasticities estimated from panel regressions were applied to ungaged basins on Maui, using available rainfall projections to estimate plausible changes in surface-water availability and usable stream habitat for native species. The presented panel-regression framework is shown to offer benefits over existing traditional hydrologic regression methods for developing robust regional relations to investigate streamflow response in a changing climate.

  17. Functional Generalized Additive Models.

    PubMed

    McLean, Mathew W; Hooker, Giles; Staicu, Ana-Maria; Scheipl, Fabian; Ruppert, David

    2014-01-01

    We introduce the functional generalized additive model (FGAM), a novel regression model for association studies between a scalar response and a functional predictor. We model the link-transformed mean response as the integral with respect to t of F { X ( t ), t } where F (·,·) is an unknown regression function and X ( t ) is a functional covariate. Rather than having an additive model in a finite number of principal components as in Müller and Yao (2008), our model incorporates the functional predictor directly and thus our model can be viewed as the natural functional extension of generalized additive models. We estimate F (·,·) using tensor-product B-splines with roughness penalties. A pointwise quantile transformation of the functional predictor is also considered to ensure each tensor-product B-spline has observed data on its support. The methods are evaluated using simulated data and their predictive performance is compared with other competing scalar-on-function regression alternatives. We illustrate the usefulness of our approach through an application to brain tractography, where X ( t ) is a signal from diffusion tensor imaging at position, t , along a tract in the brain. In one example, the response is disease-status (case or control) and in a second example, it is the score on a cognitive test. R code for performing the simulations and fitting the FGAM can be found in supplemental materials available online.

  18. Residential distance to major roadways and cardiac structure in African Americans: cross-sectional results from the Jackson Heart Study.

    PubMed

    Weaver, Anne M; Wellenius, Gregory A; Wu, Wen-Chih; Hickson, DeMarc A; Kamalesh, Masoor; Wang, Yi

    2017-03-08

    Heart failure (HF) is a significant source of morbidity and mortality among African Americans. Ambient air pollution, including from traffic, is associated with HF, but the mechanisms remain unknown. The objectives of this study were to estimate the cross-sectional associations between residential distance to major roadways with markers of cardiac structure: left ventricular (LV) mass index, LV end-diastolic diameter, LV end-systolic diameter, and LV hypertrophy among African Americans. We studied baseline participants of the Jackson Heart Study (recruited 2000-2004), a prospective cohort of cardiovascular disease (CVD) among African Americans living in Jackson, Mississippi, USA. All cardiac measures were assessed from echocardiograms. We assessed the associations between residential distance to roads and cardiac structure indicators using multivariable linear regression or multivariable logistic regression, adjusting for potential confounders. Among 4826 participants, residential distance to road was <150 m for 103 participants, 150-299 m for 158, 300-999 for 1156, and ≥1000 m for 3409. Those who lived <150 m from a major road had mean 1.2 mm (95% CI 0.2, 2.1) greater LV diameter at end-systole compared to those who lived ≥1000 m. We did not observe statistically significant associations between distance to roads and LV mass index, LV end-diastolic diameter, or LV hypertrophy. Results did not materially change after additional adjustment for hypertension and diabetes or exclusion of those with CVD at baseline; results strengthened when modeling distance to A1 roads (such as interstate highways) as the exposure of interest. We found that residential distance to roads may be associated with LV end-systolic diameter, a marker of systolic dysfunction, in this cohort of African Americans, suggesting a potential mechanism by which exposure to traffic pollution increases the risk of HF.

  19. Modeling and prediction of peptide drift times in ion mobility spectrometry using sequence-based and structure-based approaches.

    PubMed

    Zhang, Yiming; Jin, Quan; Wang, Shuting; Ren, Ren

    2011-05-01

    The mobile behavior of 1481 peptides in ion mobility spectrometry (IMS), which are generated by protease digestion of the Drosophila melanogaster proteome, is modeled and predicted based on two different types of characterization methods, i.e. sequence-based approach and structure-based approach. In this procedure, the sequence-based approach considers both the amino acid composition of a peptide and the local environment profile of each amino acid in the peptide; the structure-based approach is performed with the CODESSA protocol, which regards a peptide as a common organic compound and generates more than 200 statistically significant variables to characterize the whole structure profile of a peptide molecule. Subsequently, the nonlinear support vector machine (SVM) and Gaussian process (GP) as well as linear partial least squares (PLS) regression is employed to correlate the structural parameters of the characterizations with the IMS drift times of these peptides. The obtained quantitative structure-spectrum relationship (QSSR) models are evaluated rigorously and investigated systematically via both one-deep and two-deep cross-validations as well as the rigorous Monte Carlo cross-validation (MCCV). We also give a comprehensive comparison on the resulting statistics arising from the different combinations of variable types with modeling methods and find that the sequence-based approach can give the QSSR models with better fitting ability and predictive power but worse interpretability than the structure-based approach. In addition, though the QSSR modeling using sequence-based approach is not needed for the preparation of the minimization structures of peptides before the modeling, it would be considerably efficient as compared to that using structure-based approach. Copyright © 2011 Elsevier Ltd. All rights reserved.

  20. Regression Model Term Selection for the Analysis of Strain-Gage Balance Calibration Data

    NASA Technical Reports Server (NTRS)

    Ulbrich, Norbert Manfred; Volden, Thomas R.

    2010-01-01

    The paper discusses the selection of regression model terms for the analysis of wind tunnel strain-gage balance calibration data. Different function class combinations are presented that may be used to analyze calibration data using either a non-iterative or an iterative method. The role of the intercept term in a regression model of calibration data is reviewed. In addition, useful algorithms and metrics originating from linear algebra and statistics are recommended that will help an analyst (i) to identify and avoid both linear and near-linear dependencies between regression model terms and (ii) to make sure that the selected regression model of the calibration data uses only statistically significant terms. Three different tests are suggested that may be used to objectively assess the predictive capability of the final regression model of the calibration data. These tests use both the original data points and regression model independent confirmation points. Finally, data from a simplified manual calibration of the Ames MK40 balance is used to illustrate the application of some of the metrics and tests to a realistic calibration data set.

  1. Does the high–tech industry consistently reduce CO{sub 2} emissions? Results from nonparametric additive regression model

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Xu, Bin; Research Center of Applied Statistics, Jiangxi University of Finance and Economics, Nanchang, Jiangxi 330013; Lin, Boqiang, E-mail: bqlin@xmu.edu.cn

    China is currently the world's largest carbon dioxide (CO{sub 2}) emitter. Moreover, total energy consumption and CO{sub 2} emissions in China will continue to increase due to the rapid growth of industrialization and urbanization. Therefore, vigorously developing the high–tech industry becomes an inevitable choice to reduce CO{sub 2} emissions at the moment or in the future. However, ignoring the existing nonlinear links between economic variables, most scholars use traditional linear models to explore the impact of the high–tech industry on CO{sub 2} emissions from an aggregate perspective. Few studies have focused on nonlinear relationships and regional differences in China. Basedmore » on panel data of 1998–2014, this study uses the nonparametric additive regression model to explore the nonlinear effect of the high–tech industry from a regional perspective. The estimated results show that the residual sum of squares (SSR) of the nonparametric additive regression model in the eastern, central and western regions are 0.693, 0.054 and 0.085 respectively, which are much less those that of the traditional linear regression model (3.158, 4.227 and 7.196). This verifies that the nonparametric additive regression model has a better fitting effect. Specifically, the high–tech industry produces an inverted “U–shaped” nonlinear impact on CO{sub 2} emissions in the eastern region, but a positive “U–shaped” nonlinear effect in the central and western regions. Therefore, the nonlinear impact of the high–tech industry on CO{sub 2} emissions in the three regions should be given adequate attention in developing effective abatement policies. - Highlights: • The nonlinear effect of the high–tech industry on CO{sub 2} emissions was investigated. • The high–tech industry yields an inverted “U–shaped” effect in the eastern region. • The high–tech industry has a positive “U–shaped” nonlinear effect in other regions. • The linear impact of the high–tech industry in the eastern region is the strongest.« less

  2. HIGHLIGHTING DIFFERENCES BETWEEN CONDITIONAL AND UNCONDITIONAL QUANTILE REGRESSION APPROACHES THROUGH AN APPLICATION TO ASSESS MEDICATION ADHERENCE

    PubMed Central

    BORAH, BIJAN J.; BASU, ANIRBAN

    2014-01-01

    The quantile regression (QR) framework provides a pragmatic approach in understanding the differential impacts of covariates along the distribution of an outcome. However, the QR framework that has pervaded the applied economics literature is based on the conditional quantile regression method. It is used to assess the impact of a covariate on a quantile of the outcome conditional on specific values of other covariates. In most cases, conditional quantile regression may generate results that are often not generalizable or interpretable in a policy or population context. In contrast, the unconditional quantile regression method provides more interpretable results as it marginalizes the effect over the distributions of other covariates in the model. In this paper, the differences between these two regression frameworks are highlighted, both conceptually and econometrically. Additionally, using real-world claims data from a large US health insurer, alternative QR frameworks are implemented to assess the differential impacts of covariates along the distribution of medication adherence among elderly patients with Alzheimer’s disease. PMID:23616446

  3. Quantile regression for the statistical analysis of immunological data with many non-detects.

    PubMed

    Eilers, Paul H C; Röder, Esther; Savelkoul, Huub F J; van Wijk, Roy Gerth

    2012-07-07

    Immunological parameters are hard to measure. A well-known problem is the occurrence of values below the detection limit, the non-detects. Non-detects are a nuisance, because classical statistical analyses, like ANOVA and regression, cannot be applied. The more advanced statistical techniques currently available for the analysis of datasets with non-detects can only be used if a small percentage of the data are non-detects. Quantile regression, a generalization of percentiles to regression models, models the median or higher percentiles and tolerates very high numbers of non-detects. We present a non-technical introduction and illustrate it with an implementation to real data from a clinical trial. We show that by using quantile regression, groups can be compared and that meaningful linear trends can be computed, even if more than half of the data consists of non-detects. Quantile regression is a valuable addition to the statistical methods that can be used for the analysis of immunological datasets with non-detects.

  4. Obscure phenomena in statistical analysis of quantitative structure-activity relationships. Part 1: Multicollinearity of physicochemical descriptors.

    PubMed

    Mager, P P; Rothe, H

    1990-10-01

    Multicollinearity of physicochemical descriptors leads to serious consequences in quantitative structure-activity relationship (QSAR) analysis, such as incorrect estimators and test statistics of regression coefficients of the ordinary least-squares (OLS) model applied usually to QSARs. Beside the diagnosis of the known simple collinearity, principal component regression analysis (PCRA) also allows the diagnosis of various types of multicollinearity. Only if the absolute values of PCRA estimators are order statistics that decrease monotonically, the effects of multicollinearity can be circumvented. Otherwise, obscure phenomena may be observed, such as good data recognition but low predictive model power of a QSAR model.

  5. Regularized matrix regression

    PubMed Central

    Zhou, Hua; Li, Lexin

    2014-01-01

    Summary Modern technologies are producing a wealth of data with complex structures. For instance, in two-dimensional digital imaging, flow cytometry and electroencephalography, matrix-type covariates frequently arise when measurements are obtained for each combination of two underlying variables. To address scientific questions arising from those data, new regression methods that take matrices as covariates are needed, and sparsity or other forms of regularization are crucial owing to the ultrahigh dimensionality and complex structure of the matrix data. The popular lasso and related regularization methods hinge on the sparsity of the true signal in terms of the number of its non-zero coefficients. However, for the matrix data, the true signal is often of, or can be well approximated by, a low rank structure. As such, the sparsity is frequently in the form of low rank of the matrix parameters, which may seriously violate the assumption of the classical lasso. We propose a class of regularized matrix regression methods based on spectral regularization. A highly efficient and scalable estimation algorithm is developed, and a degrees-of-freedom formula is derived to facilitate model selection along the regularization path. Superior performance of the method proposed is demonstrated on both synthetic and real examples. PMID:24648830

  6. Avoiding and Correcting Bias in Score-Based Latent Variable Regression with Discrete Manifest Items

    ERIC Educational Resources Information Center

    Lu, Irene R. R.; Thomas, D. Roland

    2008-01-01

    This article considers models involving a single structural equation with latent explanatory and/or latent dependent variables where discrete items are used to measure the latent variables. Our primary focus is the use of scores as proxies for the latent variables and carrying out ordinary least squares (OLS) regression on such scores to estimate…

  7. Predicting surface fuel models and fuel metrics using lidar and CIR imagery in a dense mixed conifer forest

    Treesearch

    Marek K. Jakubowksi; Qinghua Guo; Brandon Collins; Scott Stephens; Maggi Kelly

    2013-01-01

    We compared the ability of several classification and regression algorithms to predict forest stand structure metrics and standard surface fuel models. Our study area spans a dense, topographically complex Sierra Nevada mixed-conifer forest. We used clustering, regression trees, and support vector machine algorithms to analyze high density (average 9 pulses/m

  8. Analytical framework for reconstructing heterogeneous environmental variables from mammal community structure.

    PubMed

    Louys, Julien; Meloro, Carlo; Elton, Sarah; Ditchfield, Peter; Bishop, Laura C

    2015-01-01

    We test the performance of two models that use mammalian communities to reconstruct multivariate palaeoenvironments. While both models exploit the correlation between mammal communities (defined in terms of functional groups) and arboreal heterogeneity, the first uses a multiple multivariate regression of community structure and arboreal heterogeneity, while the second uses a linear regression of the principal components of each ecospace. The success of these methods means the palaeoenvironment of a particular locality can be reconstructed in terms of the proportions of heavy, moderate, light, and absent tree canopy cover. The linear regression is less biased, and more precisely and accurately reconstructs heavy tree canopy cover than the multiple multivariate model. However, the multiple multivariate model performs better than the linear regression for all other canopy cover categories. Both models consistently perform better than randomly generated reconstructions. We apply both models to the palaeocommunity of the Upper Laetolil Beds, Tanzania. Our reconstructions indicate that there was very little heavy tree cover at this site (likely less than 10%), with the palaeo-landscape instead comprising a mixture of light and absent tree cover. These reconstructions help resolve the previous conflicting palaeoecological reconstructions made for this site. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. Effects of land cover, topography, and built structure on seasonal water quality at multiple spatial scales.

    PubMed

    Pratt, Bethany; Chang, Heejun

    2012-03-30

    The relationship among land cover, topography, built structure and stream water quality in the Portland Metro region of Oregon and Clark County, Washington areas, USA, is analyzed using ordinary least squares (OLS) and geographically weighted (GWR) multiple regression models. Two scales of analysis, a sectional watershed and a buffer, offered a local and a global investigation of the sources of stream pollutants. Model accuracy, measured by R(2) values, fluctuated according to the scale, season, and regression method used. While most wet season water quality parameters are associated with urban land covers, most dry season water quality parameters are related topographic features such as elevation and slope. GWR models, which take into consideration local relations of spatial autocorrelation, had stronger results than OLS regression models. In the multiple regression models, sectioned watershed results were consistently better than the sectioned buffer results, except for dry season pH and stream temperature parameters. This suggests that while riparian land cover does have an effect on water quality, a wider contributing area needs to be included in order to account for distant sources of pollutants. Copyright © 2012 Elsevier B.V. All rights reserved.

  10. Time series regression model for infectious disease and weather.

    PubMed

    Imai, Chisato; Armstrong, Ben; Chalabi, Zaid; Mangtani, Punam; Hashizume, Masahiro

    2015-10-01

    Time series regression has been developed and long used to evaluate the short-term associations of air pollution and weather with mortality or morbidity of non-infectious diseases. The application of the regression approaches from this tradition to infectious diseases, however, is less well explored and raises some new issues. We discuss and present potential solutions for five issues often arising in such analyses: changes in immune population, strong autocorrelations, a wide range of plausible lag structures and association patterns, seasonality adjustments, and large overdispersion. The potential approaches are illustrated with datasets of cholera cases and rainfall from Bangladesh and influenza and temperature in Tokyo. Though this article focuses on the application of the traditional time series regression to infectious diseases and weather factors, we also briefly introduce alternative approaches, including mathematical modeling, wavelet analysis, and autoregressive integrated moving average (ARIMA) models. Modifications proposed to standard time series regression practice include using sums of past cases as proxies for the immune population, and using the logarithm of lagged disease counts to control autocorrelation due to true contagion, both of which are motivated from "susceptible-infectious-recovered" (SIR) models. The complexity of lag structures and association patterns can often be informed by biological mechanisms and explored by using distributed lag non-linear models. For overdispersed models, alternative distribution models such as quasi-Poisson and negative binomial should be considered. Time series regression can be used to investigate dependence of infectious diseases on weather, but may need modifying to allow for features specific to this context. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  11. The social and community opportunities profile social inclusion measure: Structural equivalence and differential item functioning in community mental health residents in Hong Kong and the United Kingdom.

    PubMed

    Huxley, Peter John; Chan, Kara; Chiu, Marcus; Ma, Yanni; Gaze, Sarah; Evans, Sherrill

    2016-03-01

    China's future major health problem will be the management of chronic diseases - of which mental health is a major one. An instrument is needed to measure mental health inclusion outcomes for mental health services in Hong Kong and mainland China as they strive to promote a more inclusive society for their citizens and particular disadvantaged groups. To report on the analysis of structural equivalence and item differentiation in two mentally unhealthy and one healthy sample in the United Kingdom and Hong Kong. The mental health sample in Hong Kong was made up of non-governmental organisation (NGO) referrals meeting the selection/exclusion criteria (being well enough to be interviewed, having a formal psychiatric diagnosis and living in the community). A similar sample in the United Kingdom meeting the same selection criteria was obtained from a community mental health organisation, equivalent to the NGOs in Hong Kong. Exploratory factor analysis and logistic regression were conducted. The single-variable, self-rated 'overall social inclusion' differs significantly between all of the samples, in the way we would expect from previous research, with the healthy population feeling more included than the serious mental illness (SMI) groups. In the exploratory factor analysis, the first two factors explain between a third and half of the variance, and the single variable which enters into all the analyses in the first factor is having friends to visit the home. All the regression models were significant; however, in Hong Kong sample, only one-fifth of the total variance is explained. The structural findings imply that the social and community opportunities profile-Chinese version (SCOPE-C) gives similar results when applied to another culture. As only one-fifth of the variance of 'overall inclusion' was explained in the Hong Kong sample, it may be that the instrument needs to be refined using different or additional items within the structural domains of inclusion. © The Author(s) 2015.

  12. Applying the LANL Statistical Pattern Recognition Paradigm for Structural Health Monitoring to Data from a Surface-Effect Fast Patrol Boat

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Hoon Sohn; Charles Farrar; Norman Hunter

    2001-01-01

    This report summarizes the analysis of fiber-optic strain gauge data obtained from a surface-effect fast patrol boat being studied by the staff at the Norwegian Defense Research Establishment (NDRE) in Norway and the Naval Research Laboratory (NRL) in Washington D.C. Data from two different structural conditions were provided to the staff at Los Alamos National Laboratory. The problem was then approached from a statistical pattern recognition paradigm. This paradigm can be described as a four-part process: (1) operational evaluation, (2) data acquisition & cleansing, (3) feature extraction and data reduction, and (4) statistical model development for feature discrimination. Given thatmore » the first two portions of this paradigm were mostly completed by the NDRE and NRL staff, this study focused on data normalization, feature extraction, and statistical modeling for feature discrimination. The feature extraction process began by looking at relatively simple statistics of the signals and progressed to using the residual errors from auto-regressive (AR) models fit to the measured data as the damage-sensitive features. Data normalization proved to be the most challenging portion of this investigation. A novel approach to data normalization, where the residual errors in the AR model are considered to be an unmeasured input and an auto-regressive model with exogenous inputs (ARX) is then fit to portions of the data exhibiting similar waveforms, was successfully applied to this problem. With this normalization procedure, a clear distinction between the two different structural conditions was obtained. A false-positive study was also run, and the procedure developed herein did not yield any false-positive indications of damage. Finally, the results must be qualified by the fact that this procedure has only been applied to very limited data samples. A more complete analysis of additional data taken under various operational and environmental conditions as well as other structural conditions is necessary before one can definitively state that the procedure is robust enough to be used in practice.« less

  13. The associations between indices of patellofemoral geometry and knee pain and patella cartilage volume: a cross-sectional study

    PubMed Central

    2010-01-01

    Background Whilst patellofemoral pain is one of the most common musculoskeletal disorders presenting to orthopaedic clinics, sports clinics, and general practices, factors contributing to its development in the absence of a defined arthropathy, such as osteoarthritis (OA), are unclear. The aim of this cross-sectional study was to describe the relationships between parameters of patellofemoral geometry (patella inclination, sulcus angle and patella height) and knee pain and patella cartilage volume. Methods 240 community-based adults aged 25-60 years were recruited to take part in a study of obesity and musculoskeletal health. Magnetic resonance imaging (MRI) of the dominant knee was used to determine the lateral condyle-patella angle, sulcus angle, and Insall-Salvati ratio, as well as patella cartilage and bone volumes. Pain was assessed by the Western Ontario and McMaster University Osteoarthritis Index (WOMAC) VA pain subscale. Results Increased lateral condyle-patella angle (increased medial patella inclination) was associated with a reduction in WOMAC pain score (Regression coefficient -1.57, 95% CI -3.05, -0.09) and increased medial patella cartilage volume (Regression coefficient 51.38 mm3, 95% CI 1.68, 101.08 mm3). Higher riding patella as indicated by increased Insall-Salvati ratio was associated with decreased medial patella cartilage volume (Regression coefficient -3187 mm3, 95% CI -5510, -864 mm3). There was a trend for increased lateral patella cartilage volume associated with increased (shallower) sulcus angle (Regression coefficient 43.27 mm3, 95% CI -2.43, 88.98 mm3). Conclusion These results suggest both symptomatic and structural benefits associated with a more medially inclined patella while a high-riding patella may be detrimental to patella cartilage. This provides additional theoretical support for the current use of corrective strategies for patella malalignment that are aimed at medial patella translation, although longitudinal studies will be needed to further substantiate this. PMID:20459700

  14. A Systematic Comparison of Linear Regression-Based Statistical Methods to Assess Exposome-Health Associations.

    PubMed

    Agier, Lydiane; Portengen, Lützen; Chadeau-Hyam, Marc; Basagaña, Xavier; Giorgis-Allemand, Lise; Siroux, Valérie; Robinson, Oliver; Vlaanderen, Jelle; González, Juan R; Nieuwenhuijsen, Mark J; Vineis, Paolo; Vrijheid, Martine; Slama, Rémy; Vermeulen, Roel

    2016-12-01

    The exposome constitutes a promising framework to improve understanding of the effects of environmental exposures on health by explicitly considering multiple testing and avoiding selective reporting. However, exposome studies are challenged by the simultaneous consideration of many correlated exposures. We compared the performances of linear regression-based statistical methods in assessing exposome-health associations. In a simulation study, we generated 237 exposure covariates with a realistic correlation structure and with a health outcome linearly related to 0 to 25 of these covariates. Statistical methods were compared primarily in terms of false discovery proportion (FDP) and sensitivity. On average over all simulation settings, the elastic net and sparse partial least-squares regression showed a sensitivity of 76% and an FDP of 44%; Graphical Unit Evolutionary Stochastic Search (GUESS) and the deletion/substitution/addition (DSA) algorithm revealed a sensitivity of 81% and an FDP of 34%. The environment-wide association study (EWAS) underperformed these methods in terms of FDP (average FDP, 86%) despite a higher sensitivity. Performances decreased considerably when assuming an exposome exposure matrix with high levels of correlation between covariates. Correlation between exposures is a challenge for exposome research, and the statistical methods investigated in this study were limited in their ability to efficiently differentiate true predictors from correlated covariates in a realistic exposome context. Although GUESS and DSA provided a marginally better balance between sensitivity and FDP, they did not outperform the other multivariate methods across all scenarios and properties examined, and computational complexity and flexibility should also be considered when choosing between these methods. Citation: Agier L, Portengen L, Chadeau-Hyam M, Basagaña X, Giorgis-Allemand L, Siroux V, Robinson O, Vlaanderen J, González JR, Nieuwenhuijsen MJ, Vineis P, Vrijheid M, Slama R, Vermeulen R. 2016. A systematic comparison of linear regression-based statistical methods to assess exposome-health associations. Environ Health Perspect 124:1848-1856; http://dx.doi.org/10.1289/EHP172.

  15. Cocoa Farmers’ Compliance with Safety Precautions in Spraying Agrochemicals and Use of Personal Protective Equipment (PPE) in Cameroon

    PubMed Central

    2018-01-01

    The inability of farmers to comply with essential precautions in the course of spraying agrochemicals remains a policy dilemma, especially in developing countries. The objectives of this paper were to assess compliance of cocoa farmers with agrochemical safety measures, analyse the factors explaining involvement of cocoa farmers in the practice of reusing agrochemical containers and wearing of personal protective equipment (PPE). Data were collected with structured questionnaires from 667 cocoa farmers from the Centre and South West regions in Cameroon. Data analyses were carried out with Probit regression and Negative Binomial regression models. The results showed that average cocoa farm sizes were 3.55 ha and 2.82 ha in South West and Centre regions, respectively, and 89.80% and 42.64% complied with manufacturers’ instructions in the use of insecticides. Eating or drinking while spraying insecticides and fungicides was reported by 4.20% and 5.10% of all farmers in the two regions, respectively. However, 37.78% and 57.57% of all farmers wore hand gloves and safety boots while spraying insecticides in the South West and Centre regions of Cameroon, respectively. In addition, 7.80% of all the farmers would wash agrochemical containers and use them at home, while 42.43% would wash and use them on their farms. Probit regression results showed that probability of reusing agrochemical containers was significantly influenced (p < 0.05) by region of residence of cocoa farmers, gender, possession of formal education and farming as primary occupation. The Negative Binomial regression results showed that the log of number PPE worn was significantly influenced (p < 0.10) by region, marital status, attainment of formal education, good health, awareness of manufacturers’ instructions, land area and contact index. It was among others concluded that efforts to train farmers on the need to be familiar with manufacturers’ instructions and use PPE would enhance their safety in the course of spraying agrochemicals. PMID:29438333

  16. Cocoa Farmers' Compliance with Safety Precautions in Spraying Agrochemicals and Use of Personal Protective Equipment (PPE) in Cameroon.

    PubMed

    Oyekale, Abayomi Samuel

    2018-02-13

    The inability of farmers to comply with essential precautions in the course of spraying agrochemicals remains a policy dilemma, especially in developing countries. The objectives of this paper were to assess compliance of cocoa farmers with agrochemical safety measures, analyse the factors explaining involvement of cocoa farmers in the practice of reusing agrochemical containers and wearing of personal protective equipment (PPE). Data were collected with structured questionnaires from 667 cocoa farmers from the Centre and South West regions in Cameroon. Data analyses were carried out with Probit regression and Negative Binomial regression models. The results showed that average cocoa farm sizes were 3.55 ha and 2.82 ha in South West and Centre regions, respectively, and 89.80% and 42.64% complied with manufacturers' instructions in the use of insecticides. Eating or drinking while spraying insecticides and fungicides was reported by 4.20% and 5.10% of all farmers in the two regions, respectively. However, 37.78% and 57.57% of all farmers wore hand gloves and safety boots while spraying insecticides in the South West and Centre regions of Cameroon, respectively. In addition, 7.80% of all the farmers would wash agrochemical containers and use them at home, while 42.43% would wash and use them on their farms. Probit regression results showed that probability of reusing agrochemical containers was significantly influenced ( p < 0.05) by region of residence of cocoa farmers, gender, possession of formal education and farming as primary occupation. The Negative Binomial regression results showed that the log of number PPE worn was significantly influenced ( p < 0.10) by region, marital status, attainment of formal education, good health, awareness of manufacturers' instructions, land area and contact index. It was among others concluded that efforts to train farmers on the need to be familiar with manufacturers' instructions and use PPE would enhance their safety in the course of spraying agrochemicals.

  17. The associations between indices of patellofemoral geometry and knee pain and patella cartilage volume: a cross-sectional study.

    PubMed

    Tanamas, Stephanie K; Teichtahl, Andrew J; Wluka, Anita E; Wang, Yuanyuan; Davies-Tuck, Miranda; Urquhart, Donna M; Jones, Graeme; Cicuttini, Flavia M

    2010-05-10

    Whilst patellofemoral pain is one of the most common musculoskeletal disorders presenting to orthopaedic clinics, sports clinics, and general practices, factors contributing to its development in the absence of a defined arthropathy, such as osteoarthritis (OA), are unclear.The aim of this cross-sectional study was to describe the relationships between parameters of patellofemoral geometry (patella inclination, sulcus angle and patella height) and knee pain and patella cartilage volume. 240 community-based adults aged 25-60 years were recruited to take part in a study of obesity and musculoskeletal health. Magnetic resonance imaging (MRI) of the dominant knee was used to determine the lateral condyle-patella angle, sulcus angle, and Insall-Salvati ratio, as well as patella cartilage and bone volumes. Pain was assessed by the Western Ontario and McMaster University Osteoarthritis Index (WOMAC) VA pain subscale. Increased lateral condyle-patella angle (increased medial patella inclination) was associated with a reduction in WOMAC pain score (Regression coefficient -1.57, 95% CI -3.05, -0.09) and increased medial patella cartilage volume (Regression coefficient 51.38 mm3, 95% CI 1.68, 101.08 mm3). Higher riding patella as indicated by increased Insall-Salvati ratio was associated with decreased medial patella cartilage volume (Regression coefficient -3187 mm3, 95% CI -5510, -864 mm3). There was a trend for increased lateral patella cartilage volume associated with increased (shallower) sulcus angle (Regression coefficient 43.27 mm3, 95% CI -2.43, 88.98 mm3). These results suggest both symptomatic and structural benefits associated with a more medially inclined patella while a high-riding patella may be detrimental to patella cartilage. This provides additional theoretical support for the current use of corrective strategies for patella malalignment that are aimed at medial patella translation, although longitudinal studies will be needed to further substantiate this.

  18. Structural features that predict real-value fluctuations of globular proteins

    PubMed Central

    Jamroz, Michal; Kolinski, Andrzej; Kihara, Daisuke

    2012-01-01

    It is crucial to consider dynamics for understanding the biological function of proteins. We used a large number of molecular dynamics trajectories of non-homologous proteins as references and examined static structural features of proteins that are most relevant to fluctuations. We examined correlation of individual structural features with fluctuations and further investigated effective combinations of features for predicting the real-value of residue fluctuations using the support vector regression. It was found that some structural features have higher correlation than crystallographic B-factors with fluctuations observed in molecular dynamics trajectories. Moreover, support vector regression that uses combinations of static structural features showed accurate prediction of fluctuations with an average Pearson’s correlation coefficient of 0.669 and a root mean square error of 1.04 Å. This correlation coefficient is higher than the one observed for the prediction by the Gaussian network model. An advantage of the developed method over the Gaussian network models is that the former predicts the real-value of fluctuation. The results help improve our understanding of relationships between protein structure and fluctuation. Furthermore, the developed method provides a convienient practial way to predict fluctuations of proteins using easily computed static structural features of proteins. PMID:22328193

  19. Functional mixture regression.

    PubMed

    Yao, Fang; Fu, Yuejiao; Lee, Thomas C M

    2011-04-01

    In functional linear models (FLMs), the relationship between the scalar response and the functional predictor process is often assumed to be identical for all subjects. Motivated by both practical and methodological considerations, we relax this assumption and propose a new class of functional regression models that allow the regression structure to vary for different groups of subjects. By projecting the predictor process onto its eigenspace, the new functional regression model is simplified to a framework that is similar to classical mixture regression models. This leads to the proposed approach named as functional mixture regression (FMR). The estimation of FMR can be readily carried out using existing software implemented for functional principal component analysis and mixture regression. The practical necessity and performance of FMR are illustrated through applications to a longevity analysis of female medflies and a human growth study. Theoretical investigations concerning the consistent estimation and prediction properties of FMR along with simulation experiments illustrating its empirical properties are presented in the supplementary material available at Biostatistics online. Corresponding results demonstrate that the proposed approach could potentially achieve substantial gains over traditional FLMs.

  20. Approximate N-Player Nonzero-Sum Game Solution for an Uncertain Continuous Nonlinear System.

    PubMed

    Johnson, Marcus; Kamalapurkar, Rushikesh; Bhasin, Shubhendu; Dixon, Warren E

    2015-08-01

    An approximate online equilibrium solution is developed for an N -player nonzero-sum game subject to continuous-time nonlinear unknown dynamics and an infinite horizon quadratic cost. A novel actor-critic-identifier structure is used, wherein a robust dynamic neural network is used to asymptotically identify the uncertain system with additive disturbances, and a set of critic and actor NNs are used to approximate the value functions and equilibrium policies, respectively. The weight update laws for the actor neural networks (NNs) are generated using a gradient-descent method, and the critic NNs are generated by least square regression, which are both based on the modified Bellman error that is independent of the system dynamics. A Lyapunov-based stability analysis shows that uniformly ultimately bounded tracking is achieved, and a convergence analysis demonstrates that the approximate control policies converge to a neighborhood of the optimal solutions. The actor, critic, and identifier structures are implemented in real time continuously and simultaneously. Simulations on two and three player games illustrate the performance of the developed method.

  1. The concept of psychological regression: metaphors, mapping, Queen Square, and Tavistock Square.

    PubMed

    Mercer, Jean

    2011-05-01

    The term "regression" refers to events in which an individual changes from his or her present level of maturity and regains mental and behavioral characteristics shown at an earlier point in development. This definition has remained constant for over a century, but the implications of the concept have changed systematically from a perspective in which regression was considered pathological, to a current view in which regression may be seen as a positive step in psychotherapy or as a part of normal development. The concept of regression, famously employed by Sigmund Freud and others in his circle, derived from ideas suggested by Herbert Spencer and by John Hughlings Jackson. By the 1940s and '50s, the regression concept was applied by Winnicott and others in treatment of disturbed children and in adult psychotherapy. In addition, behavioral regression came to be seen as a part of a normal developmental trajectory, with a focus on expectable variability. The present article examines historical changes in the regression concept in terms of mapping to biomedical or other metaphors, in terms of a movement from earlier nativism toward an increased environmentalism in psychology, and with respect to other historical factors such as wartime events. The role of dominant metaphors in shifting perspectives on regression is described.

  2. Fast metabolite identification with Input Output Kernel Regression.

    PubMed

    Brouard, Céline; Shen, Huibin; Dührkop, Kai; d'Alché-Buc, Florence; Böcker, Sebastian; Rousu, Juho

    2016-06-15

    An important problematic of metabolomics is to identify metabolites using tandem mass spectrometry data. Machine learning methods have been proposed recently to solve this problem by predicting molecular fingerprint vectors and matching these fingerprints against existing molecular structure databases. In this work we propose to address the metabolite identification problem using a structured output prediction approach. This type of approach is not limited to vector output space and can handle structured output space such as the molecule space. We use the Input Output Kernel Regression method to learn the mapping between tandem mass spectra and molecular structures. The principle of this method is to encode the similarities in the input (spectra) space and the similarities in the output (molecule) space using two kernel functions. This method approximates the spectra-molecule mapping in two phases. The first phase corresponds to a regression problem from the input space to the feature space associated to the output kernel. The second phase is a preimage problem, consisting in mapping back the predicted output feature vectors to the molecule space. We show that our approach achieves state-of-the-art accuracy in metabolite identification. Moreover, our method has the advantage of decreasing the running times for the training step and the test step by several orders of magnitude over the preceding methods. celine.brouard@aalto.fi Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  3. Ensemble learning of inverse probability weights for marginal structural modeling in large observational datasets.

    PubMed

    Gruber, Susan; Logan, Roger W; Jarrín, Inmaculada; Monge, Susana; Hernán, Miguel A

    2015-01-15

    Inverse probability weights used to fit marginal structural models are typically estimated using logistic regression. However, a data-adaptive procedure may be able to better exploit information available in measured covariates. By combining predictions from multiple algorithms, ensemble learning offers an alternative to logistic regression modeling to further reduce bias in estimated marginal structural model parameters. We describe the application of two ensemble learning approaches to estimating stabilized weights: super learning (SL), an ensemble machine learning approach that relies on V-fold cross validation, and an ensemble learner (EL) that creates a single partition of the data into training and validation sets. Longitudinal data from two multicenter cohort studies in Spain (CoRIS and CoRIS-MD) were analyzed to estimate the mortality hazard ratio for initiation versus no initiation of combined antiretroviral therapy among HIV positive subjects. Both ensemble approaches produced hazard ratio estimates further away from the null, and with tighter confidence intervals, than logistic regression modeling. Computation time for EL was less than half that of SL. We conclude that ensemble learning using a library of diverse candidate algorithms offers an alternative to parametric modeling of inverse probability weights when fitting marginal structural models. With large datasets, EL provides a rich search over the solution space in less time than SL with comparable results. Copyright © 2014 John Wiley & Sons, Ltd.

  4. Finding structure in data using multivariate tree boosting

    PubMed Central

    Miller, Patrick J.; Lubke, Gitta H.; McArtor, Daniel B.; Bergeman, C. S.

    2016-01-01

    Technology and collaboration enable dramatic increases in the size of psychological and psychiatric data collections, but finding structure in these large data sets with many collected variables is challenging. Decision tree ensembles such as random forests (Strobl, Malley, & Tutz, 2009) are a useful tool for finding structure, but are difficult to interpret with multiple outcome variables which are often of interest in psychology. To find and interpret structure in data sets with multiple outcomes and many predictors (possibly exceeding the sample size), we introduce a multivariate extension to a decision tree ensemble method called gradient boosted regression trees (Friedman, 2001). Our extension, multivariate tree boosting, is a method for nonparametric regression that is useful for identifying important predictors, detecting predictors with nonlinear effects and interactions without specification of such effects, and for identifying predictors that cause two or more outcome variables to covary. We provide the R package ‘mvtboost’ to estimate, tune, and interpret the resulting model, which extends the implementation of univariate boosting in the R package ‘gbm’ (Ridgeway et al., 2015) to continuous, multivariate outcomes. To illustrate the approach, we analyze predictors of psychological well-being (Ryff & Keyes, 1995). Simulations verify that our approach identifies predictors with nonlinear effects and achieves high prediction accuracy, exceeding or matching the performance of (penalized) multivariate multiple regression and multivariate decision trees over a wide range of conditions. PMID:27918183

  5. Fast metabolite identification with Input Output Kernel Regression

    PubMed Central

    Brouard, Céline; Shen, Huibin; Dührkop, Kai; d'Alché-Buc, Florence; Böcker, Sebastian; Rousu, Juho

    2016-01-01

    Motivation: An important problematic of metabolomics is to identify metabolites using tandem mass spectrometry data. Machine learning methods have been proposed recently to solve this problem by predicting molecular fingerprint vectors and matching these fingerprints against existing molecular structure databases. In this work we propose to address the metabolite identification problem using a structured output prediction approach. This type of approach is not limited to vector output space and can handle structured output space such as the molecule space. Results: We use the Input Output Kernel Regression method to learn the mapping between tandem mass spectra and molecular structures. The principle of this method is to encode the similarities in the input (spectra) space and the similarities in the output (molecule) space using two kernel functions. This method approximates the spectra-molecule mapping in two phases. The first phase corresponds to a regression problem from the input space to the feature space associated to the output kernel. The second phase is a preimage problem, consisting in mapping back the predicted output feature vectors to the molecule space. We show that our approach achieves state-of-the-art accuracy in metabolite identification. Moreover, our method has the advantage of decreasing the running times for the training step and the test step by several orders of magnitude over the preceding methods. Availability and implementation: Contact: celine.brouard@aalto.fi Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27307628

  6. Ensemble learning of inverse probability weights for marginal structural modeling in large observational datasets

    PubMed Central

    Gruber, Susan; Logan, Roger W.; Jarrín, Inmaculada; Monge, Susana; Hernán, Miguel A.

    2014-01-01

    Inverse probability weights used to fit marginal structural models are typically estimated using logistic regression. However a data-adaptive procedure may be able to better exploit information available in measured covariates. By combining predictions from multiple algorithms, ensemble learning offers an alternative to logistic regression modeling to further reduce bias in estimated marginal structural model parameters. We describe the application of two ensemble learning approaches to estimating stabilized weights: super learning (SL), an ensemble machine learning approach that relies on V -fold cross validation, and an ensemble learner (EL) that creates a single partition of the data into training and validation sets. Longitudinal data from two multicenter cohort studies in Spain (CoRIS and CoRIS-MD) were analyzed to estimate the mortality hazard ratio for initiation versus no initiation of combined antiretroviral therapy among HIV positive subjects. Both ensemble approaches produced hazard ratio estimates further away from the null, and with tighter confidence intervals, than logistic regression modeling. Computation time for EL was less than half that of SL. We conclude that ensemble learning using a library of diverse candidate algorithms offers an alternative to parametric modeling of inverse probability weights when fitting marginal structural models. With large datasets, EL provides a rich search over the solution space in less time than SL with comparable results. PMID:25316152

  7. A Powerful Test for Comparing Multiple Regression Functions.

    PubMed

    Maity, Arnab

    2012-09-01

    In this article, we address the important problem of comparison of two or more population regression functions. Recently, Pardo-Fernández, Van Keilegom and González-Manteiga (2007) developed test statistics for simple nonparametric regression models: Y(ij) = θ(j)(Z(ij)) + σ(j)(Z(ij))∊(ij), based on empirical distributions of the errors in each population j = 1, … , J. In this paper, we propose a test for equality of the θ(j)(·) based on the concept of generalized likelihood ratio type statistics. We also generalize our test for other nonparametric regression setups, e.g, nonparametric logistic regression, where the loglikelihood for population j is any general smooth function [Formula: see text]. We describe a resampling procedure to obtain the critical values of the test. In addition, we present a simulation study to evaluate the performance of the proposed test and compare our results to those in Pardo-Fernández et al. (2007).

  8. Understanding poisson regression.

    PubMed

    Hayat, Matthew J; Higgins, Melinda

    2014-04-01

    Nurse investigators often collect study data in the form of counts. Traditional methods of data analysis have historically approached analysis of count data either as if the count data were continuous and normally distributed or with dichotomization of the counts into the categories of occurred or did not occur. These outdated methods for analyzing count data have been replaced with more appropriate statistical methods that make use of the Poisson probability distribution, which is useful for analyzing count data. The purpose of this article is to provide an overview of the Poisson distribution and its use in Poisson regression. Assumption violations for the standard Poisson regression model are addressed with alternative approaches, including addition of an overdispersion parameter or negative binomial regression. An illustrative example is presented with an application from the ENSPIRE study, and regression modeling of comorbidity data is included for illustrative purposes. Copyright 2014, SLACK Incorporated.

  9. Comparison and Contrast of Two General Functional Regression Modeling Frameworks

    PubMed Central

    Morris, Jeffrey S.

    2017-01-01

    In this article, Greven and Scheipl describe an impressively general framework for performing functional regression that builds upon the generalized additive modeling framework. Over the past number of years, my collaborators and I have also been developing a general framework for functional regression, functional mixed models, which shares many similarities with this framework, but has many differences as well. In this discussion, I compare and contrast these two frameworks, to hopefully illuminate characteristics of each, highlighting their respecitve strengths and weaknesses, and providing recommendations regarding the settings in which each approach might be preferable. PMID:28736502

  10. Comparison and Contrast of Two General Functional Regression Modeling Frameworks.

    PubMed

    Morris, Jeffrey S

    2017-02-01

    In this article, Greven and Scheipl describe an impressively general framework for performing functional regression that builds upon the generalized additive modeling framework. Over the past number of years, my collaborators and I have also been developing a general framework for functional regression, functional mixed models, which shares many similarities with this framework, but has many differences as well. In this discussion, I compare and contrast these two frameworks, to hopefully illuminate characteristics of each, highlighting their respecitve strengths and weaknesses, and providing recommendations regarding the settings in which each approach might be preferable.

  11. Measurement of DSM-5 section II personality disorder constructs using the MMPI-2-RF in clinical and forensic samples.

    PubMed

    Anderson, Jaime L; Sellbom, Martin; Pymont, Carly; Smid, Wineke; De Saeger, Hilde; Kamphuis, Jan H

    2015-09-01

    In the current study, we evaluated the associations between the Minnesota Multiphasic Personality Inventory-2 Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen, 2008) scale scores and the Diagnostic and Statistical Manual of Mental Disorders (5th ed.; DSM-5; American Psychiatric Association, 2013) Section II personality disorder (PD) criterion counts in inpatient and forensic psychiatric samples from The Netherlands using structured clinical interviews to operationalize PDs. The inpatient psychiatric sample included 190 male and female patients and the forensic sample included 162 male psychiatric patients. We conducted correlation and count regression analyses to evaluate the utility of relevant MMPI-2-RF scales in predicting PD criterion count scores. Generally, results from these analyses emerged as conceptually expected and provided evidence that MMPI-2-RF scales can be useful in assessing PDs. At the zero-order level, most hypothesized associations between Section II disorders and MMPI-2-RF scales were supported. Similarly, in the regression analyses, a unique set of predictors emerged for each PD that was generally in line with conceptual expectations. Additionally, the results provided general evidence that PDs can be captured by dimensional psychopathology constructs, which has implications for both DSM-5 Section III specifically and the personality psychopathology literature more broadly. (c) 2015 APA, all rights reserved.

  12. Adjustments to de Leva-anthropometric regression data for the changes in body proportions in elderly humans.

    PubMed

    Ho Hoang, Khai-Long; Mombaur, Katja

    2015-10-15

    Dynamic modeling of the human body is an important tool to investigate the fundamentals of the biomechanics of human movement. To model the human body in terms of a multi-body system, it is necessary to know the anthropometric parameters of the body segments. For young healthy subjects, several data sets exist that are widely used in the research community, e.g. the tables provided by de Leva. None such comprehensive anthropometric parameter sets exist for elderly people. It is, however, well known that body proportions change significantly during aging, e.g. due to degenerative effects in the spine, such that parameters for young people cannot be used for realistically simulating the dynamics of elderly people. In this study, regression equations are derived from the inertial parameters, center of mass positions, and body segment lengths provided by de Leva to be adjustable to the changes in proportion of the body parts of male and female humans due to aging. Additional adjustments are made to the reference points of the parameters for the upper body segments as they are chosen in a more practicable way in the context of creating a multi-body model in a chain structure with the pelvis representing the most proximal segment. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. Simultaneous Nonrigid Registration, Segmentation, and Tumor Detection in MRI Guided Cervical Cancer Radiation Therapy

    PubMed Central

    Lu, Chao; Chelikani, Sudhakar; Jaffray, David A.; Milosevic, Michael F.; Staib, Lawrence H.; Duncan, James S.

    2013-01-01

    External beam radiation therapy (EBRT) for the treatment of cancer enables accurate placement of radiation dose on the cancerous region. However, the deformation of soft tissue during the course of treatment, such as in cervical cancer, presents significant challenges for the delineation of the target volume and other structures of interest. Furthermore, the presence and regression of pathologies such as tumors may violate registration constraints and cause registration errors. In this paper, automatic segmentation, nonrigid registration and tumor detection in cervical magnetic resonance (MR) data are addressed simultaneously using a unified Bayesian framework. The proposed novel method can generate a tumor probability map while progressively identifying the boundary of an organ of interest based on the achieved nonrigid transformation. The method is able to handle the challenges of significant tumor regression and its effect on surrounding tissues. The new method was compared to various currently existing algorithms on a set of 36 MR data from six patients, each patient has six T2-weighted MR cervical images. The results show that the proposed approach achieves an accuracy comparable to manual segmentation and it significantly outperforms the existing registration algorithms. In addition, the tumor detection result generated by the proposed method has a high agreement with manual delineation by a qualified clinician. PMID:22328178

  14. The association between dietary lignans, phytoestrogen-rich foods, and fiber intake and postmenopausal breast cancer risk: a German case-control study.

    PubMed

    Zaineddin, Aida Karina; Buck, Katharina; Vrieling, Alina; Heinz, Judith; Flesch-Janys, Dieter; Linseisen, Jakob; Chang-Claude, Jenny

    2012-01-01

    Phytoestrogens are structurally similar to estrogens and may affect breast cancer risk by mimicking estrogenic/antiestrogenic properties. In Western societies, whole grains and possibly soy foods are rich sources of phytoestrogens. A population-based case-control study in German postmenopausal women was used to evaluate the association of phytoestrogen-rich foods and dietary lignans with breast cancer risk. Dietary data were collected from 2,884 cases and 5,509 controls using a validated food-frequency questionnaire, which included additional questions phytoestrogen-rich foods. Associations were assessed using conditional logistic regression. All analyses were adjusted for relevant risk and confounding factors. Polytomous logistic regression analysis was performed to evaluate the associations by estrogen receptor (ER) status. High and low consumption of soybeans as well as of sunflower and pumpkin seeds were associated with significantly reduced breast cancer risk compared to no consumption (OR = 0.83, 95% CI = 0.70-0.97; and OR = 0.66, 95% CI = 0.77-0.97, respectively). The observed associations were not differential by ER status. No statistically significant associations were found for dietary intake of plant lignans, fiber, or the calculated enterolignans. Our results provide evidence for a reduced postmenopausal breast cancer risk associated with increased consumption of sunflower and pumpkin seeds and soybeans.

  15. Social networks and well-being: a comparison of older people in Mediterranean and non-Mediterranean countries.

    PubMed

    Litwin, Howard

    2010-09-01

    This study examined whether the social networks of older persons in Mediterranean and non-Mediterranean countries were appreciably different and whether they functioned in similar ways in relation to well-being outcomes. The sample included family household respondents aged 60 years and older from the first wave of the Survey of Health, Ageing and Retirement in Europe in 5 Mediterranean (n = 3,583) and 7 non-Mediterranean (n = 5,471) countries. Region was regressed separately by gender on variables from 4 network domains: structure and interaction, exchange, engagement and relationship quality, and controlling for background and health characteristics. In addition, 2 well-being outcomes-depressive symptoms and perceived income inadequacy-were regressed on the study variables, including regional social network interaction terms. The results revealed differences across the 2 regional settings in each of the realms of social network, above and beyond the differences that exist in background characteristics and health status. The findings also showed that the social network variables had different effects on the well-being outcomes in the respective settings. The findings underscore that the social network phenomenon is contextually bound. The social networks of older people should be seen within their unique regional milieu and in relation to the values and social norms that prevail in different sets of societies.

  16. Advancing paternal age at birth is associated with poorer social functioning earlier and later in life of schizophrenia patients in a founder population.

    PubMed

    Liebenberg, Rudolf; van Heerden, Brigitte; Ehlers, René; Du Plessis, Anna M E; Roos, J Louw

    2016-09-30

    Consistent associations have been found between advanced paternal age and an increased risk of psychiatric disorders, such as schizophrenia, in their offspring. This increase appears to be linear as paternal age increases. The present study investigates the relationship between early deviant behaviour in the first 10 years of life of patients as well as longer term functional outcome and paternal age in sporadic Afrikaner founder population cases of schizophrenia. This might improve our understanding of Paternal Age-Related Schizophrenia (PARS). Follow-up psychiatric diagnoses were confirmed by the Diagnostic Interview for Genetic Studies (DIGS). An early deviant childhood behaviour semi-structured questionnaire and the Specific Level of Functioning Assessment (SLOF) were completed. From the logistic regression models fitted, a significant negative relationship was found between paternal age at birth and social dysfunction as early deviant behaviour. Additionally, regression analysis revealed a significant negative relationship between paternal age at birth and the SLOF for interpersonal relationships later in life. Early social dysfunction may represent a phenotypic trait for PARS. Further research is required to understand the relationship between early social dysfunction and deficits in interpersonal relationships later in life. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  17. Development of a data driven process-based model for remote sensing of terrestrial ecosystem productivity, evapotranspiration, and above-ground biomass

    NASA Astrophysics Data System (ADS)

    El Masri, Bassil

    2011-12-01

    Modeling terrestrial ecosystem functions and structure has been a subject of increasing interest because of the importance of the terrestrial carbon cycle in global carbon budget and climate change. In this study, satellite data were used to estimate gross primary production (GPP), evapotranspiration (ET) for two deciduous forests: Morgan Monroe State forest (MMSF) in Indiana and Harvard forest in Massachusetts. Also, above-ground biomass (AGB) was estimated for the MMSF and the Howland forest (mixed forest) in Maine. Surface reflectance and temperature, vegetation indices, soil moisture, tree height and canopy area derived from the Moderate Resolution Imagining Spectroradiometer (MODIS), the Advanced Microwave Scanning Radiometer (AMRS-E), LIDAR, and aerial imagery respectively, were used for this purpose. These variables along with others derived from remotely sensed data were used as inputs variables to process-based models which estimated GPP and ET and to a regression model which estimated AGB. The process-based models were BIOME-BGC and the Penman-Monteith equation. Measured values for the carbon and water fluxes obtained from the Eddy covariance flux tower were compared to the modeled GPP and ET. The data driven methods produced good estimation of GPP and ET with an average root mean square error (RMSE) of 0.17 molC/m2 and 0.40 mm/day, respectively for the MMSF and the Harvard forest. In addition, allometric data for the MMSF were used to develop the regression model relating AGB with stem volume. The performance of the AGB regression model was compared to site measurements using remotely sensed data for the MMSF and the Howland forest where the model AGB RMSE ranged between 2.92--3.30 Kg C/m2. Sensitivity analysis revealed that improvement in maintenance respiration estimation and remotely sensed maximum photosynthetic activity as well as accurate estimate of canopy resistance will result in improved GPP and ET predictions. Moreover, AGB estimates were found to decrease as large grid size is used in rasterizing LIDAR return points. The analysis suggested that this methodology could be used as an operational procedure for monitoring changes in terrestrial ecosystem functions and structure brought by environmental changes.

  18. Structural, Linguistic and Topic Variables in Verbal and Computational Problems in Elementary Mathematics.

    ERIC Educational Resources Information Center

    Beardslee, Edward C.; Jerman, Max E.

    Five structural, four linguistic and twelve topic variables are used in regression analyses on results of a 50-item achievement test. The test items are related to 12 topics from the third-grade mathematics curriculum. The items reflect one of two cases of the structural variable, cognitive level; the two levels are characterized, inductive…

  19. Comparison of mean deviation with AGIS and CIGTS scores in association with structural parameters in glaucomatous eyes.

    PubMed

    Naka, Maiko; Kanamori, Akiyasu; Tatsumi, Yasuko; Fujioka, Miyuki; Nagai-Kusuhara, Azusa; Nakamura, Makoto; Negi, Akira

    2009-01-01

    To evaluate which of the 3 clinically used visual field indices including mean deviation (MD), Advanced Glaucoma Intervention Study (AGIS) score, and Collaborative Initial Glaucoma Treatment Study (CIGTS) score are best in evaluating functional damage of glaucomatous optic neuropathy. In 213 glaucomatous eyes, peripapillary retinal nerve fiber layer thickness (RNFLT) and optic disc configuration were measured with Stratus optical coherence tomography and Heidelberg Retina Tomograph-2, respectively. Visual field was measured with standard automated perimetry 30-2. Correlations of the structural parameters compared with the 3 VF indices using second polynomial regression were calculated. In addition, these correlations were analyzed among eyes of 3 different stages of glaucoma, as defined by MD score (early, MD> or =-6 dB; moderate, -12 dB< or =MD<-6 dB; advanced, MD<-12 dB). Among structure-function relationships in all subjects, the highest correlation determination (R) was MD with RNFLT (=0.298). CIGTS score showed better R than MD or AGIS score with rim area, but these values were not higher than any R with RNFLT. In analyses of 3 groups depending on MD, statistically significant structure-function correlations were observed only in patients with an advanced stage. No clear difference was found among MD and AGIS/CIGTS scores in expressing functional damage of glaucomatous eyes. MD is suggested to be no worse than others in monitoring glaucoma in clinical setting.

  20. Reef flattening effects on total richness and species responses in the Caribbean.

    PubMed

    Newman, Steven P; Meesters, Erik H; Dryden, Charlie S; Williams, Stacey M; Sanchez, Cristina; Mumby, Peter J; Polunin, Nicholas V C

    2015-11-01

    There has been ongoing flattening of Caribbean coral reefs with the loss of habitat having severe implications for these systems. Complexity and its structural components are important to fish species richness and community composition, but little is known about its role for other taxa or species-specific responses. This study reveals the importance of reef habitat complexity and structural components to different taxa of macrofauna, total species richness, and individual coral and fish species in the Caribbean. Species presence and richness of different taxa were visually quantified in one hundred 25-m(2) plots in three marine reserves in the Caribbean. Sampling was evenly distributed across five levels of visually estimated reef complexity, with five structural components also recorded: the number of corals, number of large corals, slope angle, maximum sponge and maximum octocoral height. Taking advantage of natural heterogeneity in structural complexity within a particular coral reef habitat (Orbicella reefs) and discrete environmental envelope, thus minimizing other sources of variability, the relative importance of reef complexity and structural components was quantified for different taxa and individual fish and coral species on Caribbean coral reefs using boosted regression trees (BRTs). Boosted regression tree models performed very well when explaining variability in total (82·3%), coral (80·6%) and fish species richness (77·3%), for which the greatest declines in richness occurred below intermediate reef complexity levels. Complexity accounted for very little of the variability in octocorals, sponges, arthropods, annelids or anemones. BRTs revealed species-specific variability and importance for reef complexity and structural components. Coral and fish species occupancy generally declined at low complexity levels, with the exception of two coral species (Pseudodiploria strigosa and Porites divaricata) and four fish species (Halichoeres bivittatus, H. maculipinna, Malacoctenus triangulatus and Stegastes partitus) more common at lower reef complexity levels. A significant interaction between country and reef complexity revealed a non-additive decline in species richness in areas of low complexity and the reserve in Puerto Rico. Flattening of Caribbean coral reefs will result in substantial species losses, with few winners. Individual structural components have considerable value to different species, and their loss may have profound impacts on population responses of coral and fish due to identity effects of key species, which underpin population richness and resilience and may affect essential ecosystem processes and services. © 2015 The Authors. Journal of Animal Ecology © 2015 British Ecological Society.

  1. Predicting beta-turns in proteins using support vector machines with fractional polynomials

    PubMed Central

    2013-01-01

    Background β-turns are secondary structure type that have essential role in molecular recognition, protein folding, and stability. They are found to be the most common type of non-repetitive structures since 25% of amino acids in protein structures are situated on them. Their prediction is considered to be one of the crucial problems in bioinformatics and molecular biology, which can provide valuable insights and inputs for the fold recognition and drug design. Results We propose an approach that combines support vector machines (SVMs) and logistic regression (LR) in a hybrid prediction method, which we call (H-SVM-LR) to predict β-turns in proteins. Fractional polynomials are used for LR modeling. We utilize position specific scoring matrices (PSSMs) and predicted secondary structure (PSS) as features. Our simulation studies show that H-SVM-LR achieves Qtotal of 82.87%, 82.84%, and 82.32% on the BT426, BT547, and BT823 datasets respectively. These values are the highest among other β-turns prediction methods that are based on PSSMs and secondary structure information. H-SVM-LR also achieves favorable performance in predicting β-turns as measured by the Matthew's correlation coefficient (MCC) on these datasets. Furthermore, H-SVM-LR shows good performance when considering shape strings as additional features. Conclusions In this paper, we present a comprehensive approach for β-turns prediction. Experiments show that our proposed approach achieves better performance compared to other competing prediction methods. PMID:24565438

  2. Predicting beta-turns in proteins using support vector machines with fractional polynomials.

    PubMed

    Elbashir, Murtada; Wang, Jianxin; Wu, Fang-Xiang; Wang, Lusheng

    2013-11-07

    β-turns are secondary structure type that have essential role in molecular recognition, protein folding, and stability. They are found to be the most common type of non-repetitive structures since 25% of amino acids in protein structures are situated on them. Their prediction is considered to be one of the crucial problems in bioinformatics and molecular biology, which can provide valuable insights and inputs for the fold recognition and drug design. We propose an approach that combines support vector machines (SVMs) and logistic regression (LR) in a hybrid prediction method, which we call (H-SVM-LR) to predict β-turns in proteins. Fractional polynomials are used for LR modeling. We utilize position specific scoring matrices (PSSMs) and predicted secondary structure (PSS) as features. Our simulation studies show that H-SVM-LR achieves Qtotal of 82.87%, 82.84%, and 82.32% on the BT426, BT547, and BT823 datasets respectively. These values are the highest among other β-turns prediction methods that are based on PSSMs and secondary structure information. H-SVM-LR also achieves favorable performance in predicting β-turns as measured by the Matthew's correlation coefficient (MCC) on these datasets. Furthermore, H-SVM-LR shows good performance when considering shape strings as additional features. In this paper, we present a comprehensive approach for β-turns prediction. Experiments show that our proposed approach achieves better performance compared to other competing prediction methods.

  3. Structural and cognitive social capital and depression among older adults in two Nordic regions.

    PubMed

    Forsman, A K; Nyqvist, F; Schierenbeck, I; Gustafson, Y; Wahlbeck, K

    2012-01-01

    To study the association between structural and cognitive aspects of social capital and depression among older adults in two Nordic regions. Data were retrieved from a postal survey targeting older adults aged 65, 70, 75 and 80 years (N=6 838, response rate=64%) residing in the Västerbotten region (Sweden), and the Österbotten region (Finland) in 2010. The associations between structural (measured by frequency of social contact with friends and neighbours) and cognitive (measured by experienced trust in friends and neighbours) aspects of social capital and depression (measured by Geriatric Depression Scale, GDS-4) were tested by logistic regression analyses. Both low structural and cognitive social capital as defined in the study showed statistically significant associations with depression in older adults. Only experienced trust in neighbours failed to show significant association with depression. In addition, being single and being 80 years of age indicated a higher risk of depression as defined by GDS-4. The findings underline the connection between adequate levels of both structural and cognitive individual social capital and mental health in later life. They also suggest that the connection differs depending on various network types; the cognitive aspect of relationships between friends was connected to depression, while the connection was not found for neighbours. Further, the oldest age group in the sample (80 years of age) is pointed out as a population especially vulnerable for depression that should not be overlooked in mental health promotion and depression prevention.

  4. Regression techniques for oceanographic parameter retrieval using space-borne microwave radiometry

    NASA Technical Reports Server (NTRS)

    Hofer, R.; Njoku, E. G.

    1981-01-01

    Variations of conventional multiple regression techniques are applied to the problem of remote sensing of oceanographic parameters from space. The techniques are specifically adapted to the scanning multichannel microwave radiometer (SMRR) launched on the Seasat and Nimbus 7 satellites to determine ocean surface temperature, wind speed, and atmospheric water content. The retrievals are studied primarily from a theoretical viewpoint, to illustrate the retrieval error structure, the relative importances of different radiometer channels, and the tradeoffs between spatial resolution and retrieval accuracy. Comparisons between regressions using simulated and actual SMMR data are discussed; they show similar behavior.

  5. The use of cognitive ability measures as explanatory variables in regression analysis.

    PubMed

    Junker, Brian; Schofield, Lynne Steuerle; Taylor, Lowell J

    2012-12-01

    Cognitive ability measures are often taken as explanatory variables in regression analysis, e.g., as a factor affecting a market outcome such as an individual's wage, or a decision such as an individual's education acquisition. Cognitive ability is a latent construct; its true value is unobserved. Nonetheless, researchers often assume that a test score , constructed via standard psychometric practice from individuals' responses to test items, can be safely used in regression analysis. We examine problems that can arise, and suggest that an alternative approach, a "mixed effects structural equations" (MESE) model, may be more appropriate in many circumstances.

  6. Using temporal ICA to selectively remove global noise while preserving global signal in functional MRI data.

    PubMed

    Glasser, Matthew F; Coalson, Timothy S; Bijsterbosch, Janine D; Harrison, Samuel J; Harms, Michael P; Anticevic, Alan; Van Essen, David C; Smith, Stephen M

    2018-06-02

    Temporal fluctuations in functional Magnetic Resonance Imaging (fMRI) have been profitably used to study brain activity and connectivity for over two decades. Unfortunately, fMRI data also contain structured temporal "noise" from a variety of sources, including subject motion, subject physiology, and the MRI equipment. Recently, methods have been developed to automatically and selectively remove spatially specific structured noise from fMRI data using spatial Independent Components Analysis (ICA) and machine learning classifiers. Spatial ICA is particularly effective at removing spatially specific structured noise from high temporal and spatial resolution fMRI data of the type acquired by the Human Connectome Project and similar studies. However, spatial ICA is mathematically, by design, unable to separate spatially widespread "global" structured noise from fMRI data (e.g., blood flow modulations from subject respiration). No methods currently exist to selectively and completely remove global structured noise while retaining the global signal from neural activity. This has left the field in a quandary-to do or not to do global signal regression-given that both choices have substantial downsides. Here we show that temporal ICA can selectively segregate and remove global structured noise while retaining global neural signal in both task-based and resting state fMRI data. We compare the results before and after temporal ICA cleanup to those from global signal regression and show that temporal ICA cleanup removes the global positive biases caused by global physiological noise without inducing the network-specific negative biases of global signal regression. We believe that temporal ICA cleanup provides a "best of both worlds" solution to the global signal and global noise dilemma and that temporal ICA itself unlocks interesting neurobiological insights from fMRI data. Copyright © 2018 Elsevier Inc. All rights reserved.

  7. Structural Abnormalities on Magnetic Resonance Imaging in Patients With Patellofemoral Pain: A Cross-sectional Case-Control Study.

    PubMed

    van der Heijden, Rianne A; de Kanter, Janneke L M; Bierma-Zeinstra, Sita M A; Verhaar, Jan A N; van Veldhoven, Peter L J; Krestin, Gabriel P; Oei, Edwin H G; van Middelkoop, Marienke

    2016-09-01

    Structural abnormalities of the patellofemoral joint might play a role in the pathogenesis of patellofemoral pain (PFP), a common knee problem among young and physically active individuals. No previous study has investigated if PFP is associated with structural abnormalities of the patellofemoral joint using high-resolution magnetic resonance imaging (MRI). To investigate the presence of structural abnormalities of the patellofemoral joint on high-resolution MRI in patients with PFP compared with healthy control subjects. Cross-sectional study; Level of evidence, 3. Patients with PFP and healthy control subjects between 14 and 40 years of age underwent high-resolution 3-T MRI. All images were scored using the Magnetic Resonance Imaging Osteoarthritis Knee Score with the addition of specific patellofemoral features. Associations between PFP and the presence of structural abnormalities were analyzed using logistic regression analyses adjusted for age, body mass index (BMI), sex, and sports participation. A total of 64 patients and 70 control subjects were included in the study. Mean ± SD age was 23.2 ± 6.4 years, mean BMI ± SD was 22.9 ± 3.4 kg/m(2), and 56.7% were female. Full-thickness cartilage loss was not present. Minor patellar cartilage defects, patellar bone marrow lesions, and high signal intensity of the Hoffa fat pad were frequently seen in both patients (23%, 53%, and 58%, respectively) and control subjects (21%, 51%, and 51%, respectively). After adjustment for age, BMI, sex, and sports participation, none of the structural abnormalities were statistically significantly associated with PFP. Structural abnormalities of the patellofemoral joint have been hypothesized as a factor in the pathogenesis of PFP, but the study findings suggest that structural abnormalities of the patellofemoral joint on MRI are not associated with PFP. © 2016 The Author(s).

  8. The effects of organizational commitment and structural empowerment on patient safety culture.

    PubMed

    Horwitz, Sujin K; Horwitz, Irwin B

    2017-03-20

    Purpose The purpose of this paper is to investigate the relationship between patient safety culture and two attitudinal constructs: affective organizational commitment and structural empowerment. In doing so, the main and interaction effects of the two constructs on the perception of patient safety culture were assessed using a cohort of physicians. Design/methodology/approach Affective commitment was measured with the Organizational Commitment Questionnaire, whereas structural empowerment was assessed with the Conditions of Work Effectiveness Questionnaire-II. The abbreviated versions of these surveys were administered to a cohort of 71 post-doctoral medical residents. For the data analysis, hierarchical regression analyses were performed for the main and interaction effects of affective commitment and structural empowerment on the perception of patient safety culture. Findings A total of 63 surveys were analyzed. The results revealed that both affective commitment and structural empowerment were positively related to patient safety culture. A potential interaction effect of the two attitudinal constructs on patient safety culture was tested but no such effect was detected. Research limitations/implications This study suggests that there are potential benefits of promoting affective commitment and structural empowerment for patient safety culture in health care organizations. By identifying the positive associations between the two constructs and patient safety culture, this study provides additional empirical support for Kanter's theoretical tenet that structural and organizational support together helps to shape the perceptions of patient safety culture. Originality/value Despite the wide recognition of employee empowerment and commitment in organizational research, there has still been a paucity of empirical studies specifically assessing their effects on patient safety culture in health care organizations. To the authors' knowledge, this study is the first empirical study to examine the relationship between structural empowerment as proposed by Kanter and the culture of patient safety using physicians.

  9. Additive effects prevail: The response of biota to multiple stressors in an intensively monitored watershed.

    PubMed

    Gieswein, Alexander; Hering, Daniel; Feld, Christian K

    2017-09-01

    Freshwater ecosystems are impacted by a range of stressors arising from diverse human-caused land and water uses. Identifying the relative importance of single stressors and understanding how multiple stressors interact and jointly affect biology is crucial for River Basin Management. This study addressed multiple human-induced stressors and their effects on the aquatic flora and fauna based on data from standard WFD monitoring schemes. For altogether 1095 sites within a mountainous catchment, we used 12 stressor variables covering three different stressor groups: riparian land use, physical habitat quality and nutrient enrichment. Twenty-one biological metrics calculated from taxa lists of three organism groups (fish, benthic invertebrates and aquatic macrophytes) served as response variables. Stressor and response variables were subjected to Boosted Regression Tree (BRT) analysis to identify stressor hierarchy and stressor interactions and subsequently to Generalised Linear Regression Modelling (GLM) to quantify the stressors standardised effect size. Our results show that riverine habitat degradation was the dominant stressor group for the river fauna, notably the bed physical habitat structure. Overall, the explained variation in benthic invertebrate metrics was higher than it was in fish and macrophyte metrics. In particular, general integrative (aggregate) metrics such as % Ephemeroptera, Plecoptera and Trichoptera (EPT) taxa performed better than ecological traits (e.g. % feeding types). Overall, additive stressor effects dominated, while significant and meaningful stressor interactions were generally rare and weak. We concluded that given the type of stressor and ecological response variables addressed in this study, river basin managers do not need to bother much about complex stressor interactions, but can focus on the prevailing stressors according to the hierarchy identified. Copyright © 2017 Elsevier B.V. All rights reserved.

  10. What Matters When Children Play: Influence of Social Cognitive Theory and Perceived Environment on Levels of Physical Activity Among Elementary-Aged Youth

    PubMed Central

    Harmon, Brook E.; Nigg, Claudio R.; Long, Camonia; Amato, Katie; Anwar, Mahabub-Ul; Kutchman, Eve; Anthamatten, Peter; Browning, Raymond C.; Brink, Lois; Hill, James O.

    2014-01-01

    Objectives Social Cognitive Theory (SCT) has often been used as a guide to predict and modify physical activity (PA) behavior. We assessed the ability of commonly investigated SCT variables and perceived school environment variables to predict PA among elementary students. We also examined differences in influences between Hispanic and non-Hispanic students. Design This analysis used baseline data collected from eight schools who participated in a four-year study of a combined school-day curriculum and environmental intervention. Methods Data were collected from 393 students. A 3-step linear regression was used to measure associations between PA level, SCT variables (self-efficacy, social support, enjoyment), and perceived environment variables (schoolyard structures, condition, equipment/supervision). Logistic regression assessed associations between variables and whether students met PA recommendations. Results School and sex explained 6% of the moderate-to-vigorous PA models' variation. SCT variables explained an additional 15% of the models' variation, with much of the model's predictive ability coming from self-efficacy and social support. Sex was more strongly associated with PA level among Hispanic students, while self-efficacy was more strongly associated among non-Hispanic students. Perceived environment variables contributed little to the models. Conclusions Our findings add to the literature on the influences of PA among elementary-aged students. The differences seen in the influence of sex and self-efficacy among non-Hispanic and Hispanic students suggests these are areas where PA interventions could be tailored to improve efficacy. Additional research is needed to understand if different measures of perceived environment or perceptions at different ages may better predict PA. PMID:24772004

  11. Bayesian Inference for the Stereotype Regression Model: Application to a Case-control Study of Prostate Cancer

    PubMed Central

    Ahn, Jaeil; Mukherjee, Bhramar; Banerjee, Mousumi; Cooney, Kathleen A.

    2011-01-01

    Summary The stereotype regression model for categorical outcomes, proposed by Anderson (1984) is nested between the baseline category logits and adjacent category logits model with proportional odds structure. The stereotype model is more parsimonious than the ordinary baseline-category (or multinomial logistic) model due to a product representation of the log odds-ratios in terms of a common parameter corresponding to each predictor and category specific scores. The model could be used for both ordered and unordered outcomes. For ordered outcomes, the stereotype model allows more flexibility than the popular proportional odds model in capturing highly subjective ordinal scaling which does not result from categorization of a single latent variable, but are inherently multidimensional in nature. As pointed out by Greenland (1994), an additional advantage of the stereotype model is that it provides unbiased and valid inference under outcome-stratified sampling as in case-control studies. In addition, for matched case-control studies, the stereotype model is amenable to classical conditional likelihood principle, whereas there is no reduction due to sufficiency under the proportional odds model. In spite of these attractive features, the model has been applied less, as there are issues with maximum likelihood estimation and likelihood based testing approaches due to non-linearity and lack of identifiability of the parameters. We present comprehensive Bayesian inference and model comparison procedure for this class of models as an alternative to the classical frequentist approach. We illustrate our methodology by analyzing data from The Flint Men’s Health Study, a case-control study of prostate cancer in African-American men aged 40 to 79 years. We use clinical staging of prostate cancer in terms of Tumors, Nodes and Metastatsis (TNM) as the categorical response of interest. PMID:19731262

  12. Hawaii StreamStats; a web application for defining drainage-basin characteristics and estimating peak-streamflow statistics

    USGS Publications Warehouse

    Rosa, Sarah N.; Oki, Delwyn S.

    2010-01-01

    Reliable estimates of the magnitude and frequency of floods are necessary for the safe and efficient design of roads, bridges, water-conveyance structures, and flood-control projects and for the management of flood plains and flood-prone areas. StreamStats provides a simple, fast, and reproducible method to define drainage-basin characteristics and estimate the frequency and magnitude of peak discharges in Hawaii?s streams using recently developed regional regression equations. StreamStats allows the user to estimate the magnitude of floods for streams where data from stream-gaging stations do not exist. Existing estimates of the magnitude and frequency of peak discharges in Hawaii can be improved with continued operation of existing stream-gaging stations and installation of additional gaging stations for areas where limited stream-gaging data are available.

  13. Responsive copolymers for enhanced petroleum recovery. Quarterly technical progress report, June 23--September 21, 1994

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    McCormick, C.; Hester, R.

    Summaries are given on the technical progress on three tasks of this project. Monomer and polymer synthesis discusses the preparation of 1(7-aminoheptyloxymethyl)naphthalene and poly(maleic anhydride-alt-ethyl vinyl ether). Task 2, Characterization of molecular structure, discusses terpolymer solution preparation, UV analysis, fluorescence analysis, low angle laser light scattering, and viscometry. The paper discusses the effects of hydrophobic groups, the effect of pH, the effect of electrolyte addition, and photophysical studies. Task 3, Solution properties, describes the factorial experimental design for characterizing polymer solutions by light scattering, the light scattering test model, orthogonal factorial test design, linear regression in coded space, confidence levelmore » for coded space test mode coefficients, coefficients of the real space test model, and surface analysis of the model equations.« less

  14. Survey datasets on women participation in green jobs in the construction industry.

    PubMed

    Afolabi, Adedeji O; Ojelabi, Rapheal A; Tunji-Olayeni, Patience F; Fagbenle, Olabosipo I; Mosaku, Timothy O

    2018-04-01

    The unique qualities of women can make them bearers of solutions towards achieving sustainability and dealing with the dangers attributed to climate change. The attitudinal study utilized a questionnaire instrument to obtain perception of female construction professionals. By using a well-structured questionnaire, data was obtained on women participating in green jobs in the construction Industry. Descriptive statistics is performed on the collected data and presented in tables and mean scores (MS). In addition, inferential statistics of categorical regression was performed on the data to determine the level of influence (beta factor) the identified barriers had on the level of participation in green jobs. Barriers and the socio-economic benefits which can guide policies and actions on attracting, retaining and exploring the capabilities of women in green jobs can be obtained from the survey data when analyzed.

  15. Health care worker exposures to the antibacterial agent triclosan.

    PubMed

    MacIsaac, Julia K; Gerona, Roy R; Blanc, Paul D; Apatira, Latifat; Friesen, Matthew W; Coppolino, Michael; Janssen, Sarah

    2014-08-01

    We sought to quantify absorption of triclosan, a potential endocrine disruptor, in health care workers with occupational exposure to soap containing this chemical. A cross-sectional convenience sample of two groups of 38 health care workers at separate inpatient medical centers: hospital 1 uses 0.3% triclosan soap in all patient care areas; hospital 2 does not use triclosan-containing products. Additional exposure to triclosan-containing personal care products was assessed through a structured questionnaire. Urine triclosan was quantified and the occupational contribution estimated through regression modeling. Occupational exposure accounted for an incremental triclosan burden of 206 ng/mL (P = 0.02), while triclosan-containing toothpaste use was associated with 146 ng/mL higher levels (P < 0.001). Use of triclosan-containing antibacterial soaps in health care settings represents a substantial and potentially biologically relevant source of occupational triclosan exposure.

  16. Messing Up Texas?: A Re-Analysis of the Effects of Executions on Homicides.

    PubMed

    Brandt, Patrick T; Kovandzic, Tomislav V

    2015-01-01

    Executions in Texas from 1994-2005 do not deter homicides, contrary to the results of Land et al. (2009). We find that using different models--based on pre-tests for unit roots that correct for earlier model misspecifications--one cannot reject the null hypothesis that executions do not lead to a change in homicides in Texas over this period. Using additional control variables, we show that variables such as the number of prisoners in Texas may drive the main drop in homicides over this period. Such conclusions however are highly sensitive to model specification decisions, calling into question the assumptions about fixed parameters and constant structural relationships. This means that using dynamic regressions to account for policy changes that may affect homicides need to be done with significant care and attention.

  17. Messing Up Texas?: A Re-Analysis of the Effects of Executions on Homicides

    PubMed Central

    Brandt, Patrick T.; Kovandzic, Tomislav V.

    2015-01-01

    Executions in Texas from 1994–2005 do not deter homicides, contrary to the results of Land et al. (2009). We find that using different models—based on pre-tests for unit roots that correct for earlier model misspecifications—one cannot reject the null hypothesis that executions do not lead to a change in homicides in Texas over this period. Using additional control variables, we show that variables such as the number of prisoners in Texas may drive the main drop in homicides over this period. Such conclusions however are highly sensitive to model specification decisions, calling into question the assumptions about fixed parameters and constant structural relationships. This means that using dynamic regressions to account for policy changes that may affect homicides need to be done with significant care and attention. PMID:26398193

  18. Covariance functions for body weight from birth to maturity in Nellore cows.

    PubMed

    Boligon, A A; Mercadante, M E Z; Forni, S; Lôbo, R B; Albuquerque, L G

    2010-03-01

    The objective of this study was to estimate (co)variance functions using random regression models on Legendre polynomials for the analysis of repeated measures of BW from birth to adult age. A total of 82,064 records from 8,145 females were analyzed. Different models were compared. The models included additive direct and maternal effects, and animal and maternal permanent environmental effects as random terms. Contemporary group and dam age at calving (linear and quadratic effect) were included as fixed effects, and orthogonal Legendre polynomials of animal age (cubic regression) were considered as random covariables. Eight models with polynomials of third to sixth order were used to describe additive direct and maternal effects, and animal and maternal permanent environmental effects. Residual effects were modeled using 1 (i.e., assuming homogeneity of variances across all ages) or 5 age classes. The model with 5 classes was the best to describe the trajectory of residuals along the growth curve. The model including fourth- and sixth-order polynomials for additive direct and animal permanent environmental effects, respectively, and third-order polynomials for maternal genetic and maternal permanent environmental effects were the best. Estimates of (co)variance obtained with the multi-trait and random regression models were similar. Direct heritability estimates obtained with the random regression models followed a trend similar to that obtained with the multi-trait model. The largest estimates of maternal heritability were those of BW taken close to 240 d of age. In general, estimates of correlation between BW from birth to 8 yr of age decreased with increasing distance between ages.

  19. Prevalence and risk factors of depressive disorder in caregivers of patients with head and neck cancer.

    PubMed

    Lee, Yu; Lin, Pao-Yen; Chien, Chih-Yen; Fang, Fu-Min

    2015-02-01

    The purpose of this study is to examine the prevalence and risk factors of depressive disorder in caregivers of patients with head and neck cancer. Study subjects were recruited from a multidisciplinary outpatient clinic for head and neck cancer in a medical center from February to July 2012. Caregivers of patients with head and neck cancer were enrolled and assessed using the Structured Clinical Interview for the DSM-IV, Clinician Version, the Short Form 36 Health Survey, and the Family APGAR index. The main aim of the study was to examine the difference in demographic data and clinical characteristics between the caregivers with and without depressive disorders. In addition, a stepwise forward model of logistic regression was used to test the possible risk factors. One hundred and forty-three caregivers were included in the study. The most prevalent psychiatric disorder was depressive disorder (14.7%), followed by adjustment disorder (13.3%). Nearly one-third of the caregivers had a psychiatric diagnosis. By using logistic regression analysis, it was found that unemployment (odds ratio (OR) = 3.16; 95% CI, 1.04-9.68), lower social functioning (OR = 1.43; 95% CI, 1.18-1.72), and lower educational level (OR = 1.16; 95% CI, 1.01-1.34) were significant risk factors for the depressive disorder. The clinical implication of our results is the value of using the standardized structured interview for early diagnosis of depressive disorder in caregivers of head and neck cancer patients. Early screening and management of depression in these caregivers will raise their quality of life and capability to care patients. Copyright © 2014 John Wiley & Sons, Ltd.

  20. Parametric Study of Shear Strength of Concrete Beams Reinforced with FRP Bars

    NASA Astrophysics Data System (ADS)

    Thomas, Job; Ramadass, S.

    2016-09-01

    Fibre Reinforced Polymer (FRP) bars are being widely used as internal reinforcement in structural elements in the last decade. The corrosion resistance of FRP bars qualifies its use in severe and marine exposure conditions in structures. A total of eight concrete beams longitudinally reinforced with FRP bars were cast and tested over shear span to depth ratio of 0.5 and 1.75. The shear strength test data of 188 beams published in various literatures were also used. The model originally proposed by Indian Standard Code of practice for the prediction of shear strength of concrete beams reinforced with steel bars IS:456 (Plain and reinforced concrete, code of practice, fourth revision. Bureau of Indian Standards, New Delhi, 2000) is considered and a modification to account for the influence of the FRP bars is proposed based on regression analysis. Out of the 196 test data, 110 test data is used for the regression analysis and 86 test data is used for the validation of the model. In addition, the shear strength of 86 test data accounted for the validation is assessed using eleven models proposed by various researchers. The proposed model accounts for compressive strength of concrete ( f ck ), modulus of elasticity of FRP rebar ( E f ), longitudinal reinforcement ratio ( ρ f ), shear span to depth ratio ( a/ d) and size effect of beams. The predicted shear strength of beams using the proposed model and 11 models proposed by other researchers is compared with the corresponding experimental results. The mean of predicted shear strength to the experimental shear strength for the 86 beams accounted for the validation of the proposed model is found to be 0.93. The result of the statistical analysis indicates that the prediction based on the proposed model corroborates with the corresponding experimental data.

  1. Predicting landslide vegetation in patches on landscape gradients in Puerto Rico

    USGS Publications Warehouse

    Myster, R.W.; Thomlinson, J.R.; Larsen, M.C.

    1997-01-01

    We explored the predictive value of common landscape characteristics for landslide vegetative stages in the Luquillo Experimental Forest of Puerto Rico using four different analyses. Maximum likelihood logistic regression showed that aspect, age, and substrate type could be used to predict vegetative structural stage. In addition it showed that the structural complexity of the vegetation was greater in landslides (1) facing the southeast (away from the dominant wind direction of recent hurricanes), (2) that were older, and (3) that had volcaniclastic rather than dioritic substrate. Multiple regression indicated that both elevation and age could be used to predict the current vegetation, and that vegetation complexity was greater both at lower elevation and in older landslides. Pearson product-moment correlation coefficients showed that (1) the presence of volcaniclastic substrate in landslides was negatively correlated with aspect, age, and elevation, (2) that road association and age were positively correlated, and (3) that slope was negatively correlated with area. Finally, principal components analysis showed that landslides were differentiated on axes defined primarily by age, aspect class, and elevation in the positive direction, and by volcaniclastic substrate in the negative direction. Because several statistical techniques indicated that age, aspect, elevation, and substrate were important in determining vegetation complexity on landslides, we conclude that landslide succession is influenced by variation in these landscape traits. In particular, we would expect to find more successional development on landslides which are older, face away from hurricane winds, are at lower elevation, and are on volcaniclastic substrate. Finally, our results lead into a hierarchical conceptual model of succession on landscapes where the biota respond first to either gradients or disturbance depending on their relative severity, and then to more local biotic mechanisms such as dispersal, predation and competition.

  2. Relations between continuous real-time physical properties and discrete water-quality constituents in the Little Arkansas River, south-central Kansas, 1998-2014

    USGS Publications Warehouse

    Rasmussen, Patrick P.; Eslick, Patrick J.; Ziegler, Andrew C.

    2016-08-11

    Water from the Little Arkansas River is used as source water for artificial recharge of the Equus Beds aquifer, one of the primary water-supply sources for the city of Wichita, Kansas. The U.S. Geological Survey has operated two continuous real-time water-quality monitoring stations since 1995 on the Little Arkansas River in Kansas. Regression models were developed to establish relations between discretely sampled constituent concentrations and continuously measured physical properties to compute concentrations of those constituents of interest. Site-specific regression models were originally published in 2000 for the near Halstead and near Sedgwick U.S. Geological Survey streamgaging stations and the site-specific regression models were then updated in 2003. This report updates those regression models using discrete and continuous data collected during May 1998 through August 2014. In addition to the constituents listed in the 2003 update, new regression models were developed for total organic carbon. The real-time computations of water-quality concentrations and loads are available at http://nrtwq.usgs.gov. The water-quality information in this report is important to the city of Wichita because water-quality information allows for real-time quantification and characterization of chemicals of concern (including chloride), in addition to nutrients, sediment, bacteria, and atrazine transported in the Little Arkansas River. The water-quality information in this report aids in the decision making for water treatment before artificial recharge.

  3. The Urban Heat Island Impact in Consideration of Spatial Pattern of Urban Landscape and Structure

    NASA Astrophysics Data System (ADS)

    Kim, J.; Lee, D. K.; Jeong, W.; Sung, S.; Park, J.

    2015-12-01

    Preceding study has established a clear relationship between land surface temperature and area of land covers. However, only few studies have specifically examined the effects of spatial patterns of land covers and urban structure. To examine how much the local climate is affected by the spatial pattern in highly urbanized city, we investigated the correlation between land surface temperature and spatial patterns of land covers. In the analysis of correlation, we categorized urban structure to four different land uses: Apartment residential area, low rise residential area, industrial area and central business district. Through this study, we aims to examine the types of residential structure and land cover pattern for reducing urban heat island and sustainable development. Based on land surface temperature, we investigated the phenomenon of urban heat island through using the data of remote sensing. This study focused on Daegu in Korea. This city, one of the hottest city in Korea has basin form. We used high-resolution land cover data and land surface temperature by using Landsat8 satellite image to examine 100 randomly selected sample sites of 884.15km2 (1)In each land use, we quantified several landscape-levels and class-level landscape metrics for the sample study sites. (2)In addition, we measured the land surface temperature in 3 year hot summer seasons (July to September). Then, we investigated the pattern of land surface temperature for each land use through Ecognition package. (3)We deducted the Pearson correlation coefficients between land surface temperature and each landscape metrics. (4)We analyzed the variance among the four land uses. (5)Using linear regression, we determined land surface temperature model for each land use. (6)Through this analysis, we aims to examine the best pattern of land cover and artificial structure for reducing urban heat island effect in highly urbanized city. The results of linear regression showed that proportional land cover of grass, tree, water and impervious surfaces well explained the temperature in apartment residential areas. In contrast, the changes in the pattern of water, grass, tree and impervious surfaces were the best to determine the temperature in low rise residential area, central business district and industrial area.

  4. Estimating Causal Effects with Ancestral Graph Markov Models

    PubMed Central

    Malinsky, Daniel; Spirtes, Peter

    2017-01-01

    We present an algorithm for estimating bounds on causal effects from observational data which combines graphical model search with simple linear regression. We assume that the underlying system can be represented by a linear structural equation model with no feedback, and we allow for the possibility of latent variables. Under assumptions standard in the causal search literature, we use conditional independence constraints to search for an equivalence class of ancestral graphs. Then, for each model in the equivalence class, we perform the appropriate regression (using causal structure information to determine which covariates to include in the regression) to estimate a set of possible causal effects. Our approach is based on the “IDA” procedure of Maathuis et al. (2009), which assumes that all relevant variables have been measured (i.e., no unmeasured confounders). We generalize their work by relaxing this assumption, which is often violated in applied contexts. We validate the performance of our algorithm on simulated data and demonstrate improved precision over IDA when latent variables are present. PMID:28217244

  5. Spectral-Spatial Shared Linear Regression for Hyperspectral Image Classification.

    PubMed

    Haoliang Yuan; Yuan Yan Tang

    2017-04-01

    Classification of the pixels in hyperspectral image (HSI) is an important task and has been popularly applied in many practical applications. Its major challenge is the high-dimensional small-sized problem. To deal with this problem, lots of subspace learning (SL) methods are developed to reduce the dimension of the pixels while preserving the important discriminant information. Motivated by ridge linear regression (RLR) framework for SL, we propose a spectral-spatial shared linear regression method (SSSLR) for extracting the feature representation. Comparing with RLR, our proposed SSSLR has the following two advantages. First, we utilize a convex set to explore the spatial structure for computing the linear projection matrix. Second, we utilize a shared structure learning model, which is formed by original data space and a hidden feature space, to learn a more discriminant linear projection matrix for classification. To optimize our proposed method, an efficient iterative algorithm is proposed. Experimental results on two popular HSI data sets, i.e., Indian Pines and Salinas demonstrate that our proposed methods outperform many SL methods.

  6. A Dosing Study of Bevacizumab for Retinopathy of Prematurity: Late Recurrences and Additional Treatments.

    PubMed

    Wallace, David K; Dean, Trevano W; Hartnett, Mary Elizabeth; Kong, Lingkun; Smith, Lois E; Hubbard, G Baker; McGregor, Mary Lou; Jordan, Catherine O; Mantagos, Iason S; Bell, Edward F; Kraker, Raymond T

    2018-06-07

    Intravitreal bevacizumab is increasingly used to treat severe retinopathy of prematurity (ROP), but it enters the bloodstream, and there is concern that it may alter development of other organs. Previously we reported short-term outcomes of 61 infants enrolled in a dose de-escalation study, and we report the late recurrences and additional treatments. Masked, multicenter, dose de-escalation study. A total of 61 premature infants with type 1 ROP. If type 1 ROP was bilateral at enrollment, then the study eye was randomly selected. In the study eye, bevacizumab intravitreal injections were given at de-escalating doses of 0.25 mg, 0.125 mg, 0.063 mg, or 0.031 mg; if needed, fellow eyes received 1 dose level higher: 0.625 mg, 0.25 mg, 0.125 mg, or 0.063 mg, respectively. After 4 weeks, additional treatment was at the discretion of the investigator. Early and late ROP recurrences, additional treatments, and structural outcomes after 6 months. Of 61 study eyes, 25 (41%; 95% confidence interval [CI], 29%-54%) received additional treatment: 3 (5%; 95% CI, 1%-14%) for early failure (within 4 weeks), 11 (18%; 95% CI, 9%-30%) for late recurrence of ROP (after 4 weeks), and 11 (18%; 95% CI, 9%-30%) for persistent avascular retina. Re-treatment for early failure or late recurrence occurred in 2 of 11 eyes (18%; 95% CI, 2%-52%) treated with 0.25 mg, 4 of 16 eyes (25%; 95% CI, 7%-52%) treated with 0.125 mg, 8 of 24 eyes (33%; 95% CI, 16%-55%) treated with 0.063 mg, and 0 (0%; 95% CI, 0%-31%) of 10 eyes treated with 0.031 mg. By 6 months corrected age, 56 of 61 study eyes had regression of ROP with normal posterior poles, 1 study eye had developed a Stage 5 retinal detachment, and 4 infants had died of preexisting medical conditions. Retinal structural outcomes are very good after low-dose bevacizumab treatment for ROP, although many eyes received additional treatment. Copyright © 2018 American Academy of Ophthalmology. Published by Elsevier Inc. All rights reserved.

  7. Predicting High Explosive Detonation Velocities from Their Composition and Structure

    DTIC Science & Technology

    1978-09-01

    for a gamut of ideal explosives. The explosives ranged from nitroaromatics, cyclic and linear nitramines, nitrate esters and nitro-nitrato...structure is postulated for a gamut of explosives. Since detonation velocity, DQ, is density dependent, the linear regression plot. Figure 1, of the

  8. Structure-activity relationships between sterols and their thermal stability in oil matrix.

    PubMed

    Hu, Yinzhou; Xu, Junli; Huang, Weisu; Zhao, Yajing; Li, Maiquan; Wang, Mengmeng; Zheng, Lufei; Lu, Baiyi

    2018-08-30

    Structure-activity relationships between 20 sterols and their thermal stabilities were studied in a model oil system. All sterol degradations were found to be consistent with a first-order kinetic model with determination of coefficient (R 2 ) higher than 0.9444. The number of double bonds in the sterol structure was negatively correlated with the thermal stability of sterol, whereas the length of the branch chain was positively correlated with the thermal stability of sterol. A quantitative structure-activity relationship (QSAR) model to predict thermal stability of sterol was developed by using partial least squares regression (PLSR) combined with genetic algorithm (GA). A regression model was built with R 2 of 0.806. Almost all sterol degradation constants can be predicted accurately with R 2 of cross-validation equals to 0.680. Four important variables were selected in optimal QSAR model and the selected variables were observed to be related with information indices, RDF descriptors, and 3D-MoRSE descriptors. Copyright © 2018 Elsevier Ltd. All rights reserved.

  9. Features structure of iron-bearing strata’s of the Bakchar deposit, Western Siberia

    NASA Astrophysics Data System (ADS)

    Asochakova, E. M.

    2017-12-01

    The ore-bearing strata’s of Bakchar deposit have complicated structural-textural heterogeneity and variable mineral composition. This deposit is one of the most promising areas of localization of sedimentary iron ore. The ore-bearing strata’s are composed mainly of sandstones (sometimes with ferruginous pebbles, less often conglomerates), siltstones and clays. The ironstones are classified according to their lithology and geochemistry into three types: goethite-hydrogoethitic oolitic, glauconite-chloritic and transitional (intermediate) type iron ores. The mineral composition includes many different minerals: terrigenous, authigenic and clayey. Ironstones are characterized by elevated concentrations of many rare and valuable metals present in them as trace elements, additionally alloying (Mn, V, Cr, Ti, Zr, Mo, etc.) and harmful impurities (S, As, Cu, Pb, Zn, P). There are prerequisites for the influence of numerous factors, such as prolonged transgression of the sea, swamping of paleo-river deltas, the appearance of a tectonic fracture zone associated with active bottom tectonics and unloading of catagenetic waters, regression and natural ore enrichment due to the re-washing of slightly-iron rocks. These factors are reflected in the structure of the ore-bearing strata in which rhythmic cycles of ore sedimentation with successive changes in them are distinguished by an association of different mineral composition.

  10. Thiol-Disulfide Exchange in Peptides Derived from Human Growth Hormone

    PubMed Central

    Chandrasekhar, Saradha; Epling, Daniel E.; Sophocleous, Andreas M.; Topp, Elizabeth M.

    2014-01-01

    Disulfide bonds stabilize proteins by crosslinking distant regions into a compact three-dimensional structure. They can also participate in hydrolytic and oxidative pathways to form non-native disulfide bonds and other reactive species. Such covalent modifications can contribute to protein aggregation. Here we present experimental data for the mechanism of thiol-disulfide exchange in tryptic peptides derived from human growth hormone in aqueous solution. Reaction kinetics were monitored to investigate the effect of pH (6.0-10.0), temperature (4-50 °C), oxidation suppressants (EDTA and N2 sparging) and peptide secondary structure (amide cyclized vs. open form). The concentrations of free thiol containing peptides, scrambled disulfides and native disulfide-linked peptides generated via thiol-disulfide exchange and oxidation reactions were determined using RP-HPLC and LC-MS. Concentration vs. time data were fitted to a mathematical model using non-linear least squares regression analysis. At all pH values, the model was able to fit the data with R2≥0.95. Excluding oxidation suppressants (EDTA and N2 sparging) resulted in an increase in the formation of scrambled disulfides via oxidative pathways but did not influence the intrinsic rate of thiol-disulfide exchange. In addition, peptide secondary structure was found to influence the rate of thiol-disulfide exchange. PMID:24549831

  11. Analysis of the streamflow-gaging station network in Ohio for effectiveness in providing regional streamflow information

    USGS Publications Warehouse

    Straub, D.E.

    1998-01-01

    The streamflow-gaging station network in Ohio was evaluated for its effectiveness in providing regional streamflow information. The analysis involved application of the principles of generalized least squares regression between streamflow and climatic and basin characteristics. Regression equations were developed for three flow characteristics: (1) the instantaneous peak flow with a 100-year recurrence interval (P100), (2) the mean annual flow (Qa), and (3) the 7-day, 10-year low flow (7Q10). All active and discontinued gaging stations with 5 or more years of unregulated-streamflow data with respect to each flow characteristic were used to develop the regression equations. The gaging-station network was evaluated for the current (1996) condition of the network and estimated conditions of various network strategies if an additional 5 and 20 years of streamflow data were collected. Any active or discontinued gaging station with (1) less than 5 years of unregulated-streamflow record, (2) previously defined basin and climatic characteristics, and (3) the potential for collection of more unregulated-streamflow record were included in the network strategies involving the additional 5 and 20 years of data. The network analysis involved use of the regression equations, in combination with location, period of record, and cost of operation, to determine the contribution of the data for each gaging station to regional streamflow information. The contribution of each gaging station was based on a cost-weighted reduction of the mean square error (average sampling-error variance) associated with each regional estimating equation. All gaging stations included in the network analysis were then ranked according to their contribution to the regional information for each flow characteristic. The predictive ability of the regression equations developed from the gaging station network could be improved for all three flow characteristics with the collection of additional streamflow data. The addition of new gaging stations to the network would result in an even greater improvement of the accuracy of the regional regression equations. Typically, continued data collection at stations with unregulated streamflow for all flow conditions that had less than 11 years of record with drainage areas smaller than 200 square miles contributed the largest cost-weighted reduction to the average sampling-error variance of the regional estimating equations. The results of the network analyses can be used to prioritize the continued operation of active gaging stations or the reactivation of discontinued gaging stations if the objective is to maximize the regional information content in the streamflow-gaging station network.

  12. Refractive regression after laser in situ keratomileusis.

    PubMed

    Yan, Mabel K; Chang, John Sm; Chan, Tommy Cy

    2018-04-26

    Uncorrected refractive errors are a leading cause of visual impairment across the world. In today's society, laser in situ keratomileusis (LASIK) has become the most commonly performed surgical procedure to correct refractive errors. However, regression of the initially achieved refractive correction has been a widely observed phenomenon following LASIK since its inception more than two decades ago. Despite technological advances in laser refractive surgery and various proposed management strategies, post-LASIK regression is still frequently observed and has significant implications for the long-term visual performance and quality of life of patients. This review explores the mechanism of refractive regression after both myopic and hyperopic LASIK, predisposing risk factors and its clinical course. In addition, current preventative strategies and therapies are also reviewed. © 2018 Royal Australian and New Zealand College of Ophthalmologists.

  13. On the design of classifiers for crop inventories

    NASA Technical Reports Server (NTRS)

    Heydorn, R. P.; Takacs, H. C.

    1986-01-01

    Crop proportion estimators that use classifications of satellite data to correct, in an additive way, a given estimate acquired from ground observations are discussed. A linear version of these estimators is optimal, in terms of minimum variance, when the regression of the ground observations onto the satellite observations in linear. When this regression is not linear, but the reverse regression (satellite observations onto ground observations) is linear, the estimator is suboptimal but still has certain appealing variance properties. In this paper expressions are derived for those regressions which relate the intercepts and slopes to conditional classification probabilities. These expressions are then used to discuss the question of classifier designs that can lead to low-variance crop proportion estimates. Variance expressions for these estimates in terms of classifier omission and commission errors are also derived.

  14. Uncertainty analysis of terrestrial net primary productivity and net biome productivity in China during 1901-2005

    DOE PAGES

    Shao, Junjiong; Zhou, Xuhui; Luo, Yiqi; ...

    2016-04-28

    Here, despite the importance of net primary productivity (NPP) and net biome productivity (NBP), estimates of NPP and NBP for China are highly uncertain. To investigate the main sources of uncertainty, we synthesized model estimates of NPP and NBP for China from published literature and the Multi-scale Synthesis and Terrestrial Model Intercomparison Project (MsTMIP). The literature-based results showed that total NPP and NBP in China were 3.35 ± 1.25 and 0.14 ± 0.094 Pg C yr –1, respectively. Classification and regression tree analysis based on literature data showed that model type was the primary source of the uncertainty, explaining 36%more » and 64% of the variance in NPP and NBP, respectively. Spatiotemporal scales, land cover conditions, inclusion of the N cycle, and effects of N addition also contributed to the overall uncertainty. Results based on the MsTMIP data suggested that model structures were overwhelmingly important (>90%) for the overall uncertainty compared to simulations with different combinations of time-varying global change factors. The interannual pattern of NPP was similar among diverse studies and increased by 0.012 Pg C yr –1 during 1981–2000. In addition, high uncertainty in China's NPP occurred in areas with high productivity, whereas NBP showed the opposite pattern. Our results suggest that to significantly reduce uncertainty in estimated NPP and NBP, model structures should be substantially tested on the basis of empirical results. To this end, coordinated distributed experiments with multiple global change factors might be a practical approach that can validate specific structures of different models.« less

  15. Uncertainty analysis of terrestrial net primary productivity and net biome productivity in China during 1901-2005

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Shao, Junjiong; Zhou, Xuhui; Luo, Yiqi

    Here, despite the importance of net primary productivity (NPP) and net biome productivity (NBP), estimates of NPP and NBP for China are highly uncertain. To investigate the main sources of uncertainty, we synthesized model estimates of NPP and NBP for China from published literature and the Multi-scale Synthesis and Terrestrial Model Intercomparison Project (MsTMIP). The literature-based results showed that total NPP and NBP in China were 3.35 ± 1.25 and 0.14 ± 0.094 Pg C yr –1, respectively. Classification and regression tree analysis based on literature data showed that model type was the primary source of the uncertainty, explaining 36%more » and 64% of the variance in NPP and NBP, respectively. Spatiotemporal scales, land cover conditions, inclusion of the N cycle, and effects of N addition also contributed to the overall uncertainty. Results based on the MsTMIP data suggested that model structures were overwhelmingly important (>90%) for the overall uncertainty compared to simulations with different combinations of time-varying global change factors. The interannual pattern of NPP was similar among diverse studies and increased by 0.012 Pg C yr –1 during 1981–2000. In addition, high uncertainty in China's NPP occurred in areas with high productivity, whereas NBP showed the opposite pattern. Our results suggest that to significantly reduce uncertainty in estimated NPP and NBP, model structures should be substantially tested on the basis of empirical results. To this end, coordinated distributed experiments with multiple global change factors might be a practical approach that can validate specific structures of different models.« less

  16. Improving virtual screening predictive accuracy of Human kallikrein 5 inhibitors using machine learning models.

    PubMed

    Fang, Xingang; Bagui, Sikha; Bagui, Subhash

    2017-08-01

    The readily available high throughput screening (HTS) data from the PubChem database provides an opportunity for mining of small molecules in a variety of biological systems using machine learning techniques. From the thousands of available molecular descriptors developed to encode useful chemical information representing the characteristics of molecules, descriptor selection is an essential step in building an optimal quantitative structural-activity relationship (QSAR) model. For the development of a systematic descriptor selection strategy, we need the understanding of the relationship between: (i) the descriptor selection; (ii) the choice of the machine learning model; and (iii) the characteristics of the target bio-molecule. In this work, we employed the Signature descriptor to generate a dataset on the Human kallikrein 5 (hK 5) inhibition confirmatory assay data and compared multiple classification models including logistic regression, support vector machine, random forest and k-nearest neighbor. Under optimal conditions, the logistic regression model provided extremely high overall accuracy (98%) and precision (90%), with good sensitivity (65%) in the cross validation test. In testing the primary HTS screening data with more than 200K molecular structures, the logistic regression model exhibited the capability of eliminating more than 99.9% of the inactive structures. As part of our exploration of the descriptor-model-target relationship, the excellent predictive performance of the combination of the Signature descriptor and the logistic regression model on the assay data of the Human kallikrein 5 (hK 5) target suggested a feasible descriptor/model selection strategy on similar targets. Copyright © 2017 Elsevier Ltd. All rights reserved.

  17. Cost-sensitive AdaBoost algorithm for ordinal regression based on extreme learning machine.

    PubMed

    Riccardi, Annalisa; Fernández-Navarro, Francisco; Carloni, Sante

    2014-10-01

    In this paper, the well known stagewise additive modeling using a multiclass exponential (SAMME) boosting algorithm is extended to address problems where there exists a natural order in the targets using a cost-sensitive approach. The proposed ensemble model uses an extreme learning machine (ELM) model as a base classifier (with the Gaussian kernel and the additional regularization parameter). The closed form of the derived weighted least squares problem is provided, and it is employed to estimate analytically the parameters connecting the hidden layer to the output layer at each iteration of the boosting algorithm. Compared to the state-of-the-art boosting algorithms, in particular those using ELM as base classifier, the suggested technique does not require the generation of a new training dataset at each iteration. The adoption of the weighted least squares formulation of the problem has been presented as an unbiased and alternative approach to the already existing ELM boosting techniques. Moreover, the addition of a cost model for weighting the patterns, according to the order of the targets, enables the classifier to tackle ordinal regression problems further. The proposed method has been validated by an experimental study by comparing it with already existing ensemble methods and ELM techniques for ordinal regression, showing competitive results.

  18. Evaluation of the confusion matrix method in the validation of an automated system for measuring feeding behaviour of cattle.

    PubMed

    Ruuska, Salla; Hämäläinen, Wilhelmiina; Kajava, Sari; Mughal, Mikaela; Matilainen, Pekka; Mononen, Jaakko

    2018-03-01

    The aim of the present study was to evaluate empirically confusion matrices in device validation. We compared the confusion matrix method to linear regression and error indices in the validation of a device measuring feeding behaviour of dairy cattle. In addition, we studied how to extract additional information on classification errors with confusion probabilities. The data consisted of 12 h behaviour measurements from five dairy cows; feeding and other behaviour were detected simultaneously with a device and from video recordings. The resulting 216 000 pairs of classifications were used to construct confusion matrices and calculate performance measures. In addition, hourly durations of each behaviour were calculated and the accuracy of measurements was evaluated with linear regression and error indices. All three validation methods agreed when the behaviour was detected very accurately or inaccurately. Otherwise, in the intermediate cases, the confusion matrix method and error indices produced relatively concordant results, but the linear regression method often disagreed with them. Our study supports the use of confusion matrix analysis in validation since it is robust to any data distribution and type of relationship, it makes a stringent evaluation of validity, and it offers extra information on the type and sources of errors. Copyright © 2018 Elsevier B.V. All rights reserved.

  19. Predicting allergic contact dermatitis: a hierarchical structure activity relationship (SAR) approach to chemical classification using topological and quantum chemical descriptors

    NASA Astrophysics Data System (ADS)

    Basak, Subhash C.; Mills, Denise; Hawkins, Douglas M.

    2008-06-01

    A hierarchical classification study was carried out based on a set of 70 chemicals—35 which produce allergic contact dermatitis (ACD) and 35 which do not. This approach was implemented using a regular ridge regression computer code, followed by conversion of regression output to binary data values. The hierarchical descriptor classes used in the modeling include topostructural (TS), topochemical (TC), and quantum chemical (QC), all of which are based solely on chemical structure. The concordance, sensitivity, and specificity are reported. The model based on the TC descriptors was found to be the best, while the TS model was extremely poor.

  20. Quantitative Structure-Activity Relationships for Organophosphate Enzyme Inhibition (Briefing Charts)

    DTIC Science & Technology

    2011-09-22

    OPs) are a group of pesticides that inhibit enzymes such as acetylcholinesterase. Numerous OP structural variants exist and toxicity data can be...and human toxicity studies especially for OPs lacking experimental data. 15. SUBJECT TERMS QSAR Organophosphates...structure and mechanism of toxicity c) Linking QSAR and OP PBPK/PD 2. Methods a) Physiochemical Descriptors b) Regression Techniques 3. Results a

  1. The interaction of flavonoid-lysozyme and the relationship between molecular structure of flavonoids and their binding activity to lysozyme.

    PubMed

    Yang, Ran; Yu, Lanlan; Zeng, Huajin; Liang, Ruiling; Chen, Xiaolan; Qu, Lingbo

    2012-11-01

    In this work, the interactions of twelve structurally different flavonoids with Lysozyme (Lys) were studied by fluorescence quenching method. The interaction mechanism and binding properties were investigated. It was found that the binding capacities of flavonoids to Lys were highly depend on the number and position of hydrogen, the kind and position of glycosyl. To explore the selectivity of the bindings of flavonoids with Lys, the structure descriptors of the flavonoids were calculated under QSAR software package of Cerius2, the quantitative relationship between the structures of flavonoids and their binding activities to Lys (QSAR) was performed through genetic function approximation (GFA) regression analysis. The QSAR regression equation was K(A) = 37850.460 + 1630.01Dipole +3038.330HD-171.795MR. (r = 0.858, r(CV)(2) = 0.444, F((11,3)) = 7.48), where K(A) is binding constants, Dipole, HD and MR was dipole moment, number of hydrogen-bond donor and molecular refractivity, respectively. The obtained results make us understand better how the molecular structures influencing their binding to protein which may open up new avenues for the design of the most suitable flavonoids derivatives with structure variants.

  2. Regression with Small Data Sets: A Case Study using Code Surrogates in Additive Manufacturing

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kamath, C.; Fan, Y. J.

    There has been an increasing interest in recent years in the mining of massive data sets whose sizes are measured in terabytes. While it is easy to collect such large data sets in some application domains, there are others where collecting even a single data point can be very expensive, so the resulting data sets have only tens or hundreds of samples. For example, when complex computer simulations are used to understand a scientific phenomenon, we want to run the simulation for many different values of the input parameters and analyze the resulting output. The data set relating the simulationmore » inputs and outputs is typically quite small, especially when each run of the simulation is expensive. However, regression techniques can still be used on such data sets to build an inexpensive \\surrogate" that could provide an approximate output for a given set of inputs. A good surrogate can be very useful in sensitivity analysis, uncertainty analysis, and in designing experiments. In this paper, we compare different regression techniques to determine how well they predict melt-pool characteristics in the problem domain of additive manufacturing. Our analysis indicates that some of the commonly used regression methods do perform quite well even on small data sets.« less

  3. Comparison and validation of statistical methods for predicting power outage durations in the event of hurricanes.

    PubMed

    Nateghi, Roshanak; Guikema, Seth D; Quiring, Steven M

    2011-12-01

    This article compares statistical methods for modeling power outage durations during hurricanes and examines the predictive accuracy of these methods. Being able to make accurate predictions of power outage durations is valuable because the information can be used by utility companies to plan their restoration efforts more efficiently. This information can also help inform customers and public agencies of the expected outage times, enabling better collective response planning, and coordination of restoration efforts for other critical infrastructures that depend on electricity. In the long run, outage duration estimates for future storm scenarios may help utilities and public agencies better allocate risk management resources to balance the disruption from hurricanes with the cost of hardening power systems. We compare the out-of-sample predictive accuracy of five distinct statistical models for estimating power outage duration times caused by Hurricane Ivan in 2004. The methods compared include both regression models (accelerated failure time (AFT) and Cox proportional hazard models (Cox PH)) and data mining techniques (regression trees, Bayesian additive regression trees (BART), and multivariate additive regression splines). We then validate our models against two other hurricanes. Our results indicate that BART yields the best prediction accuracy and that it is possible to predict outage durations with reasonable accuracy. © 2011 Society for Risk Analysis.

  4. Quantile regression via vector generalized additive models.

    PubMed

    Yee, Thomas W

    2004-07-30

    One of the most popular methods for quantile regression is the LMS method of Cole and Green. The method naturally falls within a penalized likelihood framework, and consequently allows for considerable flexible because all three parameters may be modelled by cubic smoothing splines. The model is also very understandable: for a given value of the covariate, the LMS method applies a Box-Cox transformation to the response in order to transform it to standard normality; to obtain the quantiles, an inverse Box-Cox transformation is applied to the quantiles of the standard normal distribution. The purposes of this article are three-fold. Firstly, LMS quantile regression is presented within the framework of the class of vector generalized additive models. This confers a number of advantages such as a unifying theory and estimation process. Secondly, a new LMS method based on the Yeo-Johnson transformation is proposed, which has the advantage that the response is not restricted to be positive. Lastly, this paper describes a software implementation of three LMS quantile regression methods in the S language. This includes the LMS-Yeo-Johnson method, which is estimated efficiently by a new numerical integration scheme. The LMS-Yeo-Johnson method is illustrated by way of a large cross-sectional data set from a New Zealand working population. Copyright 2004 John Wiley & Sons, Ltd.

  5. Estimation of 1RM for knee extension based on the maximal isometric muscle strength and body composition.

    PubMed

    Kanada, Yoshikiyo; Sakurai, Hiroaki; Sugiura, Yoshito; Arai, Tomoaki; Koyama, Soichiro; Tanabe, Shigeo

    2017-11-01

    [Purpose] To create a regression formula in order to estimate 1RM for knee extensors, based on the maximal isometric muscle strength measured using a hand-held dynamometer and data regarding the body composition. [Subjects and Methods] Measurement was performed in 21 healthy males in their twenties to thirties. Single regression analysis was performed, with measurement values representing 1RM and the maximal isometric muscle strength as dependent and independent variables, respectively. Furthermore, multiple regression analysis was performed, with data regarding the body composition incorporated as another independent variable, in addition to the maximal isometric muscle strength. [Results] Through single regression analysis with the maximal isometric muscle strength as an independent variable, the following regression formula was created: 1RM (kg)=0.714 + 0.783 × maximal isometric muscle strength (kgf). On multiple regression analysis, only the total muscle mass was extracted. [Conclusion] A highly accurate regression formula to estimate 1RM was created based on both the maximal isometric muscle strength and body composition. Using a hand-held dynamometer and body composition analyzer, it was possible to measure these items in a short time, and obtain clinically useful results.

  6. Ridge, Lasso and Bayesian additive-dominance genomic models.

    PubMed

    Azevedo, Camila Ferreira; de Resende, Marcos Deon Vilela; E Silva, Fabyano Fonseca; Viana, José Marcelo Soriano; Valente, Magno Sávio Ferreira; Resende, Márcio Fernando Ribeiro; Muñoz, Patricio

    2015-08-25

    A complete approach for genome-wide selection (GWS) involves reliable statistical genetics models and methods. Reports on this topic are common for additive genetic models but not for additive-dominance models. The objective of this paper was (i) to compare the performance of 10 additive-dominance predictive models (including current models and proposed modifications), fitted using Bayesian, Lasso and Ridge regression approaches; and (ii) to decompose genomic heritability and accuracy in terms of three quantitative genetic information sources, namely, linkage disequilibrium (LD), co-segregation (CS) and pedigree relationships or family structure (PR). The simulation study considered two broad sense heritability levels (0.30 and 0.50, associated with narrow sense heritabilities of 0.20 and 0.35, respectively) and two genetic architectures for traits (the first consisting of small gene effects and the second consisting of a mixed inheritance model with five major genes). G-REML/G-BLUP and a modified Bayesian/Lasso (called BayesA*B* or t-BLASSO) method performed best in the prediction of genomic breeding as well as the total genotypic values of individuals in all four scenarios (two heritabilities x two genetic architectures). The BayesA*B*-type method showed a better ability to recover the dominance variance/additive variance ratio. Decomposition of genomic heritability and accuracy revealed the following descending importance order of information: LD, CS and PR not captured by markers, the last two being very close. Amongst the 10 models/methods evaluated, the G-BLUP, BAYESA*B* (-2,8) and BAYESA*B* (4,6) methods presented the best results and were found to be adequate for accurately predicting genomic breeding and total genotypic values as well as for estimating additive and dominance in additive-dominance genomic models.

  7. Application of impact dampers in vibration control of flexible structures

    NASA Technical Reports Server (NTRS)

    Akl, Fred A.; Butt, Aamir S.

    1995-01-01

    Impact dampers belong to the category of passive vibration devices used to attenuate the vibration of discrete and continuous systems. An impact damper generally consists of a mass which is allowed to travel freely between two defined stops. Under the right conditions, the vibration of the structure to which the impact damper is attached will cause the mass of the impact damper to strike the structure. Previous analytical and experimental research work on the effect of impact dampers in attenuating the vibration of discrete and continuous systems have demonstrated their effectiveness. It has been shown in this study that impact dampers can increase the intrinsic damping of a lightly-damped flexible structure. The test structure consists of a slender flexible beam supported by a pin-type support at one end and supported by a linear helical flexible spring at another location. Sinusoidal excitation spanning the first three natural frequencies was applied in the horizontal plane. The orientation of the excitation and the test structure in the horizontal plane minimizes the effect of gravity on the behavior of the test structure. The excitation was applied using a linear sine sweep technique. The span of the test structure, the mass of the impact damper, the distance of travel, and the location of the impact damper along the span of the test structure were varied. The damping ratio are estimated for sixty test configurations. The results show that the impact damper significantly increases the damping ratio of the test structure. Statistical analysis of the results using the method of multiple linear regression indicates that a reasonable fit has been accomplished. It is concluded that additional experimental analysis of flexible structures in microgravity environment is needed in order to achieve a better understanding of the behavior of impact damper under conditions of microgravity. Numerical solution of the behavior of flexible structures equipped with impact dampers is also needed to predict stresses and deformations under operating conditions of microgravity in space applications.

  8. Highlighting differences between conditional and unconditional quantile regression approaches through an application to assess medication adherence.

    PubMed

    Borah, Bijan J; Basu, Anirban

    2013-09-01

    The quantile regression (QR) framework provides a pragmatic approach in understanding the differential impacts of covariates along the distribution of an outcome. However, the QR framework that has pervaded the applied economics literature is based on the conditional quantile regression method. It is used to assess the impact of a covariate on a quantile of the outcome conditional on specific values of other covariates. In most cases, conditional quantile regression may generate results that are often not generalizable or interpretable in a policy or population context. In contrast, the unconditional quantile regression method provides more interpretable results as it marginalizes the effect over the distributions of other covariates in the model. In this paper, the differences between these two regression frameworks are highlighted, both conceptually and econometrically. Additionally, using real-world claims data from a large US health insurer, alternative QR frameworks are implemented to assess the differential impacts of covariates along the distribution of medication adherence among elderly patients with Alzheimer's disease. Copyright © 2013 John Wiley & Sons, Ltd.

  9. Marital status integration and suicide: A meta-analysis and meta-regression.

    PubMed

    Kyung-Sook, Woo; SangSoo, Shin; Sangjin, Shin; Young-Jeon, Shin

    2018-01-01

    Marital status is an index of the phenomenon of social integration within social structures and has long been identified as an important predictor suicide. However, previous meta-analyses have focused only on a particular marital status, or not sufficiently explored moderators. A meta-analysis of observational studies was conducted to explore the relationships between marital status and suicide and to understand the important moderating factors in this association. Electronic databases were searched to identify studies conducted between January 1, 2000 and June 30, 2016. We performed a meta-analysis, subgroup analysis, and meta-regression of 170 suicide risk estimates from 36 publications. Using random effects model with adjustment for covariates, the study found that the suicide risk for non-married versus married was OR = 1.92 (95% CI: 1.75-2.12). The suicide risk was higher for non-married individuals aged <65 years than for those aged ≥65 years, and higher for men than for women. According to the results of stratified analysis by gender, non-married men exhibited a greater risk of suicide than their married counterparts in all sub-analyses, but women aged 65 years or older showed no significant association between marital status and suicide. The suicide risk in divorced individuals was higher than for non-married individuals in both men and women. The meta-regression showed that gender, age, and sample size affected between-study variation. The results of the study indicated that non-married individuals have an aggregate higher suicide risk than married ones. In addition, gender and age were confirmed as important moderating factors in the relationship between marital status and suicide. Copyright © 2017 Elsevier Ltd. All rights reserved.

  10. Non-linear quantitative structure-activity relationship for adenine derivatives as competitive inhibitors of adenosine deaminase

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Sadat Hayatshahi, Sayyed Hamed; Abdolmaleki, Parviz; Safarian, Shahrokh

    2005-12-16

    Logistic regression and artificial neural networks have been developed as two non-linear models to establish quantitative structure-activity relationships between structural descriptors and biochemical activity of adenosine based competitive inhibitors, toward adenosine deaminase. The training set included 24 compounds with known k {sub i} values. The models were trained to solve two-class problems. Unlike the previous work in which multiple linear regression was used, the highest of positive charge on the molecules was recognized to be in close relation with their inhibition activity, while the electric charge on atom N1 of adenosine was found to be a poor descriptor. Consequently, themore » previously developed equation was improved and the newly formed one could predict the class of 91.66% of compounds correctly. Also optimized 2-3-1 and 3-4-1 neural networks could increase this rate to 95.83%.« less

  11. Independent contrasts and PGLS regression estimators are equivalent.

    PubMed

    Blomberg, Simon P; Lefevre, James G; Wells, Jessie A; Waterhouse, Mary

    2012-05-01

    We prove that the slope parameter of the ordinary least squares regression of phylogenetically independent contrasts (PICs) conducted through the origin is identical to the slope parameter of the method of generalized least squares (GLSs) regression under a Brownian motion model of evolution. This equivalence has several implications: 1. Understanding the structure of the linear model for GLS regression provides insight into when and why phylogeny is important in comparative studies. 2. The limitations of the PIC regression analysis are the same as the limitations of the GLS model. In particular, phylogenetic covariance applies only to the response variable in the regression and the explanatory variable should be regarded as fixed. Calculation of PICs for explanatory variables should be treated as a mathematical idiosyncrasy of the PIC regression algorithm. 3. Since the GLS estimator is the best linear unbiased estimator (BLUE), the slope parameter estimated using PICs is also BLUE. 4. If the slope is estimated using different branch lengths for the explanatory and response variables in the PIC algorithm, the estimator is no longer the BLUE, so this is not recommended. Finally, we discuss whether or not and how to accommodate phylogenetic covariance in regression analyses, particularly in relation to the problem of phylogenetic uncertainty. This discussion is from both frequentist and Bayesian perspectives.

  12. Evaluation of the Use of Zero-Augmented Regression Techniques to Model Incidence of Campylobacter Infections in FoodNet.

    PubMed

    Tremblay, Marlène; Crim, Stacy M; Cole, Dana J; Hoekstra, Robert M; Henao, Olga L; Döpfer, Dörte

    2017-10-01

    The Foodborne Diseases Active Surveillance Network (FoodNet) is currently using a negative binomial (NB) regression model to estimate temporal changes in the incidence of Campylobacter infection. FoodNet active surveillance in 483 counties collected data on 40,212 Campylobacter cases between years 2004 and 2011. We explored models that disaggregated these data to allow us to account for demographic, geographic, and seasonal factors when examining changes in incidence of Campylobacter infection. We hypothesized that modeling structural zeros and including demographic variables would increase the fit of FoodNet's Campylobacter incidence regression models. Five different models were compared: NB without demographic covariates, NB with demographic covariates, hurdle NB with covariates in the count component only, hurdle NB with covariates in both zero and count components, and zero-inflated NB with covariates in the count component only. Of the models evaluated, the nonzero-augmented NB model with demographic variables provided the best fit. Results suggest that even though zero inflation was not present at this level, individualizing the level of aggregation and using different model structures and predictors per site might be required to correctly distinguish between structural and observational zeros and account for risk factors that vary geographically.

  13. Extraction of object skeletons in multispectral imagery by the orthogonal regression fitting

    NASA Astrophysics Data System (ADS)

    Palenichka, Roman M.; Zaremba, Marek B.

    2003-03-01

    Accurate and automatic extraction of skeletal shape of objects of interest from satellite images provides an efficient solution to such image analysis tasks as object detection, object identification, and shape description. The problem of skeletal shape extraction can be effectively solved in three basic steps: intensity clustering (i.e. segmentation) of objects, extraction of a structural graph of the object shape, and refinement of structural graph by the orthogonal regression fitting. The objects of interest are segmented from the background by a clustering transformation of primary features (spectral components) with respect to each pixel. The structural graph is composed of connected skeleton vertices and represents the topology of the skeleton. In the general case, it is a quite rough piecewise-linear representation of object skeletons. The positions of skeleton vertices on the image plane are adjusted by means of the orthogonal regression fitting. It consists of changing positions of existing vertices according to the minimum of the mean orthogonal distances and, eventually, adding new vertices in-between if a given accuracy if not yet satisfied. Vertices of initial piecewise-linear skeletons are extracted by using a multi-scale image relevance function. The relevance function is an image local operator that has local maximums at the centers of the objects of interest.

  14. Structured Kernel Subspace Learning for Autonomous Robot Navigation.

    PubMed

    Kim, Eunwoo; Choi, Sungjoon; Oh, Songhwai

    2018-02-14

    This paper considers two important problems for autonomous robot navigation in a dynamic environment, where the goal is to predict pedestrian motion and control a robot with the prediction for safe navigation. While there are several methods for predicting the motion of a pedestrian and controlling a robot to avoid incoming pedestrians, it is still difficult to safely navigate in a dynamic environment due to challenges, such as the varying quality and complexity of training data with unwanted noises. This paper addresses these challenges simultaneously by proposing a robust kernel subspace learning algorithm based on the recent advances in nuclear-norm and l 1 -norm minimization. We model the motion of a pedestrian and the robot controller using Gaussian processes. The proposed method efficiently approximates a kernel matrix used in Gaussian process regression by learning low-rank structured matrix (with symmetric positive semi-definiteness) to find an orthogonal basis, which eliminates the effects of erroneous and inconsistent data. Based on structured kernel subspace learning, we propose a robust motion model and motion controller for safe navigation in dynamic environments. We evaluate the proposed robust kernel learning in various tasks, including regression, motion prediction, and motion control problems, and demonstrate that the proposed learning-based systems are robust against outliers and outperform existing regression and navigation methods.

  15. Abnormal structural luteolysis in ovaries of the senescence accelerated mouse (SAM): expression of Fas ligand/Fas-mediated apoptosis signaling molecules in luteal cells.

    PubMed

    Kiso, Minako; Manabe, Noboru; Komatsu, Kohji; Shimabe, Munetake; Miyamoto, Hajime

    2003-12-01

    Senescence accelerated mouse-prone (SAMP) mice with a shortened life span show accelerated changes in many of the signs of aging and a shorter reproductive life span than SAM-resistant (SAMR) controls. We previously showed that functional regression (progesterone dissimilation) occurs in abnormally accumulated luteal bodies (aaLBs) of SAMP mice, but structural regression of luteal cells in aaLB is inhibited. A deficiency of luteal cell apoptosis causes the abnormal accumulation of LBs in SAMP ovaries. In the present study, to show the abnormality of Fas ligand (FasL)/Fas-mediated apoptosis signal transducing factors in the aaLBs of the SAMP ovaries, we assessed the changes in the expression of FasL, Fas, caspase-8 and caspase-3 mRNAs by reverse transcription-polymerase chain reaction, and in the expression and localization of FasL, Fas and activated caspase-3 proteins by Western blotting and immunohistochemistry, respectively, during the estrus cycle/luteolysis. These mRNAs and proteins were expressed in normal LBs of both SAMP and SAMR ovaries, but not at all or only in trace amounts in aaLBs of SAMP, indicating that structural regression is inhibited by blockage of the expression of these transducing factors in luteal cells of aaLBs in SAMP mice.

  16. Investigating the separate and interactive associations of trauma and depression on brain structure: implications for cognition and aging.

    PubMed

    Karstens, Aimee J; Ajilore, Olusola; Rubin, Leah H; Yang, Shaolin; Zhang, Aifeng; Leow, Alex; Kumar, Anand; Lamar, Melissa

    2017-11-01

    Trauma and depression are associated with brain structural alterations; their combined effects on these outcomes are unclear. We previously reported a negative effect of trauma, independent of depression, on verbal learning and memory; less is known about underlying structural associates. We investigated separate and interactive associations of trauma and depression on brain structure. Adults aged 30-89 (N = 203) evaluated for depression (D+) and trauma history (T+) using structured clinical interviews were divided into 53 D+T+, 42 D+T-, 50 D-T+, and 58 D-T-. Multivariable linear regressions examined the separate and interactive associations of depression and trauma with prefrontal and temporal lobe cortical thickness composites and hippocampal volumes adjusting for age, sex, predicted verbal IQ, comorbid anxiety, and vascular risk. Significant results informed analyses of tract-based structural connectomic measures of efficiency and centrality. Trauma, independent of depression, was associated with greater left prefrontal cortex (PFC) thickness, in particular the medial orbitofrontal cortex and pars orbitalis. A trauma × depression interaction was observed for the right PFC in age-stratified analyses: Older D + T+ had reduced PFC thickness compared with older D - T+ individuals. Regardless of age, trauma was associated with more left medial orbitofrontal cortex efficiency and less pars orbitalis centrality. In the T+ group, left pars orbitalis cortical thickness and centrality negatively correlated with verbal learning. Trauma, independent of depression, associated with altered PFC characteristics, morphologically and in terms of structural network communication and influence. Additionally, findings suggest that there may be a combined effect of trauma and depression in older adults. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  17. The Predictive Effects of Protection Motivation Theory on Intention and Behaviour of Physical Activity in Patients with Type 2 Diabetes.

    PubMed

    Ali Morowatisharifabad, Mohammad; Abdolkarimi, Mahdi; Asadpour, Mohammad; Fathollahi, Mahmood Sheikh; Balaee, Parisa

    2018-04-15

    Theory-based education tailored to target behaviour and group can be effective in promoting physical activity. The purpose of this study was to examine the predictive power of Protection Motivation Theory on intent and behaviour of Physical Activity in Patients with Type 2 Diabetes. This descriptive study was conducted on 250 patients in Rafsanjan, Iran. To examine the scores of protection motivation theory structures, a researcher-made questionnaire was used. Its validity and reliability were confirmed. The level of physical activity was also measured by the International Short - form Physical Activity Inventory. Its validity and reliability were also approved. Data were analysed by statistical tests including correlation coefficient, chi-square, logistic regression and linear regression. The results revealed that there was a significant correlation between all the protection motivation theory constructs and the intention to do physical activity. The results showed that the Theory structures were able to predict 60% of the variance of physical activity intention. The results of logistic regression demonstrated that increase in the score of physical activity intent and self - efficacy increased the chance of higher level of physical activity by 3.4 and 1.5 times, respectively OR = (3.39, 1.54). Considering the ability of protection motivation theory structures to explain the physical activity behaviour, interventional designs are suggested based on the structures of this theory, especially to improve self -efficacy as the most powerful factor in predicting physical activity intention and behaviour.

  18. [Regression analysis to select native-like structures from decoys of antigen-antibody docking].

    PubMed

    Chen, Zhengshan; Chi, Xiangyang; Fan, Pengfei; Zhang, Guanying; Wang, Meirong; Yu, Changming; Chen, Wei

    2018-06-25

    Given the increasing exploitation of antibodies in different contexts such as molecular diagnostics and therapeutics, it would be beneficial to unravel properties of antigen-antibody interaction with modeling of computational protein-protein docking, especially, in the absence of a cocrystal structure. However, obtaining a native-like antigen-antibody structure remains challenging due in part to failing to reliably discriminate accurate from inaccurate structures among tens of thousands of decoys after computational docking with existing scoring function. We hypothesized that some important physicochemical and energetic features could be used to describe antigen-antibody interfaces and identify native-like antigen-antibody structure. We prepared a dataset, a subset of Protein-Protein Docking Benchmark Version 4.0, comprising 37 nonredundant 3D structures of antigen-antibody complexes, and used it to train and test multivariate logistic regression equation which took several important physicochemical and energetic features of decoys as dependent variables. Our results indicate that the ability to identify native-like structures of our method is superior to ZRANK and ZDOCK score for the subset of antigen-antibody complexes. And then, we use our method in workflow of predicting epitope of anti-Ebola glycoprotein monoclonal antibody-4G7 and identify three accurate residues in its epitope.

  19. Effect of Family Structure and Behavioral and Eyesight Problems on Caries Severity in Pupils by Using an Ordinal Logistic Model

    PubMed Central

    JAHANI, Yunes; ESHRAGHIAN, Mohammad Reza; Rahimi FOROUSHANI, Abbas; NOURIJELYANI, Keramat; MOHAMMAD, Kazem; SHAHRAVAN, Arash; ALAM, Mahin

    2013-01-01

    Background: Dental caries is one of the most preventable yet prevalent chronic diseases worldwide. Our objective was to evaluate the effect of family structure and behavioral and eyesight problems as they relate to caries severity in schoolchildren. Methods: This research was carried out on 845 primary schoolchildren aged 9 yr in Kerman, Iran, in 2012. Ten variables, including health records, family structure information and a dmft/DMFT index, were collected. Children were categorized into three groups based on the WHO caries severity classification. Low caries level was defined as dmft/DMFT<2.6, moderate as dmft/DMFT of 2.7–4.4 and high as dmft/DMFT>4.4. The Cochran–Armitage test and ordinal logistic regression were employed for data analysis. Results: Almost half of pupils had moderate or high caries severity. The odds of being in a higher caries severity category in pupils with behavioral problems (OR=2.37, 95% CI: 1.29–4.38) and girls (OR=1.6, 95% CI: 1.22–2.06) were higher than in other categories. In addition, pupils with eyesight problems (OR=0.58, 95% CI: 0.37–0.90) and overweight pupils (OR=0.46, 95% CI: 0.31–0.71) had lower caries severity than others. The effects of parents’ education, birth rank, living with parents and consanguineous relationship between parents were not significant on caries severity (P>0.05). Conclusions: Female pupils with behavioral problems were at a higher risk of caries severity than other pupils. These pupils need to be educated and coached on proper dental care. In addition, overweight pupils and those with eyesight problems had less caries severity than others. Family structure in this study did not have an effect on the severity of dental caries. PMID:26056644

  20. Experimental and computational prediction of glass transition temperature of drugs.

    PubMed

    Alzghoul, Ahmad; Alhalaweh, Amjad; Mahlin, Denny; Bergström, Christel A S

    2014-12-22

    Glass transition temperature (Tg) is an important inherent property of an amorphous solid material which is usually determined experimentally. In this study, the relation between Tg and melting temperature (Tm) was evaluated using a data set of 71 structurally diverse druglike compounds. Further, in silico models for prediction of Tg were developed based on calculated molecular descriptors and linear (multilinear regression, partial least-squares, principal component regression) and nonlinear (neural network, support vector regression) modeling techniques. The models based on Tm predicted Tg with an RMSE of 19.5 K for the test set. Among the five computational models developed herein the support vector regression gave the best result with RMSE of 18.7 K for the test set using only four chemical descriptors. Hence, two different models that predict Tg of drug-like molecules with high accuracy were developed. If Tm is available, a simple linear regression can be used to predict Tg. However, the results also suggest that support vector regression and calculated molecular descriptors can predict Tg with equal accuracy, already before compound synthesis.

  1. Methods for identifying SNP interactions: a review on variations of Logic Regression, Random Forest and Bayesian logistic regression.

    PubMed

    Chen, Carla Chia-Ming; Schwender, Holger; Keith, Jonathan; Nunkesser, Robin; Mengersen, Kerrie; Macrossan, Paula

    2011-01-01

    Due to advancements in computational ability, enhanced technology and a reduction in the price of genotyping, more data are being generated for understanding genetic associations with diseases and disorders. However, with the availability of large data sets comes the inherent challenges of new methods of statistical analysis and modeling. Considering a complex phenotype may be the effect of a combination of multiple loci, various statistical methods have been developed for identifying genetic epistasis effects. Among these methods, logic regression (LR) is an intriguing approach incorporating tree-like structures. Various methods have built on the original LR to improve different aspects of the model. In this study, we review four variations of LR, namely Logic Feature Selection, Monte Carlo Logic Regression, Genetic Programming for Association Studies, and Modified Logic Regression-Gene Expression Programming, and investigate the performance of each method using simulated and real genotype data. We contrast these with another tree-like approach, namely Random Forests, and a Bayesian logistic regression with stochastic search variable selection.

  2. Spectral regression and correlation coefficients of some benzaldimines and salicylaldimines in different solvents

    NASA Astrophysics Data System (ADS)

    Hammud, Hassan H.; Ghannoum, Amer; Masoud, Mamdouh S.

    2006-02-01

    Sixteen Schiff bases obtained from the condensation of benzaldehyde or salicylaldehyde with various amines (aniline, 4-carboxyaniline, phenylhydrazine, 2,4-dinitrophenylhydrazine, ethylenediamine, hydrazine, o-phenylenediamine and 2,6-pyridinediamine) are studied with UV-vis spectroscopy to observe the effect of solvents, substituents and other structural factors on the spectra. The bands involving different electronic transitions are interpreted. Computerized analysis and multiple regression techniques were applied to calculate the regression and correlation coefficients based on the equation that relates peak position λmax to the solvent parameters that depend on the H-bonding ability, refractive index and dielectric constant of solvents.

  3. The use of cognitive ability measures as explanatory variables in regression analysis

    PubMed Central

    Junker, Brian; Schofield, Lynne Steuerle; Taylor, Lowell J

    2015-01-01

    Cognitive ability measures are often taken as explanatory variables in regression analysis, e.g., as a factor affecting a market outcome such as an individual’s wage, or a decision such as an individual’s education acquisition. Cognitive ability is a latent construct; its true value is unobserved. Nonetheless, researchers often assume that a test score, constructed via standard psychometric practice from individuals’ responses to test items, can be safely used in regression analysis. We examine problems that can arise, and suggest that an alternative approach, a “mixed effects structural equations” (MESE) model, may be more appropriate in many circumstances. PMID:26998417

  4. Bayesian Analysis of Nonlinear Structural Equation Models with Nonignorable Missing Data

    ERIC Educational Resources Information Center

    Lee, Sik-Yum

    2006-01-01

    A Bayesian approach is developed for analyzing nonlinear structural equation models with nonignorable missing data. The nonignorable missingness mechanism is specified by a logistic regression model. A hybrid algorithm that combines the Gibbs sampler and the Metropolis-Hastings algorithm is used to produce the joint Bayesian estimates of…

  5. Structural Location and Reputed Influence in State Reading Policy Issue Networks

    ERIC Educational Resources Information Center

    Young, Tamara V.; Lewis, Wayne D.; Sanders, Marla S.

    2010-01-01

    Using data about collaborative relationships among 109 reading policy actors from four states, this study investigated the extent to which social capital, operationalized as spanning structural holes, predicted a policy actor's reputed influence. Regression analysis showed that after controlling for state, centrality, and government entity, having…

  6. Effects of Employing Ridge Regression in Structural Equation Models.

    ERIC Educational Resources Information Center

    McQuitty, Shaun

    1997-01-01

    LISREL 8 invokes a ridge option when maximum likelihood or generalized least squares are used to estimate a structural equation model with a nonpositive definite covariance or correlation matrix. Implications of the ridge option for model fit, parameter estimates, and standard errors are explored through two examples. (SLD)

  7. STRUCTURAL AND AFFECTIVE ASPECTS OF CLASSROOM CLIMATE.

    ERIC Educational Resources Information Center

    WALBERG, HERBERT J.

    USING THE CLASSROOM AS THE UNIT OF ANALYSIS A 25 PERCENT RANDOM SAMPLE OF STUDENTS IN 72 CLASSES FROM ALL PARTS OF THE COUNTRY TOOK THE CLASSROOM CLIMATE QUESTIONNAIRE IN ORDER TO INVESTIGATE THE RELATIONSHIP BETWEEN STRUCTURAL (ORGANIZATIONAL) AND AFFECTIVE (PERSONAL INTERACTION BETWEEN GROUP MEMBERS) DIMENSIONS OF GROUP CLIMATE. REGRESSION AND…

  8. Fertilizer Response Curves for Commercial Southern Forest Species Defined with an Un-Replicated Experimental Design.

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Coleman, Mark; Aubrey, Doug; Coyle, David, R.

    2005-11-01

    There has been recent interest in use of non-replicated regression experimental designs in forestry, as the need for replication in experimental design is burdensome on limited research budgets. We wanted to determine the interacting effects of soil moisture and nutrient availability on the production of various southeastern forest trees (two clones of Populus deltoides, open pollinated Platanus occidentalis, Liquidambar styraciflua and Pinus taeda). Additionally, we required an understanding of the fertilizer response curve. To accomplish both objectives we developed a composite design that includes a core ANOVA approach to consider treatment interactions, with the addition of non-replicated regression plots receivingmore » a range of fertilizer levels for the primary irrigation treatment.« less

  9. An Examination and Comparison of Airline and Navy Pilot Career Earnings

    DTIC Science & Technology

    1986-03-01

    RECEIVED ........ .............. 45 16. AIRLINE PILOT PROBATIONARY WAGES .... ........ 46 17. 1985 FAPA MAXIMUM PILOT WAGE ESTIMATES ..... 53 1 1983...tI% LIN PILOT WAGES REGRESSION EQUATIONS . 5 19. AVERAGE 1983 PILOT WAGES COMPUTED FROM REGRESSION ANALYSIS ...... ............. 56 20. FAPA MAXIMUM...Western N/A 1,200 1,500 Source: FAPA This establishes a wage "base" for pilots. In addition, a pilot who ilys more than average in one month may "bank

  10. Analysis of Binary Adherence Data in the Setting of Polypharmacy: A Comparison of Different Approaches

    PubMed Central

    Esserman, Denise A.; Moore, Charity G.; Roth, Mary T.

    2009-01-01

    Older community dwelling adults often take multiple medications for numerous chronic diseases. Non-adherence to these medications can have a large public health impact. Therefore, the measurement and modeling of medication adherence in the setting of polypharmacy is an important area of research. We apply a variety of different modeling techniques (standard linear regression; weighted linear regression; adjusted linear regression; naïve logistic regression; beta-binomial (BB) regression; generalized estimating equations (GEE)) to binary medication adherence data from a study in a North Carolina based population of older adults, where each medication an individual was taking was classified as adherent or non-adherent. In addition, through simulation we compare these different methods based on Type I error rates, bias, power, empirical 95% coverage, and goodness of fit. We find that estimation and inference using GEE is robust to a wide variety of scenarios and we recommend using this in the setting of polypharmacy when adherence is dichotomously measured for multiple medications per person. PMID:20414358

  11. Restoration of Monotonicity Respecting in Dynamic Regression

    PubMed Central

    Huang, Yijian

    2017-01-01

    Dynamic regression models, including the quantile regression model and Aalen’s additive hazards model, are widely adopted to investigate evolving covariate effects. Yet lack of monotonicity respecting with standard estimation procedures remains an outstanding issue. Advances have recently been made, but none provides a complete resolution. In this article, we propose a novel adaptive interpolation method to restore monotonicity respecting, by successively identifying and then interpolating nearest monotonicity-respecting points of an original estimator. Under mild regularity conditions, the resulting regression coefficient estimator is shown to be asymptotically equivalent to the original. Our numerical studies have demonstrated that the proposed estimator is much more smooth and may have better finite-sample efficiency than the original as well as, when available as only in special cases, other competing monotonicity-respecting estimators. Illustration with a clinical study is provided. PMID:29430068

  12. Magnitude and Frequency of Floods on Nontidal Streams in Delaware

    USGS Publications Warehouse

    Ries, Kernell G.; Dillow, Jonathan J.A.

    2006-01-01

    Reliable estimates of the magnitude and frequency of annual peak flows are required for the economical and safe design of transportation and water-conveyance structures. This report, done in cooperation with the Delaware Department of Transportation (DelDOT) and the Delaware Geological Survey (DGS), presents methods for estimating the magnitude and frequency of floods on nontidal streams in Delaware at locations where streamgaging stations monitor streamflow continuously and at ungaged sites. Methods are presented for estimating the magnitude of floods for return frequencies ranging from 2 through 500 years. These methods are applicable to watersheds exhibiting a full range of urban development conditions. The report also describes StreamStats, a web application that makes it easy to obtain flood-frequency estimates for user-selected locations on Delaware streams. Flood-frequency estimates for ungaged sites are obtained through a process known as regionalization, using statistical regression analysis, where information determined for a group of streamgaging stations within a region forms the basis for estimates for ungaged sites within the region. One hundred and sixteen streamgaging stations in and near Delaware with at least 10 years of non-regulated annual peak-flow data available were used in the regional analysis. Estimates for gaged sites are obtained by combining the station peak-flow statistics (mean, standard deviation, and skew) and peak-flow estimates with regional estimates of skew and flood-frequency magnitudes. Example flood-frequency estimate calculations using the methods presented in the report are given for: (1) ungaged sites, (2) gaged locations, (3) sites upstream or downstream from a gaged location, and (4) sites between gaged locations. Regional regression equations applicable to ungaged sites in the Piedmont and Coastal Plain Physiographic Provinces of Delaware are presented. The equations incorporate drainage area, forest cover, impervious area, basin storage, housing density, soil type A, and mean basin slope as explanatory variables, and have average standard errors of prediction ranging from 28 to 72 percent. Additional regression equations that incorporate drainage area and housing density as explanatory variables are presented for use in defining the effects of urbanization on peak-flow estimates throughout Delaware for the 2-year through 500-year recurrence intervals, along with suggestions for their appropriate use in predicting development-affected peak flows. Additional topics associated with the analyses performed during the study are also discussed, including: (1) the availability and description of more than 30 basin and climatic characteristics considered during the development of the regional regression equations; (2) the treatment of increasing trends in the annual peak-flow series identified at 18 gaged sites, with respect to their relations with maximum 24-hour precipitation and housing density, and their use in the regional analysis; (3) calculation of the 90-percent confidence interval associated with peak-flow estimates from the regional regression equations; and (4) a comparison of flood-frequency estimates at gages used in a previous study, highlighting the effects of various improved analytical techniques.

  13. Random regression models using Legendre orthogonal polynomials to evaluate the milk production of Alpine goats.

    PubMed

    Silva, F G; Torres, R A; Brito, L F; Euclydes, R F; Melo, A L P; Souza, N O; Ribeiro, J I; Rodrigues, M T

    2013-12-11

    The objective of this study was to identify the best random regression model using Legendre orthogonal polynomials to evaluate Alpine goats genetically and to estimate the parameters for test day milk yield. On the test day, we analyzed 20,710 records of milk yield of 667 goats from the Goat Sector of the Universidade Federal de Viçosa. The evaluated models had combinations of distinct fitting orders for polynomials (2-5), random genetic (1-7), and permanent environmental (1-7) fixed curves and a number of classes for residual variance (2, 4, 5, and 6). WOMBAT software was used for all genetic analyses. A random regression model using the best Legendre orthogonal polynomial for genetic evaluation of milk yield on the test day of Alpine goats considered a fixed curve of order 4, curve of genetic additive effects of order 2, curve of permanent environmental effects of order 7, and a minimum of 5 classes of residual variance because it was the most economical model among those that were equivalent to the complete model by the likelihood ratio test. Phenotypic variance and heritability were higher at the end of the lactation period, indicating that the length of lactation has more genetic components in relation to the production peak and persistence. It is very important that the evaluation utilizes the best combination of fixed, genetic additive and permanent environmental regressions, and number of classes of heterogeneous residual variance for genetic evaluation using random regression models, thereby enhancing the precision and accuracy of the estimates of parameters and prediction of genetic values.

  14. Biomass equations for shrub species of Tamualipan thornscrub of North-Eastern Mexico

    Treesearch

    J. Navar; E. Mendez; A. Najera; J. Graciano; V. Dale; B. Parresol

    2004-01-01

    Nine additive allometric equations for computing above-ground, standing biomass were developed for the plant community and for each of 18 single species typical of the Tamaulipan thornscrub of north-eastern Mexico. Equations developed using additive procedures in seemingly unrelated linear regression provided statistical efficiency in total biomass estimates at the...

  15. Developing design methods of concrete mix with microsilica additives for road construction

    NASA Astrophysics Data System (ADS)

    Dmitrienko, Vladimir; Shrivel, Igor; Kokunko, Irina; Pashkova, Olga

    2017-10-01

    Based on the laboratory test results, regression equations having standard cone and concrete strength, to determine the available amount of cement, water and microsilica were obtained. The joint solution of these equations allowed the researchers to develop the algorithm of designing heavy concrete compositions with microsilica additives for road construction.

  16. Effect of grain port length-diameter ratio on combustion performance in hybrid rocket motors

    NASA Astrophysics Data System (ADS)

    Cai, Guobiao; Zhang, Yuanjun; Tian, Hui; Wang, Pengfei; Yu, Nanjia

    2016-11-01

    The objectives of this study are to develop a more accurate regression rate considering the oxidizer mass flow and the fuel grain geometry configuration with numerical and experimental investigations in polyethylene (PE)/90% hydrogen peroxide (HP) hybrid rocket. Firstly, a 2-D axisymmetric CFD model with turbulence, chemistry reaction, solid-gas coupling is built to investigate the combustion chamber internal flow structure. Then a more accurate regression formula is proposed and the combustion efficiency changing with the length-diameter ratio is studied. A series experiments are conducted in various oxidizer mass flow to analyze combustion performance including the regression rate and combustion efficiency. The regression rates are measured by the fuel mass reducing and diameter changing. A new regression rate formula considering the fuel grain configuration is proposed in this paper. The combustion efficiency increases with the length-diameter ratio changing. To improve the performance of a hybrid rocket motor, the port length-diameter ratio is suggested 10-12 in the paper.

  17. Soft-sensing model of temperature for aluminum reduction cell on improved twin support vector regression

    NASA Astrophysics Data System (ADS)

    Li, Tao

    2018-06-01

    The complexity of aluminum electrolysis process leads the temperature for aluminum reduction cells hard to measure directly. However, temperature is the control center of aluminum production. To solve this problem, combining some aluminum plant's practice data, this paper presents a Soft-sensing model of temperature for aluminum electrolysis process on Improved Twin Support Vector Regression (ITSVR). ITSVR eliminates the slow learning speed of Support Vector Regression (SVR) and the over-fit risk of Twin Support Vector Regression (TSVR) by introducing a regularization term into the objective function of TSVR, which ensures the structural risk minimization principle and lower computational complexity. Finally, the model with some other parameters as auxiliary variable, predicts the temperature by ITSVR. The simulation result shows Soft-sensing model based on ITSVR has short time-consuming and better generalization.

  18. Design and testing of digitally manufactured paraffin Acrylonitrile-butadiene-styrene hybrid rocket motors

    NASA Astrophysics Data System (ADS)

    McCulley, Jonathan M.

    This research investigates the application of additive manufacturing techniques for fabricating hybrid rocket fuel grains composed of porous Acrylonitrile-butadiene-styrene impregnated with paraffin wax. The digitally manufactured ABS substrate provides mechanical support for the paraffin fuel material and serves as an additional fuel component. The embedded paraffin provides an enhanced fuel regression rate while having no detrimental effect on the thermodynamic burn properties of the fuel grain. Multiple fuel grains with various ABS-to-Paraffin mass ratios were fabricated and burned with nitrous oxide. Analytical predictions for end-to-end motor performance and fuel regression are compared against static test results. Baseline fuel grain regression calculations use an enthalpy balance energy analysis with the material and thermodynamic properties based on the mean paraffin/ABS mass fractions within the fuel grain. In support of these analytical comparisons, a novel method for propagating the fuel port burn surface was developed. In this modeling approach the fuel cross section grid is modeled as an image with white pixels representing the fuel and black pixels representing empty or burned grid cells.

  19. Evaluating penalized logistic regression models to predict Heat-Related Electric grid stress days

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bramer, L. M.; Rounds, J.; Burleyson, C. D.

    Understanding the conditions associated with stress on the electricity grid is important in the development of contingency plans for maintaining reliability during periods when the grid is stressed. In this paper, heat-related grid stress and the relationship with weather conditions is examined using data from the eastern United States. Penalized logistic regression models were developed and applied to predict stress on the electric grid using weather data. The inclusion of other weather variables, such as precipitation, in addition to temperature improved model performance. Several candidate models and datasets were examined. A penalized logistic regression model fit at the operation-zone levelmore » was found to provide predictive value and interpretability. Additionally, the importance of different weather variables observed at different time scales were examined. Maximum temperature and precipitation were identified as important across all zones while the importance of other weather variables was zone specific. The methods presented in this work are extensible to other regions and can be used to aid in planning and development of the electrical grid.« less

  20. Evaluating penalized logistic regression models to predict Heat-Related Electric grid stress days

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Bramer, Lisa M.; Rounds, J.; Burleyson, C. D.

    Understanding the conditions associated with stress on the electricity grid is important in the development of contingency plans for maintaining reliability during periods when the grid is stressed. In this paper, heat-related grid stress and the relationship with weather conditions were examined using data from the eastern United States. Penalized logistic regression models were developed and applied to predict stress on the electric grid using weather data. The inclusion of other weather variables, such as precipitation, in addition to temperature improved model performance. Several candidate models and combinations of predictive variables were examined. A penalized logistic regression model which wasmore » fit at the operation-zone level was found to provide predictive value and interpretability. Additionally, the importance of different weather variables observed at various time scales were examined. Maximum temperature and precipitation were identified as important across all zones while the importance of other weather variables was zone specific. In conclusion, the methods presented in this work are extensible to other regions and can be used to aid in planning and development of the electrical grid.« less

  1. Evaluating penalized logistic regression models to predict Heat-Related Electric grid stress days

    DOE PAGES

    Bramer, Lisa M.; Rounds, J.; Burleyson, C. D.; ...

    2017-09-22

    Understanding the conditions associated with stress on the electricity grid is important in the development of contingency plans for maintaining reliability during periods when the grid is stressed. In this paper, heat-related grid stress and the relationship with weather conditions were examined using data from the eastern United States. Penalized logistic regression models were developed and applied to predict stress on the electric grid using weather data. The inclusion of other weather variables, such as precipitation, in addition to temperature improved model performance. Several candidate models and combinations of predictive variables were examined. A penalized logistic regression model which wasmore » fit at the operation-zone level was found to provide predictive value and interpretability. Additionally, the importance of different weather variables observed at various time scales were examined. Maximum temperature and precipitation were identified as important across all zones while the importance of other weather variables was zone specific. In conclusion, the methods presented in this work are extensible to other regions and can be used to aid in planning and development of the electrical grid.« less

  2. Specific factors for prenatal lead exposure in the border area of China.

    PubMed

    Kawata, Kimiko; Li, Yan; Liu, Hao; Zhang, Xiao Qin; Ushijima, Hiroshi

    2006-07-01

    The objectives of this study are to examine the prevalence of increased blood lead concentrations in mothers and their umbilical cords, and to identify risk factors for prenatal lead exposure in Kunming city, Yunnan province, China. The study was conducted at two obstetrics departments, and 100 peripartum women were enrolled. The mean blood lead concentrations of the mothers and the umbilical cords were 67.3microg/l and 53.1microg/l, respectively. In multiple linear regression analysis, maternal occupational exposure, maternal consumption of homemade dehydrated vegetables and maternal habitation period in Kunming city were significantly associated with an increase of umbilical cord blood lead concentration. In addition, logistic regression analysis was used to assess the association of umbilical cord blood lead concentrations that possibly have adverse effects on brain development of newborns with each potential risk factor. Maternal frequent use of tableware with color patterns inside was significantly associated with higher cord blood lead concentration in addition to the three items in the multiple linear regression analysis. These points should be considered as specific recommendations for maternal and fetal lead exposure in this city.

  3. A new biodegradation prediction model specific to petroleum hydrocarbons.

    PubMed

    Howard, Philip; Meylan, William; Aronson, Dallas; Stiteler, William; Tunkel, Jay; Comber, Michael; Parkerton, Thomas F

    2005-08-01

    A new predictive model for determining quantitative primary biodegradation half-lives of individual petroleum hydrocarbons has been developed. This model uses a fragment-based approach similar to that of several other biodegradation models, such as those within the Biodegradation Probability Program (BIOWIN) estimation program. In the present study, a half-life in days is estimated using multiple linear regression against counts of 31 distinct molecular fragments. The model was developed using a data set consisting of 175 compounds with environmentally relevant experimental data that was divided into training and validation sets. The original fragments from the Ministry of International Trade and Industry BIOWIN model were used initially as structural descriptors and additional fragments were then added to better describe the ring systems found in petroleum hydrocarbons and to adjust for nonlinearity within the experimental data. The training and validation sets had r2 values of 0.91 and 0.81, respectively.

  4. Flexible nonlinear estimates of the association between height and mental ability in early life.

    PubMed

    Murasko, Jason E

    2014-01-01

    To estimate associations between early-life mental ability and height/height-growth in contemporary US children. Structured additive regression models are used to flexibly estimate the associations between height and mental ability at approximately 24 months of age. The sample is taken from the Early Childhood Longitudinal Study-Birth Cohort, a national study whose target population was children born in the US during 2001. A nonlinear association is indicated between height and mental ability at approximately 24 months of age. There is an increasing association between height and mental ability below the mean value of height, but a flat association thereafter. Annualized growth shows the same nonlinear association to ability when controlling for baseline length at 9 months. Restricted growth at lower values of the height distribution is associated with lower measured mental ability in contemporary US children during the first years of life. Copyright © 2013 Wiley Periodicals, Inc.

  5. Role strain in couples with and without a child with a chronic illness: associations with marital satisfaction, intimacy, and daily mood.

    PubMed

    Quittner, A L; Opipari, L C; Espelage, D L; Carter, B; Eid, N; Eigen, H

    1998-03-01

    This study examined marital role strain in 33 couples caring for a child with cystic fibrosis (CF) and 33 couples with a healthy child. The relationship between role strain, marital satisfaction, and psychological distress was tested. Couples completed a structured interview, questionnaires, a card sort procedure, and 4 daily diaries assessing activities and mood. Couples in the CF versus comparison group reported greater role strain on measures of role conflict, child-care tasks, and exchanges of affection. They also spent less time in recreational activities, but no reliable group differences were found in marital satisfaction or depression. Regression analyses indicated that role strain was related to marital satisfaction and depression and that recreation time accounted for additional variance. Path analysis suggested that recreation mediated the negative relationship between role strain and distress. The importance of using a contextual, process-oriented approach is discussed.

  6. New horizons in mouse immunoinformatics: reliable in silico prediction of mouse class I histocompatibility major complex peptide binding affinity.

    PubMed

    Hattotuwagama, Channa K; Guan, Pingping; Doytchinova, Irini A; Flower, Darren R

    2004-11-21

    Quantitative structure-activity relationship (QSAR) analysis is a main cornerstone of modern informatic disciplines. Predictive computational models, based on QSAR technology, of peptide-major histocompatibility complex (MHC) binding affinity have now become a vital component of modern day computational immunovaccinology. Historically, such approaches have been built around semi-qualitative, classification methods, but these are now giving way to quantitative regression methods. The additive method, an established immunoinformatics technique for the quantitative prediction of peptide-protein affinity, was used here to identify the sequence dependence of peptide binding specificity for three mouse class I MHC alleles: H2-D(b), H2-K(b) and H2-K(k). As we show, in terms of reliability the resulting models represent a significant advance on existing methods. They can be used for the accurate prediction of T-cell epitopes and are freely available online ( http://www.jenner.ac.uk/MHCPred).

  7. Bayesian Group Bridge for Bi-level Variable Selection.

    PubMed

    Mallick, Himel; Yi, Nengjun

    2017-06-01

    A Bayesian bi-level variable selection method (BAGB: Bayesian Analysis of Group Bridge) is developed for regularized regression and classification. This new development is motivated by grouped data, where generic variables can be divided into multiple groups, with variables in the same group being mechanistically related or statistically correlated. As an alternative to frequentist group variable selection methods, BAGB incorporates structural information among predictors through a group-wise shrinkage prior. Posterior computation proceeds via an efficient MCMC algorithm. In addition to the usual ease-of-interpretation of hierarchical linear models, the Bayesian formulation produces valid standard errors, a feature that is notably absent in the frequentist framework. Empirical evidence of the attractiveness of the method is illustrated by extensive Monte Carlo simulations and real data analysis. Finally, several extensions of this new approach are presented, providing a unified framework for bi-level variable selection in general models with flexible penalties.

  8. Estimating life expectancies for US small areas: a regression framework

    NASA Astrophysics Data System (ADS)

    Congdon, Peter

    2014-01-01

    Analysis of area mortality variations and estimation of area life tables raise methodological questions relevant to assessing spatial clustering, and socioeconomic inequalities in mortality. Existing small area analyses of US life expectancy variation generally adopt ad hoc amalgamations of counties to alleviate potential instability of mortality rates involved in deriving life tables, and use conventional life table analysis which takes no account of correlated mortality for adjacent areas or ages. The alternative strategy here uses structured random effects methods that recognize correlations between adjacent ages and areas, and allows retention of the original county boundaries. This strategy generalizes to include effects of area category (e.g. poverty status, ethnic mix), allowing estimation of life tables according to area category, and providing additional stabilization of estimated life table functions. This approach is used here to estimate stabilized mortality rates, derive life expectancies in US counties, and assess trends in clustering and in inequality according to county poverty category.

  9. Methods for Estimating Magnitude and Frequency of Floods in Rural Basins in the Southeastern United States: South Carolina

    USGS Publications Warehouse

    Feaster, Toby D.; Gotvald, Anthony J.; Weaver, J. Curtis

    2009-01-01

    For more than 50 years, the U.S. Geological Survey (USGS) has been developing regional regression equations that can be used to estimate flood magnitude and frequency at ungaged sites. Flood magnitude relates to the volume of flow that occurs over some period of time and usually is presented in cubic feet per second. Flood frequency relates to the probability of occurrence of a flood; that is, on average, what is the likelihood that a flood with a specified magnitude will occur in any given year (1 percent chance, 10 percent chance, 50 percent chance, and so on). Such flood estimates are needed for the efficient design of bridges, highway embankments, levees, and other structures near streams. In addition, these estimates are needed for the effective planning and management of land and water resources, to protect lives and property in flood-prone areas, and to determine flood-insurance rates.

  10. Healthcare Worker Exposures to the Antibacterial Agent Triclosan

    PubMed Central

    MacIsaac, Julia K.; Gerona, Roy; Blanc, Paul D.; Apatira, Latifat; Friesen, Matthew; Coppolino, Michael; Janssen, Sarah

    2014-01-01

    Objective We sought to quantify absorption of triclosan, a potential endocrine disruptor, in healthcare workers with occupational exposure to soap containing this chemical. Methods A cross-sectional convenience sample of two groups of 38 healthcare workers at separate inpatient medical centers: Hospital One uses 0.3% triclosan soap in all patient care areas; Hospital Two does not use triclosan-containing products. Additional exposure to triclosan-containing personal care products was assessed through a structured questionnaire. Urine triclosan was quantified and the occupational contribution estimated through regression modeling. Results Occupational exposure accounted for an incremental triclosan burden of 206 ng/mL (p=0.02), while triclosan-containing toothpaste use was associated with 146 ng/mL higher levels (p<0.001). Conclusions Use of triclosan-containing antibacterial soaps in healthcare settings represents a substantial and potentially biologically relevant source occupational triclosan exposure. PMID:25099409

  11. Predicting the Retention Behavior of Specific O-Linked Glycopeptides.

    PubMed

    Badgett, Majors J; Boyes, Barry; Orlando, Ron

    2017-09-01

    O -Linked glycosylation is a common post-translational modification that can alter the overall structure, polarity, and function of proteins. Reverse-phase (RP) chromatography is the most common chromatographic approach to analyze O -glycosylated peptides and their unmodified counterparts, even though this approach often does not provide adequate separation of these two species. Hydrophilic interaction liquid chromatography (HILIC) can be a solution to this problem, as the polar glycan interacts with the polar stationary phase and potentially offers the ability to resolve the peptide from its modified form(s). In this paper, HILIC is used to separate peptides with O - N -acetylgalactosamine ( O -GalNAc), O - N -acetylglucosamine ( O -GlcNAc), and O -fucose additions from their native forms, and coefficients representing the extent of hydrophilicity were derived using linear regression analysis as a means to predict the retention times of peptides with these modifications.

  12. Predicting the Retention Behavior of Specific O-Linked Glycopeptides

    PubMed Central

    Badgett, Majors J.; Boyes, Barry; Orlando, Ron

    2017-01-01

    O-Linked glycosylation is a common post-translational modification that can alter the overall structure, polarity, and function of proteins. Reverse-phase (RP) chromatography is the most common chromatographic approach to analyze O-glycosylated peptides and their unmodified counterparts, even though this approach often does not provide adequate separation of these two species. Hydrophilic interaction liquid chromatography (HILIC) can be a solution to this problem, as the polar glycan interacts with the polar stationary phase and potentially offers the ability to resolve the peptide from its modified form(s). In this paper, HILIC is used to separate peptides with O-N-acetylgalactosamine (O-GalNAc), O-N-acetylglucosamine (O-GlcNAc), and O-fucose additions from their native forms, and coefficients representing the extent of hydrophilicity were derived using linear regression analysis as a means to predict the retention times of peptides with these modifications. PMID:28785176

  13. Multi-fidelity machine learning models for accurate bandgap predictions of solids

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Pilania, Ghanshyam; Gubernatis, James E.; Lookman, Turab

    Here, we present a multi-fidelity co-kriging statistical learning framework that combines variable-fidelity quantum mechanical calculations of bandgaps to generate a machine-learned model that enables low-cost accurate predictions of the bandgaps at the highest fidelity level. Additionally, the adopted Gaussian process regression formulation allows us to predict the underlying uncertainties as a measure of our confidence in the predictions. In using a set of 600 elpasolite compounds as an example dataset and using semi-local and hybrid exchange correlation functionals within density functional theory as two levels of fidelities, we demonstrate the excellent learning performance of the method against actual high fidelitymore » quantum mechanical calculations of the bandgaps. The presented statistical learning method is not restricted to bandgaps or electronic structure methods and extends the utility of high throughput property predictions in a significant way.« less

  14. The effect of hospital organizational characteristics on postoperative complications.

    PubMed

    Knight, Margaret

    2013-12-01

    To determine if there is a relationship between the risk of postoperative complications and the nonclinical hospital characteristics of bed size, ownership structure, relative urbanicity, regional location, teaching status, and area income status. This study involved a secondary analysis of 2006 administrative hospital data from a number of U.S. states. This data, gathered annually by the Agency for Healthcare Research and Quality (AHRQ) via the National Inpatient Sample (NIS) Healthcare Utilization Project (HCUP), was analyzed using probit regressions to measure the effects of several nonclinical hospital categories on seven diagnostic groupings. The study model included postoperative complications as well as additional potentially confounding variables. The results showed mixed outcomes for each of the hospital characteristic groupings. Subdividing these groupings to correspond with the HCUP data analysis allowed a greater understanding of how hospital characteristics' may affect postoperative outcomes. Nonclinical hospital characteristics do affect the various postoperative complications, but they do so inconsistently.

  15. Multi-fidelity machine learning models for accurate bandgap predictions of solids

    DOE PAGES

    Pilania, Ghanshyam; Gubernatis, James E.; Lookman, Turab

    2016-12-28

    Here, we present a multi-fidelity co-kriging statistical learning framework that combines variable-fidelity quantum mechanical calculations of bandgaps to generate a machine-learned model that enables low-cost accurate predictions of the bandgaps at the highest fidelity level. Additionally, the adopted Gaussian process regression formulation allows us to predict the underlying uncertainties as a measure of our confidence in the predictions. In using a set of 600 elpasolite compounds as an example dataset and using semi-local and hybrid exchange correlation functionals within density functional theory as two levels of fidelities, we demonstrate the excellent learning performance of the method against actual high fidelitymore » quantum mechanical calculations of the bandgaps. The presented statistical learning method is not restricted to bandgaps or electronic structure methods and extends the utility of high throughput property predictions in a significant way.« less

  16. Reliability and validity of the Marijuana Motives Measure among young adult frequent cannabis users and associations with cannabis dependence.

    PubMed

    Benschop, Annemieke; Liebregts, Nienke; van der Pol, Peggy; Schaap, Rick; Buisman, Renate; van Laar, Margriet; van den Brink, Wim; de Graaf, Ron; Korf, Dirk J

    2015-01-01

    The Marijuana Motives Measure (MMM) has so far been examined mainly in student populations, often with relatively limited involvement in cannabis use. This study evaluated the factor structure of the MMM in a demographically mixed sample of 600 young adult (18-30 years) frequent (≥ 3 days per week) cannabis users in the Netherlands. Analysis confirmed a five-factor solution, denoting coping, enhancement, social, conformity and expansion motives. Additionally, the original MMM was extended with two items (boredom and habit), which formed a distinct, internally consistent sixth factor labelled routine motives. In a multivariable logistic regression analysis, coping and routine motives showed significant associations with 12-month DSM-IV cannabis dependence. The results suggest general reliability and validity of the MMM in a heterogeneous population of experienced cannabis users. Copyright © 2014 Elsevier Ltd. All rights reserved.

  17. Bayesian estimation of seasonal course of canopy leaf area index from hyperspectral satellite data

    NASA Astrophysics Data System (ADS)

    Varvia, Petri; Rautiainen, Miina; Seppänen, Aku

    2018-03-01

    In this paper, Bayesian inversion of a physically-based forest reflectance model is investigated to estimate of boreal forest canopy leaf area index (LAI) from EO-1 Hyperion hyperspectral data. The data consist of multiple forest stands with different species compositions and structures, imaged in three phases of the growing season. The Bayesian estimates of canopy LAI are compared to reference estimates based on a spectral vegetation index. The forest reflectance model contains also other unknown variables in addition to LAI, for example leaf single scattering albedo and understory reflectance. In the Bayesian approach, these variables are estimated simultaneously with LAI. The feasibility and seasonal variation of these estimates is also examined. Credible intervals for the estimates are also calculated and evaluated. The results show that the Bayesian inversion approach is significantly better than using a comparable spectral vegetation index regression.

  18. Why credit risk markets are predestined for exhibiting log-periodic power law structures

    NASA Astrophysics Data System (ADS)

    Wosnitza, Jan Henrik; Leker, Jens

    2014-01-01

    Recent research has established the existence of log-periodic power law (LPPL) patterns in financial institutions’ credit default swap (CDS) spreads. The main purpose of this paper is to clarify why credit risk markets are predestined for exhibiting LPPL structures. To this end, the credit risk prediction of two variants of logistic regression, i.e. polynomial logistic regression (PLR) and kernel logistic regression (KLR), are firstly compared to the standard logistic regression (SLR). In doing so, the question whether the performances of rating systems based on balance sheet ratios can be improved by nonlinear transformations of the explanatory variables is resolved. Building on the result that nonlinear balance sheet ratio transformations hardly improve the SLR’s predictive power in our case, we secondly compare the classification performance of a multivariate SLR to the discriminative powers of probabilities of default derived from three different capital market data, namely bonds, CDSs, and stocks. Benefiting from the prompt inclusion of relevant information, the capital market data in general and CDSs in particular increasingly outperform the SLR while approaching the time of the credit event. Due to the higher classification performances, it seems plausible for creditors to align their investment decisions with capital market-based default indicators, i.e., to imitate the aggregate opinion of the market participants. Since imitation is considered to be the source of LPPL structures in financial time series, it is highly plausible to scan CDS spread developments for LPPL patterns. By establishing LPPL patterns in governmental CDS spread trajectories of some European crisis countries, the LPPL’s application to credit risk markets is extended. This novel piece of evidence further strengthens the claim that credit risk markets are adequate breeding grounds for LPPL patterns.

  19. Effects of electroacupuncture on luteal regression and steroidogenesis in ovarian hyperstimulation syndrome model rat.

    PubMed

    Huang, Xuan; Chen, Li; Xia, You-Bing; Xie, Min; Sun, Qin; Yao, Bing

    2018-03-15

    Electroacupuncture (EA) is an effective and safe therapeutic method widely used for treating clinical diseases. Previously, we found that EA could decrease serum hormones and reduce ovarian size in ovarian hyperstimulation syndrome (OHSS) rat model. Nevertheless, the mechanisms that contribute to these improvements remain unclear. HE staining was used to count the number of corpora lutea (CL) and follicles. Immunohistochemical and ELISA were applied to examine luteal functional and structural regression. Immunoprecipitation was used for analyzing the interaction between NPY (neuropeptide Y) and COX-2; western blotting and qRT-PCR were used to evaluate the expressions of steroidogenic enzymes and PKA/CREB pathway. EA treatment significantly reduced the ovarian weight and the number of CL, also decreased ovarian and serum levels of PGE2 and COX-2 expression; increased ovarian PGF2α levels and PGF2α/PGE2 ratio; decreased PCNA expression and distribution; and increased cyclin regulatory inhibitor p27 expression to have further effect on the luteal formation, and promote luteal functional and structural regression. Moreover, expression of COX-2 in ovaries was possessed interactivity increased expression of NPY. Furthermore, EA treatment lowered the serum hormone levels, inhibited PKA/CREB pathway and decreased the expressions of steroidogenic enzymes. Hence, interaction with COX-2, NPY may affect the levels of PGF2α and PGE2 as well as impact the proliferation of granulosa cells in ovaries, thus further reducing the luteal formation, and promoting luteal structural and functional regression, as well as the ovarian steroidogenesis following EA treatment. EA treatment could be an option for preventing OHSS in ART. Copyright © 2018 Elsevier Inc. All rights reserved.

  20. A robust background regression based score estimation algorithm for hyperspectral anomaly detection

    NASA Astrophysics Data System (ADS)

    Zhao, Rui; Du, Bo; Zhang, Liangpei; Zhang, Lefei

    2016-12-01

    Anomaly detection has become a hot topic in the hyperspectral image analysis and processing fields in recent years. The most important issue for hyperspectral anomaly detection is the background estimation and suppression. Unreasonable or non-robust background estimation usually leads to unsatisfactory anomaly detection results. Furthermore, the inherent nonlinearity of hyperspectral images may cover up the intrinsic data structure in the anomaly detection. In order to implement robust background estimation, as well as to explore the intrinsic data structure of the hyperspectral image, we propose a robust background regression based score estimation algorithm (RBRSE) for hyperspectral anomaly detection. The Robust Background Regression (RBR) is actually a label assignment procedure which segments the hyperspectral data into a robust background dataset and a potential anomaly dataset with an intersection boundary. In the RBR, a kernel expansion technique, which explores the nonlinear structure of the hyperspectral data in a reproducing kernel Hilbert space, is utilized to formulate the data as a density feature representation. A minimum squared loss relationship is constructed between the data density feature and the corresponding assigned labels of the hyperspectral data, to formulate the foundation of the regression. Furthermore, a manifold regularization term which explores the manifold smoothness of the hyperspectral data, and a maximization term of the robust background average density, which suppresses the bias caused by the potential anomalies, are jointly appended in the RBR procedure. After this, a paired-dataset based k-nn score estimation method is undertaken on the robust background and potential anomaly datasets, to implement the detection output. The experimental results show that RBRSE achieves superior ROC curves, AUC values, and background-anomaly separation than some of the other state-of-the-art anomaly detection methods, and is easy to implement in practice.

Top