Boosted regression tree, table, and figure data
Spreadsheets are included here to support the manuscript Boosted Regression Tree Models to Explain Watershed Nutrient Concentrations and Biological Condition. This dataset is associated with the following publication:Golden , H., C. Lane , A. Prues, and E. D'Amico. Boosted Regression Tree Models to Explain Watershed Nutrient Concentrations and Biological Condition. JAWRA. American Water Resources Association, Middleburg, VA, USA, 52(5): 1251-1274, (2016).
NASA Astrophysics Data System (ADS)
Mahaboob, B.; Venkateswarlu, B.; Sankar, J. Ravi; Balasiddamuni, P.
2017-11-01
This paper uses matrix calculus techniques to obtain Nonlinear Least Squares Estimator (NLSE), Maximum Likelihood Estimator (MLE) and Linear Pseudo model for nonlinear regression model. David Pollard and Peter Radchenko [1] explained analytic techniques to compute the NLSE. However the present research paper introduces an innovative method to compute the NLSE using principles in multivariate calculus. This study is concerned with very new optimization techniques used to compute MLE and NLSE. Anh [2] derived NLSE and MLE of a heteroscedatistic regression model. Lemcoff [3] discussed a procedure to get linear pseudo model for nonlinear regression model. In this research article a new technique is developed to get the linear pseudo model for nonlinear regression model using multivariate calculus. The linear pseudo model of Edmond Malinvaud [4] has been explained in a very different way in this paper. David Pollard et.al used empirical process techniques to study the asymptotic of the LSE (Least-squares estimation) for the fitting of nonlinear regression function in 2006. In Jae Myung [13] provided a go conceptual for Maximum likelihood estimation in his work “Tutorial on maximum likelihood estimation
Schilling, K.E.; Wolter, C.F.
2005-01-01
Nineteen variables, including precipitation, soils and geology, land use, and basin morphologic characteristics, were evaluated to develop Iowa regression models to predict total streamflow (Q), base flow (Qb), storm flow (Qs) and base flow percentage (%Qb) in gauged and ungauged watersheds in the state. Discharge records from a set of 33 watersheds across the state for the 1980 to 2000 period were separated into Qb and Qs. Multiple linear regression found that 75.5 percent of long term average Q was explained by rainfall, sand content, and row crop percentage variables, whereas 88.5 percent of Qb was explained by these three variables plus permeability and floodplain area variables. Qs was explained by average rainfall and %Qb was a function of row crop percentage, permeability, and basin slope variables. Regional regression models developed for long term average Q and Qb were adapted to annual rainfall and showed good correlation between measured and predicted values. Combining the regression model for Q with an estimate of mean annual nitrate concentration, a map of potential nitrate loads in the state was produced. Results from this study have important implications for understanding geomorphic and land use controls on streamflow and base flow in Iowa watersheds and similar agriculture dominated watersheds in the glaciated Midwest. (JAWRA) (Copyright ?? 2005).
Mauer, Michael; Caramori, Maria Luiza; Fioretto, Paola; Najafian, Behzad
2015-06-01
Studies of structural-functional relationships have improved understanding of the natural history of diabetic nephropathy (DN). However, in order to consider structural end points for clinical trials, the robustness of the resultant models needs to be verified. This study examined whether structural-functional relationship models derived from a large cohort of type 1 diabetic (T1D) patients with a wide range of renal function are robust. The predictability of models derived from multiple regression analysis and piecewise linear regression analysis was also compared. T1D patients (n = 161) with research renal biopsies were divided into two equal groups matched for albumin excretion rate (AER). Models to explain AER and glomerular filtration rate (GFR) by classical DN lesions in one group (T1D-model, or T1D-M) were applied to the other group (T1D-test, or T1D-T) and regression analyses were performed. T1D-M-derived models explained 70 and 63% of AER variance and 32 and 21% of GFR variance in T1D-M and T1D-T, respectively, supporting the substantial robustness of the models. Piecewise linear regression analyses substantially improved predictability of the models with 83% of AER variance and 66% of GFR variance explained by classical DN glomerular lesions alone. These studies demonstrate that DN structural-functional relationship models are robust, and if appropriate models are used, glomerular lesions alone explain a major proportion of AER and GFR variance in T1D patients. © The Author 2014. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.
[How to fit and interpret multilevel models using SPSS].
Pardo, Antonio; Ruiz, Miguel A; San Martín, Rafael
2007-05-01
Hierarchic or multilevel models are used to analyse data when cases belong to known groups and sample units are selected both from the individual level and from the group level. In this work, the multilevel models most commonly discussed in the statistic literature are described, explaining how to fit these models using the SPSS program (any version as of the 11 th ) and how to interpret the outcomes of the analysis. Five particular models are described, fitted, and interpreted: (1) one-way analysis of variance with random effects, (2) regression analysis with means-as-outcomes, (3) one-way analysis of covariance with random effects, (4) regression analysis with random coefficients, and (5) regression analysis with means- and slopes-as-outcomes. All models are explained, trying to make them understandable to researchers in health and behaviour sciences.
Boosted Regression Tree Models to Explain Watershed Nutrient Concentrations and Biological Condition
Boosted regression tree (BRT) models were developed to quantify the nonlinear relationships between landscape variables and nutrient concentrations in a mesoscale mixed land cover watershed during base-flow conditions. Factors that affect instream biological components, based on ...
Hanley, James A
2008-01-01
Most survival analysis textbooks explain how the hazard ratio parameters in Cox's life table regression model are estimated. Fewer explain how the components of the nonparametric baseline survivor function are derived. Those that do often relegate the explanation to an "advanced" section and merely present the components as algebraic or iterative solutions to estimating equations. None comment on the structure of these estimators. This note brings out a heuristic representation that may help to de-mystify the structure.
Commitment to personal values and guilt feelings in dementia caregivers.
Gallego-Alberto, Laura; Losada, Andrés; Márquez-González, María; Romero-Moreno, Rosa; Vara, Carlos
2017-01-01
Caregivers' commitment to personal values is linked to caregivers' well-being, although the effects of personal values on caregivers' guilt have not been explored to date. The goal of this study is to analyze the relationship between caregivers´ commitment to personal values and guilt feelings. Participants were 179 dementia family caregivers. Face-to-face interviews were carried out to describe sociodemographic variables and assess stressors, caregivers' commitment to personal values and guilt feelings. Commitment to values was conceptualized as two factors (commitment to own values and commitment to family values) and 12 specific individual values (e.g. education, family or caregiving role). Hierarchical regressions were performed controlling for sociodemographic variables and stressors, and introducing the two commitment factors (in a first regression) or the commitment to individual/specific values (in a second regression) as predictors of guilt. In terms of the commitment to values factors, the analyzed regression model explained 21% of the variance of guilt feelings. Only the factor commitment to family values contributed significantly to the model, explaining 7% of variance. With regard to the regression analyzing the contribution of specific values to caregivers' guilt, commitment to the caregiving role and with leisure contributed negatively and significantly to the explanation of caregivers' guilt. Commitment to work contributed positively to guilt feelings. The full model explained 30% of guilt feelings variance. The specific values explained 16% of the variance. Our findings suggest that commitment to personal values is a relevant variable to understand guilt feelings in caregivers.
Park, Ji Hyun; Kim, Hyeon-Young; Lee, Hanna; Yun, Eun Kyoung
2015-12-01
This study compares the performance of the logistic regression and decision tree analysis methods for assessing the risk factors for infection in cancer patients undergoing chemotherapy. The subjects were 732 cancer patients who were receiving chemotherapy at K university hospital in Seoul, Korea. The data were collected between March 2011 and February 2013 and were processed for descriptive analysis, logistic regression and decision tree analysis using the IBM SPSS Statistics 19 and Modeler 15.1 programs. The most common risk factors for infection in cancer patients receiving chemotherapy were identified as alkylating agents, vinca alkaloid and underlying diabetes mellitus. The logistic regression explained 66.7% of the variation in the data in terms of sensitivity and 88.9% in terms of specificity. The decision tree analysis accounted for 55.0% of the variation in the data in terms of sensitivity and 89.0% in terms of specificity. As for the overall classification accuracy, the logistic regression explained 88.0% and the decision tree analysis explained 87.2%. The logistic regression analysis showed a higher degree of sensitivity and classification accuracy. Therefore, logistic regression analysis is concluded to be the more effective and useful method for establishing an infection prediction model for patients undergoing chemotherapy. Copyright © 2015 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Holburn, E. R.; Bledsoe, B. P.; Poff, N. L.; Cuhaciyan, C. O.
2005-05-01
Using over 300 R/EMAP sites in OR and WA, we examine the relative explanatory power of watershed, valley, and reach scale descriptors in modeling variation in benthic macroinvertebrate indices. Innovative metrics describing flow regime, geomorphic processes, and hydrologic-distance weighted watershed and valley characteristics are used in multiple regression and regression tree modeling to predict EPT richness, % EPT, EPT/C, and % Plecoptera. A nested design using seven ecoregions is employed to evaluate the influence of geographic scale and environmental heterogeneity on the explanatory power of individual and combined scales. Regression tree models are constructed to explain variability while identifying threshold responses and interactions. Cross-validated models demonstrate differences in the explanatory power associated with single-scale and multi-scale models as environmental heterogeneity is varied. Models explaining the greatest variability in biological indices result from multi-scale combinations of physical descriptors. Results also indicate that substantial variation in benthic macroinvertebrate response can be explained with process-based watershed and valley scale metrics derived exclusively from common geospatial data. This study outlines a general framework for identifying key processes driving macroinvertebrate assemblages across a range of scales and establishing the geographic extent at which various levels of physical description best explain biological variability. Such information can guide process-based stratification to avoid spurious comparison of dissimilar stream types in bioassessments and ensure that key environmental gradients are adequately represented in sampling designs.
Hansson, Lisbeth; Khamis, Harry J
2008-12-01
Simulated data sets are used to evaluate conditional and unconditional maximum likelihood estimation in an individual case-control design with continuous covariates when there are different rates of excluded cases and different levels of other design parameters. The effectiveness of the estimation procedures is measured by method bias, variance of the estimators, root mean square error (RMSE) for logistic regression and the percentage of explained variation. Conditional estimation leads to higher RMSE than unconditional estimation in the presence of missing observations, especially for 1:1 matching. The RMSE is higher for the smaller stratum size, especially for the 1:1 matching. The percentage of explained variation appears to be insensitive to missing data, but is generally higher for the conditional estimation than for the unconditional estimation. It is particularly good for the 1:2 matching design. For minimizing RMSE, a high matching ratio is recommended; in this case, conditional and unconditional logistic regression models yield comparable levels of effectiveness. For maximizing the percentage of explained variation, the 1:2 matching design with the conditional logistic regression model is recommended.
NASA Astrophysics Data System (ADS)
Mfumu Kihumba, Antoine; Ndembo Longo, Jean; Vanclooster, Marnik
2016-03-01
A multivariate statistical modelling approach was applied to explain the anthropogenic pressure of nitrate pollution on the Kinshasa groundwater body (Democratic Republic of Congo). Multiple regression and regression tree models were compared and used to identify major environmental factors that control the groundwater nitrate concentration in this region. The analyses were made in terms of physical attributes related to the topography, land use, geology and hydrogeology in the capture zone of different groundwater sampling stations. For the nitrate data, groundwater datasets from two different surveys were used. The statistical models identified the topography, the residential area, the service land (cemetery), and the surface-water land-use classes as major factors explaining nitrate occurrence in the groundwater. Also, groundwater nitrate pollution depends not on one single factor but on the combined influence of factors representing nitrogen loading sources and aquifer susceptibility characteristics. The groundwater nitrate pressure was better predicted with the regression tree model than with the multiple regression model. Furthermore, the results elucidated the sensitivity of the model performance towards the method of delineation of the capture zones. For pollution modelling at the monitoring points, therefore, it is better to identify capture-zone shapes based on a conceptual hydrogeological model rather than to adopt arbitrary circular capture zones.
Weichenthal, Scott; Ryswyk, Keith Van; Goldstein, Alon; Bagg, Scott; Shekkarizfard, Maryam; Hatzopoulou, Marianne
2016-04-01
Existing evidence suggests that ambient ultrafine particles (UFPs) (<0.1µm) may contribute to acute cardiorespiratory morbidity. However, few studies have examined the long-term health effects of these pollutants owing in part to a need for exposure surfaces that can be applied in large population-based studies. To address this need, we developed a land use regression model for UFPs in Montreal, Canada using mobile monitoring data collected from 414 road segments during the summer and winter months between 2011 and 2012. Two different approaches were examined for model development including standard multivariable linear regression and a machine learning approach (kernel-based regularized least squares (KRLS)) that learns the functional form of covariate impacts on ambient UFP concentrations from the data. The final models included parameters for population density, ambient temperature and wind speed, land use parameters (park space and open space), length of local roads and rail, and estimated annual average NOx emissions from traffic. The final multivariable linear regression model explained 62% of the spatial variation in ambient UFP concentrations whereas the KRLS model explained 79% of the variance. The KRLS model performed slightly better than the linear regression model when evaluated using an external dataset (R(2)=0.58 vs. 0.55) or a cross-validation procedure (R(2)=0.67 vs. 0.60). In general, our findings suggest that the KRLS approach may offer modest improvements in predictive performance compared to standard multivariable linear regression models used to estimate spatial variations in ambient UFPs. However, differences in predictive performance were not statistically significant when evaluated using the cross-validation procedure. Crown Copyright © 2015. Published by Elsevier Inc. All rights reserved.
Developing a predictive tropospheric ozone model for Tabriz
NASA Astrophysics Data System (ADS)
Khatibi, Rahman; Naghipour, Leila; Ghorbani, Mohammad A.; Smith, Michael S.; Karimi, Vahid; Farhoudi, Reza; Delafrouz, Hadi; Arvanaghi, Hadi
2013-04-01
Predictive ozone models are becoming indispensable tools by providing a capability for pollution alerts to serve people who are vulnerable to the risks. We have developed a tropospheric ozone prediction capability for Tabriz, Iran, by using the following five modeling strategies: three regression-type methods: Multiple Linear Regression (MLR), Artificial Neural Networks (ANNs), and Gene Expression Programming (GEP); and two auto-regression-type models: Nonlinear Local Prediction (NLP) to implement chaos theory and Auto-Regressive Integrated Moving Average (ARIMA) models. The regression-type modeling strategies explain the data in terms of: temperature, solar radiation, dew point temperature, and wind speed, by regressing present ozone values to their past values. The ozone time series are available at various time intervals, including hourly intervals, from August 2010 to March 2011. The results for MLR, ANN and GEP models are not overly good but those produced by NLP and ARIMA are promising for the establishing a forecasting capability.
Interpreting Regression Results: beta Weights and Structure Coefficients are Both Important.
ERIC Educational Resources Information Center
Thompson, Bruce
Various realizations have led to less frequent use of the "OVA" methods (analysis of variance--ANOVA--among others) and to more frequent use of general linear model approaches such as regression. However, too few researchers understand all the various coefficients produced in regression. This paper explains these coefficients and their…
An empirical model for estimating annual consumption by freshwater fish populations
Liao, H.; Pierce, C.L.; Larscheid, J.G.
2005-01-01
Population consumption is an important process linking predator populations to their prey resources. Simple tools are needed to enable fisheries managers to estimate population consumption. We assembled 74 individual estimates of annual consumption by freshwater fish populations and their mean annual population size, 41 of which also included estimates of mean annual biomass. The data set included 14 freshwater fish species from 10 different bodies of water. From this data set we developed two simple linear regression models predicting annual population consumption. Log-transformed population size explained 94% of the variation in log-transformed annual population consumption. Log-transformed biomass explained 98% of the variation in log-transformed annual population consumption. We quantified the accuracy of our regressions and three alternative consumption models as the mean percent difference from observed (bioenergetics-derived) estimates in a test data set. Predictions from our population-size regression matched observed consumption estimates poorly (mean percent difference = 222%). Predictions from our biomass regression matched observed consumption reasonably well (mean percent difference = 24%). The biomass regression was superior to an alternative model, similar in complexity, and comparable to two alternative models that were more complex and difficult to apply. Our biomass regression model, log10(consumption) = 0.5442 + 0.9962??log10(biomass), will be a useful tool for fishery managers, enabling them to make reasonably accurate annual population consumption predictions from mean annual biomass estimates. ?? Copyright by the American Fisheries Society 2005.
Probability Theory Plus Noise: Descriptive Estimation and Inferential Judgment.
Costello, Fintan; Watts, Paul
2018-01-01
We describe a computational model of two central aspects of people's probabilistic reasoning: descriptive probability estimation and inferential probability judgment. This model assumes that people's reasoning follows standard frequentist probability theory, but it is subject to random noise. This random noise has a regressive effect in descriptive probability estimation, moving probability estimates away from normative probabilities and toward the center of the probability scale. This random noise has an anti-regressive effect in inferential judgement, however. These regressive and anti-regressive effects explain various reliable and systematic biases seen in people's descriptive probability estimation and inferential probability judgment. This model predicts that these contrary effects will tend to cancel out in tasks that involve both descriptive estimation and inferential judgement, leading to unbiased responses in those tasks. We test this model by applying it to one such task, described by Gallistel et al. ). Participants' median responses in this task were unbiased, agreeing with normative probability theory over the full range of responses. Our model captures the pattern of unbiased responses in this task, while simultaneously explaining systematic biases away from normatively correct probabilities seen in other tasks. Copyright © 2018 Cognitive Science Society, Inc.
Winters, Eric R; Petosa, Rick L; Charlton, Thomas E
2003-06-01
To examine whether knowledge of high school students' actions of self-regulation, and perceptions of self-efficacy to overcome exercise barriers, social situation, and outcome expectation will predict non-school related moderate and vigorous physical exercise. High school students enrolled in introductory Physical Education courses completed questionnaires that targeted selected Social Cognitive Theory variables. They also self-reported their typical "leisure-time" exercise participation using a standardized questionnaire. Bivariate correlation statistic and hierarchical regression were conducted on reports of moderate and vigorous exercise frequency. Each predictor variable was significantly associated with measures of moderate and vigorous exercise frequency. All predictor variables were significant in the final regression model used to explain vigorous exercise. After controlling for the effects of gender, the psychosocial variables explained 29% of variance in vigorous exercise frequency. Three of four predictor variables were significant in the final regression equation used to explain moderate exercise. The final regression equation accounted for 11% of variance in moderate exercise frequency. Professionals who attempt to increase the prevalence of physical exercise through educational methods should focus on the psychosocial variables utilized in this study.
NASA Astrophysics Data System (ADS)
Mulyani, Sri; Andriyana, Yudhie; Sudartianto
2017-03-01
Mean regression is a statistical method to explain the relationship between the response variable and the predictor variable based on the central tendency of the data (mean) of the response variable. The parameter estimation in mean regression (with Ordinary Least Square or OLS) generates a problem if we apply it to the data with a symmetric, fat-tailed, or containing outlier. Hence, an alternative method is necessary to be used to that kind of data, for example quantile regression method. The quantile regression is a robust technique to the outlier. This model can explain the relationship between the response variable and the predictor variable, not only on the central tendency of the data (median) but also on various quantile, in order to obtain complete information about that relationship. In this study, a quantile regression is developed with a nonparametric approach such as smoothing spline. Nonparametric approach is used if the prespecification model is difficult to determine, the relation between two variables follow the unknown function. We will apply that proposed method to poverty data. Here, we want to estimate the Percentage of Poor People as the response variable involving the Human Development Index (HDI) as the predictor variable.
Importance of spatial autocorrelation in modeling bird distributions at a continental scale
Bahn, V.; O'Connor, R.J.; Krohn, W.B.
2006-01-01
Spatial autocorrelation in species' distributions has been recognized as inflating the probability of a type I error in hypotheses tests, causing biases in variable selection, and violating the assumption of independence of error terms in models such as correlation or regression. However, it remains unclear whether these problems occur at all spatial resolutions and extents, and under which conditions spatially explicit modeling techniques are superior. Our goal was to determine whether spatial models were superior at large extents and across many different species. In addition, we investigated the importance of purely spatial effects in distribution patterns relative to the variation that could be explained through environmental conditions. We studied distribution patterns of 108 bird species in the conterminous United States using ten years of data from the Breeding Bird Survey. We compared the performance of spatially explicit regression models with non-spatial regression models using Akaike's information criterion. In addition, we partitioned the variance in species distributions into an environmental, a pure spatial and a shared component. The spatially-explicit conditional autoregressive regression models strongly outperformed the ordinary least squares regression models. In addition, partialling out the spatial component underlying the species' distributions showed that an average of 17% of the explained variation could be attributed to purely spatial effects independent of the spatial autocorrelation induced by the underlying environmental variables. We concluded that location in the range and neighborhood play an important role in the distribution of species. Spatially explicit models are expected to yield better predictions especially for mobile species such as birds, even in coarse-grained models with a large extent. ?? Ecography.
NASA Astrophysics Data System (ADS)
Stigter, T. Y.; Ribeiro, L.; Dill, A. M. M. Carvalho
2008-07-01
SummaryFactorial regression models, based on correspondence analysis, are built to explain the high nitrate concentrations in groundwater beneath an agricultural area in the south of Portugal, exceeding 300 mg/l, as a function of chemical variables, electrical conductivity (EC), land use and hydrogeological setting. Two important advantages of the proposed methodology are that qualitative parameters can be involved in the regression analysis and that multicollinearity is avoided. Regression is performed on eigenvectors extracted from the data similarity matrix, the first of which clearly reveals the impact of agricultural practices and hydrogeological setting on the groundwater chemistry of the study area. Significant correlation exists between response variable NO3- and explanatory variables Ca 2+, Cl -, SO42-, depth to water, aquifer media and land use. Substituting Cl - by the EC results in the most accurate regression model for nitrate, when disregarding the four largest outliers (model A). When built solely on land use and hydrogeological setting, the regression model (model B) is less accurate but more interesting from a practical viewpoint, as it is based on easily obtainable data and can be used to predict nitrate concentrations in groundwater in other areas with similar conditions. This is particularly useful for conservative contaminants, where risk and vulnerability assessment methods, based on assumed rather than established correlations, generally produce erroneous results. Another purpose of the models can be to predict the future evolution of nitrate concentrations under influence of changes in land use or fertilization practices, which occur in compliance with policies such as the Nitrates Directive. Model B predicts a 40% decrease in nitrate concentrations in groundwater of the study area, when horticulture is replaced by other land use with much lower fertilization and irrigation rates.
Explaining match outcome in elite Australian Rules football using team performance indicators.
Robertson, Sam; Back, Nicole; Bartlett, Jonathan D
2016-01-01
The relationships between team performance indicators and match outcome have been examined in many team sports, however are limited in Australian Rules football. Using data from the 2013 and 2014 Australian Football League (AFL) regular seasons, this study assessed the ability of commonly reported discrete team performance indicators presented in their relative form (standardised against their opposition for a given match) to explain match outcome (Win/Loss). Logistic regression and decision tree (chi-squared automatic interaction detection (CHAID)) analyses both revealed relative differences between opposing teams for "kicks" and "goal conversion" as the most influential in explaining match outcome, with two models achieving 88.3% and 89.8% classification accuracies, respectively. Models incorporating a smaller performance indicator set displayed a slightly reduced ability to explain match outcome (81.0% and 81.5% for logistic regression and CHAID, respectively). However, both were fit to 2014 data with reduced error in comparison to the full models. Despite performance similarities across the two analysis approaches, the CHAID model revealed multiple winning performance indicator profiles, thereby increasing its comparative feasibility for use in the field. Coaches and analysts may find these results useful in informing strategy and game plan development in Australian Rules football, with the development of team-specific models recommended in future.
Gaussian Process Regression Model in Spatial Logistic Regression
NASA Astrophysics Data System (ADS)
Sofro, A.; Oktaviarina, A.
2018-01-01
Spatial analysis has developed very quickly in the last decade. One of the favorite approaches is based on the neighbourhood of the region. Unfortunately, there are some limitations such as difficulty in prediction. Therefore, we offer Gaussian process regression (GPR) to accommodate the issue. In this paper, we will focus on spatial modeling with GPR for binomial data with logit link function. The performance of the model will be investigated. We will discuss the inference of how to estimate the parameters and hyper-parameters and to predict as well. Furthermore, simulation studies will be explained in the last section.
Snow, David P.
2016-01-01
This study investigates infants’ transition from nonverbal to verbal communication using evidence from regression patterns. As an example of regressions, prelinguistic infants learning American Sign Language (ASL) use pointing gestures to communicate. At the onset of single signs, however, these gestures disappear. Petitto (1987) attributed the regression to the children’s discovery that pointing has two functions, namely, deixis and linguistic pronouns. The 1:2 relation (1 form, 2 functions) violates the simple 1:1 pattern that infants are believed to expect. This kind of conflict, Petitto argued, explains the regression. Based on the additional observation that the regression coincided with the boundary between prelinguistic and linguistic communication, Petitto concluded that the prelinguistic and linguistic periods are autonomous. The purpose of the present study was to evaluate the 1:1 model and to determine whether it explains a previously reported regression of intonation in English. Background research showed that gestures and intonation have different forms but the same pragmatic meanings, a 2:1 form-function pattern that plausibly precipitates the regression. The hypothesis of the study was that gestures and intonation are closely related. Moreover, because gestures and intonation change in the opposite direction, the negative correlation between them indicates a robust inverse relationship. To test this prediction, speech samples of 29 infants (8 to 16 months) were analyzed acoustically and compared to parent-report data on several verbal and gestural scales. In support of the hypothesis, gestures alone were inversely correlated with intonation. In addition, the regression model explains nonlinearities stemming from different form-function configurations. However, the results failed to support the claim that regressions linked to early words or signs reflect autonomy. The discussion ends with a focus on the special role of intonation in children’s transition from “prelinguistic” communication to language. PMID:28729753
Workers' compensation costs among construction workers: a robust regression analysis.
Friedman, Lee S; Forst, Linda S
2009-11-01
Workers' compensation data are an important source for evaluating costs associated with construction injuries. We describe the characteristics of injured construction workers filing claims in Illinois between 2000 and 2005 and the factors associated with compensation costs using a robust regression model. In the final multivariable model, the cumulative percent temporary and permanent disability-measures of severity of injury-explained 38.7% of the variance of cost. Attorney costs explained only 0.3% of the variance of the dependent variable. The model used in this study clearly indicated that percent disability was the most important determinant of cost, although the method and uniformity of percent impairment allocation could be better elucidated. There is a need to integrate analytical methods that are suitable for skewed data when analyzing claim costs.
NASA Technical Reports Server (NTRS)
Stolzer, Alan J.; Halford, Carl
2007-01-01
In a previous study, multiple regression techniques were applied to Flight Operations Quality Assurance-derived data to develop parsimonious model(s) for fuel consumption on the Boeing 757 airplane. The present study examined several data mining algorithms, including neural networks, on the fuel consumption problem and compared them to the multiple regression results obtained earlier. Using regression methods, parsimonious models were obtained that explained approximately 85% of the variation in fuel flow. In general data mining methods were more effective in predicting fuel consumption. Classification and Regression Tree methods reported correlation coefficients of .91 to .92, and General Linear Models and Multilayer Perceptron neural networks reported correlation coefficients of about .99. These data mining models show great promise for use in further examining large FOQA databases for operational and safety improvements.
Regression Analysis of Physician Distribution to Identify Areas of Need: Some Preliminary Findings.
ERIC Educational Resources Information Center
Morgan, Bruce B.; And Others
A regression analysis was conducted of factors that help to explain the variance in physician distribution and which identify those factors that influence the maldistribution of physicians. Models were developed for different geographic areas to determine the most appropriate unit of analysis for the Western Missouri Area Health Education Center…
Statistical Power for a Simultaneous Test of Factorial and Predictive Invariance
ERIC Educational Resources Information Center
Olivera-Aguilar, Margarita; Millsap, Roger E.
2013-01-01
A common finding in studies of differential prediction across groups is that although regression slopes are the same or similar across groups, group differences exist in regression intercepts. Building on earlier work by Birnbaum (1979), Millsap (1998) presented an invariant factor model that would explain such intercept differences as arising due…
ERIC Educational Resources Information Center
Campbell, S. Duke; Greenberg, Barry
The development of a predictive equation capable of explaining a significant percentage of enrollment variability at Florida International University is described. A model utilizing trend analysis and a multiple regression approach to enrollment forecasting was adapted to investigate enrollment dynamics at the university. Four independent…
Students' Self-Regulation for Interaction with Others in Online Learning Environments
ERIC Educational Resources Information Center
Cho, Moon-Heum; Kim, B. Joon
2013-01-01
The purpose of this study was to explore variables explaining students' self-regulation (SR) for interaction with others, specifically peers and instructors, in online learning environments. A total of 407 students participated in the study. With hierarchical regression model (HRM), several variables were regressed on students' SR for interaction…
Kumar, Rajesh; Dogra, Vishal; Rani, Khushbu; Sahu, Kanti
2017-01-01
District level determinants of total fertility rate in Empowered Action Group states of India can help in ongoing population stabilization programs in India. Present study intends to assess the role of district level determinants in predicting total fertility rate among districts of the Empowered Action Group states of India. Data from Annual Health Survey (2011-12) was analysed using STATA and R software packages. Multiple linear regression models were built and evaluated using Akaike Information Criterion. For further understanding, recursive partitioning was used to prepare a regression tree. Female married illiteracy positively associated with total fertility rate and explained more than half (53%) of variance. Under multiple linear regression model, married illiteracy, infant mortality rate, Ante natal care registration, household size, median age of live birth and sex ratio explained 70% of total variance in total fertility rate. In regression tree, female married illiteracy was the root node and splits at 42% determined TFR <= 2.7. The next left side branch was again married illiteracy with splits at 23% to determine TFR <= 2.1. We conclude that female married illiteracy is one of the most important determinants explaining total fertility rate among the districts of an Empowered Action Group states. Focus on female literacy is required to stabilize the population growth in long run.
Henrard, S; Speybroeck, N; Hermans, C
2015-11-01
Haemophilia is a rare genetic haemorrhagic disease characterized by partial or complete deficiency of coagulation factor VIII, for haemophilia A, or IX, for haemophilia B. As in any other medical research domain, the field of haemophilia research is increasingly concerned with finding factors associated with binary or continuous outcomes through multivariable models. Traditional models include multiple logistic regressions, for binary outcomes, and multiple linear regressions for continuous outcomes. Yet these regression models are at times difficult to implement, especially for non-statisticians, and can be difficult to interpret. The present paper sought to didactically explain how, why, and when to use classification and regression tree (CART) analysis for haemophilia research. The CART method is non-parametric and non-linear, based on the repeated partitioning of a sample into subgroups based on a certain criterion. Breiman developed this method in 1984. Classification trees (CTs) are used to analyse categorical outcomes and regression trees (RTs) to analyse continuous ones. The CART methodology has become increasingly popular in the medical field, yet only a few examples of studies using this methodology specifically in haemophilia have to date been published. Two examples using CART analysis and previously published in this field are didactically explained in details. There is increasing interest in using CART analysis in the health domain, primarily due to its ease of implementation, use, and interpretation, thus facilitating medical decision-making. This method should be promoted for analysing continuous or categorical outcomes in haemophilia, when applicable. © 2015 John Wiley & Sons Ltd.
Albuquerque, F S; Peso-Aguiar, M C; Assunção-Albuquerque, M J T; Gálvez, L
2009-08-01
The length-weight relationship and condition factor have been broadly investigated in snails to obtain the index of physical condition of populations and evaluate habitat quality. Herein, our goal was to describe the best predictors that explain Achatina fulica biometrical parameters and well being in a recently introduced population. From November 2001 to November 2002, monthly snail samples were collected in Lauro de Freitas City, Bahia, Brazil. Shell length and total weight were measured in the laboratory and the potential curve and condition factor were calculated. Five environmental variables were considered: temperature range, mean temperature, humidity, precipitation and human density. Multiple regressions were used to generate models including multiple predictors, via model selection approach, and then ranked with AIC criteria. Partial regressions were used to obtain the separated coefficients of determination of climate and human density models. A total of 1.460 individuals were collected, presenting a shell length range between 4.8 to 102.5 mm (mean: 42.18 mm). The relationship between total length and total weight revealed that Achatina fulica presented a negative allometric growth. Simple regression indicated that humidity has a significant influence on A. fulica total length and weight. Temperature range was the main variable that influenced the condition factor. Multiple regressions showed that climatic and human variables explain a small proportion of the variance in shell length and total weight, but may explain up to 55.7% of the condition factor variance. Consequently, we believe that the well being and biometric parameters of A. fulica can be influenced by climatic and human density factors.
Ng, Kar Yong; Awang, Norhashidah
2018-01-06
Frequent haze occurrences in Malaysia have made the management of PM 10 (particulate matter with aerodynamic less than 10 μm) pollution a critical task. This requires knowledge on factors associating with PM 10 variation and good forecast of PM 10 concentrations. Hence, this paper demonstrates the prediction of 1-day-ahead daily average PM 10 concentrations based on predictor variables including meteorological parameters and gaseous pollutants. Three different models were built. They were multiple linear regression (MLR) model with lagged predictor variables (MLR1), MLR model with lagged predictor variables and PM 10 concentrations (MLR2) and regression with time series error (RTSE) model. The findings revealed that humidity, temperature, wind speed, wind direction, carbon monoxide and ozone were the main factors explaining the PM 10 variation in Peninsular Malaysia. Comparison among the three models showed that MLR2 model was on a same level with RTSE model in terms of forecasting accuracy, while MLR1 model was the worst.
We present here the application of PLS regression to predicting surface water total phosphorous, total ammonia and Escherichia coli from landscape metrics. The amount of variability in surface water constituents explained by each model reflects the composition of the contributi...
Forest dynamics to precipitation and temperature in the Gulf of Mexico coastal region.
Li, Tianyu; Meng, Qingmin
2017-05-01
The forest is one of the most significant components of the Gulf of Mexico (GOM) coast. It provides livelihood to inhabitant and is known to be sensitive to climatic fluctuations. This study focuses on examining the impacts of temperature and precipitation variations on coastal forest. Two different regression methods, ordinary least squares (OLS) and geographically weighted regression (GWR), were employed to reveal the relationship between meteorological variables and forest dynamics. OLS regression analysis shows that changes in precipitation and temperature, over a span of 12 months, are responsible for 56% of NDVI variation. The forest, which is not particularly affected by the average monthly precipitation in most months, is observed to be affected by cumulative seasonal and annual precipitation explicitly. Temperature and precipitation almost equally impact on NDVI changes; about 50% of the NDVI variations is explained in OLS modeling, and about 74% of the NDVI variations is explained in GWR modeling. GWR analysis indicated that both precipitation and temperature characterize the spatial heterogeneity patterns of forest dynamics.
Forest dynamics to precipitation and temperature in the Gulf of Mexico coastal region
NASA Astrophysics Data System (ADS)
Li, Tianyu; Meng, Qingmin
2017-05-01
The forest is one of the most significant components of the Gulf of Mexico (GOM) coast. It provides livelihood to inhabitant and is known to be sensitive to climatic fluctuations. This study focuses on examining the impacts of temperature and precipitation variations on coastal forest. Two different regression methods, ordinary least squares (OLS) and geographically weighted regression (GWR), were employed to reveal the relationship between meteorological variables and forest dynamics. OLS regression analysis shows that changes in precipitation and temperature, over a span of 12 months, are responsible for 56% of NDVI variation. The forest, which is not particularly affected by the average monthly precipitation in most months, is observed to be affected by cumulative seasonal and annual precipitation explicitly. Temperature and precipitation almost equally impact on NDVI changes; about 50% of the NDVI variations is explained in OLS modeling, and about 74% of the NDVI variations is explained in GWR modeling. GWR analysis indicated that both precipitation and temperature characterize the spatial heterogeneity patterns of forest dynamics.
Sunkara, Vasu; Hébert, James R.
2015-01-01
BACKGROUND Disparities in cancer screening, incidence, treatment, and survival are worsening globally. The mortality-to-incidence ratio (MIR) has been used previously to evaluate such disparities. METHODS The MIR for colorectal cancer is calculated for all Organisation for Economic Cooperation and Development (OECD) countries using the 2012 GLOBOCAN incidence and mortality statistics. Health system rankings were obtained from the World Health Organization. Two linear regression models were fit with the MIR as the dependent variable and health system ranking as the independent variable; one included all countries and one model had the “divergents” removed. RESULTS The regression model for all countries explained 24% of the total variance in the MIR. Nine countries were found to have regression-calculated MIRs that differed from the actual MIR by >20%. Countries with lower-than-expected MIRs were found to have strong national health systems characterized by formal colorectal cancer screening programs. Conversely, countries with higher-than-expected MIRs lack screening programs. When these divergent points were removed from the data set, the recalculated regression model explained 60% of the total variance in the MIR. CONCLUSIONS The MIR proved useful for identifying disparities in cancer screening and treatment internationally. It has potential as an indicator of the long-term success of cancer surveillance programs and may be extended to other cancer types for these purposes. PMID:25572676
Sunkara, Vasu; Hébert, James R
2015-05-15
Disparities in cancer screening, incidence, treatment, and survival are worsening globally. The mortality-to-incidence ratio (MIR) has been used previously to evaluate such disparities. The MIR for colorectal cancer is calculated for all Organisation for Economic Cooperation and Development (OECD) countries using the 2012 GLOBOCAN incidence and mortality statistics. Health system rankings were obtained from the World Health Organization. Two linear regression models were fit with the MIR as the dependent variable and health system ranking as the independent variable; one included all countries and one model had the "divergents" removed. The regression model for all countries explained 24% of the total variance in the MIR. Nine countries were found to have regression-calculated MIRs that differed from the actual MIR by >20%. Countries with lower-than-expected MIRs were found to have strong national health systems characterized by formal colorectal cancer screening programs. Conversely, countries with higher-than-expected MIRs lack screening programs. When these divergent points were removed from the data set, the recalculated regression model explained 60% of the total variance in the MIR. The MIR proved useful for identifying disparities in cancer screening and treatment internationally. It has potential as an indicator of the long-term success of cancer surveillance programs and may be extended to other cancer types for these purposes. © 2015 American Cancer Society.
Prediction of performance on the RCMP physical ability requirement evaluation.
Stanish, H I; Wood, T M; Campagna, P
1999-08-01
The Royal Canadian Mounted Police use the Physical Ability Requirement Evaluation (PARE) for screening applicants. The purposes of this investigation were to identify those field tests of physical fitness that were associated with PARE performance and determine which most accurately classified successful and unsuccessful PARE performers. The participants were 27 female and 21 male volunteers. Testing included measures of aerobic power, anaerobic power, agility, muscular strength, muscular endurance, and body composition. Multiple regression analysis revealed a three-variable model for males (70-lb bench press, standing long jump, and agility) explaining 79% of the variability in PARE time, whereas a one-variable model (agility) explained 43% of the variability for females. Analysis of the classification accuracy of the males' data was prohibited because 91% of the males passed the PARE. Classification accuracy of the females' data, using logistic regression, produced a two-variable model (agility, 1.5-mile endurance run) with 93% overall classification accuracy.
Kumar, Rajesh; Dogra, Vishal; Rani, Khushbu; Sahu, Kanti
2017-01-01
Background: District level determinants of total fertility rate in Empowered Action Group states of India can help in ongoing population stabilization programs in India. Objective: Present study intends to assess the role of district level determinants in predicting total fertility rate among districts of the Empowered Action Group states of India. Material and Methods: Data from Annual Health Survey (2011-12) was analysed using STATA and R software packages. Multiple linear regression models were built and evaluated using Akaike Information Criterion. For further understanding, recursive partitioning was used to prepare a regression tree. Results: Female married illiteracy positively associated with total fertility rate and explained more than half (53%) of variance. Under multiple linear regression model, married illiteracy, infant mortality rate, Ante natal care registration, household size, median age of live birth and sex ratio explained 70% of total variance in total fertility rate. In regression tree, female married illiteracy was the root node and splits at 42% determined TFR <= 2.7. The next left side branch was again married illiteracy with splits at 23% to determine TFR <= 2.1. Conclusion: We conclude that female married illiteracy is one of the most important determinants explaining total fertility rate among the districts of an Empowered Action Group states. Focus on female literacy is required to stabilize the population growth in long run. PMID:29416999
ERIC Educational Resources Information Center
Woolley, Kristin K.
Many researchers are unfamiliar with suppressor variables and how they operate in multiple regression analyses. This paper describes the role suppressor variables play in a multiple regression model and provides practical examples that explain how they can change research results. A variable that when added as another predictor increases the total…
Lawrence, Stephen J.
2012-01-01
Regression analyses show that E. coli density in samples was strongly related to turbidity, streamflow characteristics, and season at both sites. The regression equation chosen for the Norcross data showed that 78 percent of the variability in E. coli density (in log base 10 units) was explained by the variability in turbidity values (in log base 10 units), streamflow event (dry-weather flow or stormflow), season (cool or warm), and an interaction term that is the cross product of streamflow event and turbidity. The regression equation chosen for the Atlanta data showed that 76 percent of the variability in E. coli density (in log base 10 units) was explained by the variability in turbidity values (in log base 10 units), water temperature, streamflow event, and an interaction term that is the cross product of streamflow event and turbidity. Residual analysis and model confirmation using new data indicated the regression equations selected at both sites predicted E. coli density within the 90 percent prediction intervals of the equations and could be used to predict E. coli density in real time at both sites.
Detection of epistatic effects with logic regression and a classical linear regression model.
Malina, Magdalena; Ickstadt, Katja; Schwender, Holger; Posch, Martin; Bogdan, Małgorzata
2014-02-01
To locate multiple interacting quantitative trait loci (QTL) influencing a trait of interest within experimental populations, usually methods as the Cockerham's model are applied. Within this framework, interactions are understood as the part of the joined effect of several genes which cannot be explained as the sum of their additive effects. However, if a change in the phenotype (as disease) is caused by Boolean combinations of genotypes of several QTLs, this Cockerham's approach is often not capable to identify them properly. To detect such interactions more efficiently, we propose a logic regression framework. Even though with the logic regression approach a larger number of models has to be considered (requiring more stringent multiple testing correction) the efficient representation of higher order logic interactions in logic regression models leads to a significant increase of power to detect such interactions as compared to a Cockerham's approach. The increase in power is demonstrated analytically for a simple two-way interaction model and illustrated in more complex settings with simulation study and real data analysis.
Predicting daily use of urban forest recreation sites
John F. Dwyer
1988-01-01
A multiple linear regression model explains 90% of the variance in daily use of an urban recreation site. Explanatory variables include season, day of the week, and weather. The results offer guides for recreation site planning and management as well as suggestions for improving the model.
NASA Technical Reports Server (NTRS)
Murphy, M. R.; Awe, C. A.
1986-01-01
Six professionally active, retired captains rated the coordination and decisionmaking performances of sixteen aircrews while viewing videotapes of a simulated commercial air transport operation. The scenario featured a required diversion and a probable minimum fuel situation. Seven point Likert-type scales were used in rating variables on the basis of a model of crew coordination and decisionmaking. The variables were based on concepts of, for example, decision difficulty, efficiency, and outcome quality; and leader-subordin ate concepts such as person and task-oriented leader behavior, and competency motivation of subordinate crewmembers. Five-front-end variables of the model were in turn dependent variables for a hierarchical regression procedure. The variance in safety performance was explained 46%, by decision efficiency, command reversal, and decision quality. The variance of decision quality, an alternative substantive dependent variable to safety performance, was explained 60% by decision efficiency and the captain's quality of within-crew communications. The variance of decision efficiency, crew coordination, and command reversal were in turn explained 78%, 80%, and 60% by small numbers of preceding independent variables. A principle component, varimax factor analysis supported the model structure suggested by regression analyses.
Developing and testing a global-scale regression model to quantify mean annual streamflow
NASA Astrophysics Data System (ADS)
Barbarossa, Valerio; Huijbregts, Mark A. J.; Hendriks, A. Jan; Beusen, Arthur H. W.; Clavreul, Julie; King, Henry; Schipper, Aafke M.
2017-01-01
Quantifying mean annual flow of rivers (MAF) at ungauged sites is essential for assessments of global water supply, ecosystem integrity and water footprints. MAF can be quantified with spatially explicit process-based models, which might be overly time-consuming and data-intensive for this purpose, or with empirical regression models that predict MAF based on climate and catchment characteristics. Yet, regression models have mostly been developed at a regional scale and the extent to which they can be extrapolated to other regions is not known. In this study, we developed a global-scale regression model for MAF based on a dataset unprecedented in size, using observations of discharge and catchment characteristics from 1885 catchments worldwide, measuring between 2 and 106 km2. In addition, we compared the performance of the regression model with the predictive ability of the spatially explicit global hydrological model PCR-GLOBWB by comparing results from both models to independent measurements. We obtained a regression model explaining 89% of the variance in MAF based on catchment area and catchment averaged mean annual precipitation and air temperature, slope and elevation. The regression model performed better than PCR-GLOBWB for the prediction of MAF, as root-mean-square error (RMSE) values were lower (0.29-0.38 compared to 0.49-0.57) and the modified index of agreement (d) was higher (0.80-0.83 compared to 0.72-0.75). Our regression model can be applied globally to estimate MAF at any point of the river network, thus providing a feasible alternative to spatially explicit process-based global hydrological models.
Can We Use Regression Modeling to Quantify Mean Annual Streamflow at a Global-Scale?
NASA Astrophysics Data System (ADS)
Barbarossa, V.; Huijbregts, M. A. J.; Hendriks, J. A.; Beusen, A.; Clavreul, J.; King, H.; Schipper, A.
2016-12-01
Quantifying mean annual flow of rivers (MAF) at ungauged sites is essential for a number of applications, including assessments of global water supply, ecosystem integrity and water footprints. MAF can be quantified with spatially explicit process-based models, which might be overly time-consuming and data-intensive for this purpose, or with empirical regression models that predict MAF based on climate and catchment characteristics. Yet, regression models have mostly been developed at a regional scale and the extent to which they can be extrapolated to other regions is not known. In this study, we developed a global-scale regression model for MAF using observations of discharge and catchment characteristics from 1,885 catchments worldwide, ranging from 2 to 106 km2 in size. In addition, we compared the performance of the regression model with the predictive ability of the spatially explicit global hydrological model PCR-GLOBWB [van Beek et al., 2011] by comparing results from both models to independent measurements. We obtained a regression model explaining 89% of the variance in MAF based on catchment area, mean annual precipitation and air temperature, average slope and elevation. The regression model performed better than PCR-GLOBWB for the prediction of MAF, as root-mean-square error values were lower (0.29 - 0.38 compared to 0.49 - 0.57) and the modified index of agreement was higher (0.80 - 0.83 compared to 0.72 - 0.75). Our regression model can be applied globally at any point of the river network, provided that the input parameters are within the range of values employed in the calibration of the model. The performance is reduced for water scarce regions and further research should focus on improving such an aspect for regression-based global hydrological models.
Boosted Regression Tree Models to Explain Watershed ...
Boosted regression tree (BRT) models were developed to quantify the nonlinear relationships between landscape variables and nutrient concentrations in a mesoscale mixed land cover watershed during base-flow conditions. Factors that affect instream biological components, based on the Index of Biotic Integrity (IBI), were also analyzed. Seasonal BRT models at two spatial scales (watershed and riparian buffered area [RBA]) for nitrite-nitrate (NO2-NO3), total Kjeldahl nitrogen, and total phosphorus (TP) and annual models for the IBI score were developed. Two primary factors — location within the watershed (i.e., geographic position, stream order, and distance to a downstream confluence) and percentage of urban land cover (both scales) — emerged as important predictor variables. Latitude and longitude interacted with other factors to explain the variability in summer NO2-NO3 concentrations and IBI scores. BRT results also suggested that location might be associated with indicators of sources (e.g., land cover), runoff potential (e.g., soil and topographic factors), and processes not easily represented by spatial data indicators. Runoff indicators (e.g., Hydrological Soil Group D and Topographic Wetness Indices) explained a substantial portion of the variability in nutrient concentrations as did point sources for TP in the summer months. The results from our BRT approach can help prioritize areas for nutrient management in mixed-use and heavily impacted watershed
Rocha, R R A; Thomaz, S M; Carvalho, P; Gomes, L C
2009-06-01
The need for prediction is widely recognized in limnology. In this study, data from 25 lakes of the Upper Paraná River floodplain were used to build models to predict chlorophyll-a and dissolved oxygen concentrations. Akaike's information criterion (AIC) was used as a criterion for model selection. Models were validated with independent data obtained in the same lakes in 2001. Predictor variables that significantly explained chlorophyll-a concentration were pH, electrical conductivity, total seston (positive correlation) and nitrate (negative correlation). This model explained 52% of chlorophyll variability. Variables that significantly explained dissolved oxygen concentration were pH, lake area and nitrate (all positive correlations); water temperature and electrical conductivity were negatively correlated with oxygen. This model explained 54% of oxygen variability. Validation with independent data showed that both models had the potential to predict algal biomass and dissolved oxygen concentration in these lakes. These findings suggest that multiple regression models are valuable and practical tools for understanding the dynamics of ecosystems and that predictive limnology may still be considered a powerful approach in aquatic ecology.
Tracking and Explaining Credit-Hour Completion
ERIC Educational Resources Information Center
Kwenda, Maxwell Ndigume
2014-01-01
This study highlights factors associated with changes in earned hours for two cohorts of incoming freshmen during their first year. The objectives of this study are twofold: (a) to derive model(s) regressing the cumulative hours earned and differential hours earned on student demographic, socioeconomic, and academic characteristics; and (b) to…
Factor complexity of crash occurrence: An empirical demonstration using boosted regression trees.
Chung, Yi-Shih
2013-12-01
Factor complexity is a characteristic of traffic crashes. This paper proposes a novel method, namely boosted regression trees (BRT), to investigate the complex and nonlinear relationships in high-variance traffic crash data. The Taiwanese 2004-2005 single-vehicle motorcycle crash data are used to demonstrate the utility of BRT. Traditional logistic regression and classification and regression tree (CART) models are also used to compare their estimation results and external validities. Both the in-sample cross-validation and out-of-sample validation results show that an increase in tree complexity provides improved, although declining, classification performance, indicating a limited factor complexity of single-vehicle motorcycle crashes. The effects of crucial variables including geographical, time, and sociodemographic factors explain some fatal crashes. Relatively unique fatal crashes are better approximated by interactive terms, especially combinations of behavioral factors. BRT models generally provide improved transferability than conventional logistic regression and CART models. This study also discusses the implications of the results for devising safety policies. Copyright © 2012 Elsevier Ltd. All rights reserved.
Almalki, Mohammed J; FitzGerald, Gerry; Clark, Michele
2012-09-12
Quality of work life (QWL) has been found to influence the commitment of health professionals, including nurses. However, reliable information on QWL and turnover intention of primary health care (PHC) nurses is limited. The aim of this study was to examine the relationship between QWL and turnover intention of PHC nurses in Saudi Arabia. A cross-sectional survey was used in this study. Data were collected using Brooks' survey of Quality of Nursing Work Life, the Anticipated Turnover Scale and demographic data questions. A total of 508 PHC nurses in the Jazan Region, Saudi Arabia, completed the questionnaire (RR = 87%). Descriptive statistics, t-test, ANOVA, General Linear Model (GLM) univariate analysis, standard multiple regression, and hierarchical multiple regression were applied for analysis using SPSS v17 for Windows. Findings suggested that the respondents were dissatisfied with their work life, with almost 40% indicating a turnover intention from their current PHC centres. Turnover intention was significantly related to QWL. Using standard multiple regression, 26% of the variance in turnover intention was explained by QWL, p < 0.001, with R2 = .263. Further analysis using hierarchical multiple regression found that the total variance explained by the model as a whole (demographics and QWL) was 32.1%, p < 0.001. QWL explained an additional 19% of the variance in turnover intention, after controlling for demographic variables. Creating and maintaining a healthy work life for PHC nurses is very important to improve their work satisfaction, reduce turnover, enhance productivity and improve nursing care outcomes.
2012-01-01
Background Quality of work life (QWL) has been found to influence the commitment of health professionals, including nurses. However, reliable information on QWL and turnover intention of primary health care (PHC) nurses is limited. The aim of this study was to examine the relationship between QWL and turnover intention of PHC nurses in Saudi Arabia. Methods A cross-sectional survey was used in this study. Data were collected using Brooks’ survey of Quality of Nursing Work Life, the Anticipated Turnover Scale and demographic data questions. A total of 508 PHC nurses in the Jazan Region, Saudi Arabia, completed the questionnaire (RR = 87%). Descriptive statistics, t-test, ANOVA, General Linear Model (GLM) univariate analysis, standard multiple regression, and hierarchical multiple regression were applied for analysis using SPSS v17 for Windows. Results Findings suggested that the respondents were dissatisfied with their work life, with almost 40% indicating a turnover intention from their current PHC centres. Turnover intention was significantly related to QWL. Using standard multiple regression, 26% of the variance in turnover intention was explained by QWL, p < 0.001, with R2 = .263. Further analysis using hierarchical multiple regression found that the total variance explained by the model as a whole (demographics and QWL) was 32.1%, p < 0.001. QWL explained an additional 19% of the variance in turnover intention, after controlling for demographic variables. Conclusions Creating and maintaining a healthy work life for PHC nurses is very important to improve their work satisfaction, reduce turnover, enhance productivity and improve nursing care outcomes. PMID:22970764
Boshoff, Magdalena; De Jonge, Maarten; Scheifler, Renaud; Bervoets, Lieven
2014-09-15
The aim of this study was to derive regression-based soil-plant models to predict and compare metal(loid) (i.e. As, Cd, Cu, Pb and Zn) concentrations in plants (grass Agrostis sp./Poa sp. and nettle Urtica dioica L.) among sites with a wide range of metal pollution and a wide variation in soil properties. Regression models were based on the pseudo total (aqua-regia) and exchangeable (0.01 M CaCl2) soil metal concentrations. Plant metal concentrations were best explained by the pseudo total soil metal concentrations in combination with soil properties. The most important soil property that influenced U. dioica metal concentrations was the clay content, while for grass organic matter (OM) and pH affected the As (OM) and Cu and Zn (pH). In this study multiple linear regression models proved functional in predicting metal accumulation in plants on a regional scale. With the proposed models based on the pseudo total metal concentration, the percentage of variation explained for the metals As, Cd, Cu, Pb and Zn were 0.56%, 0.47%, 0.59%, 0.61%, 0.30% in nettle and 0.46%, 0.38%, 0.27%, 0.50%, 0.28% in grass. Copyright © 2014 Elsevier B.V. All rights reserved.
1987-09-01
Edition,. Fail 1986. 33. Neter, John et al. Applied Linear Regression MoceL. Homewood IL: Richard D. Irwin, Incorporated, iJ83. 34. NovicK, David... Linear Regression Models (33) then, for each sample observation (X fh, the method of least squares considers the deviation of Yubms from its expected value...for finding good estimators of b - b5 * In -2raer to explain the procedure, the model Yubms = b0 + b!xfh will be discussed. According to Applied
NASA Technical Reports Server (NTRS)
Dome, G. J.; Fung, A. K.; Moore, R. K.
1977-01-01
Several regression models were tested to explain the wind direction dependence of the 1975 JONSWAP (Joint North Sea Wave Project) scatterometer data. The models consider the radar backscatter as a harmonic function of wind direction. The constant term accounts for the major effect of wind speed and the sinusoidal terms for the effects of direction. The fundamental accounts for the difference in upwind and downwind returns, while the second harmonic explains the upwind-crosswind difference. It is shown that a second harmonic model appears to adequately explain the angular variation. A simple inversion technique, which uses two orthogonal scattering measurements, is also described which eliminates the effect of wind speed and direction. Vertical polarization was shown to be more effective in determining both wind speed and direction than horizontal polarization.
Explaining cross-national differences in marriage, cohabitation, and divorce in Europe, 1990-2000.
Kalmijn, Matthijs
2007-11-01
European countries differ considerably in their marriage patterns. The study presented in this paper describes these differences for the 1990s and attempts to explain them from a macro-level perspective. We find that different indicators of marriage (i.e., marriage rate, age at marriage, divorce rate, and prevalence of unmarried cohabitation) cannot be seen as indicators of an underlying concept such as the 'strength of marriage'. Multivariate ordinary least squares (OLS) regression analyses are estimated with countries as units and panel regression models are estimated in which annual time series for multiple countries are pooled. Using these models, we find that popular explanations of trends in the indicators - explanations that focus on gender roles, secularization, unemployment, and educational expansion - are also important for understanding differences among countries. We also find evidence for the role of historical continuity and societal disintegration in understanding cross-national differences.
Mehra, Lucky K; Cowger, Christina; Gross, Kevin; Ojiambo, Peter S
2016-01-01
Pre-planting factors have been associated with the late-season severity of Stagonospora nodorum blotch (SNB), caused by the fungal pathogen Parastagonospora nodorum, in winter wheat (Triticum aestivum). The relative importance of these factors in the risk of SNB has not been determined and this knowledge can facilitate disease management decisions prior to planting of the wheat crop. In this study, we examined the performance of multiple regression (MR) and three machine learning algorithms namely artificial neural networks, categorical and regression trees, and random forests (RF), in predicting the pre-planting risk of SNB in wheat. Pre-planting factors tested as potential predictor variables were cultivar resistance, latitude, longitude, previous crop, seeding rate, seed treatment, tillage type, and wheat residue. Disease severity assessed at the end of the growing season was used as the response variable. The models were developed using 431 disease cases (unique combinations of predictors) collected from 2012 to 2014 and these cases were randomly divided into training, validation, and test datasets. Models were evaluated based on the regression of observed against predicted severity values of SNB, sensitivity-specificity ROC analysis, and the Kappa statistic. A strong relationship was observed between late-season severity of SNB and specific pre-planting factors in which latitude, longitude, wheat residue, and cultivar resistance were the most important predictors. The MR model explained 33% of variability in the data, while machine learning models explained 47 to 79% of the total variability. Similarly, the MR model correctly classified 74% of the disease cases, while machine learning models correctly classified 81 to 83% of these cases. Results show that the RF algorithm, which explained 79% of the variability within the data, was the most accurate in predicting the risk of SNB, with an accuracy rate of 93%. The RF algorithm could allow early assessment of the risk of SNB, facilitating sound disease management decisions prior to planting of wheat.
Modeling stream network-scale variation in Coho salmon overwinter survival and smolt size
Joseph L. Ebersole; Mike E. Colvin; Parker J. Wigington; Scott G. Leibowitz; Joan P. Baker; Jana E. Compton; Bruce A. Miller; Michael A. Carins; Bruce P. Hansen; Henry R. La Vigne
2009-01-01
We used multiple regression and hierarchical mixed-effects models to examine spatial patterns of overwinter survival and size at smolting in juvenile coho salmon Oncorhynchus kisutch in relation to habitat attributes across an extensive stream network in southwestern Oregon over 3 years. Contributing basin area explained the majority of spatial...
LANDSCAPE METRICS THAT ARE USEFUL FOR EXPLAINING ESTUARINE ECOLOGICAL RESPONSES
We investigated whether land use/cover characteristics of watersheds associated with estuaries exhibit a strong enough signal to make landscape metrics useful for predicting estuarine ecological condition. We used multivariate logistic regression models to discriminate between su...
ERIC Educational Resources Information Center
Bergee, Martin J.; Westfall, Claude R.
2005-01-01
This is the third study in a line of inquiry whose purpose has been to develop a theoretical model of selected extra musical variables' influence on solo and small-ensemble festival ratings. Authors of the second of these (Bergee & McWhirter, 2005) had used binomial logistic regression as the basis for their model-formulation strategy. Their…
Model C Is Feasible for ESEA Title I Evaluation.
ERIC Educational Resources Information Center
Echternacht, Gary
The assertion that Model C is feasible for Elementary Secondary Education Act Title I evaluation, why it is feasible, and reasons why it is so seldom used are explained. Two assumptions must be made to use the special regression model. First, a strict cut-off must be used on the pretest to assign students to Title I and comparison groups. Second,…
First-year growth, recruitment, and maturity of walleyes in western Lake Erie
Madenjian, Charles P.; Tyson, Jeffrey T.; Knight, Roger L.; Kershner, Mark W.; Hansen, Michael J.
1996-01-01
In some lakes, first-year growth of walleyes Stizostedion vitreum has been identified as an important factor governing recruitment of juveniles to the adult population. We developed a regression model for walleye recruitment in western Lake Erie by considering factors such as first-year growth, size of the spawning stock, the rate at which the lake warmed during the spring, and abundance of gizzard shad Dorosoma cepedianum. Gizzard shad abundance during the fall prior to spring walleye spawning explained over 40% of the variation in walleye recruitment. Gizzard shad are relatively high in lipids and are preferred prey for walleyes in Lake Erie. Therefore, the high degree of correlation between shad abundance and subsequent walleye recruitment supported the contention that mature females needed adequate lipid reserves during the winter to spawn the following spring. According to the regression analysis, spring warming rate and size of the parental stock also influenced walleye recruitment. Our regression model explained 92% of the variation in recruitment of age-2 fish into the Lake Erie walleye population from 1981 to 1993. The regression model is potentially valuable as a management tool because it could be used to forecast walleye recruitment to the fishery 2 years in advance. First-year growth was poorly correlated with recruitment, which may reflect the unusually low incidence of walleye cannibalism in western Lake Erie. In contrast, first-year growth was strongly linked to age at maturity.
Cardiac surgery productivity and throughput improvements.
Lehtonen, Juha-Matti; Kujala, Jaakko; Kouri, Juhani; Hippeläinen, Mikko
2007-01-01
The high variability in cardiac surgery length--is one of the main challenges for staff managing productivity. This study aims to evaluate the impact of six interventions on open-heart surgery operating theatre productivity. A discrete operating theatre event simulation model with empirical operation time input data from 2603 patients is used to evaluate the effect that these process interventions have on the surgery output and overtime work. A linear regression model was used to get operation time forecasts for surgery scheduling while it also could be used to explain operation time. A forecasting model based on the linear regression of variables available before the surgery explains 46 per cent operating time variance. The main factors influencing operation length were type of operation, redoing the operation and the head surgeon. Reduction of changeover time between surgeries by inducing anaesthesia outside an operating theatre and by reducing slack time at the end of day after a second surgery have the strongest effects on surgery output and productivity. A more accurate operation time forecast did not have any effect on output, although improved operation time forecast did decrease overtime work. A reduction in the operation time itself is not studied in this article. However, the forecasting model can also be applied to discover which factors are most significant in explaining variation in the length of open-heart surgery. The challenge in scheduling two open-heart surgeries in one day can be partly resolved by increasing the length of the day, decreasing the time between two surgeries or by improving patient scheduling procedures so that two short surgeries can be paired. A linear regression model is created in the paper to increase the accuracy of operation time forecasting and to identify factors that have the most influence on operation time. A simulation model is used to analyse the impact of improved surgical length forecasting and five selected process interventions on productivity in cardiac surgery.
77 FR 3121 - Program Integrity: Gainful Employment-Debt Measures; Correction
Federal Register 2010, 2011, 2012, 2013, 2014
2012-01-23
...On June 13, 2011, the Secretary of Education (Secretary) published a notice of final regulations in the Federal Register for Program Integrity: Gainful Employment--Debt Measures (Gainful Employment--Debt Measures) (76 FR 34386). In the preamble of the final regulations, we used the wrong data to calculate the percent of total variance in institutions' repayment rates that may be explained by race/ethnicity. Our intent was to use the data that included all minority students per institution. However, we mistakenly used the data for a subset of minority students per institution. We have now recalculated the total variance using the data that includes all minority students. Through this document, we correct, in the preamble of the Gainful Employment--Debt Measures final regulations, the errors resulting from this misapplication. We do not change the regression analysis model itself; we are using the same model with the appropriate data. Through this notice we also correct, in the preamble of the Gainful Employment--Debt Measures final regulations, our description of one component of the regression analysis. The preamble referred to use of an institutional variable measuring acceptance rates. This description was incorrect; in fact we used an institutional variable measuring retention rates. Correcting this language does not change the regression analysis model itself or the variance explained by the model. The text of the final regulations remains unchanged.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Nimbalkar, Sachin U.; Wenning, Thomas J.; Guo, Wei
In the United States, manufacturing facilities account for about 32% of total domestic energy consumption in 2014. Robust energy tracking methodologies are critical to understanding energy performance in manufacturing facilities. Due to its simplicity and intuitiveness, the classic energy intensity method (i.e. the ratio of total energy use over total production) is the most widely adopted. However, the classic energy intensity method does not take into account the variation of other relevant parameters (i.e. product type, feed stock type, weather, etc.). Furthermore, the energy intensity method assumes that the facilities’ base energy consumption (energy use at zero production) is zero,more » which rarely holds true. Therefore, it is commonly recommended to utilize regression models rather than the energy intensity approach for tracking improvements at the facility level. Unfortunately, many energy managers have difficulties understanding why regression models are statistically better than utilizing the classic energy intensity method. While anecdotes and qualitative information may convince some, many have major reservations about the accuracy of regression models and whether it is worth the time and effort to gather data and build quality regression models. This paper will explain why regression models are theoretically and quantitatively more accurate for tracking energy performance improvements. Based on the analysis of data from 114 manufacturing plants over 12 years, this paper will present quantitative results on the importance of utilizing regression models over the energy intensity methodology. This paper will also document scenarios where regression models do not have significant relevance over the energy intensity method.« less
Montaño, Daniel E; Tshimanga, Mufuta; Hamilton, Deven T; Gorn, Gerald; Kasprzyk, Danuta
2018-02-01
Slow adult male circumcision uptake is one factor leading some to recommend increased priority for infant male circumcision (IMC) in sub-Saharan African countries. This research, guided by the integrated behavioral model (IBM), was carried out to identify key beliefs that best explain Zimbabwean parents' motivation to have their infant sons circumcised. A quantitative survey, designed from qualitative elicitation study results, was administered to independent representative samples of 800 expectant mothers and 795 expectant fathers in two urban and two rural areas in Zimbabwe. Multiple regression analyses found IMC motivation among fathers was explained by instrumental attitude, descriptive norm and self-efficacy; while motivation among mothers was explained by instrumental attitude, injunctive norm, descriptive norm, self-efficacy, and perceived control. Regression analyses of beliefs underlying IBM constructs found some overlap but many differences in key beliefs explaining IMC motivation among mothers and fathers. We found differences in key beliefs among urban and rural parents. Urban fathers' IMC motivation was explained best by behavioral beliefs, while rural fathers' motivation was explained by both behavioral and efficacy beliefs. Urban mothers' IMC motivation was explained primarily by behavioral and normative beliefs, while rural mothers' motivation was explained mostly by behavioral beliefs. The key beliefs we identified should serve as targets for developing messages to improve demand and maximize parent uptake as IMC programs are rolled out. These targets need to be different among urban and rural expectant mothers and fathers.
Accounting for estimated IQ in neuropsychological test performance with regression-based techniques.
Testa, S Marc; Winicki, Jessica M; Pearlson, Godfrey D; Gordon, Barry; Schretlen, David J
2009-11-01
Regression-based normative techniques account for variability in test performance associated with multiple predictor variables and generate expected scores based on algebraic equations. Using this approach, we show that estimated IQ, based on oral word reading, accounts for 1-9% of the variability beyond that explained by individual differences in age, sex, race, and years of education for most cognitive measures. These results confirm that adding estimated "premorbid" IQ to demographic predictors in multiple regression models can incrementally improve the accuracy with which regression-based norms (RBNs) benchmark expected neuropsychological test performance in healthy adults. It remains to be seen whether the incremental variance in test performance explained by estimated "premorbid" IQ translates to improved diagnostic accuracy in patient samples. We describe these methods, and illustrate the step-by-step application of RBNs with two cases. We also discuss the rationale, assumptions, and caveats of this approach. More broadly, we note that adjusting test scores for age and other characteristics might actually decrease the accuracy with which test performance predicts absolute criteria, such as the ability to drive or live independently.
Barnes, J C; Boutwell, Brian B; Miller, J Mitchell; DeShay, Rashaan A; Beaver, Kevin M; White, Norman
2016-01-01
To examine whether differential exposure to pre- and perinatal risk factors explained differences in levels of self-regulation between children of different races (White, Black, Hispanic, Asian, and Other). Multiple regression models based on data from the Early Childhood Longitudinal Study, Birth Cohort (n ≈ 9,850) were used to analyze the impact of pre- and perinatal risk factors on the development of self-regulation at age 2 years. Racial differences in levels of self-regulation were observed. Racial differences were also observed for 9 of the 12 pre-/perinatal risk factors. Multiple regression analyses revealed that a portion of the racial differences in self-regulation was explained by differential exposure to several of the pre-/perinatal risk factors. Specifically, maternal age at childbirth, gestational timing, and the family's socioeconomic status were significantly related to the child's level of self-regulation. These factors accounted for a statistically significant portion of the racial differences observed in self-regulation. The findings indicate racial differences in self-regulation may be, at least partially, explained by racial differences in exposure to pre- and perinatal risk factors.
Isolating and Examining Sources of Suppression and Multicollinearity in Multiple Linear Regression.
Beckstead, Jason W
2012-03-30
The presence of suppression (and multicollinearity) in multiple regression analysis complicates interpretation of predictor-criterion relationships. The mathematical conditions that produce suppression in regression analysis have received considerable attention in the methodological literature but until now nothing in the way of an analytic strategy to isolate, examine, and remove suppression effects has been offered. In this article such an approach, rooted in confirmatory factor analysis theory and employing matrix algebra, is developed. Suppression is viewed as the result of criterion-irrelevant variance operating among predictors. Decomposition of predictor variables into criterion-relevant and criterion-irrelevant components using structural equation modeling permits derivation of regression weights with the effects of criterion-irrelevant variance omitted. Three examples with data from applied research are used to illustrate the approach: the first assesses child and parent characteristics to explain why some parents of children with obsessive-compulsive disorder accommodate their child's compulsions more so than do others, the second examines various dimensions of personal health to explain individual differences in global quality of life among patients following heart surgery, and the third deals with quantifying the relative importance of various aptitudes for explaining academic performance in a sample of nursing students. The approach is offered as an analytic tool for investigators interested in understanding predictor-criterion relationships when complex patterns of intercorrelation among predictors are present and is shown to augment dominance analysis.
Active commuting patterns at a large, midwestern college campus.
Bopp, Melissa; Kaczynski, Andrew; Wittman, Pamela
2011-01-01
To understand patterns and influences on active commuting (AC) behavior. Students and faculty/staff at a university campus. In April-May 2008, respondents answered an online survey about mode of travel to campus and influences on commuting decisions. Hierarchical regression analyses predicted variance in walking and biking using sets of demographic, psychological, and environmental variables. Of 898 respondents, 55.7% were female, 457 were students (50.4%). Students reported more AC than faculty/staff. For students, the models explained 36.2% and 29.1% of the variance in walking and biking, respectively. Among faculty/staff, the models explained 45% and 25.8% of the variance in walking and biking. For all models, the psychological set explained the greatest amount of variance. With current economic and ecological concerns, AC should be considered a behavior to target for campus health promotion.
DOT National Transportation Integrated Search
1999-11-01
Using a fairly large cross-section/time-series data base, covering all provinces of Norway and all months between January 1973 and December 1994, we estimate non-linear (Box-Cox) regression equations explaining aggregate car ownership, road use, seat...
Notes on power of normality tests of error terms in regression models
DOE Office of Scientific and Technical Information (OSTI.GOV)
Střelec, Luboš
2015-03-10
Normality is one of the basic assumptions in applying statistical procedures. For example in linear regression most of the inferential procedures are based on the assumption of normality, i.e. the disturbance vector is assumed to be normally distributed. Failure to assess non-normality of the error terms may lead to incorrect results of usual statistical inference techniques such as t-test or F-test. Thus, error terms should be normally distributed in order to allow us to make exact inferences. As a consequence, normally distributed stochastic errors are necessary in order to make a not misleading inferences which explains a necessity and importancemore » of robust tests of normality. Therefore, the aim of this contribution is to discuss normality testing of error terms in regression models. In this contribution, we introduce the general RT class of robust tests for normality, and present and discuss the trade-off between power and robustness of selected classical and robust normality tests of error terms in regression models.« less
Logistic Regression: Concept and Application
ERIC Educational Resources Information Center
Cokluk, Omay
2010-01-01
The main focus of logistic regression analysis is classification of individuals in different groups. The aim of the present study is to explain basic concepts and processes of binary logistic regression analysis intended to determine the combination of independent variables which best explain the membership in certain groups called dichotomous…
A primer for biomedical scientists on how to execute model II linear regression analysis.
Ludbrook, John
2012-04-01
1. There are two very different ways of executing linear regression analysis. One is Model I, when the x-values are fixed by the experimenter. The other is Model II, in which the x-values are free to vary and are subject to error. 2. I have received numerous complaints from biomedical scientists that they have great difficulty in executing Model II linear regression analysis. This may explain the results of a Google Scholar search, which showed that the authors of articles in journals of physiology, pharmacology and biochemistry rarely use Model II regression analysis. 3. I repeat my previous arguments in favour of using least products linear regression analysis for Model II regressions. I review three methods for executing ordinary least products (OLP) and weighted least products (WLP) regression analysis: (i) scientific calculator and/or computer spreadsheet; (ii) specific purpose computer programs; and (iii) general purpose computer programs. 4. Using a scientific calculator and/or computer spreadsheet, it is easy to obtain correct values for OLP slope and intercept, but the corresponding 95% confidence intervals (CI) are inaccurate. 5. Using specific purpose computer programs, the freeware computer program smatr gives the correct OLP regression coefficients and obtains 95% CI by bootstrapping. In addition, smatr can be used to compare the slopes of OLP lines. 6. When using general purpose computer programs, I recommend the commercial programs systat and Statistica for those who regularly undertake linear regression analysis and I give step-by-step instructions in the Supplementary Information as to how to use loss functions. © 2011 The Author. Clinical and Experimental Pharmacology and Physiology. © 2011 Blackwell Publishing Asia Pty Ltd.
A New SEYHAN's Approach in Case of Heterogeneity of Regression Slopes in ANCOVA.
Ankarali, Handan; Cangur, Sengul; Ankarali, Seyit
2018-06-01
In this study, when the assumptions of linearity and homogeneity of regression slopes of conventional ANCOVA are not met, a new approach named as SEYHAN has been suggested to use conventional ANCOVA instead of robust or nonlinear ANCOVA. The proposed SEYHAN's approach involves transformation of continuous covariate into categorical structure when the relationship between covariate and dependent variable is nonlinear and the regression slopes are not homogenous. A simulated data set was used to explain SEYHAN's approach. In this approach, we performed conventional ANCOVA in each subgroup which is constituted according to knot values and analysis of variance with two-factor model after MARS method was used for categorization of covariate. The first model is a simpler model than the second model that includes interaction term. Since the model with interaction effect has more subjects, the power of test also increases and the existing significant difference is revealed better. We can say that linearity and homogeneity of regression slopes are not problem for data analysis by conventional linear ANCOVA model by helping this approach. It can be used fast and efficiently for the presence of one or more covariates.
Madaniyazi, Lina; Guo, Yuming; Chen, Renjie; Kan, Haidong; Tong, Shilu
2016-01-01
Estimating the burden of mortality associated with particulates requires knowledge of exposure-response associations. However, the evidence on exposure-response associations is limited in many cities, especially in developing countries. In this study, we predicted associations of particulates smaller than 10 μm in aerodynamic diameter (PM10) with mortality in 73 Chinese cities. The meta-regression model was used to test and quantify which city-specific characteristics contributed significantly to the heterogeneity of PM10-mortality associations for 16 Chinese cities. Then, those city-specific characteristics with statistically significant regression coefficients were treated as independent variables to build multivariate meta-regression models. The model with the best fitness was used to predict PM10-mortality associations in 73 Chinese cities in 2010. Mean temperature, PM10 concentration and green space per capita could best explain the heterogeneity in PM10-mortality associations. Based on city-specific characteristics, we were able to develop multivariate meta-regression models to predict associations between air pollutants and health outcomes reasonably well. Copyright © 2015 Elsevier Ltd. All rights reserved.
Modelling seasonal variations in presentations at a paediatric emergency department.
Takase, Miyuki; Carlin, John
2012-09-01
Overcrowding is a phenomenon commonly observed at emergency departments (EDs) in many hospitals, and negatively impacts patients, healthcare professionals and organisations. Health care organisations are expected to act proactively to cope with a high patient volume by understanding and predicting the patterns of ED presentations. The aim of this study was, therefore, to identify the patterns of patient flow at a paediatric ED in order to assist the management of EDs. Data for ED presentations were collected from the Royal Children's Hospital in Melbourne, Australia, with the time-frame of July 2003 to June 2008. A linear regression analysis with trigonometric functions was used to identify the pattern of patient flow at the ED. The results showed that a logarithm of the daily average ED presentations was increasing exponentially (as explained by 0.004t + 0.00005t2 with t representing time, p<0.001). The model also indicated that there was a yearly oscillation in the frequency of ED presentations, in which lower frequencies were observed in summer and higher frequencies during winter (as explained by -0.046 sin(2(pi)t/12)-0.083 cos(2(pi)t/12), p<0.001). In addition, the variation of the oscillations was increasing over time (as explained by -0.002t*sin(2(pi)t/12)-0.001t*cos(2(pi)t/12), p<0.05). The identified regression model explained a total of 96% of the variance in the pattern of ED presentations. This model can be used to understand the trend of the current patient flow as well as to predict the future flow at the ED. Such an understanding will assist health care managers to prepare resources and environment more effectively to cope with overcrowding.
The Relationship between Social Capital in Hospitals and Physician Job Satisfaction
Ommen, Oliver; Driller, Elke; Köhler, Thorsten; Kowalski, Christoph; Ernstmann, Nicole; Neumann, Melanie; Steffen, Petra; Pfaff, Holger
2009-01-01
Background Job satisfaction in the hospital is an important predictor for many significant management ratios. Acceptance in professional life or high workload are known as important predictors for job satisfaction. The influence of social capital in hospitals on job satisfaction within the health care system, however, remains to be determined. Thus, this article aimed at analysing the relationship between overall job satisfaction of physicians and social capital in hospitals. Methods The results of this study are based upon questionnaires sent by mail to 454 physicians working in the field of patient care in 4 different German hospitals in 2002. 277 clinicians responded to the poll, for a response rate of 61%. Analysis was performed using three linear regression models with physician overall job satisfaction as the dependent variable and age, gender, professional experience, workload, and social capital as independent variables. Results The first regression model explained nearly 9% of the variance of job satisfaction. Whereas job satisfaction increased slightly with age, gender and professional experience were not identified as significant factors to explain the variance. Setting up a second model with the addition of subjectively-perceived workload to the analysis, the explained variance increased to 18% and job satisfaction decreased significantly with increasing workload. The third model including social capital in hospital explained 36% of the variance with social capital, professional experience and workload as significant factors. Conclusion This analysis demonstrated that the social capital of an organisation, in addition to professional experience and workload, represents a significant predictor of overall job satisfaction of physicians working in the field of patient care. Trust, mutual understanding, shared aims, and ethical values are qualities of social capital that unify members of social networks and communities and enable them to act cooperatively. PMID:19445692
Nowell, Lisa H.; Crawford, Charles G.; Gilliom, Robert J.; Nakagaki, Naomi; Stone, Wesley W.; Thelin, Gail; Wolock, David M.
2009-01-01
Empirical regression models were developed for estimating concentrations of dieldrin, total chlordane, and total DDT in whole fish from U.S. streams. Models were based on pesticide concentrations measured in whole fish at 648 stream sites nationwide (1992-2001) as part of the U.S. Geological Survey's National Water Quality Assessment Program. Explanatory variables included fish lipid content, estimates (or surrogates) representing historical agricultural and urban sources, watershed characteristics, and geographic location. Models were developed using Tobit regression methods appropriate for data with censoring. Typically, the models explain approximately 50 to 70% of the variability in pesticide concentrations measured in whole fish. The models were used to predict pesticide concentrations in whole fish for streams nationwide using the U.S. Environmental Protection Agency's River Reach File 1 and to estimate the probability that whole-fish concentrations exceed benchmarks for protection of fish-eating wildlife. Predicted concentrations were highest for dieldrin in the Corn Belt, Texas, and scattered urban areas; for total chlordane in the Corn Belt, Texas, the Southeast, and urbanized Northeast; and for total DDT in the Southeast, Texas, California, and urban areas nationwide. The probability of exceeding wildlife benchmarks for dieldrin and chlordane was predicted to be low for most U.S. streams. The probability of exceeding wildlife benchmarks for total DDT is higher but varies depending on the fish taxon and on the benchmark used. Because the models in the present study are based on fish data collected during the 1990s and organochlorine pesticide residues in the environment continue to decline decades after their uses were discontinued, these models may overestimate present-day pesticide concentrations in fish. ?? 2009 SETAC.
Ocaña-Peinado, Francisco M; Valderrama, Mariano J; Bouzas, Paula R
2013-05-01
The problem of developing a 2-week-on ahead forecast of atmospheric cypress pollen levels is tackled in this paper by developing a principal component multiple regression model involving several climatic variables. The efficacy of the proposed model is validated by means of an application to real data of Cupressaceae pollen concentration in the city of Granada (southeast of Spain). The model was applied to data from 11 consecutive years (1995-2005), with 2006 being used to validate the forecasts. Based on the work of different authors, factors as temperature, humidity, hours of sun and wind speed were incorporated in the model. This methodology explains approximately 75-80% of the variability in the airborne Cupressaceae pollen concentration.
Validation of a Model of Extramusical Influences on Solo and Small-Ensemble Festival Ratings
ERIC Educational Resources Information Center
Bergee, Martin J.
2006-01-01
This is the fourth in a series of studies whose purpose has been to develop a theoretical model of selected extramusical variables' ability to explain solo and small-ensemble festival ratings. Authors of the second and third of these (Bergee & McWhirter, 2005; Bergee & Westfall, 2005) used logistic regression as the basis for their…
ERIC Educational Resources Information Center
Luna, Andrew L.
2007-01-01
This study used two multiple regression analyses to develop an explanatory model to determine which model might best explain faculty salaries. The central purpose of the study was to determine if using a single market ratio variable was a stronger predictor for faculty salaries than the use of dummy variables representing various disciplines.…
Social modernization and the increase in the divorce rate.
Esser, H
1993-03-01
The author develops a micro-model of marital interactions that is used to analyze factors affecting the divorce rate in modern industrialized societies. The core of the model is the concept of production of marital gain and mutual control of this production. "The increase of divorce rates, then, is explained by a steady decrease of institutional and social embeddedness, which helps to solve this kind of an 'assurance game.' The shape of the individual risk is explained by the typical form of change of the 'production functions' of marriages within the first period of adaptation. The inconsistent results concerning womens' labor market participation in linear regression models are explained as a consequence of the (theoretical and statistical) 'interaction' of decreases in embeddedness and increases in external alternatives for women." Comments are included by Karl-Dieter Opp (pp. 278-82) and Ulrich Witt (pp. 283-5). excerpt
Determinants of customer satisfaction with hospitals: a managerial model.
Andaleeb, S S
1998-01-01
States that rapid changes in the environment have exerted significant pressures on hospitals to incorporate patient satisfaction in their strategic stance and quest for market share and long-term viability. This study proposes and tests a five-factor model that explains considerable variation in customer satisfaction with hospitals. These factors include communication with patients, competence of the staff, their demeanour, quality of the facilities, and perceived costs; they also represent strategic concepts that managers can address in their bid to remain competitive. A probability sample was selected and a multiple regression model used to test the hypotheses. The results indicate that all five variables were significant in the model and explained 62 per cent of the variation in the dependent variable. Managerial implications of the proposed model are discussed.
Predictors of physical activity in persons with mental illness: Testing a social cognitive model.
Zechner, Michelle R; Gill, Kenneth J
2016-12-01
This study examined whether the social cognitive theory (SCT) model can be used to explain the variance in physical exercise among persons with serious mental illnesses. A cross-sectional, correlational design was employed. Participants from community mental health centers and supported housing programs (N = 120) completed 9 measures on exercise, social support, self-efficacy, outcome expectations, barriers, and goal-setting. Hierarchical regression tested the relationship between self-report physical activity and SCT determinants while controlling for personal characteristics. The model explained 25% of the variance in exercise. Personal characteristics explained 18% of the variance in physical activity, SCT variables of social support, self-efficacy, outcome expectations, barriers, and goals were entered simultaneously, and they added an r2 change value of .07. Gender (β = -.316, p = .001) and Brief Symptom Inventory Depression subscale (β = -2.08, p < .040) contributed significantly to the prediction of exercise. In a separate stepwise multiple regression, we entered only SCT variables as potential predictors of exercise. Goal-setting was the single significant predictor, F(1, 118) = 13.59, p < .01), r2 = .10. SCT shows promise as an explanatory model of exercise in persons with mental illnesses. Goal-setting practices, self-efficacy, outcome expectations and social support from friends for exercise should be encouraged by psychiatric rehabilitation practitioners. People with more depressive symptoms and women exercise less. More work is needed on theoretical exploration of predictors of exercise. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Modeling vertebrate diversity in Oregon using satellite imagery
NASA Astrophysics Data System (ADS)
Cablk, Mary Elizabeth
Vertebrate diversity was modeled for the state of Oregon using a parametric approach to regression tree analysis. This exploratory data analysis effectively modeled the non-linear relationships between vertebrate richness and phenology, terrain, and climate. Phenology was derived from time-series NOAA-AVHRR satellite imagery for the year 1992 using two methods: principal component analysis and derivation of EROS data center greenness metrics. These two measures of spatial and temporal vegetation condition incorporated the critical temporal element in this analysis. The first three principal components were shown to contain spatial and temporal information about the landscape and discriminated phenologically distinct regions in Oregon. Principal components 2 and 3, 6 greenness metrics, elevation, slope, aspect, annual precipitation, and annual seasonal temperature difference were investigated as correlates to amphibians, birds, all vertebrates, reptiles, and mammals. Variation explained for each regression tree by taxa were: amphibians (91%), birds (67%), all vertebrates (66%), reptiles (57%), and mammals (55%). Spatial statistics were used to quantify the pattern of each taxa and assess validity of resulting predictions from regression tree models. Regression tree analysis was relatively robust against spatial autocorrelation in the response data and graphical results indicated models were well fit to the data.
Fischer, Thomas; Fischer, Susanne; Himmel, Wolfgang; Kochen, Michael M; Hummers-Pradier, Eva
2008-01-01
The influence of patient characteristics on family practitioners' (FPs') diagnostic decision making has mainly been investigated using indirect methods such as vignettes or questionnaires. Direct observation-borrowed from social and cultural anthropology-may be an alternative method for describing FPs' real-life behavior and may help in gaining insight into how FPs diagnose respiratory tract infections, which are frequent in primary care. To clarify FPs' diagnostic processes when treating patients suffering from symptoms of respiratory tract infection. This direct observation study was performed in 30 family practices using a checklist for patient complaints, history taking, physical examination, and diagnoses. The influence of patients' symptoms and complaints on the FPs' physical examination and diagnosis was calculated by logistic regression analyses. Dummy variables based on combinations of symptoms and complaints were constructed and tested against saturated (full) and backward regression models. In total, 273 patients (median age 37 years, 51% women) were included. The median number of symptoms described was 4 per patient, and most information was provided at the patients' own initiative. Multiple logistic regression analysis showed a strong association between patients' complaints and the physical examination. Frequent diagnoses were upper respiratory tract infection (URTI)/common cold (43%), bronchitis (26%), sinusitis (12%), and tonsillitis (11%). There were no significant statistical differences between "simple heuristic'' models and saturated regression models in the diagnoses of bronchitis, sinusitis, and tonsillitis, indicating that simple heuristics are probably used by the FPs, whereas "URTI/common cold'' was better explained by the full model. FPs tended to make their diagnosis based on a few patient symptoms and a limited physical examination. Simple heuristic models were almost as powerful in explaining most diagnoses as saturated models. Direct observation allowed for the study of decision making under real conditions, yielding both quantitative data and "qualitative'' information about the FPs' performance. It is important for investigators to be aware of the specific disadvantages of the method (e.g., a possible observer effect).
NASA Astrophysics Data System (ADS)
O'Connor, J. E.; Wise, D. R.; Mangano, J.; Jones, K.
2015-12-01
Empirical analyses of suspended sediment and bedload transport gives estimates of sediment flux for western Oregon and northwestern California. The estimates of both bedload and suspended load are from regression models relating measured annual sediment yield to geologic, physiographic, and climatic properties of contributing basins. The best models include generalized geology and either slope or precipitation. The best-fit suspended-sediment model is based on basin geology, precipitation, and area of recent wildfire. It explains 65% of the variance for 68 suspended sediment measurement sites within the model area. Predicted suspended sediment yields range from no yield from the High Cascades geologic province to 200 tonnes/ km2-yr in the northern Oregon Coast Range and 1000 tonnes/km2-yr in recently burned areas of the northern Klamath terrain. Bed-material yield is similarly estimated from a regression model based on 22 sites of measured bed-material transport, mostly from reservoir accumulation analyses but also from several bedload measurement programs. The resulting best-fit regression is based on basin slope and the presence/absence of the Klamath geologic terrane. For the Klamath terrane, bed-material yield is twice that of the other geologic provinces. This model explains more than 80% of the variance of the better-quality measurements. Predicted bed-material yields range up to 350 tonnes/ km2-yr in steep areas of the Klamath terrane. Applying these regressions to small individual watersheds (mean size; 66 km2 for bed-material; 3 km2 for suspended sediment) and cumulating totals down the hydrologic network (but also decreasing the bed-material flux by experimentally determined attrition rates) gives spatially explicit estimates of both bed-material and suspended sediment flux. This enables assessment of several management issues, including the effects of dams on bedload transport, instream gravel mining, habitat formation processes, and water-quality. The combined fluxes can also be compared to long-term rock uplift and cosmogenically determined landscape erosion rates.
Bonetti, Debbie; Johnston, Marie; Clarkson, Jan E; Grimshaw, Jeremy; Pitts, Nigel B; Eccles, Martin; Steen, Nick; Thomas, Ruth; Maclennan, Graeme; Glidewell, Liz; Walker, Anne
2010-04-08
Psychological models are used to understand and predict behaviour in a wide range of settings, but have not been consistently applied to health professional behaviours, and the contribution of differing theories is not clear. This study explored the usefulness of a range of models to predict an evidence-based behaviour -- the placing of fissure sealants. Measures were collected by postal questionnaire from a random sample of general dental practitioners (GDPs) in Scotland. Outcomes were behavioural simulation (scenario decision-making), and behavioural intention. Predictor variables were from the Theory of Planned Behaviour (TPB), Social Cognitive Theory (SCT), Common Sense Self-regulation Model (CS-SRM), Operant Learning Theory (OLT), Implementation Intention (II), Stage Model, and knowledge (a non-theoretical construct). Multiple regression analysis was used to examine the predictive value of each theoretical model individually. Significant constructs from all theories were then entered into a 'cross theory' stepwise regression analysis to investigate their combined predictive value. Behavioural simulation - theory level variance explained was: TPB 31%; SCT 29%; II 7%; OLT 30%. Neither CS-SRM nor stage explained significant variance. In the cross theory analysis, habit (OLT), timeline acute (CS-SRM), and outcome expectancy (SCT) entered the equation, together explaining 38% of the variance. Behavioural intention - theory level variance explained was: TPB 30%; SCT 24%; OLT 58%, CS-SRM 27%. GDPs in the action stage had significantly higher intention to place fissure sealants. In the cross theory analysis, habit (OLT) and attitude (TPB) entered the equation, together explaining 68% of the variance in intention. The study provides evidence that psychological models can be useful in understanding and predicting clinical behaviour. Taking a theory-based approach enables the creation of a replicable methodology for identifying factors that may predict clinical behaviour and so provide possible targets for knowledge translation interventions. Results suggest that more evidence-based behaviour may be achieved by influencing beliefs about the positive outcomes of placing fissure sealants and building a habit of placing them as part of patient management. However a number of conceptual and methodological challenges remain.
Mapping soil textural fractions across a large watershed in north-east Florida.
Lamsal, S; Mishra, U
2010-08-01
Assessment of regional scale soil spatial variation and mapping their distribution is constrained by sparse data which are collected using field surveys that are labor intensive and cost prohibitive. We explored geostatistical (ordinary kriging-OK), regression (Regression Tree-RT), and hybrid methods (RT plus residual Sequential Gaussian Simulation-SGS) to map soil textural fractions across the Santa Fe River Watershed (3585 km(2)) in north-east Florida. Soil samples collected from four depths (L1: 0-30 cm, L2: 30-60 cm, L3: 60-120 cm, and L4: 120-180 cm) at 141 locations were analyzed for soil textural fractions (sand, silt and clay contents), and combined with textural data (15 profiles) assembled under the Florida Soil Characterization program. Textural fractions in L1 and L2 were autocorrelated, and spatially mapped across the watershed. OK performance was poor, which may be attributed to the sparse sampling. RT model structure varied among textural fractions, and the model explained variations ranged from 25% for L1 silt to 61% for L2 clay content. Regression residuals were simulated using SGS, and the average of simulated residuals were used to approximate regression residual distribution map, which were added to regression trend maps. Independent validation of the prediction maps showed that regression models performed slightly better than OK, and regression combined with average of simulated regression residuals improved predictions beyond the regression model. Sand content >90% in both 0-30 and 30-60 cm covered 80.6% of the watershed area. Copyright 2010 Elsevier Ltd. All rights reserved.
Gender differences in body consciousness and substance use among high-risk adolescents.
Black, David Scott; Sussman, Steve; Unger, Jennifer; Pokhrel, Pallav; Sun, Ping
2010-08-01
This study explores the association between private and public body consciousness and past 30-day cigarette, alcohol, marijuana, and hard drug use among adolescents. Self-reported data from alterative high school students in California were analyzed (N = 976) using multilevel regression models to account for student clustering within schools. Separate regression analyses were conducted for males and females. Both cross-sectional baseline data and one-year longitudinal prediction models indicated that body consciousness is associated with specific drug use categories differentially by gender. Findings suggest that body consciousness accounts for additional variance in substance use etiology not explained by previously recognized dispositional variables.
Tøndel, Kristin; Indahl, Ulf G; Gjuvsland, Arne B; Vik, Jon Olav; Hunter, Peter; Omholt, Stig W; Martens, Harald
2011-06-01
Deterministic dynamic models of complex biological systems contain a large number of parameters and state variables, related through nonlinear differential equations with various types of feedback. A metamodel of such a dynamic model is a statistical approximation model that maps variation in parameters and initial conditions (inputs) to variation in features of the trajectories of the state variables (outputs) throughout the entire biologically relevant input space. A sufficiently accurate mapping can be exploited both instrumentally and epistemically. Multivariate regression methodology is a commonly used approach for emulating dynamic models. However, when the input-output relations are highly nonlinear or non-monotone, a standard linear regression approach is prone to give suboptimal results. We therefore hypothesised that a more accurate mapping can be obtained by locally linear or locally polynomial regression. We present here a new method for local regression modelling, Hierarchical Cluster-based PLS regression (HC-PLSR), where fuzzy C-means clustering is used to separate the data set into parts according to the structure of the response surface. We compare the metamodelling performance of HC-PLSR with polynomial partial least squares regression (PLSR) and ordinary least squares (OLS) regression on various systems: six different gene regulatory network models with various types of feedback, a deterministic mathematical model of the mammalian circadian clock and a model of the mouse ventricular myocyte function. Our results indicate that multivariate regression is well suited for emulating dynamic models in systems biology. The hierarchical approach turned out to be superior to both polynomial PLSR and OLS regression in all three test cases. The advantage, in terms of explained variance and prediction accuracy, was largest in systems with highly nonlinear functional relationships and in systems with positive feedback loops. HC-PLSR is a promising approach for metamodelling in systems biology, especially for highly nonlinear or non-monotone parameter to phenotype maps. The algorithm can be flexibly adjusted to suit the complexity of the dynamic model behaviour, inviting automation in the metamodelling of complex systems.
2011-01-01
Background Deterministic dynamic models of complex biological systems contain a large number of parameters and state variables, related through nonlinear differential equations with various types of feedback. A metamodel of such a dynamic model is a statistical approximation model that maps variation in parameters and initial conditions (inputs) to variation in features of the trajectories of the state variables (outputs) throughout the entire biologically relevant input space. A sufficiently accurate mapping can be exploited both instrumentally and epistemically. Multivariate regression methodology is a commonly used approach for emulating dynamic models. However, when the input-output relations are highly nonlinear or non-monotone, a standard linear regression approach is prone to give suboptimal results. We therefore hypothesised that a more accurate mapping can be obtained by locally linear or locally polynomial regression. We present here a new method for local regression modelling, Hierarchical Cluster-based PLS regression (HC-PLSR), where fuzzy C-means clustering is used to separate the data set into parts according to the structure of the response surface. We compare the metamodelling performance of HC-PLSR with polynomial partial least squares regression (PLSR) and ordinary least squares (OLS) regression on various systems: six different gene regulatory network models with various types of feedback, a deterministic mathematical model of the mammalian circadian clock and a model of the mouse ventricular myocyte function. Results Our results indicate that multivariate regression is well suited for emulating dynamic models in systems biology. The hierarchical approach turned out to be superior to both polynomial PLSR and OLS regression in all three test cases. The advantage, in terms of explained variance and prediction accuracy, was largest in systems with highly nonlinear functional relationships and in systems with positive feedback loops. Conclusions HC-PLSR is a promising approach for metamodelling in systems biology, especially for highly nonlinear or non-monotone parameter to phenotype maps. The algorithm can be flexibly adjusted to suit the complexity of the dynamic model behaviour, inviting automation in the metamodelling of complex systems. PMID:21627852
Keogh, Brad; Culliford, David; Guerrero-Ludueña, Richard; Monks, Thomas
2018-05-24
To quantify the effect of intrahospital patient flow on emergency department (ED) performance targets and indicate if the expectations set by the National Health Service (NHS) England 5-year forward review are realistic in returning emergency services to previous performance levels. Linear regression analysis of routinely reported trust activity and performance data using a series of cross-sectional studies. NHS trusts in England submitting routine nationally reported measures to NHS England. 142 acute non-specialist trusts operating in England between 2012 and 2016. The primary outcome measures were proportion of 4-hour waiting time breaches and cancelled elective operations. Univariate and multivariate linear regression models were used to show relationships between the outcome measures and various measures of trust activity including empty day beds, empty night beds, day bed to night bed ratio, ED conversion ratio and delayed transfers of care. Univariate regression results using the outcome of 4-hour breaches showed clear relationships with empty night beds and ED conversion ratio between 2012 and 2016. The day bed to night bed ratio showed an increasing ability to explain variation in performance between 2015 and 2016. Delayed transfers of care showed little evidence of an association. Multivariate model results indicated that the ability of patient flow variables to explain 4-hour target performance had reduced between 2012 and 2016 (19% to 12%), and had increased in explaining cancelled elective operations (7% to 17%). The flow of patients through trusts is shown to influence ED performance; however, performance has become less explainable by intratrust patient flow between 2012 and 2016. Some commonly stated explanatory factors such as delayed transfers of care showed limited evidence of being related. The results indicate some of the measures proposed by NHS England to reduce pressure on EDs may not have the desired impact on returning services to previous performance levels. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Effects of Microstructural Parameters on Creep of Nickel-Base Superalloy Single Crystals
NASA Technical Reports Server (NTRS)
MacKay, Rebecca A.; Gabb, Timothy P.; Nathal, Michael V.
2013-01-01
Microstructure-sensitive creep models have been developed for Ni-base superalloy single crystals. Creep rupture testing was conducted on fourteen single crystal alloys at two applied stress levels at each of two temperatures, 982 and 1093 C. The variation in creep lives among the different alloys could be explained with regression models containing relatively few microstructural parameters. At 982 C, gamma-gamma prime lattice mismatch, gamma prime volume fraction, and initial gamma prime size were statistically significant in explaining the creep rupture lives. At 1093 C, only lattice mismatch and gamma prime volume fraction were significant. These models could explain from 84 to 94 percent of the variation in creep lives, depending on test condition. Longer creep lives were associated with alloys having more negative lattice mismatch, lower gamma prime volume fractions, and finer gamma prime sizes. The gamma-gamma prime lattice mismatch exhibited the strongest influence of all the microstructural parameters at both temperatures. Although a majority of the alloys in this study were stable with respect to topologically close packed (TCP) phases, it appeared that up to approximately 2 vol% TCP phase did not affect the 1093 C creep lives under applied stresses that produced lives of approximately 200 to 300 h. In contrast, TCP phase contents of approximately 2 vol% were detrimental at lower applied stresses where creep lives were longer. A regression model was also developed for the as-heat treated initial gamma prime size; this model showed that gamma prime solvus temperature, gamma-gamma prime lattice mismatch, and bulk Re content were all statistically significant.
Warren, Nicholas; Dussetschleger, Jeffrey; Punnett, Laura; Cherniack, Martin G
2015-03-01
In this study, we sought to explain the rapid musculoskeletal symptomatology increase in correction officers (COs). COs are exposed to levels of biomechanical and psychosocial stressors that have strong associations with musculoskeletal disorders (MSDs) in other occupations, possibly contributing to their rapid health deterioration. Baseline survey data from a longitudinal study of COs and manufacturing line workers were used to model musculoskeletal symptom prevalence and intensity in the upper (UE) and lower (LE) extremity. Outcomes were regressed on demographics and biomechanical and psychosocial exposures. COs reported significantly higher prevalence and intensity of LE symptoms compared to the industrial workers. In regression models, job tenure was a primary driver of CO musculoskeletal outcomes. In CO models, a single biomechanical exposure, head and arms in awkward positions, explained variance in both UE and LE prevalence (β of 0.338 and 0.357, respectively), and low decision latitude was associated with increased LE prevalence and intensity (β of 0.229 and 0.233, respectively). Manufacturing models were less explanatory. Examining demographic associations with exposure intensity, we found none to be significant in manufacturing, but in CO models, important psychosocial exposure levels increased with job tenure. Symptom prevalence and intensity increased more rapidly with job tenure in corrections, compared to manufacturing, and were related to both biomechanical and psychosocial exposures. Tenure-related increases in psychosocial exposure levels may help explain the CO symptom increase. Although exposure assessment improvements are proposed, findings suggest focusing on improving the psychosocial work environment to reduce MSD prevalence and intensity in corrections. © 2014, Human Factors and Ergonomics Society.
Estimated long-term outdoor air pollution concentrations in a cohort study
NASA Astrophysics Data System (ADS)
Beelen, Rob; Hoek, Gerard; Fischer, Paul; Brandt, Piet A. van den; Brunekreef, Bert
Several recent studies associated long-term exposure to air pollution with increased mortality. An ongoing cohort study, the Netherlands Cohort Study on Diet and Cancer (NLCS), was used to study the association between long-term exposure to traffic-related air pollution and mortality. Following on a previous exposure assessment study in the NLCS, we improved the exposure assessment methods. Long-term exposure to nitrogen dioxide (NO 2), nitrogen oxide (NO), black smoke (BS), and sulphur dioxide (SO 2) was estimated. Exposure at each home address ( N=21 868) was considered as a function of a regional, an urban and a local component. The regional component was estimated using inverse distance weighed interpolation of measurement data from regional background sites in a national monitoring network. Regression models with urban concentrations as dependent variables, and number of inhabitants in different buffers and land use variables, derived with a Geographic Information System (GIS), as predictor variables were used to estimate the urban component. The local component was assessed using a GIS and a digital road network with linked traffic intensities. Traffic intensity on the nearest road and on the nearest major road, and the sum of traffic intensity in a buffer of 100 m around each home address were assessed. Further, a quantitative estimate of the local component was estimated. The regression models to estimate the urban component explained 67%, 46%, 49% and 35% of the variances of NO 2, NO, BS, and SO 2 concentrations, respectively. Overall regression models which incorporated the regional, urban and local component explained 84%, 44%, 59% and 56% of the variability in concentrations for NO 2, NO, BS and SO 2, respectively. We were able to develop an exposure assessment model using GIS methods and traffic intensities that explained a large part of the variations in outdoor air pollution concentrations.
Nonlinear Constitutive Modeling of Piezoelectric Ceramics
NASA Astrophysics Data System (ADS)
Xu, Jia; Li, Chao; Wang, Haibo; Zhu, Zhiwen
2017-12-01
Nonlinear constitutive modeling of piezoelectric ceramics is discussed in this paper. Van der Pol item is introduced to explain the simple hysteretic curve. Improved nonlinear difference items are used to interpret the hysteresis phenomena of piezoelectric ceramics. The fitting effect of the model on experimental data is proved by the partial least-square regression method. The results show that this method can describe the real curve well. The results of this paper are helpful to piezoelectric ceramics constitutive modeling.
Reflectance of vegetation, soil, and water
NASA Technical Reports Server (NTRS)
Wiegand, C. L. (Principal Investigator)
1974-01-01
The author has identified the following significant results. The Kubelka-Munk model, a regression model, and a combination of these models were used to extract plant, soil, and shadow reflectance components of vegetated surfaces. The combination model was superior to the others; it explained 86% of the variation in band 5 reflectance of corn and sorghum, and 90% of the variation in band 6 reflectance of cotton. A fractional shadow term substantially increased the proportion of the digital count sum of squares explained when plant parameters alone explained 85% or less of the variation. Overall recognition of 94 agricultural fields using simultaneously acquired aircraft and spacecraft MSS data was 61.8 and 62.8%, respectively; recognition of vegetable fields larger than 10 acres and taller than 25 cm, rose to 88.9 and 100% for aircraft and spacecraft, respectively. Agriculture and rangeland, were well discriminated for the entire county but level 2 categories of vegetables, citrus, and idle cropland, except for citrus, were not.
Predicting spatio-temporal failure in large scale observational and micro scale experimental systems
NASA Astrophysics Data System (ADS)
de las Heras, Alejandro; Hu, Yong
2006-10-01
Forecasting has become an essential part of modern thought, but the practical limitations still are manifold. We addressed future rates of change by comparing models that take into account time, and models that focus more on space. Cox regression confirmed that linear change can be safely assumed in the short-term. Spatially explicit Poisson regression, provided a ceiling value for the number of deforestation spots. With several observed and estimated rates, it was decided to forecast using the more robust assumptions. A Markov-chain cellular automaton thus projected 5-year deforestation in the Amazonian Arc of Deforestation, showing that even a stable rate of change would largely deplete the forest area. More generally, resolution and implementation of the existing models could explain many of the modelling difficulties still affecting forecasting.
Elder abuse and socioeconomic inequalities: a multilevel study in 7 European countries.
Fraga, Sílvia; Lindert, Jutta; Barros, Henrique; Torres-González, Francisco; Ioannidi-Kapolou, Elisabeth; Melchiorre, Maria Gabriella; Stankunas, Mindaugas; Soares, Joaquim F
2014-04-01
To compare the prevalence of elder abuse using a multilevel approach that takes into account the characteristics of participants as well as socioeconomic indicators at city and country level. In 2009, the project on abuse of elderly in Europe (ABUEL) was conducted in seven cities (Stuttgart, Germany; Ancona, Italy; Kaunas, Lithuania, Stockholm, Sweden; Porto, Portugal; Granada, Spain; Athens, Greece) comprising 4467 individuals aged 60-84 years. We used a 3-level hierarchical structure of data: 1) characteristics of participants; 2) mean of tertiary education of each city; and 3) country inequality indicator (Gini coefficient). Multilevel logistic regression was used and proportional changes in Intraclass Correlation Coefficient (ICC) were inspected to assert explained variance between models. The prevalence of elder abuse showed large variations across sites. Adding tertiary education to the regression model reduced the country level variance for psychological abuse (ICC=3.4%), with no significant decrease in the explained variance for the other types of abuse. When the Gini coefficient was considered, the highest drop in ICC was observed for financial abuse (from 9.5% to 4.3%). There is a societal and community level dimension that adds information to individual variability in explaining country differences in elder abuse, highlighting underlying socioeconomic inequalities leading to such behavior. Copyright © 2014 Elsevier Inc. All rights reserved.
Pessimism, Trauma, Risky Sex: Covariates of Depression in College Students
ERIC Educational Resources Information Center
Swanholm, Eric; Vosvick, Mark; Chng, Chwee-Lye
2009-01-01
Objective: To explain variance in depression in students (N = 648) using a model incorporating sexual trauma, pessimism, and risky sex. Method: Survey data collected from undergraduate students receiving credit for participation. Results: Controlling for demographics, a hierarchical linear regression analysis [Adjusted R[superscript 2] = 0.34,…
Social Inequality and Labor Force Participation.
ERIC Educational Resources Information Center
King, Jonathan
The labor force participation rates of whites, blacks, and Spanish-Americans, grouped by sex, are explained in a linear regression model fitted with 1970 U. S. Census data on Standard Metropolitan Statistical Area (SMSA). The explanatory variables are: average age, average years of education, vocational training rate, disabled rate, unemployment…
We examined algal metrics as indicators of altered watershed land cover and nutrients to inform their potential use in monitoring programs. Multiple regression models, in which impervious cover explained the most variation, indicated concentrations <0.202 mg/l NO3 and <0.015 mg/l...
The advent of new higher throughput analytical instrumentation has put a strain on interpreting and explaining the results from complex studies. Contemporary human, environmental, and biomonitoring data sets are comprised of tens or hundreds of analytes, multiple repeat measures...
We compared soil chemistry and plant community data at non-agronomic mesic locations that either did or did not contain genetically modified (GM) Agrostis stolonifera. The best two-variable logistic regression model included soil Mn content and A. stolonifera cover and explained...
Measurement error in epidemiologic studies of air pollution based on land-use regression models.
Basagaña, Xavier; Aguilera, Inmaculada; Rivera, Marcela; Agis, David; Foraster, Maria; Marrugat, Jaume; Elosua, Roberto; Künzli, Nino
2013-10-15
Land-use regression (LUR) models are increasingly used to estimate air pollution exposure in epidemiologic studies. These models use air pollution measurements taken at a small set of locations and modeling based on geographical covariates for which data are available at all study participant locations. The process of LUR model development commonly includes a variable selection procedure. When LUR model predictions are used as explanatory variables in a model for a health outcome, measurement error can lead to bias of the regression coefficients and to inflation of their variance. In previous studies dealing with spatial predictions of air pollution, bias was shown to be small while most of the effect of measurement error was on the variance. In this study, we show that in realistic cases where LUR models are applied to health data, bias in health-effect estimates can be substantial. This bias depends on the number of air pollution measurement sites, the number of available predictors for model selection, and the amount of explainable variability in the true exposure. These results should be taken into account when interpreting health effects from studies that used LUR models.
NASA Astrophysics Data System (ADS)
Cattani, Giorgio; Gaeta, Alessandra; di Menno di Bucchianico, Alessandro; de Santis, Antonella; Gaddi, Raffaela; Cusano, Mariacarmela; Ancona, Carla; Badaloni, Chiara; Forastiere, Francesco; Gariazzo, Claudio; Sozzi, Roberto; Inglessis, Marco; Silibello, Camillo; Salvatori, Elisabetta; Manes, Fausto; Cesaroni, Giulia; The Viias Study Group
2017-05-01
The health effects of long-term exposure to ultrafine particles (UFPs) are poorly understood. Data on spatial contrasts in ambient ultrafine particles (UFPs) concentrations are needed with fine resolution. This study aimed to assess the spatial variability of total particle number concentrations (PNC, a proxy for UFPs) in the city of Rome, Italy, using land use regression (LUR) models, and the correspondent exposure of population here living. PNC were measured using condensation particle counters at the building facade of 28 homes throughout the city. Three 7-day monitoring periods were carried out during cold, warm and intermediate seasons. Geographic Information System predictor variables, with buffers of varying size, were evaluated to model spatial variations of PNC. A stepwise forward selection procedure was used to develop a "base" linear regression model according to the European Study of Cohorts for Air Pollution Effects project methodology. Other variables were then included in more enhanced models and their capability of improving model performance was evaluated. Four LUR models were developed. Local variation in UFPs in the study area can be largely explained by the ratio of traffic intensity and distance to the nearest major road. The best model (adjusted R2 = 0.71; root mean square error = ±1,572 particles/cm³, leave one out cross validated R2 = 0.68) was achieved by regressing building and street configuration variables against residual from the "base" model, which added 3% more to the total variance explained. Urban green and population density in a 5,000 m buffer around each home were also relevant predictors. The spatial contrast in ambient PNC across the large conurbation of Rome, was successfully assessed. The average exposure of subjects living in the study area was 16,006 particles/cm³ (SD 2165 particles/cm³, range: 11,075-28,632 particles/cm³). A total of 203,886 subjects (16%) lives in Rome within 50 m from a high traffic road and they experience the highest exposure levels (18,229 particles/cm³). The results will be used to estimate the long-term health effects of ultrafine particle exposure of participants in Rome.
Nixon, R M; Bansback, N; Brennan, A
2007-03-15
Mixed treatment comparison (MTC) is a generalization of meta-analysis. Instead of the same treatment for a disease being tested in a number of studies, a number of different interventions are considered. Meta-regression is also a generalization of meta-analysis where an attempt is made to explain the heterogeneity between the treatment effects in the studies by regressing on study-level covariables. Our focus is where there are several different treatments considered in a number of randomized controlled trials in a specific disease, the same treatment can be applied in several arms within a study, and where differences in efficacy can be explained by differences in the study settings. We develop methods for simultaneously comparing several treatments and adjusting for study-level covariables by combining ideas from MTC and meta-regression. We use a case study from rheumatoid arthritis. We identified relevant trials of biologic verses standard therapy or placebo and extracted the doses, comparators and patient baseline characteristics. Efficacy is measured using the log odds ratio of achieving six-month ACR50 responder status. A random-effects meta-regression model is fitted which adjusts the log odds ratio for study-level prognostic factors. A different random-effect distribution on the log odds ratios is allowed for each different treatment. The odds ratio is found as a function of the prognostic factors for each treatment. The apparent differences in the randomized trials between tumour necrosis factor alpha (TNF- alpha) antagonists are explained by differences in prognostic factors and the analysis suggests that these drugs as a class are not different from each other. Copyright (c) 2006 John Wiley & Sons, Ltd.
Global-scale high-resolution ( 1 km) modelling of mean, maximum and minimum annual streamflow
NASA Astrophysics Data System (ADS)
Barbarossa, Valerio; Huijbregts, Mark; Hendriks, Jan; Beusen, Arthur; Clavreul, Julie; King, Henry; Schipper, Aafke
2017-04-01
Quantifying mean, maximum and minimum annual flow (AF) of rivers at ungauged sites is essential for a number of applications, including assessments of global water supply, ecosystem integrity and water footprints. AF metrics can be quantified with spatially explicit process-based models, which might be overly time-consuming and data-intensive for this purpose, or with empirical regression models that predict AF metrics based on climate and catchment characteristics. Yet, so far, regression models have mostly been developed at a regional scale and the extent to which they can be extrapolated to other regions is not known. We developed global-scale regression models that quantify mean, maximum and minimum AF as function of catchment area and catchment-averaged slope, elevation, and mean, maximum and minimum annual precipitation and air temperature. We then used these models to obtain global 30 arc-seconds (˜ 1 km) maps of mean, maximum and minimum AF for each year from 1960 through 2015, based on a newly developed hydrologically conditioned digital elevation model. We calibrated our regression models based on observations of discharge and catchment characteristics from about 4,000 catchments worldwide, ranging from 100 to 106 km2 in size, and validated them against independent measurements as well as the output of a number of process-based global hydrological models (GHMs). The variance explained by our regression models ranged up to 90% and the performance of the models compared well with the performance of existing GHMs. Yet, our AF maps provide a level of spatial detail that cannot yet be achieved by current GHMs.
Price, Weather, and `Acreage Abandonment' in Western Great Plains Wheat Culture.
NASA Astrophysics Data System (ADS)
Michaels, Patrick J.
1983-07-01
Multivariate analyses of acreage abandonment patterns in the U.S. Great Plains winter wheat region indicate that the major mode of variation is an in-phase oscillation confined to the western half of the overall area, which is also the area with lowest average yields. This is one of the more agroclimatically marginal environments in the United States, with wide interannual fluctuations in both climate and profitability.We developed a multiple regression model to determine the relative roles of weather and expected price in the decision not to harvest. The overall model explained 77% of the spatial and temporal variation in abandonment. The 36.5% of the non-spatial variation was explained by two simple transformations of climatic data from three monthly aggregates-September-October, November-February and March-April. Price factors, expressed as indexed future delivery quotations,were barely significant, with only between 3 and 5% of the non-spatial variation explained, depending upon the model.The model was based upon weather, climate and price data from 1932 through 1975. It was tested by sequentially withholding three-year blocks of data, and using the respecified regression coefficients, along with observed weather and price, to estimate abandonment in the withheld years. Error analyses indicate no loss of model fidelity in the test mode. Also, prediction errors in the 1970-75 period, characterized by widely fluctuating prices, were not different from those in the rest of the model.The overall results suggest that the perceived quality of the crop, as influenced by weather, is a much more important determinant of the abandonment decision than are expected returns based upon price considerations.
NASA Astrophysics Data System (ADS)
Ťupek, Boris; Launiainen, Samuli; Peltoniemi, Mikko; Heikkinen, Jukka; Lehtonen, Aleksi
2016-04-01
Litter decomposition rates of the most process based soil carbon models affected by environmental conditions are linked with soil heterotrophic CO2 emissions and serve for estimating soil carbon sequestration; thus due to the mass balance equation the variation in measured litter inputs and measured heterotrophic soil CO2 effluxes should indicate soil carbon stock changes, needed by soil carbon management for mitigation of anthropogenic CO2 emissions, if sensitivity functions of the applied model suit to the environmental conditions e.g. soil temperature and moisture. We evaluated the response forms of autotrophic and heterotrophic forest floor respiration to soil temperature and moisture in four boreal forest sites of the International Cooperative Programme on Assessment and Monitoring of Air Pollution Effects on Forests (ICP Forests) by a soil trenching experiment during year 2015 in southern Finland. As expected both autotrophic and heterotrophic forest floor respiration components were primarily controlled by soil temperature and exponential regression models generally explained more than 90% of the variance. Soil moisture regression models on average explained less than 10% of the variance and the response forms varied between Gaussian for the autotrophic forest floor respiration component and linear for the heterotrophic forest floor respiration component. Although the percentage of explained variance of soil heterotrophic respiration by the soil moisture was small, the observed reduction of CO2 emissions with higher moisture levels suggested that soil moisture response of soil carbon models not accounting for the reduction due to excessive moisture should be re-evaluated in order to estimate right levels of soil carbon stock changes. Our further study will include evaluation of process based soil carbon models by the annual heterotrophic respiration and soil carbon stocks.
NASA Astrophysics Data System (ADS)
Wilson, Barry T.; Knight, Joseph F.; McRoberts, Ronald E.
2018-03-01
Imagery from the Landsat Program has been used frequently as a source of auxiliary data for modeling land cover, as well as a variety of attributes associated with tree cover. With ready access to all scenes in the archive since 2008 due to the USGS Landsat Data Policy, new approaches to deriving such auxiliary data from dense Landsat time series are required. Several methods have previously been developed for use with finer temporal resolution imagery (e.g. AVHRR and MODIS), including image compositing and harmonic regression using Fourier series. The manuscript presents a study, using Minnesota, USA during the years 2009-2013 as the study area and timeframe. The study examined the relative predictive power of land cover models, in particular those related to tree cover, using predictor variables based solely on composite imagery versus those using estimated harmonic regression coefficients. The study used two common non-parametric modeling approaches (i.e. k-nearest neighbors and random forests) for fitting classification and regression models of multiple attributes measured on USFS Forest Inventory and Analysis plots using all available Landsat imagery for the study area and timeframe. The estimated Fourier coefficients developed by harmonic regression of tasseled cap transformation time series data were shown to be correlated with land cover, including tree cover. Regression models using estimated Fourier coefficients as predictor variables showed a two- to threefold increase in explained variance for a small set of continuous response variables, relative to comparable models using monthly image composites. Similarly, the overall accuracies of classification models using the estimated Fourier coefficients were approximately 10-20 percentage points higher than the models using the image composites, with corresponding individual class accuracies between six and 45 percentage points higher.
Prediction of dimethyl disulfide levels from biosolids using statistical modeling.
Gabriel, Steven A; Vilalai, Sirapong; Arispe, Susanna; Kim, Hyunook; McConnell, Laura L; Torrents, Alba; Peot, Christopher; Ramirez, Mark
2005-01-01
Two statistical models were used to predict the concentration of dimethyl disulfide (DMDS) released from biosolids produced by an advanced wastewater treatment plant (WWTP) located in Washington, DC, USA. The plant concentrates sludge from primary sedimentation basins in gravity thickeners (GT) and sludge from secondary sedimentation basins in dissolved air flotation (DAF) thickeners. The thickened sludge is pumped into blending tanks and then fed into centrifuges for dewatering. The dewatered sludge is then conditioned with lime before trucking out from the plant. DMDS, along with other volatile sulfur and nitrogen-containing chemicals, is known to contribute to biosolids odors. These models identified oxidation/reduction potential (ORP) values of a GT and DAF, the amount of sludge dewatered by centrifuges, and the blend ratio between GT thickened sludge and DAF thickened sludge in blending tanks as control variables. The accuracy of the developed regression models was evaluated by checking the adjusted R2 of the regression as well as the signs of coefficients associated with each variable. In general, both models explained observed DMDS levels in sludge headspace samples. The adjusted R2 value of the regression models 1 and 2 were 0.79 and 0.77, respectively. Coefficients for each regression model also had the correct sign. Using the developed models, plant operators can adjust the controllable variables to proactively decrease this odorant. Therefore, these models are a useful tool in biosolids management at WWTPs.
Intermediate and advanced topics in multilevel logistic regression analysis
Merlo, Juan
2017-01-01
Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher‐level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within‐cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population‐average effect of covariates measured at the subject and cluster level, in contrast to the within‐cluster or cluster‐specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster‐level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. PMID:28543517
Intermediate and advanced topics in multilevel logistic regression analysis.
Austin, Peter C; Merlo, Juan
2017-09-10
Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher-level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within-cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population-average effect of covariates measured at the subject and cluster level, in contrast to the within-cluster or cluster-specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster-level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.
Predicting having condoms available among adolescents: the role of personal norm and enjoyment.
Jellema, Ilke J; Abraham, Charles; Schaalma, Herman P; Gebhardt, Winifred A; van Empelen, Pepijn
2013-05-01
Having condoms available has been shown to be an important predictor of condom use. We examined whether or not personal norm and goal enjoyment contribute to predicting having condoms available in the context of cognition specified by the theory of planned behaviour (TPB). Prospective survey study, with a baseline and follow-up measurement (at 3 months). Data were gathered using an online survey. In total 282 adolescents (mean age = 15.6, 74% female adolescents) completed both questionnaires. At baseline, demographics, sexual experience, condom use, TPB variables, descriptive norm, personal norm, and enjoyment towards having condoms available were measured. At T2 (3 months later) having condoms available was measured. Direct and moderating effects of personal norm and goal enjoyment were examined by means of hierarchical linear regression analyses. Regression analyses yielded a direct effect of self-efficacy and personal norm on condom availability. In addition, moderation of the intention-behaviour relation by goal enjoyment added to the variance explained. The final model explained approximately 35% of the variance in condom availability. Personal norm and goal enjoyment add to the predictive utility of a TPB model of having condoms available and may be useful intervention targets. What is already known about this subject? Having condoms available is an important prerequisite for actual condom use. The theory of planned behaviour has successfully been applied to explain condom availability behaviour. The theory of planned behaviour has been criticized for not adequately taking into account affective motivation. What does this study add? Personal norm and goal enjoyment add to the predictive utility of the model. Personal norm explains condom availability directly, enjoyment increases intention enactment. Personal norm and goal enjoyment therefore are useful intervention targets. © 2012 The British Psychological Society.
Mapping urban environmental noise: a land use regression method.
Xie, Dan; Liu, Yi; Chen, Jining
2011-09-01
Forecasting and preventing urban noise pollution are major challenges in urban environmental management. Most existing efforts, including experiment-based models, statistical models, and noise mapping, however, have limited capacity to explain the association between urban growth and corresponding noise change. Therefore, these conventional methods can hardly forecast urban noise at a given outlook of development layout. This paper, for the first time, introduces a land use regression method, which has been applied for simulating urban air quality for a decade, to construct an urban noise model (LUNOS) in Dalian Municipality, Northwest China. The LUNOS model describes noise as a dependent variable of surrounding various land areas via a regressive function. The results suggest that a linear model performs better in fitting monitoring data, and there is no significant difference of the LUNOS's outputs when applied to different spatial scales. As the LUNOS facilitates a better understanding of the association between land use and urban environmental noise in comparison to conventional methods, it can be regarded as a promising tool for noise prediction for planning purposes and aid smart decision-making.
Outsourcing primary health care services--how politicians explain the grounds for their decisions.
Laamanen, Ritva; Simonsen-Rehn, Nina; Suominen, Sakari; Øvretveit, John; Brommels, Mats
2008-12-01
To explore outsourcing of primary health care (PHC) services in four municipalities in Finland with varying amounts and types of outsourcing: a Southern municipality (SM) which contracted all PHC services to a not-for-profit voluntary organization, and Eastern (EM), South-Western (SWM) and Western (WM) municipalities which had contracted out only a few services to profit or public organizations. A mail survey to all municipality politicians (response rate 52%, N=101) in 2004. Data were analyzed using cross-tabulations, Spearman correlation and linear regression analyses. Politicians were willing to outsource PHC services only partially, and many problems relating to outsourcing were reported. Politicians in all municipalities were least likely to outsource preventive services. A multiple linear regression model showed that reported preference to outsource in EM and in SWM was lower than in SM, and also lower among politicians from "leftist" political parties than "rightist" political parties. Perceived difficulties in local health policy issues were related to reduced preference to outsource. The model explained 27% of the variance of the inclination to outsource PHC services. The findings highlight how important it is to take into account local health policy issues when assessing service-provision models.
Odegård, J; Klemetsdal, G; Heringstad, B
2005-04-01
Several selection criteria for reducing incidence of mastitis were developed from a random regression sire model for test-day somatic cell score (SCS). For comparison, sire transmitting abilities were also predicted based on a cross-sectional model for lactation mean SCS. Only first-crop daughters were used in genetic evaluation of SCS, and the different selection criteria were compared based on their correlation with incidence of clinical mastitis in second-crop daughters (measured as mean daughter deviations). Selection criteria were predicted based on both complete and reduced first-crop daughter groups (261 or 65 daughters per sire, respectively). For complete daughter groups, predicted transmitting abilities at around 30 d in milk showed the best predictive ability for incidence of clinical mastitis, closely followed by average predicted transmitting abilities over the entire lactation. Both of these criteria were derived from the random regression model. These selection criteria improved accuracy of selection by approximately 2% relative to a cross-sectional model. However, for reduced daughter groups, the cross-sectional model yielded increased predictive ability compared with the selection criteria based on the random regression model. This result may be explained by the cross-sectional model being more robust, i.e., less sensitive to precision of (co)variance components estimates and effects of data structure.
Mortality rates in OECD countries converged during the period 1990-2010.
Bremberg, Sven G
2017-06-01
Since the scientific revolution of the 18th century, human health has gradually improved, but there is no unifying theory that explains this improvement in health. Studies of macrodeterminants have produced conflicting results. Most studies have analysed health at a given point in time as the outcome; however, the rate of improvement in health might be a more appropriate outcome. Twenty-eight OECD member countries were selected for analysis in the period 1990-2010. The main outcomes studied, in six age groups, were the national rates of decrease in mortality in the period 1990-2010. The effects of seven potential determinants on the rates of decrease in mortality were analysed in linear multiple regression models using least squares, controlling for country-specific history constants, which represent the mortality rate in 1990. The multiple regression analyses started with models that only included mortality rates in 1990 as determinants. These models explained 87% of the intercountry variation in the children aged 1-4 years and 51% in adults aged 55-74 years. When added to the regression equations, the seven determinants did not seem to significantly increase the explanatory power of the equations. The analyses indicated a decrease in mortality in all nations and in all age groups. The development of mortality rates in the different nations demonstrated significant catch-up effects. Therefore an important objective of the national public health sector seems to be to reduce the delay between international research findings and the universal implementation of relevant innovations.
Barr, A J; Dube, B; Hensor, E M A; Kingsbury, S R; Peat, G; Bowes, M A; Conaghan, P G
2014-10-01
Radiographic measures of osteoarthritis (OA) are based upon two dimensional projection images. Active appearance modelling (AAM) of knee magnetic resonance imaging (MRI) enables accurate, 3D quantification of joint structures in large cohorts. This cross-sectional study explored the relationship between clinical characteristics, radiographic measures of OA and 3D bone area (tAB). Clinical data and baseline paired radiographic and MRI data, from the medial compartment of one knee of 2588 participants were obtained from the NIH Osteoarthritis Initiative (OAI). The medial femur (MF) and tibia (MT) tAB were calculated using AAM. 'OA-attributable' tAB (OA-tAB) was calculated using data from regression models of tAB of knees without OA. Associations between OA-tAB and radiographic measures of OA were investigated using linear regression. In univariable analyses, height, weight, and age in female knees without OA explained 43.1%, 32.1% and 0.1% of the MF tAB variance individually and 54.4% when included simultaneously in a multivariable model. Joint space width (JSW), osteophytes and sclerosis explained just 5.3%, 14.9% and 10.1% of the variance of MF OA-tAB individually and 17.4% when combined. Kellgren Lawrence (KL) grade explained approximately 20% of MF OA-tAB individually. Similar results were seen for MT OA-tAB. Height explained the majority of variance in tAB, confirming an allometric relationship between body and joint size. Radiographic measures of OA, derived from a single radiographic projection, accounted for only a small amount of variation in 3D knee OA-tAB. The additional structural information provided by 3D bone area may explain the lack of a substantive relationship with these radiographic OA measures. Copyright © 2014 The Authors. Published by Elsevier Ltd.. All rights reserved.
NASA Astrophysics Data System (ADS)
Chiu, Tina
This dissertation includes three studies that analyze a new set of assessment tasks developed by the Learning Progressions in Middle School Science (LPS) Project. These assessment tasks were designed to measure science content knowledge on the structure of matter domain and scientific argumentation, while following the goals from the Next Generation Science Standards (NGSS). The three studies focus on the evidence available for the success of this design and its implementation, generally labelled as "validity" evidence. I use explanatory item response models (EIRMs) as the overarching framework to investigate these assessment tasks. These models can be useful when gathering validity evidence for assessments as they can help explain student learning and group differences. In the first study, I explore the dimensionality of the LPS assessment by comparing the fit of unidimensional, between-item multidimensional, and Rasch testlet models to see which is most appropriate for this data. By applying multidimensional item response models, multiple relationships can be investigated, and in turn, allow for a more substantive look into the assessment tasks. The second study focuses on person predictors through latent regression and differential item functioning (DIF) models. Latent regression models show the influence of certain person characteristics on item responses, while DIF models test whether one group is differentially affected by specific assessment items, after conditioning on latent ability. Finally, the last study applies the linear logistic test model (LLTM) to investigate whether item features can help explain differences in item difficulties.
ERIC Educational Resources Information Center
Luna, Andrew L.
2007-01-01
The purpose of this study was to determine if a market ratio factor was a better predictor of faculty salaries than the use of k-1 dummy variables representing the various disciplines. This study used two multiple regression analyses to develop an explanatory model to determine which model might best explain faculty salaries. A total of 20 out of…
High-risk regions and outbreak modelling of tularemia in humans.
Desvars-Larrive, A; Liu, X; Hjertqvist, M; Sjöstedt, A; Johansson, A; Rydén, P
2017-02-01
Sweden reports large and variable numbers of human tularemia cases, but the high-risk regions are anecdotally defined and factors explaining annual variations are poorly understood. Here, high-risk regions were identified by spatial cluster analysis on disease surveillance data for 1984-2012. Negative binomial regression with five previously validated predictors (including predicted mosquito abundance and predictors based on local weather data) was used to model the annual number of tularemia cases within the high-risk regions. Seven high-risk regions were identified with annual incidences of 3·8-44 cases/100 000 inhabitants, accounting for 56·4% of the tularemia cases but only 9·3% of Sweden's population. For all high-risk regions, most cases occurred between July and September. The regression models explained the annual variation of tularemia cases within most high-risk regions and discriminated between years with and without outbreaks. In conclusion, tularemia in Sweden is concentrated in a few high-risk regions and shows high annual and seasonal variations. We present reproducible methods for identifying tularemia high-risk regions and modelling tularemia cases within these regions. The results may help health authorities to target populations at risk and lay the foundation for developing an early warning system for outbreaks.
Wakie, Tewodros; Kumar, Sunil; Senay, Gabriel; Takele, Abera; Lencho, Alemu
2016-01-01
A number of studies have reported the presence of wheat septoria leaf blotch (Septoria tritici; SLB) disease in Ethiopia. However, the environmental factors associated with SLB disease, and areas under risk of SLB disease, have not been studied. Here, we tested the hypothesis that environmental variables can adequately explain observed SLB disease severity levels in West Shewa, Central Ethiopia. Specifically, we identified 50 environmental variables and assessed their relationships with SLB disease severity. Geographically referenced disease severity data were obtained from the field, and linear regression and Boosted Regression Trees (BRT) modeling approaches were used for developing spatial models. Moderate-resolution imaging spectroradiometer (MODIS) derived vegetation indices and land surface temperature (LST) variables highly influenced SLB model predictions. Soil and topographic variables did not sufficiently explain observed SLB disease severity variation in this study. Our results show that wheat growing areas in Central Ethiopia, including highly productive districts, are at risk of SLB disease. The study demonstrates the integration of field data with modeling approaches such as BRT for predicting the spatial patterns of severity of a pathogenic wheat disease in Central Ethiopia. Our results can aid Ethiopia's wheat disease monitoring efforts, while our methods can be replicated for testing related hypotheses elsewhere.
Visentin, G; McDermott, A; McParland, S; Berry, D P; Kenny, O A; Brodkorb, A; Fenelon, M A; De Marchi, M
2015-09-01
Rapid, cost-effective monitoring of milk technological traits is a significant challenge for dairy industries specialized in cheese manufacturing. The objective of the present study was to investigate the ability of mid-infrared spectroscopy to predict rennet coagulation time, curd-firming time, curd firmness at 30 and 60min after rennet addition, heat coagulation time, casein micelle size, and pH in cow milk samples, and to quantify associations between these milk technological traits and conventional milk quality traits. Samples (n=713) were collected from 605 cows from multiple herds; the samples represented multiple breeds, stages of lactation, parities, and milking times. Reference analyses were undertaken in accordance with standardized methods, and mid-infrared spectra in the range of 900 to 5,000cm(-1) were available for all samples. Prediction models were developed using partial least squares regression, and prediction accuracy was based on both cross and external validation. The proportion of variance explained by the prediction models in external validation was greatest for pH (71%), followed by rennet coagulation time (55%) and milk heat coagulation time (46%). Models to predict curd firmness 60min from rennet addition and casein micelle size, however, were poor, explaining only 25 and 13%, respectively, of the total variance in each trait within external validation. On average, all prediction models tended to be unbiased. The linear regression coefficient of the reference value on the predicted value varied from 0.17 (casein micelle size regression model) to 0.83 (pH regression model) but all differed from 1. The ratio performance deviation of 1.07 (casein micelle size prediction model) to 1.79 (pH prediction model) for all prediction models in the external validation was <2, suggesting that none of the prediction models could be used for analytical purposes. With the exception of casein micelle size and curd firmness at 60min after rennet addition, the developed prediction models may be useful as a screening method, because the concordance correlation coefficient ranged from 0.63 (heat coagulation time prediction model) to 0.84 (pH prediction model) in the external validation. Copyright © 2015 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Hyper-Spectral Image Analysis With Partially Latent Regression and Spatial Markov Dependencies
NASA Astrophysics Data System (ADS)
Deleforge, Antoine; Forbes, Florence; Ba, Sileye; Horaud, Radu
2015-09-01
Hyper-spectral data can be analyzed to recover physical properties at large planetary scales. This involves resolving inverse problems which can be addressed within machine learning, with the advantage that, once a relationship between physical parameters and spectra has been established in a data-driven fashion, the learned relationship can be used to estimate physical parameters for new hyper-spectral observations. Within this framework, we propose a spatially-constrained and partially-latent regression method which maps high-dimensional inputs (hyper-spectral images) onto low-dimensional responses (physical parameters such as the local chemical composition of the soil). The proposed regression model comprises two key features. Firstly, it combines a Gaussian mixture of locally-linear mappings (GLLiM) with a partially-latent response model. While the former makes high-dimensional regression tractable, the latter enables to deal with physical parameters that cannot be observed or, more generally, with data contaminated by experimental artifacts that cannot be explained with noise models. Secondly, spatial constraints are introduced in the model through a Markov random field (MRF) prior which provides a spatial structure to the Gaussian-mixture hidden variables. Experiments conducted on a database composed of remotely sensed observations collected from the Mars planet by the Mars Express orbiter demonstrate the effectiveness of the proposed model.
Steinmann, Zoran J N; Venkatesh, Aranya; Hauck, Mara; Schipper, Aafke M; Karuppiah, Ramkumar; Laurenzi, Ian J; Huijbregts, Mark A J
2014-05-06
One of the major challenges in life cycle assessment (LCA) is the availability and quality of data used to develop models and to make appropriate recommendations. Approximations and assumptions are often made if appropriate data are not readily available. However, these proxies may introduce uncertainty into the results. A regression model framework may be employed to assess missing data in LCAs of products and processes. In this study, we develop such a regression-based framework to estimate CO2 emission factors associated with coal power plants in the absence of reported data. Our framework hypothesizes that emissions from coal power plants can be explained by plant-specific factors (predictors) that include steam pressure, total capacity, plant age, fuel type, and gross domestic product (GDP) per capita of the resident nations of those plants. Using reported emission data for 444 plants worldwide, plant level CO2 emission factors were fitted to the selected predictors by a multiple linear regression model and a local linear regression model. The validated models were then applied to 764 coal power plants worldwide, for which no reported data were available. Cumulatively, available reported data and our predictions together account for 74% of the total world's coal-fired power generation capacity.
Kasprzyk, Danuta; Tshimanga, Mufuta; Hamilton, Deven T; Gorn, Gerald J; Montaño, Daniel E
2018-02-01
Male circumcision (MC) significantly reduces HIV acquisition among men, leading WHO/UNAIDS to recommend high HIV and low MC prevalence countries circumcise 80% of adolescents and men age 15-49. Despite significant investment to increase MC capacity only 27% of the goal has been achieved in Zimbabwe. To increase adoption, research to create evidence-based messages is greatly needed. The Integrated Behavioral Model (IBM) was used to investigate factors affecting MC motivation among adolescents. Based on qualitative elicitation study results a survey was designed and administered to a representative sample of 802 adolescent boys aged 13-17 in two urban and two rural areas in Zimbabwe. Multiple regression analysis found all six IBM constructs (2 attitude, 2 social influence, 2 personal agency) significantly explained MC intention (R 2 = 0.55). Stepwise regression analysis of beliefs underlying each IBM belief-based construct found 9 behavioral, 6 injunctive norm, 2 descriptive norm, 5 efficacy, and 8 control beliefs significantly explained MC intention. A final stepwise regression of all the significant IBM construct beliefs identified 12 key beliefs best explaining intention. Similar analyses were carried out with subgroups of adolescents by urban-rural and age. Different sets of behavioral, normative, efficacy, and control beliefs were significant for each sub-group. This study demonstrates the application of theory-driven research to identify evidence-based targets for the design of effective MC messages for interventions to increase adolescents' motivation. Incorporating these findings into communication campaigns is likely to improve demand for MC.
Kennedy, Jeffrey R.; Paretti, Nicholas V.; Veilleux, Andrea G.
2014-01-01
Regression equations, which allow predictions of n-day flood-duration flows for selected annual exceedance probabilities at ungaged sites, were developed using generalized least-squares regression and flood-duration flow frequency estimates at 56 streamgaging stations within a single, relatively uniform physiographic region in the central part of Arizona, between the Colorado Plateau and Basin and Range Province, called the Transition Zone. Drainage area explained most of the variation in the n-day flood-duration annual exceedance probabilities, but mean annual precipitation and mean elevation were also significant variables in the regression models. Standard error of prediction for the regression equations varies from 28 to 53 percent and generally decreases with increasing n-day duration. Outside the Transition Zone there are insufficient streamgaging stations to develop regression equations, but flood-duration flow frequency estimates are presented at select streamgaging stations.
Investigation of Relationship between QBO and Ionospheric Neutral Temperature
NASA Astrophysics Data System (ADS)
Saǧır, Selçuk; Atıcı, Ramazan; Özcan, Osman
2016-07-01
The relationship between Quasi Biennial Oscillation (QBO) measured at 10 hPa altitude and neutral temperature obtained from NRLMSIS-00 model for 90 km altitude of ionosphere is statistically investigated. For this study, multiple-regression model is used. To see effect on neutral temperature of QBO directions, Dummy variables are added to model established. In the results of performed analysis, it is observed that QBO is effected on neutral temperature of ionosphere. It is determined that 57% of variations at neutral temperature can be explainable by QBO. According to the established model, statistical explainable ratio was determined as 1% that it is the highest rate. Also, it is seen that an increase/a decrease of 1 meter per second at QBO give rise to an increase/a decrease of 0,07 K at neutral temperature.
Directional data analysis under the general projected normal distribution
Wang, Fangpo; Gelfand, Alan E.
2013-01-01
The projected normal distribution is an under-utilized model for explaining directional data. In particular, the general version provides flexibility, e.g., asymmetry and possible bimodality along with convenient regression specification. Here, we clarify the properties of this general class. We also develop fully Bayesian hierarchical models for analyzing circular data using this class. We show how they can be fit using MCMC methods with suitable latent variables. We show how posterior inference for distributional features such as the angular mean direction and concentration can be implemented as well as how prediction within the regression setting can be handled. With regard to model comparison, we argue for an out-of-sample approach using both a predictive likelihood scoring loss criterion and a cumulative rank probability score criterion. PMID:24046539
Parental Vaccine Acceptance: A Logistic Regression Model Using Previsit Decisions.
Lee, Sara; Riley-Behringer, Maureen; Rose, Jeanmarie C; Meropol, Sharon B; Lazebnik, Rina
2017-07-01
This study explores how parents' intentions regarding vaccination prior to their children's visit were associated with actual vaccine acceptance. A convenience sample of parents accompanying 6-week-old to 17-year-old children completed a written survey at 2 pediatric practices. Using hierarchical logistic regression, for hospital-based participants (n = 216), vaccine refusal history ( P < .01) and vaccine decision made before the visit ( P < .05) explained 87% of vaccine refusals. In community-based participants (n = 100), vaccine refusal history ( P < .01) explained 81% of refusals. Over 1 in 5 parents changed their minds about vaccination during the visit. Thirty parents who were previous vaccine refusers accepted current vaccines, and 37 who had intended not to vaccinate choose vaccination. Twenty-nine parents without a refusal history declined vaccines, and 32 who did not intend to refuse before the visit declined vaccination. Future research should identify key factors to nudge parent decision making in favor of vaccination.
A refined method for multivariate meta-analysis and meta-regression.
Jackson, Daniel; Riley, Richard D
2014-02-20
Making inferences about the average treatment effect using the random effects model for meta-analysis is problematic in the common situation where there is a small number of studies. This is because estimates of the between-study variance are not precise enough to accurately apply the conventional methods for testing and deriving a confidence interval for the average effect. We have found that a refined method for univariate meta-analysis, which applies a scaling factor to the estimated effects' standard error, provides more accurate inference. We explain how to extend this method to the multivariate scenario and show that our proposal for refined multivariate meta-analysis and meta-regression can provide more accurate inferences than the more conventional approach. We explain how our proposed approach can be implemented using standard output from multivariate meta-analysis software packages and apply our methodology to two real examples. Copyright © 2013 John Wiley & Sons, Ltd.
Zhang, Jingyi; Li, Bin; Chen, Yumin; Chen, Meijie; Fang, Tao; Liu, Yongfeng
2018-06-11
This paper proposes a regression model using the Eigenvector Spatial Filtering (ESF) method to estimate ground PM 2.5 concentrations. Covariates are derived from remotely sensed data including aerosol optical depth, normal differential vegetation index, surface temperature, air pressure, relative humidity, height of planetary boundary layer and digital elevation model. In addition, cultural variables such as factory densities and road densities are also used in the model. With the Yangtze River Delta region as the study area, we constructed ESF-based Regression (ESFR) models at different time scales, using data for the period between December 2015 and November 2016. We found that the ESFR models effectively filtered spatial autocorrelation in the OLS residuals and resulted in increases in the goodness-of-fit metrics as well as reductions in residual standard errors and cross-validation errors, compared to the classic OLS models. The annual ESFR model explained 70% of the variability in PM 2.5 concentrations, 16.7% more than the non-spatial OLS model. With the ESFR models, we performed detail analyses on the spatial and temporal distributions of PM 2.5 concentrations in the study area. The model predictions are lower than ground observations but match the general trend. The experiment shows that ESFR provides a promising approach to PM 2.5 analysis and prediction.
Schleicher, Rosemary L; Sternberg, Maya R; Pfeiffer, Christine M
2013-06-01
Sociodemographic and lifestyle factors exert important influences on nutritional status; however, information on their association with biomarkers of fat-soluble nutrients is limited, particularly in a representative sample of adults. Serum or plasma concentrations of vitamin A, vitamin E, carotenes, xanthophylls, 25-hydroxyvitamin D [25(OH)D], SFAs, MUFAs, PUFAs, and total fatty acids (tFAs) were measured in adults (aged ≥ 20 y) during all or part of NHANES 2003-2006. Simple and multiple linear regression models were used to assess 5 sociodemographic variables (age, sex, race-ethnicity, education, and income) and 5 lifestyle behaviors (smoking, alcohol consumption, BMI, physical activity, and supplement use) and their relation to biomarker concentrations. Adjustment for total serum cholesterol and lipid-altering drug use was added to the full regression model. Adjustment for latitude and season was added to the full model for 25(OH)D. Based on simple linear regression, race-ethnicity, BMI, and supplement use were significantly related to all fat-soluble biomarkers. Sociodemographic variables as a group explained 5-17% of biomarker variability, whereas together, sociodemographic and lifestyle variables explained 22-23% [25(OH)D, vitamin E, xanthophylls], 17% (vitamin A), 15% (MUFAs), 10-11% (SFAs, carotenes, tFAs), and 6% (PUFAs) of biomarker variability. Although lipid adjustment explained additional variability for all biomarkers except for 25(OH)D, it appeared to be largely independent of sociodemographic and lifestyle variables. After adjusting for sociodemographic, lifestyle, and lipid-related variables, major differences in biomarkers were associated with race-ethnicity (from -44 to 57%), smoking (up to -25%), supplement use (up to 21%), and BMI (up to -15%). Latitude and season attenuated some race-ethnicity differences. Of the sociodemographic and lifestyle variables examined, with or without lipid adjustment, most fat-soluble nutrient biomarkers were significantly associated with race-ethnicity.
Cade, Brian S.; Noon, Barry R.; Scherer, Rick D.; Keane, John J.
2017-01-01
Counts of avian fledglings, nestlings, or clutch size that are bounded below by zero and above by some small integer form a discrete random variable distribution that is not approximated well by conventional parametric count distributions such as the Poisson or negative binomial. We developed a logistic quantile regression model to provide estimates of the empirical conditional distribution of a bounded discrete random variable. The logistic quantile regression model requires that counts are randomly jittered to a continuous random variable, logit transformed to bound them between specified lower and upper values, then estimated in conventional linear quantile regression, repeating the 3 steps and averaging estimates. Back-transformation to the original discrete scale relies on the fact that quantiles are equivariant to monotonic transformations. We demonstrate this statistical procedure by modeling 20 years of California Spotted Owl fledgling production (0−3 per territory) on the Lassen National Forest, California, USA, as related to climate, demographic, and landscape habitat characteristics at territories. Spotted Owl fledgling counts increased nonlinearly with decreasing precipitation in the early nesting period, in the winter prior to nesting, and in the prior growing season; with increasing minimum temperatures in the early nesting period; with adult compared to subadult parents; when there was no fledgling production in the prior year; and when percentage of the landscape surrounding nesting sites (202 ha) with trees ≥25 m height increased. Changes in production were primarily driven by changes in the proportion of territories with 2 or 3 fledglings. Average variances of the discrete cumulative distributions of the estimated fledgling counts indicated that temporal changes in climate and parent age class explained 18% of the annual variance in owl fledgling production, which was 34% of the total variance. Prior fledgling production explained as much of the variance in the fledgling counts as climate, parent age class, and landscape habitat predictors. Our logistic quantile regression model can be used for any discrete response variables with fixed upper and lower bounds.
Energy, water and large-scale patterns of reptile and amphibian species richness in Europe
NASA Astrophysics Data System (ADS)
Rodríguez, Miguel Á.; Belmontes, Juan Alfonso; Hawkins, Bradford A.
2005-07-01
We used regression analyses to examine the relationships between reptile and amphibian species richness in Europe and 11 environmental variables related to five hypotheses for geographical patterns of species richness: (1) productivity; (2) ambient energy; (3) water-energy balance, (4) habitat heterogeneity; and (5) climatic variability. For reptiles, annual potential evapotranspiration (PET), a measure of the amount of atmospheric energy, explained 71% of the variance, with variability in log elevation explaining an additional 6%. For amphibians, annual actual evapotranspiration (AET), a measure of the joint availability of energy and water in the environment, and the global vegetation index, an estimate of plant biomass generated through satellite remote sensing, both described similar proportions of the variance (61% and 60%, respectively) and had partially independent effects on richness as indicated by multiple regression. The two-factor environmental models successfully removed most of the statistically detectable spatial autocorrelation in the richness data of both groups. Our results are consistent with reptile and amphibian environmental requirements, where the former depend strongly on solar energy and the latter require both warmth and moisture for reproduction. We conclude that ambient energy explains the reptile richness pattern, whereas for amphibians a combination of water-energy balance and productivity best explain the pattern.
Competing risks models and time-dependent covariates
Barnett, Adrian; Graves, Nick
2008-01-01
New statistical models for analysing survival data in an intensive care unit context have recently been developed. Two models that offer significant advantages over standard survival analyses are competing risks models and multistate models. Wolkewitz and colleagues used a competing risks model to examine survival times for nosocomial pneumonia and mortality. Their model was able to incorporate time-dependent covariates and so examine how risk factors that changed with time affected the chances of infection or death. We briefly explain how an alternative modelling technique (using logistic regression) can more fully exploit time-dependent covariates for this type of data. PMID:18423067
Olea, Pedro P.; Mateo-Tomás, Patricia; de Frutos, Ángel
2010-01-01
Background Hierarchical partitioning (HP) is an analytical method of multiple regression that identifies the most likely causal factors while alleviating multicollinearity problems. Its use is increasing in ecology and conservation by its usefulness for complementing multiple regression analysis. A public-domain software “hier.part package” has been developed for running HP in R software. Its authors highlight a “minor rounding error” for hierarchies constructed from >9 variables, however potential bias by using this module has not yet been examined. Knowing this bias is pivotal because, for example, the ranking obtained in HP is being used as a criterion for establishing priorities of conservation. Methodology/Principal Findings Using numerical simulations and two real examples, we assessed the robustness of this HP module in relation to the order the variables have in the analysis. Results indicated a considerable effect of the variable order on the amount of independent variance explained by predictors for models with >9 explanatory variables. For these models the nominal ranking of importance of the predictors changed with variable order, i.e. predictors declared important by its contribution in explaining the response variable frequently changed to be either most or less important with other variable orders. The probability of changing position of a variable was best explained by the difference in independent explanatory power between that variable and the previous one in the nominal ranking of importance. The lesser is this difference, the more likely is the change of position. Conclusions/Significance HP should be applied with caution when more than 9 explanatory variables are used to know ranking of covariate importance. The explained variance is not a useful parameter to use in models with more than 9 independent variables. The inconsistency in the results obtained by HP should be considered in future studies as well as in those already published. Some recommendations to improve the analysis with this HP module are given. PMID:20657734
Raghavan, S; Zhang, W; Yang, I V; Lange, L A; Lange, E M; Fingerlin, T E; Dabelea, D
2017-12-01
To examine the extent to which offspring obesity-associated genetic risk explains the association between gestational diabetes mellitus and childhood adiposity. We studied 282 children aged 7-12 years who were enrolled in the Exploring Perinatal Outcomes in Children Study. A genetic risk score for BMI was calculated as the count of 91 established BMI-raising risk alleles. Multivariable linear and logistic regression models were used to estimate associations between the offspring genetic risk score and exposure to gestational diabetes and childhood adiposity (BMI and waist circumference), adjusting for clinical and demographic covariates. The contribution of offspring genetic risk to associations between maternal gestational diabetes and childhood outcomes was estimated by comparing the regression coefficients for the gestational diabetes variable in models with and without the genetic risk score. The offspring BMI genetic risk score was associated with childhood BMI (P = 0.006) and waist circumference (P = 0.02), and marginally with gestational diabetes (P = 0.05). Offspring BMI genetic risk did not contribute significantly to associations between gestational diabetes and childhood BMI [7.7% (95% CI -3.3, 18.8)] or waist circumference [5.8% (95% CI -3.1, 14.8); P = 0.2 for both]. Offspring obesity genetic risk does not explain a significant proportion of the association between gestational diabetes exposure and childhood adiposity. The association between gestational diabetes and childhood adiposity is probably explained through alternative pathways, including direct intrauterine effects or a shared postnatal environment. © 2017 Diabetes UK.
Possibility of modifying the growth trajectory in Raeini Cashmere goat.
Ghiasi, Heydar; Mokhtari, M S
2018-03-27
The objective of this study was to investigate the possibility of modifying the growth trajectory in Raeini Cashmere goat breed. In total, 13,193 records on live body weight collected from 4788 Raeini Cashmere goats were used. According to Akanke's information criterion (AIC), the sing-trait random regression model included fourth-order Legendre polynomial for direct and maternal genetic effect; maternal and individual permanent environmental effect was the best model for estimating (co)variance components. The matrices of eigenvectors for (co)variances between random regression coefficients of direct additive genetic were used to calculate eigenfunctions, and different eigenvector indices were also constructed. The obtained results showed that the first eigenvalue explained 79.90% of total genetic variance. Therefore, changing the body weights applying the first eigenfunction will be obtained rapidly. Selection based on the first eigenvector will cause favorable positive genetic gains for all body weight considered from birth to 12 months of age. For modifying the growth trajectory in Raeini Cashmere goat, the selection should be based on the second eigenfunction. The second eigenvalue accounted for 14.41% of total genetic variance for body weights that is low in comparison with genetic variance explained by the first eigenvalue. The complex patterns of genetic change in growth trajectory observed under the third and fourth eigenfunction and low amount of genetic variance explained by the third and fourth eigenvalues.
[Mapping environmental vulnerability from ETM + data in the Yellow River Mouth Area].
Wang, Rui-Yan; Yu, Zhen-Wen; Xia, Yan-Ling; Wang, Xiang-Feng; Zhao, Geng-Xing; Jiang, Shu-Qian
2013-10-01
The environmental vulnerability retrieval is important to support continuing data. The spatial distribution of regional environmental vulnerability was got through remote sensing retrieval. In view of soil and vegetation, the environmental vulnerability evaluation index system was built, and the environmental vulnerability of sampling points was calculated by the AHP-fuzzy method, then the correlation between the sampling points environmental vulnerability and ETM + spectral reflectance ratio including some kinds of conversion data was analyzed to determine the sensitive spectral parameters. Based on that, models of correlation analysis, traditional regression, BP neural network and support vector regression were taken to explain the quantitative relationship between the spectral reflectance and the environmental vulnerability. With this model, the environmental vulnerability distribution was retrieved in the Yellow River Mouth Area. The results showed that the correlation between the environmental vulnerability and the spring NDVI, the September NDVI and the spring brightness was better than others, so they were selected as the sensitive spectral parameters. The model precision result showed that in addition to the support vector model, the other model reached the significant level. While all the multi-variable regression was better than all one-variable regression, and the model accuracy of BP neural network was the best. This study will serve as a reliable theoretical reference for the large spatial scale environmental vulnerability estimation based on remote sensing data.
Linear regression analysis: part 14 of a series on evaluation of scientific publications.
Schneider, Astrid; Hommel, Gerhard; Blettner, Maria
2010-11-01
Regression analysis is an important statistical method for the analysis of medical data. It enables the identification and characterization of relationships among multiple factors. It also enables the identification of prognostically relevant risk factors and the calculation of risk scores for individual prognostication. This article is based on selected textbooks of statistics, a selective review of the literature, and our own experience. After a brief introduction of the uni- and multivariable regression models, illustrative examples are given to explain what the important considerations are before a regression analysis is performed, and how the results should be interpreted. The reader should then be able to judge whether the method has been used correctly and interpret the results appropriately. The performance and interpretation of linear regression analysis are subject to a variety of pitfalls, which are discussed here in detail. The reader is made aware of common errors of interpretation through practical examples. Both the opportunities for applying linear regression analysis and its limitations are presented.
Yue, Xu; Mickley, Loretta J.; Logan, Jennifer A.; Kaplan, Jed O.
2013-01-01
We estimate future wildfire activity over the western United States during the mid-21st century (2046–2065), based on results from 15 climate models following the A1B scenario. We develop fire prediction models by regressing meteorological variables from the current and previous years together with fire indexes onto observed regional area burned. The regressions explain 0.25–0.60 of the variance in observed annual area burned during 1980–2004, depending on the ecoregion. We also parameterize daily area burned with temperature, precipitation, and relative humidity. This approach explains ~0.5 of the variance in observed area burned over forest ecoregions but shows no predictive capability in the semi-arid regions of Nevada and California. By applying the meteorological fields from 15 climate models to our fire prediction models, we quantify the robustness of our wildfire projections at mid-century. We calculate increases of 24–124% in area burned using regressions and 63–169% with the parameterization. Our projections are most robust in the southwestern desert, where all GCMs predict significant (p<0.05) meteorological changes. For forested ecoregions, more GCMs predict significant increases in future area burned with the parameterization than with the regressions, because the latter approach is sensitive to hydrological variables that show large inter-model variability in the climate projections. The parameterization predicts that the fire season lengthens by 23 days in the warmer and drier climate at mid-century. Using a chemical transport model, we find that wildfire emissions will increase summertime surface organic carbon aerosol over the western United States by 46–70% and black carbon by 20–27% at midcentury, relative to the present day. The pollution is most enhanced during extreme episodes: above the 84th percentile of concentrations, OC increases by ~90% and BC by ~50%, while visibility decreases from 130 km to 100 km in 32 Federal Class 1 areas in Rocky Mountains Forest. PMID:24015109
Analysis of Blood Transfusion Data Using Bivariate Zero-Inflated Poisson Model: A Bayesian Approach.
Mohammadi, Tayeb; Kheiri, Soleiman; Sedehi, Morteza
2016-01-01
Recognizing the factors affecting the number of blood donation and blood deferral has a major impact on blood transfusion. There is a positive correlation between the variables "number of blood donation" and "number of blood deferral": as the number of return for donation increases, so does the number of blood deferral. On the other hand, due to the fact that many donors never return to donate, there is an extra zero frequency for both of the above-mentioned variables. In this study, in order to apply the correlation and to explain the frequency of the excessive zero, the bivariate zero-inflated Poisson regression model was used for joint modeling of the number of blood donation and number of blood deferral. The data was analyzed using the Bayesian approach applying noninformative priors at the presence and absence of covariates. Estimating the parameters of the model, that is, correlation, zero-inflation parameter, and regression coefficients, was done through MCMC simulation. Eventually double-Poisson model, bivariate Poisson model, and bivariate zero-inflated Poisson model were fitted on the data and were compared using the deviance information criteria (DIC). The results showed that the bivariate zero-inflated Poisson regression model fitted the data better than the other models.
Analysis of Blood Transfusion Data Using Bivariate Zero-Inflated Poisson Model: A Bayesian Approach
Mohammadi, Tayeb; Sedehi, Morteza
2016-01-01
Recognizing the factors affecting the number of blood donation and blood deferral has a major impact on blood transfusion. There is a positive correlation between the variables “number of blood donation” and “number of blood deferral”: as the number of return for donation increases, so does the number of blood deferral. On the other hand, due to the fact that many donors never return to donate, there is an extra zero frequency for both of the above-mentioned variables. In this study, in order to apply the correlation and to explain the frequency of the excessive zero, the bivariate zero-inflated Poisson regression model was used for joint modeling of the number of blood donation and number of blood deferral. The data was analyzed using the Bayesian approach applying noninformative priors at the presence and absence of covariates. Estimating the parameters of the model, that is, correlation, zero-inflation parameter, and regression coefficients, was done through MCMC simulation. Eventually double-Poisson model, bivariate Poisson model, and bivariate zero-inflated Poisson model were fitted on the data and were compared using the deviance information criteria (DIC). The results showed that the bivariate zero-inflated Poisson regression model fitted the data better than the other models. PMID:27703493
Predicting biological condition in southern California streams
Brown, Larry R.; May, Jason T.; Rehn, Andrew C.; Ode, Peter R.; Waite, Ian R.; Kennen, Jonathan G.
2012-01-01
As understanding of the complex relations among environmental stressors and biological responses improves, a logical next step is predictive modeling of biological condition at unsampled sites. We developed a boosted regression tree (BRT) model of biological condition, as measured by a benthic macroinvertebrate index of biotic integrity (BIBI), for streams in urbanized Southern Coastal California. We also developed a multiple linear regression (MLR) model as a benchmark for comparison with the BRT model. The BRT model explained 66% of the variance in B-IBI, identifying watershed population density and combined percentage agricultural and urban land cover in the riparian buffer as the most important predictors of B-IBI, but with watershed mean precipitation and watershed density of manmade channels also important. The MLR model explained 48% of the variance in B-IBI and included watershed population density and combined percentage agricultural and urban land cover in the riparian buffer. For a verification data set, the BRT model correctly classified 75% of impaired sites (B-IBI < 40) and 78% of unimpaired sites (B-IBI = 40). For the same verification data set, the MLR model correctly classified 69% of impaired sites and 87% of unimpaired sites. The BRT model should not be used to predict B-IBI for specific sites; however, the model can be useful for general applications such as identifying and prioritizing regions for monitoring, remediation or preservation, stratifying new bioassessments according to anticipated biological condition, or assessing the potential for change in stream biological condition based on anticipated changes in population density and development in stream buffers.
Rainfall and streamflow from small tree-covered and fern-covered and burned watersheds in Hawaii
H. W. Anderson; P. D. Duffy; Teruo Yamamoto
1966-01-01
Streamflow from two 30-acre watersheds near Honolulu was studied by using principal components regression analysis. Models using data on monthly, storm, and peak discharges were tested against several variables expressing amount and intensity of rainfall, and against variables expressing antecedent rainfall. Explained variation ranged from 78 to 94 percent. The...
Predictors of Performance in Introductory Finance: Variables within and beyond the Student's Control
ERIC Educational Resources Information Center
Englander, Fred; Wang, Zhaobo; Betz, Kenneth
2015-01-01
This study examined variables that are within and beyond the control of students in explaining variations in performance in an introductory finance course. Regression models were utilized to consider whether the variables within the student's control have a greater impact on course performance relative to the variables beyond the student's…
ERIC Educational Resources Information Center
Morgan, Paul L.; Farkas, George; Cook, Michael; Strassfeld, Natasha M.; Hillemeier, Marianne M.; Pun, Wik Hung; Wang, Yangyang; Schussler, Deborah L.
2018-01-01
We conducted a best-evidence synthesis of 22 studies to examine whether systemic bias explained minority disproportionate overrepresentation in special education. Of the total regression model estimates, only 7/168 (4.2%), 14/208 (6.7%), 2/37 (5.4%), and 6/91 (6.6%) indicated statistically significant overrepresentation for Hispanic, Asian, Native…
NASA Astrophysics Data System (ADS)
Song, Lanlan
2017-04-01
Nitrous oxide is much more potent greenhouse gas than carbon dioxide. However, the estimation of N2O flux is usually clouded with uncertainty, mainly due to high spatial and temporal variations. This hampers the development of general mechanistic models for N2O emission as well, as most previously developed models were empirical or exhibited low predictability with numerous assumptions. In this study, we tested General Regression Neural Networks (GRNN) as an alternative to classic empirical models for simulating N2O emission in riparian zones of Reservoirs. GRNN and nonlinear regression (NLR) were applied to estimate the N2O flux of 1-year observations in riparian zones of Three Gorge Reservoir. NLR resulted in lower prediction power and higher residuals compared to GRNN. Although nonlinear regression model estimated similar average values of N2O, it could not capture the fluctuation patterns accurately. In contrast, GRNN model achieved a fairly high predictability, with an R2 of 0.59 for model validation, 0.77 for model calibration (training), and a low root mean square error (RMSE), indicating a high capacity to simulate the dynamics of N2O flux. According to a sensitivity analysis of the GRNN, nonlinear relationships between input variables and N2O flux were well explained. Our results suggest that the GRNN developed in this study has a greater performance in simulating variations in N2O flux than nonlinear regressions.
Merkel, C; Morabito, A; Sacerdoti, D; Bolognesi, M; Angeli, P; Gatta, A
1998-06-01
The determination of aminopyrine breath test on entry into the study was recently shown to improve the accuracy of prediction of death based on the Child-Pugh classification, but the possible usefulness of serial determinations of both parameters has not been assessed. In the present study, we aimed at evaluating whether serial determinations of aminopyrine breath test and Child-Pugh score improve prognostic accuracy in patients with cirrhosis, compared with determinations obtained only on admission. In 74 patients with liver cirrhosis aminopyrine breath test and Child-Pugh score were obtained upon entry into the study. Patients were followed with sequential aminopyrine breath tests and assessments of the Child-Pugh score every 4-6 months. A total number of 232 determinations were obtained. During follow-up 45 patients died, on average after 12 months of follow-up. Child-Pugh score improved in the beginning of follow-up, and then remained fairly constant; aminopyrine breath test showed no improvement in the beginning of follow-up, but rather a slowly progressive decline. In patients who died, both the Child-Pugh score and the metabolism of aminopyrine were significantly more impaired in the last year preceding death (p < 0.05). Applying Cox's regression model with time-dependent covariates, Child-Pugh score and aminopyrine breath test were independent significant predictors of survival. The model with time-dependent covariates explained the observed survival much better than the model with time-fixed covariates (chi-sq. explained by regression = 31.45 vs 11.97; d.f. = 2; p = 0.0000001 vs 0.003). These data suggest that serial determinations of Child-Pugh score and aminopyrine breath test can be used to efficiently update prognosis of cirrhosis.
Casemix funding for a specialist paediatrics hospital: a hedonic regression approach.
Bridges, J F; Hanson, R M
2000-01-01
This paper inquires into the effects that Diagnosis Related Groups (DRGs) have had on the ability to explain patient-level costs in a specialist paediatrics hospital. Two hedonic models are estimated using 1996/97 New Children's Hospital (NCH) patient level cost data, one with and one without a casemix index (CMI). The results show that the inclusion of a casemix index as an explanatory variable leads to a better accounting of cost. The full hedonic model is then used to simulate a funding model for the 1997/98 NCH cost data. These costs are highly correlated with the actual costs reported for that year. In addition, univariate regression indicates that there has been inflation in costs in the order of 4.8% between the two years. In conclusion, hedonic analysis can provide valuable evidence for the design of funding models that account for casemix.
Prediction of episodic acidification in North-eastern USA: An empirical/mechanistic approach
Davies, T.D.; Tranter, M.; Wigington, P.J.; Eshleman, K.N.; Peters, N.E.; Van Sickle, J.; DeWalle, David R.; Murdoch, Peter S.
1999-01-01
Observations from the US Environmental Protection Agency's Episodic Response Project (ERP) in the North-eastern United States are used to develop an empirical/mechanistic scheme for prediction of the minimum values of acid neutralizing capacity (ANC) during episodes. An acidification episode is defined as a hydrological event during which ANC decreases. The pre-episode ANC is used to index the antecedent condition, and the stream flow increase reflects how much the relative contributions of sources of waters change during the episode. As much as 92% of the total variation in the minimum ANC in individual catchments can be explained (with levels of explanation >70% for nine of the 13 streams) by a multiple linear regression model that includes pre-episode ANC and change in discharge as independent variable. The predictive scheme is demonstrated to be regionally robust, with the regional variance explained ranging from 77 to 83%. The scheme is not successful for each ERP stream, and reasons are suggested for the individual failures. The potential for applying the predictive scheme to other watersheds is demonstrated by testing the model with data from the Panola Mountain Research Watershed in the South-eastern United States, where the variance explained by the model was 74%. The model can also be utilized to assess 'chemically new' and 'chemically old' water sources during acidification episodes.Observations from the US Environmental Protection Agency's Episodic Response Project (ERP) in the Northeastern United States are used to develop an empirical/mechanistic scheme for prediction of the minimum values of acid neutralizing capacity (ANC) during episodes. An acidification episode is defined as a hydrological event during which ANC decreases. The pre-episode ANC is used to index the antecedent condition, and the stream flow increase reflects how much the relative contributions of sources of waters change during the episode. As much as 92% of the total variation in the minimum ANC in individual catchments can be explained (with levels of explanation >70% for nine of the 13 streams) by a multiple linear regression model that includes pre-episode ANC and change in discharge as independent variables. The predictive scheme is demonstrated to be regionally robust, with the regional variance explained ranging from 77 to 83%. The scheme is not successful for each ERP stream, and reasons are suggested for the individual failures. The potential for applying the predictive scheme to other watersheds is demonstrated by testing the model with data from the Panola Mountain Research Watershed in the South-eastern United States, where the variance explained by the model was 74%. The model can also be utilized to assess `chemically new' and `chemically old' water sources during acidification episodes.
Altmann, Vivian; Schumacher-Schuh, Artur F; Rieck, Mariana; Callegari-Jacques, Sidia M; Rieder, Carlos R M; Hutz, Mara H
2016-04-01
Levodopa is first-line treatment of Parkinson's disease motor symptoms but, dose response is highly variable. Therefore, the aim of this study was to determine how much levodopa dose could be explained by biological, pharmacological and genetic factors. A total of 224 Parkinson's disease patients were genotyped for SV2C and SLC6A3 polymorphisms by allelic discrimination assays. Comedication, demographic and clinical data were also assessed. All variables with p < 0.20 were included in a multiple regression analysis for dose prediction. The final model explained 23% of dose variation (F = 11.54; p < 0.000001). Although a good prediction model was obtained, it still needs to be tested in an independent sample to be validated.
Berlinguer, Fiammetta; Madeddu, Manuela; Pasciu, Valeria; Succu, Sara; Spezzigu, Antonio; Satta, Valentina; Mereu, Paolo; Leoni, Giovanni G; Naitana, Salvatore
2009-01-01
Currently, the assessment of sperm function in a raw or processed semen sample is not able to reliably predict sperm ability to withstand freezing and thawing procedures and in vivo fertility and/or assisted reproductive biotechnologies (ART) outcome. The aim of the present study was to investigate which parameters among a battery of analyses could predict subsequent spermatozoa in vitro fertilization ability and hence blastocyst output in a goat model. Ejaculates were obtained by artificial vagina from 3 adult goats (Capra hircus) aged 2 years (A, B and C). In order to assess the predictive value of viability, computer assisted sperm analyzer (CASA) motility parameters and ATP intracellular concentration before and after thawing and of DNA integrity after thawing on subsequent embryo output after an in vitro fertility test, a logistic regression analysis was used. Individual differences in semen parameters were evident for semen viability after thawing and DNA integrity. Results of IVF test showed that spermatozoa collected from A and B lead to higher cleavage rates (0 < 0.01) and blastocysts output (p < 0.05) compared with C. Logistic regression analysis model explained a deviance of 72% (p < 0.0001), directly related with the mean percentage of rapid spermatozoa in fresh semen (p < 0.01), semen viability after thawing (p < 0.01), and with two of the three comet parameters considered, i.e tail DNA percentage and comet length (p < 0.0001). DNA integrity alone had a high predictive value on IVF outcome with frozen/thawed semen (deviance explained: 57%). The model proposed here represents one of the many possible ways to explain differences found in embryo output following IVF with different semen donors and may represent a useful tool to select the most suitable donors for semen cryopreservation. PMID:19900288
Do Lower-Body Dimensions and Body Composition Explain Vertical Jump Ability?
Caia, Johnpaul; Weiss, Lawrence W; Chiu, Loren Z F; Schilling, Brian K; Paquette, Max R; Relyea, George E
2016-11-01
Caia, J, Weiss, LW, Chiu, LZF, Schilling, BK, Paquette, MR, and Relyea, GE. Do lower-body dimensions and body composition explain vertical jump ability? J Strength Cond Res 30(11): 3073-3083, 2016-Vertical jump (VJ) capability is integral to the level of success attained by individuals participating in numerous sport and physical activities. Knowledge of factors related to jump performance may help with talent identification and/or optimizing training prescription. Although myriad variables are likely related to VJ, this study focused on determining if various lower-body dimensions and/or body composition would explain some of the variability in performance. Selected anthropometric dimensions were obtained from 50 university students (25 men and 25 women) on 2 occasions separated by 48 or 72 hours. Estimated body fat percentage (BF%), height, body weight, hip width, pelvic width, bilateral quadriceps angle (Q-angle), and bilateral longitudinal dimensions of the feet, leg, thigh, and lower limb were obtained. Additionally, participants completed countermovement VJs. Analysis showed BF% to have the highest correlation with countermovement VJ displacement (r = -0.76, p < 0.001). When examining lower-body dimensions, right-side Q-angle displayed the strongest association with countermovement VJ displacement (r = -0.58, p < 0.001). Regression analysis revealed that 2 different pairs of variables accounted for the greatest variation (66%) in VJ: (a) BF% and sex and (b) BF% and body weight. Regression models involving BF% and lower-body dimensions explained up to 61% of the variance observed in VJ. Although the variance explained by BF% may be increased by using several lower-body dimensions, either sex identification or body weight explains comparatively more. Therefore, these data suggest that the lower-body dimensions measured herein have limited utility in explaining VJ performance.
NASA Astrophysics Data System (ADS)
Beriro, D. J.; Abrahart, R. J.; Nathanail, C. P.
2012-04-01
Data-driven modelling is most commonly used to develop predictive models that will simulate natural processes. This paper, in contrast, uses Gene Expression Programming (GEP) to construct two alternative models of different pan evaporation estimations by means of symbolic regression: a simulator, a model of a real-world process developed on observed records, and an emulator, an imitator of some other model developed on predicted outputs calculated by that source model. The solutions are compared and contrasted for the purposes of determining whether any substantial differences exist between either option. This analysis will address recent arguments over the impact of using downloaded hydrological modelling datasets originating from different initial sources i.e. observed or calculated. These differences can be easily be overlooked by modellers, resulting in a model of a model developed on estimations derived from deterministic empirical equations and producing exceptionally high goodness-of-fit. This paper uses different lines-of-evidence to evaluate model output and in so doing paves the way for a new protocol in machine learning applications. Transparent modelling tools such as symbolic regression offer huge potential for explaining stochastic processes, however, the basic tenets of data quality and recourse to first principles with regard to problem understanding should not be trivialised. GEP is found to be an effective tool for the prediction of observed and calculated pan evaporation, with results supported by an understanding of the records, and of the natural processes concerned, evaluated using one-at-a-time response function sensitivity analysis. The results show that both architectures and response functions are very similar, implying that previously observed differences in goodness-of-fit can be explained by whether models are applied to observed or calculated data.
Hughes, James P.; Haley, Danielle F.; Frew, Paula M.; Golin, Carol E.; Adimora, Adaora A; Kuo, Irene; Justman, Jessica; Soto-Torres, Lydia; Wang, Jing; Hodder, Sally
2015-01-01
Purpose Reductions in risk behaviors are common following enrollment in HIV prevention studies. We develop methods to quantify the proportion of change in risk behaviors that can be attributed to regression to the mean versus study participation and other factors. Methods A novel model that incorporates both regression to the mean and study participation effects is developed for binary measures. The model is used to estimate the proportion of change in the prevalence of “unprotected sex in the past 6 months” that can be attributed to study participation versus regression to the mean in a longitudinal cohort of women at risk for HIV infection who were recruited from ten US communities with high rates of HIV and poverty. HIV risk behaviors were evaluated using audio computer-assisted self-interviews at baseline and every 6 months for up to 12 months. Results The prevalence of “unprotected sex in the past 6 months” declined from 96% at baseline to 77% at 12 months. However, this change could be almost completely explained by regression to the mean. Conclusions Analyses that examine changes over time in cohorts selected for high or low risk behaviors should account for regression to the mean effects. PMID:25883065
Rice, Nigel; Dixon, Paul; Lloyd, David C E F; Roberts, David
2000-01-01
Objective To develop a weighted capitation formula for setting target allocations for prescribing expenditures for health authorities and primary care groups in England. Design Regression analysis relating prescribing costs to the demographic, morbidity, and mortality composition of practice lists. Setting 8500 general practices in England. Subjects Data from the 1991 census were attributed to practice lists on the basis of the place of residence of the practice population. Main outcome measures Variation in age, sex, and temporary resident originated prescribing units (ASTRO(97)-PUs) adjusted net ingredient cost of general practices in England for 1997-8 modelled for the impact of health and social needs after controlling for differences in supply. Results A needs gradient based on the four variables: permanent sickness, percentage of dependants in no carer households, percentage of students, and percentage of births on practice lists. These, together with supply characteristics, explained 41% of variation in prescribing costs per ASTRO(97)-PU adjusted capita across practices. The latter alone explained about 35% of variation in total costs per head across practices. Conclusions The model has good statistical specification and contains intuitively plausible needs drivers of prescribing expenditure. Together with adjustments made for differences in ASTRO(97)-PUs the model is capable of explaining 62% (35%+0.65% (41%)) of variation in prescribing expenditure at practice level. The results of the study have formed the basis for setting target budgets for 1999-2000 allocations for prescribing expenditure for health authorities and primary care groups. PMID:10650026
Montaño, Daniel E; Kasprzyk, Danuta; Hamilton, Deven T; Tshimanga, Mufuta; Gorn, Gerald
2014-05-01
Male circumcision (MC) reduces HIV acquisition among men, leading WHO/UNAIDS to recommend a goal to circumcise 80 % of men in high HIV prevalence countries. Significant investment to increase MC capacity in priority countries was made, yet only 5 % of the goal has been achieved in Zimbabwe. The integrated behavioral model (IBM) was used as a framework to investigate the factors affecting MC motivation among men in Zimbabwe. A survey instrument was designed based on elicitation study results, and administered to a representative household-based sample of 1,201 men aged 18-30 from two urban and two rural areas in Zimbabwe. Multiple regression analysis found all five IBM constructs significantly explained MC Intention. Nearly all beliefs underlying the IBM constructs were significantly correlated with MC Intention. Stepwise regression analysis of beliefs underlying each construct respectively found that 13 behavioral beliefs, 5 normative beliefs, 4 descriptive norm beliefs, 6 efficacy beliefs, and 10 control beliefs were significant in explaining MC Intention. A final stepwise regression of the five sets of significant IBM construct beliefs identified 14 key beliefs that best explain Intention. Similar analyses were carried out with subgroups of men by urban-rural and age. Different sets of behavioral, normative, efficacy, and control beliefs were significant for each sub-group, suggesting communication messages need to be targeted to be most effective for sub-groups. Implications for the design of effective MC demand creation messages are discussed. This study demonstrates the application of theory-driven research to identify evidence-based targets for intervention messages to increase men's motivation to get circumcised and thereby improve demand for male circumcision.
Li, Siyue; Zhang, Quanfa
2011-06-15
Water samples were collected for determination of dissolved trace metals in 56 sampling sites throughout the upper Han River, China. Multivariate statistical analyses including correlation analysis, stepwise multiple linear regression models, and principal component and factor analysis (PCA/FA) were employed to examine the land use influences on trace metals, and a receptor model of factor analysis-multiple linear regression (FA-MLR) was used for source identification/apportionment of anthropogenic heavy metals in the surface water of the River. Our results revealed that land use was an important factor in water metals in the snow melt flow period and land use in the riparian zone was not a better predictor of metals than land use away from the river. Urbanization in a watershed and vegetation along river networks could better explain metals, and agriculture, regardless of its relative location, however slightly explained metal variables in the upper Han River. FA-MLR analysis identified five source types of metals, and mining, fossil fuel combustion, and vehicle exhaust were the dominant pollutions in the surface waters. The results demonstrated great impacts of human activities on metal concentrations in the subtropical river of China. Copyright © 2011 Elsevier B.V. All rights reserved.
Coleman, Christopher Lance
2016-12-01
The purpose of this descriptive correlational study was to describe predictors of depressive symptoms among N=70 seropositive Botswana men and women residing in Gaborne, Botswana. A demographic questionnaire, the Center for Epidemiologic Studies Depression Scale, (CESD-D), and the SF-36 [Quality of life] were administered. The questionnaires were translated and back translated in Setswana and administered by Batswana men and women. The results of the regression analyses resulted in two calculated models. In the first Model energy/fatigue explained 46% of the variance in depressive symptoms (P=.000), and in the second Model energy/fatigue and role limitations on emotional well-being explained 50% of the variance in depressive symptoms respectively. The study findings underscore the need for mental health services for seropositive Batswana men and women. Copyright © 2016 Elsevier Inc. All rights reserved.
The Effect of QBO on the Total Mass Density
NASA Astrophysics Data System (ADS)
Saǧır, Selçuk; Atıcı, Ramazan
2016-07-01
The relationship between Quasi-Biennial Oscillation (QBO) measured at 10 hPa altitude and total mass density (TMD) values obtained from NRLMSIS-00 model for 90 km altitude of ionosphere known as Mesosphere-Lower Thermosphere (MLT) region is statistically investigated. For this study, multiple-regression model is used. To see the effect on TMD of QBO directions, Dummy variables are also added to model. In the result of calculations, it is observed that QBO is effected on TMD. It is determined that 69% of variations at TMD can be explainable by QBO. It is determined that the explainable ratio is at the rate of 5%. Also, it is seen that an increase/a decrease of 1 meter per second at QBO give rise to an increase/a decrease of 7,2x10-4 g/cm3 at TMD.
Marchandeau, S; Bertagnoli, S; Peralta, B; Boucraut-Baralon, C; Letty, J; Reitz, F
2004-11-06
Serological data on myxoma virus, rabbit haemorrhagic disease (RHD) virus and RHD-like viruses in juvenile rabbits (Oryctolagus cuniculus) trapped in 1995, 1996 and 1997 in two areas of France were analysed. For each disease, the effects of bodyweight, year, month and seropositivity for the other disease were modelled by using logistic regressions. In one area, a model including RHD seropositivity was selected to explain the myxoma virus seropositivity. Models including myxoma virus seropositivity were selected to explain the RHD seropositivity in both areas, and the odds of a rabbit being seropositive to both viruses were 5.1 and 8.4 times higher than the odds of a rabbit being seronegative to myxoma virus and seropositive to RHD. The year and bodyweight had significant effects for myxomatosis in one area and for RHD in both areas.
Spatial interpolation schemes of daily precipitation for hydrologic modeling
Hwang, Y.; Clark, M.R.; Rajagopalan, B.; Leavesley, G.
2012-01-01
Distributed hydrologic models typically require spatial estimates of precipitation interpolated from sparsely located observational points to the specific grid points. We compare and contrast the performance of regression-based statistical methods for the spatial estimation of precipitation in two hydrologically different basins and confirmed that widely used regression-based estimation schemes fail to describe the realistic spatial variability of daily precipitation field. The methods assessed are: (1) inverse distance weighted average; (2) multiple linear regression (MLR); (3) climatological MLR; and (4) locally weighted polynomial regression (LWP). In order to improve the performance of the interpolations, the authors propose a two-step regression technique for effective daily precipitation estimation. In this simple two-step estimation process, precipitation occurrence is first generated via a logistic regression model before estimate the amount of precipitation separately on wet days. This process generated the precipitation occurrence, amount, and spatial correlation effectively. A distributed hydrologic model (PRMS) was used for the impact analysis in daily time step simulation. Multiple simulations suggested noticeable differences between the input alternatives generated by three different interpolation schemes. Differences are shown in overall simulation error against the observations, degree of explained variability, and seasonal volumes. Simulated streamflows also showed different characteristics in mean, maximum, minimum, and peak flows. Given the same parameter optimization technique, LWP input showed least streamflow error in Alapaha basin and CMLR input showed least error (still very close to LWP) in Animas basin. All of the two-step interpolation inputs resulted in lower streamflow error compared to the directly interpolated inputs. ?? 2011 Springer-Verlag.
Hüls, Anke; Frömke, Cornelia; Ickstadt, Katja; Hille, Katja; Hering, Johanna; von Münchhausen, Christiane; Hartmann, Maria; Kreienbrock, Lothar
2017-01-01
Antimicrobial resistance in livestock is a matter of general concern. To develop hygiene measures and methods for resistance prevention and control, epidemiological studies on a population level are needed to detect factors associated with antimicrobial resistance in livestock holdings. In general, regression models are used to describe these relationships between environmental factors and resistance outcome. Besides the study design, the correlation structures of the different outcomes of antibiotic resistance and structural zero measurements on the resistance outcome as well as on the exposure side are challenges for the epidemiological model building process. The use of appropriate regression models that acknowledge these complexities is essential to assure valid epidemiological interpretations. The aims of this paper are (i) to explain the model building process comparing several competing models for count data (negative binomial model, quasi-Poisson model, zero-inflated model, and hurdle model) and (ii) to compare these models using data from a cross-sectional study on antibiotic resistance in animal husbandry. These goals are essential to evaluate which model is most suitable to identify potential prevention measures. The dataset used as an example in our analyses was generated initially to study the prevalence and associated factors for the appearance of cefotaxime-resistant Escherichia coli in 48 German fattening pig farms. For each farm, the outcome was the count of samples with resistant bacteria. There was almost no overdispersion and only moderate evidence of excess zeros in the data. Our analyses show that it is essential to evaluate regression models in studies analyzing the relationship between environmental factors and antibiotic resistances in livestock. After model comparison based on evaluation of model predictions, Akaike information criterion, and Pearson residuals, here the hurdle model was judged to be the most appropriate model. PMID:28620609
Sandborgh, Maria; Johansson, Ann-Christin; Söderlund, Anne
2016-01-01
In the fear-avoidance (FA) model social cognitive constructs could add to explaining the disabling process in whiplash associated disorder (WAD). The aim was to exemplify the possible input from Social Cognitive Theory on the FA model. Specifically the role of functional self-efficacy and perceived responses from a spouse/intimate partner was studied. A cross-sectional and correlational design was used. Data from 64 patients with acute WAD were used. Measures were pain intensity measured with a numerical rating scale, the Pain Disability Index, support, punishing responses, solicitous responses, and distracting responses subscales from the Multidimensional Pain Inventory, the Catastrophizing subscale from the Coping Strategies Questionnaire, the Tampa Scale of Kinesiophobia, and the Self-Efficacy Scale. Bivariate correlational, simple linear regression, and multiple regression analyses were used. In the statistical prediction models high pain intensity indicated high punishing responses, which indicated high catastrophizing. High catastrophizing indicated high fear of movement, which indicated low self-efficacy. Low self-efficacy indicated high disability, which indicated high pain intensity. All independent variables together explained 66.4% of the variance in pain disability, p < 0.001. Results suggest a possible link between one aspect of the social environment, perceived punishing responses from a spouse/intimate partner, pain intensity, and catastrophizing. Further, results support a mediating role of self-efficacy between fear of movement and disability in WAD.
Shanley, J.B.; Kamman, N.C.; Clair, T.A.; Chalmers, A.
2005-01-01
The physical factors controlling total mercury (HgT) and methylmercury (MeHg) concentrations in lakes and streams of northeastern USA were assessed in a regional data set containing 693 HgT and 385 corresponding MeHg concentrations in surface waters. Multiple regression models using watershed characteristics and climatic variables explained 38% or less of the variance in HgT and MeHg. Land cover percentages and soil permeability generally provided modest predictive power. Percent wetlands alone explained 19% of the variance in MeHg in streams at low-flow, and it was the only significant (p < 0.02) predictor for MeHg in lakes, albeit explaining only 7% of the variance. When stream discharge was added as a variable it became the dominant predictor for HgT in streams, improving the model r 2 from 0.19 to 0.38. Stream discharge improved the MeHg model more modestly, from r 2 of 0.25 to 0.33. Methylation efficiency (MeHg/HgT) was modeled well (r 2 of 0.78) when a seasonal term was incorporated (sine wave with annual period). Physical models explained 18% of the variance in fish Hg concentrations in 134 lakes and 55% in 20 reservoirs. Our results highlight the important role of seasonality and short-term hydrologic changes to the delivery of Hg to water bodies. ?? 2005 Springer Science+Business Media, Inc.
NASA Astrophysics Data System (ADS)
Bloomfield, J. P.; Allen, D. J.; Griffiths, K. J.
2009-06-01
SummaryLinear regression methods can be used to quantify geological controls on baseflow index (BFI). This is illustrated using an example from the Thames Basin, UK. Two approaches have been adopted. The areal extents of geological classes based on lithostratigraphic and hydrogeological classification schemes have been correlated with BFI for 44 'natural' catchments from the Thames Basin. When regression models are built using lithostratigraphic classes that include a constant term then the model is shown to have some physical meaning and the relative influence of the different geological classes on BFI can be quantified. For example, the regression constants for two such models, 0.64 and 0.69, are consistent with the mean observed BFI (0.65) for the Thames Basin, and the signs and relative magnitudes of the regression coefficients for each of the lithostratigraphic classes are consistent with the hydrogeology of the Basin. In addition, regression coefficients for the lithostratigraphic classes scale linearly with estimates of log 10 hydraulic conductivity for each lithological class. When a regression is built using a hydrogeological classification scheme with no constant term, the model does not have any physical meaning, but it has a relatively high adjusted R2 value and because of the continuous coverage of the hydrogeological classification scheme, the model can be used for predictive purposes. A model calibrated on the 44 'natural' catchments and using four hydrogeological classes (low-permeability surficial deposits, consolidated aquitards, fractured aquifers and intergranular aquifers) is shown to perform as well as a model based on a hydrology of soil types (BFIHOST) scheme in predicting BFI in the Thames Basin. Validation of this model using 110 other 'variably impacted' catchments in the Basin shows that there is a correlation between modelled and observed BFI. Where the observed BFI is significantly higher than modelled BFI the deviations can be explained by an exogenous factor, catchment urban area. It is inferred that this is may be due influences from sewage discharge, mains leakage, and leakage from septic tanks.
Shifren, Kim; Anzaldi, Kristen
2018-01-01
The investigation of the relation of positive personality characteristics to mental and physical health among Stroke survivors has been a neglected area of research. The purpose of this study was to examine the relationship between optimism, well-being, depressive symptoms, and perceived physical health among Stroke survivors. It was hypothesized that Stroke survivors' optimism would explain variance in their physical health above and beyond the variance explained by demographic variables, diagnostic variables, and mental health. One hundred seventy-six Stroke survivors (97 females, 79 males) completed the Revised Life Orientation Test, the Center for Epidemiological Studies Depression Scale, two items on perceived physical health from the 36-item Short Form of the Medical Outcomes study, and the Identity scale of the Illness Perception Questionnaire. Pearson correlations, hierarchical regression analyses, and the PROCESS approach to determining mediators were used to assess hypothesized relations between variables. Stroke survivors' level of optimism explained additional variance in overall health in regression models controlling for demographic and diagnostic variables, and mental health. Analyses revealed that optimism played a partial mediator role between mental health (well-being, depressive symptoms and total score on CES-D) variables and overall health.
Partitioning sources of variation in vertebrate species richness
Boone, R.B.; Krohn, W.B.
2000-01-01
Aim: To explore biogeographic patterns of terrestrial vertebrates in Maine, USA using techniques that would describe local and spatial correlations with the environment. Location: Maine, USA. Methods: We delineated the ranges within Maine (86,156 km2) of 275 species using literature and expert review. Ranges were combined into species richness maps, and compared to geomorphology, climate, and woody plant distributions. Methods were adapted that compared richness of all vertebrate classes to each environmental correlate, rather than assessing a single explanatory theory. We partitioned variation in species richness into components using tree and multiple linear regression. Methods were used that allowed for useful comparisons between tree and linear regression results. For both methods we partitioned variation into broad-scale (spatially autocorrelated) and fine-scale (spatially uncorrelated) explained and unexplained components. By partitioning variance, and using both tree and linear regression in analyses, we explored the degree of variation in species richness for each vertebrate group that Could be explained by the relative contribution of each environmental variable. Results: In tree regression, climate variation explained richness better (92% of mean deviance explained for all species) than woody plant variation (87%) and geomorphology (86%). Reptiles were highly correlated with environmental variation (93%), followed by mammals, amphibians, and birds (each with 84-82% deviance explained). In multiple linear regression, climate was most closely associated with total vertebrate richness (78%), followed by woody plants (67%) and geomorphology (56%). Again, reptiles were closely correlated with the environment (95%), followed by mammals (73%), amphibians (63%) and birds (57%). Main conclusions: Comparing variation explained using tree and multiple linear regression quantified the importance of nonlinear relationships and local interactions between species richness and environmental variation, identifying the importance of linear relationships between reptiles and the environment, and nonlinear relationships between birds and woody plants, for example. Conservation planners should capture climatic variation in broad-scale designs; temperatures may shift during climate change, but the underlying correlations between the environment and species richness will presumably remain.
Klijs, Bart; Kibele, Eva U B; Ellwardt, Lea; Zuidersma, Marij; Stolk, Ronald P; Wittek, Rafael P M; Mendes de Leon, Carlos M; Smidt, Nynke
2016-08-11
Previous studies are inconclusive on whether poor socioeconomic conditions in the neighborhood are associated with major depressive disorder. Furthermore, conceptual models that relate neighborhood conditions to depressive disorder have not been evaluated using empirical data. In this study, we investigated whether neighborhood income is associated with major depressive episodes. We evaluated three conceptual models. Conceptual model 1: The association between neighborhood income and major depressive episodes is explained by diseases, lifestyle factors, stress and social participation. Conceptual model 2: A low individual income relative to the mean income in the neighborhood is associated with major depressive episodes. Conceptual model 3: A high income of the neighborhood buffers the effect of a low individual income on major depressive disorder. We used adult baseline data from the LifeLines Cohort Study (N = 71,058) linked with data on the participants' neighborhoods from Statistics Netherlands. The current presence of a major depressive episode was assessed using the MINI neuropsychiatric interview. The association between neighborhood income and major depressive episodes was assessed using a mixed effect logistic regression model adjusted for age, sex, marital status, education and individual (equalized) income. This regression model was sequentially adjusted for lifestyle factors, chronic diseases, stress, and social participation to evaluate conceptual model 1. To evaluate conceptual models 2 and 3, an interaction term for neighborhood income*individual income was included. Multivariate regression analysis showed that a low neighborhood income is associated with major depressive episodes (OR (95 % CI): 0.82 (0.73;0.93)). Adjustment for diseases, lifestyle factors, stress, and social participation attenuated this association (ORs (95 % CI): 0.90 (0.79;1.01)). Low individual income was also associated with major depressive episodes (OR (95 % CI): 0.72 (0.68;0.76)). The interaction of individual income*neighborhood income on major depressive episodes was not significant (p = 0.173). Living in a low-income neighborhood is associated with major depressive episodes. Our results suggest that this association is partly explained by chronic diseases, lifestyle factors, stress and poor social participation, and thereby partly confirm conceptual model 1. Our results do not support conceptual model 2 and 3.
Thomson, James R; Kimmerer, Wim J; Brown, Larry R; Newman, Ken B; Mac Nally, Ralph; Bennett, William A; Feyrer, Frederick; Fleishman, Erica
2010-07-01
We examined trends in abundance of four pelagic fish species (delta smelt, longfin smelt, striped bass, and threadfin shad) in the upper San Francisco Estuary, California, USA, over 40 years using Bayesian change point models. Change point models identify times of abrupt or unusual changes in absolute abundance (step changes) or in rates of change in abundance (trend changes). We coupled Bayesian model selection with linear regression splines to identify biotic or abiotic covariates with the strongest associations with abundances of each species. We then refitted change point models conditional on the selected covariates to explore whether those covariates could explain statistical trends or change points in species abundances. We also fitted a multispecies change point model that identified change points common to all species. All models included hierarchical structures to model data uncertainties, including observation errors and missing covariate values. There were step declines in abundances of all four species in the early 2000s, with a likely common decline in 2002. Abiotic variables, including water clarity, position of the 2 per thousand isohaline (X2), and the volume of freshwater exported from the estuary, explained some variation in species' abundances over the time series, but no selected covariates could explain statistically the post-2000 change points for any species.
Yalcin, Semra; Leroux, Shawn James
2018-04-14
Land-cover and climate change are two main drivers of changes in species ranges. Yet, the majority of studies investigating the impacts of global change on biodiversity focus on one global change driver and usually use simulations to project biodiversity responses to future conditions. We conduct an empirical test of the relative and combined effects of land-cover and climate change on species occurrence changes. Specifically, we examine whether observed local colonization and extinctions of North American birds between 1981-1985 and 2001-2005 are correlated with land-cover and climate change and whether bird life history and ecological traits explain interspecific variation in observed occurrence changes. We fit logistic regression models to test the impact of physical land-cover change, changes in net primary productivity, winter precipitation, mean summer temperature, and mean winter temperature on the probability of Ontario breeding bird local colonization and extinction. Models with climate change, land-cover change, and the combination of these two drivers were the top ranked models of local colonization for 30%, 27%, and 29% of species, respectively. Conversely, models with climate change, land-cover change, and the combination of these two drivers were the top ranked models of local extinction for 61%, 7%, and 9% of species, respectively. The quantitative impacts of land-cover and climate change variables also vary among bird species. We then fit linear regression models to test whether the variation in regional colonization and extinction rate could be explained by mean body mass, migratory strategy, and habitat preference of birds. Overall, species traits were weakly correlated with heterogeneity in species occurrence changes. We provide empirical evidence showing that land-cover change, climate change, and the combination of multiple global change drivers can differentially explain observed species local colonization and extinction. © 2018 John Wiley & Sons Ltd.
A crash-prediction model for multilane roads.
Caliendo, Ciro; Guida, Maurizio; Parisi, Alessandra
2007-07-01
Considerable research has been carried out in recent years to establish relationships between crashes and traffic flow, geometric infrastructure characteristics and environmental factors for two-lane rural roads. Crash-prediction models focused on multilane rural roads, however, have rarely been investigated. In addition, most research has paid but little attention to the safety effects of variables such as stopping sight distance and pavement surface characteristics. Moreover, the statistical approaches have generally included Poisson and Negative Binomial regression models, whilst Negative Multinomial regression model has been used to a lesser extent. Finally, as far as the authors are aware, prediction models involving all the above-mentioned factors have still not been developed in Italy for multilane roads, such as motorways. Thus, in this paper crash-prediction models for a four-lane median-divided Italian motorway were set up on the basis of accident data observed during a 5-year monitoring period extending between 1999 and 2003. The Poisson, Negative Binomial and Negative Multinomial regression models, applied separately to tangents and curves, were used to model the frequency of accident occurrence. Model parameters were estimated by the Maximum Likelihood Method, and the Generalized Likelihood Ratio Test was applied to detect the significant variables to be included in the model equation. Goodness-of-fit was measured by means of both the explained fraction of total variation and the explained fraction of systematic variation. The Cumulative Residuals Method was also used to test the adequacy of a regression model throughout the range of each variable. The candidate set of explanatory variables was: length (L), curvature (1/R), annual average daily traffic (AADT), sight distance (SD), side friction coefficient (SFC), longitudinal slope (LS) and the presence of a junction (J). Separate prediction models for total crashes and for fatal and injury crashes only were considered. For curves it is shown that significant variables are L, 1/R and AADT, whereas for tangents they are L, AADT and junctions. The effect of rain precipitation was analysed on the basis of hourly rainfall data and assumptions about drying time. It is shown that a wet pavement significantly increases the number of crashes. The models developed in this paper for Italian motorways appear to be useful for many applications such as the detection of critical factors, the estimation of accident reduction due to infrastructure and pavement improvement, and the predictions of accidents counts when comparing different design options. Thus this research may represent a point of reference for engineers in adjusting or designing multilane roads.
Health Service Access across Racial/Ethnic Groups of Children in the Child Welfare System
ERIC Educational Resources Information Center
Wells, Rebecca; Hillemeier, Marianne M.; Bai, Yu; Belue, Rhonda
2009-01-01
Objective: This study examined health service access among children of different racial/ethnic groups in the child welfare system in an attempt to identify and explain disparities. Methods: Data were from the National Survey of Child and Adolescent Well-Being (NSCAW). N for descriptive statistics = 2,505. N for multiple regression model = 537.…
ERIC Educational Resources Information Center
Haataja, Anne; Ahtola, Annarilla; Poskiparta, Elisa; Salmivalli, Christina
2015-01-01
The present study provides a person-centered view on teachers' adherence to the KiVa antibullying curriculum over a school year. Factor mixture modeling was used to examine how teachers (N = 282) differed in their implementation profiles and multinomial logistic regression was used to identify factors related to these profiles. On the basis of…
Exploring the Ups and Downs of Mathematics Engagement in the Middle Years of School
ERIC Educational Resources Information Center
Martin, Andrew J.; Way, Jennifer; Bobis, Janette; Anderson, Judy
2015-01-01
This study of 1,601 students in the middle years of schooling (Grades 5-8, each student measured twice, 1 year apart) from 200 classrooms in 44 schools sought to identify factors explaining gains and declines in mathematics engagement at key transition points. In multilevel regression modeling, findings showed that compared with Grade 6 students…
ERIC Educational Resources Information Center
Folds, Lea D.; Tanner, C. Kenneth
2014-01-01
The purpose of this study was to analyze the relations among socioeconomic status, highest-level mathematics course, absenteeism, student mobility and measures of work readiness of high school seniors in Georgia. Study participants were 476 high school seniors in one Georgia county. The full regression model explained 27.5% of the variance in…
Case-mix groups for VA hospital-based home care.
Smith, M E; Baker, C R; Branch, L G; Walls, R C; Grimes, R M; Karklins, J M; Kashner, M; Burrage, R; Parks, A; Rogers, P
1992-01-01
The purpose of this study is to group hospital-based home care (HBHC) patients homogeneously by their characteristics with respect to cost of care to develop alternative case mix methods for management and reimbursement (allocation) purposes. Six Veterans Affairs (VA) HBHC programs in Fiscal Year (FY) 1986 that maximized patient, program, and regional variation were selected, all of which agreed to participate. All HBHC patients active in each program on October 1, 1987, in addition to all new admissions through September 30, 1988 (FY88), comprised the sample of 874 unique patients. Statistical methods include the use of classification and regression trees (CART software: Statistical Software; Lafayette, CA), analysis of variance, and multiple linear regression techniques. The resulting algorithm is a three-factor model that explains 20% of the cost variance (R2 = 20%, with a cross validation R2 of 12%). Similar classifications such as the RUG-II, which is utilized for VA nursing home and intermediate care, the VA outpatient resource allocation model, and the RUG-HHC, utilized in some states for reimbursing home health care in the private sector, explained less of the cost variance and, therefore, are less adequate for VA home care resource allocation.
Use of multilevel logistic regression to identify the causes of differential item functioning.
Balluerka, Nekane; Gorostiaga, Arantxa; Gómez-Benito, Juana; Hidalgo, María Dolores
2010-11-01
Given that a key function of tests is to serve as evaluation instruments and for decision making in the fields of psychology and education, the possibility that some of their items may show differential behaviour is a major concern for psychometricians. In recent decades, important progress has been made as regards the efficacy of techniques designed to detect this differential item functioning (DIF). However, the findings are scant when it comes to explaining its causes. The present study addresses this problem from the perspective of multilevel analysis. Starting from a case study in the area of transcultural comparisons, multilevel logistic regression is used: 1) to identify the item characteristics associated with the presence of DIF; 2) to estimate the proportion of variation in the DIF coefficients that is explained by these characteristics; and 3) to evaluate alternative explanations of the DIF by comparing the explanatory power or fit of different sequential models. The comparison of these models confirmed one of the two alternatives (familiarity with the stimulus) and rejected the other (the topic area) as being a cause of differential functioning with respect to the compared groups.
Social determinants of childhood asthma symptoms: an ecological study in urban Latin America.
Fattore, Gisel L; Santos, Carlos A T; Barreto, Mauricio L
2014-04-01
Asthma is an important public health problem in urban Latin America. This study aimed to analyze the role of socioeconomic and environmental factors as potential determinants of asthma symptoms prevalence in children from Latin American (LA) urban centers. We selected 31 LA urban centers with complete data, and an ecological analysis was performed. According to our theoretical framework, the explanatory variables were classified in three levels: distal, intermediate, and proximate. The association between variables in the three levels and prevalence of asthma symptoms was examined by bivariate and multivariate linear regression analysis weighed by sample size. In a second stage, we fitted several linear regression models introducing sequentially the variables according to the predefined hierarchy. In the final hierarchical model Gini Index, crowding, sanitation, variation in infant mortality rates and homicide rates, explained great part of the variance in asthma prevalence between centers (R(2) = 75.0 %). We found a strong association between socioeconomic and environmental variables and prevalence of asthma symptoms in LA urban children, and according to our hierarchical framework and the results found we suggest that social inequalities (measured by the Gini Index) is a central determinant to explain high prevalence of asthma in LA.
Vinken, Kasper; Vogels, Rufin
2017-11-20
In predictive coding theory, the brain is conceptualized as a prediction machine that constantly constructs and updates expectations of the sensory environment [1]. In the context of this theory, Bell et al.[2] recently studied the effect of the probability of task-relevant stimuli on the activity of macaque inferior temporal (IT) neurons and observed a reduced population response to expected faces in face-selective neurons. They concluded that "IT neurons encode long-term, latent probabilistic information about stimulus occurrence", supporting predictive coding. They manipulated expectation by the frequency of face versus fruit stimuli in blocks of trials. With such a design, stimulus repetition is confounded with expectation. As previous studies showed that IT neurons decrease their response with repetition [3], such adaptation (or repetition suppression), instead of expectation suppression as assumed by the authors, could explain their effects. The authors attempted to control for this alternative interpretation with a multiple regression approach. Here we show by using simulation that adaptation can still masquerade as expectation effects reported in [2]. Further, the results from the regression model used for most analyses cannot be trusted, because the model is not uniquely defined. Copyright © 2017 Elsevier Ltd. All rights reserved.
Sources of Variability in Physical Activity Among Inactive People with Multiple Sclerosis.
Uszynski, Marcin K; Herring, Matthew P; Casey, Blathin; Hayes, Sara; Gallagher, Stephen; Motl, Robert W; Coote, Susan
2018-04-01
Evidence supports that physical activity (PA) improves symptoms of multiple sclerosis (MS). Although application of principles from Social Cognitive Theory (SCT) may facilitate positive changes in PA behaviour among people with multiple sclerosis (pwMS), the constructs often explain limited variance in PA. This study investigated the extent to which MS symptoms, including fatigue, depression, and walking limitations combined with the SCT constructs, explained more variance in PA than SCT constructs alone among pwMS. Baseline data, including objectively assessed PA, exercise self-efficacy, goal setting, outcome expectations, 6-min walk test, fatigue and depression, from 65 participants of the Step It Up randomized controlled trial completed in Ireland (2016), were included. Multiple regression models quantified variance explained in PA and independent associations of (1) SCT constructs, (2) symptoms and (3) SCT constructs and symptoms. Model 1 included exercise self-efficacy, exercise goal setting and multidimensional outcomes expectations for exercise and explained ~14% of the variance in PA (R 2 =0.144, p < 0.05). Model 2 included walking limitations, fatigue and depression and explained 20% of the variance in PA (R 2 =0.196, p < 0.01). Model 3 combined models 1 and 2 and explained variance increased to ~29% (R 2 =0.288; p<0.01). In Model 3, exercise self-efficacy (β=0.30, p < 0.05), walking limitations (β=0.32, p < 0.01), fatigue (β = -0.41, p < 0.01) and depression (β = 0.34, p < 0.05) were significantly and independently associated with PA. Findings suggest that relevant MS symptoms improved by PA, including fatigue, depression and walking limitations, and SCT constructs together explained more variance in PA than SCT constructs alone, providing support for targeting both SCT constructs and these symptoms in the multifactorial promotion of PA among pwMS.
Cuffney, T.F.; Kashuba, R.; Qian, S.S.; Alameddine, I.; Cha, Y.K.; Lee, B.; Coles, J.F.; McMahon, G.
2011-01-01
Multilevel hierarchical regression was used to examine regional patterns in the responses of benthic macroinvertebrates and algae to urbanization across 9 metropolitan areas of the conterminous USA. Linear regressions established that responses (intercepts and slopes) to urbanization of invertebrates and algae varied among metropolitan areas. Multilevel hierarchical regression models were able to explain these differences on the basis of region-scale predictors. Regional differences in the type of land cover (agriculture or forest) being converted to urban and climatic factors (precipitation and air temperature) accounted for the differences in the response of macroinvertebrates to urbanization based on ordination scores, total richness, Ephemeroptera, Plecoptera, Trichoptera richness, and average tolerance. Regional differences in climate and antecedent agriculture also accounted for differences in the responses of salt-tolerant diatoms, but differences in the responses of other diatom metrics (% eutraphenic, % sensitive, and % silt tolerant) were best explained by regional differences in soils (mean % clay soils). The effects of urbanization were most readily detected in regions where forest lands were being converted to urban land because agricultural development significantly degraded assemblages before urbanization and made detection of urban effects difficult. The effects of climatic factors (temperature, precipitation) on background conditions (biogeographic differences) and rates of response to urbanization were most apparent after accounting for the effects of agricultural development. The effects of climate and land cover on responses to urbanization provide strong evidence that monitoring, mitigation, and restoration efforts must be tailored for specific regions and that attainment goals (background conditions) may not be possible in regions with high levels of prior disturbance (e.g., agricultural development). ?? 2011 by The North American Benthological Society.
Nonlinear-regression groundwater flow modeling of a deep regional aquifer system
Cooley, Richard L.; Konikow, Leonard F.; Naff, Richard L.
1986-01-01
A nonlinear regression groundwater flow model, based on a Galerkin finite-element discretization, was used to analyze steady state two-dimensional groundwater flow in the areally extensive Madison aquifer in a 75,000 mi2 area of the Northern Great Plains. Regression parameters estimated include intrinsic permeabilities of the main aquifer and separate lineament zones, discharges from eight major springs surrounding the Black Hills, and specified heads on the model boundaries. Aquifer thickness and temperature variations were included as specified functions. The regression model was applied using sequential F testing so that the fewest number and simplest zonation of intrinsic permeabilities, combined with the simplest overall model, were evaluated initially; additional complexities (such as subdivisions of zones and variations in temperature and thickness) were added in stages to evaluate the subsequent degree of improvement in the model results. It was found that only the eight major springs, a single main aquifer intrinsic permeability, two separate lineament intrinsic permeabilities of much smaller values, and temperature variations are warranted by the observed data (hydraulic heads and prior information on some parameters) for inclusion in a model that attempts to explain significant controls on groundwater flow. Addition of thickness variations did not significantly improve model results; however, thickness variations were included in the final model because they are fairly well defined. Effects on the observed head distribution from other features, such as vertical leakage and regional variations in intrinsic permeability, apparently were overshadowed by measurement errors in the observed heads. Estimates of the parameters correspond well to estimates obtained from other independent sources.
Nonlinear-Regression Groundwater Flow Modeling of a Deep Regional Aquifer System
NASA Astrophysics Data System (ADS)
Cooley, Richard L.; Konikow, Leonard F.; Naff, Richard L.
1986-12-01
A nonlinear regression groundwater flow model, based on a Galerkin finite-element discretization, was used to analyze steady state two-dimensional groundwater flow in the areally extensive Madison aquifer in a 75,000 mi2 area of the Northern Great Plains. Regression parameters estimated include intrinsic permeabilities of the main aquifer and separate lineament zones, discharges from eight major springs surrounding the Black Hills, and specified heads on the model boundaries. Aquifer thickness and temperature variations were included as specified functions. The regression model was applied using sequential F testing so that the fewest number and simplest zonation of intrinsic permeabilities, combined with the simplest overall model, were evaluated initially; additional complexities (such as subdivisions of zones and variations in temperature and thickness) were added in stages to evaluate the subsequent degree of improvement in the model results. It was found that only the eight major springs, a single main aquifer intrinsic permeability, two separate lineament intrinsic permeabilities of much smaller values, and temperature variations are warranted by the observed data (hydraulic heads and prior information on some parameters) for inclusion in a model that attempts to explain significant controls on groundwater flow. Addition of thickness variations did not significantly improve model results; however, thickness variations were included in the final model because they are fairly well defined. Effects on the observed head distribution from other features, such as vertical leakage and regional variations in intrinsic permeability, apparently were overshadowed by measurement errors in the observed heads. Estimates of the parameters correspond well to estimates obtained from other independent sources.
Casero-Alonso, V; López-Fidalgo, J; Torsney, B
2017-01-01
Binary response models are used in many real applications. For these models the Fisher information matrix (FIM) is proportional to the FIM of a weighted simple linear regression model. The same is also true when the weight function has a finite integral. Thus, optimal designs for one binary model are also optimal for the corresponding weighted linear regression model. The main objective of this paper is to provide a tool for the construction of MV-optimal designs, minimizing the maximum of the variances of the estimates, for a general design space. MV-optimality is a potentially difficult criterion because of its nondifferentiability at equal variance designs. A methodology for obtaining MV-optimal designs where the design space is a compact interval [a, b] will be given for several standard weight functions. The methodology will allow us to build a user-friendly computer tool based on Mathematica to compute MV-optimal designs. Some illustrative examples will show a representation of MV-optimal designs in the Euclidean plane, taking a and b as the axes. The applet will be explained using two relevant models. In the first one the case of a weighted linear regression model is considered, where the weight function is directly chosen from a typical family. In the second example a binary response model is assumed, where the probability of the outcome is given by a typical probability distribution. Practitioners can use the provided applet to identify the solution and to know the exact support points and design weights. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Assari, Shervin; Lankarani, Maryam Moghani
2015-01-01
Background: This study explored cross-country differences in how multi-morbidity explains the effects of socioeconomic characteristics on self-rated health. Methods: The study borrowed data from the Research on Early Life and Aging Trends and Effects. Participants were 44,530 individuals (age > 65 years) who were sampled from 15 countries (i.e. United States, China, India, Russia, Costa Rica, Puerto Rico, Mexico, Argentina, Barbados, Brazil, Chile, Cuba, Uruguay, Ghana and South Africa). Multi-morbidity was measured as number of chronic medical conditions. In Model I, main effects of socioeconomic factors on self-rated health were calculated using country-specific logistic regressions. In Model II, number of chronic conditions were also added to the models to find changes in coefficients for demographic and socioeconomic factors. Results: In the United States, number of chronic medical conditions explained the effect of income on subjective health. In Puerto Rico, number of chronic medical conditions explained the effect of marital status on subjective health. In Costa Rica, Argentina, Barbados, Cuba, and Uruguay, number of chronic medical conditions explained gender disparities in subjective health. In China, Mexico, Brazil, Russia, Chile, India, Ghana and South Africa, number of chronic medical conditions did not explain the effect of demographic or socioeconomic factors on subjective health. Conclusions: Multi-morbidity explains the effect of demographic and socioeconomic factors on subjective health in some but not other countries. Further research is needed. PMID:26445632
Tay, Cheryl Sihui; Sterzing, Thorsten; Lim, Chen Yen; Ding, Rui; Kong, Pui Wah
2017-05-01
This study examined (a) the strength of four individual footwear perception factors to influence the overall preference of running shoes and (b) whether these perception factors satisfied the nonmulticollinear assumption in a regression model. Running footwear must fulfill multiple functional criteria to satisfy its potential users. Footwear perception factors, such as fit and cushioning, are commonly used to guide shoe design and development, but it is unclear whether running-footwear users are able to differentiate one factor from another. One hundred casual runners assessed four running shoes on a 15-cm visual analogue scale for four footwear perception factors (fit, cushioning, arch support, and stability) as well as for overall preference during a treadmill running protocol. Diagnostic tests showed an absence of multicollinearity between factors, where values for tolerance ranged from .36 to .72, corresponding to variance inflation factors of 2.8 to 1.4. The multiple regression model of these four footwear perception variables accounted for 77.7% to 81.6% of variance in overall preference, with each factor explaining a unique part of the total variance. Casual runners were able to rate each footwear perception factor separately, thus assigning each factor a true potential to improve overall preference for the users. The results also support the use of a multiple regression model of footwear perception factors to predict overall running shoe preference. Regression modeling is a useful tool for running-shoe manufacturers to more precisely evaluate how individual factors contribute to the subjective assessment of running footwear.
Physiology-Based Modeling May Predict Surgical Treatment Outcome for Obstructive Sleep Apnea
Li, Yanru; Ye, Jingying; Han, Demin; Cao, Xin; Ding, Xiu; Zhang, Yuhuan; Xu, Wen; Orr, Jeremy; Jen, Rachel; Sands, Scott; Malhotra, Atul; Owens, Robert
2017-01-01
Study Objectives: To test whether the integration of both anatomical and nonanatomical parameters (ventilatory control, arousal threshold, muscle responsiveness) in a physiology-based model will improve the ability to predict outcomes after upper airway surgery for obstructive sleep apnea (OSA). Methods: In 31 patients who underwent upper airway surgery for OSA, loop gain and arousal threshold were calculated from preoperative polysomnography (PSG). Three models were compared: (1) a multiple regression based on an extensive list of PSG parameters alone; (2) a multivariate regression using PSG parameters plus PSG-derived estimates of loop gain, arousal threshold, and other trait surrogates; (3) a physiological model incorporating selected variables as surrogates of anatomical and nonanatomical traits important for OSA pathogenesis. Results: Although preoperative loop gain was positively correlated with postoperative apnea-hypopnea index (AHI) (P = .008) and arousal threshold was negatively correlated (P = .011), in both model 1 and 2, the only significant variable was preoperative AHI, which explained 42% of the variance in postoperative AHI. In contrast, the physiological model (model 3), which included AHIREM (anatomy term), fraction of events that were hypopnea (arousal term), the ratio of AHIREM and AHINREM (muscle responsiveness term), loop gain, and central/mixed apnea index (control of breathing terms), was able to explain 61% of the variance in postoperative AHI. Conclusions: Although loop gain and arousal threshold are associated with residual AHI after surgery, only preoperative AHI was predictive using multivariate regression modeling. Instead, incorporating selected surrogates of physiological traits on the basis of OSA pathophysiology created a model that has more association with actual residual AHI. Commentary: A commentary on this article appears in this issue on page 1023. Clinical Trial Registration: ClinicalTrials.Gov; Title: The Impact of Sleep Apnea Treatment on Physiology Traits in Chinese Patients With Obstructive Sleep Apnea; Identifier: NCT02696629; URL: https://clinicaltrials.gov/show/NCT02696629 Citation: Li Y, Ye J, Han D, Cao X, Ding X, Zhang Y, Xu W, Orr J, Jen R, Sands S, Malhotra A, Owens R. Physiology-based modeling may predict surgical treatment outcome for obstructive sleep apnea. J Clin Sleep Med. 2017;13(9):1029–1037. PMID:28818154
2013-01-01
Background Developing countries in South Asia, such as Bangladesh, bear a disproportionate burden of diarrhoeal diseases such as Cholera, Typhoid and Paratyphoid. These seem to be aggravated by a number of social and environmental factors such as lack of access to safe drinking water, overcrowdedness and poor hygiene brought about by poverty. Some socioeconomic data can be obtained from census data whilst others are more difficult to elucidate. This study considers a range of both census data and spatial data from other sources, including remote sensing, as potential predictors of typhoid risk. Typhoid data are aggregated from hospital admission records for the period from 2005 to 2009. The spatial and statistical structures of the data are analysed and Principal Axis Factoring is used to reduce the degree of co-linearity in the data. The resulting factors are combined into a Quality of Life index, which in turn is used in a regression model of typhoid occurrence and risk. Results The three Principal Factors used together explain 87% of the variance in the initial candidate predictors, which eminently qualifies them for use as a set of uncorrelated explanatory variables in a linear regression model. Initial regression result using Ordinary Least Squares (OLS) were disappointing, this was explainable by analysis of the spatial autocorrelation inherent in the Principal factors. The use of Geographically Weighted Regression caused a considerable increase in the predictive power of regressions based on these factors. The best prediction, determined by analysis of the Akaike Information Criterion (AIC) was found when the three factors were combined into a quality of life index, using a method previously published by others, and had a coefficient of determination of 73%. Conclusions The typhoid occurrence/risk prediction equation was used to develop the first risk map showing areas of Dhaka Metropolitan Area whose inhabitants are at greater or lesser risk of typhoid infection. This, coupled with seasonal information on typhoid incidence also reported in this paper, has the potential to advise public health professionals on developing prevention strategies such as targeted vaccination. PMID:23497202
Corner, Robert J; Dewan, Ashraf M; Hashizume, Masahiro
2013-03-16
Developing countries in South Asia, such as Bangladesh, bear a disproportionate burden of diarrhoeal diseases such as cholera, typhoid and paratyphoid. These seem to be aggravated by a number of social and environmental factors such as lack of access to safe drinking water, overcrowdedness and poor hygiene brought about by poverty. Some socioeconomic data can be obtained from census data whilst others are more difficult to elucidate. This study considers a range of both census data and spatial data from other sources, including remote sensing, as potential predictors of typhoid risk. Typhoid data are aggregated from hospital admission records for the period from 2005 to 2009. The spatial and statistical structures of the data are analysed and principal axis factoring is used to reduce the degree of co-linearity in the data. The resulting factors are combined into a quality of life index, which in turn is used in a regression model of typhoid occurrence and risk. The three principal factors used together explain 87% of the variance in the initial candidate predictors, which eminently qualifies them for use as a set of uncorrelated explanatory variables in a linear regression model. Initial regression result using ordinary least squares (OLS) were disappointing, this was explainable by analysis of the spatial autocorrelation inherent in the principal factors. The use of geographically weighted regression caused a considerable increase in the predictive power of regressions based on these factors. The best prediction, determined by analysis of the Akaike information criterion (AIC) was found when the three factors were combined into a quality of life index, using a method previously published by others, and had a coefficient of determination of 73%. The typhoid occurrence/risk prediction equation was used to develop the first risk map showing areas of Dhaka metropolitan area whose inhabitants are at greater or lesser risk of typhoid infection. This, coupled with seasonal information on typhoid incidence also reported in this paper, has the potential to advise public health professionals on developing prevention strategies such as targeted vaccination.
Sanchez, Margaux; Ambros, Albert; Milà, Carles; Salmon, Maëlle; Balakrishnan, Kalpana; Sambandam, Sankar; Sreekanth, V; Marshall, Julian D; Tonne, Cathryn
2018-09-01
Land-use regression (LUR) has been used to model local spatial variability of particulate matter in cities of high-income countries. Performance of LUR models is unknown in less urbanized areas of low-/middle-income countries (LMICs) experiencing complex sources of ambient air pollution and which typically have limited land use data. To address these concerns, we developed LUR models using satellite imagery (e.g., vegetation, urbanicity) and manually-collected data from a comprehensive built-environment survey (e.g., roads, industries, non-residential places) for a peri-urban area outside Hyderabad, India. As part of the CHAI (Cardiovascular Health effects of Air pollution in Telangana, India) project, concentrations of fine particulate matter (PM 2.5 ) and black carbon were measured over two seasons at 23 sites. Annual mean (sd) was 34.1 (3.2) μg/m 3 for PM 2.5 and 2.7 (0.5) μg/m 3 for black carbon. The LUR model for annual black carbon explained 78% of total variance and included both local-scale (energy supply places) and regional-scale (roads) predictors. Explained variance was 58% for annual PM 2.5 and the included predictors were only regional (urbanicity, vegetation). During leave-one-out cross-validation and cross-holdout validation, only the black carbon model showed consistent performance. The LUR model for black carbon explained a substantial proportion of the spatial variability that could not be captured by simpler interpolation technique (ordinary kriging). This is the first study to develop a LUR model for ambient concentrations of PM 2.5 and black carbon in a non-urban area of LMICs, supporting the applicability of the LUR approach in such settings. Our results provide insights on the added value of manually-collected built-environment data to improve the performance of LUR models in settings with limited data availability. For both pollutants, LUR models predicted substantial within-village variability, an important feature for future epidemiological studies. Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.
Human impact on sediment fluxes within the Blue Nile and Atbara River basins
NASA Astrophysics Data System (ADS)
Balthazar, Vincent; Vanacker, Veerle; Girma, Atkilt; Poesen, Jean; Golla, Semunesh
2013-01-01
A regional assessment of the spatial variability in sediment yields allows filling the gap between detailed, process-based understanding of erosion at field scale and empirical sediment flux models at global scale. In this paper, we focus on the intrabasin variability in sediment yield within the Blue Nile and Atbara basins as biophysical and anthropogenic factors are presumably acting together to accelerate soil erosion. The Blue Nile and Atbara River systems are characterized by an important spatial variability in sediment fluxes, with area-specific sediment yield (SSY) values ranging between 4 and 4935 t/km2/y. Statistical analyses show that 41% of the observed variation in SSY can be explained by remote sensing proxy data of surface vegetation cover, rainfall intensity, mean annual temperature, and human impact. The comparison of a locally adapted regression model with global predictive sediment flux models indicates that global flux models such as the ART and BQART models are less suited to capture the spatial variability in area-specific sediment yields (SSY), but they are very efficient to predict absolute sediment yields (SY). We developed a modified version of the BQART model that estimates the human influence on sediment yield based on a high resolution composite measure of local human impact (human footprint index) instead of countrywide estimates of GNP/capita. Our modified version of the BQART is able to explain 80% of the observed variation in SY for the Blue Nile and Atbara basins and thereby performs only slightly less than locally adapted regression models.
Salazar, Edwin; Buitrago, Carolina; Molina, Federico; Alzate, Catalina Arango
2015-05-01
Determine the trend in mortality from external causes in pregnant and postpartum women and its relationship to socioeconomic factors. Descriptive study, based on the official registries of deaths reported by the National Statistics Agency, 1998-2010. The trend was analyzed using Poisson regressions. Bivariate correlations and multiple linear regression models were constructed to explore the relationship between mortality and socioeconomic factors: human development index, Gini index, gross domestic product, unsatisfied basic needs, unemployment rate, poverty, extreme poverty, quality of life index, illiteracy rate, and percentage of affiliation to the Social Security System. A total of 2 223 female deaths from external causes were recorded, of which 1 429 occurred during pregnancy and 794 in the postpartum period. The gross mortality rate dropped from 30.7 per 100 000 live births plus fetal deaths in 1998 to 16.7 in 2010. A downward curve with no significant inflection points was shown in the risk of dying from this cause. The multiple linear regression model showed a correlation between mortality and extreme poverty and the illiteracy rate, suggesting that these indicators could explain 89.4% of the change in mortality from external causes in pregnant and postpartum women each year in Colombia. Mortality from external causes in pregnant and postpartum women showed a significant downward trend that may be explained by important socioeconomic changes in the country, including a decrease in extreme poverty and in the illiteracy rate.
Pare, Guillaume; Mao, Shihong; Deng, Wei Q
2016-06-08
Despite considerable efforts, known genetic associations only explain a small fraction of predicted heritability. Regional associations combine information from multiple contiguous genetic variants and can improve variance explained at established association loci. However, regional associations are not easily amenable to estimation using summary association statistics because of sensitivity to linkage disequilibrium (LD). We now propose a novel method, LD Adjusted Regional Genetic Variance (LARGV), to estimate phenotypic variance explained by regional associations using summary statistics while accounting for LD. Our method is asymptotically equivalent to a multiple linear regression model when no interaction or haplotype effects are present. It has several applications, such as ranking of genetic regions according to variance explained or comparison of variance explained by two or more regions. Using height and BMI data from the Health Retirement Study (N = 7,776), we show that most genetic variance lies in a small proportion of the genome and that previously identified linkage peaks have higher than expected regional variance.
Effect of climatological factors on respiratory syncytial virus epidemics
NOYOLA, D. E.; MANDEVILLE, P. B.
2008-01-01
SUMMARY Respiratory syncytial virus (RSV) presents as yearly epidemics in temperate climates. We analysed the association of atmospheric conditions to RSV epidemics in San Luis Potosí, S.L.P., Mexico. The weekly number of RSV detections between October 2002 and May 2006 were correlated to ambient temperature, barometric pressure, relative humidity, vapour tension, dew point, precipitation, and hours of light using time-series and regression analyses. Of the variation in RSV cases, 49·8% was explained by the study variables. Of the explained variation in RSV cases, 32·5% was explained by the study week and 17·3% was explained by meteorological variables (average daily temperature, maximum daily temperature, temperature at 08:00 hours, and relative humidity at 08:00 hours). We concluded that atmospheric conditions, particularly temperature, partly explain the year to year variability in RSV activity. Identification of additional factors that affect RSV seasonality may help develop a model to predict the onset of RSV epidemics. PMID:18177520
Pare, Guillaume; Mao, Shihong; Deng, Wei Q.
2016-01-01
Despite considerable efforts, known genetic associations only explain a small fraction of predicted heritability. Regional associations combine information from multiple contiguous genetic variants and can improve variance explained at established association loci. However, regional associations are not easily amenable to estimation using summary association statistics because of sensitivity to linkage disequilibrium (LD). We now propose a novel method, LD Adjusted Regional Genetic Variance (LARGV), to estimate phenotypic variance explained by regional associations using summary statistics while accounting for LD. Our method is asymptotically equivalent to a multiple linear regression model when no interaction or haplotype effects are present. It has several applications, such as ranking of genetic regions according to variance explained or comparison of variance explained by two or more regions. Using height and BMI data from the Health Retirement Study (N = 7,776), we show that most genetic variance lies in a small proportion of the genome and that previously identified linkage peaks have higher than expected regional variance. PMID:27273519
A case-mix classification system for explaining healthcare costs using administrative data in Italy.
Corti, Maria Chiara; Avossa, Francesco; Schievano, Elena; Gallina, Pietro; Ferroni, Eliana; Alba, Natalia; Dotto, Matilde; Basso, Cristina; Netti, Silvia Tiozzo; Fedeli, Ugo; Mantoan, Domenico
2018-03-04
The Italian National Health Service (NHS) provides universal coverage to all citizens, granting primary and hospital care with a copayment system for outpatient and drug services. Financing of Local Health Trusts (LHTs) is based on a capitation system adjusted only for age, gender and area of residence. We applied a risk-adjustment system (Johns Hopkins Adjusted Clinical Groups System, ACG® System) in order to explain health care costs using routinely collected administrative data in the Veneto Region (North-eastern Italy). All residents in the Veneto Region were included in the study. The ACG system was applied to classify the regional population based on the following information sources for the year 2015: Hospital Discharges, Emergency Room visits, Chronic disease registry for copayment exemptions, ambulatory visits, medications, the Home care database, and drug prescriptions. Simple linear regressions were used to contrast an age-gender model to models incorporating more comprehensive risk measures aimed at predicting health care costs. A simple age-gender model explained only 8% of the variance of 2015 total costs. Adding diagnoses-related variables provided a 23% increase, while pharmacy based variables provided an additional 17% increase in explained variance. The adjusted R-squared of the comprehensive model was 6 times that of the simple age-gender model. ACG System provides substantial improvement in predicting health care costs when compared to simple age-gender adjustments. Aging itself is not the main determinant of the increase of health care costs, which is better explained by the accumulation of chronic conditions and the resulting multimorbidity. Copyright © 2018. Published by Elsevier B.V.
Ramsey, Elijah W.; Rangoonwala, Amina; Jones, Cathleen E.
2015-01-01
Empirical relationships between field-derived Leaf Area Index (LAI) and Leaf Angle Distribution (LAD) and polarimetric synthetic aperture radar (PolSAR) based biophysical indicators were created and applied to map S. alterniflora marsh canopy structure. PolSAR and field data were collected near concurrently in the summers of 2010, 2011, and 2012 in coastal marshes, and PolSAR data alone were acquired in 2009. Regression analyses showed that LAI correspondence with the PolSAR biophysical indicator variables equaled or exceeded those of vegetation water content (VWC) correspondences. In the final six regressor model, the ratio HV/VV explained 49% of the total 77% explained LAI variance, and the HH-VV coherence and phase information accounted for the remainder. HV/HH dominated the two regressor LAD relationship, and spatial heterogeneity and backscatter mechanism followed by coherence information dominated the final three regressor model that explained 74% of the LAD variance. Regression results applied to 2009 through 2012 PolSAR images showed substantial changes in marsh LAI and LAD. Although the direct cause was not substantiated, following a release of freshwater in response to the 2010 Deepwater Horizon oil spill, the fairly uniform interior marsh structure of 2009 was more vertical and dense shortly after the oil spill cessation. After 2010, marsh structure generally progressed back toward the 2009 uniformity; however, the trend was more disjointed in oil impact marshes.
Lunt, Mark
2015-07-01
In the first article in this series we explored the use of linear regression to predict an outcome variable from a number of predictive factors. It assumed that the predictive factors were measured on an interval scale. However, this article shows how categorical variables can also be included in a linear regression model, enabling predictions to be made separately for different groups and allowing for testing the hypothesis that the outcome differs between groups. The use of interaction terms to measure whether the effect of a particular predictor variable differs between groups is also explained. An alternative approach to testing the difference between groups of the effect of a given predictor, which consists of measuring the effect in each group separately and seeing whether the statistical significance differs between the groups, is shown to be misleading. © The Author 2013. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Factor regression for interpreting genotype-environment interaction in bread-wheat trials.
Baril, C P
1992-05-01
The French INRA wheat (Triticum aestivum L. em Thell.) breeding program is based on multilocation trials to produce high-yielding, adapted lines for a wide range of environments. Differential genotypic responses to variable environment conditions limit the accuracy of yield estimations. Factor regression was used to partition the genotype-environment (GE) interaction into four biologically interpretable terms. Yield data were analyzed from 34 wheat genotypes grown in four environments using 12 auxiliary agronomic traits as genotypic and environmental covariates. Most of the GE interaction (91%) was explained by the combination of only three traits: 1,000-kernel weight, lodging susceptibility and spike length. These traits are easily measured in breeding programs, therefore factor regression model can provide a convenient and useful prediction method of yield.
Feng, Zhujing; Schilling, Keith E; Chan, Kung-Sik
2013-06-01
Nitrate-nitrogen concentrations in rivers represent challenges for water supplies that use surface water sources. Nitrate concentrations are often modeled using time-series approaches, but previous efforts have typically relied on monthly time steps. In this study, we developed a dynamic regression model of daily nitrate concentrations in the Raccoon River, Iowa, that incorporated contemporaneous and lags of precipitation and discharge occurring at several locations around the basin. Results suggested that 95 % of the variation in daily nitrate concentrations measured at the outlet of a large agricultural watershed can be explained by time-series patterns of precipitation and discharge occurring in the basin. Discharge was found to be a more important regression variable than precipitation in our model but both regression parameters were strongly correlated with nitrate concentrations. The time-series model was consistent with known patterns of nitrate behavior in the watershed, successfully identifying contemporaneous dilution mechanisms from higher relief and urban areas of the basin while incorporating the delayed contribution of nitrate from tile-drained regions in a lagged response. The first difference of the model errors were modeled as an AR(16) process and suggest that daily nitrate concentration changes remain temporally correlated for more than 2 weeks although temporal correlation was stronger in the first few days before tapering off. Consequently, daily nitrate concentrations are non-stationary, i.e. of strong memory. Using time-series models to reliably forecast daily nitrate concentrations in a river based on patterns of precipitation and discharge occurring in its basin may be of great interest to water suppliers.
NASA Astrophysics Data System (ADS)
Zhao, Ren-Yang; Magun, Andreas; Schanda, Erwin
1990-12-01
Results are reported from a correlation analysis for 57 microwave impulsive bursts observed at six frequencies. A regression line between the peak frequency and the corresponding rise time of microwave impulsive bursts is obtained, with a correlation coefficient of -0.43. This can be explained in the frame of a thermal model. The magnetic field decrease with height has to be much slower than in a dipole field in order to explain the weak dependence of f(p) on t(r). This decrease of magnetic field with height in burst sources is based on the relationship between f(p) and t(r) found by assuming a thermal flare model with a collisionless conduction front.
Smyczynska, Joanna; Hilczer, Maciej; Smyczynska, Urszula; Stawerska, Renata; Tadeusiewicz, Ryszard; Lewinski, Andrzej
2015-01-01
The leading method for prediction of growth hormone (GH) therapy effectiveness are multiple linear regression (MLR) models. Best of our knowledge, we are the first to apply artificial neural networks (ANN) to solve this problem. For ANN there is no necessity to assume the functions linking independent and dependent variables. The aim of study is to compare ANN and MLR models of GH therapy effectiveness. Analysis comprised the data of 245 GH-deficient children (170 boys) treated with GH up to final height (FH). Independent variables included: patients' height, pre-treatment height velocity, chronological age, bone age, gender, pubertal status, parental heights, GH peak in 2 stimulation tests, IGF-I concentration. The output variable was FH. For testing dataset, MLR model predicted FH SDS with average error (RMSE) 0.64 SD, explaining 34.3% of its variability; ANN model derived on the same pre-processed data predicted FH SDS with RMSE 0.60 SD, explaining 42.0% of its variability; ANN model derived on raw data predicted FH with RMSE 3.9 cm (0.63 SD), explaining 78.7% of its variability. ANN seem to be valuable tool in prediction of GH treatment effectiveness, especially since they can be applied to raw clinical data.
The effect of latitude on photoperiodic control of gonadal maturation, regression and molt in birds.
Dawson, Alistair
2013-09-01
Photoperiod is the major cue used by birds to time breeding seasons and molt. However, the annual cycle in photoperiod changes with latitude. Within species, for temperate and high latitude species, gonadal maturation and breeding start earlier at lower latitudes but regression and molt both occur at similar times at different latitudes. Earlier gonadal maturation can be explained simply by the fact that considerable maturation occurs before the equinox when photoperiod is longer at lower latitudes - genetic differences between populations are not necessary to explain earlier breeding at lower latitudes. Gonadal regression is caused either by absolute photorefractoriness or, in some species with long breeding seasons, relative photorefractoriness. In either case, the timing of regression and molt cannot be explained by absolute prevailing photoperiod or rate of change in photoperiod - birds appear to be using more subtle cues from the pattern of change in photoperiod. However, there may be no difference between absolute and relative photorefractory species in how they utilise the annual cycle in photoperiod to time regression. Copyright © 2013 Elsevier Inc. All rights reserved.
A refined method for multivariate meta-analysis and meta-regression
Jackson, Daniel; Riley, Richard D
2014-01-01
Making inferences about the average treatment effect using the random effects model for meta-analysis is problematic in the common situation where there is a small number of studies. This is because estimates of the between-study variance are not precise enough to accurately apply the conventional methods for testing and deriving a confidence interval for the average effect. We have found that a refined method for univariate meta-analysis, which applies a scaling factor to the estimated effects’ standard error, provides more accurate inference. We explain how to extend this method to the multivariate scenario and show that our proposal for refined multivariate meta-analysis and meta-regression can provide more accurate inferences than the more conventional approach. We explain how our proposed approach can be implemented using standard output from multivariate meta-analysis software packages and apply our methodology to two real examples. © 2013 The Authors. Statistics in Medicine published by John Wiley & Sons, Ltd. PMID:23996351
Reddy, Bhargava K; Delen, Dursun; Agrawal, Rupesh K
2018-01-01
Crohn's disease is among the chronic inflammatory bowel diseases that impact the gastrointestinal tract. Understanding and predicting the severity of inflammation in real-time settings is critical to disease management. Extant literature has primarily focused on studies that are conducted in clinical trial settings to investigate the impact of a drug treatment on the remission status of the disease. This research proposes an analytics methodology where three different types of prediction models are developed to predict and to explain the severity of inflammation in patients diagnosed with Crohn's disease. The results show that machine-learning-based analytic methods such as gradient boosting machines can predict the inflammation severity with a very high accuracy (area under the curve = 92.82%), followed by regularized regression and logistic regression. According to the findings, a combination of baseline laboratory parameters, patient demographic characteristics, and disease location are among the strongest predictors of inflammation severity in Crohn's disease patients.
Purewal, Rebecca; Fisher, Peter L
2018-02-01
Anxiety and depression are highly prevalent in people with diabetes (PwD). The most widely used psychological model to explain anxiety and depression in PwD is the Common-Sense Model, which gives a central role to illness perceptions. The Self-Regulatory Executive Function (S-REF) model proposes metacognitive beliefs are key to understanding the development and maintenance of emotional disorders. To test the potential utility of the S-REF model in PwD, the study explored if metacognitive beliefs explained additional variance in anxiety and depression after controlling for demographic and illness perceptions. 614 adults with either Type 1 (n = 335) or Type 2 (n = 279) diabetes participated in a cross sectional online survey. All participants completed questionnaires on anxiety, depression, illness perceptions and metacognitive beliefs. Regression analyses showed that metacognitive beliefs were associated with anxiety and depression in PwD and explained additional variance in both anxiety and depression after controlling for demographics and illness perceptions. This is the first study to demonstrate that metacognitive beliefs are associated with anxiety and depression in PwD. The clinical implications of the study are illustrated. Crown Copyright © 2017. Published by Elsevier B.V. All rights reserved.
Combustion performance and scale effect from N2O/HTPB hybrid rocket motor simulations
NASA Astrophysics Data System (ADS)
Shan, Fanli; Hou, Lingyun; Piao, Ying
2013-04-01
HRM code for the simulation of N2O/HTPB hybrid rocket motor operation and scale effect analysis has been developed. This code can be used to calculate motor thrust and distributions of physical properties inside the combustion chamber and nozzle during the operational phase by solving the unsteady Navier-Stokes equations using a corrected compressible difference scheme and a two-step, five species combustion model. A dynamic fuel surface regression technique and a two-step calculation method together with the gas-solid coupling are applied in the calculation of fuel regression and the determination of combustion chamber wall profile as fuel regresses. Both the calculated motor thrust from start-up to shut-down mode and the combustion chamber wall profile after motor operation are in good agreements with experimental data. The fuel regression rate equation and the relation between fuel regression rate and axial distance have been derived. Analysis of results suggests improvements in combustion performance to the current hybrid rocket motor design and explains scale effects in the variation of fuel regression rate with combustion chamber diameter.
González Costa, J J; Reigosa, M J; Matías, J M; Covelo, E F
2017-09-01
The aim of this study was to model the sorption and retention of Cd, Cu, Ni, Pb and Zn in soils. To that extent, the sorption and retention of these metals were studied and the soil characterization was performed separately. Multiple stepwise regression was used to produce multivariate models with linear techniques and with support vector machines, all of which included 15 explanatory variables characterizing soils. When the R-squared values are represented, two different groups are noticed. Cr, Cu and Pb sorption and retention show a higher R-squared; the most explanatory variables being humified organic matter, Al oxides and, in some cases, cation-exchange capacity (CEC). The other group of metals (Cd, Ni and Zn) shows a lower R-squared, and clays are the most explanatory variables, including a percentage of vermiculite and slime. In some cases, quartz, plagioclase or hematite percentages also show some explanatory capacity. Support Vector Machine (SVM) regression shows that the different models are not as regular as in multiple regression in terms of number of variables, the regression for nickel adsorption being the one with the highest number of variables in its optimal model. On the other hand, there are cases where the most explanatory variables are the same for two metals, as it happens with Cd and Cr adsorption. A similar adsorption mechanism is thus postulated. These patterns of the introduction of variables in the model allow us to create explainability sequences. Those which are the most similar to the selectivity sequences obtained by Covelo (2005) are Mn oxides in multiple regression and change capacity in SVM. Among all the variables, the only one that is explanatory for all the metals after applying the maximum parsimony principle is the percentage of sand in the retention process. In the competitive model arising from the aforementioned sequences, the most intense competitiveness for the adsorption and retention of different metals appears between Cr and Cd, Cu and Zn in multiple regression; and between Cr and Cd in SVM regression. Copyright © 2017 Elsevier B.V. All rights reserved.
Genetic prediction of type 2 diabetes using deep neural network.
Kim, J; Kim, J; Kwak, M J; Bajaj, M
2018-04-01
Type 2 diabetes (T2DM) has strong heritability but genetic models to explain heritability have been challenging. We tested deep neural network (DNN) to predict T2DM using the nested case-control study of Nurses' Health Study (3326 females, 45.6% T2DM) and Health Professionals Follow-up Study (2502 males, 46.5% T2DM). We selected 96, 214, 399, and 678 single-nucleotide polymorphism (SNPs) through Fisher's exact test and L1-penalized logistic regression. We split each dataset randomly in 4:1 to train prediction models and test their performance. DNN and logistic regressions showed better area under the curve (AUC) of ROC curves than the clinical model when 399 or more SNPs included. DNN was superior than logistic regressions in AUC with 399 or more SNPs in male and 678 SNPs in female. Addition of clinical factors consistently increased AUC of DNN but failed to improve logistic regressions with 214 or more SNPs. In conclusion, we show that DNN can be a versatile tool to predict T2DM incorporating large numbers of SNPs and clinical information. Limitations include a relatively small number of the subjects mostly of European ethnicity. Further studies are warranted to confirm and improve performance of genetic prediction models using DNN in different ethnic groups. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Ribaroff, G A; Wastnedge, E; Drake, A J; Sharpe, R M; Chambers, T J G
2017-06-01
Animal models of maternal high fat diet (HFD) demonstrate perturbed offspring metabolism although the effects differ markedly between models. We assessed studies investigating metabolic parameters in the offspring of HFD fed mothers to identify factors explaining these inter-study differences. A total of 171 papers were identified, which provided data from 6047 offspring. Data were extracted regarding body weight, adiposity, glucose homeostasis and lipidaemia. Information regarding the macronutrient content of diet, species, time point of exposure and gestational weight gain were collected and utilized in meta-regression models to explore predictive factors. Publication bias was assessed using Egger's regression test. Maternal HFD exposure did not affect offspring birthweight but increased weaning weight, final bodyweight, adiposity, triglyceridaemia, cholesterolaemia and insulinaemia in both female and male offspring. Hyperglycaemia was found in female offspring only. Meta-regression analysis identified lactational HFD exposure as a key moderator. The fat content of the diet did not correlate with any outcomes. There was evidence of significant publication bias for all outcomes except birthweight. Maternal HFD exposure was associated with perturbed metabolism in offspring but between studies was not accounted for by dietary constituents, species, strain or maternal gestational weight gain. Specific weaknesses in experimental design predispose many of the results to bias. © 2017 The Authors. Obesity Reviews published by John Wiley & Sons Ltd on behalf of World Obesity Federation.
Roh, Soonhee; Burnette, Catherine E; Lee, Kyoung Hag; Lee, Yeon-Shim; Martin, James I; Lawler, Michael J
2017-01-01
American Indian (AI) older adults are vulnerable to mental health disparities, yet very little is known about the factors associated with help-seeking for mental health services among them. The purpose of this study was to investigate the utility of Andersen's Behavioral Model in explaining AI older adults' help-seeking attitudes toward professional mental health services. Hierarchical regression analysis was used to examine predisposing, enabling, and need variables as predictors of help-seeking attitudes toward mental health services in a sample of 233 AI older adults from the Midwest. The model was found to have limited utility in the context of older AI help-seeking attitudes, as the proportion of explained variance was low. Gender, perceived stigma, social support, and physical health were significant predictors, whereas age, perceived mental health, and health insurance were not. © The Author(s) 2014.
Exploring public databases to characterize urban flood risks in Amsterdam
NASA Astrophysics Data System (ADS)
Gaitan, Santiago; ten Veldhuis, Marie-claire; van de Giesen, Nick
2015-04-01
Cities worldwide are challenged by increasing urban flood risks. Precise and realistic measures are required to decide upon investment to reduce their impacts. Obvious flooding factors affecting flood risk include sewer systems performance and urban topography. However, currently implemented sewer and topographic models do not provide realistic predictions of local flooding occurrence during heavy rain events. Assessing other factors such as spatially distributed rainfall and socioeconomic characteristics may help to explain probability and impacts of urban flooding. Several public databases were analyzed: complaints about flooding made by citizens, rainfall depths (15 min and 100 Ha spatio-temporal resolution), grids describing number of inhabitants, income, and housing price (1Ha and 25Ha resolution); and buildings age. Data analysis was done using Python and GIS programming, and included spatial indexing of data, cluster analysis, and multivariate regression on the complaints. Complaints were used as a proxy to characterize flooding impacts. The cluster analysis, run for all the variables except the complaints, grouped part of the grid-cells of central Amsterdam into a highly differentiated group, covering 10% of the analyzed area, and accounting for 25% of registered complaints. The configuration of the analyzed variables in central Amsterdam coincides with a high complaint count. Remaining complaints were evenly dispersed along other groups. An adjusted R2 of 0.38 in the multivariate regression suggests that explaining power can improve if additional variables are considered. While rainfall intensity explained 4% of the incidence of complaints, population density and building age significantly explained around 20% each. Data mining of public databases proved to be a valuable tool to identify factors explaining variability in occurrence of urban pluvial flooding, though additional variables must be considered to fully explain flood risk variability.
Analyzing industrial energy use through ordinary least squares regression models
NASA Astrophysics Data System (ADS)
Golden, Allyson Katherine
Extensive research has been performed using regression analysis and calibrated simulations to create baseline energy consumption models for residential buildings and commercial institutions. However, few attempts have been made to discuss the applicability of these methodologies to establish baseline energy consumption models for industrial manufacturing facilities. In the few studies of industrial facilities, the presented linear change-point and degree-day regression analyses illustrate ideal cases. It follows that there is a need in the established literature to discuss the methodologies and to determine their applicability for establishing baseline energy consumption models of industrial manufacturing facilities. The thesis determines the effectiveness of simple inverse linear statistical regression models when establishing baseline energy consumption models for industrial manufacturing facilities. Ordinary least squares change-point and degree-day regression methods are used to create baseline energy consumption models for nine different case studies of industrial manufacturing facilities located in the southeastern United States. The influence of ambient dry-bulb temperature and production on total facility energy consumption is observed. The energy consumption behavior of industrial manufacturing facilities is only sometimes sufficiently explained by temperature, production, or a combination of the two variables. This thesis also provides methods for generating baseline energy models that are straightforward and accessible to anyone in the industrial manufacturing community. The methods outlined in this thesis may be easily replicated by anyone that possesses basic spreadsheet software and general knowledge of the relationship between energy consumption and weather, production, or other influential variables. With the help of simple inverse linear regression models, industrial manufacturing facilities may better understand their energy consumption and production behavior, and identify opportunities for energy and cost savings. This thesis study also utilizes change-point and degree-day baseline energy models to disaggregate facility annual energy consumption into separate industrial end-user categories. The baseline energy model provides a suitable and economical alternative to sub-metering individual manufacturing equipment. One case study describes the conjoined use of baseline energy models and facility information gathered during a one-day onsite visit to perform an end-point energy analysis of an injection molding facility conducted by the Alabama Industrial Assessment Center. Applying baseline regression model results to the end-point energy analysis allowed the AIAC to better approximate the annual energy consumption of the facility's HVAC system.
Applying the Expectancy-Value Model to understand health values.
Zhang, Xu-Hao; Xie, Feng; Wee, Hwee-Lin; Thumboo, Julian; Li, Shu-Chuen
2008-03-01
Expectancy-Value Model (EVM) is the most structured model in psychology to predict attitudes by measuring attitudinal attributes (AAs) and relevant external variables. Because health value could be categorized as attitude, we aimed to apply EVM to explore its usefulness in explaining variances in health values and investigate underlying factors. Focus group discussion was carried out to identify the most common and significant AAs toward 5 different health states (coded as 11111, 11121, 21221, 32323, and 33333 in EuroQol Five-Dimension (EQ-5D) descriptive system). AAs were measured in a sum of multiplications of subjective probability (expectancy) and perceived value of attributes with 7-point Likert scales. Health values were measured using visual analog scales (VAS, range 0-1). External variables (age, sex, ethnicity, education, housing, marital status, and concurrent chronic diseases) were also incorporated into survey questionnaire distributed by convenience sampling among eligible respondents. Univariate analyses were used to identify external variables causing significant differences in VAS. Multiple linear regression model (MLR) and hierarchical regression model were used to investigate the explanatory power of AAs and possible significant external variable(s) separately or in combination, for each individual health state and a mixed scenario of five states, respectively. Four AAs were identified, namely, "worsening your quality of life in terms of health" (WQoL), "adding a burden to your family" (BTF), "making you less independent" (MLI) and "unable to work or study" (UWS). Data were analyzed based on 232 respondents (mean [SD] age: 27.7 [15.07] years, 49.1% female). Health values varied significantly across 5 health states, ranging from 0.12 (33333) to 0.97 (11111). With no significant external variables identified, EVM explained up to 62% of the variances in health values across 5 health states. The explanatory power of 4 AAs were found to be between 13% and 28% in separate MLR models (P < 0.05). When data were analyzed for each health state, variances in health values became small and explanatory power of EVM was reduced to a range between 8% and 23%. EVM was useful in explaining variances of health values and predicting important factors. Its power to explain small variances might be restricted due to limitations of 7-point Likert scale to measure AAs accurately. With further improvement and validation of a compatible continuous scale for more accurate measurement, EVM is expected to explain health values to a larger extent.
Solvency supervision based on a total balance sheet approach
NASA Astrophysics Data System (ADS)
Pitselis, Georgios
2009-11-01
In this paper we investigate the adequacy of the own funds a company requires in order to remain healthy and avoid insolvency. Two methods are applied here; the quantile regression method and the method of mixed effects models. Quantile regression is capable of providing a more complete statistical analysis of the stochastic relationship among random variables than least squares estimation. The estimated mixed effects line can be considered as an internal industry equation (norm), which explains a systematic relation between a dependent variable (such as own funds) with independent variables (e.g. financial characteristics, such as assets, provisions, etc.). The above two methods are implemented with two data sets.
ERIC Educational Resources Information Center
Floyd, Randy G.; McGrew, Kevin S.; Evans, Jeffrey J.
2008-01-01
This study examined the relative contributions of measures of Cattell-Horn-Carroll (CHC) cognitive abilities in explaining writing achievement. Drawing from samples that covered the age range of 7 to 18 years, simultaneous multiple regression was used to regress scores from the Woodcock-Johnson III (WJ III; Woodcock, McGrew, & Mather, 2001) that…
Use and misuse of motor-vehicle crash death rates in assessing highway-safety performance.
O'Neill, Brian; Kyrychenko, Sergey Y
2006-12-01
The objectives of the article are to assess the extent to which comparisons of motor-vehicle crash death rates can be used to determine the effectiveness of highway-safety policies over time in a country or to compare policy effectiveness across countries. Motor-vehicle crash death rates per mile traveled in the 50 U.S. states from 1980 to 2003 are used to show the influence on these rates of factors independent of highway-safety interventions. Multiple regression models relating state death rates to various measures related to urbanization and demographics are used. The analyses demonstrate strong relationships between state death rates and urbanization and demographics. Almost 60% of the variability among the state death rates can be explained by the independent variables in the multiple regression models. When the death rates for passenger vehicle occupants (i.e., excluding motorcycle, pedestrian, and other deaths) are used in the regression models, almost 70% of the variability in the rates can be explained by urbanization and demographics. The analyses presented in the article demonstrate that motor-vehicle crash death rates are strongly influenced by factors unrelated to highway-safety countermeasures. Overall death rates should not be used as a basis for judging the effectiveness (or ineffectiveness) of specific highway-safety countermeasures or to assess overall highway-safety policies, especially across jurisdictions. There can be no substitute for the use of carefully designed scientific evaluations of highway-safety interventions that use outcome measures directly related to the intervention; e.g., motorcyclist deaths should be used to assess the effectiveness of motorcycle helmet laws. While this may seem obvious, there are numerous examples in the literature of death rates from all crashes being used to assess the effectiveness of interventions aimed at specific subsets of crashes.
Predicting academic performance of medical students: the first three years.
Höschl, C; Kozený, J
1997-06-01
The purpose of this exploratory study was to identify a cluster of variables that would most economically explain variations in the grade point averages of medical students during the first 3 years of study. Data were derived from a study of 92 students admitted to the 3rd Faculty of Medicine in 1992-1993 academic year and who were still in the medical school at the end of the sixth semester (third year). Stepwise regression analysis was used to build models for predicting log-transformed changes in grade point average after six semesters of study-at the end of the first, second, and third years. Predictor variables were chosen from four domains: 1) high school grade point averages in physics, mathematics, and the Czech language over 4 years of study, 2) results of admission tests in biology, chemistry, and physics, 3) admission committee's assessment of the applicant's ability to reproduce a text, motivation to study medicine, and social maturity, and 4) scores on the sentimentality and attachment scales of the Tridimensional Personality Questionnaire. The regression model, which included performance in high school physics, results of the admission test in physics, assessment of the applicant's motivation to study medicine, and attachment scale score, accounted for 32% of the change in grade point average over six semesters of study. The regression models using the first-, second-, and third-year grade point averages as the dependent variables showed slightly decreasing amounts of explained variance toward the end of the third year of study and within domains, changing the structure of predictor variables. The results suggest that variables chosen from the assessment domains of high school performance, written entrance examination, admission interview, and personality traits may be significant predictors of academic success during the first 3 years of medical study.
The Roles of IL-6, IL-10, and IL-1RA in Obesity and Insulin Resistance in African-Americans
Doumatey, Ayo; Huang, Hanxia; Zhou, Jie; Chen, Guanjie; Shriner, Daniel; Adeyemo, Adebowale
2011-01-01
Objective: The aim of the study was to investigate the associations between IL-1 receptor antagonist (IL-1RA), IL-6, IL-10, measures of obesity, and insulin resistance in African-Americans. Research Design and Methods: Nondiabetic participants (n = 1025) of the Howard University Family Study were investigated for associations between serum IL (IL-1RA, IL-6, IL-10), measures of obesity, and insulin resistance, with adjustment for age and sex. Measures of obesity included body mass index, waist circumference, hip circumference, waist-to-hip ratio, and percent fat mass. Insulin resistance was assessed using the homeostasis model assessment of insulin resistance (HOMA-IR). Data were analyzed with R statistical software using linear regression and likelihood ratio tests. Results: IL-1RA and IL-6 were associated with measures of obesity and insulin resistance, explaining 4–12.7% of the variance observed (P values < 0.001). IL-1RA was bimodally distributed and therefore was analyzed based on grouping those with low vs. high IL-1RA levels. High IL-1RA explained up to 20 and 12% of the variance in measures of obesity and HOMA-IR, respectively. Among the IL, only high IL-1RA improved the fit of models regressing HOMA-IR on measures of obesity. In contrast, all measures of obesity improved the fit of models regressing HOMA-IR on IL. IL-10 was not associated with obesity measures or HOMA-IR. Conclusions: High IL-1RA levels and obesity measures are associated with HOMA-IR in this population-based sample of African-Americans. The results suggest that obesity and increased levels of IL-1RA both contribute to the development of insulin resistance. PMID:21956416
Determinants of single family residential water use across scales in four western US cities.
Chang, Heejun; Bonnette, Matthew Ryan; Stoker, Philip; Crow-Miller, Britt; Wentz, Elizabeth
2017-10-15
A growing body of literature examines urban water sustainability with increasing evidence that locally-based physical and social spatial interactions contribute to water use. These studies however are based on single-city analysis and often fail to consider whether these interactions occur more generally. We examine a multi-city comparison using a common set of spatially-explicit water, socioeconomic, and biophysical data. We investigate the relative importance of variables for explaining the variations of single family residential (SFR) water uses at Census Block Group (CBG) and Census Tract (CT) scales in four representative western US cities - Austin, Phoenix, Portland, and Salt Lake City, - which cover a wide range of climate and development density. We used both ordinary least squares regression and spatial error regression models to identify the influence of spatial dependence on water use patterns. Our results show that older downtown areas show lower water use than newer suburban areas in all four cities. Tax assessed value and building age are the main determinants of SFR water use across the four cities regardless of the scale. Impervious surface area becomes an important variable for summer water use in all cities, and it is important in all seasons for arid environments such as Phoenix. CT level analysis shows better model predictability than CBG analysis. In all cities, seasons, and spatial scales, spatial error regression models better explain the variations of SFR water use. Such a spatially-varying relationship of urban water consumption provides additional evidence for the need to integrate urban land use planning and municipal water planning. Copyright © 2017 Elsevier B.V. All rights reserved.
Hughes, James P; Haley, Danielle F; Frew, Paula M; Golin, Carol E; Adimora, Adaora A; Kuo, Irene; Justman, Jessica; Soto-Torres, Lydia; Wang, Jing; Hodder, Sally
2015-06-01
Reductions in risk behaviors are common following enrollment in human immunodeficiency virus (HIV) prevention studies. We develop methods to quantify the proportion of change in risk behaviors that can be attributed to regression to the mean versus study participation and other factors. A novel model that incorporates both regression to the mean and study participation effects is developed for binary measures. The model is used to estimate the proportion of change in the prevalence of "unprotected sex in the past 6 months" that can be attributed to study participation versus regression to the mean in a longitudinal cohort of women at risk for HIV infection who were recruited from ten U.S. communities with high rates of HIV and poverty. HIV risk behaviors were evaluated using audio computer-assisted self-interviews at baseline and every 6 months for up to 12 months. The prevalence of "unprotected sex in the past 6 months" declined from 96% at baseline to 77% at 12 months. However, this change could be almost completely explained by regression to the mean. Analyses that examine changes over time in cohorts selected for high- or low- risk behaviors should account for regression to the mean effects. Copyright © 2015 Elsevier Inc. All rights reserved.
Weissert, L F; Salmond, J A; Miskell, G; Alavi-Shoshtari, M; Williams, D E
2018-04-01
Land use regression (LUR) analysis has become a key method to explain air pollutant concentrations at unmeasured sites at city or country scales, but little is known about the applicability of LUR at microscales. We present a microscale LUR model developed for a heavy trafficked section of road in Auckland, New Zealand. We also test the within-city transferability of LUR models developed at different spatial scales (local scale and city scale). Nitrogen dioxide (NO 2 ) was measured during summer at 40 sites and a LUR model was developed based on standard criteria. The results showed that LUR models are able to capture the microscale variability with the model explaining 66% of the variability in NO 2 concentrations. Predictor variables identified at this scale were street width, distance to major road, presence of awnings and number of bus stops, with the latter three also being important determinants at the local scale. This highlights the importance of street and building configurations for individual exposure at the street level. However, within-city transferability was limited with the number of bus stops being the only significant predictor variable at all spatial scales and locations tested, indicating the strong influence of diesel emissions related to bus traffic. These findings show that air quality monitoring is necessary at a high spatial density within cities in capturing small-scale variability in NO 2 concentrations at the street level and assessing individual exposure to traffic related air pollutants. Copyright © 2017. Published by Elsevier B.V.
Explaining Match Outcome During The Men’s Basketball Tournament at The Olympic Games
Leicht, Anthony S.; Gómez, Miguel A.; Woods, Carl T.
2017-01-01
In preparation for the Olympics, there is a limited opportunity for coaches and athletes to interact regularly with team performance indicators providing important guidance to coaches for enhanced match success at the elite level. This study examined the relationship between match outcome and team performance indicators during men’s basketball tournaments at the Olympic Games. Twelve team performance indicators were collated from all men’s teams and matches during the basketball tournament of the 2004-2016 Olympic Games (n = 156). Linear and non-linear analyses examined the relationship between match outcome and team performance indicator characteristics; namely, binary logistic regression and a conditional interference (CI) classification tree. The most parsimonious logistic regression model retained ‘assists’, ‘defensive rebounds’, ‘field-goal percentage’, ‘fouls’, ‘fouls against’, ‘steals’ and ‘turnovers’ (delta AIC <0.01; Akaike weight = 0.28) with a classification accuracy of 85.5%. Conversely, four performance indicators were retained with the CI classification tree with an average classification accuracy of 81.4%. However, it was the combination of ‘field-goal percentage’ and ‘defensive rebounds’ that provided the greatest probability of winning (93.2%). Match outcome during the men’s basketball tournaments at the Olympic Games was identified by a unique combination of performance indicators. Despite the average model accuracy being marginally higher for the logistic regression analysis, the CI classification tree offered a greater practical utility for coaches through its resolution of non-linear phenomena to guide team success. Key points A unique combination of team performance indicators explained 93.2% of winning observations in men’s basketball at the Olympics. Monitoring of these team performance indicators may provide coaches with the capability to devise multiple game plans or strategies to enhance their likelihood of winning. Incorporation of machine learning techniques with team performance indicators may provide a valuable and strategic approach to explain patterns within multivariate datasets in sport science. PMID:29238245
Wilson, Andrew N; Dollman, James
2007-06-01
Understanding factors that influence physical activity levels of adolescents can assist the design of more effective interventions. Social support is a consistent correlate of youth physical activity but few studies have examined this in different cultural settings. Male adolescents (n=180, age=13.58+/-0.97 years) from a metropolitan single sex private school participated in this study. Habitual physical activity was estimated using the 3-day physical activity recall (3dPAR), and aspects of social support to be physically active using a specifically designed questionnaire. Comparisons were made between Anglo-Australians (n=118), whose parents were both born in Australia, and Vietnamese-Australians (n=62), whose parents were both born in Vietnam. There was a trend towards higher physical activity among Anglo-Australians, particularly on weekends. Anglo-Australians reported significantly more parental and peer support across most items pertaining to these constructs. Among the whole sample, social support variables explained 5-12% of the total explained variance in physical activity, with items pertaining to father and best friend support emerging as the strongest and most consistent predictors in multiple regression models. Among Anglo-Australians, the prediction models were relatively weak, explaining 0-9% of the total explained variance in physical activity. Prediction models for physical activity among Vietnamese-Australians were much stronger, explaining 11-32% of the total explained variance, with father's support variables contributing consistently to these models. The strong paternal influence on physical activity among Vietnamese-Australians needs to be confirmed in more diverse population groups, but results from this study suggest that interventions promoting physical activity among adolescent boys need to take into account cultural background as a moderator of widely reported social influences.
Chahine, Teresa; Schultz, Bradley D.; Zartarian, Valerie G.; Xue, Jianping; Subramanian, SV; Levy, Jonathan I.
2011-01-01
Community-based cumulative risk assessment requires characterization of exposures to multiple chemical and non-chemical stressors, with consideration of how the non-chemical stressors may influence risks from chemical stressors. Residential radon provides an interesting case example, given its large attributable risk, effect modification due to smoking, and significant variability in radon concentrations and smoking patterns. In spite of this fact, no study to date has estimated geographic and sociodemographic patterns of both radon and smoking in a manner that would allow for inclusion of radon in community-based cumulative risk assessment. In this study, we apply multi-level regression models to explain variability in radon based on housing characteristics and geological variables, and construct a regression model predicting housing characteristics using U.S. Census data. Multi-level regression models of smoking based on predictors common to the housing model allow us to link the exposures. We estimate county-average lifetime lung cancer risks from radon ranging from 0.15 to 1.8 in 100, with high-risk clusters in areas and for subpopulations with high predicted radon and smoking rates. Our findings demonstrate the viability of screening-level assessment to characterize patterns of lung cancer risk from radon, with an approach that can be generalized to multiple chemical and non-chemical stressors. PMID:22016710
Liu, Tianyin; Wong, Gloria Hy; Luo, Hao; Tang, Jennifer Ym; Xu, Jiaqi; Choy, Jacky Cp; Lum, Terry Ys
2017-05-02
Intact cognition is a key determinant of quality of life. Here, we investigated the relative contribution of age and physical frailty to global and everyday cognition in older adults. Data came from 1396 community-dwelling, healthy Chinese older adults aged 65 or above. We measured their global cognition using the Cantonese Chinese Montreal Cognitive Assessment, everyday cognition with the short Chinese Lawton Instrumental Activities Daily Living scale, and physical frailty using the Fatigue, Resistance, Ambulation, Illness, and Loss of Weight Scale and grip strength. Multiple regression analysis was used to evaluate the comparative roles of age and physical frailty. In the global cognition model, age explained 12% and physical frailty explained 8% of the unique variance. This pattern was only evident in women, while the reverse (physical frailty explains a greater extent of variance) was evident in men. In the everyday cognition model, physical frailty explained 18% and chronological age explained 9% of the unique variance, with similar results across both genders. Physical frailty is a stronger indicator than age for everyday cognition in both genders and for global cognition in men. Our findings suggest that there are alternative indexes of cognitive aging than chronological age.
Microhabitat and Climatic Niche Change Explain Patterns of Diversification among Frog Families.
Moen, Daniel S; Wiens, John J
2017-07-01
A major goal of ecology and evolutionary biology is to explain patterns of species richness among clades. Differences in rates of net diversification (speciation minus extinction over time) may often explain these patterns, but the factors that drive variation in diversification rates remain uncertain. Three important candidates are climatic niche position (e.g., whether clades are primarily temperate or tropical), rates of climatic niche change among species within clades, and microhabitat (e.g., aquatic, terrestrial, arboreal). The first two factors have been tested separately in several studies, but the relative importance of all three is largely unknown. Here we explore the correlates of diversification among families of frogs, which collectively represent ∼88% of amphibian species. We assemble and analyze data on phylogeny, climate, and microhabitat for thousands of species. We find that the best-fitting phylogenetic multiple regression model includes all three types of variables: microhabitat, rates of climatic niche change, and climatic niche position. This model explains 67% of the variation in diversification rates among frog families, with arboreal microhabitat explaining ∼31%, niche rates ∼25%, and climatic niche position ∼11%. Surprisingly, we show that microhabitat can have a much stronger influence on diversification than climatic niche position or rates of climatic niche change.
Inequalities in oral health: Understanding the contributions of education and income.
Farmer, Julie; Phillips, Rebecca C; Singhal, Sonica; Quiñonez, Carlos
2017-09-14
To quantify the extent to which income and education explain gradients in oral health outcomes. Using data from the Canadian Community Health Survey (CCHS 2003), binary logistic regression models were constructed to examine the relationship between income and education on self-reported oral health (SROH) and chewing difficulties (CD) while controlling for age, sex, ethnicity, employment status and dental insurance coverage. The relative index of inequality (RII) was utilized to quantify the extent to which income and education explain gradients in poor SROH and CD. Income and education gradients were present for SROH and CD. From fully adjusted models, income inequalities were greater for CD (RIIinc = 2.85) than for SROH (RIIinc = 2.75), with no substantial difference in education inequalities between the two. Income explained 37.4% and 42.4% of the education gradient in SROH and CD respectively, whereas education explained 45.2% and 6.1% of income gradients in SROH and CD respectively. Education appears to play a larger role than income when explaining inequalities in SROH; however, it is the opposite for CD. In this sample of the Canadian adult population, income explained over one third of the education gradient in SROH and CDs, whereas the contribution of education to income gradients varied by choice of self-reported outcome. Results call for stakeholders to improve affordability of dental care in order to reduce inequalities in the Canadian population.
Searching for a two-factor model of marriage duration: commentary on Gottman and Levenson.
DeKay, Michael L; Greeno, Catherine G; Houck, Patricia R
2002-01-01
Gottman and Levenson (2002) report a number of post hoc ordinary least squares regressions to "predict" the length of marriage, given that divorce has occurred. We argue that the type of statistical model they use is inappropriate for answering clinically relevant questions about the causes and timing of divorce, and present several reasons why an alternative family of models called duration models would be more appropriate. The distribution of marriage length is not bimodal, as Gottman and Levenson suggest, and their search for a two-factor model for explaining marriage length is misguided. Their regression models omit many variables known to affect marriage length, and instead use variables that were pre-screened for their predictive ability. Their final model is based on data for only 15 cases, including one unusual case that has undue influence on the results. For these and other technical reasons presented in the text, we believe that Gottman and Levenson's results are not replicable, and that they should not be used to guide interventions for couples in clinical settings.
The interaction between stratospheric monthly mean regional winds and sporadic-E
NASA Astrophysics Data System (ADS)
Çetin, Kenan; Özcan, Osman; Korlaelçi, Serhat
2017-03-01
In the present study, a statistical investigation is carried out to explore whether there is a relationship between the critical frequency (foEs) of the sporadic-E layer that is occasionally seen on the E region of the ionosphere and the quasi-biennial oscillation (QBO) that flows in the east-west direction in the equatorial stratosphere. Multiple regression model as a statistical tool was used to determine the relationship between variables. In this model, the stationarity of the variables (foEs and QBO) was firstly analyzed for each station (Cocos Island, Gibilmanna, Niue Island, and Tahiti). Then, a co-integration test was made to determine the existence of a long-term relationship between QBO and foEs. After verifying the presence of a long-term relationship between the variables, the magnitude of the relationship between variables was further determined using the multiple regression model. As a result, it is concluded that the variations in foEs were explainable with QBO measured at 10 hPa altitude at the rate of 69%, 94%, 79%, and 58% for Cocos Island, Gibilmanna, Niue Island, and Tahiti stations, respectively. It is observed that the variations in foEs were explainable with QBO measured at 70 hPa altitude at the rate of 66%, 69%, 53%, and 47% for Cocos Island, Gibilmanna, Niue Island, and Tahiti stations, respectively.
New methods of testing nonlinear hypothesis using iterative NLLS estimator
NASA Astrophysics Data System (ADS)
Mahaboob, B.; Venkateswarlu, B.; Mokeshrayalu, G.; Balasiddamuni, P.
2017-11-01
This research paper discusses the method of testing nonlinear hypothesis using iterative Nonlinear Least Squares (NLLS) estimator. Takeshi Amemiya [1] explained this method. However in the present research paper, a modified Wald test statistic due to Engle, Robert [6] is proposed to test the nonlinear hypothesis using iterative NLLS estimator. An alternative method for testing nonlinear hypothesis using iterative NLLS estimator based on nonlinear hypothesis using iterative NLLS estimator based on nonlinear studentized residuals has been proposed. In this research article an innovative method of testing nonlinear hypothesis using iterative restricted NLLS estimator is derived. Pesaran and Deaton [10] explained the methods of testing nonlinear hypothesis. This paper uses asymptotic properties of nonlinear least squares estimator proposed by Jenrich [8]. The main purpose of this paper is to provide very innovative methods of testing nonlinear hypothesis using iterative NLLS estimator, iterative NLLS estimator based on nonlinear studentized residuals and iterative restricted NLLS estimator. Eakambaram et al. [12] discussed least absolute deviation estimations versus nonlinear regression model with heteroscedastic errors and also they studied the problem of heteroscedasticity with reference to nonlinear regression models with suitable illustration. William Grene [13] examined the interaction effect in nonlinear models disused by Ai and Norton [14] and suggested ways to examine the effects that do not involve statistical testing. Peter [15] provided guidelines for identifying composite hypothesis and addressing the probability of false rejection for multiple hypotheses.
Bryan, Venise D; Lindo, Jascinth; Anderson-Johnson, Pauline; Weaver, Steve
2015-01-01
Faculty members are viewed as nurturers within the academic setting and may be able to influence students' behaviors through the formation of positive interpersonal relationships. Faculty members' attributes that best facilitated positive interpersonal relationships according to Carl Rogers' Person-Centered Model was studied. Students (n = 192) enrolled in a 3-year undergraduate nursing program in urban Jamaica were randomly selected to participate in this descriptive cross-sectional study. A 38-item questionnaire on interpersonal relationships with nursing faculty and students' perceptions of their teachers was utilized to collect data. Factor analysis was used to create factors of realness, prizing, and empathetic understanding. Multiple linear regression analysis on the interaction of the 3 factors and interpersonal relationship scores was performed while controlling for nursing students' study year and age. One hundred sixty-five students (mean age: 23.18 ± 4.51years; 99% female) responded. The regression model explained over 46% of the variance. Realness (β = 0.50, P < .001) was the only significant predictor of the interpersonal relationship scores assigned by the nursing students. Of the total number of respondents, 99 students (60%) reported satisfaction with the interpersonal relationships shared with faculty. Nursing students' perception of faculty members' realness appeared to be the most significant attribute in fostering positive interpersonal relationships. Copyright © 2015 Elsevier Inc. All rights reserved.
Chau, Kénora; Kabuth, Bernard; Chau, Nearkasen
2016-11-01
The risk of suicide behaviors in immigrant adolescents varies across countries and remains partly understood. We conducted a study in France to examine immigrant adolescents' likelihood of experiencing suicide ideation in the last 12 months (SI) and lifetime suicide attempts (SA) compared with their native counterparts, and the contribution of socioeconomic factors and school, behavior, and health-related difficulties. Questionnaires were completed by 1559 middle-school adolescents from north-eastern France including various risk factors, SI, SA, and their first occurrence over adolescent's life course (except SI). Data were analyzed using logistic regression models for SI and Cox regression models for SA (retaining only school, behavior, and health-related difficulties that started before SA). Immigrant adolescents had a two-time higher risk of SI and SA than their native counterparts. Using nested models, the excess SI risk was highly explained by socioeconomic factors (27%) and additional school, behavior, and health-related difficulties (24%) but remained significant. The excess SA risk was more highly explained by these issues (40% and 85%, respectively) and became non-significant. These findings demonstrate the risk patterns of SI and SA and the prominent confounding roles of socioeconomic factors and school, behavior, and health-related difficulties. They may be provided to policy makers, schools, carers, and various organizations interested in immigrant, adolescent, and suicide-behavior problems.
Applicability of Cameriere's and Drusini's age estimation methods to a sample of Turkish adults.
Hatice, Boyacioglu Dogru; Nihal, Avcu; Nursel, Akkaya; Humeyra Ozge, Yilanci; Goksuluk, Dincer
2017-10-01
The aim of this study was to investigate the applicability of Drusini's and Cameriere's methods to a sample of Turkish people. Panoramic images of 200 individuals were allocated into two groups as study and test groups and examined by two observers. Tooth coronal indexes (TCI), which is the ratio between coronal pulp cavity height and crown height, were calculated in the mandibular first and second premolars and molars. Pulp/tooth area ratios (ARs) were calculated in the maxillary and mandibular canine teeth. Study group measurements were used to derive a regression model. Test group measurements were used to evaluate the accuracy of the regression model. Pearson's correlation coefficients and regression analysis were used. The correlations between TCIs and age were -0.230, -0.301, -0.344 and -0.257 for mandibular first premolar, second premolar, first molar and second molar, respectively. Those for the maxillary canine (MX) and mandibular canine (MN) ARs were -0.716 and -0.514, respectively. The MX ARs were used to build the linear regression model that explained 51.2% of the total variation, with a standard error of 9.23 years. The mean error of the estimates in test group was 8 years and age of 64% of the individuals were estimated with an error of <±10 years which is acceptable in forensic age prediction. The low correlation coefficients between age and TCI indicate that Drusini's method was not applicable to the estimation of age in a Turkish population. Using Cameriere's method, we derived a regression model.
Palomo, M J; Quintanilla, R; Izquierdo, M D; Mogas, T; Paramio, M T
2016-12-01
This work analyses the changes that caprine spermatozoa undergo during in vitro fertilization (IVF) of in vitro matured prepubertal goat oocytes and their relationship with IVF outcome, in order to obtain an effective model that allows prediction of in vitro fertility on the basis of semen assessment. The evolution of several sperm parameters (motility, viability and acrosomal integrity) during IVF and their relationship with three IVF outcome criteria (total penetration, normal penetration and cleavage rates) were studied in a total of 56 IVF replicates. Moderate correlation coefficients between some sperm parameters and IVF outcome were observed. In addition, stepwise multiple regression analyses were conducted that considered three grouping of sperm parameters as potential explanatory variables of the three IVF outcome criteria. The proportion of IVF outcome variation that can be explained by the fitted models ranged from 0.62 to 0.86, depending upon the trait analysed and the variables considered. Seven out of 32 sperm parameters were selected as partial covariates in at least one of the nine multiple regression models. Among these, progressive sperm motility assessed immediately after swim-up, the percentage of dead sperm with intact acrosome and the incidence of acrosome reaction both determined just before the gamete co-culture, and finally the proportion of viable spermatozoa at 17 h post-insemination were the most frequently selected sperm parameters. Nevertheless, the predictive ability of these models must be confirmed in a larger sample size experiment.
Buonocore, Jonathan J; Dong, Xinyi; Spengler, John D; Fu, Joshua S; Levy, Jonathan I
2014-07-01
We estimated PM2.5-related public health impacts/ton emitted of primary PM2.5, SO2, and NOx for a set of power plants in the Mid-Atlantic and Lower Great Lakes regions of the United States, selected to include varying emission profiles and broad geographic representation. We then developed a regression model explaining variability in impacts per ton emitted using the population distributions around each plant. We linked outputs from the Community Multiscale Air Quality (CMAQ) model v 4.7.1 with census data and concentration-response functions for PM2.5-related mortality, and monetized health estimates using the value-of-statistical-life. The median impacts for the final set of plants were $130,000/ton for primary PM2.5 (range: $22,000-230,000), $28,000/ton for SO2 (range: $19,000-33,000), and $16,000/ton for NOx (range: $7100-26,000). Impacts of NOx were a median of 34% (range: 20%-75%) from ammonium nitrate and 66% (range: 25%-79%) from ammonium sulfate. The latter pathway is likely from NOx enhancing atmospheric oxidative capacity and amplifying sulfate formation, and is often excluded. Our regression models explained most of the variation in impact/ton estimates using basic population covariates, and can aid in estimating impacts averted from interventions such as pollution controls, alternative energy installations, or demand-side management. Copyright © 2014 Elsevier Ltd. All rights reserved.
Harris, Jocelyn E; MacDermid, Joy C; Roth, James
2005-01-01
Background Distal radius fractures are common injuries that have an increasing impact on health across the lifespan. The purpose of this study was to identify health impacts in body structure/function, activity, and participation at baseline and follow-up, to determine whether they support the ICF model of health. Methods This is a prospective cohort study of 790 individuals who were assessed at 1 week, 3 months, and 1 year post injury. The Patient Rated Wrist Evaluation (PRWE), The Wrist Outcome Measure (WOM), and the Medical Outcome Survey Short-Form (SF-36) were used to measure impairment, activity, participation, and health. Multiple regression was used to develop explanatory models of health outcome. Results Regression analysis showed that the PRWE explained between 13% (one week) and 33% (three months) of the SF-36 Physical Component Summary Scores with pain, activities and participation subscales showing dominant effects at different stages of recovery. PRWE scores were less related to Mental Component Summary Scores, 10% (three months) and 8% (one year). Wrist impairment scores were less powerful predictors of health status than the PRWE. Conclusion The ICF is an informative model for examining distal radius fracture. Difficulty in the domains of activity and participation were able to explain a significant portion of physical health. Post-fracture rehabilitation and outcome assessments should extend beyond physical impairment to insure comprehensive treatment to individuals with distal radius fracture. PMID:16288664
Giménez-Espert, María Del Carmen; Prado-Gascó, Vicente Javier
2018-03-01
To analyse link between empathy and emotional intelligence as a predictor of nurses' attitudes towards communication while comparing the contribution of emotional aspects and attitudinal elements on potential behaviour. Nurses' attitudes towards communication, empathy and emotional intelligence are key skills for nurses involved in patient care. There are currently no studies analysing this link, and its investigation is needed because attitudes may influence communication behaviours. Correlational study. To attain this goal, self-reported instruments (attitudes towards communication of nurses, trait emotional intelligence (Trait Emotional Meta-Mood Scale) and Jefferson Scale of Nursing Empathy (Jefferson Scale Nursing Empathy) were collected from 460 nurses between September 2015-February 2016. Two different analytical methodologies were used: traditional regression models and fuzzy-set qualitative comparative analysis models. The results of the regression model suggest that cognitive dimensions of attitude are a significant and positive predictor of the behavioural dimension. The perspective-taking dimension of empathy and the emotional-clarity dimension of emotional intelligence were significant positive predictors of the dimensions of attitudes towards communication, except for the affective dimension (for which the association was negative). The results of the fuzzy-set qualitative comparative analysis models confirm that the combination of high levels of cognitive dimension of attitudes, perspective-taking and emotional clarity explained high levels of the behavioural dimension of attitude. Empathy and emotional intelligence are predictors of nurses' attitudes towards communication, and the cognitive dimension of attitude is a good predictor of the behavioural dimension of attitudes towards communication of nurses in both regression models and fuzzy-set qualitative comparative analysis. In general, the fuzzy-set qualitative comparative analysis models appear to be better predictors than the regression models are. To evaluate current practices, establish intervention strategies and evaluate their effectiveness. The evaluation of these variables and their relationships are important in creating a satisfied and sustainable workforce and improving quality of care and patient health. © 2018 John Wiley & Sons Ltd.
NASA Astrophysics Data System (ADS)
Arcenegui, V.; Morugán, A.; García-Orenes, F.; Zornoza, R.; Mataix-Solera, J.; Navarro, M. A.; Guerrero, C.; Mataix-Beneyto, J.
2009-04-01
The use of treated wastewater for the irrigation of agricultural soils is an alternative to utilizing better-quality water, especially in semiarid regions where water shortage is a very serious problem. However, this practise can modify the soil equilibrium and affect its quality. In this work two soil quality indices (models) are used to evaluate the effects of long-term irrigation with treated wastewater in soil. The models were developed studying different soil properties in undisturbed forest soils in SE Spain, and the relationships between soil parameters were established using multiple linear regressions. Model 1, that explained 92% of the variance in soil organic carbon (SOC) showed that the SOC can be calculated by the linear combination of 6 physical, chemical and biochemical properties (acid phosphatase, water holding capacity (WHC), electrical conductivity (EC), available phosphorus (P), cation exchange capacity (CEC) and aggregate stability (AS)). Model 2 explains 89% of the SOC variance, which can be calculated by means of 7 chemical and biochemical properties (urease, phosphatase, and
NASA Astrophysics Data System (ADS)
Martínez-Fernández, J.; Chuvieco, E.; Koutsias, N.
2013-02-01
Humans are responsible for most forest fires in Europe, but anthropogenic factors behind these events are still poorly understood. We tried to identify the driving factors of human-caused fire occurrence in Spain by applying two different statistical approaches. Firstly, assuming stationary processes for the whole country, we created models based on multiple linear regression and binary logistic regression to find factors associated with fire density and fire presence, respectively. Secondly, we used geographically weighted regression (GWR) to better understand and explore the local and regional variations of those factors behind human-caused fire occurrence. The number of human-caused fires occurring within a 25-yr period (1983-2007) was computed for each of the 7638 Spanish mainland municipalities, creating a binary variable (fire/no fire) to develop logistic models, and a continuous variable (fire density) to build standard linear regression models. A total of 383 657 fires were registered in the study dataset. The binary logistic model, which estimates the probability of having/not having a fire, successfully classified 76.4% of the total observations, while the ordinary least squares (OLS) regression model explained 53% of the variation of the fire density patterns (adjusted R2 = 0.53). Both approaches confirmed, in addition to forest and climatic variables, the importance of variables related with agrarian activities, land abandonment, rural population exodus and developmental processes as underlying factors of fire occurrence. For the GWR approach, the explanatory power of the GW linear model for fire density using an adaptive bandwidth increased from 53% to 67%, while for the GW logistic model the correctly classified observations improved only slightly, from 76.4% to 78.4%, but significantly according to the corrected Akaike Information Criterion (AICc), from 3451.19 to 3321.19. The results from GWR indicated a significant spatial variation in the local parameter estimates for all the variables and an important reduction of the autocorrelation in the residuals of the GW linear model. Despite the fitting improvement of local models, GW regression, more than an alternative to "global" or traditional regression modelling, seems to be a valuable complement to explore the non-stationary relationships between the response variable and the explanatory variables. The synergy of global and local modelling provides insights into fire management and policy and helps further our understanding of the fire problem over large areas while at the same time recognizing its local character.
Chouchane, Hatem; Krol, Maarten S; Hoekstra, Arjen Y
2018-02-01
Growing water demands put increasing pressure on local water resources, especially in water-short countries. Virtual water trade can play a key role in filling the gap between local demand and supply of water-intensive commodities. This study aims to analyse the dynamics in virtual water trade of Tunisia in relation to environmental and socio-economic factors such as GDP, irrigated land, precipitation, population and water scarcity. The water footprint of crop production is estimated using AquaCrop for six crops over the period 1981-2010. Net virtual water import (NVWI) is quantified at yearly basis. Regression models are used to investigate dynamics in NVWI in relation to the selected factors. The results show that NVWI during the study period for the selected crops is not influenced by blue water scarcity. NVWI correlates in two alternative models to either population and precipitation (model I) or to GDP and irrigated area (model II). The models are better in explaining NVWI of staple crops (wheat, barley, potatoes) than NVWI of cash crops (dates, olives, tomatoes). Using model I, we are able to explain both trends and inter-annual variability for rain-fed crops. Model II performs better for irrigated crops and is able to explain trends significantly; no significant relation is found, however, with variables hypothesized to represent inter-annual variability. Copyright © 2017 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Hellack, Bryan; Sugiri, Dorothea; Schins, Roel P. F.; Schikowski, Tamara; Krämer, Ursula; Kuhlbusch, Thomas A. J.; Hoffmann, Barbara
2017-12-01
While land use regression models (LUR) are commonly used, e.g. for the prediction of spatially variable air pollutant mass concentrations, they are scarcely used for predicting the oxidative potential (OP), a suggested unifying predictor of health effects. Therefore a LUR model was developed to examine if long-term OP of fine particulate exposure can be reasonably predicted by LUR modeling and whether it is related to health effects in a study region comprised of urban and rural areas. Four 14-day sampling periods over 1 year at 40 sites in the western Ruhr Area and adjacent northern rural area, Germany, in 2002/2003 were conducted and annual Nitrogen Dioxide (NO2), fine particles (PM2.5), and OP were calculated. LUR models were developed to estimate spatially-resolved annual OP, NO2 and PM2.5 concentrations. The model performance was checked by leave-one-out cross validation (LOOCV) and cox regression was used to analyze the association of modeled residential OP and NO2 with incident type 2 diabetes mellitus (T2DM) in 1784 elderly women during a mean follow-up of 16 years (baseline 1985-1994). The measured OP and NO2 concentrations were moderately correlated (rSpearman 0.57). The LUR models explained 62% and 92% of the OP and NO2 variance (adjusted LOOCV R2 57% and 90%). PM10 emission from combustion in a 5000 m buffer was the most important predictor for OP and NO2. Modeled pollutants were highly correlated (rSpearman 0.87). Model quality for OP was sensitive to the inclusion of a single influential measurement site. For PM2.5 mass only an insufficient model with a low explained variance of 22% (adjusted R2) was developed so no health effects analyses were conducted with estimated PM2.5. Increases in OP and NO2 were associated with an increase in risk of T2DM by a hazard ratio of 1.38 (95% CI 1.06-1.80) and 1.39 (95% CI 1.07-1.81) per interquartile range of OP and NO2, respectively. We conclude that spatially-resolved OP can be predicted by LUR modeling, but future work is needed to investigate the possibility to increase OP model quality with refined predictors.
NASA Astrophysics Data System (ADS)
Hadley, Brian Christopher
This dissertation assessed remotely sensed data and geospatial modeling technique(s) to map the spatial distribution of total above-ground biomass present on the surface of the Savannah River National Laboratory's (SRNL) Mixed Waste Management Facility (MWMF) hazardous waste landfill. Ordinary least squares (OLS) regression, regression kriging, and tree-structured regression were employed to model the empirical relationship between in-situ measured Bahia (Paspalum notatum Flugge) and Centipede [Eremochloa ophiuroides (Munro) Hack.] grass biomass against an assortment of explanatory variables extracted from fine spatial resolution passive optical and LIDAR remotely sensed data. Explanatory variables included: (1) discrete channels of visible, near-infrared (NIR), and short-wave infrared (SWIR) reflectance, (2) spectral vegetation indices (SVI), (3) spectral mixture analysis (SMA) modeled fractions, (4) narrow-band derivative-based vegetation indices, and (5) LIDAR derived topographic variables (i.e. elevation, slope, and aspect). Results showed that a linear combination of the first- (1DZ_DGVI), second- (2DZ_DGVI), and third-derivative of green vegetation indices (3DZ_DGVI) calculated from hyperspectral data recorded over the 400--960 nm wavelengths of the electromagnetic spectrum explained the largest percentage of statistical variation (R2 = 0.5184) in the total above-ground biomass measurements. In general, the topographic variables did not correlate well with the MWMF biomass data, accounting for less than five percent of the statistical variation. It was concluded that tree-structured regression represented the optimum geospatial modeling technique due to a combination of model performance and efficiency/flexibility factors.
Connections between survey calibration estimators and semiparametric models for incomplete data
Lumley, Thomas; Shaw, Pamela A.; Dai, James Y.
2012-01-01
Survey calibration (or generalized raking) estimators are a standard approach to the use of auxiliary information in survey sampling, improving on the simple Horvitz–Thompson estimator. In this paper we relate the survey calibration estimators to the semiparametric incomplete-data estimators of Robins and coworkers, and to adjustment for baseline variables in a randomized trial. The development based on calibration estimators explains the ‘estimated weights’ paradox and provides useful heuristics for constructing practical estimators. We present some examples of using calibration to gain precision without making additional modelling assumptions in a variety of regression models. PMID:23833390
NASA Astrophysics Data System (ADS)
Ren-Yang, Zhao; Magun, Andreas; Schanda, Erwin
1990-12-01
In the present paper we report the results of a correlation analysis for 57 microwave impulsive bursts observed at six frequencies in which we have obtained a regression line between the peak frequency and the corresponding rise time of microwave impulsive bursts: {ie361-01} (with a correlation coefficient of - 0.43). This can be explained in the frame of a thermal model. The magnetic field decrease with height has to be much slower than in a dipole field in order to explain the weak dependence of f p on t r . This decrease of magnetic field with height in burst sources is based on the relationship between f p and t r found by assuming a thermal flare model with a collisionless conduction front.
De Cock, Rozane; Vangeel, Jolien; Klein, Annabelle; Minotte, Pascal; Rosas, Omar; Meerkerk, Gert-Jan
2014-03-01
A representative sample (n=1,000) of the Belgian population aged 18 years and older filled out an online questionnaire on their Internet use in general and their use of social networking sites (SNS) in particular. We measured total time spent on the Internet, time spent on SNS, number of SNS profiles, gender, age, schooling level, income, job occupation, and leisure activities, and we integrated several psychological scales such as the Quick Big Five and the Mastery Scale. Hierarchical multiple regression modeling shows that gender and age explain an important part of the compulsive SNS score (5%) as well as psychological scales (20%), but attitude toward school (additional 3%) and income (2.5%) also add to explained variance in predictive models of compulsive SNS use.
Weichenthal, Scott; Van Ryswyk, Keith; Goldstein, Alon; Shekarrizfard, Maryam; Hatzopoulou, Marianne
2016-01-01
Exposure models are needed to evaluate the chronic health effects of ambient ultrafine particles (<0.1 μm) (UFPs). We developed a land use regression model for ambient UFPs in Toronto, Canada using mobile monitoring data collected during summer/winter 2010-2011. In total, 405 road segments were included in the analysis. The final model explained 67% of the spatial variation in mean UFPs and included terms for the logarithm of distances to highways, major roads, the central business district, Pearson airport, and bus routes as well as variables for the number of on-street trees, parks, open space, and the length of bus routes within a 100 m buffer. There was no systematic difference between measured and predicted values when the model was evaluated in an external dataset, although the R(2) value decreased (R(2) = 50%). This model will be used to evaluate the chronic health effects of UFPs using population-based cohorts in the Toronto area. Crown Copyright © 2015. Published by Elsevier Ltd. All rights reserved.
Gulliver, John; Morley, David; Dunster, Chrissi; McCrea, Adrienne; van Nunen, Erik; Tsai, Ming-Yi; Probst-Hensch, Nicoltae; Eeftens, Marloes; Imboden, Medea; Ducret-Stich, Regina; Naccarati, Alessio; Galassi, Claudia; Ranzi, Andrea; Nieuwenhuijsen, Mark; Curto, Ariadna; Donaire-Gonzalez, David; Cirach, Marta; Vermeulen, Roel; Vineis, Paolo; Hoek, Gerard; Kelly, Frank J
2018-01-01
Oxidative potential (OP) of particulate matter (PM) is proposed as a biologically-relevant exposure metric for studies of air pollution and health. We aimed to evaluate the spatial variability of the OP of measured PM 2.5 using ascorbate (AA) and (reduced) glutathione (GSH), and develop land use regression (LUR) models to explain this spatial variability. We estimated annual average values (m -3 ) of OP AA and OP GSH for five areas (Basel, CH; Catalonia, ES; London-Oxford, UK (no OP GSH ); the Netherlands; and Turin, IT) using PM 2.5 filters. OP AA and OP GSH LUR models were developed using all monitoring sites, separately for each area and combined-areas. The same variables were then used in repeated sub-sampling of monitoring sites to test sensitivity of variable selection; new variables were offered where variables were excluded (p > .1). On average, measurements of OP AA and OP GSH were moderately correlated (maximum Pearson's maximum Pearson's R = = .7) with PM 2.5 and other metrics (PM 2.5 absorbance, NO 2 , Cu, Fe). HOV (hold-out validation) R 2 for OP AA models was .21, .58, .45, .53, and .13 for Basel, Catalonia, London-Oxford, the Netherlands and Turin respectively. For OP GSH , the only model achieving at least moderate performance was for the Netherlands (R 2 = .31). Combined models for OP AA and OP GSH were largely explained by study area with weak local predictors of intra-area contrasts; we therefore do not endorse them for use in epidemiologic studies. Given the moderate correlation of OP AA with other pollutants, the three reasonably performing LUR models for OP AA could be used independently of other pollutant metrics in epidemiological studies. Copyright © 2017 Elsevier Inc. All rights reserved.
European Wintertime Windstorms and its Links to Large-Scale Variability Modes
NASA Astrophysics Data System (ADS)
Befort, D. J.; Wild, S.; Walz, M. A.; Knight, J. R.; Lockwood, J. F.; Thornton, H. E.; Hermanson, L.; Bett, P.; Weisheimer, A.; Leckebusch, G. C.
2017-12-01
Winter storms associated with extreme wind speeds and heavy precipitation are the most costly natural hazard in several European countries. Improved understanding and seasonal forecast skill of winter storms will thus help society, policy-makers and (re-) insurance industry to be better prepared for such events. We firstly assess the ability to represent extra-tropical windstorms over the Northern Hemisphere of three seasonal forecast ensemble suites: ECMWF System3, ECMWF System4 and GloSea5. Our results show significant skill for inter-annual variability of windstorm frequency over parts of Europe in two of these forecast suites (ECMWF-S4 and GloSea5) indicating the potential use of current seasonal forecast systems. In a regression model we further derive windstorm variability using the forecasted NAO from the seasonal model suites thus estimating the suitability of the NAO as the only predictor. We find that the NAO as the main large-scale mode over Europe can explain some of the achieved skill and is therefore an important source of variability in the seasonal models. However, our results show that the regression model fails to reproduce the skill level of the directly forecast windstorm frequency over large areas of central Europe. This suggests that the seasonal models also capture other sources of variability/predictability of windstorms than the NAO. In order to investigate which other large-scale variability modes steer the interannual variability of windstorms we develop a statistical model using a Poisson GLM. We find that the Scandinavian Pattern (SCA) in fact explains a larger amount of variability for Central Europe during the 20th century than the NAO. This statistical model is able to skilfully reproduce the interannual variability of windstorm frequency especially for the British Isles and Central Europe with correlations up to 0.8.
NASA Astrophysics Data System (ADS)
Dons, Evi; Van Poppel, Martine; Kochan, Bruno; Wets, Geert; Int Panis, Luc
2013-08-01
Land use regression (LUR) modeling is a statistical technique used to determine exposure to air pollutants in epidemiological studies. Time-activity diaries can be combined with LUR models, enabling detailed exposure estimation and limiting exposure misclassification, both in shorter and longer time lags. In this study, the traffic related air pollutant black carbon was measured with μ-aethalometers on a 5-min time base at 63 locations in Flanders, Belgium. The measurements show that hourly concentrations vary between different locations, but also over the day. Furthermore the diurnal pattern is different for street and background locations. This suggests that annual LUR models are not sufficient to capture all the variation. Hourly LUR models for black carbon are developed using different strategies: by means of dummy variables, with dynamic dependent variables and/or with dynamic and static independent variables. The LUR model with 48 dummies (weekday hours and weekend hours) performs not as good as the annual model (explained variance of 0.44 compared to 0.77 in the annual model). The dataset with hourly concentrations of black carbon can be used to recalibrate the annual model, resulting in many of the original explaining variables losing their statistical significance, and certain variables having the wrong direction of effect. Building new independent hourly models, with static or dynamic covariates, is proposed as the best solution to solve these issues. R2 values for hourly LUR models are mostly smaller than the R2 of the annual model, ranging from 0.07 to 0.8. Between 6 a.m. and 10 p.m. on weekdays the R2 approximates the annual model R2. Even though models of consecutive hours are developed independently, similar variables turn out to be significant. Using dynamic covariates instead of static covariates, i.e. hourly traffic intensities and hourly population densities, did not significantly improve the models' performance.
Validation of ACG Case-mix for equitable resource allocation in Swedish primary health care.
Zielinski, Andrzej; Kronogård, Maria; Lenhoff, Håkan; Halling, Anders
2009-09-18
Adequate resource allocation is an important factor to ensure equity in health care. Previous reimbursement models have been based on age, gender and socioeconomic factors. An explanatory model based on individual need of primary health care (PHC) has not yet been used in Sweden to allocate resources. The aim of this study was to examine to what extent the ACG case-mix system could explain concurrent costs in Swedish PHC. Diagnoses were obtained from electronic PHC records of inhabitants in Blekinge County (approx. 150,000) listed with public PHC (approx. 120,000) for three consecutive years, 2004-2006. The inhabitants were then classified into six different resource utilization bands (RUB) using the ACG case-mix system. The mean costs for primary health care were calculated for each RUB and year. Using linear regression models and log-cost as dependent variable the adjusted R2 was calculated in the unadjusted model (gender) and in consecutive models where age, listing with specific PHC and RUB were added. In an additional model the ACG groups were added. Gender, age and listing with specific PHC explained 14.48-14.88% of the variance in individual costs for PHC. By also adding information on level of co-morbidity, as measured by the ACG case-mix system, to specific PHC the adjusted R2 increased to 60.89-63.41%. The ACG case-mix system explains patient costs in primary care to a high degree. Age and gender are important explanatory factors, but most of the variance in concurrent patient costs was explained by the ACG case-mix system.
Placebo influences on dyskinesia in Parkinson's disease.
Goetz, Christopher G; Laska, Eugene; Hicking, Christine; Damier, Philippe; Müller, Thomas; Nutt, John; Warren Olanow, C; Rascol, Olivier; Russ, Hermann
2008-04-15
Clinical features that are prognostic indicators of placebo response among dyskinetic Parkinson's disease patients were determined. Placebo-associated improvements occur in Parkinsonism, but responses in dyskinesia have not been studied. Placebo data from two multicenter studies with identical design comparing sarizotan to placebo for treating dyskinesia were accessed. Sarizotan (2 mg/day) failed to improve dyskinesia compared with placebo, but both treatments improved dyskinesia compared with baseline. Stepwise regression identified baseline characteristics that influenced dyskinesia response to placebo, and these factors were entered into a logistic regression model to quantify their influence on placebo-related dyskinesia improvements and worsening. Because placebo-associated improvements in Parkinsonism have been attributed to heightened dopaminergic activity, we also examined the association between changes in Parkinsonism and dyskinesia. Four hundred eighty-four subjects received placebo treatment; 178 met criteria for placebo-associated dyskinesia improvement and 37 for dyskinesia worsening. Older age, lower baseline Parkinsonism score, and lower total daily levodopa doses were associated with placebo-associated improvement, whereas lower baseline dyskinesia score was associated with placebo-associated worsening. Placebo-associated dyskinesia changes were not correlated with Parkinsonism changes, and all effects in the sarizotan group were statistically explained by the placebo-effect regression model. Dyskinesias are affected by placebo treatment. The absence of correlation between placebo-induced changes in dyskinesia and Parkinsonism argues against a dopaminergic activation mechanism to explain placebo-associated improvements in dyskinesia. The magnitude and variance of placebo-related changes and the factors that influence them can be helpful in the design of future clinical trials of antidyskinetic agents. 2007 Movement Disorder Society
Placebo Influences on Dyskinesia in Parkinson's Disease
Goetz, Christopher G.; Laska, Eugene; Hicking, Christine; Damier, Philippe; Müller, Thomas; Nutt, John; Olanow, C. Warren; Rascol, Olivier; Russ, Hermann
2009-01-01
Clinical features that are prognostic indicators of placebo response among dyskinetic Parkinson's disease patients were determined. Placebo-associated improvements occur in Parkinsonism, but responses in dyskinesia have not been studied. Placebo data from two multicenter studies with identical design comparing sarizotan to placebo for treating dyskinesia were accessed. Sarizotan (2 mg/day) failed to improve dyskinesia compared with placebo, but both treatments improved dyskinesia compared with baseline. Stepwise regression identified baseline characteristics that influenced dyskinesia response to placebo, and these factors were entered into a logistic regression model to quantify their influence on placebo-related dyskinesia improvements and worsening. Because placebo-associated improvements in Parkinsonism have been attributed to heightened dopaminergic activity, we also examined the association between changes in Parkinsonism and dyskinesia. Four hundred eighty-four subjects received placebo treatment; 178 met criteria for placebo-associated dyskinesia improvement and 37 for dyskinesia worsening. Older age, lower baseline Parkinsonism score, and lower total daily levodopa doses were associated with placebo-associated improvement, whereas lower baseline dyskinesia score was associated with placebo-associated worsening. Placebo-associated dyskinesia changes were not correlated with Parkinsonism changes, and all effects in the sarizotan group were statistically explained by the placebo-effect regression model. Dyskinesias are affected by placebo treatment. The absence of correlation between placebo-induced changes in dyskinesia and Parkinsonism argues against a dopaminergic activation mechanism to explain placebo-associated improvements in dyskinesia. The magnitude and variance of placebo-related changes and the factors that influence them can be helpful in the design of future clinical trials of antidyskinetic agents. PMID:18175337
SNPs selection using support vector regression and genetic algorithms in GWAS
2014-01-01
Introduction This paper proposes a new methodology to simultaneously select the most relevant SNPs markers for the characterization of any measurable phenotype described by a continuous variable using Support Vector Regression with Pearson Universal kernel as fitness function of a binary genetic algorithm. The proposed methodology is multi-attribute towards considering several markers simultaneously to explain the phenotype and is based jointly on statistical tools, machine learning and computational intelligence. Results The suggested method has shown potential in the simulated database 1, with additive effects only, and real database. In this simulated database, with a total of 1,000 markers, and 7 with major effect on the phenotype and the other 993 SNPs representing the noise, the method identified 21 markers. Of this total, 5 are relevant SNPs between the 7 but 16 are false positives. In real database, initially with 50,752 SNPs, we have reduced to 3,073 markers, increasing the accuracy of the model. In the simulated database 2, with additive effects and interactions (epistasis), the proposed method matched to the methodology most commonly used in GWAS. Conclusions The method suggested in this paper demonstrates the effectiveness in explaining the real phenotype (PTA for milk), because with the application of the wrapper based on genetic algorithm and Support Vector Regression with Pearson Universal, many redundant markers were eliminated, increasing the prediction and accuracy of the model on the real database without quality control filters. The PUK demonstrated that it can replicate the performance of linear and RBF kernels. PMID:25573332
ERIC Educational Resources Information Center
Beauchamp, Guy
2005-01-01
A study to present specific hypothesis that satisfactorily explain the boiling point of a number of molecules, CH[subscript w]F[subscript x]Cl[subscript y]Br[subscript z] having similar structure, and then analyze the model with the help of multiple linear regression (MLR), a data analysis tool. The MLR analysis was useful in selecting the…
The effect of gender equality on happiness: Statistical modeling and analysis.
Qian, Ge
2017-02-01
In this article, the researcher presents linear regression models that describe how five gender-equality indexes affect individuals' perceptions of well-being and happiness, controlling for their economic income and weighed by the population of the countries that contribute to the models. The logical premise of this study is that gender equality is not only favorable for women, but it is also conducive to fostering the greatest level of happiness for all people. The researcher believes that most findings confirmed this assumption and the opinion of John Stuart Mill regarding gender equality, while two exceptions are explained by social quality theory and the male breadwinner model.
The association between height and birth order: evidence from 652,518 Swedish men.
Myrskylä, Mikko; Silventoinen, Karri; Jelenkovic, Aline; Tynelius, Per; Rasmussen, Finn
2013-07-01
Birth order is associated with outcomes such as birth weight and adult socioeconomic position (SEP), but little is known about the association with adult height. This potential birth order-height association is important because height predicts health, and because the association may help explain population-level height trends. We studied the birth order-height association and whether it varies by family characteristics or birth cohort. We used the Swedish Military Conscription Register to analyse adult height among 652,518 men born in 1951-1983 using fixed effects regression models that compare brothers and account for genetic and social factors shared by brothers. We stratified the analysis by family size, parental SEP and birth cohort. We compared models with and without birth weight and birth length controls. Unadjusted analyses showed no differences between the first two birth orders but in the fixed effects regression, birth orders 2, 3 and 4 were associated with 0.4, 0.7 and 0.8 cm (p<0.001 for each) shorter height than birth order 1, respectively. The associations were similar in large and small and high-SEP and low-SEP families, but were attenuated in recent cohorts. Birth characteristics did not explain these associations. Birth order is an important determinant of height. The height difference between birth orders 3 and 1 is larger than the population-level height increase achieved over 10 years. The attenuation of the effect over cohorts may reflect improvements in living standards. Decreases in family size may explain some of the secular-height increases in countries with decreasing fertility.
Hifinger, Monika; Putrik, Polina; Ramiro, Sofia; Keszei, András P; Hmamouchi, Ihsane; Dougados, Maxime; Gossec, Laure; Boonen, Annelies
2016-04-01
To investigate the relationship between country of residence and fatigue in RA, and to explore which country characteristics are related to fatigue. Data from the multinational COMORA study were analysed. Contribution of country of residence to level of fatigue [0-10 on visual analogue scale (VAS)] and presence of severe fatigue (VAS ⩾ 5) was explored in multivariable linear or logistic regression models including first socio-demographics and objective disease outcomes (M1), and then also subjective outcomes (M2). Next, country of residence was replaced by country characteristics: gross domestic product (GDP), human development index (HDI), latitude (as indicator of climate), language and income inequality index (gini-index). Model fit (R(2)) for linear models was compared. A total of 3920 patients from 17 countries were included, mean age 56 years (s.d. 13), 82% females. Mean fatigue across countries ranged from 1.86 (s.d. 2.46) to 4.99 (s.d. 2.64) and proportion of severe fatigue from 14% (Venezuela) to 65% (Egypt). Objective disease outcomes did not explain much of the variation in fatigue ([Formula: see text] = 0.12), while subjective outcomes had a strong negative impact and partly explained the variation in fatigue ([Formula: see text]= 0.27). Country of residence had a significant additional effect (increasing model fit to [Formula: see text] = 0.20 and [Formula: see text] = 0.36, respectively). Remarkably, higher GDP and better HDI were associated with higher fatigue, and explained a large part of the country effect. Logistic regression confirmed the limited contribution of objective outcomes and the relevant contribution of country of residence. Country of residence has an important influence on fatigue. Paradoxically, patients from wealthier countries had higher fatigue. © The Author 2015. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Landscape Variation in Tree Species Richness in Northern Iran Forests
Bourque, Charles P.-A.; Bayat, Mahmoud
2015-01-01
Mapping landscape variation in tree species richness (SR) is essential to the long term management and conservation of forest ecosystems. The current study examines the prospect of mapping field assessments of SR in a high-elevation, deciduous forest in northern Iran as a function of 16 biophysical variables representative of the area’s unique physiography, including topography and coastal placement, biophysical environment, and forests. Basic to this study is the development of moderate-resolution biophysical surfaces and associated plot-estimates for 202 permanent sampling plots. The biophysical variables include: (i) three topographic variables generated directly from the area’s digital terrain model; (ii) four ecophysiologically-relevant variables derived from process models or from first principles; and (iii) seven variables of Landsat-8-acquired surface reflectance and two, of surface radiance. With symbolic regression, it was shown that only four of the 16 variables were needed to explain 85% of observed plot-level variation in SR (i.e., wind velocity, surface reflectance of blue light, and topographic wetness indices representative of soil water content), yielding mean-absolute and root-mean-squared error of 0.50 and 0.78, respectively. Overall, localised calculations of wind velocity and surface reflectance of blue light explained about 63% of observed variation in SR, with wind velocity accounting for 51% of that variation. The remaining 22% was explained by linear combinations of soil-water-related topographic indices and associated thresholds. In general, SR and diversity tended to be greatest for plots dominated by Carpinus betulus (involving ≥ 33% of all trees in a plot), than by Fagus orientalis (median difference of one species). This study provides a significant step towards describing landscape variation in SR as a function of modelled and satellite-based information and symbolic regression. Methods in this study are sufficiently general to be applicable to the characterisation of SR in other forested regions of the world, providing plot-scale data are available for model generation. PMID:25849029
Landscape variation in tree species richness in northern Iran forests.
Bourque, Charles P-A; Bayat, Mahmoud
2015-01-01
Mapping landscape variation in tree species richness (SR) is essential to the long term management and conservation of forest ecosystems. The current study examines the prospect of mapping field assessments of SR in a high-elevation, deciduous forest in northern Iran as a function of 16 biophysical variables representative of the area's unique physiography, including topography and coastal placement, biophysical environment, and forests. Basic to this study is the development of moderate-resolution biophysical surfaces and associated plot-estimates for 202 permanent sampling plots. The biophysical variables include: (i) three topographic variables generated directly from the area's digital terrain model; (ii) four ecophysiologically-relevant variables derived from process models or from first principles; and (iii) seven variables of Landsat-8-acquired surface reflectance and two, of surface radiance. With symbolic regression, it was shown that only four of the 16 variables were needed to explain 85% of observed plot-level variation in SR (i.e., wind velocity, surface reflectance of blue light, and topographic wetness indices representative of soil water content), yielding mean-absolute and root-mean-squared error of 0.50 and 0.78, respectively. Overall, localised calculations of wind velocity and surface reflectance of blue light explained about 63% of observed variation in SR, with wind velocity accounting for 51% of that variation. The remaining 22% was explained by linear combinations of soil-water-related topographic indices and associated thresholds. In general, SR and diversity tended to be greatest for plots dominated by Carpinus betulus (involving ≥ 33% of all trees in a plot), than by Fagus orientalis (median difference of one species). This study provides a significant step towards describing landscape variation in SR as a function of modelled and satellite-based information and symbolic regression. Methods in this study are sufficiently general to be applicable to the characterisation of SR in other forested regions of the world, providing plot-scale data are available for model generation.
Herring, Bradley; Trish, Erin
2015-01-01
The slowed growth in national health care spending over the past decade has led analysts to question the extent to which this recent slowdown can be explained by predictable factors such as the Great Recession or must be driven by some unpredictable structural change in the health care sector. To help address this question, we first estimate a regression model for state personal health care spending for 1991-2009, with an emphasis on the explanatory power of income, insurance, and provider market characteristics. We then use the results from this simple predictive model to produce state-level projections of health care spending for 2010-2013 to subsequently compare those average projected state values with actual national spending for 2010-2013, finding that at least 70% of the recent slowdown in health care spending can likely be explained by long-standing patterns. We also use the results from this predictive model to both examine the Great Recession’s likely reduction in health care spending and project the Affordable Care Act’s insurance expansion’s likely increase in health care spending. PMID:26655685
NASA Astrophysics Data System (ADS)
Santoro, R.; Ingraffea, A. R.
2015-12-01
Previous modeling (ingraffea et al. PNAS, 2014) indicated roughly two-times higher cumulative risk for wellbore impairment in unconventional wells, relative to conventional wells, and large spatial variation in risk for oil and gas wells drilled in the state of Pennsylvania. Impairment risk for wells in the northeast portion of the state were found to be 8.5-times greater than that of wells drilled in the rest of the state. Here, we set out to explain this apparent regional variability through Boosted Regression Tree (BRT) analysis of geographic, developmental, and general well attributes. We find that regional variability is largely driven by the nature of the development, i.e. whether conventional or unconventional development is dominant. Oil and natural gas market prices and total well depths present as major influences in wellbore impairment, with moderate influences from well densities and geologic factors. The figure depicts influence paths for predictors of impairments for the state (top left), SW region (top right), unconventional/NE region (bottom left) and conventional/NW region (bottom right) models. Influences are scaled to reflect percent contributions in explaining variability in the model.
Schnall, Rebecca; Bakken, Suzanne
2011-09-01
To assess the applicability of the Technology Acceptance Model (TAM) constructs in explaining HIV case managers' behavioural intention to use a continuity of care record (CCR) with context-specific links designed to meet their information needs. Data were collected from 94 case managers who provide care to persons living with HIV (PLWH) using an online survey comprising three components: (1) demographic information: age, gender, ethnicity, race, Internet usage and computer experience; (2) mock-up of CCR with context-specific links; and items related to TAM constructs. Data analysis included: principal components factor analysis (PCA), assessment of internal consistency reliability and univariate and multivariate analysis. PCA extracted three factors (Perceived Ease of Use, Perceived Usefulness and Perceived Barriers to Use), explained variance = 84.9%, Cronbach's ά = 0.69-0.91. In a linear regression model, Perceived Ease of Use, Perceived Usefulness and Perceived Barriers to Use explained 43.6% (p < 0.001) of the variance in Behavioural Intention to use a CCR with context-specific links. Our study contributes to the evidence base regarding TAM in health care through expanding the type of professional surveyed, study setting and Health Information Technology assessed.
Patients whom neurologists find difficult to help
Carson, A; Stone, J; Warlow, C; Sharpe, M
2004-01-01
Objective: To test the hypothesis that patients whose symptoms were less explained by organic disease would be perceived as more difficult to help. Methods: In a consecutive series of 300 new neurology outpatients, neurologists indicated on four point Likert-type scales how "difficult to help" they found the patient and to what extent the patient's symptoms were explained by organic disease. The patients' demographics, health status, number of somatic symptoms, and mental state were also assessed. Results: The neurologists rated 143 patients (48%) as "not at all difficult" to help, 111 (37%) as "somewhat difficult", 27 (9%) as "very difficult", and 18 (6%) as "extremely difficult". A logistic regression model was constructed and the hypothesis that patients whose symptoms were less explained by organic disease would be perceived as more difficult to help was supported. The only other measured variable that contributed to perceived difficulty was physical disability, but it explained only a small amount of the variance. Conclusions: Neurologists find patients whose symptoms are not explained by organic disease more difficult to help than their other patients. PMID:15548505
Heteroscedasticity as a Basis of Direction Dependence in Reversible Linear Regression Models.
Wiedermann, Wolfgang; Artner, Richard; von Eye, Alexander
2017-01-01
Heteroscedasticity is a well-known issue in linear regression modeling. When heteroscedasticity is observed, researchers are advised to remedy possible model misspecification of the explanatory part of the model (e.g., considering alternative functional forms and/or omitted variables). The present contribution discusses another source of heteroscedasticity in observational data: Directional model misspecifications in the case of nonnormal variables. Directional misspecification refers to situations where alternative models are equally likely to explain the data-generating process (e.g., x → y versus y → x). It is shown that the homoscedasticity assumption is likely to be violated in models that erroneously treat true nonnormal predictors as response variables. Recently, Direction Dependence Analysis (DDA) has been proposed as a framework to empirically evaluate the direction of effects in linear models. The present study links the phenomenon of heteroscedasticity with DDA and describes visual diagnostics and nine homoscedasticity tests that can be used to make decisions concerning the direction of effects in linear models. Results of a Monte Carlo simulation that demonstrate the adequacy of the approach are presented. An empirical example is provided, and applicability of the methodology in cases of violated assumptions is discussed.
Zoellner, Jamie M; Porter, Kathleen J; Chen, Yvonnes; Hedrick, Valisa E; You, Wen; Hickman, Maja; Estabrooks, Paul A
2017-05-01
Guided by the theory of planned behaviour (TPB) and health literacy concepts, SIPsmartER is a six-month multicomponent intervention effective at improving SSB behaviours. Using SIPsmartER data, this study explores prediction of SSB behavioural intention (BI) and behaviour from TPB constructs using: (1) cross-sectional and prospective models and (2) 11 single-item assessments from interactive voice response (IVR) technology. Quasi-experimental design, including pre- and post-outcome data and repeated-measures process data of 155 intervention participants. Validated multi-item TPB measures, single-item TPB measures, and self-reported SSB behaviours. Hypothesised relationships were investigated using correlation and multiple regression models. TPB constructs explained 32% of the variance cross sectionally and 20% prospectively in BI; and explained 13-20% of variance cross sectionally and 6% prospectively. Single-item scale models were significant, yet explained less variance. All IVR models predicting BI (average 21%, range 6-38%) and behaviour (average 30%, range 6-55%) were significant. Findings are interpreted in the context of other cross-sectional, prospective and experimental TPB health and dietary studies. Findings advance experimental application of the TPB, including understanding constructs at outcome and process time points and applying theory in all intervention development, implementation and evaluation phases.
Ekbäck, Gunnar; Åstrøm, Anne Nordrehaug; Klock, Kristin; Ordell, Sven; Unell, Lennart
2012-07-01
The aims of this study were to identify explanatory factors of satisfaction with oral health among Norwegian and Swedish 65 year olds in terms of items from four different domains of ICF and to compare the strengths of the various ICF domains in explaining satisfaction with oral health. Further it was to assess whether the explanatory factors of ICF domains vary between Norway and Sweden. In 2007, standardized questionnaires were mailed to all the residents in certain counties of Sweden and Norway who were born in 1942. Response rates were 73.1% (n = 6078) in Sweden and 56.0% (n = 4062) in Norway. In total, 33 questions based on four different ICF domains were chosen to explain satisfaction with oral health. Logistic regression showed that four different ICF domains in terms of body function, body structure, activity/participation and environmental factors explained, respectively, 53%, 31%, 12% and 34% of the explanatory variance in the satisfaction with oral health. In the final analysis, only nine items were statistically significant (p < 0.05). This study indicates that ICF as a conceptual model could cover a broad spectrum of factors embedded in OHRQoL measured by a global question in Sweden and Norway. Nine items, representing four ICF domains, were important in the final model for explaining satisfaction with oral health.
Predictors of effects of lifestyle intervention on diabetes mellitus type 2 patients.
Jacobsen, Ramune; Vadstrup, Eva; Røder, Michael; Frølich, Anne
2012-01-01
The main aim of the study was to identify predictors of the effects of lifestyle intervention on diabetes mellitus type 2 patients by means of multivariate analysis. Data from a previously published randomised clinical trial, which compared the effects of a rehabilitation programme including standardised education and physical training sessions in the municipality's health care centre with the same duration of individual counseling in the diabetes outpatient clinic, were used. Data from 143 diabetes patients were analysed. The merged lifestyle intervention resulted in statistically significant improvements in patients' systolic blood pressure, waist circumference, exercise capacity, glycaemic control, and some aspects of general health-related quality of life. The linear multivariate regression models explained 45% to 80% of the variance in these improvements. The baseline outcomes in accordance to the logic of the regression to the mean phenomenon were the only statistically significant and robust predictors in all regression models. These results are important from a clinical point of view as they highlight the more urgent need for and better outcomes following lifestyle intervention for those patients who have worse general and disease-specific health.
Wagle, Jørgen; Farner, Lasse; Flekkøy, Kjell; Bruun Wyller, Torgeir; Sandvik, Leiv; Fure, Brynjar; Stensrød, Brynhild; Engedal, Knut
2011-01-01
To identify prognostic factors associated with functional outcome at 13 months in a sample of stroke rehabilitation patients. Specifically, we hypothesized that cognitive functioning early after stroke would predict long-term functional outcome independently of other factors. 163 stroke rehabilitation patients underwent a structured neuropsychological examination 2-3 weeks after hospital admittance, and their functional status was subsequently evaluated 13 months later with the modified Rankin Scale (mRS) as outcome measure. Three predictive models were built using linear regression analyses: a biological model (sociodemographics, apolipoprotein E genotype, prestroke vascular factors, lesion characteristics and neurological stroke-related impairment); a functional model (pre- and early post-stroke cognitive functioning, personal and instrumental activities of daily living, ADL, and depressive symptoms), and a combined model (including significant variables, with p value <0.05, from the biological and functional models). A combined model of 4 variables best predicted long-term functional outcome with explained variance of 49%: neurological impairment (National Institute of Health Stroke Scale; β = 0.402, p < 0.001), age (β = 0.233, p = 0.001), post-stroke cognitive functioning (Repeatable Battery of Neuropsychological Status, RBANS; β = -0.248, p = 0.001) and prestroke personal ADL (Barthel Index; β = -0.217, p = 0.002). Further linear regression analyses of which RBANS indexes and subtests best predicted long-term functional outcome showed that Coding (β = -0.484, p < 0.001) and Figure Copy (β = -0.233, p = 0.002) raw scores at baseline explained 42% of the variance in mRS scores at follow-up. Early post-stroke cognitive functioning as measured by the RBANS is a significant and independent predictor of long-term functional post-stroke outcome. Copyright © 2011 S. Karger AG, Basel.
Malosetti, Marcos; Ribaut, Jean-Marcel; van Eeuwijk, Fred A.
2013-01-01
Genotype-by-environment interaction (GEI) is an important phenomenon in plant breeding. This paper presents a series of models for describing, exploring, understanding, and predicting GEI. All models depart from a two-way table of genotype by environment means. First, a series of descriptive and explorative models/approaches are presented: Finlay–Wilkinson model, AMMI model, GGE biplot. All of these approaches have in common that they merely try to group genotypes and environments and do not use other information than the two-way table of means. Next, factorial regression is introduced as an approach to explicitly introduce genotypic and environmental covariates for describing and explaining GEI. Finally, QTL modeling is presented as a natural extension of factorial regression, where marker information is translated into genetic predictors. Tests for regression coefficients corresponding to these genetic predictors are tests for main effect QTL expression and QTL by environment interaction (QEI). QTL models for which QEI depends on environmental covariables form an interesting model class for predicting GEI for new genotypes and new environments. For realistic modeling of genotypic differences across multiple environments, sophisticated mixed models are necessary to allow for heterogeneity of genetic variances and correlations across environments. The use and interpretation of all models is illustrated by an example data set from the CIMMYT maize breeding program, containing environments differing in drought and nitrogen stress. To help readers to carry out the statistical analyses, GenStat® programs, 15th Edition and Discovery® version, are presented as “Appendix.” PMID:23487515
Eynon, Michael John; O'Donnell, Christopher; Williams, Lynn
2017-10-01
Given the mixed findings concerning self-determination theory in explaining adherence to exercise referral schemes (ERS), the present study attempted to examine whether autonomous motivation and psychological need satisfaction could predict ERS adherence. Participants referred to an 8-week ERS completed self-report measures grounded in self-determination theory and basic needs theory at baseline (N = 124), mid-scheme (N = 58), and at the end of the scheme (N = 40). Logistic regressions were used to analyse the data. Autonomous motivation measured at mid-scheme explained between 12 and 16% of the variance in ERS adherence. Autonomy, relatedness and competence measured at mid-scheme explained between 18 and 26% of the variance in ERS adherence. This model also explained between 18 and 25% when measured at the end of the scheme. The study found limited evidence for the role of autonomous motivation in explaining ERS adherence. Stronger support was found for the satisfaction of the three needs for autonomy, relatedness and competence in predicting ERS adherence. Future research should tap into the satisfaction of all three needs collectively to help foster ERS adherence.
Psychosocial correlates of HIV protection motivation among black adolescents in Venda, South Africa.
Boer, Henk; Mashamba, M Tshilidzi
2005-12-01
We assessed the usefulness of the theory of planned behavior (TPB) and protection motivation theory (PMT) to predict intended condom use among 201 adolescents from Venda, South Africa. Results indicated that both the TPB and the PMT could significantly predict intended condom use, although the level of explained variance was limited. Hierarchical regression analysis indicated that there was considerable overlap between the TPB and the PMT in predicting condom use intention. In the regression analysis that used both the TPB and the PMT variables subjective norms and response efficacy were positively related to intended condom use. The results indicated that both the TPB and the PMT were valuable in explaining intended condom use among African adolescents. The TPB made clear that the social environment is an important contextual factor, whereas the PMT made clear that response efficacy is positively related to condom use intention. The results of this study indicated that social cognition models have some value in the analysis of condom use intention of African adolescents, but the role of other factors like myths about condoms should be further examined.
Importance of participation in major life areas matters for return to work.
Kvam, Lisbeth; Vik, Kjersti; Eide, Arne Henning
2015-06-01
The complexity of the process and outcome of vocational rehabilitation yearns for a multifaceted approach. This article investigates whether importance of participation in major life areas for men and women predicts the outcome of vocational rehabilitation. This longitudinal study provides measure points at the start of the intervention (T1), at the end of the intervention (T2) and at a follow-up 6-12 months after completing the rehabilitation program (T3). Associations were assessed by nominal logistic regression. The importance of participation in work was positively associated to return to work (RTW), while the importance of participation in leisure activities and importance of participation in family was negatively associated with RTW after the rehabilitation. Gender and number of children also contributed significantly to the regression model. To identify individuals' subjective evaluation of the importance of participation may be of value in explaining return or not RTW and contribute to explain gender differences in outcomes. It may also inform rehabilitation counselors in collaboration with clients and facilitate tailoring interventions to the individual's needs.
Fossati, Andrea; Somma, Antonella; Borroni, Serena; Maffei, Cesare; Markon, Kristian E; Krueger, Robert F
2016-02-01
In order to evaluate if measures of DSM-5 Alternative PD Model domains predicted interview-based scores of general personality pathology when compared to self-report measures of DSM-IV Axis II/DSM-5 Section II PD criteria, 300 Italian community adults were administered the Iowa Personality Disorder Screen (IPDS) interview, the Personality Inventory for DSM-5 (PID-5), and the Personality Diagnostic Questionnaire-4+ (PDQ-4+). Multiple regression analyses showed that the five PID-5 domain scales collectively explained an adequate rate of the variance of the IPDS interview total score. This result was slightly lower than the amount of variance in the IPDS total score explained by the 10 PDQ-4+ scales. The PID-5 traits scales performed better than the PDQ-4+, although the difference was marginal. Hierarchical regression analyses revealed that the PID-5 domain and trait scales provided a moderate, but significant increase in the prediction of the general level of personality pathology above and beyond the PDQ-4+ scales.
Schetter, Timothy A; Walters, Timothy L; Root, Karen V
2013-09-01
Impacts of human land use pose an increasing threat to global biodiversity. Resource managers must respond rapidly to this threat by assessing existing natural areas and prioritizing conservation actions across multiple spatial scales. Plant species richness is a useful measure of biodiversity but typically can only be evaluated on small portions of a given landscape. Modeling relationships between spatial heterogeneity and species richness may allow conservation planners to make predictions of species richness patterns within unsampled areas. We utilized a combination of field data, remotely sensed data, and landscape pattern metrics to develop models of native and exotic plant species richness at two spatial extents (60- and 120-m windows) and at four ecological levels for northwestern Ohio's Oak Openings region. Multiple regression models explained 37-77 % of the variation in plant species richness. These models consistently explained more variation in exotic richness than in native richness. Exotic richness was better explained at the 120-m extent while native richness was better explained at the 60-m extent. Land cover composition of the surrounding landscape was an important component of all models. We found that percentage of human-modified land cover (negatively correlated with native richness and positively correlated with exotic richness) was a particularly useful predictor of plant species richness and that human-caused disturbances exert a strong influence on species richness patterns within a mixed-disturbance oak savanna landscape. Our results emphasize the importance of using a multi-scale approach to examine the complex relationships between spatial heterogeneity and plant species richness.
Explaining regional variation in home care use by demand and supply variables.
van Noort, Olivier; Schotanus, Fredo; van de Klundert, Joris; Telgen, Jan
2018-02-01
In the Netherlands, home care services like district nursing and personal assistance are provided by private service provider organizations and covered by private health insurance companies which bear legal responsibility for purchasing these services. To improve value for money, their procurement increasingly replaces fee-for-service payments with population based budgets. Setting appropriate population budgets requires adaptation to the legitimate needs of the population, whereas historical costs are likely to be influenced by supply factors as well, not all of which are necessarily legitimate. Our purpose is to explain home care costs in terms of demand and supply factors. This allows for adjusting historical cost patterns when setting population based budgets. Using expenses claims of 60 Dutch municipalities, we analyze eight demand variables and five supply variables with a multiple regression model to explain variance in the number of clients per inhabitant, costs per client and costs per inhabitant. Our models explain 69% of variation in the number of clients per inhabitant, 28% of costs per client and 56% of costs per inhabitant using demand factors. Moreover, we find that supply factors explain an additional 17-23% of variation. Predictors of higher utilization are home care organizations that are integrated with intramural nursing homes, higher competition levels among home care organizations and the availability of complementary services. Copyright © 2017. Published by Elsevier B.V.
[On the effectiveness of the homeopathic remedy Arnica montana].
Lüdtke, Rainer; Hacke, Daniela
2005-11-01
Arnica montana is a homeopathic remedy often prescribed after traumata and injuries. To assess whether Arnica is effective beyond placebo and to identify factors which support or contradict this effectiveness. All prospective, controlled trials on the effectiveness of homeopathic Arnica were included. Overall effectiveness was assessed by meta-analysis and meta-regression techniques. 68 comparisons from 49 clinical trials show a significant effectiveness of Arnica in traumatic injuries in random effects meta-analysis (odds ratio [OR], 0.36; 95% confidence interval [CI], 0.24-0.55), but not in meta-regression models (OR, 0.37; CI, 0.11-1.24). We found no evidence for publication bias. Studies from Medline-listed journals and high-quality studies are less likely to report positive results (p = 0.0006 and p = 0.0167). The hypothesis that homeopathic Arnica is effective could neither be proved nor rejected. All trials were highly heterogeneous, meta-regression does not help to explain this heterogeneity substantially.
Air Leakage of US Homes: Regression Analysis and Improvements from Retrofit
DOE Office of Scientific and Technical Information (OSTI.GOV)
Chan, Wanyu R.; Joh, Jeffrey; Sherman, Max H.
2012-08-01
LBNL Residential Diagnostics Database (ResDB) contains blower door measurements and other diagnostic test results of homes in United States. Of these, approximately 134,000 single-family detached homes have sufficient information for the analysis of air leakage in relation to a number of housing characteristics. We performed regression analysis to consider the correlation between normalized leakage and a number of explanatory variables: IECC climate zone, floor area, height, year built, foundation type, duct location, and other characteristics. The regression model explains 68% of the observed variability in normalized leakage. ResDB also contains the before and after retrofit air leakage measurements of approximatelymore » 23,000 homes that participated in weatherization assistant programs (WAPs) or residential energy efficiency programs. The two types of programs achieve rather similar reductions in normalized leakage: 30% for WAPs and 20% for other energy programs.« less
Doran, Kara S.; Howd, Peter A.; Sallenger,, Asbury H.
2016-01-04
Recent studies, and most of their predecessors, use tide gage data to quantify SL acceleration, ASL(t). In the current study, three techniques were used to calculate acceleration from tide gage data, and of those examined, it was determined that the two techniques based on sliding a regression window through the time series are more robust compared to the technique that fits a single quadratic form to the entire time series, particularly if there is temporal variation in the magnitude of the acceleration. The single-fit quadratic regression method has been the most commonly used technique in determining acceleration in tide gage data. The inability of the single-fit method to account for time-varying acceleration may explain some of the inconsistent findings between investigators. Properly quantifying ASL(t) from field measurements is of particular importance in evaluating numerical models of past, present, and future SLR resulting from anticipated climate change.
Disconcordance in Statistical Models of Bisphenol A and Chronic Disease Outcomes in NHANES 2003-08
Casey, Martin F.; Neidell, Matthew
2013-01-01
Background Bisphenol A (BPA), a high production chemical commonly found in plastics, has drawn great attention from researchers due to the substance’s potential toxicity. Using data from three National Health and Nutrition Examination Survey (NHANES) cycles, we explored the consistency and robustness of BPA’s reported effects on coronary heart disease and diabetes. Methods And Findings We report the use of three different statistical models in the analysis of BPA: (1) logistic regression, (2) log-linear regression, and (3) dose-response logistic regression. In each variation, confounders were added in six blocks to account for demographics, urinary creatinine, source of BPA exposure, healthy behaviours, and phthalate exposure. Results were sensitive to the variations in functional form of our statistical models, but no single model yielded consistent results across NHANES cycles. Reported ORs were also found to be sensitive to inclusion/exclusion criteria. Further, observed effects, which were most pronounced in NHANES 2003-04, could not be explained away by confounding. Conclusions Limitations in the NHANES data and a poor understanding of the mode of action of BPA have made it difficult to develop informative statistical models. Given the sensitivity of effect estimates to functional form, researchers should report results using multiple specifications with different assumptions about BPA measurement, thus allowing for the identification of potential discrepancies in the data. PMID:24223205
Overcoming multicollinearity in multiple regression using correlation coefficient
NASA Astrophysics Data System (ADS)
Zainodin, H. J.; Yap, S. J.
2013-09-01
Multicollinearity happens when there are high correlations among independent variables. In this case, it would be difficult to distinguish between the contributions of these independent variables to that of the dependent variable as they may compete to explain much of the similar variance. Besides, the problem of multicollinearity also violates the assumption of multiple regression: that there is no collinearity among the possible independent variables. Thus, an alternative approach is introduced in overcoming the multicollinearity problem in achieving a well represented model eventually. This approach is accomplished by removing the multicollinearity source variables on the basis of the correlation coefficient values based on full correlation matrix. Using the full correlation matrix can facilitate the implementation of Excel function in removing the multicollinearity source variables. It is found that this procedure is easier and time-saving especially when dealing with greater number of independent variables in a model and a large number of all possible models. Hence, in this paper detailed insight of the procedure is shown, compared and implemented.
Sense of coherence and hardiness as predictors of the mental health of college students.
Knowlden, Adam P; Sharma, Manoj; Kanekar, Amar; Atri, Ashutosh
Psychological distress has a deleterious impact on the mental health of college students. The purpose of this study was to specify a theoretical, sense of coherence, and hardiness-based regression model to predict the mental health of college students. The instruments employed to build the model included the Kessler Psychological Distress Scale K-6, the Sense of Coherence-29, and the College Student Hardiness Measure. Data were collected from a sample of college students (n = 220) attending a Midwestern university. Each of the theoretical predictors regressed on mental health was deemed significant. Collectively, the significant predictors produced an R2 adjusted value of 0.434 (p < 0.001), suggesting the final specified model explained 43.4% of the variance in mental health in the sample of participants. Qualitative cut-points were developed for each scale to aid in measurement of health promotion and education interventions designed to improve the mental health of college students.
Lannau, B; Van Geyt, C; Van Maele, G; Beele, H
2015-03-01
During the procurement of musculoskeletal grafts contamination may occur. As this might be detrimental for the acceptor, it is important to know which variables influence this occurrence and to alter procurement protocols accordingly. From 2004 to 2012 we gathered information on 6,428 allografts obtained from 291 donors. Using a multiple regression model we attempted to determine the factors that influence the contamination risk during procurement. We used the following variables: cause of death, type of hospital (i.e. university hospital vs. general hospital), previous blood vessel donation, previous organ donation, donor age, time between death and the start of the procurement, duration of the procurement, number of people attending the procurement and the number of procured grafts. The multiple regression model was only able to explain 5 % of the variability of the used outcome variable. None of the variables examined appear to have an important influence on the contamination risk.
Psychosocial factors influencing smokeless tobacco use by teen-age military dependents.
Lee, S; Raker, T; Chisick, M C
1994-02-01
Using bivariate and logistic regression analysis, we explored psychosocial correlates of smokeless tobacco (SLT) use in a sample of 2,257 teenage military dependents. We built separate regression models for males and females to explain triers and users of SLT. Results show female and male triers share five factors regarding SLT use--parental and peer approval, trying smoking, relatives using SLT, and athletic team membership. Male trial of SLT was additionally associated with race, difficulty in purchasing SLT, relatives who smoke, current smoking, and belief that SLT can cause mouth cancer. Male use of SLT was associated with race, seeing a dentist regularly, SLT counseling by a dentist, parental approval, trying and current smoking, and grade level. In all models, trying smoking was the strongest explanatory variable. Relatives and peers exert considerable influence on SLT use. Few triers or users had received SLT counseling from their dentist despite high dental utilization rates.
Folta, Sara C; Bell, Rick; Economos, Christina; Landers, Stewart; Goldberg, Jeanne P
2006-01-01
The purpose of this study was to test the utility of the Theory of Reasoned Action (TRA) in explaining young elementary school children's intention to consume legumes. A survey was conducted with children in an urban, multicultural community in Massachusetts. A total of 336 children participated. Logistic regression analysis was used to assess the strength of the relationship between attitude and subjective norm and intention. Although attitude was significantly associated with intention, the pseudo-R2 for the regression model that included only the TRA constructs was extremely low (.01). Adding demographic factors and preference improved the model's predictive ability, but attitude was no longer significant. The results of this study do not provide support for the predictive utility of the TRA with young elementary school children for this behavior, when demographic factors are accounted for. Hedonic factors, rather than reasoned judgments, may help drive children's intentions.
DeLisi, Matt; Angton, Alexia; Vaughn, Michael G; Trulson, Chad R; Caudill, Jonathan W; Beaver, Kevin M
2014-12-01
The association between psychopathy and crime is established, but the specific components of the personality disorders that most contribute to crime are largely unknown. Drawing on data from 723 confined delinquents in Missouri, the present study delved into the eight subscales of the Psychopathic Personality Inventory-Short Form to empirically assess the specific aspects of the disorder that are most responsible for explaining variation in career delinquency. Blame externalization emerged as the strongest predictor of career delinquency in ordinary least squares regression, logistic regression, and t-test models. Fearlessness and carefree nonplanfulness were also significant in all models. Other features of psychopathy, such as stress immunity, social potency, and coldheartedness were weakly and inconsistently predictive of career delinquency. Implications of these findings for the study of psychopathy and delinquent careers are discussed in this article. © The Author(s) 2013.
Eaton, Jennifer L; Mohr, David C; Hodgson, Michael J; McPhaul, Kathleen M
2018-02-01
To describe development and validation of the work-related well-being (WRWB) index. Principal components analysis was performed using Federal Employee Viewpoint Survey (FEVS) data (N = 392,752) to extract variables representing worker well-being constructs. Confirmatory factor analysis was performed to verify factor structure. To validate the WRWB index, we used multiple regression analysis to examine relationships with burnout associated outcomes. Principal Components Analysis identified three positive psychology constructs: "Work Positivity", "Co-worker Relationships", and "Work Mastery". An 11 item index explaining 63.5% of variance was achieved. The structural equation model provided a very good fit to the data. Higher WRWB scores were positively associated with all three employee experience measures examined in regression models. The new WRWB index shows promise as a valid and widely accessible instrument to assess worker well-being.
Eaton, Jennifer L; Mohr, David C; Hodgson, Michael J; McPhaul, Kathleen M
2017-10-11
To describe development and validation of the Work-Related Well-Being (WRWB) Index. Principal Components Analysis was performed using Federal Employee Viewpoint Survey (FEVS) data (N = 392,752) to extract variables representing worker well-being constructs. Confirmatory factor analysis was performed to verify factor structure. To validate the WRWB index, we used multiple regression analysis to examine relationships with burnout associated outcomes. PCA identified three positive psychology constructs: "Work Positivity", "Co-worker Relationships", and "Work Mastery". An 11 item index explaining 63.5% of variance was achieved. The structural equation model provided a very good fit to the data. Higher WRWB scores were positively associated with all 3 employee experience measures examined in regression models. The new WRWB index shows promise as a valid and widely accessible instrument to assess worker well-being.
Patient cost-sharing, socioeconomic status, and children's health care utilization.
Nilsson, Anton; Paul, Alexander
2018-05-01
This paper estimates the effect of cost-sharing on the demand for children's and adolescents' use of medical care. We use a large population-wide registry dataset including detailed information on contacts with the health care system as well as family income. Two different estimation strategies are used: regression discontinuity design exploiting age thresholds above which fees are charged, and difference-in-differences models exploiting policy changes. We also estimate combined regression discontinuity difference-in-differences models that take into account discontinuities around age thresholds caused by factors other than cost-sharing. We find that when care is free of charge, individuals increase their number of doctor visits by 5-10%. Effects are similar in middle childhood and adolescence, and are driven by those from low-income families. The differences across income groups cannot be explained by other factors that correlate with income, such as maternal education. Copyright © 2018 Elsevier B.V. All rights reserved.
Discrete Emotion Effects on Lexical Decision Response Times
Briesemeister, Benny B.; Kuchinke, Lars; Jacobs, Arthur M.
2011-01-01
Our knowledge about affective processes, especially concerning effects on cognitive demands like word processing, is increasing steadily. Several studies consistently document valence and arousal effects, and although there is some debate on possible interactions and different notions of valence, broad agreement on a two dimensional model of affective space has been achieved. Alternative models like the discrete emotion theory have received little interest in word recognition research so far. Using backward elimination and multiple regression analyses, we show that five discrete emotions (i.e., happiness, disgust, fear, anger and sadness) explain as much variance as two published dimensional models assuming continuous or categorical valence, with the variables happiness, disgust and fear significantly contributing to this account. Moreover, these effects even persist in an experiment with discrete emotion conditions when the stimuli are controlled for emotional valence and arousal levels. We interpret this result as evidence for discrete emotion effects in visual word recognition that cannot be explained by the two dimensional affective space account. PMID:21887307
Discrete emotion effects on lexical decision response times.
Briesemeister, Benny B; Kuchinke, Lars; Jacobs, Arthur M
2011-01-01
Our knowledge about affective processes, especially concerning effects on cognitive demands like word processing, is increasing steadily. Several studies consistently document valence and arousal effects, and although there is some debate on possible interactions and different notions of valence, broad agreement on a two dimensional model of affective space has been achieved. Alternative models like the discrete emotion theory have received little interest in word recognition research so far. Using backward elimination and multiple regression analyses, we show that five discrete emotions (i.e., happiness, disgust, fear, anger and sadness) explain as much variance as two published dimensional models assuming continuous or categorical valence, with the variables happiness, disgust and fear significantly contributing to this account. Moreover, these effects even persist in an experiment with discrete emotion conditions when the stimuli are controlled for emotional valence and arousal levels. We interpret this result as evidence for discrete emotion effects in visual word recognition that cannot be explained by the two dimensional affective space account.
Hietapakka, Laura; Elovainio, Marko; Heponiemi, Tarja; Presseau, Justin; Eccles, Martin; Aalto, Anna-Mari; Pekkarinen, Laura; Kuokkanen, Liisa; Sinervo, Timo
2013-10-01
We examined whether organizational justice is associated with sleep quality and performance in a population-based sample of 1,729 Finnish registered nurses working full time. In addition, we tested psychological mechanisms mediating the potential association. The results of multivariate linear regression analyses showed higher organizational justice to be associated with fewer sleeping problems (β values range from -.20 to -.11) and higher self-reported performance (β values range from .05 to .35). Furthermore, psychological distress (related to the psychological stress model) and job involvement (related to the psychosocial resource model) mediated the association between organizational justice and sleep. Sleeping problems partly mediated the association between organizational justice and performance. Psychological distress explained 51% to 83% and job involvement explained 10% to 15% of the total effects of justice variables on sleeping problems. The findings provide support for the psychological stress model and offer practical implications for reducing nurses' sleeping problems.
Occupational conditions and workers' sense of community: variations by gender and race.
Lambert, S J; Hopkins, K
1995-04-01
The literature is reviewed to define a sense of community in the workplace and to identify factors that may foster it. A model is developed and estimated with survey data from a culturally diverse sample of men and women performing lower-level jobs at a medium-sized manufacturing firm. Results of regression analyses are reported that correct for sample selection bias resulting from the lower response rates of minority workers. Findings suggest that well-designed jobs and supportive workplace relationships and policies are important in explaining workers' sense of community, defined as workers' perceptions of mutual commitment between employee and employer. Informal sources of support play a larger role in explaining men's sense of community, while formal sources of support are more important in explaining women's sense of community. Findings further suggest that African American workers, especially women, have a difficult time experiencing a sense of community at work.
NASA Astrophysics Data System (ADS)
Madani, Nima; Kimball, John S.; Running, Steven W.
2017-11-01
In the light use efficiency (LUE) approach of estimating the gross primary productivity (GPP), plant productivity is linearly related to absorbed photosynthetically active radiation assuming that plants absorb and convert solar energy into biomass within a maximum LUE (LUEmax) rate, which is assumed to vary conservatively within a given biome type. However, it has been shown that photosynthetic efficiency can vary within biomes. In this study, we used 149 global CO2 flux towers to derive the optimum LUE (LUEopt) under prevailing climate conditions for each tower location, stratified according to model training and test sites. Unlike LUEmax, LUEopt varies according to heterogeneous landscape characteristics and species traits. The LUEopt data showed large spatial variability within and between biome types, so that a simple biome classification explained only 29% of LUEopt variability over 95 global tower training sites. The use of explanatory variables in a mixed effect regression model explained 62.2% of the spatial variability in tower LUEopt data. The resulting regression model was used for global extrapolation of the LUEopt data and GPP estimation. The GPP estimated using the new LUEopt map showed significant improvement relative to global tower data, including a 15% R2 increase and 34% root-mean-square error reduction relative to baseline GPP calculations derived from biome-specific LUEmax constants. The new global LUEopt map is expected to improve the performance of LUE-based GPP algorithms for better assessment and monitoring of global terrestrial productivity and carbon dynamics.
Chau, Kénora; Kabuth, Bernard; Chau, Nearkasen
2016-01-01
The risk of suicide behaviors in immigrant adolescents varies across countries and remains partly understood. We conducted a study in France to examine immigrant adolescents’ likelihood of experiencing suicide ideation in the last 12 months (SI) and lifetime suicide attempts (SA) compared with their native counterparts, and the contribution of socioeconomic factors and school, behavior, and health-related difficulties. Questionnaires were completed by 1559 middle-school adolescents from north-eastern France including various risk factors, SI, SA, and their first occurrence over adolescent’s life course (except SI). Data were analyzed using logistic regression models for SI and Cox regression models for SA (retaining only school, behavior, and health-related difficulties that started before SA). Immigrant adolescents had a two-time higher risk of SI and SA than their native counterparts. Using nested models, the excess SI risk was highly explained by socioeconomic factors (27%) and additional school, behavior, and health-related difficulties (24%) but remained significant. The excess SA risk was more highly explained by these issues (40% and 85%, respectively) and became non-significant. These findings demonstrate the risk patterns of SI and SA and the prominent confounding roles of socioeconomic factors and school, behavior, and health-related difficulties. They may be provided to policy makers, schools, carers, and various organizations interested in immigrant, adolescent, and suicide-behavior problems. PMID:27809296
Headey, Derek; Frongillo, Edward A; Tran, Lan Mai; Rawat, Rahul; Ruel, Marie T; Menon, Purnima
2017-01-01
Background: Child linear growth sometimes improves in both intervention and comparison groups in evaluations of nutrition interventions, possibly because of spillover intervention effects to nonintervention areas or improvements in underlying determinants of nutritional change in both areas. Objective: We aimed to understand what changes in underlying socioeconomic characteristics and behavioral factors are important in explaining improvements in child linear growth. Methods: Baseline (2010) and endline (2014) surveys from the Alive & Thrive impact evaluation were used to identify the underlying determinants of height-for-age z scores (HAZs) among children aged 24–48 mo in Bangladesh (n = 4311) and 24–59 mo in Vietnam (n = 4002). Oaxaca-Blinder regression decompositions were used to examine which underlying determinants contributed to HAZ changes over time. Results: HAZs improved significantly between 2010 and 2014 in Bangladesh (∼0.18 SDs) and Vietnam (0.25 SDs). Underlying determinants improved substantially over time and were larger in Vietnam than in Bangladesh. Multiple regression models revealed significant associations between changes in HAZs and socioeconomic status (SES), food security, maternal education, hygiene, and birth weight in both countries. Changes in HAZs were significantly associated with maternal nutrition knowledge and child dietary diversity in Bangladesh, and with prenatal visits in Vietnam. Improvements in maternal nutrition knowledge in Bangladesh accounted for 20% of the total HAZ change, followed by maternal education (13%), SES (12%), hygiene (10%), and food security (9%). HAZ improvements in Vietnam were accounted for by changes in SES (26%), prenatal visits (25%), hygiene (19%), child birth weight (10%), and maternal education (7%). The decomposition models in both countries performed well, explaining >75% of the HAZ changes. Conclusions: Decomposition is a useful and simple technique for analyzing nonintervention drivers of nutritional change in intervention and comparison areas. Improvements in underlying determinants explained rapid improvements in HAZs between 2010 and 2014 in Bangladesh and Vietnam. PMID:28122930
Nguyen, Phuong Hong; Headey, Derek; Frongillo, Edward A; Tran, Lan Mai; Rawat, Rahul; Ruel, Marie T; Menon, Purnima
2017-03-01
Background: Child linear growth sometimes improves in both intervention and comparison groups in evaluations of nutrition interventions, possibly because of spillover intervention effects to nonintervention areas or improvements in underlying determinants of nutritional change in both areas. Objective: We aimed to understand what changes in underlying socioeconomic characteristics and behavioral factors are important in explaining improvements in child linear growth. Methods: Baseline (2010) and endline (2014) surveys from the Alive & Thrive impact evaluation were used to identify the underlying determinants of height-for-age z scores (HAZs) among children aged 24-48 mo in Bangladesh ( n = 4311) and 24-59 mo in Vietnam ( n = 4002). Oaxaca-Blinder regression decompositions were used to examine which underlying determinants contributed to HAZ changes over time. Results: HAZs improved significantly between 2010 and 2014 in Bangladesh (∼0.18 SDs) and Vietnam (0.25 SDs). Underlying determinants improved substantially over time and were larger in Vietnam than in Bangladesh. Multiple regression models revealed significant associations between changes in HAZs and socioeconomic status (SES), food security, maternal education, hygiene, and birth weight in both countries. Changes in HAZs were significantly associated with maternal nutrition knowledge and child dietary diversity in Bangladesh, and with prenatal visits in Vietnam. Improvements in maternal nutrition knowledge in Bangladesh accounted for 20% of the total HAZ change, followed by maternal education (13%), SES (12%), hygiene (10%), and food security (9%). HAZ improvements in Vietnam were accounted for by changes in SES (26%), prenatal visits (25%), hygiene (19%), child birth weight (10%), and maternal education (7%). The decomposition models in both countries performed well, explaining >75% of the HAZ changes. Conclusions: Decomposition is a useful and simple technique for analyzing nonintervention drivers of nutritional change in intervention and comparison areas. Improvements in underlying determinants explained rapid improvements in HAZs between 2010 and 2014 in Bangladesh and Vietnam.
Gupta, Nidhi; Heiden, Marina; Mathiassen, Svend Erik; Holtermann, Andreas
2016-05-01
We aimed at developing and evaluating statistical models predicting objectively measured occupational time spent sedentary or in physical activity from self-reported information available in large epidemiological studies and surveys. Two-hundred-and-fourteen blue-collar workers responded to a questionnaire containing information about personal and work related variables, available in most large epidemiological studies and surveys. Workers also wore accelerometers for 1-4 days measuring time spent sedentary and in physical activity, defined as non-sedentary time. Least-squares linear regression models were developed, predicting objectively measured exposures from selected predictors in the questionnaire. A full prediction model based on age, gender, body mass index, job group, self-reported occupational physical activity (OPA), and self-reported occupational sedentary time (OST) explained 63% (R (2)adjusted) of the variance of both objectively measured time spent sedentary and in physical activity since these two exposures were complementary. Single-predictor models based only on self-reported information about either OPA or OST explained 21% and 38%, respectively, of the variance of the objectively measured exposures. Internal validation using bootstrapping suggested that the full and single-predictor models would show almost the same performance in new datasets as in that used for modelling. Both full and single-predictor models based on self-reported information typically available in most large epidemiological studies and surveys were able to predict objectively measured occupational time spent sedentary or in physical activity, with explained variances ranging from 21-63%.
2011-01-01
Background Informal payments for health care are common in most former communist countries. This paper explores the demand side of these payments in Albania. By using data from the Living Standard Measurement Survey 2005 we control for individual determinants of informal payments in inpatient and outpatient health care. We use these results to explain the main factors contributing to the occurrence and extent of informal payments in Albania. Methods Using multivariate methods (logit and OLS) we test three models to explain informal payments: the cultural, economic and governance model. The results of logit models are presented here as odds ratios (OR) and results from OLS models as regression coefficients (RC). Results Our findings suggest differences in determinants of informal payments in inpatient and outpatient care. Generally our results show that informal payments are dependent on certain characteristics of patients, including age, area of residence, education, health status and health insurance. However, they are less dependent on income, suggesting homogeneity of payments across income categories. Conclusions We have found more evidence for the validity of governance and economic models than for the cultural model. PMID:21605459
Prediction of forest fires occurrences with area-level Poisson mixed models.
Boubeta, Miguel; Lombardía, María José; Marey-Pérez, Manuel Francisco; Morales, Domingo
2015-05-01
The number of fires in forest areas of Galicia (north-west of Spain) during the summer period is quite high. Local authorities are interested in analyzing the factors that explain this phenomenon. Poisson regression models are good tools for describing and predicting the number of fires per forest areas. This work employs area-level Poisson mixed models for treating real data about fires in forest areas. A parametric bootstrap method is applied for estimating the mean squared errors of fires predictors. The developed methodology and software are applied to a real data set of fires in forest areas of Galicia. Copyright © 2015 Elsevier Ltd. All rights reserved.
What Explains Patterns of Diversification and Richness among Animal Phyla?
Jezkova, Tereza; Wiens, John J.
2016-01-01
Animal phyla vary dramatically in species richness (from 1 species to >1.2 million), but the causes of this variation remain largely unknown. Animals have also evolved striking variation in morphology and ecology, including sessile marine taxa lacking heads, eyes, limbs, and complex organs (e.g. sponges), parasitic worms (e.g. nematodes, platyhelminths), and taxa with eyes, skeletons, limbs, and complex organs that dominate terrestrial ecosystems (arthropods, chordates). Relating this remarkable variation in traits to the diversification and richness of animal phyla is a fundamental yet unresolved problem in biology. Here, we test the impacts of 18 traits (including morphology, ecology, reproduction, and development) on diversification and richness of extant animal phyla. Using phylogenetic multiple regression, the best-fitting model includes five traits that explain ~74% of the variation in diversification rates (dioecy, parasitism, eyes/photoreceptors, a skeleton, non-marine habitat). However, a model including just three (skeleton, parasitism, habitat) explains nearly as much variation (~67%). Diversification rates then largely explain richness patterns. Our results also identify many striking traits that have surprisingly little impact on diversification (e.g. head, limbs, and complex circulatory and digestive systems). Overall, our results reveal the key factors that shape large-scale patterns of diversification and richness across >80% of all extant, described species. PMID:28221832
What Explains Patterns of Diversification and Richness among Animal Phyla?
Jezkova, Tereza; Wiens, John J
2017-03-01
Animal phyla vary dramatically in species richness (from one species to >1.2 million), but the causes of this variation remain largely unknown. Animals have also evolved striking variation in morphology and ecology, including sessile marine taxa lacking heads, eyes, limbs, and complex organs (e.g., sponges), parasitic worms (e.g., nematodes, platyhelminths), and taxa with eyes, skeletons, limbs, and complex organs that dominate terrestrial ecosystems (arthropods, chordates). Relating this remarkable variation in traits to the diversification and richness of animal phyla is a fundamental yet unresolved problem in biology. Here, we test the impacts of 18 traits (including morphology, ecology, reproduction, and development) on diversification and richness of extant animal phyla. Using phylogenetic multiple regression, the best-fitting model includes five traits that explain ∼74% of the variation in diversification rates (dioecy, parasitism, eyes/photoreceptors, a skeleton, nonmarine habitat). However, a model including just three (skeleton, parasitism, habitat) explains nearly as much variation (∼67%). Diversification rates then largely explain richness patterns. Our results also identify many striking traits that have surprisingly little impact on diversification (e.g., head, limbs, and complex circulatory and digestive systems). Overall, our results reveal the key factors that shape large-scale patterns of diversification and richness across >80% of all extant, described species.
Dutton, Daniel J; McLaren, Lindsay
2016-04-01
Obesity prevalence varies between geographic regions in Canada. The reasons for this variation are unclear but most likely implicate both individual-level and population-level factors. The objective of this study was to examine whether equalising correlates of body mass index (BMI) across these geographic regions could be reasonably expected to reduce differences in BMI distributions between regions. Using data from three cycles of the Canadian Community Health Survey (CCHS) 2001, 2003 and 2007 for males and females, we modelled between-region BMI cross-sectionally using quantile regression and Blinder-Oaxaca decomposition of the quantile regression results. We show that while individual-level variables (ie, age, income, education, physical activity level, fruit and vegetable consumption, smoking status, drinking status, family doctor status, rural status, employment in the past 12 months and marital status) may be Caucasian important correlates of BMI within geographic regions, those variables are not capable of explaining variation in BMI between regions. Equalisation of common correlates of BMI between regions cannot be reasonably expected to reduce differences in the BMI distributions between regions. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
Noh, Hwayoung; Freisling, Heinz; Assi, Nada; Zamora-Ros, Raul; Achaintre, David; Affret, Aurélie; Mancini, Francesca; Boutron-Ruault, Marie-Christine; Flögel, Anna; Boeing, Heiner; Kühn, Tilman; Schübel, Ruth; Trichopoulou, Antonia; Naska, Androniki; Kritikou, Maria; Palli, Domenico; Pala, Valeria; Tumino, Rosario; Ricceri, Fulvio; Santucci de Magistris, Maria; Cross, Amanda; Slimani, Nadia; Scalbert, Augustin; Ferrari, Pietro
2017-07-25
We identified urinary polyphenol metabolite patterns by a novel algorithm that combines dimension reduction and variable selection methods to explain polyphenol-rich food intake, and compared their respective performance with that of single biomarkers in the European Prospective Investigation into Cancer and Nutrition (EPIC) study. The study included 475 adults from four European countries (Germany, France, Italy, and Greece). Dietary intakes were assessed with 24-h dietary recalls (24-HDR) and dietary questionnaires (DQ). Thirty-four polyphenols were measured by ultra-performance liquid chromatography-electrospray ionization-tandem mass spectrometry (UPLC-ESI-MS-MS) in 24-h urine. Reduced rank regression-based variable importance in projection (RRR-VIP) and least absolute shrinkage and selection operator (LASSO) methods were used to select polyphenol metabolites. Reduced rank regression (RRR) was then used to identify patterns in these metabolites, maximizing the explained variability in intake of pre-selected polyphenol-rich foods. The performance of RRR models was evaluated using internal cross-validation to control for over-optimistic findings from over-fitting. High performance was observed for explaining recent intake (24-HDR) of red wine ( r = 0.65; AUC = 89.1%), coffee ( r = 0.51; AUC = 89.1%), and olives ( r = 0.35; AUC = 82.2%). These metabolite patterns performed better or equally well compared to single polyphenol biomarkers. Neither metabolite patterns nor single biomarkers performed well in explaining habitual intake (as reported in the DQ) of polyphenol-rich foods. This proposed strategy of biomarker pattern identification has the potential of expanding the currently still limited list of available dietary intake biomarkers.
Predicting dynamic knee joint load with clinical measures in people with medial knee osteoarthritis.
Hunt, Michael A; Bennell, Kim L
2011-08-01
Knee joint loading, as measured by the knee adduction moment (KAM), has been implicated in the pathogenesis of knee osteoarthritis (OA). Given that the KAM can only currently be accurately measured in the laboratory setting with sophisticated and expensive equipment, its utility in the clinical setting is limited. This study aimed to determine the ability of a combination of four clinical measures to predict KAM values. Three-dimensional motion analysis was used to calculate the peak KAM at a self-selected walking speed in 47 consecutive individuals with medial compartment knee OA and varus malalignment. Clinical predictors included: body mass; tibial angle measured using an inclinometer; walking speed; and visually observed trunk lean toward the affected limb during the stance phase of walking. Multiple linear regression was performed to predict KAM magnitudes using the four clinical measures. A regression model including body mass (41% explained variance), tibial angle (17% explained variance), and walking speed (9% explained variance) explained a total of 67% of variance in the peak KAM. Our study demonstrates that a set of measures easily obtained in the clinical setting (body mass, tibial alignment, and walking speed) can help predict the KAM in people with medial knee OA. Identifying those patients who are more likely to experience high medial knee loads could assist clinicians in deciding whether load-modifying interventions may be appropriate for patients, whilst repeated assessment of joint load could provide a mechanism to monitor disease progression or success of treatment. Copyright © 2010 Elsevier B.V. All rights reserved.
NASA Astrophysics Data System (ADS)
Mfumu Kihumba, Antoine; Vanclooster, Marnik
2013-04-01
Drinking water in Kinshasa, the capital of the Democratic Republic of Congo, is provided by extracting groundwater from the local aquifer, particularly in peripheral areas. The exploited groundwater body is mainly unconfined and located within a continuous detrital aquifer, primarily composed of sedimentary formations. However, the aquifer is subjected to an increasing threat of anthropogenic pollution pressure. Understanding the detailed origin of this pollution pressure is important for sustainable drinking water management in Kinshasa. The present study aims to explain the observed nitrate pollution problem, nitrate being considered as a good tracer for other pollution threats. The analysis is made in terms of physical attributes that are readily available using a statistical modelling approach. For the nitrate data, use was made of a historical groundwater quality assessment study, for which the data were re-analysed. The physical attributes are related to the topography, land use, geology and hydrogeology of the region. Prior to the statistical modelling, intrinsic and specific vulnerability for nitrate pollution was assessed. This vulnerability assessment showed that the alluvium area in the northern part of the region is the most vulnerable area. This area consists of urban land use with poor sanitation. Re-analysis of the nitrate pollution data demonstrated that the spatial variability of nitrate concentrations in the groundwater body is high, and coherent with the fragmented land use of the region and the intrinsic and specific vulnerability maps. For the statistical modeling use was made of multiple regression and regression tree analysis. The results demonstrated the significant impact of land use variables on the Kinshasa groundwater nitrate pollution and the need for a detailed delineation of groundwater capture zones around the monitoring stations. Key words: Groundwater , Isotopic, Kinshasa, Modelling, Pollution, Physico-chemical.
Individualized prediction of lung-function decline in chronic obstructive pulmonary disease
Zafari, Zafar; Sin, Don D.; Postma, Dirkje S.; Löfdahl, Claes-Göran; Vonk, Judith; Bryan, Stirling; Lam, Stephen; Tammemagi, C. Martin; Khakban, Rahman; Man, S.F. Paul; Tashkin, Donald; Wise, Robert A.; Connett, John E.; McManus, Bruce; Ng, Raymond; Hollander, Zsuszanna; Sadatsafavi, Mohsen
2016-01-01
Background: The rate of lung-function decline in chronic obstructive pulmonary disease (COPD) varies substantially among individuals. We sought to develop and validate an individualized prediction model for forced expiratory volume at 1 second (FEV1) in current smokers with mild-to-moderate COPD. Methods: Using data from a large long-term clinical trial (the Lung Health Study), we derived mixed-effects regression models to predict future FEV1 values over 11 years according to clinical traits. We modelled heterogeneity by allowing regression coefficients to vary across individuals. Two independent cohorts with COPD were used for validating the equations. Results: We used data from 5594 patients (mean age 48.4 yr, 63% men, mean baseline FEV1 2.75 L) to create the individualized prediction equations. There was significant between-individual variability in the rate of FEV1 decline, with the interval for the annual rate of decline that contained 95% of individuals being −124 to −15 mL/yr for smokers and −83 to 15 mL/yr for sustained quitters. Clinical variables in the final model explained 88% of variation around follow-up FEV1. The C statistic for predicting severity grades was 0.90. Prediction equations performed robustly in the 2 external data sets. Interpretation: A substantial part of individual variation in FEV1 decline can be explained by easily measured clinical variables. The model developed in this work can be used for prediction of future lung health in patients with mild-to-moderate COPD. Trial registration: Lung Health Study — ClinicalTrials.gov, no. NCT00000568; Pan-Canadian Early Detection of Lung Cancer Study — ClinicalTrials.gov, no. NCT00751660 PMID:27486205
Factors associated with fear of falling in people with Parkinson’s disease
2014-01-01
Background This study aimed to comprehensibly investigate potential contributing factors to fear of falling (FOF) among people with idiopathic Parkinson’s disease (PD). Methods The study included 104 people with PD. Mean (SD) age and PD-duration were 68 (9.4) and 5 (4.2) years, respectively, and the participants’ PD-symptoms were relatively mild. FOF (the dependent variable) was investigated with the Swedish version of the Falls Efficacy Scale, i.e. FES(S). The first multiple linear regression model replicated a previous study and independent variables targeted: walking difficulties in daily life; freezing of gait; dyskinesia; fatigue; need of help in daily activities; age; PD-duration; history of falls/near falls and pain. Model II included also the following clinically assessed variables: motor symptoms, cognitive functions, gait speed, dual-task difficulties and functional balance performance as well as reactive postural responses. Results Both regression models showed that the strongest contributing factor to FOF was walking difficulties, i.e. explaining 60% and 64% of the variance in FOF-scores, respectively. Other significant independent variables in both models were needing help from others in daily activities and fatigue. Functional balance was the only clinical variable contributing additional significant information to model I, increasing the explained variance from 66% to 73%. Conclusions The results imply that one should primarily target walking difficulties in daily life in order to reduce FOF in people mildly affected by PD. This finding applies even when considering a broad variety of aspects not previously considered in PD-studies targeting FOF. Functional balance performance, dependence in daily activities, and fatigue were also independently associated with FOF, but to a lesser extent. Longitudinal studies are warranted to gain an increased understanding of predictors of FOF in PD and who is at risk of developing a FOF. PMID:24456482
Sanders, Elizabeth A.; Berninger, Virginia W.; Abbott, Robert D.
2017-01-01
Sequential regression was used to evaluate whether language-related working memory components uniquely predict reading and writing achievement beyond cognitive-linguistic translation for students in grades 4–9 (N=103) with specific learning disabilities (SLDs) in subword handwriting (dysgraphia, n=25), word reading and spelling (dyslexia, n=60), or oral and written language (OWL LD, n=18). That is, SLDs are defined on basis of cascading level of language impairment (subword, word, and syntax/text). A 5-block regression model sequentially predicted literacy achievement from cognitive-linguistic translation (Block 1); working memory components for word form coding (Block 2), phonological and orthographic loops (Block 3), and supervisory focused or switching attention (Block4); and SLD groups (Block 5). Results showed that cognitive-linguistic translation explained an average of 27% and 15% of the variance in reading and writing achievement, respectively, but working memory components explained an additional 39% and 27% variance. Orthographic word form coding uniquely predicted nearly every measure, whereas attention switching only uniquely predicted reading. Finally, differences in reading and writing persisted between dyslexia and dysgraphia, with dysgraphia higher, even after controlling for Block 1 to 4 predictors. Differences in literacy achievement between students with dyslexia and OWL LD were largely explained by the Block 1 predictors. Applications to identifying and teaching students with these SLDs are discussed. PMID:28199175
Determinants of Internet use as a preferred source of information on personal health.
Lemire, Marc; Paré, Guy; Sicotte, Claude; Harvey, Charmian
2008-11-01
To understand the personal, social and cultural factors likely to explain recourse to the Internet as a preferred source of personal health information. A cross-sectional survey was conducted among a population of 2923 Internet users visiting a firmly established website that offers information on personal health. Multiple regression analysis was performed to identify the determinants of site use. The analysis template comprised four classes of determinants likely to explain Internet use: beliefs, intentions, user satisfaction and socio-demographic characteristics. Seven-point Likert scales were used. An analysis of the psychometric qualities of the variables provided compelling evidence of the construct's validity and reliability. A confirmatory factor analysis confirmed the correspondence with the factors predicted by the theoretical model. The regression analysis explained 35% of the variance in Internet use. Use was directly associated with five factors: perceived usefulness, importance given to written media in searches for health information, concern for personal health, importance given to the opinions of physicians and other health professionals, and the trust placed in the information available on the site itself. This study confirms the importance of the credibility of information on the frequency of Internet use as a preferred source of information on personal health. It also shows the potentially influential role of the Internet in the development of personal knowledge of health issues.
Physiological and anthropometric determinants of rhythmic gymnastics performance.
Douda, Helen T; Toubekis, Argyris G; Avloniti, Alexandra A; Tokmakidis, Savvas P
2008-03-01
To identify the physiological and anthropometric predictors of rhythmic gymnastics performance, which was defined from the total ranking score of each athlete in a national competition. Thirty-four rhythmic gymnasts were divided into 2 groups, elite (n = 15) and nonelite (n = 19), and they underwent a battery of anthropometric, physical fitness, and physiological measurements. The principal-components analysis extracted 6 components: anthropometric, flexibility, explosive strength, aerobic capacity, body dimensions, and anaerobic metabolism. These were used in a simultaneous multiple-regression procedure to determine which best explain the variance in rhythmic gymnastics performance. Based on the principal-component analysis, the anthropometric component explained 45% of the total variance, flexibility 12.1%, explosive strength 9.2%, aerobic capacity 7.4%, body dimensions 6.8%, and anaerobic metabolism 4.6%. Components of anthropometric (r = .50) and aerobic capacity (r = .49) were significantly correlated with performance (P < .01). When the multiple-regression model-y = 10.708 + (0.0005121 x VO2max) + (0.157 x arm span) + (0.814 x midthigh circumference) - (0.293 x body mass)-was applied to elite gymnasts, 92.5% of the variation was explained by VO2max (58.9%), arm span (12%), midthigh circumference (13.1%), and body mass (8.5%). Selected anthropometric characteristics, aerobic power, flexibility, and explosive strength are important determinants of successful performance. These findings might have practical implications for both training and talent identification in rhythmic gymnastics.
Rasmussen, Patrick P.; Ziegler, Andrew C.
2003-01-01
The sanitary quality of water and its use as a public-water supply and for recreational activities, such as swimming, wading, boating, and fishing, can be evaluated on the basis of fecal coliform and Escherichia coli (E. coli) bacteria densities. This report describes the overall sanitary quality of surface water in selected Kansas streams, the relation between fecal coliform and E. coli, the relation between turbidity and bacteria densities, and how continuous bacteria estimates can be used to evaluate the water-quality conditions in selected Kansas streams. Samples for fecal coliform and E. coli were collected at 28 surface-water sites in Kansas. Of the 318 samples collected, 18 percent exceeded the current Kansas Department of Health and Environment (KDHE) secondary contact recreational, single-sample criterion for fecal coliform (2,000 colonies per 100 milliliters of water). Of the 219 samples collected during the recreation months (April 1 through October 31), 21 percent exceeded the current (2003) KDHE single-sample fecal coliform criterion for secondary contact rec-reation (2,000 colonies per 100 milliliters of water) and 36 percent exceeded the U.S. Environmental Protection Agency (USEPA) recommended single-sample primary contact recreational criterion for E. coli (576 colonies per 100 milliliters of water). Comparisons of fecal coliform and E. coli criteria indicated that more than one-half of the streams sampled could exceed USEPA recommended E. coli criteria more frequently than the current KDHE fecal coliform criteria. In addition, the ratios of E. coli to fecal coliform (EC/FC) were smallest for sites with slightly saline water (specific conductance greater than 1,000 microsiemens per centimeter at 25 degrees Celsius), indicating that E. coli may not be a good indicator of sanitary quality for those streams. Enterococci bacteria may provide a more accurate assessment of the potential for swimming-related illnesses in these streams. Ratios of EC/FC and linear regression models were developed for estimating E. coli densities on the basis of measured fecal coliform densities for six individual and six groups of surface-water sites. Regression models developed for the six individual surface-water sites and six groups of sites explain at least 89 percent of the variability in E. coli densities. The EC/FC ratios and regression models are site specific and make it possible to convert historic fecal coliform bacteria data to estimated E. coli densities for the selected sites. The EC/FC ratios can be used to estimate E. coli for any range of historical fecal coliform densities, and in some cases with less error than the regression models. The basin- and statewide regression models explained at least 93 percent of the variance and best represent the sites where a majority of the data used to develop the models were collected (Kansas and Little Arkansas Basins). Comparison of the current (2003) KDHE geometric-mean primary contact criterion for fecal coliform bacteria of 200 col/100 mL to the 2002 USEPA recommended geometric-mean criterion of 126 col/100 mL for E. coli results in an EC/FC ratio of 0.63. The geometric-mean EC/FC ratio for all sites except Rattlesnake Creek (site 21) is 0.77, indicating that considerably more than 63 percent of the fecal coliform is E. coli. This potentially could lead to more exceedances of the recommended E. coli criterion, where the water now meets the current (2003) 200-col/100 mL fecal coliform criterion. In this report, turbidity was found to be a reliable estimator of bacteria densities. Regression models are provided for estimating fecal coliform and E. coli bacteria densities using continuous turbidity measurements. Prediction intervals also are provided to show the uncertainty associated with using the regression models. Eighty percent of all measured sample densities and individual turbidity-based estimates from the regression models were in agreement as exceedi
On statistical analysis of factors affecting anthocyanin extraction from Ixora siamensis
NASA Astrophysics Data System (ADS)
Mat Nor, N. A.; Arof, A. K.
2016-10-01
This study focused on designing an experimental model in order to evaluate the influence of operative extraction parameters employed for anthocyanin extraction from Ixora siamensis on CIE color measurements (a*, b* and color saturation). Extractions were conducted at temperatures of 30, 55 and 80°C, soaking time of 60, 120 and 180 min using acidified methanol solvent with different trifluoroacetic acid (TFA) contents of 0.5, 1.75 and 3% (v/v). The statistical evaluation was performed by running analysis of variance (ANOVA) and regression calculation to investigate the significance of the generated model. Results show that the generated regression models adequately explain the data variation and significantly represented the actual relationship between the independent variables and the responses. Analysis of variance (ANOVA) showed high coefficient determination values (R2) of 0.9687 for a*, 0.9621 for b* and 0.9758 for color saturation, thus ensuring a satisfactory fit of the developed models with the experimental data. Interaction between TFA content and extraction temperature exhibited to the highest significant influence on CIE color parameter.
Visentin, G; Penasa, M; Gottardo, P; Cassandro, M; De Marchi, M
2016-10-01
Milk minerals and coagulation properties are important for both consumers and processors, and they can aid in increasing milk added value. However, large-scale monitoring of these traits is hampered by expensive and time-consuming reference analyses. The objective of the present study was to develop prediction models for major mineral contents (Ca, K, Mg, Na, and P) and milk coagulation properties (MCP: rennet coagulation time, curd-firming time, and curd firmness) using mid-infrared spectroscopy. Individual milk samples (n=923) of Holstein-Friesian, Brown Swiss, Alpine Grey, and Simmental cows were collected from single-breed herds between January and December 2014. Reference analysis for the determination of both mineral contents and MCP was undertaken with standardized methods. For each milk sample, the mid-infrared spectrum in the range from 900 to 5,000cm(-1) was stored. Prediction models were calibrated using partial least squares regression coupled with a wavenumber selection technique called uninformative variable elimination, to improve model accuracy, and validated both internally and externally. The average reduction of wavenumbers used in partial least squares regression was 80%, which was accompanied by an average increment of 20% of the explained variance in external validation. The proportion of explained variance in external validation was about 70% for P, K, Ca, and Mg, and it was lower (40%) for Na. Milk coagulation properties prediction models explained between 54% (rennet coagulation time) and 56% (curd-firming time) of the total variance in external validation. The ratio of standard deviation of each trait to the respective root mean square error of prediction, which is an indicator of the predictive ability of an equation, suggested that the developed models might be effective for screening and collection of milk minerals and coagulation properties at the population level. Although prediction equations were not accurate enough to be proposed for analytic purposes, mid-infrared spectroscopy predictions could be evaluated as phenotypic information to genetically improve milk minerals and MCP on a large scale. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Explaining the heterogeneous scrapie surveillance figures across Europe: a meta-regression approach.
Del Rio Vilas, Victor J; Hopp, Petter; Nunes, Telmo; Ru, Giuseppe; Sivam, Kumar; Ortiz-Pelaez, Angel
2007-06-28
Two annual surveys, the abattoir and the fallen stock, monitor the presence of scrapie across Europe. A simple comparison between the prevalence estimates in different countries reveals that, in 2003, the abattoir survey appears to detect more scrapie in some countries. This is contrary to evidence suggesting the greater ability of the fallen stock survey to detect the disease. We applied meta-analysis techniques to study this apparent heterogeneity in the behaviour of the surveys across Europe. Furthermore, we conducted a meta-regression analysis to assess the effect of country-specific characteristics on the variability. We have chosen the odds ratios between the two surveys to inform the underlying relationship between them and to allow comparisons between the countries under the meta-regression framework. Baseline risks, those of the slaughtered populations across Europe, and country-specific covariates, available from the European Commission Report, were inputted in the model to explain the heterogeneity. Our results show the presence of significant heterogeneity in the odds ratios between countries and no reduction in the variability after adjustment for the different risks in the baseline populations. Three countries contributed the most to the overall heterogeneity: Germany, Ireland and The Netherlands. The inclusion of country-specific covariates did not, in general, reduce the variability except for one variable: the proportion of the total adult sheep population sampled as fallen stock by each country. A large residual heterogeneity remained in the model indicating the presence of substantial effect variability between countries. The meta-analysis approach was useful to assess the level of heterogeneity in the implementation of the surveys and to explore the reasons for the variation between countries.
Harmsen, Wouter J; Ribbers, Gerard M; Slaman, Jorrit; Heijenbrok-Kal, Majanka H; Khajeh, Ladbon; van Kooten, Fop; Neggers, Sebastiaan J C M M; van den Berg-Emons, Rita J
2017-05-01
Peak oxygen uptake (VO 2peak ) established during progressive cardiopulmonary exercise testing (CPET) is the "gold-standard" for cardiorespiratory fitness. However, CPET measurements may be limited in patients with aneurysmal subarachnoid hemorrhage (a-SAH) by disease-related complaints, such as cardiovascular health-risks or anxiety. Furthermore, CPET with gas-exchange analyses require specialized knowledge and infrastructure with limited availability in most rehabilitation facilities. To determine whether an easy-to-administer six-minute walk test (6MWT) is a valid clinical alternative to progressive CPET in order to predict VO 2peak in individuals with a-SAH. Twenty-seven patients performed the 6MWT and CPET with gas-exchange analyses on a cycle ergometer. Univariate and multivariate regression models were made to investigate the predictability of VO 2peak from the six-minute walk distance (6MWD). Univariate regression showed that the 6MWD was strongly related to VO 2peak (r = 0.75, p < 0.001), with an explained variance of 56% and a prediction error of 4.12 ml/kg/min, representing 18% of mean VO 2peak . Adding age and sex to an extended multivariate regression model improved this relationship (r = 0.82, p < 0.001), with an explained variance of 67% and a prediction error of 3.67 ml/kg/min corresponding to 16% of mean VO 2peak . The 6MWT is an easy-to-administer submaximal exercise test that can be selected to estimate cardiorespiratory fitness at an aggregated level, in groups of patients with a-SAH, which may help to evaluate interventions in a clinical or research setting. However, the relatively large prediction error does not allow for an accurate prediction in individual patients.
Georgiades, Anastasia; Davis, Vicki G; Atkins, Alexandra S; Khan, Anzalee; Walker, Trina W; Loebel, Antony; Haig, George; Hilt, Dana C; Dunayevich, Eduardo; Umbricht, Daniel; Sand, Michael; Keefe, Richard S E
2017-12-01
The MATRICS Consensus Cognitive Battery (MCCB) was developed to assess cognitive treatment effects in schizophrenia clinical trials, and is considered the FDA gold standard outcome measure for that purpose. The aim of the present study was to establish pre-treatment psychometric characteristics of the MCCB in a large pooled sample. The dataset included 2616 stable schizophrenia patients enrolled in 15 different clinical trials between 2007 and 2016 within the United States (94%) and Canada (6%). The MCCB was administered twice prior to the initiation of treatment in 1908 patients. Test-retest reliability and practice effects of the cognitive composite score, the neurocognitive composite score, which excludes the domain Social Cognition, and the subtests/domains were examined using Intra-Class Correlations (ICC) and Cohen's d. Simulated regression models explored which domains explained the greatest portion of variance in composite scores. Test-retest reliability was high (ICC=0.88) for both composite scores. Practice effects were small for the cognitive (d=0.15) and neurocognitive (d=0.17) composites. Simulated bootstrap regression analyses revealed that 3 of the 7 domains explained 86% of the variance for both composite scores. The domains that entered most frequently in the top 3 positions of the regression models were Speed of Processing, Working Memory, and Visual Learning. Findings provide definitive psychometric characteristics and a benchmark comparison for clinical trials using the MCCB. The test-retest reliability of the MCCB composite scores is considered excellent and the learning effects are small, fulfilling two of the key criteria for outcome measures in cognition clinical trials. Copyright © 2017 Elsevier B.V. All rights reserved.
Black Clouds vs Random Variation in Hospital Admissions.
Ong, Luei Wern; Dawson, Jeffrey D; Ely, John W
2018-06-01
Physicians often accuse their peers of being "black clouds" if they repeatedly have more than the average number of hospital admissions while on call. Our purpose was to determine whether the black-cloud phenomenon is real or explainable by random variation. We analyzed hospital admissions to the University of Iowa family medicine service from July 1, 2010 to June 30, 2015. Analyses were stratified by peer group (eg, night shift attending physicians, day shift senior residents). We analyzed admission numbers to find evidence of black-cloud physicians (those with significantly more admissions than their peers) and white-cloud physicians (those with significantly fewer admissions). The statistical significance of whether there were actual differences across physicians was tested with mixed-effects negative binomial regression. The 5-year study included 96 physicians and 6,194 admissions. The number of daytime admissions ranged from 0 to 10 (mean 2.17, SD 1.63). Night admissions ranged from 0 to 11 (mean 1.23, SD 1.22). Admissions increased from 1,016 in the first year to 1,523 in the fifth year. We found 18 white-cloud and 16 black-cloud physicians in simple regression models that did not control for this upward trend. After including study year and other potential confounding variables in the regression models, there were no significant associations between physicians and admission numbers and therefore no true black or white clouds. In this study, apparent black-cloud and white-cloud physicians could be explained by random variation in hospital admissions. However, this randomness incorporated a wide range in workload among physicians, with potential impact on resident education at the low end and patient safety at the high end.
QSAR modeling of flotation collectors using principal components extracted from topological indices.
Natarajan, R; Nirdosh, Inderjit; Basak, Subhash C; Mills, Denise R
2002-01-01
Several topological indices were calculated for substituted-cupferrons that were tested as collectors for the froth flotation of uranium. The principal component analysis (PCA) was used for data reduction. Seven principal components (PC) were found to account for 98.6% of the variance among the computed indices. The principal components thus extracted were used in stepwise regression analyses to construct regression models for the prediction of separation efficiencies (Es) of the collectors. A two-parameter model with a correlation coefficient of 0.889 and a three-parameter model with a correlation coefficient of 0.913 were formed. PCs were found to be better than partition coefficient to form regression equations, and inclusion of an electronic parameter such as Hammett sigma or quantum mechanically derived electronic charges on the chelating atoms did not improve the correlation coefficient significantly. The method was extended to model the separation efficiencies of mercaptobenzothiazoles (MBT) and aminothiophenols (ATP) used in the flotation of lead and zinc ores, respectively. Five principal components were found to explain 99% of the data variability in each series. A three-parameter equation with correlation coefficient of 0.985 and a two-parameter equation with correlation coefficient of 0.926 were obtained for MBT and ATP, respectively. The amenability of separation efficiencies of chelating collectors to QSAR modeling using PCs based on topological indices might lead to the selection of collectors for synthesis and testing from a virtual database.
Data Analysis & Statistical Methods for Command File Errors
NASA Technical Reports Server (NTRS)
Meshkat, Leila; Waggoner, Bruce; Bryant, Larry
2014-01-01
This paper explains current work on modeling for managing the risk of command file errors. It is focused on analyzing actual data from a JPL spaceflight mission to build models for evaluating and predicting error rates as a function of several key variables. We constructed a rich dataset by considering the number of errors, the number of files radiated, including the number commands and blocks in each file, as well as subjective estimates of workload and operational novelty. We have assessed these data using different curve fitting and distribution fitting techniques, such as multiple regression analysis, and maximum likelihood estimation to see how much of the variability in the error rates can be explained with these. We have also used goodness of fit testing strategies and principal component analysis to further assess our data. Finally, we constructed a model of expected error rates based on the what these statistics bore out as critical drivers to the error rate. This model allows project management to evaluate the error rate against a theoretically expected rate as well as anticipate future error rates.
Chun, Heeran; Khang, Young-Ho; Kim, Il-Ho; Cho, Sung-Il
2008-09-01
This study examines and explains the gender disparity in health despite rapid modernization in South Korea where the social structure is still based on traditional gender relations. A nationally representative sample of 2897 men and 3286 women aged 25-64 from the 2001 Korean National Health and Nutrition Examination Survey was analyzed. Health indicators included self rated health and chronic disease. Age-adjusted prevalence was computed according to a gender and odds ratios (OR) derived from logistic regression. Percentage changes in OR by inclusion of determinant variables (socio-structural, psychosocial, and behavioral) into the base logistic regression model were used to estimate the contributions to the gender gap in two morbidity measures. Results showed a substantial female excess in ill-health in both measures, revealing an increasing disparity in the older age group. Group-specific age-adjusted prevalence of ill-health showed an inverse relationship to socioeconomic position. When adjusting for each determinant, employment status, education, and depression contributed the greatest to the gender gap. After adjusting for all suggested determinants, 78% for self rated health and 86% for chronic disease in excess OR could be explained. After stratifying for age, the full model provided a complete explanation for the female excess in chronic illness, but for self rated health a female excess was still evident for the younger age group. Socio-structural factors played a crucial role in accounting for female excess in ill-health. This result calls for greater attention to gender-based health inequality stemming from socio-structural determinants in South Korea. Cross-cultural validation studies are suggested for further discussion of the link between changing gender relations and the gender health gap in morbidity in diverse settings.
Parsimonious model for blood glucose level monitoring in type 2 diabetes patients.
Zhao, Fang; Ma, Yan Fen; Wen, Jing Xiao; DU, Yan Fang; Li, Chun Lin; Li, Guang Wei
2014-07-01
To establish the parsimonious model for blood glucose monitoring in patients with type 2 diabetes receiving oral hypoglycemic agent treatment. One hundred and fifty-nine adult Chinese type 2 diabetes patients were randomized to receive rapid-acting or sustained-release gliclazide therapy for 12 weeks. Their blood glucose levels were measured at 10 time points in a 24 h period before and after treatment, and the 24 h mean blood glucose levels were measured. Contribution of blood glucose levels to the mean blood glucose level and HbA1c was assessed by multiple regression analysis. The correlation coefficients of blood glucose level measured at 10 time points to the daily MBG were 0.58-0.74 and 0.59-0.79, respectively, before and after treatment (P<0.0001). The multiple stepwise regression analysis showed that the blood glucose levels measured at 6 of the 10 time points could explain 95% and 97% of the changes in MBG before and after treatment. The three blood glucose levels, which were measured at fasting, 2 h after breakfast and before dinner, of the 10 time points could explain 84% and 86% of the changes in MBG before and after treatment, but could only explain 36% and 26% of the changes in HbA1c before and after treatment, and they had a poorer correlation with the HbA1c than with the 24 h MBG. The blood glucose levels measured at fasting, 2 h after breakfast and before dinner truly reflected the change 24 h blood glucose level, suggesting that they are appropriate for the self-monitoring of blood glucose levels in diabetes patients receiving oral anti-diabetes therapy. Copyright © 2014 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.
Time trends in physical activity in the state of São Paulo, Brazil: 2002-2008.
Matsudo, Victor K R; Matsudo, Sandra M; Araújo, Timóteo L; Andrade, Douglas R; Oliveira, Luis C; Hallal, Pedro C
2010-12-01
To document time trends in physical activity in the state of São Paulo, Brazil (2002-2008). In addition, we discuss the role of Agita São Paulo at explaining such trends. Cross-sectional surveys were carried out in 2002, 2003, 2006, and 2008 in the state of São Paulo, Brazil, using comparable sampling approaches and similar sample sizes. In all surveys, physical activity was measured using the short version of the International Physical Activity Questionnaire. Separate weekly scores of walking and moderate- and vigorous-intensity physical activities were generated; cutoff points of 0 and 150 min·wk were used. Also, we created a total physical activity score by summing these three types of activity. We used logistic regression models for adjusting time trends for the different sociodemographic compositions of the samples. The prevalence of no physical activity decreased from 9.6% in 2002 to 2.7% in 2008, whereas the proportion of subjects below the 150-min threshold decreased from 43.7% in 2002 to 11.6% in 2008. These trends were mainly explained by increases in walking and moderate-intensity physical activity. Increases in physical activity were slightly greater among females than among males. Logistic regression models confirmed that these trends were not due to the different compositions of the samples. Physical activity levels are increasing in the state of São Paulo, Brazil. Considering that the few data available in Brazil using the same instrument indicate exactly the opposite trend and that Agita São Paulo primarily incentives the involvement in moderate-intensity physical activity and walking, it seems that at least part of the trends described here are explained by the Agita São Paulo program.
Santos Tavares Silva, I; Sunnerhagen, K S; Willén, C; Ottenvall Hammar, I
2016-11-18
Fatigue is reported as one of the most disabling symptoms and is common among persons living with late effects of polio. Although fatigue has been studied in the context of people living with late effects of polio, there is a lack of knowledge concerning the association of fatigue and variables of importance for participation in daily life. Therefore, the aim of this study was to explore possible factors associated with fatigue among persons with late effects of polio in Sweden. This retrospective registry study consisted of 89 persons with late effects of polio living in Sweden. Fatigue was measured with the Multidimensional Fatigue Inventory (MFI-20) scale, Swedish version. Pearson's correlation coefficient was used to analyse the correlation between the factors and fatigue, and a multiple linear regression was carried out to explore factors for fatigue. Fatigue statistically significantly correlated with age (r = 0.234, p < 0.05) and the use of mobility assistive devices (r = 0.255, p < 0.05). The multiple linear regression model showed that the factors age (β = 0.304, p < 0.019) and mobility assistive devices (β = 0.262, p < 0.017) were associated with fatigue among persons living with late effects of polio, and the model partly explained 14% of the variation of fatigue. Fatigue could partly be explained by the extent of using mobility assistive devices and age. Healthcare professionals should provide and demonstrate the importance of assistive devices to ensure management of fatigue in persons living with late effects of polio.
Dyer, Betsey D.; Kahn, Michael J.; LeBlanc, Mark D.
2008-01-01
Classification and regression tree (CART) analysis was applied to genome-wide tetranucleotide frequencies (genomic signatures) of 195 archaea and bacteria. Although genomic signatures have typically been used to classify evolutionary divergence, in this study, convergent evolution was the focus. Temperature optima for most of the organisms examined could be distinguished by CART analyses of tetranucleotide frequencies. This suggests that pervasive (nonlinear) qualities of genomes may reflect certain environmental conditions (such as temperature) in which those genomes evolved. The predominant use of GAGA and AGGA as the discriminating tetramers in CART models suggests that purine-loading and codon biases of thermophiles may explain some of the results. PMID:19054742
Explaining Support Vector Machines: A Color Based Nomogram
Van Belle, Vanya; Van Calster, Ben; Van Huffel, Sabine; Suykens, Johan A. K.; Lisboa, Paulo
2016-01-01
Problem setting Support vector machines (SVMs) are very popular tools for classification, regression and other problems. Due to the large choice of kernels they can be applied with, a large variety of data can be analysed using these tools. Machine learning thanks its popularity to the good performance of the resulting models. However, interpreting the models is far from obvious, especially when non-linear kernels are used. Hence, the methods are used as black boxes. As a consequence, the use of SVMs is less supported in areas where interpretability is important and where people are held responsible for the decisions made by models. Objective In this work, we investigate whether SVMs using linear, polynomial and RBF kernels can be explained such that interpretations for model-based decisions can be provided. We further indicate when SVMs can be explained and in which situations interpretation of SVMs is (hitherto) not possible. Here, explainability is defined as the ability to produce the final decision based on a sum of contributions which depend on one single or at most two input variables. Results Our experiments on simulated and real-life data show that explainability of an SVM depends on the chosen parameter values (degree of polynomial kernel, width of RBF kernel and regularization constant). When several combinations of parameter values yield the same cross-validation performance, combinations with a lower polynomial degree or a larger kernel width have a higher chance of being explainable. Conclusions This work summarizes SVM classifiers obtained with linear, polynomial and RBF kernels in a single plot. Linear and polynomial kernels up to the second degree are represented exactly. For other kernels an indication of the reliability of the approximation is presented. The complete methodology is available as an R package and two apps and a movie are provided to illustrate the possibilities offered by the method. PMID:27723811
NASA Astrophysics Data System (ADS)
Gonçalves, Karen dos Santos; Winkler, Mirko S.; Benchimol-Barbosa, Paulo Roberto; de Hoogh, Kees; Artaxo, Paulo Eduardo; de Souza Hacon, Sandra; Schindler, Christian; Künzli, Nino
2018-07-01
Epidemiological studies generally use particulate matter measurements with diameter less 2.5 μm (PM2.5) from monitoring networks. Satellite aerosol optical depth (AOD) data has considerable potential in predicting PM2.5 concentrations, and thus provides an alternative method for producing knowledge regarding the level of pollution and its health impact in areas where no ground PM2.5 measurements are available. This is the case in the Brazilian Amazon rainforest region where forest fires are frequent sources of high pollution. In this study, we applied a non-linear model for predicting PM2.5 concentration from AOD retrievals using interaction terms between average temperature, relative humidity, sine, cosine of date in a period of 365,25 days and the square of the lagged relative residual. Regression performance statistics were tested comparing the goodness of fit and R2 based on results from linear regression and non-linear regression for six different models. The regression results for non-linear prediction showed the best performance, explaining on average 82% of the daily PM2.5 concentrations when considering the whole period studied. In the context of Amazonia, it was the first study predicting PM2.5 concentrations using the latest high-resolution AOD products also in combination with the testing of a non-linear model performance. Our results permitted a reliable prediction considering the AOD-PM2.5 relationship and set the basis for further investigations on air pollution impacts in the complex context of Brazilian Amazon Region.
Family and school environmental predictors of sleep bruxism in children.
Rossi, Debora; Manfredini, Daniele
2013-01-01
To identify potential predictors of self-reported sleep bruxism (SB) within children's family and school environments. A total of 65 primary school children (55.4% males, mean age 9.3 ± 1.9 years) were administered a 10-item questionnaire investigating the prevalence of self-reported SB as well as nine family and school-related potential bruxism predictors. Regression analyses were performed to assess the correlation between the potential predictors and SB. A positive answer to the self-reported SB item was endorsed by 18.8% of subjects, with no sex differences. Multiple variable regression analysis identified a final model showing that having divorced parents and not falling asleep easily were the only two weak predictors of self-reported SB. The percentage of explained variance for SB by the final multiple regression model was 13.3% (Nagelkerke's R² = 0.133). While having a high specificity and a good negative predictive value, the model showed unacceptable sensitivity and positive predictive values. The resulting accuracy to predict the presence of self-reported SB was 73.8%. The present investigation suggested that, among family and school-related matters, having divorced parents and not falling asleep easily were two predictors, even if weak, of a child's self-report of SB.
1992-05-01
regression analysis. The strength of any one variable can be estimated along with the strength of the entire model in explaining the variance of percent... applicable a set of damage functions is to a particular situation. Sometimes depth- damage functions are embedded in computer programs which calculate...functions. Chapter Six concludes with recommended policies on the development and application of depth-damage functions. 5 6 CHAPTER TWO CONSTRUCTION OF
Explaining and modeling the concentration and loading of Escherichia coli in a stream-A case study.
Wang, Chaozi; Schneider, Rebecca L; Parlange, Jean-Yves; Dahlke, Helen E; Walter, M Todd
2018-09-01
Escherichia coli (E. coli) level in streams is a public health indicator. Therefore, being able to explain why E. coli levels are sometimes high and sometimes low is important. Using citizen science data from Fall Creek in central NY we found that complementarily using principal component analysis (PCA) and partial least squares (PLS) regression provided insights into the drivers of E. coli and a mechanism for predicting E. coli levels, respectively. We found that stormwater, temperature/season and shallow subsurface flow are the three dominant processes driving the fate and transport of E. coli. PLS regression modeling provided very good predictions under stormwater conditions (R 2 = 0.85 for log (E. coli concentration) and R 2 = 0.90 for log (E. coli loading)); predictions under baseflow conditions were less robust. But, in our case, both E. coli concentration and E. coli loading were significantly higher under stormwater condition, so it is probably more important to predict high-flow E. coli hazards than low-flow conditions. Besides previously reported good indicators of in-stream E. coli level, nitrate-/nitrite-nitrogen and soluble reactive phosphorus were also found to be good indicators of in-stream E. coli levels. These findings suggest management practices to reduce E. coli concentrations and loads in-streams and, eventually, reduce the risk of waterborne disease outbreak. Copyright © 2018. Published by Elsevier B.V.
Tsunami Size Distributions at Far-Field Locations from Aggregated Earthquake Sources
NASA Astrophysics Data System (ADS)
Geist, E. L.; Parsons, T.
2015-12-01
The distribution of tsunami amplitudes at far-field tide gauge stations is explained by aggregating the probability of tsunamis derived from individual subduction zones and scaled by their seismic moment. The observed tsunami amplitude distributions of both continental (e.g., San Francisco) and island (e.g., Hilo) stations distant from subduction zones are examined. Although the observed probability distributions nominally follow a Pareto (power-law) distribution, there are significant deviations. Some stations exhibit varying degrees of tapering of the distribution at high amplitudes and, in the case of the Hilo station, there is a prominent break in slope on log-log probability plots. There are also differences in the slopes of the observed distributions among stations that can be significant. To explain these differences we first estimate seismic moment distributions of observed earthquakes for major subduction zones. Second, regression models are developed that relate the tsunami amplitude at a station to seismic moment at a subduction zone, correcting for epicentral distance. The seismic moment distribution is then transformed to a site-specific tsunami amplitude distribution using the regression model. Finally, a mixture distribution is developed, aggregating the transformed tsunami distributions from all relevant subduction zones. This mixture distribution is compared to the observed distribution to assess the performance of the method described above. This method allows us to estimate the largest tsunami that can be expected in a given time period at a station.
The Influence of Contextual and Psychosocial Factors on Handwashing.
Seimetz, Elisabeth; Boyayo, Anne-Marie; Mosler, Hans-Joachim
2016-06-01
Even though washing hands with soap is among the most effective measures to reduce the risk of infection, handwashing rates in infrastructure-restricted settings remain seriously low. Little is known about how context alone and in interaction with psychosocial factors influence hand hygiene behavior. The aim of this article was to explore how both contextual and psychosocial factors affect handwashing practices. A cross-sectional survey was conducted with 660 caregivers of primary school children in rural Burundi. Hierarchical regression analyses revealed that household wealth, the amount of water per person, and having a designated place for washing hands were contextual factors significantly predicting handwashing frequency, whereas the contextual factors, time spent collecting water and amount of money spent on soap, were not significant predictors. The contextual factors explained about 13% of the variance of reported handwashing frequency. The addition of the psychosocial factors to the regression model resulted in a significant 41% increase of explained variation in handwashing frequency. In this final model, the amount of water was the only contextual factor that remained a significant predictor. The most important predictors were a belief of self-efficacy, planning how, when, and where to wash hands, and always remembering to do so. The findings suggest that contextual constraints might be perceived rather than actual barriers and highlight the role of psychosocial factors in understanding hygiene behaviors. © The American Society of Tropical Medicine and Hygiene.
NASA Astrophysics Data System (ADS)
Molina, Armando; Govers, Gerard; Poesen, Jean; Van Hemelryck, Hendrik; De Bièvre, Bert; Vanacker, Veerle
2008-06-01
A large spatial variability in sediment yield was observed from small streams in the Ecuadorian Andes. The objective of this study was to analyze the environmental factors controlling these variations in sediment yield in the Paute basin, Ecuador. Sediment yield data were calculated based on sediment volumes accumulated behind checkdams for 37 small catchments. Mean annual specific sediment yield (SSY) shows a large spatial variability and ranges between 26 and 15,100 Mg km - 2 year - 1 . Mean vegetation cover (C, fraction) in the catchment, i.e. the plant cover at or near the surface, exerts a first order control on sediment yield. The fractional vegetation cover alone explains 57% of the observed variance in ln(SSY). The negative exponential relation (SSY = a × e- b C) which was found between vegetation cover and sediment yield at the catchment scale (10 3-10 9 m 2), is very similar to the equations derived from splash, interrill and rill erosion experiments at the plot scale (1-10 3 m 2). This affirms the general character of an exponential decrease of sediment yield with increasing vegetation cover at a wide range of spatial scales, provided the distribution of cover can be considered to be essentially random. Lithology also significantly affects the sediment yield, and explains an additional 23% of the observed variance in ln(SSY). Based on these two catchment parameters, a multiple regression model was built. This empirical regression model already explains more than 75% of the total variance in the mean annual sediment yield. These results highlight the large potential of revegetation programs for controlling sediment yield. They show that a slight increase in the overall fractional vegetation cover of degraded land is likely to have a large effect on sediment production and delivery. Moreover, they point to the importance of detailed surface vegetation data for predicting and modeling sediment production rates.
Factors affecting road mortality of white-tailed deer in eastern South Dakota
Grovenburg, Troy W.; Jenks, Jonathan A.; Klaver, Robert W.; Monteith, Kevin L.; Galster, Dwight H.; Schauer, Ron J.; Morlock, Wilbert W.; Delger, Joshua A.
2008-01-01
White-tailed deer (Odocoileus virginianus) mortalities (n = 4,433) caused by collisions with automobiles during 2003 were modeled in 35 counties in eastern South Dakota. Seventeen independent variables and 5 independent variable interactions were evaluated to explain deer mortalities. A negative binomial regression model (Ln Y = 1.25 – 0.12 [percentage tree coverage] + 0.0002 [county area] + 5.39 [county hunter success rate] + 0.0023 [vehicle proxy 96–104 km/hr roads], model deviance = 33.43, χ2 = 27.53, df = 27) was chosen using a combination of a priori model selection and AICc. Management options include use of the model to predict road mortalities and to increase the number of hunting licenses, which could result in fewer DVCs.
Early experiences building a software quality prediction model
NASA Technical Reports Server (NTRS)
Agresti, W. W.; Evanco, W. M.; Smith, M. C.
1990-01-01
Early experiences building a software quality prediction model are discussed. The overall research objective is to establish a capability to project a software system's quality from an analysis of its design. The technical approach is to build multivariate models for estimating reliability and maintainability. Data from 21 Ada subsystems were analyzed to test hypotheses about various design structures leading to failure-prone or unmaintainable systems. Current design variables highlight the interconnectivity and visibility of compilation units. Other model variables provide for the effects of reusability and software changes. Reported results are preliminary because additional project data is being obtained and new hypotheses are being developed and tested. Current multivariate regression models are encouraging, explaining 60 to 80 percent of the variation in error density of the subsystems.
Innovating patient care delivery: DSRIP's interrupted time series analysis paradigm.
Shenoy, Amrita G; Begley, Charles E; Revere, Lee; Linder, Stephen H; Daiger, Stephen P
2017-12-08
Adoption of Medicaid Section 1115 waiver is one of the many ways of innovating healthcare delivery system. The Delivery System Reform Incentive Payment (DSRIP) pool, one of the two funding pools of the waiver has four categories viz. infrastructure development, program innovation and redesign, quality improvement reporting and lastly, bringing about population health improvement. A metric of the fourth category, preventable hospitalization (PH) rate was analyzed in the context of eight conditions for two time periods, pre-reporting years (2010-2012) and post-reporting years (2013-2015) for two hospital cohorts, DSRIP participating and non-participating hospitals. The study explains how DSRIP impacted Preventable Hospitalization (PH) rates of eight conditions for both hospital cohorts within two time periods. Eight PH rates were regressed as the dependent variable with time, intervention and post-DSRIP Intervention as independent variables. PH rates of eight conditions were then consolidated into one rate for regressing with the above independent variables to evaluate overall impact of DSRIP. An interrupted time series regression was performed after accounting for auto-correlation, stationarity and seasonality in the dataset. In the individual regression model, PH rates showed statistically significant coefficients for seven out of eight conditions in DSRIP participating hospitals. In the combined regression model, the coefficient of the PH rate showed a statistically significant decrease with negative p-values for regression coefficients in DSRIP participating hospitals compared to positive/increased p-values for regression coefficients in DSRIP non-participating hospitals. Several macro- and micro-level factors may have likely contributed DSRIP hospitals outperforming DSRIP non-participating hospitals. Healthcare organization/provider collaboration, support from healthcare professionals, DSRIP's design, state reimbursement and coordination in care delivery methods may have led to likely success of DSRIP. IV, a retrospective cohort study based on longitudinal data. Copyright © 2017 Elsevier Inc. All rights reserved.
Lanfredi, Mariangela; Candini, Valentina; Buizza, Chiara; Ferrari, Clarissa; Boero, Maria E; Giobbio, Gian M; Goldschmidt, Nicoletta; Greppo, Stefania; Iozzino, Laura; Maggi, Paolo; Melegari, Anna; Pasqualetti, Patrizio; Rossi, Giuseppe; de Girolamo, Giovanni
2014-05-15
Quality of life (QOL) has been considered an important outcome measure in psychiatric research and determinants of QOL have been widely investigated. We aimed at detecting predictors of QOL at baseline and at testing the longitudinal interrelations of the baseline predictors with QOL scores at a 1-year follow-up in a sample of patients living in Residential Facilities (RFs). Logistic regression models were adopted to evaluate the association between WHOQoL-Bref scores and potential determinants of QOL. In addition, all variables significantly associated with QOL domains in the final logistic regression model were included by using the Structural Equation Modeling (SEM). We included 139 patients with a diagnosis of schizophrenia spectrum. In the final logistic regression model level of activity, social support, age, service satisfaction, spiritual well-being and symptoms' severity were identified as predictors of QOL scores at baseline. Longitudinal analyses carried out by SEM showed that 40% of QOL follow-up variability was explained by QOL at baseline, and significant indirect effects toward QOL at follow-up were found for satisfaction with services and for social support. Rehabilitation plans for people with schizophrenia living in RFs should also consider mediators of change in subjective QOL such as satisfaction with mental health services. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Khan, I.; Hawlader, Sophie Mohammad Delwer Hossain; Arifeen, Shams El; Moore, Sophie; Hills, Andrew P.; Wells, Jonathan C.; Persson, Lars-Åke; Kabir, Iqbal
2012-01-01
The aim of this study was to investigate the validity of the Tanita TBF 300A leg-to-leg bioimpedance analyzer for estimating fat-free mass (FFM) in Bangladeshi children aged 4-10 years and to develop novel prediction equations for use in this population, using deuterium dilution as the reference method. Two hundred Bangladeshi children were enrolled. The isotope dilution technique with deuterium oxide was used for estimation of total body water (TBW). FFM estimated by Tanita was compared with results of deuterium oxide dilution technique. Novel prediction equations were created for estimating FFM, using linear regression models, fitting child's height and impedance as predictors. There was a significant difference in FFM and percentage of body fat (BF%) between methods (p<0.01), Tanita underestimating TBW in boys (p=0.001) and underestimating BF% in girls (p<0.001). A basic linear regression model with height and impedance explained 83% of the variance in FFM estimated by deuterium oxide dilution technique. The best-fit equation to predict FFM from linear regression modelling was achieved by adding weight, sex, and age to the basic model, bringing the adjusted R2 to 89% (standard error=0.90, p<0.001). These data suggest Tanita analyzer may be a valid field-assessment technique in Bangladeshi children when using population-specific prediction equations, such as the ones developed here. PMID:23082630
NASA Astrophysics Data System (ADS)
Zhao, Wei; Fan, Shaojia; Guo, Hai; Gao, Bo; Sun, Jiaren; Chen, Laiguo
2016-11-01
The quantile regression (QR) method has been increasingly introduced to atmospheric environmental studies to explore the non-linear relationship between local meteorological conditions and ozone mixing ratios. In this study, we applied QR for the first time, together with multiple linear regression (MLR), to analyze the dominant meteorological parameters influencing the mean, 10th percentile, 90th percentile and 99th percentile of maximum daily 8-h average (MDA8) ozone concentrations in 2000-2015 in Hong Kong. The dominance analysis (DA) was used to assess the relative importance of meteorological variables in the regression models. Results showed that the MLR models worked better at suburban and rural sites than at urban sites, and worked better in winter than in summer. QR models performed better in summer for 99th and 90th percentiles and performed better in autumn and winter for 10th percentile. And QR models also performed better in suburban and rural areas for 10th percentile. The top 3 dominant variables associated with MDA8 ozone concentrations, changing with seasons and regions, were frequently associated with the six meteorological parameters: boundary layer height, humidity, wind direction, surface solar radiation, total cloud cover and sea level pressure. Temperature rarely became a significant variable in any season, which could partly explain the peak of monthly average ozone concentrations in October in Hong Kong. And we found the effect of solar radiation would be enhanced during extremely ozone pollution episodes (i.e., the 99th percentile). Finally, meteorological effects on MDA8 ozone had no significant changes before and after the 2010 Asian Games.
Blanco, Luis E; Aragón, Aurora; Lundberg, Ingvar; Lidén, Carola; Wesseling, Catharina; Nise, Gun
2005-01-01
Identification of pesticide exposure determinants has become an issue in explaining exposure variability and improving control measures. Most studies have been conducted in industrialized countries. The aim of this study was to identify relevant dermal exposure determinants among Nicaraguan subsistence farmers. Field data on possible determinants were collected during 32 pesticide applications through observation and supplementary videorecording. A multistep reduction strategy brought down the 110 potential exposure determinants to 27 variables, which were grouped as worksite, spray equipment, working practices, clothing or hygiene practices related. Dermal exposure was quantified with a modification of Fenske's visual scoring method. Multivariate linear regression modeling within groups and across groups was performed. In the within-group analyses, work practices, spray equipment and worksite related determinants explained 52, 33 and 25% of the exposure variability, respectively. Clothing and hygiene practices were weaker determinants and did not always reduce the exposure. The final model included determinants from all groups except hygiene practices and explained 69% of the exposure variability. A less restricted model increased the explained variability to 75%. Several novel determinants were identified, including spraying on a muddy terrain, dew on plants, sealing the tank lid with a cloth and wiping sweat from the face. This study showed that a combination of observation and visual scoring techniques can provide valuable information on determinants of pesticide exposure and affected body parts under developing country conditions. The results could be used to develop job-specific questionnaires and to design training and preventive programs.
Moloney, Niamh; Beales, Darren; Azoory, Roxanne; Hübscher, Markus; Waller, Robert; Gibbons, Rebekah; Rebbeck, Trudy
2018-06-14
Pain sensitivity and psychosocial issues are prognostic of poor outcome in acute neck disorders. However, knowledge of associations between pain sensitivity and ongoing pain and disability in chronic neck pain are lacking. We aimed to investigate associations of pain sensitivity with pain and disability at the 12-month follow-up in people with chronic neck pain. The predictor variables were: clinical and quantitative sensory testing (cold, pressure); neural tissue sensitivity; neuropathic symptoms; comorbidities; sleep; psychological distress; pain catastrophizing; pain intensity (for the model explaining disability at 12 months only); and disability (for the model explaining pain at 12 months only). Data were analysed using uni- and multivariate regression models to assess associations with pain and disability at the 12-month follow-up (n = 64 at baseline, n = 51 at follow-up). Univariable associations between all predictor variables and pain and disability were evident (r > 0.3; p < 0.05), except for cold and pressure pain thresholds and cold sensitivity. For disability at the 12-month follow-up, 24.0% of the variance was explained by psychological distress and comorbidities. For pain at 12 months, 39.8% of the variance was explained primarily by baseline disability. Neither clinical nor quantitative measures of pain sensitivity were meaningfully associated with long-term patient-reported outcomes in people with chronic neck pain, limiting their clinical application in evaluating prognosis. Copyright © 2018 John Wiley & Sons, Ltd.
Aerobic fitness, maturation, and training experience in youth basketball.
Carvalho, Humberto M; Coelho-e-Silva, Manuel J; Eisenmann, Joey C; Malina, Robert M
2013-07-01
Relationships among chronological age (CA), maturation, training experience, and body dimensions with peak oxygen uptake (VO2max) were considered in male basketball players 14-16 y of age. Data for all players included maturity status estimated as percentage of predicted adult height attained at the time of the study (Khamis-Roche protocol), years of training, body dimensions, and VO2max (incremental maximal test on a treadmill). Proportional allometric models derived from stepwise regressions were used to incorporate either CA or maturity status and to incorporate years of formal training in basketball. Estimates for size exponents (95% CI) from the separate allometric models for VO2max were height 2.16 (1.23-3.09), body mass 0.65 (0.37-0.93), and fat-free mass 0.73 (0.46-1.02). Body dimensions explained 39% to 44% of variance. The independent variables in the proportional allometric models explained 47% to 60% of variance in VO2max. Estimated maturity status (11-16% of explained variance) and training experience (7-11% of explained variance) were significant predictors with either body mass or estimated fat-free mass (P ≤ .01) but not with height. Biological maturity status and training experience in basketball had a significant contribution to VO2max via body mass and fat-free fat mass and also had an independent positive relation with aerobic performance. The results highlight the importance of considering variation associated with biological maturation in aerobic performance of late-adolescent boys.
Exploring patient satisfaction predictors in relation to a theoretical model.
Grøndahl, Vigdis Abrahamsen; Hall-Lord, Marie Louise; Karlsson, Ingela; Appelgren, Jari; Wilde-Larsson, Bodil
2013-01-01
The aim is to describe patients' care quality perceptions and satisfaction and to explore potential patient satisfaction predictors as person-related conditions, external objective care conditions and patients' perception of actual care received ("PR") in relation to a theoretical model. A cross-sectional design was used. Data were collected using one questionnaire combining questions from four instruments: Quality from patients' perspective; Sense of coherence; Big five personality trait; and Emotional stress reaction questionnaire (ESRQ), together with questions from previous research. In total, 528 patients (83.7 per cent response rate) from eight medical, three surgical and one medical/surgical ward in five Norwegian hospitals participated. Answers from 373 respondents with complete ESRQ questionnaires were analysed. Sequential multiple regression analysis with ESRQ as dependent variable was run in three steps: person-related conditions, external objective care conditions, and PR (p < 0.05). Step 1 (person-related conditions) explained 51.7 per cent of the ESRQ variance. Step 2 (external objective care conditions) explained an additional 2.4 per cent. Step 3 (PR) gave no significant additional explanation (0.05 per cent). Steps 1 and 2 contributed statistical significance to the model. Patients rated both quality-of-care and satisfaction highly. The paper shows that the theoretical model using an emotion-oriented approach to assess patient satisfaction can explain 54 per cent of patient satisfaction in a statistically significant manner.
Zoellner, Jamie M.; Porter, Kathleen J.; Chen, Yvonnes; Hedrick, Valisa E.; You, Wen; Hickman, Maja; Estabrooks, Paul A.
2017-01-01
Objective Guided by the theory of planned behaviour (TPB) and health literacy concepts, SIPsmartER is a six-month multicomponent intervention effective at improving SSB behaviours. Using SIPsmartER data, this study explores prediction of SSB behavioural intention (BI) and behaviour from TPB constructs using: (1) cross-sectional and prospective models and (2) 11 single-item assessments from interactive voice response (IVR) technology. Design Quasi-experimental design, including pre- and post-outcome data and repeated-measures process data of 155 intervention participants. Main Outcome Measures Validated multi-item TPB measures, single-item TPB measures, and self-reported SSB behaviours. Hypothesised relationships were investigated using correlation and multiple regression models. Results TPB constructs explained 32% of the variance cross sectionally and 20% prospectively in BI; and explained 13–20% of variance cross sectionally and 6% prospectively. Single-item scale models were significant, yet explained less variance. All IVR models predicting BI (average 21%, range 6–38%) and behaviour (average 30%, range 6–55%) were significant. Conclusion Findings are interpreted in the context of other cross-sectional, prospective and experimental TPB health and dietary studies. Findings advance experimental application of the TPB, including understanding constructs at outcome and process time points and applying theory in all intervention development, implementation and evaluation phases. PMID:28165771
Influenza epidemics, seasonality, and the effects of cold weather on cardiac mortality
2012-01-01
Background More people die in the winter from cardiac disease, and there are competing hypotheses to explain this. The authors conducted a study in 48 US cities to determine how much of the seasonal pattern in cardiac deaths could be explained by influenza epidemics, whether that allowed a more parsimonious control for season than traditional spline models, and whether such control changed the short term association with temperature. Methods The authors obtained counts of daily cardiac deaths and of emergency hospital admissions of the elderly for influenza during 1992–2000. Quasi-Poisson regression models were conducted estimating the association between daily cardiac mortality, and temperature. Results Controlling for influenza admissions provided a more parsimonious model with better Generalized Cross-Validation, lower residual serial correlation, and better captured Winter peaks. The temperature-response function was not greatly affected by adjusting for influenza. The pooled estimated increase in risk for a temperature decrease from 0 to −5°C was 1.6% (95% confidence interval (CI) 1.1-2.1%). Influenza accounted for 2.3% of cardiac deaths over this period. Conclusions The results suggest that including epidemic data explained most of the irregular seasonal pattern (about 18% of the total seasonal variation), allowing more parsimonious models than when adjusting for seasonality only with smooth functions of time. The effect of cold temperature is not confounded by epidemics. PMID:23025494
Pisanti, R
2012-01-01
Nursing is generally considered to be a stressful profession. The purpose of the present study was to test the core hypotheses of the job demands-control-social support model (JDCS) of Karasek & Theorell (1990). In order to refine and extend the JDCS model, we also analyzed the direct and interactive role of three coping strategies: task- oriented, emotion-oriented, and avoidance-oriented coping. Questionnaire data from 1383 nurses (77%female) were collected. Controlling for demographic variables and non-linearity of the associations between job characteristics and outcomes (job satisfaction; burnout dimensions, psychological distress, and somatic complaints), hierarchical regression analyses indicated that job control and social support combined additively (p < 0.001) with job demands to explain the wellbeing outcomes (explained variance between 6% and 28%). Coping strategies accounted for additional variance (p < 0.001; explained variance between 4% and 15%) in all outcomes except in job satisfaction. Support was found for main effects of coping. Coping strategies did not moderate the impact of job characteristics on burnout and wellbeing. Emotion-oriented coping emerged as the most important predictor and was consistently associated with higher burnout levels and lower wellbeing levels. The results demonstrated the need to include the role of individual variables in the JDCS model. The limitations of the study, and theoretical and practical implications are discussed.
Zhang, Xingyu; Kim, Joyce; Patzer, Rachel E; Pitts, Stephen R; Patzer, Aaron; Schrager, Justin D
2017-10-26
To describe and compare logistic regression and neural network modeling strategies to predict hospital admission or transfer following initial presentation to Emergency Department (ED) triage with and without the addition of natural language processing elements. Using data from the National Hospital Ambulatory Medical Care Survey (NHAMCS), a cross-sectional probability sample of United States EDs from 2012 and 2013 survey years, we developed several predictive models with the outcome being admission to the hospital or transfer vs. discharge home. We included patient characteristics immediately available after the patient has presented to the ED and undergone a triage process. We used this information to construct logistic regression (LR) and multilayer neural network models (MLNN) which included natural language processing (NLP) and principal component analysis from the patient's reason for visit. Ten-fold cross validation was used to test the predictive capacity of each model and receiver operating curves (AUC) were then calculated for each model. Of the 47,200 ED visits from 642 hospitals, 6,335 (13.42%) resulted in hospital admission (or transfer). A total of 48 principal components were extracted by NLP from the reason for visit fields, which explained 75% of the overall variance for hospitalization. In the model including only structured variables, the AUC was 0.824 (95% CI 0.818-0.830) for logistic regression and 0.823 (95% CI 0.817-0.829) for MLNN. Models including only free-text information generated AUC of 0.742 (95% CI 0.731- 0.753) for logistic regression and 0.753 (95% CI 0.742-0.764) for MLNN. When both structured variables and free text variables were included, the AUC reached 0.846 (95% CI 0.839-0.853) for logistic regression and 0.844 (95% CI 0.836-0.852) for MLNN. The predictive accuracy of hospital admission or transfer for patients who presented to ED triage overall was good, and was improved with the inclusion of free text data from a patient's reason for visit regardless of modeling approach. Natural language processing and neural networks that incorporate patient-reported outcome free text may increase predictive accuracy for hospital admission.
David, Ingrid; Garreau, Hervé; Balmisse, Elodie; Billon, Yvon; Canario, Laurianne
2017-01-20
Some genetic studies need to take into account correlations between traits that are repeatedly measured over time. Multiple-trait random regression models are commonly used to analyze repeated traits but suffer from several major drawbacks. In the present study, we developed a multiple-trait extension of the structured antedependence model (SAD) to overcome this issue and validated its usefulness by modeling the association between litter size (LS) and average birth weight (ABW) over parities in pigs and rabbits. The single-trait SAD model assumes that a random effect at time [Formula: see text] can be explained by the previous values of the random effect (i.e. at previous times). The proposed multiple-trait extension of the SAD model consists in adding a cross-antedependence parameter to the single-trait SAD model. This model can be easily fitted using ASReml and the OWN Fortran program that we have developed. In comparison with the random regression model, we used our multiple-trait SAD model to analyze the LS and ABW of 4345 litters from 1817 Large White sows and 8706 litters from 2286 L-1777 does over a maximum of five successive parities. For both species, the multiple-trait SAD fitted the data better than the random regression model. The difference between AIC of the two models (AIC_random regression-AIC_SAD) were equal to 7 and 227 for pigs and rabbits, respectively. A similar pattern of heritability and correlation estimates was obtained for both species. Heritabilities were lower for LS (ranging from 0.09 to 0.29) than for ABW (ranging from 0.23 to 0.39). The general trend was a decrease of the genetic correlation for a given trait between more distant parities. Estimates of genetic correlations between LS and ABW were negative and ranged from -0.03 to -0.52 across parities. No correlation was observed between the permanent environmental effects, except between the permanent environmental effects of LS and ABW of the same parity, for which the estimate of the correlation was strongly negative (ranging from -0.57 to -0.67). We demonstrated that application of our multiple-trait SAD model is feasible for studying several traits with repeated measurements and showed that it provided a better fit to the data than the random regression model.
Modelling Ecuador's rainfall distribution according to geographical characteristics.
NASA Astrophysics Data System (ADS)
Tobar, Vladimiro; Wyseure, Guido
2017-04-01
It is known that rainfall is affected by terrain characteristics and some studies had focussed on its distribution over complex terrain. Ecuador's temporal and spatial rainfall distribution is affected by its location on the ITCZ, the marine currents in the Pacific, the Amazon rainforest, and the Andes mountain range. Although all these factors are important, we think that the latter one may hold a key for modelling spatial and temporal distribution of rainfall. The study considered 30 years of monthly data from 319 rainfall stations having at least 10 years of data available. The relatively low density of stations and their location in accessible sites near to main roads or rivers, leave large and important areas ungauged, making it not appropriate to rely on traditional interpolation techniques to estimate regional rainfall for water balance. The aim of this research was to come up with a useful model for seasonal rainfall distribution in Ecuador based on geographical characteristics to allow its spatial generalization. The target for modelling was the seasonal rainfall, characterized by nine percentiles for each one of the 12 months of the year that results in 108 response variables, later on reduced to four principal components comprising 94% of the total variability. Predictor variables for the model were: geographic coordinates, elevation, main wind effects from the Amazon and Coast, Valley and Hill indexes, and average and maximum elevation above the selected rainfall station to the east and to the west, for each one of 18 directions (50-135°, by 5°) adding up to 79 predictors. A multiple linear regression model by the Elastic-net algorithm with cross-validation was applied for each one of the PC as response to select the most important ones from the 79 predictor variables. The Elastic-net algorithm deals well with collinearity problems, while allowing variable selection in a blended approach between the Ridge and Lasso regression. The model fitting produced explained variances of 59%, 81%, 49% and 17% for PC1, PC2, PC3 and PC4, respectively, backing up the hypothesis of good correlation between geographical characteristics and seasonal rainfall patterns (comprised in the four principal components). With the obtained coefficients from the regression, the 108 rainfall percentiles for each station were back estimated giving very good results when compared with the original ones, with an overall 60% explained variance.
Li, Aihua; Dhakal, Shital; Glenn, Nancy F.; Spaete, Luke P.; Shinneman, Douglas; Pilliod, David S.; Arkle, Robert; McIlroy, Susan
2017-01-01
Our study objectives were to model the aboveground biomass in a xeric shrub-steppe landscape with airborne light detection and ranging (Lidar) and explore the uncertainty associated with the models we created. We incorporated vegetation vertical structure information obtained from Lidar with ground-measured biomass data, allowing us to scale shrub biomass from small field sites (1 m subplots and 1 ha plots) to a larger landscape. A series of airborne Lidar-derived vegetation metrics were trained and linked with the field-measured biomass in Random Forests (RF) regression models. A Stepwise Multiple Regression (SMR) model was also explored as a comparison. Our results demonstrated that the important predictors from Lidar-derived metrics had a strong correlation with field-measured biomass in the RF regression models with a pseudo R2 of 0.76 and RMSE of 125 g/m2 for shrub biomass and a pseudo R2 of 0.74 and RMSE of 141 g/m2 for total biomass, and a weak correlation with field-measured herbaceous biomass. The SMR results were similar but slightly better than RF, explaining 77–79% of the variance, with RMSE ranging from 120 to 129 g/m2 for shrub and total biomass, respectively. We further explored the computational efficiency and relative accuracies of using point cloud and raster Lidar metrics at different resolutions (1 m to 1 ha). Metrics derived from the Lidar point cloud processing led to improved biomass estimates at nearly all resolutions in comparison to raster-derived Lidar metrics. Only at 1 m were the results from the point cloud and raster products nearly equivalent. The best Lidar prediction models of biomass at the plot-level (1 ha) were achieved when Lidar metrics were derived from an average of fine resolution (1 m) metrics to minimize boundary effects and to smooth variability. Overall, both RF and SMR methods explained more than 74% of the variance in biomass, with the most important Lidar variables being associated with vegetation structure and statistical measures of this structure (e.g., standard deviation of height was a strong predictor of biomass). Using our model results, we developed spatially-explicit Lidar estimates of total and shrub biomass across our study site in the Great Basin, U.S.A., for monitoring and planning in this imperiled ecosystem.
Fire frequency in the Interior Columbia River Basin: Building regional models from fire history data
McKenzie, D.; Peterson, D.L.; Agee, James K.
2000-01-01
Fire frequency affects vegetation composition and successional pathways; thus it is essential to understand fire regimes in order to manage natural resources at broad spatial scales. Fire history data are lacking for many regions for which fire management decisions are being made, so models are needed to estimate past fire frequency where local data are not yet available. We developed multiple regression models and tree-based (classification and regression tree, or CART) models to predict fire return intervals across the interior Columbia River basin at 1-km resolution, using georeferenced fire history, potential vegetation, cover type, and precipitation databases. The models combined semiqualitative methods and rigorous statistics. The fire history data are of uneven quality; some estimates are based on only one tree, and many are not cross-dated. Therefore, we weighted the models based on data quality and performed a sensitivity analysis of the effects on the models of estimation errors that are due to lack of cross-dating. The regression models predict fire return intervals from 1 to 375 yr for forested areas, whereas the tree-based models predict a range of 8 to 150 yr. Both types of models predict latitudinal and elevational gradients of increasing fire return intervals. Examination of regional-scale output suggests that, although the tree-based models explain more of the variation in the original data, the regression models are less likely to produce extrapolation errors. Thus, the models serve complementary purposes in elucidating the relationships among fire frequency, the predictor variables, and spatial scale. The models can provide local managers with quantitative information and provide data to initialize coarse-scale fire-effects models, although predictions for individual sites should be treated with caution because of the varying quality and uneven spatial coverage of the fire history database. The models also demonstrate the integration of qualitative and quantitative methods when requisite data for fully quantitative models are unavailable. They can be tested by comparing new, independent fire history reconstructions against their predictions and can be continually updated, as better fire history data become available.
Mahdavi, Mahdi; Vissers, Jan; Elkhuizen, Sylvia; van Dijk, Mattees; Vanhala, Antero; Karampli, Eleftheria; Faubel, Raquel; Forte, Paul; Coroian, Elena; van de Klundert, Joris
2018-01-01
While health service provisioning for the chronic condition Type 2 Diabetes (T2D) often involves a network of organisations and professionals, most evidence on the relationships between the structures and processes of service provisioning and the outcomes considers single organisations or solo practitioners. Extending Donabedian's Structure-Process-Outcome (SPO) model, we investigate how differences in quality of life, effective coverage of diabetes, and service satisfaction are associated with differences in the structures, processes, and context of T2D services in six regions in Finland, Germany, Greece, Netherlands, Spain, and UK. Data collection consisted of: a) systematic modelling of provider network's structures and processes, and b) a cross-sectional survey of patient reported outcomes and other information. The survey resulted in data from 1459 T2D patients, during 2011-2012. Stepwise linear regression models were used to identify how independent cumulative proportion of variance in quality of life and service satisfaction are related to differences in context, structure and process. The selected context, structure and process variables are based on Donabedian's SPO model, a service quality research instrument (SERVQUAL), and previous organization and professional level evidence. Additional analysis deepens the possible bidirectional relation between outcomes and processes. The regression models explain 44% of variance in service satisfaction, mostly by structure and process variables (such as human resource use and the SERVQUAL dimensions). The models explained 23% of variance in quality of life between the networks, much of which is related to contextual variables. Our results suggest that effectiveness of A1c control is negatively correlated with process variables such as total hours of care provided per year and cost of services per year. While the selected structure and process variables explain much of the variance in service satisfaction, this is less the case for quality of life. Moreover, it appears that the effect of the clinical outcome A1c control on processes is stronger than the other way around, as poorer control seems to relate to more service use, and higher cost. The standardized operational models used in this research prove to form a basis for expanding the network level evidence base for effective T2D service provisioning.
Elkhuizen, Sylvia; van Dijk, Mattees; Vanhala, Antero; Karampli, Eleftheria; Faubel, Raquel; Forte, Paul; Coroian, Elena
2018-01-01
Background While health service provisioning for the chronic condition Type 2 Diabetes (T2D) often involves a network of organisations and professionals, most evidence on the relationships between the structures and processes of service provisioning and the outcomes considers single organisations or solo practitioners. Extending Donabedian’s Structure-Process-Outcome (SPO) model, we investigate how differences in quality of life, effective coverage of diabetes, and service satisfaction are associated with differences in the structures, processes, and context of T2D services in six regions in Finland, Germany, Greece, Netherlands, Spain, and UK. Methods Data collection consisted of: a) systematic modelling of provider network’s structures and processes, and b) a cross-sectional survey of patient reported outcomes and other information. The survey resulted in data from 1459 T2D patients, during 2011–2012. Stepwise linear regression models were used to identify how independent cumulative proportion of variance in quality of life and service satisfaction are related to differences in context, structure and process. The selected context, structure and process variables are based on Donabedian’s SPO model, a service quality research instrument (SERVQUAL), and previous organization and professional level evidence. Additional analysis deepens the possible bidirectional relation between outcomes and processes. Results The regression models explain 44% of variance in service satisfaction, mostly by structure and process variables (such as human resource use and the SERVQUAL dimensions). The models explained 23% of variance in quality of life between the networks, much of which is related to contextual variables. Our results suggest that effectiveness of A1c control is negatively correlated with process variables such as total hours of care provided per year and cost of services per year. Conclusions While the selected structure and process variables explain much of the variance in service satisfaction, this is less the case for quality of life. Moreover, it appears that the effect of the clinical outcome A1c control on processes is stronger than the other way around, as poorer control seems to relate to more service use, and higher cost. The standardized operational models used in this research prove to form a basis for expanding the network level evidence base for effective T2D service provisioning. PMID:29447220
Cairney, John; Eisenmann, Joe; Pfeiffer, Karin; Gould, Dan
2018-01-01
Children who are overweight and obese display lower physical activity levels than normal weight peers. Measures of weight status, perceived motor competence, and motor skill performance have been identified as potential correlates explaining this discrepancy. 1881 children (955 males; 926 females; 9.9 years) were assessed as part of the Physical Health Activity Study Team project. The age, habitual physical activity participation (PAP), body mass index (BMI), socioeconomic status (SES), motor performance (MP), and perceived athletic competence (PAC) of each child included were assessed. Gender-specific linear regression analyses (main effects model) were conducted to identify the percent variance in PAP explained by the following variables: BMI, MP, and PAC. For males, 18.3% of the variance in PAP was explained by BMI, MP, and PAC. PAC explained 17% of the variance, while MP, BMI, and SES only accounted for 0.6%, 0.7%, and 0.5%, respectively. PAC explained 17.5% of PAP variance in females; MP explained 0.8%. BMI, SES, and chronological age were not significant correlates of PAP in girls. An established repertoire of motor skill performance has been seen as a vehicle to PAP in children; however, this study indicates that PAC should not be overlooked in intervention strategies to promote increased PAP. PMID:29854437
Morrison, Kyle M; Cairney, John; Eisenmann, Joe; Pfeiffer, Karin; Gould, Dan
2018-01-01
Children who are overweight and obese display lower physical activity levels than normal weight peers. Measures of weight status, perceived motor competence, and motor skill performance have been identified as potential correlates explaining this discrepancy. 1881 children (955 males; 926 females; 9.9 years) were assessed as part of the Physical Health Activity Study Team project. The age, habitual physical activity participation (PAP), body mass index (BMI), socioeconomic status (SES), motor performance (MP), and perceived athletic competence (PAC) of each child included were assessed. Gender-specific linear regression analyses (main effects model) were conducted to identify the percent variance in PAP explained by the following variables: BMI, MP, and PAC. For males, 18.3% of the variance in PAP was explained by BMI, MP, and PAC. PAC explained 17% of the variance, while MP, BMI, and SES only accounted for 0.6%, 0.7%, and 0.5%, respectively. PAC explained 17.5% of PAP variance in females; MP explained 0.8%. BMI, SES, and chronological age were not significant correlates of PAP in girls. An established repertoire of motor skill performance has been seen as a vehicle to PAP in children; however, this study indicates that PAC should not be overlooked in intervention strategies to promote increased PAP.
Estimation of Particulate Mass and Manganese Exposure Levels among Welders
Hobson, Angela; Seixas, Noah; Sterling, David; Racette, Brad A.
2011-01-01
Background: Welders are frequently exposed to Manganese (Mn), which may increase the risk of neurological impairment. Historical exposure estimates for welding-exposed workers are needed for epidemiological studies evaluating the relationship between welding and neurological or other health outcomes. The objective of this study was to develop and validate a multivariate model to estimate quantitative levels of welding fume exposures based on welding particulate mass and Mn concentrations reported in the published literature. Methods: Articles that described welding particulate and Mn exposures during field welding activities were identified through a comprehensive literature search. Summary measures of exposure and related determinants such as year of sampling, welding process performed, type of ventilation used, degree of enclosure, base metal, and location of sampling filter were extracted from each article. The natural log of the reported arithmetic mean exposure level was used as the dependent variable in model building, while the independent variables included the exposure determinants. Cross-validation was performed to aid in model selection and to evaluate the generalizability of the models. Results: A total of 33 particulate and 27 Mn means were included in the regression analysis. The final model explained 76% of the variability in the mean exposures and included welding process and degree of enclosure as predictors. There was very little change in the explained variability and root mean squared error between the final model and its cross-validation model indicating the final model is robust given the available data. Conclusions: This model may be improved with more detailed exposure determinants; however, the relatively large amount of variance explained by the final model along with the positive generalizability results of the cross-validation increases the confidence that the estimates derived from this model can be used for estimating welder exposures in absence of individual measurement data. PMID:20870928
Validity of VO(2 max) in predicting blood volume: implications for the effect of fitness on aging
NASA Technical Reports Server (NTRS)
Convertino, V. A.; Ludwig, D. A.
2000-01-01
A multiple regression model was constructed to investigate the premise that blood volume (BV) could be predicted using several anthropometric variables, age, and maximal oxygen uptake (VO(2 max)). To test this hypothesis, age, calculated body surface area (height/weight composite), percent body fat (hydrostatic weight), and VO(2 max) were regressed on to BV using data obtained from 66 normal healthy men. Results from the evaluation of the full model indicated that the most parsimonious result was obtained when age and VO(2 max) were regressed on BV expressed per kilogram body weight. The full model accounted for 52% of the total variance in BV per kilogram body weight. Both age and VO(2 max) were related to BV in the positive direction. Percent body fat contributed <1% to the explained variance in BV when expressed in absolute BV (ml) or as BV per kilogram body weight. When the model was cross validated on 41 new subjects and BV per kilogram body weight was reexpressed as raw BV, the results indicated that the statistical model would be stable under cross validation (e.g., predictive applications) with an accuracy of +/- 1,200 ml at 95% confidence. Our results support the hypothesis that BV is an increasing function of aerobic fitness and to a lesser extent the age of the subject. The results may have implication as to a mechanism by which aerobic fitness and activity may be protective against reduced BV associated with aging.
Scheerman, Janneke F M; van Empelen, Pepijn; van Loveren, Cor; Pakpour, Amir H; van Meijel, Berno; Gholami, Maryam; Mierzaie, Zaher; van den Braak, Matheus C T; Verrips, Gijsbert H W
2017-11-01
The Health Action Process Approach (HAPA) model addresses health behaviours, but it has never been applied to model adolescents' oral hygiene behaviour during fixed orthodontic treatment. This study aimed to apply the HAPA model to explain adolescents' oral hygiene behaviour and dental plaque during orthodontic treatment with fixed appliances. In this cross-sectional study, 116 adolescents with fixed appliances from an orthodontic clinic situated in Almere (the Netherlands) completed a questionnaire assessing oral health behaviours and the psychosocial factors of the HAPA model. Linear regression analyses were performed to examine the factors associated with dental plaque, toothbrushing, and the use of a proxy brush. Stepwise regression analysis showed that lower amounts of plaque were significantly associated with higher frequency of the use of a proxy brush (R 2 = 45%), higher intention of the use of a proxy brush (R 2 = 5%), female gender (R 2 = 2%), and older age (R 2 = 2%). The multiple regression analyses revealed that higher action self-efficacy, intention, maintenance self-efficacy, and a higher education were significantly associated with the use of a proxy brush (R 2 = 45%). Decreased levels of dental plaque are mainly associated with increased use of a proxy brush that is subsequently associated with a higher intention and self-efficacy to use the proxy brush. © 2017 BSPD, IAPD and John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Advances in nowcasting influenza-like illness rates using search query logs
NASA Astrophysics Data System (ADS)
Lampos, Vasileios; Miller, Andrew C.; Crossan, Steve; Stefansen, Christian
2015-08-01
User-generated content can assist epidemiological surveillance in the early detection and prevalence estimation of infectious diseases, such as influenza. Google Flu Trends embodies the first public platform for transforming search queries to indications about the current state of flu in various places all over the world. However, the original model significantly mispredicted influenza-like illness rates in the US during the 2012-13 flu season. In this work, we build on the previous modeling attempt, proposing substantial improvements. Firstly, we investigate the performance of a widely used linear regularized regression solver, known as the Elastic Net. Then, we expand on this model by incorporating the queries selected by the Elastic Net into a nonlinear regression framework, based on a composite Gaussian Process. Finally, we augment the query-only predictions with an autoregressive model, injecting prior knowledge about the disease. We assess predictive performance using five consecutive flu seasons spanning from 2008 to 2013 and qualitatively explain certain shortcomings of the previous approach. Our results indicate that a nonlinear query modeling approach delivers the lowest cumulative nowcasting error, and also suggest that query information significantly improves autoregressive inferences, obtaining state-of-the-art performance.
Advances in nowcasting influenza-like illness rates using search query logs.
Lampos, Vasileios; Miller, Andrew C; Crossan, Steve; Stefansen, Christian
2015-08-03
User-generated content can assist epidemiological surveillance in the early detection and prevalence estimation of infectious diseases, such as influenza. Google Flu Trends embodies the first public platform for transforming search queries to indications about the current state of flu in various places all over the world. However, the original model significantly mispredicted influenza-like illness rates in the US during the 2012-13 flu season. In this work, we build on the previous modeling attempt, proposing substantial improvements. Firstly, we investigate the performance of a widely used linear regularized regression solver, known as the Elastic Net. Then, we expand on this model by incorporating the queries selected by the Elastic Net into a nonlinear regression framework, based on a composite Gaussian Process. Finally, we augment the query-only predictions with an autoregressive model, injecting prior knowledge about the disease. We assess predictive performance using five consecutive flu seasons spanning from 2008 to 2013 and qualitatively explain certain shortcomings of the previous approach. Our results indicate that a nonlinear query modeling approach delivers the lowest cumulative nowcasting error, and also suggest that query information significantly improves autoregressive inferences, obtaining state-of-the-art performance.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Lebersorger, S.; Beigl, P., E-mail: peter.beigl@boku.ac.at
Waste management planning requires reliable data concerning waste generation, influencing factors on waste generation and forecasts of waste quantities based on facts. This paper aims at identifying and quantifying differences between different municipalities' municipal solid waste (MSW) collection quantities based on data from waste management and on socio-economic indicators. A large set of 116 indicators from 542 municipalities in the Province of Styria was investigated. The resulting regression model included municipal tax revenue per capita, household size and the percentage of buildings with solid fuel heating systems. The model explains 74.3% of the MSW variation and the model assumptions aremore » met. Other factors such as tourism, home composting or age distribution of the population did not significantly improve the model. According to the model, 21% of MSW collected in Styria was commercial waste and 18% of the generated MSW was burned in domestic heating systems. While the percentage of commercial waste is consistent with literature data, practically no literature data are available for the quantity of MSW burned, which seems to be overestimated by the model. The resulting regression model was used as basis for a waste prognosis model (Beigl and Lebersorger, in preparation).« less
Lebersorger, S; Beigl, P
2011-01-01
Waste management planning requires reliable data concerning waste generation, influencing factors on waste generation and forecasts of waste quantities based on facts. This paper aims at identifying and quantifying differences between different municipalities' municipal solid waste (MSW) collection quantities based on data from waste management and on socio-economic indicators. A large set of 116 indicators from 542 municipalities in the Province of Styria was investigated. The resulting regression model included municipal tax revenue per capita, household size and the percentage of buildings with solid fuel heating systems. The model explains 74.3% of the MSW variation and the model assumptions are met. Other factors such as tourism, home composting or age distribution of the population did not significantly improve the model. According to the model, 21% of MSW collected in Styria was commercial waste and 18% of the generated MSW was burned in domestic heating systems. While the percentage of commercial waste is consistent with literature data, practically no literature data are available for the quantity of MSW burned, which seems to be overestimated by the model. The resulting regression model was used as basis for a waste prognosis model (Beigl and Lebersorger, in preparation). Copyright © 2011 Elsevier Ltd. All rights reserved.
Yang, Y-M; Lee, J; Kim, Y-I; Cho, B-H; Park, S-B
2014-08-01
This study aimed to determine the viability of using axial cervical vertebrae (ACV) as biological indicators of skeletal maturation and to build models that estimate ossification level with improved explanatory power over models based only on chronological age. The study population comprised 74 female and 47 male patients with available hand-wrist radiographs and cone-beam computed tomography images. Generalized Procrustes analysis was used to analyze the shape, size, and form of the ACV regions of interest. The variabilities of these factors were analyzed by principal component analysis. Skeletal maturation was then estimated using a multiple regression model. Separate models were developed for male and female participants. For the female estimation model, the adjusted R(2) explained 84.8% of the variability of the Sempé maturation level (SML), representing a 7.9% increase in SML explanatory power over that using chronological age alone (76.9%). For the male estimation model, the adjusted R(2) was over 90%, representing a 1.7% increase relative to the reference model. The simplest possible ACV morphometric information provided a statistically significant explanation of the portion of skeletal-maturation variability not dependent on chronological age. These results verify that ACV is a strong biological indicator of ossification status. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
The use of modelling to evaluate and adapt strategies for animal disease control.
Saegerman, C; Porter, S R; Humblet, M F
2011-08-01
Disease is often associated with debilitating clinical signs, disorders or production losses in animals and/or humans, leading to severe socio-economic repercussions. This explains the high priority that national health authorities and international organisations give to selecting control strategies for and the eradication of specific diseases. When a control strategy is selected and implemented, an effective method of evaluating its efficacy is through modelling. To illustrate the usefulness of models in evaluating control strategies, the authors describe several examples in detail, including three examples of classification and regression tree modelling to evaluate and improve the early detection of disease: West Nile fever in equids, bovine spongiform encephalopathy (BSE) and multifactorial diseases, such as colony collapse disorder (CCD) in the United States. Also examined are regression modelling to evaluate skin test practices and the efficacy of an awareness campaign for bovine tuberculosis (bTB); mechanistic modelling to monitor the progress of a control strategy for BSE; and statistical nationwide modelling to analyse the spatio-temporal dynamics of bTB and search for potential risk factors that could be used to target surveillance measures more effectively. In the accurate application of models, an interdisciplinary rather than a multidisciplinary approach is required, with the fewest assumptions possible.
The crux of the method: assumptions in ordinary least squares and logistic regression.
Long, Rebecca G
2008-10-01
Logistic regression has increasingly become the tool of choice when analyzing data with a binary dependent variable. While resources relating to the technique are widely available, clear discussions of why logistic regression should be used in place of ordinary least squares regression are difficult to find. The current paper compares and contrasts the assumptions of ordinary least squares with those of logistic regression and explains why logistic regression's looser assumptions make it adept at handling violations of the more important assumptions in ordinary least squares.
Røislien, Jo; Lossius, Hans Morten; Kristiansen, Thomas
2015-01-01
Background Trauma is a leading global cause of death. Trauma mortality rates are higher in rural areas, constituting a challenge for quality and equality in trauma care. The aim of the study was to explore population density and transport time to hospital care as possible predictors of geographical differences in mortality rates, and to what extent choice of statistical method might affect the analytical results and accompanying clinical conclusions. Methods Using data from the Norwegian Cause of Death registry, deaths from external causes 1998–2007 were analysed. Norway consists of 434 municipalities, and municipality population density and travel time to hospital care were entered as predictors of municipality mortality rates in univariate and multiple regression models of increasing model complexity. We fitted linear regression models with continuous and categorised predictors, as well as piecewise linear and generalised additive models (GAMs). Models were compared using Akaike's information criterion (AIC). Results Population density was an independent predictor of trauma mortality rates, while the contribution of transport time to hospital care was highly dependent on choice of statistical model. A multiple GAM or piecewise linear model was superior, and similar, in terms of AIC. However, while transport time was statistically significant in multiple models with piecewise linear or categorised predictors, it was not in GAM or standard linear regression. Conclusions Population density is an independent predictor of trauma mortality rates. The added explanatory value of transport time to hospital care is marginal and model-dependent, highlighting the importance of exploring several statistical models when studying complex associations in observational data. PMID:25972600
Harmon, Brook E.; Nigg, Claudio R.; Long, Camonia; Amato, Katie; Anwar, Mahabub-Ul; Kutchman, Eve; Anthamatten, Peter; Browning, Raymond C.; Brink, Lois; Hill, James O.
2014-01-01
Objectives Social Cognitive Theory (SCT) has often been used as a guide to predict and modify physical activity (PA) behavior. We assessed the ability of commonly investigated SCT variables and perceived school environment variables to predict PA among elementary students. We also examined differences in influences between Hispanic and non-Hispanic students. Design This analysis used baseline data collected from eight schools who participated in a four-year study of a combined school-day curriculum and environmental intervention. Methods Data were collected from 393 students. A 3-step linear regression was used to measure associations between PA level, SCT variables (self-efficacy, social support, enjoyment), and perceived environment variables (schoolyard structures, condition, equipment/supervision). Logistic regression assessed associations between variables and whether students met PA recommendations. Results School and sex explained 6% of the moderate-to-vigorous PA models' variation. SCT variables explained an additional 15% of the models' variation, with much of the model's predictive ability coming from self-efficacy and social support. Sex was more strongly associated with PA level among Hispanic students, while self-efficacy was more strongly associated among non-Hispanic students. Perceived environment variables contributed little to the models. Conclusions Our findings add to the literature on the influences of PA among elementary-aged students. The differences seen in the influence of sex and self-efficacy among non-Hispanic and Hispanic students suggests these are areas where PA interventions could be tailored to improve efficacy. Additional research is needed to understand if different measures of perceived environment or perceptions at different ages may better predict PA. PMID:24772004
McConnon, Aine; Raats, Monique; Astrup, Arne; Bajzová, Magda; Handjieva-Darlenska, Teodora; Lindroos, Anna Karin; Martinez, J Alfredo; Larson, Thomas Meinert; Papadaki, Angeliki; Pfeiffer, Andreas; van Baak, Marleen A; Shepherd, Richard
2012-02-01
Using the Theory of Planned Behaviour (TPB), this study investigates weight control in overweight and obese participants (27 kg/m(2)≤BMI<45 kg/m(2)) taking part in a dietary intervention trial targeted at weight loss maintenance (n=932). Respondents completed TPB measures investigating "weight gain prevention" at three time points. Correlation and regression analyses were used to investigate the relationship between TPB variables and weight regain. The TPB explained up to 27% variance in expectation, 14% in intention and 20% in desire scores. No relationship was established between intention, expectation or desire and behaviour at Time 1 or Time 2. Perceived need and subjective norm were found to be significantly related to weight regain, however, the model explained a maximum of 11% of the variation in weight regain. Better understanding of overweight individuals' trajectories of weight control is needed to help inform studies investigating people's weight regain behaviours. Future research using the TPB model to explain weight control should consider the likely behaviours being sought by individuals. Copyright © 2011 Elsevier Ltd. All rights reserved.
Comparing flood loss models of different complexity
NASA Astrophysics Data System (ADS)
Schröter, Kai; Kreibich, Heidi; Vogel, Kristin; Riggelsen, Carsten; Scherbaum, Frank; Merz, Bruno
2013-04-01
Any deliberation on flood risk requires the consideration of potential flood losses. In particular, reliable flood loss models are needed to evaluate cost-effectiveness of mitigation measures, to assess vulnerability, for comparative risk analysis and financial appraisal during and after floods. In recent years, considerable improvements have been made both concerning the data basis and the methodological approaches used for the development of flood loss models. Despite of that, flood loss models remain an important source of uncertainty. Likewise the temporal and spatial transferability of flood loss models is still limited. This contribution investigates the predictive capability of different flood loss models in a split sample cross regional validation approach. For this purpose, flood loss models of different complexity, i.e. based on different numbers of explaining variables, are learned from a set of damage records that was obtained from a survey after the Elbe flood in 2002. The validation of model predictions is carried out for different flood events in the Elbe and Danube river basins in 2002, 2005 and 2006 for which damage records are available from surveys after the flood events. The models investigated are a stage-damage model, the rule based model FLEMOps+r as well as novel model approaches which are derived using data mining techniques of regression trees and Bayesian networks. The Bayesian network approach to flood loss modelling provides attractive additional information concerning the probability distribution of both model predictions and explaining variables.
Preston, Stephen D.; Alexander, Richard B.; Woodside, Michael D.
2011-01-01
The U.S. Geological Survey (USGS) recently completed assessments of stream nutrients in six major regions extending over much of the conterminous United States. SPARROW (SPAtially Referenced Regressions On Watershed attributes) models were developed for each region to explain spatial patterns in monitored stream nutrient loads in relation to human activities and natural resources and processes. The model information, reported by stream reach and catchment, provides contrasting views of the spatial patterns of nutrient source contributions, including those from urban (wastewater effluent and diffuse runoff from developed land), agricultural (farm fertilizers and animal manure), and specific background sources (atmospheric nitrogen deposition, soil phosphorus, forest nitrogen fixation, and channel erosion).
NASA Astrophysics Data System (ADS)
Colen, Charles Raymond, Jr.
There have been numerous studies with ultrasonic nondestructive testing and wood fiber composites. The problem of the study was to ascertain whether ultrasonic nondestructive testing can be used in place of destructive testing to obtain the modulus of elasticity (MOE) of the wood/agricultural material with comparable results. The uniqueness of this research is that it addressed the type of content (cornstalks and switchgrass) being used with the wood fibers and the type of adhesives (soybean-based) associated with the production of these composite materials. Two research questions were addressed in the study. The major objective was to determine if one can predict the destructive test MOE value based on the nondestructive test MOE value. The population of the study was wood/agricultural fiberboards made from wood fibers, cornstalks, and switchgrass bonded together with soybean-based, urea-formaldehyde, and phenol-formaldehyde adhesives. Correlational analysis was used to determine if there was a relationship between the two tests. Regression analysis was performed to determine a prediction equation for the destructive test MOE value. Data were collected on both procedures using ultrasonic nondestructing testing and 3-point destructive testing. The results produced a simple linear regression model for this study which was adequate in the prediction of destructive MOE values if the nondestructive MOE value is known. An approximation very close to the entire error in the model equation was explained from the destructive test MOE values for the composites. The nondestructive MOE values used to produce a linear regression model explained 83% of the variability in the destructive test MOE values. The study also showed that, for the particular destructive test values obtained with the equipment used, the model associated with the study is as good as it could be due to the variability in the results from the destructive tests. In this study, an ultrasonic signal was used to determine the MOE values on nondestructive tests. Future research studies could use the same or other hardboards to examine how the resins affect the ultrasonic signal.
The Role of Habit and Perceived Control on Health Behavior among Pregnant Women.
Mullan, Barbara; Henderson, Joanna; Kothe, Emily; Allom, Vanessa; Orbell, Sheina; Hamilton, Kyra
2016-05-01
Many pregnant women do not adhere to physical activity and dietary recommendations. Research investigating what psychological processes might predict physical activity and healthy eating (fruit and vegetable consumption) during pregnancy is scant. We explored the role of intention, habit, and perceived behavioral control as predictors of physical activity and healthy eating. Pregnant women (N = 195, Mage = 30.17, SDage = 4.46) completed questionnaires at 2 time points. At Time 1, participants completed measures of intention, habit, and perceived behavioral control. At Time 2, participants reported on their behavior (physical activity and healthy eating) within the intervening week. Regression analysis determined whether Time 1 variables predicted behavior at Time 2. Interaction terms also were tested. Final regression models indicated that only intention and habit explained significant variance in physical activity, whereas habit and the interaction between intention and habit explained significant variance in healthy eating. Simple slopes analysis indicated that the relationship between intention and healthy eating behavior was only significant at high levels of habit. Findings highlight the influence of habit on behavior and suggest that automaticity interventions may be useful in changing health behaviors during pregnancy.
van der Meer, D; Hoekstra, P J; van Donkelaar, M; Bralten, J; Oosterlaan, J; Heslenfeld, D; Faraone, S V; Franke, B; Buitelaar, J K; Hartman, C A
2017-01-01
Identifying genetic variants contributing to attention-deficit/hyperactivity disorder (ADHD) is complicated by the involvement of numerous common genetic variants with small effects, interacting with each other as well as with environmental factors, such as stress exposure. Random forest regression is well suited to explore this complexity, as it allows for the analysis of many predictors simultaneously, taking into account any higher-order interactions among them. Using random forest regression, we predicted ADHD severity, measured by Conners’ Parent Rating Scales, from 686 adolescents and young adults (of which 281 were diagnosed with ADHD). The analysis included 17 374 single-nucleotide polymorphisms (SNPs) across 29 genes previously linked to hypothalamic–pituitary–adrenal (HPA) axis activity, together with information on exposure to 24 individual long-term difficulties or stressful life events. The model explained 12.5% of variance in ADHD severity. The most important SNP, which also showed the strongest interaction with stress exposure, was located in a region regulating the expression of telomerase reverse transcriptase (TERT). Other high-ranking SNPs were found in or near NPSR1, ESR1, GABRA6, PER3, NR3C2 and DRD4. Chronic stressors were more influential than single, severe, life events. Top hits were partly shared with conduct problems. We conclude that random forest regression may be used to investigate how multiple genetic and environmental factors jointly contribute to ADHD. It is able to implicate novel SNPs of interest, interacting with stress exposure, and may explain inconsistent findings in ADHD genetics. This exploratory approach may be best combined with more hypothesis-driven research; top predictors and their interactions with one another should be replicated in independent samples. PMID:28585928
Nicotine Dependence and Urinary Nicotine, Cotinine and Hydroxycotinine Levels in Daily Smokers.
Van Overmeire, Ilse P I; De Smedt, Tom; Dendale, Paul; Nackaerts, Kristiaan; Vanacker, Hilde; Vanoeteren, Jan F A; Van Laethem, Danny M G; Van Loco, Joris; De Cremer, Koen A J
2016-09-01
Nicotine dependence and smoking frequency are critical factors for smoking cessation. The aims of this study are (1) to determine if nicotine dependence Fagerström Test for Nicotine Dependence (FTND) scores are associated with urinary levels of nicotine metabolites, (2) to assess the relationship of hydroxycotinine/cotinine ratio with FTND score and cigarettes smoked per day (CPD), and (3) to identify significant predictors of cigarettes per day among biomarker concentrations and individual FTND items. Urine samples and questionnaire data of 239 daily smokers were obtained. Nicotine, cotinine and hydroxycotinine urinary levels were determined by UPLC MS/MS.Multiple linear regression models were developed to explore the relationship between nicotine, cotinine, hydroxycotinine levels and separate FTND scores (for all six items). We found significant correlations between the different urinary biomarker concentrations, and the FTND score. The time before the first cigarette after waking (TTFC) was significantly associated with the nicotine, cotinine and hydroxycotinine concentrations. No association was found between the ratio of hydroxycotinine to cotinine and either the FTND or the CPD. A model including four FTND questions, sex, age, and the cotinine concentration, accounted for 45% of the variance of CPD. There are significant relationships between urinary levels of nicotine, cotinine, and hydroxycotinine and the FTND score. Especially the FTND question about TTFC is relevant for explaining the biomarker concentrations. CPD (below 15) was significantly explained by four FTND dependence items and urinary cotinine levels in a regression model. We investigated associations between urinary levels of nicotine, cotinine, and hydroxycotinine in daily smokers and the FTND scores for nicotine dependence. We did not find association between the hydroxycotinine/cotinine ratio and CPD. We developed a model that explains the cigarettes smoked daily (CPD) in a group of light smokers by combining FTND items, urinary cotinine levels, sex, and age. Our results might be of importance for clinical use or future studies on larger smoking populations. © The Author 2016. Published by Oxford University Press on behalf of the Society for Research on Nicotine and Tobacco. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
NASA Astrophysics Data System (ADS)
Leung, D. M.; Tai, A. P. K.; Shen, L.; Moch, J. M.; van Donkelaar, A.; Mickley, L. J.
2017-12-01
Fine particulate matter (PM2.5) air quality is strongly dependent on not only on emissions but also meteorological conditions. Here we examine the dominant synoptic circulation patterns that control day-to-day PM2.5 variability over China. We perform principal component (PC) analysis on 1998-2016 NCEP/NCAR Reanalysis I daily meteorological fields to diagnose distinct synoptic meteorological modes, and perform PC regression on spatially interpolated 2014-2016 daily mean PM2.5 concentrations in China to identify modes dominantly explaining PM2.5 variability. We find that synoptic systems, e.g., cold-frontal passages, maritime inflow and frontal precipitation, can explain up to 40% of the day-to-day PM2.5 variability in major metropolitan regions in China. We further investigate how annually changing frequencies of synoptic systems, as well as changing local meteorology, drive interannual PM2.5 variability. We apply a spectral analysis on the PC time series to obtain the 1998-2016 annual median synoptic frequency, and use a forward-selection multiple linear regression (MLR) model of satellite-derived 1998-2015 annual mean PM2.5 concentrations on local meteorology and synoptic frequency, selecting predictors that explain the highest fraction of interannual PM2.5 variability while guarding against multicollinearity. To estimate the effect of climate change on future PM2.5 air quality, we project a multimodel ensemble of 15 CMIP5 models under the RCP8.5 scenario on the PM2.5-to-meteorology sensitivities derived for the present-day from the MLR model. Our results show that climate change could be responsible for increases in PM2.5 of more than 25 μg m-3 in northwestern China and 10 mg m-3 in northeastern China by the 2050s. Increases in synoptic frequency of cold-frontal passages cause only a modest 1 μg m-3 decrease in PM2.5 in North China Plain. Our analyses show that climate change imposes a significant penalty on air quality over China and poses serious threat on human health under the RCP8.5 future.
Lubin, Molly; Chen, Hubert; Elicker, Brett; Jones, Kirk D; Collard, Harold R; Lee, Joyce S
2014-06-01
Patients with interstitial lung disease (ILD) have poor health-related quality of life (HRQL). However, whether HRQL differs among different subtypes of ILD is unclear. The aim of this study was to determine whether HRQL was different among patients with idiopathic pulmonary fibrosis (IPF) and chronic hypersensitivity pneumonitis (CHP). We identified patients from an ongoing longitudinal cohort of patients with ILD. HRQL was assessed using the Short Form (SF)-36 medical outcomes form (version 2.0). Regression analysis was used to determine the association between clinical covariates and HRQL, primarily the physical component summary (PCS) and mental component summary (MCS) score. A multivariate regression model was created to identify potential covariates that could help explain the association between the ILD subtype and HRQL. Patients with IPF (n = 102) were older, more likely to be men, and more likely to have smoked. Pulmonary function was similar between the groups. The patients with CHP (n = 69) had worse HRQL across all eight domains of the SF-36, as well as the PCS and MCS, compared with patients with IPF (P < .01-.09). This pattern remained after controlling for age and pulmonary function (P < .01-.02). Covariates explaining part of the relationship between disease subtype and PCS score included severity of dyspnea (P < .01) and fatigue (P < .01). Covariates explaining part of the relationship between disease subtype and MCS score included severity of dyspnea (P < .01), female sex (P = .02), and fatigue (P = .02). HRQL is worse in CHP compared with IPF. HRQL differences between ILD subtypes are explained in part by differences in sex, dyspnea, and fatigue.
Improving prediction of conditions that modulate dengue fever risks in Yucatán, México.
NASA Astrophysics Data System (ADS)
Laureano-Rosario, A. E.; Garcia-Rejon, J. E.; Gomez-Carro, S.; Farfan-Ale, J.; Muller-Karger, F. E.
2015-12-01
Accurately predicting vector-borne diseases is essential for communities everywhere around the world. Yet this is a difficult task, even in areas where annual epidemics occur. The primary vector for dengue virus disease (DENV) is Aedes aegypti. This is a tropical-subtropical mosquito that proliferates in urban areas. Precipitation and increased temperatures are known to promote growth, reproduction and transmission of DENV. This study assesses potential health risks on coastal communities in the northwest Yucatan Peninsula, Mexico. We studied the relation between DENV incidences and environmental data. We hypothesized that environmental parameters such as rainfall, sea surface temperature (SST), air temperature, humidity, and past DENV cases are the primary drivers of DENV incidences. We collected DENV data from the National Health Information System and demographic data from the National Institute of Statistics and Geography. Precipitation and air temperature were obtained from the National Water Commission. SST was derived from the NOAA Advanced Very High Resolution Radiometer (AVHRR) satellite sensor. In addition, incidence of DENV cases per year was calculated. Multiple regression analyses show that previous DENV cases, minimum air temperature, humidity, and precipitation are positively related to DENV cases and explain 82% of the variation, with 77% explained by previous DENV cases (cases that took place 2-weeks before the target). A second regression model without the previous DENV cases showed 30% of the variation explained by humidity and precipitation (p<0.05). Satellite-derived SST was also included to test whether the percent variation of DENV explained increased. These results imply that if these environmental variables continue to increase with time, the trend of DENV cases will also increase. This study suggests that it is possible to significantly improve DENV prevention and prediction of potential outcomes in Yucatan using remote sensing data.
Nitrate removal in stream ecosystems measured by 15N addition experiments: Total uptake
Hall, R.O.; Tank, J.L.; Sobota, D.J.; Mulholland, P.J.; O'Brien, J. M.; Dodds, W.K.; Webster, J.R.; Valett, H.M.; Poole, G.C.; Peterson, B.J.; Meyer, J.L.; McDowell, W.H.; Johnson, S.L.; Hamilton, S.K.; Grimm, N. B.; Gregory, S.V.; Dahm, Clifford N.; Cooper, L.W.; Ashkenas, L.R.; Thomas, S.M.; Sheibley, R.W.; Potter, J.D.; Niederlehner, B.R.; Johnson, L.T.; Helton, A.M.; Crenshaw, C.M.; Burgin, A.J.; Bernot, M.J.; Beaulieu, J.J.; Arangob, C.P.
2009-01-01
We measured uptake length of 15NO-3 in 72 streams in eight regions across the United States and Puerto Rico to develop quantitative predictive models on controls of NO-3 uptake length. As part of the Lotic Intersite Nitrogen eXperiment II project, we chose nine streams in each region corresponding to natural (reference), suburban-urban, and agricultural land uses. Study streams spanned a range of human land use to maximize variation in NO-3 concentration, geomorphology, and metabolism. We tested a causal model predicting controls on NO-3 uptake length using structural equation modeling. The model included concomitant measurements of ecosystem metabolism, hydraulic parameters, and nitrogen concentration. We compared this structural equation model to multiple regression models which included additional biotic, catchment, and riparian variables. The structural equation model explained 79% of the variation in log uptake length (S Wtot). Uptake length increased with specific discharge (Q/w) and increasing NO-3 concentrations, showing a loss in removal efficiency in streams with high NO-3 concentration. Uptake lengths shortened with increasing gross primary production, suggesting autotrophic assimilation dominated NO-3 removal. The fraction of catchment area as agriculture and suburban-urban land use weakly predicted NO-3 uptake in bivariate regression, and did improve prediction in a set of multiple regression models. Adding land use to the structural equation model showed that land use indirectly affected NO-3 uptake lengths via directly increasing both gross primary production and NO-3 concentration. Gross primary production shortened SWtot, while increasing NO-3 lengthened SWtot resulting in no net effect of land use on NO- 3 removal. ?? 2009.
Sahadevan, S; Earnest, A; Koh, Y L; Lee, K M; Soh, C H; Ding, Y Y
2004-09-01
This study first aimed to determine the adequacy of the Diagnosis Related Grouping (DRG) model's ability to explain (1) the variance in the actual length of stay (LOS) of elderly medical inpatients and (2) the LOS difference in the same cohort between the departments of Geriatric Medicine (GRM) and General Medicine (GM). We then looked at how these explanatory abilities of the DRG changed when patients' function-linked variables (ignored by DRG) were incorporated into the model. Basic demographic data of a consecutively hospitalised cohort of elderly medical inpatients from GRM and GM, as well as their actual LOS, discharge DRG codes [with their corresponding trimmed average length of stay (ALOS)] and selected function-linked variables (including premorbid functional status, change in functional profile during hospitalisation and number of therapists seen) were recorded. Beginning with ALOS, function-linked variables that were significantly associated with LOS were then added into two multiple liner regression models so as to quantify how the functional dimension improved the DRGs' abilities to explain LOS variances and interdepartmental LOS differences. Forward selection procedure was employed to determine the final models. For the interdepartmental analysis, the study sample was restricted to patients who shared common DRG codes. 114 GRM and 118 GM patients were studied. Trimmed ALOS alone explained 8% of the actual LOS variance. With the addition of function-linked variables, the adjusted R2 of the final model increased to 28%. Due to common code restrictions, the data of 79 GRM and 78 GM patients were available for the analysis of interdepartmental LOS differences. At the unadjusted stage, the median stay of GRM patients was 4.3 days longer than GM's and with adjustments made for the DRGs, this difference was reduced to 3.9 days. Additionally adjusting for the patients' functional features diminished the interdepartmental LOS discrepancy even further, to 2.1 days. This study demonstrates that for elderly medical inpatients, the incorporation of patients' functional status significantly improves the DRG model's ability to predict the patients' actual LOS as well as to explain interdepartmental LOS differences between GRM and GM.
Can biomechanical variables predict improvement in crouch gait?
Hicks, Jennifer L.; Delp, Scott L.; Schwartz, Michael H.
2011-01-01
Many patients respond positively to treatments for crouch gait, yet surgical outcomes are inconsistent and unpredictable. In this study, we developed a multivariable regression model to determine if biomechanical variables and other subject characteristics measured during a physical exam and gait analysis can predict which subjects with crouch gait will demonstrate improved knee kinematics on a follow-up gait analysis. We formulated the model and tested its performance by retrospectively analyzing 353 limbs of subjects who walked with crouch gait. The regression model was able to predict which subjects would demonstrate ‘improved’ and ‘unimproved’ knee kinematics with over 70% accuracy, and was able to explain approximately 49% of the variance in subjects’ change in knee flexion between gait analyses. We found that improvement in stance phase knee flexion was positively associated with three variables that were drawn from knowledge about the biomechanical contributors to crouch gait: i) adequate hamstrings lengths and velocities, possibly achieved via hamstrings lengthening surgery, ii) normal tibial torsion, possibly achieved via tibial derotation osteotomy, and iii) sufficient muscle strength. PMID:21616666
Health service costs in Europe: cost and reimbursement of primary hip replacement in nine countries.
Stargardt, Tom
2008-01-01
This paper assesses variations in the cost of primary hip replacement between and within nine member states of the European Union (EU). It also compares the cost of service with public-payer reimbursements. To do so, data on cost and reimbursement were surveyed at the micro-level in 42 hospitals in Denmark, England, France, Germany, Hungary, Italy, The Netherlands, Poland, and Spain. The total cost of treatment ranged from 1290 euros (Hungary) to 8739 euros (The Netherlands), with a mean cost of 5043 euros (STD +/- 2071 euros). The main cost drivers were found to be implants (34% of total cost on average) and ward costs (20.9% of total cost on average). A one-way random effects analysis of variance model indicated that 74.0% of variation was between and only 26% of variation was within countries. In a two-level random-intercept regression model, purchasing-power parities explained 79.4% of the explainable between-country variation, while the percentage of uncemented implants used and the number of beds explained 12.1 and 1.6% of explainable within-country variation, respectively. The large differences in cost and reimbursement between Poland, Hungary, and the other EU member states shows that primary total hip replacement is a highly relevant case for cross-border care. Copyright 2008 John Wiley & Sons, Ltd.
Terry, Douglas P; Puente, Antonio N; Brown, Courtney L; Faraco, Carlos C; Miller, L Stephen
2013-01-01
The personality traits Openness to experience and Neuroticism of the five-factor model have previously been associated with memory performance in nondemented older adults, but this relationship has not been investigated in samples with memory impairment. Our examination of 50 community-dwelling older adults (29 cognitively intact; 21 with questionable dementia as determined by the Clinical Dementia Rating Scale) showed that demographic variables (age, years of education, gender, and estimated premorbid IQ) and current depressive symptoms explained a significant amount of variance of Repeatable Battery of Neuropsychological Status Delayed Memory (adjusted R (2) = 0.23). After controlling for these variables, a measure of global cognitive status further explained a significant portion of variance in memory performance (ΔR(2) = 0.13; adjusted R(2) = 0.36; p < .01). Finally, adding Openness to this hierarchical linear regression model explained a significant additional portion of variance (ΔR(2) = 0.08; adjusted R(2) = 0.44; p < .01) but adding Neuroticism did not explain any additional variance. This significant relationship between Openness and better memory performance above and beyond one's cognitive status and demographic variables may suggest that a lifelong pattern of involvement in new cognitive activities could be preserved in old age or protect from memory decline. This study suggests that personality may be a powerful predictor of memory ability and clinically useful in this heterogeneous population.
Miller, Nathan; Prevatt, Frances
2017-10-01
The purpose of this study was to reexamine the latent structure of ADHD and sluggish cognitive tempo (SCT) due to issues with construct validity. Two proposed changes to the construct include viewing hyperactivity and sluggishness (hypoactivity) as a single continuum of activity level, and viewing inattention as a separate dimension from activity level. Data were collected from 1,398 adults using Amazon's MTurk. A new scale measuring activity level was developed, and scores of Inattention were regressed onto scores of Activity Level using curvilinear regression. The Activity Level scale showed acceptable levels of internal consistency, normality, and unimodality. Curvilinear regression indicates that a quadratic (curvilinear) model accurately explains a small but significant portion of the variance in levels of inattention. Hyperactivity and hypoactivity may be viewed as a continuum, rather than separate disorders. Inattention may have a U-shaped relationship with activity level. Linear analyses may be insufficient and inaccurate for studying ADHD.
Lee, Seung Hee; Jang, Hyung Suk; Yang, Young Hee
2016-10-01
This study was done to investigate factors influencing successful aging in middle-aged women. A convenience sample of 103 middle-aged women was selected from the community. Data were collected using a structured questionnaire and analyzed using descriptive statistics, two-sample t-test, one-way ANOVA, Kruskal Wallis test, Pearson correlations, Spearman correlations and multiple regression analysis with the SPSS/WIN 22.0 program. Results of regression analysis showed that significant factors influencing successful aging were post-traumatic growth and social support. This regression model explained 48% of the variance in successful aging. Findings show that the concept 'post-traumatic growth' is an important factor influencing successful aging in middle-aged women. In addition, social support from friends/co-workers had greater influence on successful aging than social support from family. Thus, we need to consider the positive impact of post-traumatic growth and increase the chances of social participation in a successful aging program for middle-aged women.
Sanders, Elizabeth A; Berninger, Virginia W; Abbott, Robert D
Sequential regression was used to evaluate whether language-related working memory components uniquely predict reading and writing achievement beyond cognitive-linguistic translation for students in Grades 4 through 9 ( N = 103) with specific learning disabilities (SLDs) in subword handwriting (dysgraphia, n = 25), word reading and spelling (dyslexia, n = 60), or oral and written language (oral and written language learning disabilities, n = 18). That is, SLDs are defined on the basis of cascading level of language impairment (subword, word, and syntax/text). A five-block regression model sequentially predicted literacy achievement from cognitive-linguistic translation (Block 1); working memory components for word-form coding (Block 2), phonological and orthographic loops (Block 3), and supervisory focused or switching attention (Block 4); and SLD groups (Block 5). Results showed that cognitive-linguistic translation explained an average of 27% and 15% of the variance in reading and writing achievement, respectively, but working memory components explained an additional 39% and 27% of variance. Orthographic word-form coding uniquely predicted nearly every measure, whereas attention switching uniquely predicted only reading. Finally, differences in reading and writing persisted between dyslexia and dysgraphia, with dysgraphia higher, even after controlling for Block 1 to 4 predictors. Differences in literacy achievement between students with dyslexia and oral and written language learning disabilities were largely explained by the Block 1 predictors. Applications to identifying and teaching students with these SLDs are discussed.
Donta, Balaiah; Dasgupta, Anindita; Ghule, Mohan; Battala, Madhusudana; Nair, Saritha; Silverman, Jay G.; Jadhav, Arun; Palaye, Prajakta; Saggurti, Niranjan; Raj, Anita
2015-01-01
Objective Evidence has linked economic hardship with increased intimate partner violence (IPV) perpetration among males. However, less is known about how economic debt or gender norms related to men's roles in relationships or the household, which often underlie IPV perpetration, intersect in or may explain these associations. We assessed the intersection of economic debt, attitudes toward gender norms, and IPV perpetration among married men in India. Methods Data were from the evaluation of a family planning intervention among young married couples (n=1,081) in rural Maharashtra, India. Crude and adjusted logistic regression models for dichotomous outcome variables and linear regression models for continuous outcomes were used to examine debt in relation to husbands' attitudes toward gender-based norms (i.e., beliefs supporting IPV and beliefs regarding male dominance in relationships and the household), as well as sexual and physical IPV perpetration. Results Twenty percent of husbands reported debt. In adjusted linear regression models, debt was associated with husbands' attitudes supportive of IPV (b=0.015, p=0.004) and norms supporting male dominance in relationships and the household (b=0.006, p=0.003). In logistic regression models adjusted for relevant demographics, debt was associated with perpetration of physical IPV (adjusted odds ratio [AOR] = 1.4, 95% confidence interval [CI] 1.1, 1.9) and sexual IPV (AOR=1.6, 95% CI 1.1, 2.1) from husbands. These findings related to debt and relation to IPV were slightly attenuated when further adjusted for men's attitudes toward gender norms. Conclusion Findings suggest the need for combined gender equity and economic promotion interventions to address high levels of debt and related IPV reported among married couples in rural India. PMID:26556938
Spatial patterns of species richness in New World coral snakes and the metabolic theory of ecology
NASA Astrophysics Data System (ADS)
Terribile, Levi Carina; Diniz-Filho, José Alexandre Felizola
2009-03-01
The metabolic theory of ecology (MTE) has attracted great interest because it proposes an explanation for species diversity gradients based on temperature-metabolism relationships of organisms. Here we analyse the spatial richness pattern of 73 coral snake species from the New World in the context of MTE. We first analysed the association between ln-transformed richness and environmental variables, including the inverse transformation of annual temperature (1/ kT). We used eigenvector-based spatial filtering to remove the residual spatial autocorrelation in the data and geographically weighted regression to account for non-stationarity in data. In a model I regression (OLS), the observed slope between ln-richness and 1/ kT was -0.626 ( r2 = 0.413), but a model II regression generated a much steeper slope (-0.975). When we added additional environmental correlates and the spatial filters in the OLS model, the R2 increased to 0.863 and the partial regression coefficient of 1/ kT was -0.676. The GWR detected highly significant non-stationarity, in data, and the median of local slopes of ln-richness against 1/ kT was -0.38. Our results expose several problems regarding the assumptions needed to test MTE: although the slope of OLS fell within that predicted by the theory and the dataset complied with the assumption of temperature-independence of average body size, the fact that coral snakes consist of a restricted taxonomic group and the non-stationarity of slopes across geographical space makes MTE invalid to explain richness in this case. Also, it is clear that other ecological and historical factors are important drivers of species richness patterns and must be taken into account both in theoretical modeling and data analysis.
Reed, Elizabeth; Donta, Balaiah; Dasgupta, Anindita; Ghule, Mohan; Battala, Madhusudana; Nair, Saritha; Silverman, Jay G; Jadhav, Arun; Palaye, Prajakta; Saggurti, Niranjan; Raj, Anita
2015-01-01
Evidence has linked economic hardship with increased intimate partner violence (IPV) perpetration among males. However, less is known about how economic debt or gender norms related to men's roles in relationships or the household, which often underlie IPV perpetration, intersect in or may explain these associations. We assessed the intersection of economic debt, attitudes toward gender norms, and IPV perpetration among married men in India. Data were from the evaluation of a family planning intervention among young married couples (n=1,081) in rural Maharashtra, India. Crude and adjusted logistic regression models for dichotomous outcome variables and linear regression models for continuous outcomes were used to examine debt in relation to husbands' attitudes toward gender-based norms (i.e., beliefs supporting IPV and beliefs regarding male dominance in relationships and the household), as well as sexual and physical IPV perpetration. Twenty percent of husbands reported debt. In adjusted linear regression models, debt was associated with husbands' attitudes supportive of IPV (b=0.015, p=0.004) and norms supporting male dominance in relationships and the household (b=0.006, p=0.003). In logistic regression models adjusted for relevant demographics, debt was associated with perpetration of physical IPV (adjusted odds ratio [AOR] = 1.4, 95% confidence interval [CI] 1.1, 1.9) and sexual IPV (AOR=1.6, 95% CI 1.1, 2.1) from husbands. These findings related to debt and relation to IPV were slightly attenuated when further adjusted for men's attitudes toward gender norms. Findings suggest the need for combined gender equity and economic promotion interventions to address high levels of debt and related IPV reported among married couples in rural India.
The intermediate endpoint effect in logistic and probit regression
MacKinnon, DP; Lockwood, CM; Brown, CH; Wang, W; Hoffman, JM
2010-01-01
Background An intermediate endpoint is hypothesized to be in the middle of the causal sequence relating an independent variable to a dependent variable. The intermediate variable is also called a surrogate or mediating variable and the corresponding effect is called the mediated, surrogate endpoint, or intermediate endpoint effect. Clinical studies are often designed to change an intermediate or surrogate endpoint and through this intermediate change influence the ultimate endpoint. In many intermediate endpoint clinical studies the dependent variable is binary, and logistic or probit regression is used. Purpose The purpose of this study is to describe a limitation of a widely used approach to assessing intermediate endpoint effects and to propose an alternative method, based on products of coefficients, that yields more accurate results. Methods The intermediate endpoint model for a binary outcome is described for a true binary outcome and for a dichotomization of a latent continuous outcome. Plots of true values and a simulation study are used to evaluate the different methods. Results Distorted estimates of the intermediate endpoint effect and incorrect conclusions can result from the application of widely used methods to assess the intermediate endpoint effect. The same problem occurs for the proportion of an effect explained by an intermediate endpoint, which has been suggested as a useful measure for identifying intermediate endpoints. A solution to this problem is given based on the relationship between latent variable modeling and logistic or probit regression. Limitations More complicated intermediate variable models are not addressed in the study, although the methods described in the article can be extended to these more complicated models. Conclusions Researchers are encouraged to use an intermediate endpoint method based on the product of regression coefficients. A common method based on difference in coefficient methods can lead to distorted conclusions regarding the intermediate effect. PMID:17942466
Yu, Wenxi; Liu, Yang; Ma, Zongwei; Bi, Jun
2017-08-01
Using satellite-based aerosol optical depth (AOD) measurements and statistical models to estimate ground-level PM 2.5 is a promising way to fill the areas that are not covered by ground PM 2.5 monitors. The statistical models used in previous studies are primarily Linear Mixed Effects (LME) and Geographically Weighted Regression (GWR) models. In this study, we developed a new regression model between PM 2.5 and AOD using Gaussian processes in a Bayesian hierarchical setting. Gaussian processes model the stochastic nature of the spatial random effects, where the mean surface and the covariance function is specified. The spatial stochastic process is incorporated under the Bayesian hierarchical framework to explain the variation of PM 2.5 concentrations together with other factors, such as AOD, spatial and non-spatial random effects. We evaluate the results of our model and compare them with those of other, conventional statistical models (GWR and LME) by within-sample model fitting and out-of-sample validation (cross validation, CV). The results show that our model possesses a CV result (R 2 = 0.81) that reflects higher accuracy than that of GWR and LME (0.74 and 0.48, respectively). Our results indicate that Gaussian process models have the potential to improve the accuracy of satellite-based PM 2.5 estimates.
2014-01-01
Background There have been large-scale outbreaks of hand, foot and mouth disease (HFMD) in Mainland China over the last decade. These events varied greatly across the country. It is necessary to identify the spatial risk factors and spatial distribution patterns of HFMD for public health control and prevention. Climate risk factors associated with HFMD occurrence have been recognized. However, few studies discussed the socio-economic determinants of HFMD risk at a space scale. Methods HFMD records in Mainland China in May 2008 were collected. Both climate and socio-economic factors were selected as potential risk exposures of HFMD. Odds ratio (OR) was used to identify the spatial risk factors. A spatial autologistic regression model was employed to get OR values of each exposures and model the spatial distribution patterns of HFMD risk. Results Results showed that both climate and socio-economic variables were spatial risk factors for HFMD transmission in Mainland China. The statistically significant risk factors are monthly average precipitation (OR = 1.4354), monthly average temperature (OR = 1.379), monthly average wind speed (OR = 1.186), the number of industrial enterprises above designated size (OR = 17.699), the population density (OR = 1.953), and the proportion of student population (OR = 1.286). The spatial autologistic regression model has a good goodness of fit (ROC = 0.817) and prediction accuracy (Correct ratio = 78.45%) of HFMD occurrence. The autologistic regression model also reduces the contribution of the residual term in the ordinary logistic regression model significantly, from 17.25 to 1.25 for the odds ratio. Based on the prediction results of the spatial model, we obtained a map of the probability of HFMD occurrence that shows the spatial distribution pattern and local epidemic risk over Mainland China. Conclusions The autologistic regression model was used to identify spatial risk factors and model spatial risk patterns of HFMD. HFMD occurrences were found to be spatially heterogeneous over the Mainland China, which is related to both the climate and socio-economic variables. The combination of socio-economic and climate exposures can explain the HFMD occurrences more comprehensively and objectively than those with only climate exposures. The modeled probability of HFMD occurrence at the county level reveals not only the spatial trends, but also the local details of epidemic risk, even in the regions where there were no HFMD case records. PMID:24731248
The M Word: Multicollinearity in Multiple Regression.
ERIC Educational Resources Information Center
Morrow-Howell, Nancy
1994-01-01
Notes that existence of substantial correlation between two or more independent variables creates problems of multicollinearity in multiple regression. Discusses multicollinearity problem in social work research in which independent variables are usually intercorrelated. Clarifies problems created by multicollinearity, explains detection of…
Cho, Eunsoo; Capin, Philip; Roberts, Greg; Vaughn, Sharon
2017-07-01
Within multitiered instructional delivery models, progress monitoring is a key mechanism for determining whether a child demonstrates an adequate response to instruction. One measure commonly used to monitor the reading progress of students is oral reading fluency (ORF). This study examined the extent to which ORF slope predicts reading comprehension outcomes for fifth-grade struggling readers ( n = 102) participating in an intensive reading intervention. Quantile regression models showed that ORF slope significantly predicted performance on a sentence-level fluency and comprehension assessment, regardless of the students' reading skills, controlling for initial ORF performance. However, ORF slope was differentially predictive of a passage-level comprehension assessment based on students' reading skills when controlling for initial ORF status. Results showed that ORF explained unique variance for struggling readers whose posttest performance was at the upper quantiles at the end of the reading intervention, but slope was not a significant predictor of passage-level comprehension for students whose reading problems were the most difficult to remediate.
Predicting assemblages and species richness of endemic fish in the upper Yangtze River.
He, Yongfeng; Wang, Jianwei; Lek-Ang, Sithan; Lek, Sovan
2010-09-01
The present work describes the ability of two modeling methods, Classification and Regression Tree (CART) and Random Forest (RF), to predict endemic fish assemblages and species richness in the upper Yangtze River, and then to identify the determinant environmental factors contributing to the models. The models included 24 predictor variables and 2 response variables (fish assemblage and species richness) for a total of 46 site units. The predictive quality of the modeling approaches was judged with a leave-one-out validation procedure. There was an average success of 60.9% and 71.7% to assign each site unit to the correct assemblage of fish, and 73% and 84% to explain the variance in species richness, by using CART and RF models, respectively. RF proved to be better than CART in terms of accuracy and efficiency in ecological applications. In any case, the mixed models including both land cover and river characteristic variables were more powerful than either individual one in explaining the endemic fish distribution pattern in the upper Yangtze River. For instance, altitude, slope, length, discharge, runoff, farmland and alpine and sub-alpine meadow played important roles in driving the observed endemic fish assemblage structure, while farmland, slope grassland, discharge, runoff, altitude and drainage area in explaining the observed patterns of endemic species richness. Therefore, the various effects of human activity on natural aquatic ecosystems, in particular, the flow modification of the river and the land use changes may have a considerable effect on the endemic fish distribution patterns on a regional scale. Copyright 2010 Elsevier B.V. All rights reserved.
Voit, E O; Knapp, R G
1997-08-15
The linear-logistic regression model and Cox's proportional hazard model are widely used in epidemiology. Their successful application leaves no doubt that they are accurate reflections of observed disease processes and their associated risks or incidence rates. In spite of their prominence, it is not a priori evident why these models work. This article presents a derivation of the two models from the framework of canonical modeling. It begins with a general description of the dynamics between risk sources and disease development, formulates this description in the canonical representation of an S-system, and shows how the linear-logistic model and Cox's proportional hazard model follow naturally from this representation. The article interprets the model parameters in terms of epidemiological concepts as well as in terms of general systems theory and explains the assumptions and limitations generally accepted in the application of these epidemiological models.
NASA Astrophysics Data System (ADS)
Ronsmans, Gaétane; Wespes, Catherine; Hurtmans, Daniel; Clerbaux, Cathy; Coheur, Pierre-François
2018-04-01
This study aims to understand the spatial and temporal variability of HNO3 total columns in terms of explanatory variables. To achieve this, multiple linear regressions are used to fit satellite-derived time series of HNO3 daily averaged total columns. First, an analysis of the IASI 9-year time series (2008-2016) is conducted based on various equivalent latitude bands. The strong and systematic denitrification of the southern polar stratosphere is observed very clearly. It is also possible to distinguish, within the polar vortex, three regions which are differently affected by the denitrification. Three exceptional denitrification episodes in 2011, 2014 and 2016 are also observed in the Northern Hemisphere, due to unusually low arctic temperatures. The time series are then fitted by multivariate regressions to identify what variables are responsible for HNO3 variability in global distributions and time series, and to quantify their respective influence. Out of an ensemble of proxies (annual cycle, solar flux, quasi-biennial oscillation, multivariate ENSO index, Arctic and Antarctic oscillations and volume of polar stratospheric clouds), only the those defined as significant (p value < 0.05) by a selection algorithm are retained for each equivalent latitude band. Overall, the regression gives a good representation of HNO3 variability, with especially good results at high latitudes (60-80 % of the observed variability explained by the model). The regressions show the dominance of annual variability in all latitudinal bands, which is related to specific chemistry and dynamics depending on the latitudes. We find that the polar stratospheric clouds (PSCs) also have a major influence in the polar regions, and that their inclusion in the model improves the correlation coefficients and the residuals. However, there is still a relatively large portion of HNO3 variability that remains unexplained by the model, especially in the intertropical regions, where factors not included in the regression model (such as vegetation fires or lightning) may be at play.
Emotion avoidance in patients with anorexia nervosa: initial test of a functional model.
Wildes, Jennifer E; Ringham, Rebecca M; Marcus, Marsha D
2010-07-01
This study aimed to evaluate emotion avoidance in patients with anorexia nervosa (AN) and to examine whether emotion avoidance helps to explain (i.e., mediates) the relation between depressive and anxiety symptoms and eating disorder (ED) psychopathology in this group. Seventy-five patients with AN completed questionnaires to assess study variables. Rates of emotion avoidance were compared to published data, and regression models were used to test the hypothesis that emotion avoidance mediates the relation between depressive and anxiety symptoms and ED psychopathology in AN. Patients with AN endorsed levels of emotion avoidance that were comparable to or higher than other psychiatric populations and exceeded community controls. As predicted, emotion avoidance significantly explained the relations of depressive and anxiety symptoms to ED psychopathology. Findings confirm that emotion avoidance is present in patients with AN and provide initial support for the idea that anorexic symptoms function, in part, to help individuals avoid aversive emotional states. 2009 by Wiley Periodicals, Inc.
[The role of supply-side characteristics of services in AIDS mortality in Mexico].
Bautista-Arredondo, Sergio; Serván-Mori, Edson; Silverman-Retana, Omar; Contreras-Loya, David; Romero-Martínez, Martín; Magis-Rodríguez, Carlos; Uribe-Zúñiga, Patricia; Lozano, Rafael
2015-01-01
To document the association between supply-side determinants and AIDS mortality in Mexico between 2008 and 2013. We analyzed the SALVAR database (system for antiretroviral management, logistics and surveillance) as well as data collected through a nationally representative survey in health facilities. We used multivariate logit regression models to estimate the association between supply-side characteristics, namely management, training and experience of health care providers, and AIDS mortality, distinguishing early and non-early mortality and controlling for clinical indicators of the patients. Clinic status of the patients (initial CD4 and viral load) explain 44.4% of the variability of early mortality across clinics and 13.8% of the variability in non-early mortality. Supply-side characteristics increase explanatory power of the models by 16% in the case of early mortality, and 96% in the case of non-early mortality. Aspects of management and implementation of services contribute significantly to explain AIDS mortality in Mexico. Improving these aspects of the national program, can similarly improve its results.
Nisén, Jessica; Myrskylä, Mikko; Silventoinen, Karri; Martikainen, Pekka
2014-01-01
An inverse association between education and fertility in women has been found in many societies but the causes of this association remain inadequately understood. We investigated whether observed and unobserved family-background characteristics explained educational differences in lifetime fertility among 35,212 Finnish women born in 1940–50. Poisson and logistic regression models, adjusted for measured socio-demographic family-background characteristics and for unobserved family characteristics shared by siblings, were used to analyse the relationship between education and the number of children, having any children, and fertility beyond the first child. The woman's education and the socio-economic position of the family were negatively associated with fertility. Observed family characteristics moderately (3–28 per cent) explained the association between education and fertility, and results from models including unobserved characteristics supported this interpretation. The remaining association may represent a causal relationship between education and fertility or joint preferences that form independently of our measures of background. PMID:24946905
Nisén, Jessica; Myrskylä, Mikko; Silventoinen, Karri; Martikainen, Pekka
2014-01-01
An inverse association between education and fertility in women has been found in many societies but the causes of this association remain inadequately understood. We investigated whether observed and unobserved family-background characteristics explained educational differences in lifetime fertility among 35,212 Finnish women born in 1940-50. Poisson and logistic regression models, adjusted for measured socio-demographic family-background characteristics and for unobserved family characteristics shared by siblings, were used to analyse the relationship between education and the number of children, having any children, and fertility beyond the first child. The woman's education and the socio-economic position of the family were negatively associated with fertility. Observed family characteristics moderately (3-28 per cent) explained the association between education and fertility, and results from models including unobserved characteristics supported this interpretation. The remaining association may represent a causal relationship between education and fertility or joint preferences that form independently of our measures of background.
Work-related fatigue: the specific case of highly educated women in the Netherlands.
Verdonk, Petra; Hooftman, Wendela E; van Veldhoven, Marc J P M; Boelens, Louise R M; Koppes, Lando L J
2010-03-01
This study aims to establish the prevalence of high work-related fatigue (need for recovery, NFR) among employees and to explain group differences categorized by gender, age, and education. The study particularly aims to clarify prevalence and explanatory factors in highly educated women. In 2005 and 2006, large representative samples of 80,000 Dutch employees (net response rate 33.0%; N = 47,263) received the Netherlands working conditions survey questionnaire. First, we calculated the prevalence of high NFR for men and women with different age and education levels. The average prevalence of high NFR was 28.8% and was highest among highly educated women (35.2%) in particular those aged 50-64 years (40.3%). Second, logistic regression analyses were used to compare subgroups' NFR in relation to situational factors, working conditions, and health. Three comparisons were made: (1) highly educated women versus men; (2) highly educated versus lower educated women and; (3) older highly educated versus younger highly educated women. The situational, working conditions and health factors in our model did not explain the gender differences among highly educated employees (OR = 1.37; CI = 1.3-1.5, adjusted for all factors OR = 1.32; CI = 1.2-1.5). Despite that lower autonomy and workplace violence explained highly educated women's NFR, working fewer hours counterbalanced this. Time pressure in work largely explained the differences in NFR among women at different education levels (crude OR 1.44; CI = 1.4-1.5, adjusted OR 1.14; CI = 1.0-1.3). In the age comparison, lower health ratings, more adverse working conditions, and working as a teacher explained older highly educated women's high prevalence of high NFR (crude OR 1.32; CI = 1.2-1.5, adjusted OR 0.94; CI = 0.8-1.2). NFR has high prevalence in highly educated women (35.2%) in particular those aged 50-64 years (40.3%). Our model did not explain gender differences in NFR, because working fewer hours counterbalanced the effects of lower autonomy and external workplace violence. Our model, in particular time pressure, largely explained differences in NFR between women at different education levels. Age differences in the prevalence of high NFR among highly educated women's were fully explained by our model. Main factors were lower health ratings, adverse working conditions, and working as a teacher.
Chen, Gongbo; Li, Shanshan; Knibbs, Luke D; Hamm, N A S; Cao, Wei; Li, Tiantian; Guo, Jianping; Ren, Hongyan; Abramson, Michael J; Guo, Yuming
2018-09-15
Machine learning algorithms have very high predictive ability. However, no study has used machine learning to estimate historical concentrations of PM 2.5 (particulate matter with aerodynamic diameter ≤ 2.5 μm) at daily time scale in China at a national level. To estimate daily concentrations of PM 2.5 across China during 2005-2016. Daily ground-level PM 2.5 data were obtained from 1479 stations across China during 2014-2016. Data on aerosol optical depth (AOD), meteorological conditions and other predictors were downloaded. A random forests model (non-parametric machine learning algorithms) and two traditional regression models were developed to estimate ground-level PM 2.5 concentrations. The best-fit model was then utilized to estimate the daily concentrations of PM 2.5 across China with a resolution of 0.1° (≈10 km) during 2005-2016. The daily random forests model showed much higher predictive accuracy than the other two traditional regression models, explaining the majority of spatial variability in daily PM 2.5 [10-fold cross-validation (CV) R 2 = 83%, root mean squared prediction error (RMSE) = 28.1 μg/m 3 ]. At the monthly and annual time-scale, the explained variability of average PM 2.5 increased up to 86% (RMSE = 10.7 μg/m 3 and 6.9 μg/m 3 , respectively). Taking advantage of a novel application of modeling framework and the most recent ground-level PM 2.5 observations, the machine learning method showed higher predictive ability than previous studies. Random forests approach can be used to estimate historical exposure to PM 2.5 in China with high accuracy. Copyright © 2018 Elsevier B.V. All rights reserved.
Holtschlag, David J.; Shively, Dawn; Whitman, Richard L.; Haack, Sheridan K.; Fogarty, Lisa R.
2008-01-01
Regression analyses and hydrodynamic modeling were used to identify environmental factors and flow paths associated with Escherichia coli (E. coli) concentrations at Memorial and Metropolitan Beaches on Lake St. Clair in Macomb County, Mich. Lake St. Clair is part of the binational waterway between the United States and Canada that connects Lake Huron with Lake Erie in the Great Lakes Basin. Linear regression, regression-tree, and logistic regression models were developed from E. coli concentration and ancillary environmental data. Linear regression models on log10 E. coli concentrations indicated that rainfall prior to sampling, water temperature, and turbidity were positively associated with bacteria concentrations at both beaches. Flow from Clinton River, changes in water levels, wind conditions, and log10 E. coli concentrations 2 days before or after the target bacteria concentrations were statistically significant at one or both beaches. In addition, various interaction terms were significant at Memorial Beach. Linear regression models for both beaches explained only about 30 percent of the variability in log10 E. coli concentrations. Regression-tree models were developed from data from both Memorial and Metropolitan Beaches but were found to have limited predictive capability in this study. The results indicate that too few observations were available to develop reliable regression-tree models. Linear logistic models were developed to estimate the probability of E. coli concentrations exceeding 300 most probable number (MPN) per 100 milliliters (mL). Rainfall amounts before bacteria sampling were positively associated with exceedance probabilities at both beaches. Flow of Clinton River, turbidity, and log10 E. coli concentrations measured before or after the target E. coli measurements were related to exceedances at one or both beaches. The linear logistic models were effective in estimating bacteria exceedances at both beaches. A receiver operating characteristic (ROC) analysis was used to determine cut points for maximizing the true positive rate prediction while minimizing the false positive rate. A two-dimensional hydrodynamic model was developed to simulate horizontal current patterns on Lake St. Clair in response to wind, flow, and water-level conditions at model boundaries. Simulated velocity fields were used to track hypothetical massless particles backward in time from the beaches along flow paths toward source areas. Reverse particle tracking for idealized steady-state conditions shows changes in expected flow paths and traveltimes with wind speeds and directions from 24 sectors. The results indicate that three to four sets of contiguous wind sectors have similar effects on flow paths in the vicinity of the beaches. In addition, reverse particle tracking was used for transient conditions to identify expected flow paths for 10 E. coli sampling events in 2004. These results demonstrate the ability to track hypothetical particles from the beaches, backward in time, to likely source areas. This ability, coupled with a greater frequency of bacteria sampling, may provide insight into changes in bacteria concentrations between source and sink areas.
Non-Linear Approach in Kinesiology Should Be Preferred to the Linear--A Case of Basketball.
Trninić, Marko; Jeličić, Mario; Papić, Vladan
2015-07-01
In kinesiology, medicine, biology and psychology, in which research focus is on dynamical self-organized systems, complex connections exist between variables. Non-linear nature of complex systems has been discussed and explained by the example of non-linear anthropometric predictors of performance in basketball. Previous studies interpreted relations between anthropometric features and measures of effectiveness in basketball by (a) using linear correlation models, and by (b) including all basketball athletes in the same sample of participants regardless of their playing position. In this paper the significance and character of linear and non-linear relations between simple anthropometric predictors (AP) and performance criteria consisting of situation-related measures of effectiveness (SE) in basketball were determined and evaluated. The sample of participants consisted of top-level junior basketball players divided in three groups according to their playing time (8 minutes and more per game) and playing position: guards (N = 42), forwards (N = 26) and centers (N = 40). Linear (general model) and non-linear (general model) regression models were calculated simultaneously and separately for each group. The conclusion is viable: non-linear regressions are frequently superior to linear correlations when interpreting actual association logic among research variables.
Factors associated with self-medication in Spain: a cross-sectional study in different age groups.
Niclós, Gracia; Olivar, Teresa; Rodilla, Vicent
2018-06-01
The identification of factors which may influence a patient's decision to self-medicate. Descriptive, cross-sectional study of the adult population (at least 16 years old), using data from the 2009 European Health Interview Survey in Spain, which included 22 188 subjects. Logistic regression models enabled us to estimate the effect of each analysed variable on self-medication. In total, 14 863 (67%) individuals reported using medication (prescribed and non-prescribed) and 3274 (22.0%) of them self-medicated. Using logistic regression and stratifying by age, four different models have been constructed. Our results include different variables in each of the models to explain self-medication, but the one that appears on all four models is education level. Age is the other important factor which influences self-medication. Self-medication is strongly associated with factors related to socio-demographic, such as sex, educational level or age, as well as several health factors such as long-standing illness or physical activity. When our data are compared to those from previous Spanish surveys carried out in 2003 and 2006, we can conclude that self-medication is increasing in Spain. © 2017 Royal Pharmaceutical Society.
Crop weather models of barley and spring wheat yield for agrophysical units in North Dakota
NASA Technical Reports Server (NTRS)
Leduc, S. (Principal Investigator)
1982-01-01
Models based on multiple regression were developed to estimate barley yield and spring wheat yield from weather data for Agrophysical units(APU) in North Dakota. The predictor variables are derived from monthly average temperature and monthly total precipitation data at meteorological stations in the cooperative network. The models are similar in form to the previous models developed for Crop Reporting Districts (CRD). The trends and derived variables were the same and the approach to select the significant predictors was similar to that used in developing the CRD models. The APU models show sight improvements in some of the statistics of the models, e.g., explained variation. These models are to be independently evaluated and compared to the previously evaluated CRD models. The comparison will indicate the preferred model area for this application, i.e., APU or CRD.
Su, Peng-Hao; Tomy, Gregg T; Hou, Chun-Yan; Yin, Fang; Feng, Dao-Lun; Ding, Yong-Sheng; Li, Yi-Fan
2018-04-01
A size-segregated gas/particle partitioning coefficient K Pi was proposed and evaluated in the predicting models on the basis of atmospheric polybrominated diphenyl ether (PBDE) field data comparing with the bulk coefficient K P . Results revealed that the characteristics of atmospheric PBDEs in southeast Shanghai rural area were generally consistent with previous investigations, suggesting that this investigation was representative to the present pollution status of atmospheric PBDEs. K Pi was generally greater than bulk K P , indicating an overestimate of TSP (the mass concentration of total suspended particles) in the expression of bulk K P . In predicting models, K Pi led to a significant shift in regression lines as compared to K P , thus it should be more cautious to investigate sorption mechanisms using the regression lines. The differences between the performances of K Pi and K P were helpful to explain some phenomenon in predicting investigations, such as P L 0 and K OA models overestimate the particle fractions of PBDEs and the models work better at high temperature than at low temperature. Our findings are important because they enabled an insight into the influence of particle size on predicting models. Copyright © 2018 Elsevier Ltd. All rights reserved.
Sebastian, Tunny; Jeyaseelan, Visalakshi; Jeyaseelan, Lakshmanan; Anandan, Shalini; George, Sebastian; Bangdiwala, Shrikant I
2018-01-01
Hidden Markov models are stochastic models in which the observations are assumed to follow a mixture distribution, but the parameters of the components are governed by a Markov chain which is unobservable. The issues related to the estimation of Poisson-hidden Markov models in which the observations are coming from mixture of Poisson distributions and the parameters of the component Poisson distributions are governed by an m-state Markov chain with an unknown transition probability matrix are explained here. These methods were applied to the data on Vibrio cholerae counts reported every month for 11-year span at Christian Medical College, Vellore, India. Using Viterbi algorithm, the best estimate of the state sequence was obtained and hence the transition probability matrix. The mean passage time between the states were estimated. The 95% confidence interval for the mean passage time was estimated via Monte Carlo simulation. The three hidden states of the estimated Markov chain are labelled as 'Low', 'Moderate' and 'High' with the mean counts of 1.4, 6.6 and 20.2 and the estimated average duration of stay of 3, 3 and 4 months, respectively. Environmental risk factors were studied using Markov ordinal logistic regression analysis. No significant association was found between disease severity levels and climate components.
Huppert, Theodore J
2016-01-01
Functional near-infrared spectroscopy (fNIRS) is a noninvasive neuroimaging technique that uses low levels of light to measure changes in cerebral blood oxygenation levels. In the majority of NIRS functional brain studies, analysis of this data is based on a statistical comparison of hemodynamic levels between a baseline and task or between multiple task conditions by means of a linear regression model: the so-called general linear model. Although these methods are similar to their implementation in other fields, particularly for functional magnetic resonance imaging, the specific application of these methods in fNIRS research differs in several key ways related to the sources of noise and artifacts unique to fNIRS. In this brief communication, we discuss the application of linear regression models in fNIRS and the modifications needed to generalize these models in order to deal with structured (colored) noise due to systemic physiology and noise heteroscedasticity due to motion artifacts. The objective of this work is to present an overview of these noise properties in the context of the linear model as it applies to fNIRS data. This work is aimed at explaining these mathematical issues to the general fNIRS experimental researcher but is not intended to be a complete mathematical treatment of these concepts.
Hao, Chen; LiJun, Chen; Albright, Thomas P.
2007-01-01
Invasive exotic species pose a growing threat to the economy, public health, and ecological integrity of nations worldwide. Explaining and predicting the spatial distribution of invasive exotic species is of great importance to prevention and early warning efforts. We are investigating the potential distribution of invasive exotic species, the environmental factors that influence these distributions, and the ability to predict them using statistical and information-theoretic approaches. For some species, detailed presence/absence occurrence data are available, allowing the use of a variety of standard statistical techniques. However, for most species, absence data are not available. Presented with the challenge of developing a model based on presence-only information, we developed an improved logistic regression approach using Information Theory and Frequency Statistics to produce a relative suitability map. This paper generated a variety of distributions of ragweed (Ambrosia artemisiifolia L.) from logistic regression models applied to herbarium specimen location data and a suite of GIS layers including climatic, topographic, and land cover information. Our logistic regression model was based on Akaike's Information Criterion (AIC) from a suite of ecologically reasonable predictor variables. Based on the results we provided a new Frequency Statistical method to compartmentalize habitat-suitability in the native range. Finally, we used the model and the compartmentalized criterion developed in native ranges to "project" a potential distribution onto the exotic ranges to build habitat-suitability maps. ?? Science in China Press 2007.
The Role of Violent Thinking in Violent Behavior: It's More About Thinking Than Drinking.
Bowes, Nicola; Walker, Julian; Hughes, Elise; Lewis, Rhiannon; Hyde, Gemma
2017-08-01
This article aims to explore and report on violent thinking and alcohol misuse; how these factors may predict self-reported violence. The role of violent thinking in violent behavior is both well established in theoretical models, yet there are few measures that explain this role. One measure that has been identified is the Maudsley Violence Questionnaire (MVQ). This is the first study to explore the use of the MVQ with a general (nonoffender) adult sample, having already been shown to be valid with young people (under 18 years old), adult male offenders, and mentally disordered offenders. This study involved 808 adult participants-569 female and 239 male participants. As figures demonstrate that around half of all violent crime in the United Kingdom is alcohol related, we also explored the role of alcohol misuse. Regression was used to explore how these factors predicted violence. The results demonstrate the important role of violent thinking in violent behavior. The MVQ factor of "Machismo" was the primary factor in regression models for both male and female self-reported violence. The role of alcohol in the regression models differed slightly between the male and female participants, with alcohol misuse involved in male violence. The study supports theoretical models including the role of violent thinking and encourages those hoping to address violence, to consider "Machismo" as a treatment target. The study also provides further validation of the MVQ as a helpful tool for clinicians or researchers who may be interested in "measuring" violent thinking.
New insights into the earliest stages of colorectal tumorigenesis.
Sievers, Chelsie K; Grady, William M; Halberg, Richard B; Pickhardt, Perry J
2017-08-01
Tumors in the large intestine have been postulated to arise via a stepwise accumulation of mutations, a process that takes up to 20 years. Recent advances in lineage tracing and DNA sequencing, however, are revealing new evolutionary models that better explain the vast amount of heterogeneity observed within and across colorectal tumors. Areas covered: A review of the literature supporting a novel model of colorectal tumor evolution was conducted. The following commentary examines the basic science and clinical evidence supporting a modified view of tumor initiation and progression in the colon. Expert commentary: The proposed 'cancer punctuated equilibrium' model of tumor evolution better explains the variability seen within and across polyps of the colon and rectum. Small colorectal polyps (6-9mm) followed longitudinally by interval imaging with CT colonography have been reported to have multiple fates: some growing, some remaining static in size, and others regressing in size over time. This new model allows for this variability in growth behavior and supports the hypothesis that some tumors can be 'born to be bad' as originally postulated by Sottoriva and colleagues, with very early molecular events impacting tumor fitness and growth behavior in the later stages of the disease process.
Cognitive predictors of balance in Parkinson's disease.
Fernandes, Ângela; Mendes, Andreia; Rocha, Nuno; Tavares, João Manuel R S
2016-06-01
Postural instability is one of the most incapacitating symptoms of Parkinson's disease (PD) and appears to be related to cognitive deficits. This study aims to determine the cognitive factors that can predict deficits in static and dynamic balance in individuals with PD. A sociodemographic questionnaire characterized 52 individuals with PD for this work. The Trail Making Test, Rule Shift Cards Test, and Digit Span Test assessed the executive functions. The static balance was assessed using a plantar pressure platform, and dynamic balance was based on the Timed Up and Go Test. The results were statistically analysed using SPSS Statistics software through linear regression analysis. The results show that a statistically significant model based on cognitive outcomes was able to explain the variance of motor variables. Also, the explanatory value of the model tended to increase with the addition of individual and clinical variables, although the resulting model was not statistically significant The model explained 25-29% of the variability of the Timed Up and Go Test, while for the anteroposterior displacement it was 23-34%, and for the mediolateral displacement it was 24-39%. From the findings, we conclude that the cognitive performance, especially the executive functions, is a predictor of balance deficit in individuals with PD.
Conceptual hierarchical modeling to describe wetland plant community organization
Little, A.M.; Guntenspergen, G.R.; Allen, T.F.H.
2010-01-01
Using multivariate analysis, we created a hierarchical modeling process that describes how differently-scaled environmental factors interact to affect wetland-scale plant community organization in a system of small, isolated wetlands on Mount Desert Island, Maine. We followed the procedure: 1) delineate wetland groups using cluster analysis, 2) identify differently scaled environmental gradients using non-metric multidimensional scaling, 3) order gradient hierarchical levels according to spatiotem-poral scale of fluctuation, and 4) assemble hierarchical model using group relationships with ordination axes and post-hoc tests of environmental differences. Using this process, we determined 1) large wetland size and poor surface water chemistry led to the development of shrub fen wetland vegetation, 2) Sphagnum and water chemistry differences affected fen vs. marsh / sedge meadows status within small wetlands, and 3) small-scale hydrologic differences explained transitions between forested vs. non-forested and marsh vs. sedge meadow vegetation. This hierarchical modeling process can help explain how upper level contextual processes constrain biotic community response to lower-level environmental changes. It creates models with more nuanced spatiotemporal complexity than classification and regression tree procedures. Using this process, wetland scientists will be able to generate more generalizable theories of plant community organization, and useful management models. ?? Society of Wetland Scientists 2009.
NASA Astrophysics Data System (ADS)
Aulenbach, B. T.; Burns, D. A.; Shanley, J. B.; Yanai, R. D.; Bae, K.; Wild, A.; Yang, Y.; Dong, Y.
2013-12-01
There are many sources of uncertainty in estimates of streamwater solute flux. Flux is the product of discharge and concentration (summed over time), each of which has measurement uncertainty of its own. Discharge can be measured almost continuously, but concentrations are usually determined from discrete samples, which increases uncertainty dependent on sampling frequency and how concentrations are assigned for the periods between samples. Gaps between samples can be estimated by linear interpolation or by models that that use the relations between concentration and continuously measured or known variables such as discharge, season, temperature, and time. For this project, developed in cooperation with QUEST (Quantifying Uncertainty in Ecosystem Studies), we evaluated uncertainty for three flux estimation methods and three different sampling frequencies (monthly, weekly, and weekly plus event). The constituents investigated were dissolved NO3, Si, SO4, and dissolved organic carbon (DOC), solutes whose concentration dynamics exhibit strongly contrasting behavior. The evaluation was completed for a 10-year period at five small, forested watersheds in Georgia, New Hampshire, New York, Puerto Rico, and Vermont. Concentration regression models were developed for each solute at each of the three sampling frequencies for all five watersheds. Fluxes were then calculated using (1) a linear interpolation approach, (2) a regression-model method, and (3) the composite method - which combines the regression-model method for estimating concentrations and the linear interpolation method for correcting model residuals to the observed sample concentrations. We considered the best estimates of flux to be derived using the composite method at the highest sampling frequencies. We also evaluated the importance of sampling frequency and estimation method on flux estimate uncertainty; flux uncertainty was dependent on the variability characteristics of each solute and varied for different reporting periods (e.g. 10-year, study period vs. annually vs. monthly). The usefulness of the two regression model based flux estimation approaches was dependent upon the amount of variance in concentrations the regression models could explain. Our results can guide the development of optimal sampling strategies by weighing sampling frequency with improvements in uncertainty in stream flux estimates for solutes with particular characteristics of variability. The appropriate flux estimation method is dependent on a combination of sampling frequency and the strength of concentration regression models. Sites: Biscuit Brook (Frost Valley, NY), Hubbard Brook Experimental Forest and LTER (West Thornton, NH), Luquillo Experimental Forest and LTER (Luquillo, Puerto Rico), Panola Mountain (Stockbridge, GA), Sleepers River Research Watershed (Danville, VT)
The intergenerational transmission of conduct problems.
Raudino, Alessandra; Fergusson, David M; Woodward, Lianne J; Horwood, L John
2013-03-01
Drawing on prospective longitudinal data, this paper examines the intergenerational transmission of childhood conduct problems in a sample of 209 parents and their 331 biological offspring studied as part of the Christchurch Health and Developmental Study. The aims were to estimate the association between parental and offspring conduct problems and to examine the extent to which this association could be explained by (a) confounding social/family factors from the parent's childhood and (b) intervening factors reflecting parental behaviours and family functioning. The same item set was used to assess childhood conduct problems in parents and offspring. Two approaches to data analysis (generalised estimating equation regression methods and latent variable structural equation modelling) were used to examine possible explanations of the intergenerational continuity in behaviour. Regression analysis suggested that there was moderate intergenerational continuity (r = 0.23, p < 0.001) between parental and offspring conduct problems. This continuity was not explained by confounding factors but was partially mediated by parenting behaviours, particularly parental over-reactivity. Latent variable modelling designed to take account of non-observed common genetic and environmental factors underlying the continuities in problem behaviours across generations also suggested that parenting behaviour played a role in mediating the intergenerational transmission of conduct problems. There is clear evidence of intergenerational continuity in conduct problems. In part this association reflects a causal chain process in which parental conduct problems are associated (directly or indirectly) with impaired parenting behaviours that in turn influence risks of conduct problems in offspring.
Weijman, I; Ros, W; Rutten, G; Schaufeli, W; Schabracq, M; Winnubst, J
2003-01-01
Aims: To examine the relations between work characteristics as defined by the Job Demand-Control-Support model (JDCS) (that is, job demands, decision latitude, and social support), diabetes related burden (symptoms, seriousness of disease, self care activities, and disease duration), and fatigue in employees with diabetes mellitus. Methods: Employees (n = 292) aged 30–60 years, with insulin treated diabetes, filled in self administered questionnaires that assess the above mentioned components of the JDCS model and diabetes related burdens. Results: Both work and diabetes related factors are related to fatigue in employees with diabetes. Regression analyses revealed that work characteristics explain 19.1% of the variance in fatigue; lack of support, and the interaction of job demands and job control contribute significantly. Diabetes related factors explain another 29.0% of the variance, with the focus on diabetes related symptoms and the burden of adjusting insulin dosage to circumstances. Fatigue is more severe in case of lack of social support at work, high job demands in combination with a lack of decision latitude, more burden of adjusting insulin dosage to circumstances, and more diabetic symptoms. Furthermore, regression analysis revealed that diabetic symptoms and the burden of adjusting the insulin dosage to circumstances are especially relevant in combination with high job demands. Conclusions: Both diabetes and work should be taken into consideration—by (occupational) physicians as well as supervisors—in the communication with people with diabetes. PMID:12782754
May, Philip A; Tabachnick, Barbara G; Gossage, J Phillip; Kalberg, Wendy O; Marais, Anna-Susan; Robinson, Luther K; Manning, Melanie; Buckley, David; Hoyme, H Eugene
2011-12-01
Previous research in South Africa revealed very high rates of fetal alcohol syndrome (FAS), of 46-89 per 1000 among young children. Maternal and child data from studies in this community summarize the multiple predictors of FAS and partial fetal alcohol syndrome (PFAS). Sequential regression was employed to examine influences on child physical characteristics and dysmorphology from four categories of maternal traits: physical, demographic, childbearing, and drinking. Then, a structural equation model (SEM) was constructed to predict influences on child physical characteristics. Individual sequential regressions revealed that maternal drinking measures were the most powerful predictors of a child's physical anomalies (R² = .30, p < .001), followed by maternal demographics (R² = .24, p < .001), maternal physical characteristics (R²=.15, p < .001), and childbearing variables (R² = .06, p < .001). The SEM utilized both individual variables and the four composite categories of maternal traits to predict a set of child physical characteristics, including a total dysmorphology score. As predicted, drinking behavior is a relatively strong predictor of child physical characteristics (β = 0.61, p < .001), even when all other maternal risk variables are included; higher levels of drinking predict child physical anomalies. Overall, the SEM model explains 62% of the variance in child physical anomalies. As expected, drinking variables explain the most variance. But this highly controlled estimation of multiple effects also reveals a significant contribution played by maternal demographics and, to a lesser degree, maternal physical and childbearing variables. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Seasonal and spatial patterns of erosivity in a tropical watershed of the Colombian Andes
NASA Astrophysics Data System (ADS)
Hoyos, Natalia; Waylen, Peter R.; Jaramillo, Álvaro
2005-11-01
The Dosquebradas Basin, in the central coffee growing region of Colombia, covers an area of 58 km 2 between 1350 and 2150 m of elevation, with an annual precipitation of 2600-3200 mm. Seasonal erosivity (EI30), as defined by the Revised Universal Soil Loss Equation (RUSLE), was calculated for 11 years of record (1987-1997) from six pluviographic stations located within 21 km of the basin. Regression models for each station indicated that storm rainfall explained 61-70% of the variation in storm erosivity. Individual storms represented as much as 25% of the annual EI30 (10,409-15,975 MJ mm ha -1 h -1 yr -1). At the seasonal scale, the explained variation increased to 75-86%. There was a significant difference between wet and dry seasons, with higher values and larger increases in erosivity per unit increase in rainfall during the wet seasons. Two pooled regression models, one for the wet and one for the dry seasons, were created and used to estimate seasonal erosivity for 10 stations with pluviometric data. Interpolation surfaces were created from seasonal values using the local polynomial algorithm. Spatial patterns of erosivity were related to (a) the regional elevation gradient, particularly important during the dry seasons, and (b) local topographic effects, particularly during the wet seasons. Our findings underscore the importance of using seasonal erosivity values and local rainfall intensity records in tropical mountainous regions characterized by marked rainfall seasonality and complex topography.
Agarwal, Parul; Sambamoorthi, Usha
2015-12-01
Depression is common among individuals with osteoarthritis and leads to increased healthcare burden. The objective of this study was to examine excess total healthcare expenditures associated with depression among individuals with osteoarthritis in the US. Adults with self-reported osteoarthritis (n = 1881) were identified using data from the 2010 Medical Expenditure Panel Survey (MEPS). Among those with osteoarthritis, chi-square tests and ordinary least square regressions (OLS) were used to examine differences in healthcare expenditures between those with and without depression. Post-regression linear decomposition technique was used to estimate the relative contribution of different constructs of the Anderson's behavioral model, i.e., predisposing, enabling, need, personal healthcare practices, and external environment factors, to the excess expenditures associated with depression among individuals with osteoarthritis. All analysis accounted for the complex survey design of MEPS. Depression coexisted among 20.6 % of adults with osteoarthritis. The average total healthcare expenditures were $13,684 among adults with depression compared to $9284 among those without depression. Multivariable OLS regression revealed that adults with depression had 38.8 % higher healthcare expenditures (p < 0.001) compared to those without depression. Post-regression linear decomposition analysis indicated that 50 % of differences in expenditures among adults with and without depression can be explained by differences in need factors. Among individuals with coexisting osteoarthritis and depression, excess healthcare expenditures associated with depression were mainly due to comorbid anxiety, chronic conditions and poor health status. These expenditures may potentially be reduced by providing timely intervention for need factors or by providing care under a collaborative care model.
Modeling variability in air pollution-related health damages from individual airport emissions.
Penn, Stefani L; Boone, Scott T; Harvey, Brian C; Heiger-Bernays, Wendy; Tripodis, Yorghos; Arunachalam, Sarav; Levy, Jonathan I
2017-07-01
In this study, we modeled concentrations of fine particulate matter (PM 2.5 ) and ozone (O 3 ) attributable to precursor emissions from individual airports in the United States, developing airport-specific health damage functions (deaths per 1000t of precursor emissions) and physically-interpretable regression models to explain variability in these functions. We applied the Community Multiscale Air Quality model using the Decoupled Direct Method to isolate PM 2.5 - or O 3 -related contributions from precursor pollutants emitted by 66 individual airports. We linked airport- and pollutant-specific concentrations with population data and literature-based concentration-response functions to create health damage functions. Deaths per 1000t of primary PM 2.5 emissions ranged from 3 to 160 across airports, with variability explained by population patterns within 500km of the airport. Deaths per 1000t of precursors for secondary PM 2.5 varied across airports from 0.1 to 2.7 for NOx, 0.06 to 2.9 for SO 2 , and 0.06 to 11 for VOCs, with variability explained by population patterns and ambient concentrations influencing particle formation. Deaths per 1000t of O 3 precursors ranged from -0.004 to 1.0 for NOx and 0.03 to 1.5 for VOCs, with strong seasonality and influence of ambient concentrations. Our findings reinforce the importance of location- and source-specific health damage functions in design of health-maximizing emissions control policies. Copyright © 2017 Elsevier Inc. All rights reserved.
NASA Astrophysics Data System (ADS)
Kirchner-Bossi, Nicolas; Befort, Daniel J.; Wild, Simon B.; Ulbrich, Uwe; Leckebusch, Gregor C.
2016-04-01
Time-clustered winter storms are responsible for a majority of the wind-induced losses in Europe. Over last years, different atmospheric and oceanic large-scale mechanisms as the North Atlantic Oscillation (NAO) or the Meridional Overturning Circulation (MOC) have been proven to drive some significant portion of the windstorm variability over Europe. In this work we systematically investigate the influence of different large-scale natural variability modes: more than 20 indices related to those mechanisms with proven or potential influence on the windstorm frequency variability over Europe - mostly SST- or pressure-based - are derived by means of ECMWF ERA-20C reanalysis during the last century (1902-2009), and compared to the windstorm variability for the European winter (DJF). Windstorms are defined and tracked as in Leckebusch et al. (2008). The derived indices are then employed to develop a statistical procedure including a stepwise Multiple Linear Regression (MLR) and an Artificial Neural Network (ANN), aiming to hindcast the inter-annual (DJF) regional windstorm frequency variability in a case study for the British Isles. This case study reveals 13 indices with a statistically significant coupling with seasonal windstorm counts. The Scandinavian Pattern (SCA) showed the strongest correlation (0.61), followed by the NAO (0.48) and the Polar/Eurasia Pattern (0.46). The obtained indices (standard-normalised) are selected as predictors for a windstorm variability hindcast model applied for the British Isles. First, a stepwise linear regression is performed, to identify which mechanisms can explain windstorm variability best. Finally, the indices retained by the stepwise regression are used to develop a multlayer perceptron-based ANN that hindcasted seasonal windstorm frequency and clustering. Eight indices (SCA, NAO, EA, PDO, W.NAtl.SST, AMO (unsmoothed), EA/WR and Trop.N.Atl SST) are retained by the stepwise regression. Among them, SCA showed the highest linear coefficient, followed by SST in western Atlantic, AMO and NAO. The explanatory regression model (considering all time steps) provided a Coefficient of Determination (R^2) of 0.75. A predictive version of the linear model applying a leave-one-out cross-validation (LOOCV) shows an R2 of 0.56 and a relative RMSE of 4.67 counts/season. An ANN-based nonlinear hindcast model for the seasonal windstorm frequency is developed with the aim to improve the stepwise hindcast ability and thus better predict a time-clustered season over the case study. A 7 node-hidden layer perceptron is set, and the LOOCV procedure reveals a R2 of 0.71. In comparison to the stepwise MLR the RMSE is reduced a 20%. This work shows that for the British Isles case study, most of the interannual variability can be explained by certain large-scale mechanisms, considering also nonlinear effects (ANN). This allows to discern a time-clustered season from a non-clustered one - a key issue for applications e.g., in the (re)insurance industry.
Development of land use regression models for particle composition in twenty study areas in Europe.
de Hoogh, Kees; Wang, Meng; Adam, Martin; Badaloni, Chiara; Beelen, Rob; Birk, Matthias; Cesaroni, Giulia; Cirach, Marta; Declercq, Christophe; Dėdelė, Audrius; Dons, Evi; de Nazelle, Audrey; Eeftens, Marloes; Eriksen, Kirsten; Eriksson, Charlotta; Fischer, Paul; Gražulevičienė, Regina; Gryparis, Alexandros; Hoffmann, Barbara; Jerrett, Michael; Katsouyanni, Klea; Iakovides, Minas; Lanki, Timo; Lindley, Sarah; Madsen, Christian; Mölter, Anna; Mosler, Gioia; Nádor, Gizella; Nieuwenhuijsen, Mark; Pershagen, Göran; Peters, Annette; Phuleria, Harisch; Probst-Hensch, Nicole; Raaschou-Nielsen, Ole; Quass, Ulrich; Ranzi, Andrea; Stephanou, Euripides; Sugiri, Dorothea; Schwarze, Per; Tsai, Ming-Yi; Yli-Tuomi, Tarja; Varró, Mihály J; Vienneau, Danielle; Weinmayr, Gudrun; Brunekreef, Bert; Hoek, Gerard
2013-06-04
Land Use Regression (LUR) models have been used to describe and model spatial variability of annual mean concentrations of traffic related pollutants such as nitrogen dioxide (NO2), nitrogen oxides (NOx) and particulate matter (PM). No models have yet been published of elemental composition. As part of the ESCAPE project, we measured the elemental composition in both the PM10 and PM2.5 fraction sizes at 20 sites in each of 20 study areas across Europe. LUR models for eight a priori selected elements (copper (Cu), iron (Fe), potassium (K), nickel (Ni), sulfur (S), silicon (Si), vanadium (V), and zinc (Zn)) were developed. Good models were developed for Cu, Fe, and Zn in both fractions (PM10 and PM2.5) explaining on average between 67 and 79% of the concentration variance (R(2)) with a large variability between areas. Traffic variables were the dominant predictors, reflecting nontailpipe emissions. Models for V and S in the PM10 and PM2.5 fractions and Si, Ni, and K in the PM10 fraction performed moderately with R(2) ranging from 50 to 61%. Si, NI, and K models for PM2.5 performed poorest with R(2) under 50%. The LUR models are used to estimate exposures to elemental composition in the health studies involved in ESCAPE.
Foster, Guy M.
2014-01-01
The Neosho River and its primary tributary, the Cottonwood River, are the primary sources of inflow to the John Redmond Reservoir in east-central Kansas. Sedimentation rate in the John Redmond Reservoir was estimated as 743 acre-feet per year for 1964–2006. This estimated sedimentation rate is more than 80 percent larger than the projected design sedimentation rate of 404 acre-feet per year, and resulted in a loss of 40 percent of the conservation pool since its construction in 1964. To reduce sediment input into the reservoir, the Kansas Water Office implemented stream bank stabilization techniques along an 8.3 mile reach of the Neosho River during 2010 through 2011. The U.S. Geological Survey, in cooperation with the Kansas Water Office and funded in part through the Kansas State Water Plan Fund, operated continuous real-time water-quality monitors upstream and downstream from stream bank stabilization efforts before, during, and after construction. Continuously measured water-quality properties include streamflow, specific conductance, water temperature, and turbidity. Discrete sediment samples were collected from June 2009 through September 2012 and analyzed for suspended-sediment concentration (SSC), percentage of sediments less than 63 micrometers (sand-fine break), and loss of material on ignition (analogous to amount of organic matter). Regression models were developed to establish relations between discretely measured SSC samples, and turbidity or streamflow to estimate continuously SSC. Continuous water-quality monitors represented between 96 and 99 percent of the cross-sectional variability for turbidity, and had slopes between 0.91 and 0.98. Because consistent bias was not observed, values from continuous water-quality monitors were considered representative of stream conditions. On average, turbidity-based SSC models explained 96 percent of the variance in SSC. Streamflow-based regressions explained 53 to 60 percent of the variance. Mean squared prediction error for turbidity-based regression relations ranged from -32 to 48 percent, whereas mean square prediction error for streamflow-based regressions ranged from -69 to 218 percent. These models are useful for evaluating the variability of SSC during rapidly changing conditions, computing loads and yields to assess SSC transport through the watershed, and for providing more accurate load estimates compared to streamflow-only based estimation methods used in the past. These models can be used to evaluate the efficacy of streambank stabilization efforts.
Balk, Benjamin; Elder, Kelly
2000-01-01
We model the spatial distribution of snow across a mountain basin using an approach that combines binary decision tree and geostatistical techniques. In April 1997 and 1998, intensive snow surveys were conducted in the 6.9‐km2 Loch Vale watershed (LVWS), Rocky Mountain National Park, Colorado. Binary decision trees were used to model the large‐scale variations in snow depth, while the small‐scale variations were modeled through kriging interpolation methods. Binary decision trees related depth to the physically based independent variables of net solar radiation, elevation, slope, and vegetation cover type. These decision tree models explained 54–65% of the observed variance in the depth measurements. The tree‐based modeled depths were then subtracted from the measured depths, and the resulting residuals were spatially distributed across LVWS through kriging techniques. The kriged estimates of the residuals were added to the tree‐based modeled depths to produce a combined depth model. The combined depth estimates explained 60–85% of the variance in the measured depths. Snow densities were mapped across LVWS using regression analysis. Snow‐covered area was determined from high‐resolution aerial photographs. Combining the modeled depths and densities with a snow cover map produced estimates of the spatial distribution of snow water equivalence (SWE). This modeling approach offers improvement over previous methods of estimating SWE distribution in mountain basins.
van der Zijden, A M; Groen, B E; Tanck, E; Nienhuis, B; Verdonschot, N; Weerdesteyn, V
2017-03-21
Many research groups have studied fall impact mechanics to understand how fall severity can be reduced to prevent hip fractures. Yet, direct impact force measurements with force plates are restricted to a very limited repertoire of experimental falls. The purpose of this study was to develop a generic model for estimating hip impact forces (i.e. fall severity) in in vivo sideways falls without the use of force plates. Twelve experienced judokas performed sideways Martial Arts (MA) and Block ('natural') falls on a force plate, both with and without a mat on top. Data were analyzed to determine the hip impact force and to derive 11 selected (subject-specific and kinematic) variables. Falls from kneeling height were used to perform a stepwise regression procedure to assess the effects of these input variables and build the model. The final model includes four input variables, involving one subject-specific measure and three kinematic variables: maximum upper body deceleration, body mass, shoulder angle at the instant of 'maximum impact' and maximum hip deceleration. The results showed that estimated and measured hip impact forces were linearly related (explained variances ranging from 46 to 63%). Hip impact forces of MA falls onto the mat from a standing position (3650±916N) estimated by the final model were comparable with measured values (3698±689N), even though these data were not used for training the model. In conclusion, a generic linear regression model was developed that enables the assessment of fall severity through kinematic measures of sideways falls, without using force plates. Copyright © 2017 Elsevier Ltd. All rights reserved.
Giménez, J; Manjabacas, A; Tuset, V M; Lombarte, A
2016-10-01
Regressions between fish length and otolith size are provided for 40 species from the north-eastern Atlantic Ocean and 142 species from the Mediterranean Sea. Regressions were also estimated at genus level. Most of the regressions (c. 84%) explained a high percentage of the deviance (>75%). © 2016 The Fisheries Society of the British Isles.
Patient casemix classification for medicare psychiatric prospective payment.
Drozd, Edward M; Cromwell, Jerry; Gage, Barbara; Maier, Jan; Greenwald, Leslie M; Goldman, Howard H
2006-04-01
For a proposed Medicare prospective payment system for inpatient psychiatric facility treatment, the authors developed a casemix classification to capture differences in patients' real daily resource use. Primary data on patient characteristics and daily time spent in various activities were collected in a survey of 696 patients from 40 inpatient psychiatric facilities. Survey data were combined with Medicare claims data to estimate intensity-adjusted daily cost. Classification and Regression Trees (CART) analysis of average daily routine and ancillary costs yielded several hierarchical classification groupings. Regression analysis was used to control for facility and day-of-stay effects in order to compare hierarchical models with models based on the recently proposed payment system of the Centers for Medicare & Medicaid Services. CART analysis identified a small set of patient characteristics strongly associated with higher daily costs, including age, psychiatric diagnosis, deficits in daily living activities, and detox or ECT use. A parsimonious, 16-group, fully interactive model that used five major DSM-IV categories and stratified by age, illness severity, deficits in daily living activities, dangerousness, and use of ECT explained 40% (out of a possible 76%) of daily cost variation not attributable to idiosyncratic daily changes within patients. A noninteractive model based on diagnosis-related groups, age, and medical comorbidity had explanatory power of only 32%. A regression model with 16 casemix groups restricted to using "appropriate" payment variables (i.e., those with clinical face validity and low administrative burden that are easily validated and provide proper care incentives) produced more efficient and equitable payments than did a noninteractive system based on diagnosis-related groups.
Facchinello, Yann; Beauséjour, Marie; Richard-Denis, Andreane; Thompson, Cynthia; Mac-Thiong, Jean-Marc
2017-10-25
Predicting the long-term functional outcome following traumatic spinal cord injury is needed to adapt medical strategies and to plan an optimized rehabilitation. This study investigates the use of regression tree for the development of predictive models based on acute clinical and demographic predictors. This prospective study was performed on 172 patients hospitalized following traumatic spinal cord injury. Functional outcome was quantified using the Spinal Cord Independence Measure collected within the first-year post injury. Age, delay prior to surgery and Injury Severity Score were considered as continuous predictors while energy of injury, trauma mechanisms, neurological level of injury, injury severity, occurrence of early spasticity, urinary tract infection, pressure ulcer and pneumonia were coded as categorical inputs. A simplified model was built using only injury severity, neurological level, energy and age as predictor and was compared to a more complex model considering all 11 predictors mentioned above The models built using 4 and 11 predictors were found to explain 51.4% and 62.3% of the variance of the Spinal Cord Independence Measure total score after validation, respectively. The severity of the neurological deficit at admission was found to be the most important predictor. Other important predictors were the Injury Severity Score, age, neurological level and delay prior to surgery. Regression trees offer promising performances for predicting the functional outcome after a traumatic spinal cord injury. It could help to determine the number and type of predictors leading to a prediction model of the functional outcome that can be used clinically in the future.
Sensitivity of ALOS/PALSAR imagery to forest degradation by fire in northern Amazon
NASA Astrophysics Data System (ADS)
Martins, Flora da Silva Ramos Vieira; dos Santos, João Roberto; Galvão, Lênio Soares; Xaud, Haron Abrahim Magalhães
2016-07-01
We evaluated the sensitivity of the full polarimetric Phased Array type L-band Synthetic Aperture Radar (PALSAR), onboard the Advanced Land Observing Satellite (ALOS), to forest degradation caused by fires in northern Amazon, Brazil. We searched for changes in PALSAR signal and tri-dimensional polarimetric responses for different classes of fire disturbance defined by fire frequency and severity. Since the aboveground biomass (AGB) is affected by fire, multiple regression models to estimate AGB were obtained for the whole set of coherent and incoherent attributes (general model) and for each set separately (specific models). The results showed that the polarimetric L-band PALSAR attributes were sensitive to variations in canopy structure and AGB caused by forest fire. However, except for the unburned and thrice burned classes, no single PALSAR attribute was able to discriminate between the intermediate classes of forest degradation by fire. Both the coherent and incoherent polarimetric attributes were important to explain AGB variations in tropical forests affected by fire. The HV backscattering coefficient, anisotropy, double-bounce component, orientation angle, volume index and HH-VV phase difference were PALSAR attributes selected from multiple regression analysis to estimate AGB. The general regression model, combining phase and power radar metrics, presented better results than specific models using coherent or incoherent attributes. The polarimetric responses indicated the dominance of VV-oriented backscattering in primary forest and lightly burned forests. The HH-oriented backscattering predominated in heavily and frequently burned forests. The results suggested a greater contribution of horizontally arranged constituents such as fallen trunks or branches in areas severely affected by fire.
Saraf, Sanatan; Mathew, Thomas; Roy, Anindya
2015-01-01
For the statistical validation of surrogate endpoints, an alternative formulation is proposed for testing Prentice's fourth criterion, under a bivariate normal model. In such a setup, the criterion involves inference concerning an appropriate regression parameter, and the criterion holds if the regression parameter is zero. Testing such a null hypothesis has been criticized in the literature since it can only be used to reject a poor surrogate, and not to validate a good surrogate. In order to circumvent this, an equivalence hypothesis is formulated for the regression parameter, namely the hypothesis that the parameter is equivalent to zero. Such an equivalence hypothesis is formulated as an alternative hypothesis, so that the surrogate endpoint is statistically validated when the null hypothesis is rejected. Confidence intervals for the regression parameter and tests for the equivalence hypothesis are proposed using bootstrap methods and small sample asymptotics, and their performances are numerically evaluated and recommendations are made. The choice of the equivalence margin is a regulatory issue that needs to be addressed. The proposed equivalence testing formulation is also adopted for other parameters that have been proposed in the literature on surrogate endpoint validation, namely, the relative effect and proportion explained.
On neural networks in identification and control of dynamic systems
NASA Technical Reports Server (NTRS)
Phan, Minh; Juang, Jer-Nan; Hyland, David C.
1993-01-01
This paper presents a discussion of the applicability of neural networks in the identification and control of dynamic systems. Emphasis is placed on the understanding of how the neural networks handle linear systems and how the new approach is related to conventional system identification and control methods. Extensions of the approach to nonlinear systems are then made. The paper explains the fundamental concepts of neural networks in their simplest terms. Among the topics discussed are feed forward and recurrent networks in relation to the standard state-space and observer models, linear and nonlinear auto-regressive models, linear, predictors, one-step ahead control, and model reference adaptive control for linear and nonlinear systems. Numerical examples are presented to illustrate the application of these important concepts.
Hyde, Melissa K; White, Katherine M
2010-05-01
To explore whether people's organ donation consent decisions occur via a reasoned and/or social reaction pathway. We examined prospectively students' and community members' decisions to register consent on a donor register and discuss organ donation wishes with family. Participants completed items assessing theory of planned behaviour (TPB; attitude, subjective norm, perceived behavioural control (PBC)), prototype/willingness model (PWM; donor prototype favourability/similarity, past behaviour), and proposed additional influences (moral norm, self-identity, recipient prototypes) for registering (N=339) and discussing (N=315) intentions/willingness. Participants self-reported their registering (N=177) and discussing (N=166) behaviour 1 month later. The utility of the (1) TPB, (2) PWM, (3) augmented TPB with PWM, and (4) augmented TPB with PWM and extensions was tested using structural equation modelling for registering and discussing intentions/willingness, and logistic regression for behaviour. While the TPB proved a more parsimonious model, fit indices suggested that the other proposed models offered viable options, explaining greater variance in communication intentions/willingness. The TPB, augmented TPB with PWM, and extended augmented TPB with PWM best explained registering and discussing decisions. The proposed and revised PWM also proved an adequate fit for discussing decisions. Respondents with stronger intentions (and PBC for registering) had a higher likelihood of registering and discussing. People's decisions to communicate donation wishes may be better explained via a reasoned pathway (especially for registering); however, discussing involves more reactive elements. The role of moral norm, self-identity, and prototypes as influences predicting communication decisions were highlighted also.
Relationships between Adaptive Behaviours, Personal Factors, and Participation of Young Children.
Killeen, Hazel; Shiel, Agnes; Law, Mary; O'Donovan, Donough J; Segurado, Ricardo; Anaby, Dana
2017-12-19
To examine the extent to which personal factors (age, socioeconomic grouping, and preterm birth) and adaptive behaviour explain the participation patterns of young children. 65 Children 2-5 years old with and without a history of preterm birth and no physical or intellectual disability were selected by convenience sampling from Galway University Hospital, Ireland. Interviews with parents were conducted using the Adaptive Behaviour Assessment System, Second Edition (ABAS-II) and the Assessment of Preschool Children's Participation (APCP). Linear regression models were used to identify associations between the ABAS-II scores, personal factors, and APCP scores for intensity and diversity of participation. Adaptive behaviour explained 21% of variance in intensity of play, 18% in intensity of Skill Development, 7% in intensity of Active Physical Recreation, and 6% in intensity of Social Activities controlling for age, preterm birth, and socioeconomic grouping. Age explained between 1% and 11% of variance in intensity of participation scores. Adapted behaviour (13%), Age (17%), and socioeconomic grouping (5%) explained a significant percentage of variance in diversity of participation controlling for the other variables. Adaptive behaviour had a unique contribution to children's intensity and diversity of participation, suggesting its importance.
Wolff, Dana; Fitzhugh, Eugene C.
2011-01-01
The purpose of this study was to examine relationships between weather and outdoor physical activity (PA). An online weather source was used to obtain daily max temperature [DMT], precipitation, and wind speed. An infra-red trail counter provided data on daily trail use along a greenway, over a 2-year period. Multiple regression analysis was used to examine associations between PA and weather, while controlling for day of the week and month of the year. The overall regression model explained 77.0% of the variance in daily PA (p < 0.001). DMT (b = 10.5), max temp-squared (b = −4.0), precipitation (b = −70.0), and max wind speed (b = 1.9) contributed significantly. Conclusion: Aggregated daily data can detect relationships between weather and outdoor PA. PMID:21556205
Measurements of interfacial thermal contact conductance between pressed alloys at low temperatures
NASA Astrophysics Data System (ADS)
Zheng, Jiang; Li, Yanzhong; Chen, Pengwei; Yin, Geyuan; Luo, Huaihua
2016-12-01
Interfacial thermal contact conductance is the primary factor limiting the heat transfer in many cryogenic engineering applications. This paper presents an experimental apparatus to measure interfacial thermal contact conductance between pressed alloys in a vacuum environment at low temperatures. The measurements of thermal contact conductance between pressed alloys are conducted by using the developed apparatus. The results show that the contact conductance increases with the decrease of surface roughness, the increase of interface temperature and contact pressure. The temperature dependence of thermal conductivity and mechanical properties is analyzed to explain the results. Thermal contact conductance of a pair of stainless steel specimens is obtained in the interface temperature range of 135-245 K and in the contact pressure range of 1-9 MPa. The results are regressed as a power function of temperature and load. Thermal conductance is also obtained between aluminums as well as between stainless steel and aluminum. The load exponents of the regressed relations for different contacts are compared. Existing theoretical models (the Cooper-Mikic-Yovanovich plastic model, the Mikic elastic model and the improved Kimura model) are reviewed and compared with the experimental results. The Cooper-Mikic-Yovanovich model predictions are found to be in good agreement with experimental results, especially with measurements between aluminums.
SPSS and SAS programming for the testing of mediation models.
Dudley, William N; Benuzillo, Jose G; Carrico, Mineh S
2004-01-01
Mediation modeling can explain the nature of the relation among three or more variables. In addition, it can be used to show how a variable mediates the relation between levels of intervention and outcome. The Sobel test, developed in 1990, provides a statistical method for determining the influence of a mediator on an intervention or outcome. Although interactive Web-based and stand-alone methods exist for computing the Sobel test, SPSS and SAS programs that automatically run the required regression analyses and computations increase the accessibility of mediation modeling to nursing researchers. To illustrate the utility of the Sobel test and to make this programming available to the Nursing Research audience in both SAS and SPSS. The history, logic, and technical aspects of mediation testing are introduced. The syntax files sobel.sps and sobel.sas, created to automate the computation of the regression analysis and test statistic, are available from the corresponding author. The reported programming allows the user to complete mediation testing with the user's own data in a single-step fashion. A technical manual included with the programming provides instruction on program use and interpretation of the output. Mediation modeling is a useful tool for describing the relation between three or more variables. Programming and manuals for using this model are made available.
Predicting water consumption habits for seven arsenic-safe water options in Bangladesh.
Inauen, Jennifer; Tobias, Robert; Mosler, Hans-Joachim
2013-05-01
In Bangladesh, 20 million people are at the risk of developing arsenicosis because of excessive arsenic intake. Despite increased awareness, many of the implemented arsenic-safe water options are not being sufficiently used by the population. This study investigated the role of social-cognitive factors in explaining the habitual use of arsenic-safe water options. Eight hundred seventy-two randomly selected households in six arsenic-affected districts of rural Bangladesh, which had access to an arsenic-safe water option, were interviewed using structured face-to-face interviews in November 2009. Habitual use of arsenic-safe water options, severity, vulnerability, affective and instrumental attitudes, injunctive and descriptive norms, self-efficacy, and coping planning were measured. The data were analyzed using multiple linear regressions. Linear regression revealed that self-efficacy (B = 0.42, SE = .03, p < .001), the instrumental attitude towards the safe water option (B = 0.24, SE = .04, p < .001), the affective attitude towards contaminated tube wells (B = -0.04, SE = .02, p = .024), vulnerability (B = -0.20, SE = .02, p < .001), as well as injunctive (B = 0.08, SE = 0.04, p = .049) and descriptive norms (B = 0.34, SE = .03, p < .001) primarily explained the habitual use of arsenic-safe water options (R2 = 0.688). This model proved highly generalizable to all seven arsenic-safe water options investigated, even though habitual use of single options were predicted on the basis of parameters estimated without these options. This general model for the habitual use of arsenic-safe water options may prove useful to predict other water consumption habits. Behavior-change interventions are derived from the model to promote the habitual use of arsenic-safe water options.
Maakip, Ismail; Keegel, Tessa; Oakman, Jodi
2017-04-01
Prevalence and predictors associated with musculoskeletal disorders (MSDs) vary considerably between countries. It is plausible that socio-cultural contexts may contribute to these differences. We conducted a cross-sectional survey with 1184 Malaysian and Australian office workers with the aim to examine predictors associated with MSD discomfort. The 6-month period prevalence of self-reported MSD discomfort for Malaysian office workers was 92.8% and 71.2% among Australian workers. In Malaysia, a model regressing level of musculoskeletal discomfort against possible risk factors was significant overall (F [6, 370] = 17.35; p < 0.001) and explained 22% (r = 0.46) of its variance. MSD discomfort was significantly associated with predictors that included gender (β = 14), physical (β = 0.38) and psychosocial hazards (β = -0.10), and work-life balance (β = -0.13). In Australia, the regression model is also significant (F [6, 539] = 16.47; p < 0.001) with the model explaining 15.5% (r = 0.39) of the variance in MSD discomfort. Predictors such as gender (β = 0.14), physical (β = 24) and psychosocial hazards (β = -0.17), were associated with MSD discomfort in Australian office workers. Predictors associated with MSD discomfort were similar, but their relative importance differed. Work-life balance was significantly associated with increased MSD discomfort for the Malaysian population only. Design and implementation of MSD risk management needs to take into account the work practices and culture of the target population. Crown Copyright © 2016. Published by Elsevier Ltd. All rights reserved.
Burns, Douglas A.; Aiken, George R.; Bradley, Paul M.; Journey, Celeste A.; Schelker, Jakob
2013-01-01
The Adirondack region of New York has been identified as a hot spot where high methylmercury concentrations are found in surface waters and biota, yet mercury (Hg) concentrations vary widely in this region. We collected stream and groundwater samples for Hg and organic carbon analyses across the upper Hudson River, a 493 km2 basin in the central Adirondacks to evaluate and model the sources of variation in filtered total Hg (FTHg) concentrations. Variability in FTHg concentrations during the growing seasons (May-Oct) of 2007-2009 in Fishing Brook, a 66-km2 sub-basin, was better explained by specific ultra-violet absorbance at 254 nm (SUVA254), a measure of organic carbon aromaticity, than by dissolved organic carbon (DOC) concentrations, a commonly used Hg indicator. SUVA254 was a stronger predictor of FTHg concentrations during the growing season than during the dormant season. Multiple linear regression models that included SUVA254 values and DOC concentrations could explain 75 % of the variation in FTHg concentrations on an annual basis and 84 % during the growing season. A multiple linear regression landscape modeling approach applied to 27 synoptic sites across the upper Hudson basin found that higher SUVA254 values are associated with gentler slopes, and greater riparian area, and lower SUVA254 values are associated with an increasing influence of open water. We hypothesize that the strong Hg?SUVA254 relation in this basin reflects distinct patterns of FTHg and SUVA254 that are characteristic of source areas that control the mobilization of Hg to surface waters, and that the seasonal influence of these source areas varies in this heterogeneous basin landscape.
Vila-Rodriguez, F; Ochoa, S; Autonell, J; Usall, J; Haro, J M
2011-12-01
Social functioning (SF) is the ultimate target aimed in treatment plans in schizophrenia, thus it is critical to know what are the factors that determine SF. Gender is a well-established variable influencing SF, yet it is not known how social variables and symptoms interact in schizophrenia patients. Furthermore, it remains unclear whether the interaction between social variables and symptoms is different in men compared to women. Our aim is to test whether social variables are better predictors of SF in community-dwelled individuals with schizophrenia, and whether men and women differ in how symptoms and social variables interact to impact SF. Community-dwelling individuals with schizophrenia (N = 231) were randomly selected from a register. Participants were assessed with symptom measures (PANSS), performance-based social scale (LSP), objective social and demographic variables. Stratification by gender and stepwise multivariate regression analyses by gender were used to find the best-fitting models that predict SF in both gender. Men had poorer SF than women in spite of showing similar symptom scores. On stepwise regression analyses, gender was the main variable explaining SF, with a significant contribution by disorganized and excitatory symptoms. Age of onset made a less marked, yet significant, contribution to explain SF. When the sample was stratified by gender, disorganized symptoms and 'Income' variable entered the model and accounted for a 30.8% of the SF variance in women. On the other hand, positive and disorganized symptoms entered the model and accounted for a 36.1% of the SF variance in men. Community-dwelling men and women with schizophrenia differ in the constellation of variables associated with SF. Symptom scores still account for most of the variance in SF in both genders.
Cohen, Gregory H.; Sampson, Laura A.; Fink, David S.; Wang, Jing; Russell, Dale; Gifford, Robert; Fullerton, Carol; Ursano, Robert; Galea, Sandro
2016-01-01
BACKGROUND Recent United States military operations in Iraq and Afghanistan have seen dramatic increases in the proportion of women serving, and the breadth of their occupational roles. General population studies suggest that women, compared to men, and persons with lower, as compared to higher, social position may be at greater risk of post-traumatic stress disorder (PTSD) and depression. However, these relations remain unclear in military populations. Accordingly, we aimed to estimate the effects of (1) gender, (2) military authority (i.e., rank) and (3) the interaction of gender and military authority upon: (a) risk of most-recent-deployment-related PTSD, and (b) risk of depression since most-recent-deployment. METHODS Using a nationally representative sample of 1024 previously deployed Reserve Component personnel surveyed in 2010, we constructed multivariable logistic regression models to estimate effects of interest. RESULTS Weighted multivariable logistic regression models demonstrated no statistically significant associations between gender or authority, and either PTSD or depression. Interaction models demonstrated multiplicative statistical interaction between gender and authority for PTSD (beta= −2.37;p=0.01), and depression (beta=-1.21; p=0.057). Predicted probabilities of PTSD and depression, respectively, were lowest in male officers (0.06, 0.09), followed by male enlisted (0.07, 0.14), female enlisted (0.07, 0.15), and female officers (0.30, 0.25). CONCLUSIONS Female officers in the Reserve Component may be at greatest risk for PTSD and depression following deployment, relative to their male and enlisted counterparts, and this relation is not explained by deployment trauma exposure. Future studies may fruitfully examine whether social support, family responsibilities peri-deployment, or contradictory class status may explain these findings. PMID:26899583
Schuhmacher, Nils; Collard, Jenny; Kärtner, Joscha
2017-02-01
This study analyzes temperamental and social correlates of 18-month-olds' (N=58) instrumental helping (i.e., handing over out-of-reach objects) and comforting (i.e., alleviating experimenter's distress). While out-of-reach helping as a basic type of prosocial behavior was not associated with any of the social and temperamental variables, comforting was associated with maternal responsible parenting, day care attendance, and temperamental fear, accounting for 34% of the total variance in a corresponding regression model. The data of the present study suggest that, while simple instrumental helping seems to be a robust developmental phenomenon, comforting is associated with specific social experiences and child temperament that constitute interindividual differences and thereby help to explain the domain-specific development of prosociality. Copyright © 2017 Elsevier Inc. All rights reserved.
Linking land cover and water quality in New York City's water supply watersheds.
Mehaffey, M H; Nash, M S; Wade, T G; Ebert, D W; Jones, K B; Rager, A
2005-08-01
The Catskill/Delaware reservoirs supply 90% of New York City's drinking water. The City has implemented a series of watershed protection measures, including land acquisition, aimed at preserving water quality in the Catskill/Delaware watersheds. The objective of this study was to examine how relationships between landscape and surface water measurements change between years. Thirty-two drainage areas delineated from surface water sample points (total nitrogen, total phosphorus, and fecal coliform bacteria concentrations) were used in step-wise regression analyses to test landscape and surface-water quality relationships. Two measurements of land use, percent agriculture and percent urban development, were positively related to water quality and consistently present in all regression models. Together these two land uses explained 25 to 75% of the regression model variation. However, the contribution of agriculture to water quality condition showed a decreasing trend with time as overall agricultural land cover decreased. Results from this study demonstrate that relationships between land cover and surface water concentrations of total nitrogen, total phosphorus, and fecal coliform bacteria counts over a large area can be evaluated using a relatively simple geographic information system method. Land managers may find this method useful for targeting resources in relation to a particular water quality concern, focusing best management efforts, and maximizing benefits to water quality with minimal costs.
Effect of motivational interviewing on rates of early childhood caries: a randomized trial.
Harrison, Rosamund; Benton, Tonya; Everson-Stewart, Siobhan; Weinstein, Phil
2007-01-01
The purposes of this randomized controlled trial were to: (1) test motivational interviewing (MI) to prevent early childhood caries; and (2) use Poisson regression for data analysis. A total of 240 South Asian children 6 to 18 months old were enrolled and randomly assigned to either the MI or control condition. Children had a dental exam, and their mothers completed pretested instruments at baseline and 1 and 2 years postintervention. Other covariates that might explain outcomes over and above treatment differences were modeled using Poisson regression. Hazard ratios were produced. Analyses included all participants whenever possible. Poisson regression supported a protective effect of MI (hazard ratio [HR]=0.54 (95%CI=035-0.84)-that is, the M/ group had about a 46% lower rate of dmfs at 2 years than did control children. Similar treatment effect estimates were obtained from models that included, as alternative outcomes, ds, dms, and dmfs, including "white spot lesions." Exploratory analyses revealed that rates of dmfs were higher in children whose mothers had: (1) prechewed their food; (2) been raised in a rural environment; and (3) a higher family income (P<.05). A motivational interviewing-style intervention shows promise to promote preventive behaviors in mothers of young children at high risk for caries.
Impact of previous ART and of ART initiation on outcome of HIV-associated tuberculosis.
Girardi, Enrico; Palmieri, Fabrizio; Angeletti, Claudio; Vanacore, Paola; Matteelli, Alberto; Gori, Andrea; Carbonara, Sergio; Ippolito, Giuseppe
2012-01-01
Combination antiretroviral therapy (cART) has progressively decreased mortality of HIV-associated tuberculosis .To date, however, limited data on tuberculosis treatment outcomes among coinfected patients who are not ART-naive at the time of tuberculosis diagnosis are available. A multicenter, observational study enrolled 246 HIV-infected patients diagnosed with tuberculosis, in 96 Italian infectious diseases hospital units, who started tuberculosis treatment. A polytomous logistic regression model was used to identify baseline factors associated with the outcome. A Poisson regression model was used to explain the effect of ART during tuberculosis treatment on mortality, as a time-varying covariate, adjusting for baseline characteristics. Outcomes of tuberculosis treatment were as follows: 130 (52.8%) were successfully treated, 36 (14.6%) patients died in a median time of 2 months (range: 0-16), and 80 (32.6%) had an unsuccessful outcome. Being foreign born or injecting drug users was associated with unsuccessful outcomes. In multivariable Poisson regression, cART during tuberculosis treatment decreased the risk of death, while this risk increased for those who were not ART-naive at tuberculosis diagnosis. ART during tuberculosis treatment is associated with a substantial reduction of death rate among HIV-infected patients. However, patients who are not ART-naive when they develop tuberculosis remain at elevated risk of death.
Explaining ethnic disparities in lung function among young adults: A pilot investigation
Patel, Jaymini; Minelli, Cosetta; Burney, Peter G. J.
2017-01-01
Background Ethnic disparities in lung function have been linked mainly to anthropometric factors but have not been fully explained. We conducted a cross-sectional pilot study to investigate how best to study ethnic differences in lung function in young adults and evaluate whether these could be explained by birth weight and socio-economic factors. Methods We recruited 112 university students of White and South Asian British ethnicity, measured post-bronchodilator lung function, obtained information on respiratory symptoms and socio-economic factors through questionnaires, and acquired birth weight through data linkage. We regressed lung function against ethnicity and candidate predictors defined a priori using linear regression, and used penalised regression to examine a wider range of factors. We reviewed the implications of our findings for the feasibility of a larger study. Results There was a similar parental socio-economic environment and no difference in birth weight between the two ethnic groups, but the ethnic difference in FVC adjusted for sex, age, height, demi-span, father’s occupation, birth weight, maternal educational attainment and maternal upbringing was 0.81L (95%CI: -1.01 to -0.54L). Difference in body proportions did not explain the ethnic differences although parental immigration was an important predictor of FVC independent of ethnic group. Participants were comfortable with study procedures and we were able to link birth weight data to clinical measurements. Conclusion Studies of ethnic disparities in lung function among young adults are feasible. Future studies should recruit a socially more diverse sample and investigate the role of markers of acculturation in explaining such differences. PMID:28575113
Manyaapelo, Thabang; Nyembezi, Anam; Ruiter, Robert A. C.; van den Borne, Bart; Sifunda, Sibusiso; Reddy, Priscilla
2017-01-01
South Africa leads the world with the number of people infected with HIV. Even with all attempts that have been made to curb HIV, it is still evident that new infections are on the rise. Condom use remains one of the best tools against this challenge yet a small number of sexually active men use them. This study investigates the psychosocial correlates of the intention to use condoms among young men in KwaZulu-Natal province. Using the Theory of Planned Behaviour as a framework, hierarchical linear regression models were used to determine the unique contribution of the study measures in explaining the overall variance of intention to consistently use condoms. Subjective norms and perceived behavioural control towards consistent condom use explained 46% of the variance in the intention to use a condom, suggesting that health behaviour interventions should focus on targeting the normative beliefs as well as control beliefs of the target population. Furthermore, subjective norms and intentions towards reducing alcohol and marijuana use explained an additional 7% to the final model in intentions to condom use, implying that substance use and condom usage may influence each other. No significant contributions were found for beliefs underlying cultural aspects of responsible manhood. PMID:28333100
Sherman, Brent J.; Rochelle, Gary T.
2016-12-16
Explanations for the mass transfer behavior of 2-amino-2-methyl-1-propanol (AMP) are conflicting, despite extensive study of the amine for CO 2 capture. At equilibrium, aqueous AMP reacts with CO 2 to give bicarbonate in a 1:1 ratio. While this is the same stoichiometry as a tertiary amine, the reaction rate of AMP is 100 times faster. This work aims to explain the mass transfer behavior of AMP, specifically the stoichiometry and kinetics. An eNRTL thermodynamic model was used to regress wetted-wall column mass transfer data with two activity-based reactions: formation of carbamate and formation of bicarbonate. Data spanned 40–100 C andmore » 0.15–0.60 mol CO 2/mol alk). The fitted carbamate rate constant is three orders of magnitude greater than the bicarbonate rate constant. Rapid carbamate formation explains the kinetics, while the stoichiometry is explained by the carbamate reverting in the bulk liquid to allow CO 2 to form bicarbonate. Understanding the role of carbamate formation and diffusion in hindered amines enables optimizing solvent amine concentration by balancing viscosity and free amine concentration. Furthermore, this improves absorber design for CO 2 capture.« less
The Role of Graphlets in Viral Processes on Networks
NASA Astrophysics Data System (ADS)
Khorshidi, Samira; Al Hasan, Mohammad; Mohler, George; Short, Martin B.
2018-05-01
Predicting the evolution of viral processes on networks is an important problem with applications arising in biology, the social sciences, and the study of the Internet. In existing works, mean-field analysis based upon degree distribution is used for the prediction of viral spreading across networks of different types. However, it has been shown that degree distribution alone fails to predict the behavior of viruses on some real-world networks and recent attempts have been made to use assortativity to address this shortcoming. In this paper, we show that adding assortativity does not fully explain the variance in the spread of viruses for a number of real-world networks. We propose using the graphlet frequency distribution in combination with assortativity to explain variations in the evolution of viral processes across networks with identical degree distribution. Using a data-driven approach by coupling predictive modeling with viral process simulation on real-world networks, we show that simple regression models based on graphlet frequency distribution can explain over 95% of the variance in virality on networks with the same degree distribution but different network topologies. Our results not only highlight the importance of graphlets but also identify a small collection of graphlets which may have the highest influence over the viral processes on a network.
A Decade in Dental Care Utilization among Adults and Children (2001–2010)
Vujicic, Marko; Nasseh, Kamyar
2014-01-01
Objective To decompose the change in pediatric and adult dental care utilization over the last decade. Data 2001 through 2010 Medical Expenditure Panel Survey. Study Design The Blinder-Oaxaca decomposition was used to explain the change in dental care utilization among adults and children. Changes in dental care utilization were attributed to changes in explained covariates and changes due to movements in estimated coefficients. Controlling for demographics, overall health status, and dental benefits variables, we estimated year-specific logistic regression models. Outputs from these models were used to compute the Blinder-Oaxaca decomposition. Principal Findings Dental care utilization decreased from 40.5 percent in 2001 to 37.0 percent in 2010 for adults and increased from 43.2 percent in 2001 to 46.3 percent in 2010 for children (p < .05). Among adults, changes in insurance status, race, and income contributed to a decline in adult dental care utilization (−0.018, p < .01). Among children, changes in controlled factors did not substantially change dental care utilization, which instead may be explained by changes in policy, oral health status, or preferences. Conclusions Dental care utilization for adults has declined, especially among the poor and uninsured. Without further policy intervention, disadvantaged adults face increasing barriers to dental care. PMID:24299620
Shen, Yong; Yu, Shixiao; Lian, Juyu; Shen, Hao; Cao, Honglin; Lu, Huanping; Ye, Wanhui
2016-01-01
Tropical forests play a disproportionately important role in the global carbon (C) cycle, but it remains unclear how local environments and functional diversity regulate tree aboveground C storage. We examined how three components (environments, functional dominance and diversity) affected C storage in Dinghushan 20-ha plot in China. There was large fine-scale variation in C storage. The three components significantly contributed to regulate C storage, but dominance and diversity of traits were associated with C storage in different directions. Structural equation models (SEMs) of dominance and diversity explained 34% and 32% of variation in C storage. Environments explained 26–44% of variation in dominance and diversity. Similar proportions of variation in C storage were explained by dominance and diversity in regression models, they were improved after adding environments. Diversity of maximum diameter was the best predictor of C storage. Complementarity and selection effects contributed to C storage simultaneously, and had similar importance. The SEMs disengaged the complex relationships among the three components and C storage, and established a framework to show the direct and indirect effects (via dominance and diversity) of local environments on C storage. We concluded that local environments are important for regulating functional diversity and C storage. PMID:27278688
Lundy, J Jason; Coons, Stephen Joel; Wendel, Christopher; Hornbrook, Mark C; Herrinton, Lisa; Grant, Marcia; Krouse, Robert S
2009-03-01
The purpose of this analysis was to determine the unique contribution of household income to the variance explained in psychological well-being (PWB) among a sample of colorectal cancer (CRC) survivors. This study is a secondary analysis of data collected as part of the Health-Related Quality of Life in Long-Term Colorectal Cancer Survivors Study, which included CRC survivors with (cases) and without (controls) ostomies. The dataset included socio-demographic, health status, and health-related quality of life (HRQOL) information. HRQOL was assessed with the modified City of Hope Quality of Life (mCOH-QOL)-Ostomy questionnaire and SF-36v2. To assess the relationship between income and PWB, a hierarchical linear regression model was constructed combining data from both cases and controls. After accounting for the proportion of variance in PWB explained by the other independent variables in the model, the additional variance explained by income was significant (R (2) increased from 0.228 to 0.250; P = 0.006). Although the study design does not allow causal inference, these results demonstrate a significant relationship between income and PWB in CRC survivors. The findings suggest that for non-randomized group comparisons of HRQOL, income should, at the very least, be included as a control variable in the analysis.
Spatio-temporal prediction of daily temperatures using time-series of MODIS LST images
NASA Astrophysics Data System (ADS)
Hengl, Tomislav; Heuvelink, Gerard B. M.; Perčec Tadić, Melita; Pebesma, Edzer J.
2012-01-01
A computational framework to generate daily temperature maps using time-series of publicly available MODIS MOD11A2 product Land Surface Temperature (LST) images (1 km resolution; 8-day composites) is illustrated using temperature measurements from the national network of meteorological stations (159) in Croatia. The input data set contains 57,282 ground measurements of daily temperature for the year 2008. Temperature was modeled as a function of latitude, longitude, distance from the sea, elevation, time, insolation, and the MODIS LST images. The original rasters were first converted to principal components to reduce noise and filter missing pixels in the LST images. The residual were next analyzed for spatio-temporal auto-correlation; sum-metric separable variograms were fitted to account for zonal and geometric space-time anisotropy. The final predictions were generated for time-slices of a 3D space-time cube, constructed in the R environment for statistical computing. The results show that the space-time regression model can explain a significant part of the variation in station-data (84%). MODIS LST 8-day (cloud-free) images are unbiased estimator of the daily temperature, but with relatively low precision (±4.1°C); however their added value is that they systematically improve detection of local changes in land surface temperature due to local meteorological conditions and/or active heat sources (urban areas, land cover classes). The results of 10-fold cross-validation show that use of spatio-temporal regression-kriging and incorporation of time-series of remote sensing images leads to significantly more accurate maps of temperature than if plain spatial techniques were used. The average (global) accuracy of mapping temperature was ±2.4°C. The regression-kriging explained 91% of variability in daily temperatures, compared to 44% for ordinary kriging. Further software advancement—interactive space-time variogram exploration and automated retrieval, resampling and filtering of MODIS images—are anticipated.
Schleicher, Rosemary L; Sternberg, Maya R; Pfeiffer, Christine M
2016-01-01
Sociodemographic and lifestyle factors exert important influences on nutritional status; however, information on their association with biomarkers of fat-soluble nutrients is limited, particularly in a representative sample of adults. Serum or plasma concentrations of vitamin A (VIA), vitamin E (VIE), carotenes (CAR), xanthophylls (XAN), 25-hydroxyvitamin D (25OHD), saturated- (SFA), monounsaturated- (MUFA), polyunsaturated- (PUFA) and total fatty acids (tFA) were measured in adults (≥20 y) during all or part of NHANES 2003–2006. Simple and multiple linear regression were used to assess 5 sociodemographic variables (age, sex, race-ethnicity, education, income) and 5 lifestyle behaviors (smoking, alcohol consumption, BMI, physical activity, supplement use) and their relation to biomarker concentrations. Adjustment for total serum cholesterol and lipid-altering drug use was added to the full regression model. Adjustment for latitude and season was added to the full model for 25OHD. Based on simple linear regression, race-ethnicity, BMI and supplement use were significantly related to all fat-soluble biomarkers. Sociodemographic variables as a groupexplained 5–17% of biomarker variability, whereas together, sociodemographic and lifestyle variables explained 22–23% (25OHD, VIE, XAN), 17% (VIA), 15% (MUFA), 10–11% (SFA, CAR, tFA) and 6% (PUFA). Although lipid adjustment explained additional variability for all biomarkers except 25OHD, it appeared to be largely independent of sociodemographic and lifestyle variables. After adjusting for sociodemographic, lifestyle and lipid-related variables, major differences in biomarkers were associated with race-ethnicity (from −44% to 57%); smoking (up to −25%); supplement use (up to 21%); and BMI (up to −15%). Latitude and season attenuated some race-ethnic differences. Of the sociodemographic and lifestyle variables examined, with or without lipid-adjustment, most fat-soluble nutrient biomarkers were significantly associated with race-ethnicity. PMID:23596163
Jarvis, J; Seed, M; Elton, R; Sawyer, L; Agius, R
2005-01-01
Aims: To investigate quantitatively, relationships between chemical structure and reported occupational asthma hazard for low molecular weight (LMW) organic compounds; to develop and validate a model linking asthma hazard with chemical substructure; and to generate mechanistic hypotheses that might explain the relationships. Methods: A learning dataset used 78 LMW chemical asthmagens reported in the literature before 1995, and 301 control compounds with recognised occupational exposures and hazards other than respiratory sensitisation. The chemical structures of the asthmagens and control compounds were characterised by the presence of chemical substructure fragments. Odds ratios were calculated for these fragments to determine which were associated with a likelihood of being reported as an occupational asthmagen. Logistic regression modelling was used to identify the independent contribution of these substructures. A post-1995 set of 21 asthmagens and 77 controls were selected to externally validate the model. Results: Nitrogen or oxygen containing functional groups such as isocyanate, amine, acid anhydride, and carbonyl were associated with an occupational asthma hazard, particularly when the functional group was present twice or more in the same molecule. A logistic regression model using only statistically significant independent variables for occupational asthma hazard correctly assigned 90% of the model development set. The external validation showed a sensitivity of 86% and specificity of 99%. Conclusions: Although a wide variety of chemical structures are associated with occupational asthma, bifunctional reactivity is strongly associated with occupational asthma hazard across a range of chemical substructures. This suggests that chemical cross-linking is an important molecular mechanism leading to the development of occupational asthma. The logistic regression model is freely available on the internet and may offer a useful but inexpensive adjunct to the prediction of occupational asthma hazard. PMID:15778257
NASA Astrophysics Data System (ADS)
Walawender, Ewelina; Walawender, Jakub P.; Ustrnul, Zbigniew
2017-02-01
The main purpose of the study is to introduce methods for mapping the spatial distribution of the occurrence of selected atmospheric phenomena (thunderstorms, fog, glaze and rime) over Poland from 1966 to 2010 (45 years). Limited in situ observations as well the discontinuous and location-dependent nature of these phenomena make traditional interpolation inappropriate. Spatially continuous maps were created with the use of geospatial predictive modelling techniques. For each given phenomenon, an algorithm identifying its favourable meteorological and environmental conditions was created on the basis of observations recorded at 61 weather stations in Poland. Annual frequency maps presenting the probability of a day with a thunderstorm, fog, glaze or rime were created with the use of a modelled, gridded dataset by implementing predefined algorithms. Relevant explanatory variables were derived from NCEP/NCAR reanalysis and downscaled with the use of a Regional Climate Model. The resulting maps of favourable meteorological conditions were found to be valuable and representative on the country scale but at different correlation ( r) strength against in situ data (from r = 0.84 for thunderstorms to r = 0.15 for fog). A weak correlation between gridded estimates of fog occurrence and observations data indicated the very local nature of this phenomenon. For this reason, additional environmental predictors of fog occurrence were also examined. Topographic parameters derived from the SRTM elevation model and reclassified CORINE Land Cover data were used as the external, explanatory variables for the multiple linear regression kriging used to obtain the final map. The regression model explained 89 % of annual frequency of fog variability in the study area. Regression residuals were interpolated via simple kriging.
Memory complaints in epilepsy: An examination of the role of mood and illness perceptions.
Tinson, Deborah; Crockford, Christopher; Gharooni, Sara; Russell, Helen; Zoeller, Sophie; Leavy, Yvonne; Lloyd, Rachel; Duncan, Susan
2018-03-01
The study examined the role of mood and illness perceptions in explaining the variance in the memory complaints of patients with epilepsy. Forty-four patients from an outpatient tertiary care center and 43 volunteer controls completed a formal assessment of memory and a verbal fluency test, as well as validated self-report questionnaires on memory complaints, mood, and illness perceptions. In hierarchical multiple regression analyses, objective memory test performance and verbal fluency did not contribute significantly to the variance in memory complaints for either patients or controls. In patients, illness perceptions and mood were highly correlated. Illness perceptions correlated more highly with memory complaints than mood and were therefore added to the multiple regression analysis. This accounted for an additional 25% of the variance, after controlling for objective memory test performance and verbal fluency, and the model was significant (model B). In order to compare with other studies, mood was added to a second model, instead of illness perceptions. This accounted for an additional 24% of the variance, which was again significant (model C). In controls, low mood accounted for 11% of the variance in memory complaints (model C2). A measure of illness perceptions was more highly correlated with the memory complaints of patients with epilepsy than with a measure of mood. In a hierarchical multiple regression model, illness perceptions accounted for 25% of the variance in memory complaints. Illness perceptions could provide useful information in a clinical investigation into the self-reported memory complaints of patients with epilepsy, alongside the assessment of mood and formal memory testing. Copyright © 2017 Elsevier Inc. All rights reserved.
Genetic predisposition scores associate with muscular strength, size, and trainability.
Thomaes, Tom; Thomis, Martine; Onkelinx, Steven; Goetschalckx, Kaatje; Fagard, Robert; Lambrechts, Diether; Vanhees, Luc
2013-08-01
The number of studies trying to identify genetic sequence variation related to muscular phenotypes has increased enormously. The aim of this study was to identify the role of a genetic predisposition score (GPS) based on earlier identified gene variants for different muscular endophenotypes to explain the individual differences in muscular fitness characteristics and the response to training in patients with coronary artery disease. Two hundred and sixty coronary artery disease patients followed a standard ambulatory, 3-month supervised training program for cardiac patients. Maximal knee extension strength (KES) and rectus femoris diameter were measured at baseline and after rehabilitation. Sixty-five single nucleotide polymorphisms (SNP) in 30 genes were selected based on genotype-phenotype association literature. Backward regression analysis revealed subsets of SNP associated with the different phenotypes. GPS were constructed for all sets of SNP by adding up the strength-increasing alleles. General linear models and multiple stepwise regression analysis were used to test the explained variance of the GPS in baseline and strength responses. Receiver operating characteristic curve analyses were performed to discriminate between high- and low-responder status. GPS were significantly associated with the rectus femoris diameter (P < 0.01) and its response (P < 0.0001), the isometric KES (P < 0.05) and its response (P < 0.01), the isokinetic KES at 60° · s (P < 0.05) and 180° · s (P < 0.001) and their responses to training (P < 0.0001), and the isokinetic KES endurance (P < 0.001) and its change after training (P < 0.0001). The GPS was shown as an independent determinant in baseline and response phenotypes with partial explained variance up to 23%. Receiver operating characteristic analysis showed a significant discriminating accuracy of the models, including the GPS for responses to training, with areas under the curve ranging from 0.62 to 0.85. GPS for muscular phenotypes showed to be associated with baseline KES, muscle diameter, and the response to training in cardiac rehabilitation patients.
Rantalainen, T; Valtonen, A; Sipilä, S; Pöyhönen, T; Heinonen, A
2012-03-01
It is currently unknown whether knee replacement-associated bone loss is modified by rehabilitation programs. Thus, a sample of 45 (18 men and 25 women) persons with unilateral knee replacement were recruited; age 66 years (sd 6), height 169 cm (sd 8), body mass 83 kg (sd 15), time since operation 10 months (sd 4) to explore the associations between maximal torque/power in knee extension/flexion and femoral mid-shaft bone traits (Cortical cross-sectional area (CoA, mm(2)), cortical volumetric bone mineral density (CoD, mg/mm(3)) and bone bending strength index (SSI, mm(3))). Bone traits were calculated from a single computed tomography slice from the femoral mid-shaft. Pain in the operated knee was assessed with the WOMAC questionnaire. Stepwise regression models were built for the operated leg bone traits, with knee extension and flexion torque and power, age, height, body mass, pain score and time since operation as independent variables. CoA was 2.3% (P=0.015), CoD 1.2% (P<0.001) and SSI 1.6% (P=0.235) lower in the operated compared to non-operated leg. The overall proportions of the variation explained by the regression models were 50%, 29% and 55% for CoA, CoD and SSI, respectively. Body mass explained 12% of Coa, 11% of CoD and 11% of SSI (P≤0.003). Maximal knee flexion torque explained 38% of Coa, 7% of CoD and 44% of SSI (p≤0.047). For CoD time since operation also became a significant predictor (11%, P=0.045). Knee flexion torque of the operated leg was positively associated with bone strength in the operated leg. Thus, successful rehabilitation may diminish bone loss in the operated leg. Copyright © 2011 Elsevier B.V. All rights reserved.
Kontopantelis, Evangelos; Mamas, Mamas A; van Marwijk, Harm; Ryan, Andrew M; Bower, Peter; Guthrie, Bruce; Doran, Tim
2018-02-14
Primary care provides the foundation for most modern health-care systems, and in the interests of equity, it should be resourced according to local need. We aimed to describe spatially the burden of chronic conditions and primary medical care funding in England at a low geographical level, and to measure how much variation in funding is explained by chronic condition prevalence and other patient and regional factors. We used multiple administrative data sets including chronic condition prevalence and management data (2014/15), funding for primary-care practices (2015-16), and geographical and area deprivation data (2015). Data were assigned to a low geographical level (average 1500 residents). We investigated the overall morbidity burden across 19 chronic conditions and its regional variation, spatial clustering and association with funding and area deprivation. A linear regression model was used to explain local variation in spending using patient demographics, morbidity, deprivation and regional characteristics. Levels of morbidity varied within and between regions, with several clusters of very high morbidity identified. At the regional level, morbidity was modestly associated with practice funding, with the North East and North West appearing underfunded. The regression model explained 39% of the variability in practice funding, but even after adjusting for covariates, a large amount of variability in funding existed across regions. High morbidity and, especially, rural location were very strongly associated with higher practice funding, while associations were more modest for high deprivation and older age. Primary care funding in England does not adequately reflect the contemporary morbidity burden. More equitable resource allocation could be achieved by making better use of routinely available information and big data resources. Similar methods could be deployed in other countries where comparable data are collected, to identify morbidity clusters and to target funding to areas of greater need.
Modeling Drought Impact Occurrence Based on Climatological Drought Indices for Europe
NASA Astrophysics Data System (ADS)
Stagge, J. H.; Kohn, I.; Tallaksen, L. M.; Stahl, K.
2014-12-01
Meteorological drought indices are often assumed to accurately characterize the severity of a drought event; however, these indices do not necessarily reflect the likelihood or severity of a particular type of drought impact experienced on the ground. In previous research, this link between index and impact was often estimated based on thresholds found by experience, measured using composite indices with assumed weighting schemes, or defined based on very narrow impact measures, using either a narrow spatial extent or very specific impacts. This study expands on earlier work by demonstrating the feasibility of relating user-provided impact reports to the climatological drought indices SPI and SPEI by logistic regression. The user-provided drought impact reports are based on the European Drought Impact Inventory (EDII, www.geo.uio.no/edc/droughtdb/), a newly developed online database that allows both public report submission and querying the more than 4,000 reported impacts spanning 33 European countries. This new tool is used to quantify the link between meteorological drought indices and impacts focusing on four primary impact types, spanning agriculture, energy and industry, public water supply, and freshwater ecosystem across five European countries. Statistically significant climate indices are retained as predictors using step-wise regression and used to compare the most relevant drought indices and accumulation periods for different impact types and regions. Agricultural impacts are explained best by 2-12 month anomalies, with 2-3 month anomalies found in predominantly rain-fed agricultural regions, and anomalies greater than 3 months related to agricultural management practices. Energy and industry impacts, related to hydropower and energy cooling water in these countries, respond to longer accumulated precipitation anomalies (6-12 months). Public water supply and freshwater ecosystem impacts are explained by a more complex combination of short (1-3 month) and seasonal (6-12 month) anomalies. A mean of 47.0% (22.4-71.6%) impact deviance is explained by the resulting models, highlighting the feasibility of using such statistical techniques and drought impact databases to model drought impact likelihood based on relatively easily calculated meteorological drought indices.
Eeftens, Marloes; Meier, Reto; Schindler, Christian; Aguilera, Inmaculada; Phuleria, Harish; Ineichen, Alex; Davey, Mark; Ducret-Stich, Regina; Keidel, Dirk; Probst-Hensch, Nicole; Künzli, Nino; Tsai, Ming-Yi
2016-04-18
Land Use Regression (LUR) is a popular method to explain and predict spatial contrasts in air pollution concentrations, but LUR models for ultrafine particles, such as particle number concentration (PNC) are especially scarce. Moreover, no models have been previously presented for the lung deposited surface area (LDSA) of ultrafine particles. The additional value of ultrafine particle metrics has not been well investigated due to lack of exposure measurements and models. Air pollution measurements were performed in 2011 and 2012 in the eight areas of the Swiss SAPALDIA study at up to 40 sites per area for NO2 and at 20 sites in four areas for markers of particulate air pollution. We developed multi-area LUR models for biannual average concentrations of PM2.5, PM2.5 absorbance, PM10, PMcoarse, PNC and LDSA, as well as alpine, non-alpine and study area specific models for NO2, using predictor variables which were available at a national level. Models were validated using leave-one-out cross-validation, as well as independent external validation with routine monitoring data. Model explained variance (R(2)) was moderate for the various PM mass fractions PM2.5 (0.57), PM10 (0.63) and PMcoarse (0.45), and was high for PM2.5 absorbance (0.81), PNC (0.87) and LDSA (0.91). Study-area specific LUR models for NO2 (R(2) range 0.52-0.89) outperformed combined-area alpine (R (2) = 0.53) and non-alpine (R (2) = 0.65) models in terms of both cross-validation and independent external validation, and were better able to account for between-area variability. Predictor variables related to traffic and national dispersion model estimates were important predictors. LUR models for all pollutants captured spatial variability of long-term average concentrations, performed adequately in validation, and could be successfully applied to the SAPALDIA cohort. Dispersion model predictions or area indicators served well to capture the between area variance. For NO2, applying study-area specific models was preferable over applying combined-area alpine/non-alpine models. Correlations between pollutants were higher in the model predictions than in the measurements, so it will remain challenging to disentangle their health effects.
Artes, Paul H; Crabb, David P
2010-01-01
To investigate why the specificity of the Moorfields Regression Analysis (MRA) of the Heidelberg Retina Tomograph (HRT) varies with disc size, and to derive accurate normative limits for neuroretinal rim area to address this problem. Two datasets from healthy subjects (Manchester, UK, n = 88; Halifax, Nova Scotia, Canada, n = 75) were used to investigate the physiological relationship between the optic disc and neuroretinal rim area. Normative limits for rim area were derived by quantile regression (QR) and compared with those of the MRA (derived by linear regression). Logistic regression analyses were performed to quantify the association between disc size and positive classifications with the MRA, as well as with the QR-derived normative limits. In both datasets, the specificity of the MRA depended on optic disc size. The odds of observing a borderline or outside-normal-limits classification increased by approximately 10% for each 0.1 mm(2) increase in disc area (P < 0.1). The lower specificity of the MRA with large optic discs could be explained by the failure of linear regression to model the extremes of the rim area distribution (observations far from the mean). In comparison, the normative limits predicted by QR were larger for smaller discs (less specific, more sensitive), and smaller for larger discs, such that false-positive rates became independent of optic disc size. Normative limits derived by quantile regression appear to remove the size-dependence of specificity with the MRA. Because quantile regression does not rely on the restrictive assumptions of standard linear regression, it may be a more appropriate method for establishing normative limits in other clinical applications where the underlying distributions are nonnormal or have nonconstant variance.
Comprehension of texts by deaf elementary school students: The role of grammatical understanding.
Barajas, Carmen; González-Cuenca, Antonia M; Carrero, Francisco
2016-12-01
The aim of this study was to analyze how the reading process of deaf Spanish elementary school students is affected both by those components that explain reading comprehension according to the Simple View of Reading model: decoding and linguistic comprehension (both lexical and grammatical) and by other variables that are external to the reading process: the type of assistive technology used, the age at which it is implanted or fitted, the participant's socioeconomic status and school stage. Forty-seven students aged between 6 and 13 years participated in the study; all presented with profound or severe prelingual bilateral deafness, and all used digital hearing aids or cochlear implants. Students' text comprehension skills, decoding skills and oral comprehension skills (both lexical and grammatical) were evaluated. Logistic regression analysis indicated that neither the type of assistive technology, age at time of fitting or activation, socioeconomic status, nor school stage could predict the presence or absence of difficulties in text comprehension. Furthermore, logistic regression analysis indicated that neither decoding skills, nor lexical age could predict competency in text comprehension; however, grammatical age could explain 41% of the variance. Probing deeper into the effect of grammatical understanding, logistic regression analysis indicated that a participant's understanding of reversible passive object-verb-subject sentences and reversible predicative subject-verb-object sentences accounted for 38% of the variance in text comprehension. Based on these results, we suggest that it might be beneficial to devise and evaluate interventions that focus specifically on grammatical comprehension. Copyright © 2016 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Aguilera, Fátima; Fornaciari, Marco; Ruiz-Valenzuela, Luis; Galán, Carmen; Msallem, Monji; Dhiab, Ali Ben; la Guardia, Consuelo Díaz-de; del Mar Trigo, María; Bonofiglio, Tommaso; Orlandi, Fabio
2015-05-01
The aim of the present study was to develop pheno-meteorological models to explain and forecast the main olive flowering phenological phases within the Mediterranean basin, across a latitudinal and longitudinal gradient that includes Tunisia, Spain, and Italy. To analyze the aerobiological sampling points, study periods from 13 years (1999-2011) to 19 years (1993-2011) were used. The forecasting models were constructed using partial least-squares regression, considering both the flowering start and full-flowering dates as dependent variables. The percentages of variance explained by the full-flowering models (mean 84 %) were greater than those explained by the flowering start models (mean 77 %). Moreover, given the time lag from the North African areas to the central Mediterranean areas in the main olive flowering dates, the regional full-flowering predictive models are proposed as the most useful to improve the knowledge of the influence of climate on the olive tree floral phenology. The meteorological parameters related to the previous autumn and both the winter and the spring seasons, and above all the temperatures, regulate the reproductive phenology of olive trees in the Mediterranean area. The mean anticipation of flowering start and full flowering for the future period from 2081 to 2100 was estimated at 10 and 12 days, respectively. One question can be raised: Will the olive trees located in the warmest areas be northward displaced or will they be able to adapt their physiology in response to the higher temperatures? The present study can be considered as an approach to design more detailed future bioclimate research.
WASP (Write a Scientific Paper) using Excel - 13: Correlation and Regression.
Grech, Victor
2018-07-01
Correlation and regression measure the closeness of association between two continuous variables. This paper explains how to perform these tests in Microsoft Excel and their interpretation, as well as how to apply these tests dynamically using Excel's functions. Copyright © 2018 Elsevier B.V. All rights reserved.
Patient satisfaction in Dental Healthcare Centers.
Ali, Dena A
2016-01-01
This study aimed to (1) measure the degree of patient satisfaction among the clinical and nonclinical dental services offered at specialty dental centers and (2) investigate the factors associated with the degree of overall satisfaction. Four hundred and ninety-seven participants from five dental centers were recruited for this study. Each participant completed a self-administered questionnaire to measure patient satisfaction with clinical and nonclinical dental services. Analysis of variance, t-tests, a general linear model, and stepwise regression analysis was applied. The respondents were generally satisfied, but internal differences were observed. The exhibited highest satisfaction with the dentists' performance, followed by the dental assistants' services, and the lowest satisfaction with the center's physical appearance and accessibility. Females, participants with less than a bachelor's degree, and younger individuals were more satisfied with the clinical and nonclinical dental services. The stepwise regression analysis revealed that the coefficient of determination (R (2)) was 40.4%. The patient satisfaction with the performance of the dentists explained 42.6% of the overall satisfaction, whereas their satisfaction with the clinical setting explained 31.5% of the overall satisfaction. Additional improvements with regard to the accessibility and physical appearance of the dental centers are needed. In addition, interventions regarding accessibility, particularly when booking an appointment, are required.
Modelling ecological flow regime: an example from the Tennessee and Cumberland River basins
Knight, Rodney R.; Gain, W. Scott; Wolfe, William J.
2012-01-01
Predictive equations were developed for 19 ecologically relevant streamflow characteristics within five major groups of flow variables (magnitude, ratio, frequency, variability, and date) for use in the Tennessee and Cumberland River basins using stepbackward regression. Basin characteristics explain 50% or more of the variation for 12 of the 19 equations. Independent variables identified through stepbackward regression were statistically significant in 78 of 304 cases (α > 0.0001) and represent four major groups: climate, physical landscape features, regional indicators, and land use. Of these groups, the regional and climate variables were the most influential for determining hydrologic response. Daily temperature range, geologic factor, and rock depth were major factors explaining the variability in 17, 15, and 13 equations, respectively. The equations and independent datasets were used to explore the broad relation between basin properties and streamflow and the implication of streamflow to the study of ecological flow requirements. Key results include a high degree of hydrologic variability among least disturbed Blue Ridge streams, similar hydrologic behaviour for watersheds with widely varying degrees of forest cover, and distinct hydrologic profiles for streams in different geographic regions. Published in 2011. This article is a US Government work and is in the public domain in the USA.
Psychophysiological responses to competition and the big five personality traits.
Binboga, Erdal; Guven, Senol; Catıkkaş, Fatih; Bayazıt, Onur; Tok, Serdar
2012-06-01
This study examines the relationship between psychophysiological arousal, cognitive anxiety, and personality traits in young taekwondo athletes. A total of 20 male and 10 female taekwondo athletes (mean age = 18.6 years; ± 1.8) volunteered for the study. The Five Factor Personality Inventory and the state scale of the Spielberger State-Trait Anxiety Inventory (STAI) were used to measure personality and cognitive state anxiety. Electrodermal activity (EDA) was measured twice, one day and approximately one hour prior to the competition, to determine psychophysiological arousal. Descriptive statistics, Pearson product-moment correlations, and stepwise regression were used to analyze the data. Several "Big Five" facets were related to the EDA delta scores that were measured both one day and one hour before the competition. Two stepwise regressions were conducted to examine whether personality traits could significantly predict both EDA delta scores. The final model, containing only neuroticism from the Big Five factors, can significantly explain the variations in the EDA delta scores measured one day before the competition. Agreeableness can significantly explain variations in the EDA delta scores measured one hour before the competition. No relationship was found between cognitive anxiety and the EDA delta scores measured one hour before the competition. In conclusion, personality traits, especially agreeableness and neuroticism, might be useful in understanding arousal responses to competition.
Psychophysiological Responses to Competition and the Big Five Personality Traits
Binboga, Erdal; Guven, Senol; Çatıkkaş, Fatih; Bayazıt, Onur; Tok, Serdar
2012-01-01
This study examines the relationship between psychophysiological arousal, cognitive anxiety, and personality traits in young taekwondo athletes. A total of 20 male and 10 female taekwondo athletes (mean age = 18.6 years; ± 1.8) volunteered for the study. The Five Factor Personality Inventory and the state scale of the Spielberger State-Trait Anxiety Inventory (STAI) were used to measure personality and cognitive state anxiety. Electrodermal activity (EDA) was measured twice, one day and approximately one hour prior to the competition, to determine psychophysiological arousal. Descriptive statistics, Pearson product-moment correlations, and stepwise regression were used to analyze the data. Several “Big Five” facets were related to the EDA delta scores that were measured both one day and one hour before the competition. Two stepwise regressions were conducted to examine whether personality traits could significantly predict both EDA delta scores. The final model, containing only neuroticism from the Big Five factors, can significantly explain the variations in the EDA delta scores measured one day before the competition. Agreeableness can significantly explain variations in the EDA delta scores measured one hour before the competition. No relationship was found between cognitive anxiety and the EDA delta scores measured one hour before the competition. In conclusion, personality traits, especially agreeableness and neuroticism, might be useful in understanding arousal responses to competition. PMID:23486906
Medical team interdependence as a determinant of use of clinical resources.
Sicotte, C; Pineault, R; Lambert, J
1993-01-01
OBJECTIVE. Our objective, based on organization theory, is to examine whether interdependence among physicians leads to coordination problems that in turn may explain variations observed in the use of clinical resources. DATA SOURCES/STUDY SETTING. Secondary data about episodes of in-hospital care were collected over a 14-month period in two midsize acute care hospitals located in two suburbs of Montreal, Quebec. STUDY DESIGN. Hierarchical regression analysis was used to assess the marginal effect of medical team interdependence on clinical resource utilization after taking into account the effect attributable to the nature of several morbidities taken as specific and distinct tasks. PRINCIPAL FINDINGS. Medical team interdependence is found within medical specialties as well as between specialties. The largest portion of resource utilization was explained by morbidity characteristics, whereas team interdependence had a weaker, but systematic effect for all morbidities studied (15 regression models out of 18 performed). Task coordination was found to become more difficult as the number of physicians coming from different specialties increased in the context of teamwork. CONCLUSIONS. Results suggest that team practice does not entirely overcome coordination problems inherent to task (morbidity) interdependence. In considering the individual (especially the attending) physician as the main factor responsible for resource utilization, other factors related to team practice may too readily be overlooked. PMID:8270423
Patient satisfaction in Dental Healthcare Centers
Ali, Dena A.
2016-01-01
Objectives: This study aimed to (1) measure the degree of patient satisfaction among the clinical and nonclinical dental services offered at specialty dental centers and (2) investigate the factors associated with the degree of overall satisfaction. Materials and Methods: Four hundred and ninety-seven participants from five dental centers were recruited for this study. Each participant completed a self-administered questionnaire to measure patient satisfaction with clinical and nonclinical dental services. Analysis of variance, t-tests, a general linear model, and stepwise regression analysis was applied. Results: The respondents were generally satisfied, but internal differences were observed. The exhibited highest satisfaction with the dentists’ performance, followed by the dental assistants’ services, and the lowest satisfaction with the center's physical appearance and accessibility. Females, participants with less than a bachelor's degree, and younger individuals were more satisfied with the clinical and nonclinical dental services. The stepwise regression analysis revealed that the coefficient of determination (R2) was 40.4%. The patient satisfaction with the performance of the dentists explained 42.6% of the overall satisfaction, whereas their satisfaction with the clinical setting explained 31.5% of the overall satisfaction. Conclusion: Additional improvements with regard to the accessibility and physical appearance of the dental centers are needed. In addition, interventions regarding accessibility, particularly when booking an appointment, are required. PMID:27403045
Admixture Analysis of Spontaneous Hepatitis C Virus Clearance in Individuals of African-Descent
Wojcik, Genevieve L.; Thio, Chloe L.; Kao, WH Linda; Latanich, Rachel; Goedert, James J.; Mehta, Shruti H.; Kirk, Gregory D.; Peters, Marion G.; Cox, Andrea L.; Kim, Arthur Y.; Chung, Raymond T.; Thomas, David L.; Duggal, Priya
2015-01-01
Hepatitis C virus (HCV) infects an estimated 3% of the global population with the majority of individuals (75–85%) failing to clear the virus without treatment, leading to chronic liver disease. Individuals of African-descent have lower rates of clearance compared to individuals of European-descent and this is not fully explained by social and environmental factors. This suggests that differences in genetic background may contribute to this difference in clinical outcome following HCV infection. Using 473 individuals and 792,721 SNPs from a genome-wide association study (GWAS), we estimated local African ancestry across the genome. Using admixture mapping and logistic regression we identified two regions of interest associated with spontaneous clearance of HCV (15q24, 20p12). A genome-wide significant variant was identified on chromosome 15 at the imputed SNP, rs55817928 (P=6.18×10−8) between the genes SCAPER and RCN. Each additional copy of the African ancestral C allele is associated with 2.4 times the odds of spontaneous clearance. Conditional analysis using this SNP in the logistic regression model explained one-third of the local ancestry association. Additionally, signals of selection in this area suggest positive selection due to some ancestral pathogen or environmental pressure in African, but not in European populations. PMID:24622687
Evans, A H; Lawrence, A D; Potts, J; MacGregor, L; Katzenschlager, R; Shaw, K; Zijlmans, J; Lees, A J
2006-03-01
An inverse relation exists between smoking and coffee intake and Parkinson's disease (PD). The present study explored whether this is explained by low sensation seeking, a personality trait believed to characterise PD. A total of 106 non-demented patients with PD and 106 age and sex matched healthy controls completed a short version of Zuckerman's Sensation Seeking Scale (SSS), the Geriatric Depression Scale, and the Trait Anxiety Inventory. Data were collected on past and current cigarette smoking, and participants also completed food frequency questionnaires to estimate current caffeine and alcohol intake. Patients with PD had lower sensation seeking and higher depression and anxiety scores. They were also less likely to have ever smoked, and had lower caffeine and alcohol intakes. Analysis of the data using conditional logistic regression suggested that the inverse association of PD risk with sensation seeking was independent of smoking, and caffeine and alcohol intake. Moreover, low sensation seeking explained some of the apparent effect of caffeine and alcohol intake on PD. However, the effect of smoking was weakened only slightly when SSS was included in the regression model. This study raises the possibility that there is a neurobiological link between low sensation seeking traits--which might underlie the parkinsonian personality--and the hypothetical protective effect of cigarette smoking and caffeine consumption on PD.
Meijer, Kim A; Muhlert, Nils; Cercignani, Mara; Sethi, Varun; Ron, Maria A; Thompson, Alan J; Miller, David H; Chard, Declan; Geurts, Jeroen Jg; Ciccarelli, Olga
2016-10-01
While our knowledge of white matter (WM) pathology underlying cognitive impairment in relapsing remitting multiple sclerosis (MS) is increasing, equivalent understanding in those with secondary progressive (SP) MS lags behind. The aim of this study is to examine whether the extent and severity of WM tract damage differ between cognitively impaired (CI) and cognitively preserved (CP) secondary progressive multiple sclerosis (SPMS) patients. Conventional magnetic resonance imaging (MRI) and diffusion MRI were acquired from 30 SPMS patients and 32 healthy controls (HC). Cognitive domains commonly affected in MS patients were assessed. Linear regression was used to predict cognition. Diffusion measures were compared between groups using tract-based spatial statistics (TBSS). A total of 12 patients were classified as CI, and processing speed was the most commonly affected domain. The final regression model including demographic variables and radial diffusivity explained the greatest variance of cognitive performance (R 2 = 0.48, p = 0.002). SPMS patients showed widespread loss of WM integrity throughout the WM skeleton when compared with HC. When compared with CP patients, CI patients showed more extensive and severe damage of several WM tracts, including the fornix, superior longitudinal fasciculus and forceps major. Loss of WM integrity assessed using TBSS helps to explain cognitive decline in SPMS patients. © The Author(s), 2016.
The effect of multiple stressors on salt marsh end-of-season biomass
Visser, J.M.; Sasser, C.E.; Cade, B.S.
2006-01-01
It is becoming more apparent that commonly used statistical methods (e.g., analysis of variance and regression) are not the best methods for estimating limiting relationships or stressor effects. A major challenge of estimating the effects associated with a measured subset of limiting factors is to account for the effects of unmeasured factors in an ecologically realistic matter. We used quantile regression to elucidate multiple stressor effects on end-of-season biomass data from two salt marsh sites in coastal Louisiana collected for 18 yr. Stressor effects evaluated based on available data were flooding, salinity, air temperature, cloud cover, precipitation deficit, grazing by muskrat, and surface water nitrogen and phosphorus. Precipitation deficit combined with surface water nitrogen provided the best two-parameter model to explain variation in the peak biomass with different slopes and intercepts for the two study sites. Precipitation deficit, cloud cover, and temperature were significantly correlated with each other. Surface water nitrogen was significantly correlated with surface water phosphorus and muskrat density. The site with the larger duration of flooding showed reduced peak biomass, when cloud cover and surface water nitrogen were optimal. Variation in the relatively low salinity occurring in our study area did not explain any of the variation in Spartina alterniflora biomass. ?? 2006 Estuarine Research Federation.
The effect of multiple stressors on salt marsh end-of-season biomass
Visser, J.M.; Sasser, C.E.; Cade, B.S.
2006-01-01
It is becoming more apparent that commonly used statistical methods (e.g. analysis of variance and regression) are not the best methods for estimating limiting relationships or stressor effects. A major challenge of estimating the effects associated with a measured subset of limiting factors is to account for the effects of unmeasured factors in an ecologically realistic matter. We used quantile regression to elucidate multiple stressor effects on end-of-season biomass data from two salt marsh sites in coastal Louisiana collected for 18 yr. Stressor effects evaluated based on available data were flooding, salinity air temperature, cloud cover, precipitation deficit, grazing by muskrat, and surface water nitrogen and phosphorus. Precipitation deficit combined with surface water nitrogen provided the best two-parameter model to explain variation in the peak biomass with different slopes and intercepts for the two study sites. Precipitation deficit, cloud cover, and temperature were significantly correlated with each other. Surface water nitrogen was significantly correlated with surface water phosphorus and muskrat density. The site with the larger duration of flooding showed reduced peak biomass, when cloud cover and surface water nitrogen were optimal. Variation in the relatively low salinity occurring in our study area did not explain any of the variation in Spartina alterniflora biomass.
Davies-Venn, Evelyn; Nelson, Peggy; Souza, Pamela
2015-01-01
Some listeners with hearing loss show poor speech recognition scores in spite of using amplification that optimizes audibility. Beyond audibility, studies have suggested that suprathreshold abilities such as spectral and temporal processing may explain differences in amplified speech recognition scores. A variety of different methods has been used to measure spectral processing. However, the relationship between spectral processing and speech recognition is still inconclusive. This study evaluated the relationship between spectral processing and speech recognition in listeners with normal hearing and with hearing loss. Narrowband spectral resolution was assessed using auditory filter bandwidths estimated from simultaneous notched-noise masking. Broadband spectral processing was measured using the spectral ripple discrimination (SRD) task and the spectral ripple depth detection (SMD) task. Three different measures were used to assess unamplified and amplified speech recognition in quiet and noise. Stepwise multiple linear regression revealed that SMD at 2.0 cycles per octave (cpo) significantly predicted speech scores for amplified and unamplified speech in quiet and noise. Commonality analyses revealed that SMD at 2.0 cpo combined with SRD and equivalent rectangular bandwidth measures to explain most of the variance captured by the regression model. Results suggest that SMD and SRD may be promising clinical tools for diagnostic evaluation and predicting amplification outcomes. PMID:26233047
Davies-Venn, Evelyn; Nelson, Peggy; Souza, Pamela
2015-07-01
Some listeners with hearing loss show poor speech recognition scores in spite of using amplification that optimizes audibility. Beyond audibility, studies have suggested that suprathreshold abilities such as spectral and temporal processing may explain differences in amplified speech recognition scores. A variety of different methods has been used to measure spectral processing. However, the relationship between spectral processing and speech recognition is still inconclusive. This study evaluated the relationship between spectral processing and speech recognition in listeners with normal hearing and with hearing loss. Narrowband spectral resolution was assessed using auditory filter bandwidths estimated from simultaneous notched-noise masking. Broadband spectral processing was measured using the spectral ripple discrimination (SRD) task and the spectral ripple depth detection (SMD) task. Three different measures were used to assess unamplified and amplified speech recognition in quiet and noise. Stepwise multiple linear regression revealed that SMD at 2.0 cycles per octave (cpo) significantly predicted speech scores for amplified and unamplified speech in quiet and noise. Commonality analyses revealed that SMD at 2.0 cpo combined with SRD and equivalent rectangular bandwidth measures to explain most of the variance captured by the regression model. Results suggest that SMD and SRD may be promising clinical tools for diagnostic evaluation and predicting amplification outcomes.
Veni, T; Boyas, S; Beaune, B; Bourgeois, H; Rahmani, A; Landry, S; Bochereau, A; Durand, S; Morel, B
2018-06-24
As a subjective symptom, cancer-related fatigue is assessed via patient-reported outcomes. Due to the inherent bias of such evaluation, screening and treatment for cancer-related fatigue remains suboptimal. The purpose is to evaluate whether objective cancer patients' hand muscle mechanical parameters (maximal force, critical force, force variability) extracted from a fatiguing handgrip exercise may be correlated to the different dimensions (physical, emotional, and cognitive) of cancer-related fatigue. Fourteen women with advanced breast cancer, still under or having previously received chemotherapy within the preceding 3 months, and 11 healthy women participated to the present study. Cancer-related fatigue was first assessed through the EORTC QLQ-30 and its fatigue module. Fatigability was then measured during 60 maximal repeated handgrip contractions. The maximum force, critical force (asymptote of the force-time evolution), and force variability (root mean square of the successive differences) were extracted. Multiple regression models were performed to investigate the influence of the force parameters on cancer-related fatigue's dimensions. The multiple linear regression analysis evidenced that physical fatigue was best explained by maximum force and critical force (r = 0.81; p = 0.029). The emotional fatigue was best explained by maximum force, critical force, and force variability (r = 0.83; p = 0.008). The cognitive fatigue was best explained by critical force and force variability (r = 0.62; p = 0.035). The handgrip maximal force, critical force, and force variability may offer objective measures of the different dimensions of cancer-related fatigue and could provide a complementary approach to the patient reported outcomes.
Fountoulakis, Konstantinos N; Savopoulos, Christos; Zannis, Prodromos; Apostolopoulou, Martha; Fountoukidis, Ilias; Kakaletsis, Nikolaos; Kanellos, Ilias; Dimellis, Dimos; Hyphantis, Thomas; Tsikerdekis, Athanasios; Pompili, Maurizio; Hatzitolios, Apostolos I
2016-03-15
Recently there was a debate concerning the etiology behind attempts and completed suicides. The aim of the current study was to search for possible correlations between the rates of attempted and completed suicide and climate variables and regional unemployment per year in the county of Thessaloniki, Macedonia, northern Greece, for the years 2000-12. The regional rates of suicide and attempted suicide as well as regional unemployment were available from previous publications of the authors. The climate variables were calculated from the daily E-OBS gridded dataset which is based on observational data Only the male suicide rates correlate significantly with high mean annual temperature but not with unemployment. The multiple linear regression analysis results suggest that temperature is the only variable that determines male suicides and explains 51% of their variance. Unemployment fails to contribute significantly to the model. There seems to be a seasonal distribution for attempts with mean rates being higher for the period from May to October and the rates clearly correlate with temperature. The highest mean rates were observed during May and August and the lowest during December and February. Multiple linear regression analysis suggests that temperature also determines the female attempts rate although the explained variable is significant but very low (3-5%) Climate variables and specifically high temperature correlate both with suicide and attempted suicide rates but with a different way between males and females. The climate effect was stronger than the effect of unemployment. Copyright © 2016 Elsevier B.V. All rights reserved.
Hojman, D E
1996-03-01
This analysis involves empirically testing a theoretical model among 22 Central American and Caribbean countries during the 1990s that explains differences in infant and child mortality. Explanatory measures capture demographic, economic, health care, and educational characteristics. The model is expected to allow for an assessment of the potential impact of structural adjustment and external debt. It is pointed out that birth rates and child mortality rates followed similar patterns over time and between countries. In this study's regression analyses all variables in the three models that explain infant mortality are exogenous: low birth weight, immunization, gross domestic product per capita, years of schooling for women, population/nurse, and debt as a proportion of gross national product. As nations became richer, infant mortality declined. Infant mortality was lower in countries with high external debt. In models for explaining the birth rate and the child mortality rate, the best fit included variables for debt, real public expenditure on health care, water supply, and malnutrition. Analysis in a simultaneous model for 10 countries revealed that the birth rate and the child mortality rate were more responsive to shocks in exogenous variables in Barbados than in the Dominican Republic, and more responsive in the Dominican Republic than in Guatemala. The impact of each exogenous variable varied by country. In Barbados education was four times more effective in explaining the birth rate than water. In Guatemala, the most effective exogenous variable was malnutrition. Child mortality rates were affected more by multiplier effects. In richer countries, the most important impact on child survival was improved access to safe water, and the most important impact on the birth rate was increased real public expenditure on education per capita. For the poorest countries, findings suggest first improvement in malnutrition and then improvement in safe water supplies. Structural adjustment variables were found to have small impacts on the birth rate or limited impacts on child survival in poorer countries.
[Analyzing consumer preference by using the latest semantic model for verbal protocol].
Tamari, Yuki; Takemura, Kazuhisa
2012-02-01
This paper examines consumers' preferences for competing brands by using a preference model of verbal protocols. Participants were 150 university students, who reported their opinions and feelings about McDonalds and Mos Burger (competing hamburger restaurants in Japan). Their verbal protocols were analyzed by using the singular value decomposition method, and the latent decision frames were estimated. The verbal protocols having a large value in the decision frames could be interpreted as showing attributes that consumers emphasize. Based on the estimated decision frames, we predicted consumers' preferences using the logistic regression analysis method. The results indicate that the decision frames projected from the verbal protocol data explained consumers' preferences effectively.
Techniques for estimating flood-peak discharges of rural, unregulated streams in Ohio
Koltun, G.F.; Roberts, J.W.
1990-01-01
Multiple-regression equations are presented for estimating flood-peak discharges having recurrence intervals of 2, 5, 10, 25, 50, and 100 years at ungaged sites on rural, unregulated streams in Ohio. The average standard errors of prediction for the equations range from 33.4% to 41.4%. Peak discharge estimates determined by log-Pearson Type III analysis using data collected through the 1987 water year are reported for 275 streamflow-gaging stations. Ordinary least-squares multiple-regression techniques were used to divide the State into three regions and to identify a set of basin characteristics that help explain station-to- station variation in the log-Pearson estimates. Contributing drainage area, main-channel slope, and storage area were identified as suitable explanatory variables. Generalized least-square procedures, which include historical flow data and account for differences in the variance of flows at different gaging stations, spatial correlation among gaging station records, and variable lengths of station record were used to estimate the regression parameters. Weighted peak-discharge estimates computed as a function of the log-Pearson Type III and regression estimates are reported for each station. A method is provided to adjust regression estimates for ungaged sites by use of weighted and regression estimates for a gaged site located on the same stream. Limitations and shortcomings cited in an earlier report on the magnitude and frequency of floods in Ohio are addressed in this study. Geographic bias is no longer evident for the Maumee River basin of northwestern Ohio. No bias is found to be associated with the forested-area characteristic for the range used in the regression analysis (0.0 to 99.0%), nor is this characteristic significant in explaining peak discharges. Surface-mined area likewise is not significant in explaining peak discharges, and the regression equations are not biased when applied to basins having approximately 30% or less surface-mined area. Analyses of residuals indicate that the equations tend to overestimate flood-peak discharges for basins having approximately 30% or more surface-mined area. (USGS)
Wen, L; Bowen, C R; Hartman, G L
2017-10-01
Dispersal of urediniospores by wind is the primary means of spread for Phakopsora pachyrhizi, the cause of soybean rust. Our research focused on the short-distance movement of urediniospores from within the soybean canopy and up to 61 m from field-grown rust-infected soybean plants. Environmental variables were used to develop and compare models including the least absolute shrinkage and selection operator regression, zero-inflated Poisson/regular Poisson regression, random forest, and neural network to describe deposition of urediniospores collected in passive and active traps. All four models identified distance of trap from source, humidity, temperature, wind direction, and wind speed as the five most important variables influencing short-distance movement of urediniospores. The random forest model provided the best predictions, explaining 76.1 and 86.8% of the total variation in the passive- and active-trap datasets, respectively. The prediction accuracy based on the correlation coefficient (r) between predicted values and the true values were 0.83 (P < 0.0001) and 0.94 (P < 0.0001) for the passive and active trap datasets, respectively. Overall, multiple machine learning techniques identified the most important variables to make the most accurate predictions of movement of P. pachyrhizi urediniospores short-distance.
[Using fractional polynomials to estimate the safety threshold of fluoride in drinking water].
Pan, Shenling; An, Wei; Li, Hongyan; Yang, Min
2014-01-01
To study the dose-response relationship between fluoride content in drinking water and prevalence of dental fluorosis on the national scale, then to determine the safety threshold of fluoride in drinking water. Meta-regression analysis was applied to the 2001-2002 national endemic fluorosis survey data of key wards. First, fractional polynomial (FP) was adopted to establish fixed effect model, determining the best FP structure, after that restricted maximum likelihood (REML) was adopted to estimate between-study variance, then the best random effect model was established. The best FP structure was first-order logarithmic transformation. Based on the best random effect model, the benchmark dose (BMD) of fluoride in drinking water and its lower limit (BMDL) was calculated as 0.98 mg/L and 0.78 mg/L. Fluoride in drinking water can only explain 35.8% of the variability of the prevalence, among other influencing factors, ward type was a significant factor, while temperature condition and altitude were not. Fractional polynomial-based meta-regression method is simple, practical and can provide good fitting effect, based on it, the safety threshold of fluoride in drinking water of our country is determined as 0.8 mg/L.
Henry, Teague; Campbell, Ashley
2015-01-01
Objective. To examine factors that determine the interindividual variability of learning within a team-based learning environment. Methods. Students in a pharmacokinetics course were given 4 interim, low-stakes cumulative assessments throughout the semester and a cumulative final examination. Students’ Myers-Briggs personality type was assessed, as well as their study skills, motivations, and attitudes towards team-learning. A latent curve model (LCM) was applied and various covariates were assessed to improve the regression model. Results. A quadratic LCM was applied for the first 4 assessments to predict final examination performance. None of the covariates examined significantly impacted the regression model fit except metacognitive self-regulation, which explained some of the variability in the rate of learning. There were some correlations between personality type and attitudes towards team learning, with introverts having a lower opinion of team-learning than extroverts. Conclusion. The LCM could readily describe the learning curve. Extroverted and introverted personality types had the same learning performance even though preference for team-learning was lower in introverts. Other personality traits, study skills, or practice did not significantly contribute to the learning variability in this course. PMID:25861101
Persky, Adam M; Henry, Teague; Campbell, Ashley
2015-03-25
To examine factors that determine the interindividual variability of learning within a team-based learning environment. Students in a pharmacokinetics course were given 4 interim, low-stakes cumulative assessments throughout the semester and a cumulative final examination. Students' Myers-Briggs personality type was assessed, as well as their study skills, motivations, and attitudes towards team-learning. A latent curve model (LCM) was applied and various covariates were assessed to improve the regression model. A quadratic LCM was applied for the first 4 assessments to predict final examination performance. None of the covariates examined significantly impacted the regression model fit except metacognitive self-regulation, which explained some of the variability in the rate of learning. There were some correlations between personality type and attitudes towards team learning, with introverts having a lower opinion of team-learning than extroverts. The LCM could readily describe the learning curve. Extroverted and introverted personality types had the same learning performance even though preference for team-learning was lower in introverts. Other personality traits, study skills, or practice did not significantly contribute to the learning variability in this course.
Hanmer, Janel; Cherepanov, Dasha
2016-09-01
To evaluate a general question about ability to meet monthly bills as an alternative to direct questions about income and assets in health utility studies. We used data from the National Health Measurement Study-a US nationally representative telephone survey collected in 2005-2006. It included health utility measures (EuroQol-5D-3L, Health Utilities Index Mark 3, Short Form-6D, and Quality of Well-being Index) and household income, assets, and financial ability to meet monthly bills questions. Each utility score was regressed on: income and assets (Model 1); difficulty paying bills (DPB) (Model 2); income, assets, and DPB (Model 3). All models used survey weights and adjusted for demographics and education. Among 3666 respondents, as income and assets increased, DPB decreased. The DPB question had fewer missing values (n = 30) than income (n = 311) or assets (n = 373). Model 2 (DPB only) explained more variance in health utility than Model 1 (income and assets only). Including all measures (Model 3) had very modest improvement in R (2), e.g., values were 0.112 (Model 1), 0.166 (Model 2), and 0.175 (Model 3) for EuroQol-5D-3L. The single question on DPB yields more information and has less missing values than the traditionally used income and assets questions.
NASA Astrophysics Data System (ADS)
Rao, M.; Vuong, H.
2013-12-01
The overall objective of this study is to develop a method for estimating total aboveground biomass of redwood stands in Jackson Demonstration State Forest, Mendocino, California using airborne LiDAR data. LiDAR data owing to its vertical and horizontal accuracy are increasingly being used to characterize landscape features including ground surface elevation and canopy height. These LiDAR-derived metrics involving structural signatures at higher precision and accuracy can help better understand ecological processes at various spatial scales. Our study is focused on two major species of the forest: redwood (Sequoia semperirens [D.Don] Engl.) and Douglas-fir (Pseudotsuga mensiezii [Mirb.] Franco). Specifically, the objectives included linear regression models fitting tree diameter at breast height (dbh) to LiDAR derived height for each species. From 23 random points on the study area, field measurement (dbh and tree coordinate) were collected for more than 500 trees of Redwood and Douglas-fir over 0.2 ha- plots. The USFS-FUSION application software along with its LiDAR Data Viewer (LDV) were used to to extract Canopy Height Model (CHM) from which tree heights would be derived. Based on the LiDAR derived height and ground based dbh, a linear regression model was developed to predict dbh. The predicted dbh was used to estimate the biomass at the single tree level using Jenkin's formula (Jenkin et al 2003). The linear regression models were able to explain 65% of the variability associated with Redwood's dbh and 80% of that associated with Douglas-fir's dbh.