The Geometry of Enhancement in Multiple Regression
ERIC Educational Resources Information Center
Waller, Niels G.
2011-01-01
In linear multiple regression, "enhancement" is said to occur when R[superscript 2] = b[prime]r greater than r[prime]r, where b is a p x 1 vector of standardized regression coefficients and r is a p x 1 vector of correlations between a criterion y and a set of standardized regressors, x. When p = 1 then b [is congruent to] r and…
Using Robust Standard Errors to Combine Multiple Regression Estimates with Meta-Analysis
ERIC Educational Resources Information Center
Williams, Ryan T.
2012-01-01
Combining multiple regression estimates with meta-analysis has continued to be a difficult task. A variety of methods have been proposed and used to combine multiple regression slope estimates with meta-analysis, however, most of these methods have serious methodological and practical limitations. The purpose of this study was to explore the use…
ERIC Educational Resources Information Center
Hafner, Lawrence E.
A study developed a multiple regression prediction equation for each of six selected achievement variables in a popular standardized test of achievement. Subjects, 42 fourth-grade pupils randomly selected across several classes in a large elementary school in a north Florida city, were administered several standardized tests to determine predictor…
Population heterogeneity in the salience of multiple risk factors for adolescent delinquency.
Lanza, Stephanie T; Cooper, Brittany R; Bray, Bethany C
2014-03-01
To present mixture regression analysis as an alternative to more standard regression analysis for predicting adolescent delinquency. We demonstrate how mixture regression analysis allows for the identification of population subgroups defined by the salience of multiple risk factors. We identified population subgroups (i.e., latent classes) of individuals based on their coefficients in a regression model predicting adolescent delinquency from eight previously established risk indices drawn from the community, school, family, peer, and individual levels. The study included N = 37,763 10th-grade adolescents who participated in the Communities That Care Youth Survey. Standard, zero-inflated, and mixture Poisson and negative binomial regression models were considered. Standard and mixture negative binomial regression models were selected as optimal. The five-class regression model was interpreted based on the class-specific regression coefficients, indicating that risk factors had varying salience across classes of adolescents. Standard regression showed that all risk factors were significantly associated with delinquency. Mixture regression provided more nuanced information, suggesting a unique set of risk factors that were salient for different subgroups of adolescents. Implications for the design of subgroup-specific interventions are discussed. Copyright © 2014 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.
A Technique of Fuzzy C-Mean in Multiple Linear Regression Model toward Paddy Yield
NASA Astrophysics Data System (ADS)
Syazwan Wahab, Nur; Saifullah Rusiman, Mohd; Mohamad, Mahathir; Amira Azmi, Nur; Che Him, Norziha; Ghazali Kamardan, M.; Ali, Maselan
2018-04-01
In this paper, we propose a hybrid model which is a combination of multiple linear regression model and fuzzy c-means method. This research involved a relationship between 20 variates of the top soil that are analyzed prior to planting of paddy yields at standard fertilizer rates. Data used were from the multi-location trials for rice carried out by MARDI at major paddy granary in Peninsular Malaysia during the period from 2009 to 2012. Missing observations were estimated using mean estimation techniques. The data were analyzed using multiple linear regression model and a combination of multiple linear regression model and fuzzy c-means method. Analysis of normality and multicollinearity indicate that the data is normally scattered without multicollinearity among independent variables. Analysis of fuzzy c-means cluster the yield of paddy into two clusters before the multiple linear regression model can be used. The comparison between two method indicate that the hybrid of multiple linear regression model and fuzzy c-means method outperform the multiple linear regression model with lower value of mean square error.
Tracking the Gender Pay Gap: A Case Study
ERIC Educational Resources Information Center
Travis, Cheryl B.; Gross, Louis J.; Johnson, Bruce A.
2009-01-01
This article provides a short introduction to standard considerations in the formal study of wages and illustrates the use of multiple regression and resampling simulation approaches in a case study of faculty salaries at one university. Multiple regression is especially beneficial where it provides information on strength of association, specific…
Rahman, Md. Jahanur; Shamim, Abu Ahmed; Klemm, Rolf D. W.; Labrique, Alain B.; Rashid, Mahbubur; Christian, Parul; West, Keith P.
2017-01-01
Birth weight, length and circumferences of the head, chest and arm are key measures of newborn size and health in developing countries. We assessed maternal socio-demographic factors associated with multiple measures of newborn size in a large rural population in Bangladesh using partial least squares (PLS) regression method. PLS regression, combining features from principal component analysis and multiple linear regression, is a multivariate technique with an ability to handle multicollinearity while simultaneously handling multiple dependent variables. We analyzed maternal and infant data from singletons (n = 14,506) born during a double-masked, cluster-randomized, placebo-controlled maternal vitamin A or β-carotene supplementation trial in rural northwest Bangladesh. PLS regression results identified numerous maternal factors (parity, age, early pregnancy MUAC, living standard index, years of education, number of antenatal care visits, preterm delivery and infant sex) significantly (p<0.001) associated with newborn size. Among them, preterm delivery had the largest negative influence on newborn size (Standardized β = -0.29 − -0.19; p<0.001). Scatter plots of the scores of first two PLS components also revealed an interaction between newborn sex and preterm delivery on birth size. PLS regression was found to be more parsimonious than both ordinary least squares regression and principal component regression. It also provided more stable estimates than the ordinary least squares regression and provided the effect measure of the covariates with greater accuracy as it accounts for the correlation among the covariates and outcomes. Therefore, PLS regression is recommended when either there are multiple outcome measurements in the same study, or the covariates are correlated, or both situations exist in a dataset. PMID:29261760
Kabir, Alamgir; Rahman, Md Jahanur; Shamim, Abu Ahmed; Klemm, Rolf D W; Labrique, Alain B; Rashid, Mahbubur; Christian, Parul; West, Keith P
2017-01-01
Birth weight, length and circumferences of the head, chest and arm are key measures of newborn size and health in developing countries. We assessed maternal socio-demographic factors associated with multiple measures of newborn size in a large rural population in Bangladesh using partial least squares (PLS) regression method. PLS regression, combining features from principal component analysis and multiple linear regression, is a multivariate technique with an ability to handle multicollinearity while simultaneously handling multiple dependent variables. We analyzed maternal and infant data from singletons (n = 14,506) born during a double-masked, cluster-randomized, placebo-controlled maternal vitamin A or β-carotene supplementation trial in rural northwest Bangladesh. PLS regression results identified numerous maternal factors (parity, age, early pregnancy MUAC, living standard index, years of education, number of antenatal care visits, preterm delivery and infant sex) significantly (p<0.001) associated with newborn size. Among them, preterm delivery had the largest negative influence on newborn size (Standardized β = -0.29 - -0.19; p<0.001). Scatter plots of the scores of first two PLS components also revealed an interaction between newborn sex and preterm delivery on birth size. PLS regression was found to be more parsimonious than both ordinary least squares regression and principal component regression. It also provided more stable estimates than the ordinary least squares regression and provided the effect measure of the covariates with greater accuracy as it accounts for the correlation among the covariates and outcomes. Therefore, PLS regression is recommended when either there are multiple outcome measurements in the same study, or the covariates are correlated, or both situations exist in a dataset.
Krishan, Kewal; Kanchan, Tanuj; Sharma, Abhilasha
2012-05-01
Estimation of stature is an important parameter in identification of human remains in forensic examinations. The present study is aimed to compare the reliability and accuracy of stature estimation and to demonstrate the variability in estimated stature and actual stature using multiplication factor and regression analysis methods. The study is based on a sample of 246 subjects (123 males and 123 females) from North India aged between 17 and 20 years. Four anthropometric measurements; hand length, hand breadth, foot length and foot breadth taken on the left side in each subject were included in the study. Stature was measured using standard anthropometric techniques. Multiplication factors were calculated and linear regression models were derived for estimation of stature from hand and foot dimensions. Derived multiplication factors and regression formula were applied to the hand and foot measurements in the study sample. The estimated stature from the multiplication factors and regression analysis was compared with the actual stature to find the error in estimated stature. The results indicate that the range of error in estimation of stature from regression analysis method is less than that of multiplication factor method thus, confirming that the regression analysis method is better than multiplication factor analysis in stature estimation. Copyright © 2012 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.
Suppression Situations in Multiple Linear Regression
ERIC Educational Resources Information Center
Shieh, Gwowen
2006-01-01
This article proposes alternative expressions for the two most prevailing definitions of suppression without resorting to the standardized regression modeling. The formulation provides a simple basis for the examination of their relationship. For the two-predictor regression, the author demonstrates that the previous results in the literature are…
Standardized Regression Coefficients as Indices of Effect Sizes in Meta-Analysis
ERIC Educational Resources Information Center
Kim, Rae Seon
2011-01-01
When conducting a meta-analysis, it is common to find many collected studies that report regression analyses, because multiple regression analysis is widely used in many fields. Meta-analysis uses effect sizes drawn from individual studies as a means of synthesizing a collection of results. However, indices of effect size from regression analyses…
Lorenzo-Seva, Urbano; Ferrando, Pere J
2011-03-01
We provide an SPSS program that implements currently recommended techniques and recent developments for selecting variables in multiple linear regression analysis via the relative importance of predictors. The approach consists of: (1) optimally splitting the data for cross-validation, (2) selecting the final set of predictors to be retained in the equation regression, and (3) assessing the behavior of the chosen model using standard indices and procedures. The SPSS syntax, a short manual, and data files related to this article are available as supplemental materials from brm.psychonomic-journals.org/content/supplemental.
Akkus, Zeki; Camdeviren, Handan; Celik, Fatma; Gur, Ali; Nas, Kemal
2005-09-01
To determine the risk factors of osteoporosis using a multiple binary logistic regression method and to assess the risk variables for osteoporosis, which is a major and growing health problem in many countries. We presented a case-control study, consisting of 126 postmenopausal healthy women as control group and 225 postmenopausal osteoporotic women as the case group. The study was carried out in the Department of Physical Medicine and Rehabilitation, Dicle University, Diyarbakir, Turkey between 1999-2002. The data from the 351 participants were collected using a standard questionnaire that contains 43 variables. A multiple logistic regression model was then used to evaluate the data and to find the best regression model. We classified 80.1% (281/351) of the participants using the regression model. Furthermore, the specificity value of the model was 67% (84/126) of the control group while the sensitivity value was 88% (197/225) of the case group. We found the distribution of residual values standardized for final model to be exponential using the Kolmogorow-Smirnow test (p=0.193). The receiver operating characteristic curve was found successful to predict patients with risk for osteoporosis. This study suggests that low levels of dietary calcium intake, physical activity, education, and longer duration of menopause are independent predictors of the risk of low bone density in our population. Adequate dietary calcium intake in combination with maintaining a daily physical activity, increasing educational level, decreasing birth rate, and duration of breast-feeding may contribute to healthy bones and play a role in practical prevention of osteoporosis in Southeast Anatolia. In addition, the findings of the present study indicate that the use of multivariate statistical method as a multiple logistic regression in osteoporosis, which maybe influenced by many variables, is better than univariate statistical evaluation.
A Constrained Linear Estimator for Multiple Regression
ERIC Educational Resources Information Center
Davis-Stober, Clintin P.; Dana, Jason; Budescu, David V.
2010-01-01
"Improper linear models" (see Dawes, Am. Psychol. 34:571-582, "1979"), such as equal weighting, have garnered interest as alternatives to standard regression models. We analyze the general circumstances under which these models perform well by recasting a class of "improper" linear models as "proper" statistical models with a single predictor. We…
Multiple regression for physiological data analysis: the problem of multicollinearity.
Slinker, B K; Glantz, S A
1985-07-01
Multiple linear regression, in which several predictor variables are related to a response variable, is a powerful statistical tool for gaining quantitative insight into complex in vivo physiological systems. For these insights to be correct, all predictor variables must be uncorrelated. However, in many physiological experiments the predictor variables cannot be precisely controlled and thus change in parallel (i.e., they are highly correlated). There is a redundancy of information about the response, a situation called multicollinearity, that leads to numerical problems in estimating the parameters in regression equations; the parameters are often of incorrect magnitude or sign or have large standard errors. Although multicollinearity can be avoided with good experimental design, not all interesting physiological questions can be studied without encountering multicollinearity. In these cases various ad hoc procedures have been proposed to mitigate multicollinearity. Although many of these procedures are controversial, they can be helpful in applying multiple linear regression to some physiological problems.
Chen, Ying-Jen; Ho, Meng-Yang; Chen, Kwan-Ju; Hsu, Chia-Fen; Ryu, Shan-Jin
2009-08-01
The aims of the present study were to (i) investigate if traditional Chinese word reading ability can be used for estimating premorbid general intelligence; and (ii) to provide multiple regression equations for estimating premorbid performance on Raven's Standard Progressive Matrices (RSPM), using age, years of education and Chinese Graded Word Reading Test (CGWRT) scores as predictor variables. Four hundred and twenty-six healthy volunteers (201 male, 225 female), aged 16-93 years (mean +/- SD, 41.92 +/- 18.19 years) undertook the tests individually under supervised conditions. Seventy percent of subjects were randomly allocated to the derivation group (n = 296), and the rest to the validation group (n = 130). RSPM score was positively correlated with CGWRT score and years of education. RSPM and CGWRT scores and years of education were also inversely correlated with age, but the declining trend for RSPM performance against age was steeper than that for CGWRT performance. Separate multiple regression equations were derived for estimating RSPM scores using different combinations of age, years of education, and CGWRT score for both groups. The multiple regression coefficient of each equation ranged from 0.71 to 0.80 with the standard error of estimate between 7 and 8 RSPM points. When fitting the data of one group to the equations derived from its counterpart group, the cross-validation multiple regression coefficients ranged from 0.71 to 0.79. There were no significant differences in the 'predicted-obtained' RSPM discrepancies between any equations. The regression equations derived in the present study may provide a basis for estimating premorbid RSPM performance.
Parrett, Charles; Omang, R.J.; Hull, J.A.
1983-01-01
Equations for estimating mean annual runoff and peak discharge from measurements of channel geometry were developed for western and northeastern Montana. The study area was divided into two regions for the mean annual runoff analysis, and separate multiple-regression equations were developed for each region. The active-channel width was determined to be the most important independent variable in each region. The standard error of estimate for the estimating equation using active-channel width was 61 percent in the Northeast Region and 38 percent in the West region. The study area was divided into six regions for the peak discharge analysis, and multiple regression equations relating channel geometry and basin characteristics to peak discharges having recurrence intervals of 2, 5, 10, 25, 50 and 100 years were developed for each region. The standard errors of estimate for the regression equations using only channel width as an independent variable ranged from 35 to 105 percent. The standard errors improved in four regions as basin characteristics were added to the estimating equations. (USGS)
Rasmussen, Patrick P.; Gray, John R.; Glysson, G. Douglas; Ziegler, Andrew C.
2009-01-01
In-stream continuous turbidity and streamflow data, calibrated with measured suspended-sediment concentration data, can be used to compute a time series of suspended-sediment concentration and load at a stream site. Development of a simple linear (ordinary least squares) regression model for computing suspended-sediment concentrations from instantaneous turbidity data is the first step in the computation process. If the model standard percentage error (MSPE) of the simple linear regression model meets a minimum criterion, this model should be used to compute a time series of suspended-sediment concentrations. Otherwise, a multiple linear regression model using paired instantaneous turbidity and streamflow data is developed and compared to the simple regression model. If the inclusion of the streamflow variable proves to be statistically significant and the uncertainty associated with the multiple regression model results in an improvement over that for the simple linear model, the turbidity-streamflow multiple linear regression model should be used to compute a suspended-sediment concentration time series. The computed concentration time series is subsequently used with its paired streamflow time series to compute suspended-sediment loads by standard U.S. Geological Survey techniques. Once an acceptable regression model is developed, it can be used to compute suspended-sediment concentration beyond the period of record used in model development with proper ongoing collection and analysis of calibration samples. Regression models to compute suspended-sediment concentrations are generally site specific and should never be considered static, but they represent a set period in a continually dynamic system in which additional data will help verify any change in sediment load, type, and source.
RRegrs: an R package for computer-aided model selection with multiple regression models.
Tsiliki, Georgia; Munteanu, Cristian R; Seoane, Jose A; Fernandez-Lozano, Carlos; Sarimveis, Haralambos; Willighagen, Egon L
2015-01-01
Predictive regression models can be created with many different modelling approaches. Choices need to be made for data set splitting, cross-validation methods, specific regression parameters and best model criteria, as they all affect the accuracy and efficiency of the produced predictive models, and therefore, raising model reproducibility and comparison issues. Cheminformatics and bioinformatics are extensively using predictive modelling and exhibit a need for standardization of these methodologies in order to assist model selection and speed up the process of predictive model development. A tool accessible to all users, irrespectively of their statistical knowledge, would be valuable if it tests several simple and complex regression models and validation schemes, produce unified reports, and offer the option to be integrated into more extensive studies. Additionally, such methodology should be implemented as a free programming package, in order to be continuously adapted and redistributed by others. We propose an integrated framework for creating multiple regression models, called RRegrs. The tool offers the option of ten simple and complex regression methods combined with repeated 10-fold and leave-one-out cross-validation. Methods include Multiple Linear regression, Generalized Linear Model with Stepwise Feature Selection, Partial Least Squares regression, Lasso regression, and Support Vector Machines Recursive Feature Elimination. The new framework is an automated fully validated procedure which produces standardized reports to quickly oversee the impact of choices in modelling algorithms and assess the model and cross-validation results. The methodology was implemented as an open source R package, available at https://www.github.com/enanomapper/RRegrs, by reusing and extending on the caret package. The universality of the new methodology is demonstrated using five standard data sets from different scientific fields. Its efficiency in cheminformatics and QSAR modelling is shown with three use cases: proteomics data for surface-modified gold nanoparticles, nano-metal oxides descriptor data, and molecular descriptors for acute aquatic toxicity data. The results show that for all data sets RRegrs reports models with equal or better performance for both training and test sets than those reported in the original publications. Its good performance as well as its adaptability in terms of parameter optimization could make RRegrs a popular framework to assist the initial exploration of predictive models, and with that, the design of more comprehensive in silico screening applications.Graphical abstractRRegrs is a computer-aided model selection framework for R multiple regression models; this is a fully validated procedure with application to QSAR modelling.
Simple and multiple linear regression: sample size considerations.
Hanley, James A
2016-11-01
The suggested "two subjects per variable" (2SPV) rule of thumb in the Austin and Steyerberg article is a chance to bring out some long-established and quite intuitive sample size considerations for both simple and multiple linear regression. This article distinguishes two of the major uses of regression models that imply very different sample size considerations, neither served well by the 2SPV rule. The first is etiological research, which contrasts mean Y levels at differing "exposure" (X) values and thus tends to focus on a single regression coefficient, possibly adjusted for confounders. The second research genre guides clinical practice. It addresses Y levels for individuals with different covariate patterns or "profiles." It focuses on the profile-specific (mean) Y levels themselves, estimating them via linear compounds of regression coefficients and covariates. By drawing on long-established closed-form variance formulae that lie beneath the standard errors in multiple regression, and by rearranging them for heuristic purposes, one arrives at quite intuitive sample size considerations for both research genres. Copyright © 2016 Elsevier Inc. All rights reserved.
Estimating standard errors in feature network models.
Frank, Laurence E; Heiser, Willem J
2007-05-01
Feature network models are graphical structures that represent proximity data in a discrete space while using the same formalism that is the basis of least squares methods employed in multidimensional scaling. Existing methods to derive a network model from empirical data only give the best-fitting network and yield no standard errors for the parameter estimates. The additivity properties of networks make it possible to consider the model as a univariate (multiple) linear regression problem with positivity restrictions on the parameters. In the present study, both theoretical and empirical standard errors are obtained for the constrained regression parameters of a network model with known features. The performance of both types of standard error is evaluated using Monte Carlo techniques.
Parameter estimation in Cox models with missing failure indicators and the OPPERA study.
Brownstein, Naomi C; Cai, Jianwen; Slade, Gary D; Bair, Eric
2015-12-30
In a prospective cohort study, examining all participants for incidence of the condition of interest may be prohibitively expensive. For example, the "gold standard" for diagnosing temporomandibular disorder (TMD) is a physical examination by a trained clinician. In large studies, examining all participants in this manner is infeasible. Instead, it is common to use questionnaires to screen for incidence of TMD and perform the "gold standard" examination only on participants who screen positively. Unfortunately, some participants may leave the study before receiving the "gold standard" examination. Within the framework of survival analysis, this results in missing failure indicators. Motivated by the Orofacial Pain: Prospective Evaluation and Risk Assessment (OPPERA) study, a large cohort study of TMD, we propose a method for parameter estimation in survival models with missing failure indicators. We estimate the probability of being an incident case for those lacking a "gold standard" examination using logistic regression. These estimated probabilities are used to generate multiple imputations of case status for each missing examination that are combined with observed data in appropriate regression models. The variance introduced by the procedure is estimated using multiple imputation. The method can be used to estimate both regression coefficients in Cox proportional hazard models as well as incidence rates using Poisson regression. We simulate data with missing failure indicators and show that our method performs as well as or better than competing methods. Finally, we apply the proposed method to data from the OPPERA study. Copyright © 2015 John Wiley & Sons, Ltd.
Normalization Ridge Regression in Practice II: The Estimation of Multiple Feedback Linkages.
ERIC Educational Resources Information Center
Bulcock, J. W.
The use of the two-stage least squares (2 SLS) procedure for estimating nonrecursive social science models is often impractical when multiple feedback linkages are required. This is because 2 SLS is extremely sensitive to multicollinearity. The standard statistical solution to the multicollinearity problem is a biased, variance reduced procedure…
Almalki, Mohammed J; FitzGerald, Gerry; Clark, Michele
2012-09-12
Quality of work life (QWL) has been found to influence the commitment of health professionals, including nurses. However, reliable information on QWL and turnover intention of primary health care (PHC) nurses is limited. The aim of this study was to examine the relationship between QWL and turnover intention of PHC nurses in Saudi Arabia. A cross-sectional survey was used in this study. Data were collected using Brooks' survey of Quality of Nursing Work Life, the Anticipated Turnover Scale and demographic data questions. A total of 508 PHC nurses in the Jazan Region, Saudi Arabia, completed the questionnaire (RR = 87%). Descriptive statistics, t-test, ANOVA, General Linear Model (GLM) univariate analysis, standard multiple regression, and hierarchical multiple regression were applied for analysis using SPSS v17 for Windows. Findings suggested that the respondents were dissatisfied with their work life, with almost 40% indicating a turnover intention from their current PHC centres. Turnover intention was significantly related to QWL. Using standard multiple regression, 26% of the variance in turnover intention was explained by QWL, p < 0.001, with R2 = .263. Further analysis using hierarchical multiple regression found that the total variance explained by the model as a whole (demographics and QWL) was 32.1%, p < 0.001. QWL explained an additional 19% of the variance in turnover intention, after controlling for demographic variables. Creating and maintaining a healthy work life for PHC nurses is very important to improve their work satisfaction, reduce turnover, enhance productivity and improve nursing care outcomes.
2012-01-01
Background Quality of work life (QWL) has been found to influence the commitment of health professionals, including nurses. However, reliable information on QWL and turnover intention of primary health care (PHC) nurses is limited. The aim of this study was to examine the relationship between QWL and turnover intention of PHC nurses in Saudi Arabia. Methods A cross-sectional survey was used in this study. Data were collected using Brooks’ survey of Quality of Nursing Work Life, the Anticipated Turnover Scale and demographic data questions. A total of 508 PHC nurses in the Jazan Region, Saudi Arabia, completed the questionnaire (RR = 87%). Descriptive statistics, t-test, ANOVA, General Linear Model (GLM) univariate analysis, standard multiple regression, and hierarchical multiple regression were applied for analysis using SPSS v17 for Windows. Results Findings suggested that the respondents were dissatisfied with their work life, with almost 40% indicating a turnover intention from their current PHC centres. Turnover intention was significantly related to QWL. Using standard multiple regression, 26% of the variance in turnover intention was explained by QWL, p < 0.001, with R2 = .263. Further analysis using hierarchical multiple regression found that the total variance explained by the model as a whole (demographics and QWL) was 32.1%, p < 0.001. QWL explained an additional 19% of the variance in turnover intention, after controlling for demographic variables. Conclusions Creating and maintaining a healthy work life for PHC nurses is very important to improve their work satisfaction, reduce turnover, enhance productivity and improve nursing care outcomes. PMID:22970764
Multiple Imputation of a Randomly Censored Covariate Improves Logistic Regression Analysis.
Atem, Folefac D; Qian, Jing; Maye, Jacqueline E; Johnson, Keith A; Betensky, Rebecca A
2016-01-01
Randomly censored covariates arise frequently in epidemiologic studies. The most commonly used methods, including complete case and single imputation or substitution, suffer from inefficiency and bias. They make strong parametric assumptions or they consider limit of detection censoring only. We employ multiple imputation, in conjunction with semi-parametric modeling of the censored covariate, to overcome these shortcomings and to facilitate robust estimation. We develop a multiple imputation approach for randomly censored covariates within the framework of a logistic regression model. We use the non-parametric estimate of the covariate distribution or the semiparametric Cox model estimate in the presence of additional covariates in the model. We evaluate this procedure in simulations, and compare its operating characteristics to those from the complete case analysis and a survival regression approach. We apply the procedures to an Alzheimer's study of the association between amyloid positivity and maternal age of onset of dementia. Multiple imputation achieves lower standard errors and higher power than the complete case approach under heavy and moderate censoring and is comparable under light censoring. The survival regression approach achieves the highest power among all procedures, but does not produce interpretable estimates of association. Multiple imputation offers a favorable alternative to complete case analysis and ad hoc substitution methods in the presence of randomly censored covariates within the framework of logistic regression.
NASA Astrophysics Data System (ADS)
George, Anna Ray Bayless
A study was conducted to determine the relationship between the credentials held by science teachers who taught at a school that administered the Science Texas Assessment on Knowledge and Skills (Science TAKS), the state standardized exam in science, at grade 11 and student performance on a state standardized exam in science administered in grade 11. Years of teaching experience, teacher certification type(s), highest degree level held, teacher and school demographic information, and the percentage of students who met the passing standard on the Science TAKS were obtained through a public records request to the Texas Education Agency (TEA) and the State Board for Educator Certification (SBEC). Analysis was performed through the use of canonical correlation analysis and multiple linear regression analysis. The results of the multiple linear regression analysis indicate that a larger percentage of students met the passing standard on the Science TAKS state attended schools in which a large portion of the high school science teachers held post baccalaureate degrees, elementary and physical science certifications, and had 11-20 years of teaching experience.
NASA Technical Reports Server (NTRS)
Stolzer, Alan J.; Halford, Carl
2007-01-01
In a previous study, multiple regression techniques were applied to Flight Operations Quality Assurance-derived data to develop parsimonious model(s) for fuel consumption on the Boeing 757 airplane. The present study examined several data mining algorithms, including neural networks, on the fuel consumption problem and compared them to the multiple regression results obtained earlier. Using regression methods, parsimonious models were obtained that explained approximately 85% of the variation in fuel flow. In general data mining methods were more effective in predicting fuel consumption. Classification and Regression Tree methods reported correlation coefficients of .91 to .92, and General Linear Models and Multilayer Perceptron neural networks reported correlation coefficients of about .99. These data mining models show great promise for use in further examining large FOQA databases for operational and safety improvements.
ERIC Educational Resources Information Center
Si, Yajuan; Reiter, Jerome P.
2013-01-01
In many surveys, the data comprise a large number of categorical variables that suffer from item nonresponse. Standard methods for multiple imputation, like log-linear models or sequential regression imputation, can fail to capture complex dependencies and can be difficult to implement effectively in high dimensions. We present a fully Bayesian,…
Bankfull characteristics of Ohio streams and their relation to peak streamflows
Sherwood, James M.; Huitger, Carrie A.
2005-01-01
Regional curves, simple-regression equations, and multiple-regression equations were developed to estimate bankfull width, bankfull mean depth, bankfull cross-sectional area, and bankfull discharge of rural, unregulated streams in Ohio. The methods are based on geomorphic, basin, and flood-frequency data collected at 50 study sites on unregulated natural alluvial streams in Ohio, of which 40 sites are near streamflow-gaging stations. The regional curves and simple-regression equations relate the bankfull characteristics to drainage area. The multiple-regression equations relate the bankfull characteristics to drainage area, main-channel slope, main-channel elevation index, median bed-material particle size, bankfull cross-sectional area, and local-channel slope. Average standard errors of prediction for bankfull width equations range from 20.6 to 24.8 percent; for bankfull mean depth, 18.8 to 20.6 percent; for bankfull cross-sectional area, 25.4 to 30.6 percent; and for bankfull discharge, 27.0 to 78.7 percent. The simple-regression (drainage-area only) equations have the highest average standard errors of prediction. The multiple-regression equations in which the explanatory variables included drainage area, main-channel slope, main-channel elevation index, median bed-material particle size, bankfull cross-sectional area, and local-channel slope have the lowest average standard errors of prediction. Field surveys were done at each of the 50 study sites to collect the geomorphic data. Bankfull indicators were identified and evaluated, cross-section and longitudinal profiles were surveyed, and bed- and bank-material were sampled. Field data were analyzed to determine various geomorphic characteristics such as bankfull width, bankfull mean depth, bankfull cross-sectional area, bankfull discharge, streambed slope, and bed- and bank-material particle-size distribution. The various geomorphic characteristics were analyzed by means of a combination of graphical and statistical techniques. The logarithms of the annual peak discharges for the 40 gaged study sites were fit by a Pearson Type III frequency distribution to develop flood-peak discharges associated with recurrence intervals of 2, 5, 10, 25, 50, and 100 years. The peak-frequency data were related to geomorphic, basin, and climatic variables by multiple-regression analysis. Simple-regression equations were developed to estimate 2-, 5-, 10-, 25-, 50-, and 100-year flood-peak discharges of rural, unregulated streams in Ohio from bankfull channel cross-sectional area. The average standard errors of prediction are 31.6, 32.6, 35.9, 41.5, 46.2, and 51.2 percent, respectively. The study and methods developed are intended to improve understanding of the relations between geomorphic, basin, and flood characteristics of streams in Ohio and to aid in the design of hydraulic structures, such as culverts and bridges, where stability of the stream and structure is an important element of the design criteria. The study was done in cooperation with the Ohio Department of Transportation and the U.S. Department of Transportation, Federal Highway Administration.
ERIC Educational Resources Information Center
Bostic, Timothy B.
2014-01-01
The purpose of this research study was to ascertain whether there is a relationship between teachers' cognitive role taking aspect of empathy and the Virginia Standards of Learning (VSOL), English/Reading scores of their students. A correlational research design using hierarchical multiple regression was used to look for this relationship. In…
Krasikova, Dina V; Le, Huy; Bachura, Eric
2018-06-01
To address a long-standing concern regarding a gap between organizational science and practice, scholars called for more intuitive and meaningful ways of communicating research results to users of academic research. In this article, we develop a common language effect size index (CLβ) that can help translate research results to practice. We demonstrate how CLβ can be computed and used to interpret the effects of continuous and categorical predictors in multiple linear regression models. We also elaborate on how the proposed CLβ index is computed and used to interpret interactions and nonlinear effects in regression models. In addition, we test the robustness of the proposed index to violations of normality and provide means for computing standard errors and constructing confidence intervals around its estimates. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Practical Session: Simple Linear Regression
NASA Astrophysics Data System (ADS)
Clausel, M.; Grégoire, G.
2014-12-01
Two exercises are proposed to illustrate the simple linear regression. The first one is based on the famous Galton's data set on heredity. We use the lm R command and get coefficients estimates, standard error of the error, R2, residuals …In the second example, devoted to data related to the vapor tension of mercury, we fit a simple linear regression, predict values, and anticipate on multiple linear regression. This pratical session is an excerpt from practical exercises proposed by A. Dalalyan at EPNC (see Exercises 1 and 2 of http://certis.enpc.fr/~dalalyan/Download/TP_ENPC_4.pdf).
Adjusted variable plots for Cox's proportional hazards regression model.
Hall, C B; Zeger, S L; Bandeen-Roche, K J
1996-01-01
Adjusted variable plots are useful in linear regression for outlier detection and for qualitative evaluation of the fit of a model. In this paper, we extend adjusted variable plots to Cox's proportional hazards model for possibly censored survival data. We propose three different plots: a risk level adjusted variable (RLAV) plot in which each observation in each risk set appears, a subject level adjusted variable (SLAV) plot in which each subject is represented by one point, and an event level adjusted variable (ELAV) plot in which the entire risk set at each failure event is represented by a single point. The latter two plots are derived from the RLAV by combining multiple points. In each point, the regression coefficient and standard error from a Cox proportional hazards regression is obtained by a simple linear regression through the origin fit to the coordinates of the pictured points. The plots are illustrated with a reanalysis of a dataset of 65 patients with multiple myeloma.
Esserman, Denise A.; Moore, Charity G.; Roth, Mary T.
2009-01-01
Older community dwelling adults often take multiple medications for numerous chronic diseases. Non-adherence to these medications can have a large public health impact. Therefore, the measurement and modeling of medication adherence in the setting of polypharmacy is an important area of research. We apply a variety of different modeling techniques (standard linear regression; weighted linear regression; adjusted linear regression; naïve logistic regression; beta-binomial (BB) regression; generalized estimating equations (GEE)) to binary medication adherence data from a study in a North Carolina based population of older adults, where each medication an individual was taking was classified as adherent or non-adherent. In addition, through simulation we compare these different methods based on Type I error rates, bias, power, empirical 95% coverage, and goodness of fit. We find that estimation and inference using GEE is robust to a wide variety of scenarios and we recommend using this in the setting of polypharmacy when adherence is dichotomously measured for multiple medications per person. PMID:20414358
Ono, Tomohiro; Nakamura, Mitsuhiro; Hirose, Yoshinori; Kitsuda, Kenji; Ono, Yuka; Ishigaki, Takashi; Hiraoka, Masahiro
2017-09-01
To estimate the lung tumor position from multiple anatomical features on four-dimensional computed tomography (4D-CT) data sets using single regression analysis (SRA) and multiple regression analysis (MRA) approach and evaluate an impact of the approach on internal target volume (ITV) for stereotactic body radiotherapy (SBRT) of the lung. Eleven consecutive lung cancer patients (12 cases) underwent 4D-CT scanning. The three-dimensional (3D) lung tumor motion exceeded 5 mm. The 3D tumor position and anatomical features, including lung volume, diaphragm, abdominal wall, and chest wall positions, were measured on 4D-CT images. The tumor position was estimated by SRA using each anatomical feature and MRA using all anatomical features. The difference between the actual and estimated tumor positions was defined as the root-mean-square error (RMSE). A standard partial regression coefficient for the MRA was evaluated. The 3D lung tumor position showed a high correlation with the lung volume (R = 0.92 ± 0.10). Additionally, ITVs derived from SRA and MRA approaches were compared with ITV derived from contouring gross tumor volumes on all 10 phases of the 4D-CT (conventional ITV). The RMSE of the SRA was within 3.7 mm in all directions. Also, the RMSE of the MRA was within 1.6 mm in all directions. The standard partial regression coefficient for the lung volume was the largest and had the most influence on the estimated tumor position. Compared with conventional ITV, average percentage decrease of ITV were 31.9% and 38.3% using SRA and MRA approaches, respectively. The estimation accuracy of lung tumor position was improved by the MRA approach, which provided smaller ITV than conventional ITV. © 2017 The Authors. Journal of Applied Clinical Medical Physics published by Wiley Periodicals, Inc. on behalf of American Association of Physicists in Medicine.
ERIC Educational Resources Information Center
Bodner, Todd E.
2016-01-01
This article revisits how the end points of plotted line segments should be selected when graphing interactions involving a continuous target predictor variable. Under the standard approach, end points are chosen at ±1 or 2 standard deviations from the target predictor mean. However, when the target predictor and moderator are correlated or the…
NASA Astrophysics Data System (ADS)
Lu, Lin; Chang, Yunlong; Li, Yingmin; He, Youyou
2013-05-01
A transverse magnetic field was introduced to the arc plasma in the process of welding stainless steel tubes by high-speed Tungsten Inert Gas Arc Welding (TIG for short) without filler wire. The influence of external magnetic field on welding quality was investigated. 9 sets of parameters were designed by the means of orthogonal experiment. The welding joint tensile strength and form factor of weld were regarded as the main standards of welding quality. A binary quadratic nonlinear regression equation was established with the conditions of magnetic induction and flow rate of Ar gas. The residual standard deviation was calculated to adjust the accuracy of regression model. The results showed that, the regression model was correct and effective in calculating the tensile strength and aspect ratio of weld. Two 3D regression models were designed respectively, and then the impact law of magnetic induction on welding quality was researched.
Statistical Evaluation of Time Series Analysis Techniques
NASA Technical Reports Server (NTRS)
Benignus, V. A.
1973-01-01
The performance of a modified version of NASA's multivariate spectrum analysis program is discussed. A multiple regression model was used to make the revisions. Performance improvements were documented and compared to the standard fast Fourier transform by Monte Carlo techniques.
ERIC Educational Resources Information Center
Balfanz, Robert; Legters, Nettie; Jordan, Will
2004-01-01
Little is known about the feasibility and rapidity with which the academic learning of students who enter high school multiple years behind grade level can be accelerated. This study uses multiple regression analyses of standardized test and survey data from high-poverty high schools in two large urban districts to evaluate initial effects of the…
Black Male Labor Force Participation.
ERIC Educational Resources Information Center
Baer, Roger K.
This study attempts to test (via multiple regression analysis) hypothesized relationships between designated independent variables and age specific incidences of labor force participation for black male subpopulations in 54 Standard Metropolitan Statistical Areas. Leading independent variables tested include net migration, earnings, unemployment,…
The prediction of intelligence in preschool children using alternative models to regression.
Finch, W Holmes; Chang, Mei; Davis, Andrew S; Holden, Jocelyn E; Rothlisberg, Barbara A; McIntosh, David E
2011-12-01
Statistical prediction of an outcome variable using multiple independent variables is a common practice in the social and behavioral sciences. For example, neuropsychologists are sometimes called upon to provide predictions of preinjury cognitive functioning for individuals who have suffered a traumatic brain injury. Typically, these predictions are made using standard multiple linear regression models with several demographic variables (e.g., gender, ethnicity, education level) as predictors. Prior research has shown conflicting evidence regarding the ability of such models to provide accurate predictions of outcome variables such as full-scale intelligence (FSIQ) test scores. The present study had two goals: (1) to demonstrate the utility of a set of alternative prediction methods that have been applied extensively in the natural sciences and business but have not been frequently explored in the social sciences and (2) to develop models that can be used to predict premorbid cognitive functioning in preschool children. Predictions of Stanford-Binet 5 FSIQ scores for preschool-aged children is used to compare the performance of a multiple regression model with several of these alternative methods. Results demonstrate that classification and regression trees provided more accurate predictions of FSIQ scores than does the more traditional regression approach. Implications of these results are discussed.
Nguyen, Quynh C.; Osypuk, Theresa L.; Schmidt, Nicole M.; Glymour, M. Maria; Tchetgen Tchetgen, Eric J.
2015-01-01
Despite the recent flourishing of mediation analysis techniques, many modern approaches are difficult to implement or applicable to only a restricted range of regression models. This report provides practical guidance for implementing a new technique utilizing inverse odds ratio weighting (IORW) to estimate natural direct and indirect effects for mediation analyses. IORW takes advantage of the odds ratio's invariance property and condenses information on the odds ratio for the relationship between the exposure (treatment) and multiple mediators, conditional on covariates, by regressing exposure on mediators and covariates. The inverse of the covariate-adjusted exposure-mediator odds ratio association is used to weight the primary analytical regression of the outcome on treatment. The treatment coefficient in such a weighted regression estimates the natural direct effect of treatment on the outcome, and indirect effects are identified by subtracting direct effects from total effects. Weighting renders treatment and mediators independent, thereby deactivating indirect pathways of the mediators. This new mediation technique accommodates multiple discrete or continuous mediators. IORW is easily implemented and is appropriate for any standard regression model, including quantile regression and survival analysis. An empirical example is given using data from the Moving to Opportunity (1994–2002) experiment, testing whether neighborhood context mediated the effects of a housing voucher program on obesity. Relevant Stata code (StataCorp LP, College Station, Texas) is provided. PMID:25693776
Alpha-synuclein levels in patients with multiple system atrophy: a meta-analysis.
Yang, Fei; Li, Wan-Jun; Huang, Xu-Sheng
2018-05-01
This study evaluates the relationship between multiple system atrophy and α-synuclein levels in the cerebrospinal fluid, plasma and neural tissue. Literature search for relevant research articles was undertaken in electronic databases and study selection was based on a priori eligibility criteria. Random-effects meta-analyses of standardized mean differences in α-synuclein levels between multiple system atrophy patients and normal controls were conducted to obtain the overall and subgroup effect sizes. Meta-regression analyses were performed to evaluate the effect of age, gender and disease severity on standardized mean differences. Data were obtained from 11 studies involving 378 multiple system atrophy patients and 637 healthy controls (age: multiple system atrophy patients 64.14 [95% confidence interval 62.05, 66.23] years; controls 64.16 [60.06, 68.25] years; disease duration: 44.41 [26.44, 62.38] months). Cerebrospinal fluid α-synuclein levels were significantly lower in multiple system atrophy patients than in controls but in plasma and neural tissue, α-synuclein levels were significantly higher in multiple system atrophy patients (standardized mean difference: -0.99 [-1.65, -0.32]; p = 0.001). Percentage of male multiple system atrophy patients was significantly positively associated with the standardized mean differences of cerebrospinal fluid α-synuclein levels (p = 0.029) whereas the percentage of healthy males was not associated with the standardized mean differences of cerebrospinal fluid α-synuclein levels (p = 0.920). In multiple system atrophy patients, α-synuclein levels were significantly lower in the cerebrospinal fluid and were positively associated with the male gender.
Campos-Filho, N; Franco, E L
1989-02-01
A frequent procedure in matched case-control studies is to report results from the multivariate unmatched analyses if they do not differ substantially from the ones obtained after conditioning on the matching variables. Although conceptually simple, this rule requires that an extensive series of logistic regression models be evaluated by both the conditional and unconditional maximum likelihood methods. Most computer programs for logistic regression employ only one maximum likelihood method, which requires that the analyses be performed in separate steps. This paper describes a Pascal microcomputer (IBM PC) program that performs multiple logistic regression by both maximum likelihood estimation methods, which obviates the need for switching between programs to obtain relative risk estimates from both matched and unmatched analyses. The program calculates most standard statistics and allows factoring of categorical or continuous variables by two distinct methods of contrast. A built-in, descriptive statistics option allows the user to inspect the distribution of cases and controls across categories of any given variable.
Two SPSS programs for interpreting multiple regression results.
Lorenzo-Seva, Urbano; Ferrando, Pere J; Chico, Eliseo
2010-02-01
When multiple regression is used in explanation-oriented designs, it is very important to determine both the usefulness of the predictor variables and their relative importance. Standardized regression coefficients are routinely provided by commercial programs. However, they generally function rather poorly as indicators of relative importance, especially in the presence of substantially correlated predictors. We provide two user-friendly SPSS programs that implement currently recommended techniques and recent developments for assessing the relevance of the predictors. The programs also allow the user to take into account the effects of measurement error. The first program, MIMR-Corr.sps, uses a correlation matrix as input, whereas the second program, MIMR-Raw.sps, uses the raw data and computes bootstrap confidence intervals of different statistics. The SPSS syntax, a short manual, and data files related to this article are available as supplemental materials from http://brm.psychonomic-journals.org/content/supplemental.
Flood characteristics of Alaskan streams
Lamke, R.D.
1979-01-01
Peak discharge data for Alaskan streams are summarized and analyzed. Multiple-regression equations relating peak discharge magnitude and frequency to climatic and physical characteristics of 260 gaged basins were determined in order to estimate average recurrence interval of floods at ungaged sites. These equations are for 1.25-, 2-, 5-, 10-, 25-, and 50-year average recurrence intervals. In this report, Alaska was divided into two regions, one having a maritime climate with fall and winter rains and floods, the other having spring and summer floods of a variety or combinations of causes. Average standard errors of the six multiple-regression equations for these two regions were 48 and 74 percent, respectively. Maximum recorded floods at more than 400 sites throughout Alaska are tabulated. Maps showing lines of equal intensity of the principal climatic variables found to be significant (mean annual precipitation and mean minimum January temperature), and location of the 260 sites used in the multiple-regression analyses are included. Little flood data have been collected in western and arctic Alaska, and the predictive equations are therefore less reliable for those areas. (Woodard-USGS)
NASA Astrophysics Data System (ADS)
Grotti, Marco; Abelmoschi, Maria Luisa; Soggia, Francesco; Tiberiade, Christian; Frache, Roberto
2000-12-01
The multivariate effects of Na, K, Mg and Ca as nitrates on the electrothermal atomisation of manganese, cadmium and iron were studied by multiple linear regression modelling. Since the models proved to efficiently predict the effects of the considered matrix elements in a wide range of concentrations, they were applied to correct the interferences occurring in the determination of trace elements in seawater after pre-concentration of the analytes. In order to obtain a statistically significant number of samples, a large volume of the certified seawater reference materials CASS-3 and NASS-3 was treated with Chelex-100 resin; then, the chelating resin was separated from the solution, divided into several sub-samples, each of them was eluted with nitric acid and analysed by electrothermal atomic absorption spectrometry (for trace element determinations) and inductively coupled plasma optical emission spectrometry (for matrix element determinations). To minimise any other systematic error besides that due to matrix effects, accuracy of the pre-concentration step and contamination levels of the procedure were checked by inductively coupled plasma mass spectrometric measurements. Analytical results obtained by applying the multiple linear regression models were compared with those obtained with other calibration methods, such as external calibration using acid-based standards, external calibration using matrix-matched standards and the analyte addition technique. Empirical models proved to efficiently reduce interferences occurring in the analysis of real samples, allowing an improvement of accuracy better than for other calibration methods.
Incremental Validity in the Clinical Assessment of Early Childhood Development
ERIC Educational Resources Information Center
Liu, Xin; Zhou, Xiaobin; Lackaff, Julie
2013-01-01
The authors demonstrate the increment of clinical validity in early childhood assessment of physical impairment (PI), developmental delay (DD), and autism (AUT) using multiple standardized developmental screening measures such as performance measures and parent and teacher rating scales. Hierarchical regression and sensitivity/specificity analyses…
Ecologists are often faced with problem of small sample size, correlated and large number of predictors, and high noise-to-signal relationships. This necessitates excluding important variables from the model when applying standard multiple or multivariate regression analyses. In ...
Estimation of standard liver volume in Chinese adult living donors.
Fu-Gui, L; Lu-Nan, Y; Bo, L; Yong, Z; Tian-Fu, W; Ming-Qing, X; Wen-Tao, W; Zhe-Yu, C
2009-12-01
To determine a formula predicting the standard liver volume based on body surface area (BSA) or body weight in Chinese adults. A total of 115 consecutive right-lobe living donors not including the middle hepatic vein underwent right hemi-hepatectomy. No organs were used from prisoners, and no subjects were prisoners. Donor anthropometric data including age, gender, body weight, and body height were recorded prospectively. The weights and volumes of the right lobe liver grafts were measured at the back table. Liver weights and volumes were calculated from the right lobe graft weight and volume obtained at the back table, divided by the proportion of the right lobe on computed tomography. By simple linear regression analysis and stepwise multiple linear regression analysis, we correlated calculated liver volume and body height, body weight, or body surface area. The subjects had a mean age of 35.97 +/- 9.6 years, and a female-to-male ratio of 60:55. The mean volume of the right lobe was 727.47 +/- 136.17 mL, occupying 55.59% +/- 6.70% of the whole liver by computed tomography. The volume of the right lobe was 581.73 +/- 96.137 mL, and the estimated liver volume was 1053.08 +/- 167.56 mL. Females of the same body weight showed a slightly lower liver weight. By simple linear regression analysis and stepwise multiple linear regression analysis, a formula was derived based on body weight. All formulae except the Hong Kong formula overestimated liver volume compared to this formula. The formula of standard liver volume, SLV (mL) = 11.508 x body weight (kg) + 334.024, may be applied to estimate liver volumes in Chinese adults.
Father and adolescent son variables related to son's HIV prevention.
Glenn, Betty L; Demi, Alice; Kimble, Laura P
2008-02-01
The purpose of this study was to examine the relationship between fathers' influences and African American male adolescents' perceptions of self-efficacy to reduce high-risk sexual behavior. A convenience sample of 70 fathers was recruited from churches in a large metropolitan area in the South. Hierarchical multiple linear regression analysis indicated father-related factors and son-related factors were associated with 26.1% of the variance in son's self-efficacy to be abstinent. In the regression model greater son's perception of the communication of sexual standards and greater father's perception of his son's self-efficacy were significantly related to greater son's self-efficacy for abstinence. The second regression model with son's self-efficacy for safer sex as the criterion was not statistically significant. Data support the need for fathers to express confidence in their sons' ability to be abstinent or practice safer sex and to communicate with their sons regarding sexual issues and standards.
A Computer Program for Preliminary Data Analysis
Dennis L. Schweitzer
1967-01-01
ABSTRACT. -- A computer program written in FORTRAN has been designed to summarize data. Class frequencies, means, and standard deviations are printed for as many as 100 independent variables. Cross-classifications of an observed dependent variable and of a dependent variable predicted by a multiple regression equation can also be generated.
Predictors of Incomes. AIR Forum 1981 Paper.
ERIC Educational Resources Information Center
Witmer, David R.
Income predictions that provide some indication of the potential value of attending college are considered. Standard multiple regression analysis of data describing the income experiences of men 25 years old and older were used to determine differences in incomes of high school and college graduates. Information on the gross national product was…
USDA-ARS?s Scientific Manuscript database
Agricultural research increasingly seeks to quantify complex interactions of processes for a wide range of environmental conditions and crop management scenarios, leading to investigation where multiple sets of experimental data are examined using tools such as simulation and regression. The use of ...
Kabeshova, A; Annweiler, C; Fantino, B; Philip, T; Gromov, V A; Launay, C P; Beauchet, O
2014-06-01
Regression tree (RT) analyses are particularly adapted to explore the risk of recurrent falling according to various combinations of fall risk factors compared to logistic regression models. The aims of this study were (1) to determine which combinations of fall risk factors were associated with the occurrence of recurrent falls in older community-dwellers, and (2) to compare the efficacy of RT and multiple logistic regression model for the identification of recurrent falls. A total of 1,760 community-dwelling volunteers (mean age ± standard deviation, 71.0 ± 5.1 years; 49.4 % female) were recruited prospectively in this cross-sectional study. Age, gender, polypharmacy, use of psychoactive drugs, fear of falling (FOF), cognitive disorders and sad mood were recorded. In addition, the history of falls within the past year was recorded using a standardized questionnaire. Among 1,760 participants, 19.7 % (n = 346) were recurrent fallers. The RT identified 14 nodes groups and 8 end nodes with FOF as the first major split. Among participants with FOF, those who had sad mood and polypharmacy formed the end node with the greatest OR for recurrent falls (OR = 6.06 with p < 0.001). Among participants without FOF, those who were male and not sad had the lowest OR for recurrent falls (OR = 0.25 with p < 0.001). The RT correctly classified 1,356 from 1,414 non-recurrent fallers (specificity = 95.6 %), and 65 from 346 recurrent fallers (sensitivity = 18.8 %). The overall classification accuracy was 81.0 %. The multiple logistic regression correctly classified 1,372 from 1,414 non-recurrent fallers (specificity = 97.0 %), and 61 from 346 recurrent fallers (sensitivity = 17.6 %). The overall classification accuracy was 81.4 %. Our results show that RT may identify specific combinations of risk factors for recurrent falls, the combination most associated with recurrent falls involving FOF, sad mood and polypharmacy. The FOF emerged as the risk factor strongly associated with recurrent falls. In addition, RT and multiple logistic regression were not sensitive enough to identify the majority of recurrent fallers but appeared efficient in detecting individuals not at risk of recurrent falls.
Sherwood, J.M.
1986-01-01
Methods are presented for estimating peak discharges, flood volumes and hydrograph shapes of small (less than 5 sq mi) urban streams in Ohio. Examples of how to use the various regression equations and estimating techniques also are presented. Multiple-regression equations were developed for estimating peak discharges having recurrence intervals of 2, 5, 10, 25, 50, and 100 years. The significant independent variables affecting peak discharge are drainage area, main-channel slope, average basin-elevation index, and basin-development factor. Standard errors of regression and prediction for the peak discharge equations range from +/-37% to +/-41%. An equation also was developed to estimate the flood volume of a given peak discharge. Peak discharge, drainage area, main-channel slope, and basin-development factor were found to be the significant independent variables affecting flood volumes for given peak discharges. The standard error of regression for the volume equation is +/-52%. A technique is described for estimating the shape of a runoff hydrograph by applying a specific peak discharge and the estimated lagtime to a dimensionless hydrograph. An equation for estimating the lagtime of a basin was developed. Two variables--main-channel length divided by the square root of the main-channel slope and basin-development factor--have a significant effect on basin lagtime. The standard error of regression for the lagtime equation is +/-48%. The data base for the study was established by collecting rainfall-runoff data at 30 basins distributed throughout several metropolitan areas of Ohio. Five to eight years of data were collected at a 5-min record interval. The USGS rainfall-runoff model A634 was calibrated for each site. The calibrated models were used in conjunction with long-term rainfall records to generate a long-term streamflow record for each site. Each annual peak-discharge record was fitted to a Log-Pearson Type III frequency curve. Multiple-regression techniques were then used to analyze the peak discharge data as a function of the basin characteristics of the 30 sites. (Author 's abstract)
Nguyen, Quynh C; Osypuk, Theresa L; Schmidt, Nicole M; Glymour, M Maria; Tchetgen Tchetgen, Eric J
2015-03-01
Despite the recent flourishing of mediation analysis techniques, many modern approaches are difficult to implement or applicable to only a restricted range of regression models. This report provides practical guidance for implementing a new technique utilizing inverse odds ratio weighting (IORW) to estimate natural direct and indirect effects for mediation analyses. IORW takes advantage of the odds ratio's invariance property and condenses information on the odds ratio for the relationship between the exposure (treatment) and multiple mediators, conditional on covariates, by regressing exposure on mediators and covariates. The inverse of the covariate-adjusted exposure-mediator odds ratio association is used to weight the primary analytical regression of the outcome on treatment. The treatment coefficient in such a weighted regression estimates the natural direct effect of treatment on the outcome, and indirect effects are identified by subtracting direct effects from total effects. Weighting renders treatment and mediators independent, thereby deactivating indirect pathways of the mediators. This new mediation technique accommodates multiple discrete or continuous mediators. IORW is easily implemented and is appropriate for any standard regression model, including quantile regression and survival analysis. An empirical example is given using data from the Moving to Opportunity (1994-2002) experiment, testing whether neighborhood context mediated the effects of a housing voucher program on obesity. Relevant Stata code (StataCorp LP, College Station, Texas) is provided. © The Author 2015. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Flood-frequency characteristics of Wisconsin streams
Walker, John F.; Peppler, Marie C.; Danz, Mari E.; Hubbard, Laura E.
2017-05-22
Flood-frequency characteristics for 360 gaged sites on unregulated rural streams in Wisconsin are presented for percent annual exceedance probabilities ranging from 0.2 to 50 using a statewide skewness map developed for this report. Equations of the relations between flood-frequency and drainage-basin characteristics were developed by multiple-regression analyses. Flood-frequency characteristics for ungaged sites on unregulated, rural streams can be estimated by use of the equations presented in this report. The State was divided into eight areas of similar physiographic characteristics. The most significant basin characteristics are drainage area, soil saturated hydraulic conductivity, main-channel slope, and several land-use variables. The standard error of prediction for the equation for the 1-percent annual exceedance probability flood ranges from 56 to 70 percent for Wisconsin Streams; these values are larger than results presented in previous reports. The increase in the standard error of prediction is likely due to increased variability of the annual-peak discharges, resulting in increased variability in the magnitude of flood peaks at higher frequencies. For each of the unregulated rural streamflow-gaging stations, a weighted estimate based on the at-site log Pearson type III analysis and the multiple regression results was determined. The weighted estimate generally has a lower uncertainty than either the Log Pearson type III or multiple regression estimates. For regulated streams, a graphical method for estimating flood-frequency characteristics was developed from the relations of discharge and drainage area for selected annual exceedance probabilities. Graphs for the major regulated streams in Wisconsin are presented in the report.
Intent to Persist in College Freshmen: The Role of Self-Efficacy and Mentorship
ERIC Educational Resources Information Center
Baier, Stefanie T.; Markman, Barry S.; Pernice-Duca, Francesca M.
2016-01-01
We surveyed 237 first-time college students to examine the extent to which social-cognitive factors--self-efficacy, perceptions of mentorship, high school GPA, ACT scores, first-semester college GPA, and demographic characteristics-- influence freshmen's intent to persist. Standard multiple regression and MANOVA were conducted to determine the…
A SAS Interface for Bayesian Analysis with WinBUGS
ERIC Educational Resources Information Center
Zhang, Zhiyong; McArdle, John J.; Wang, Lijuan; Hamagami, Fumiaki
2008-01-01
Bayesian methods are becoming very popular despite some practical difficulties in implementation. To assist in the practical application of Bayesian methods, we show how to implement Bayesian analysis with WinBUGS as part of a standard set of SAS routines. This implementation procedure is first illustrated by fitting a multiple regression model…
Yan, Chao-Gan; Craddock, R. Cameron; Zuo, Xi-Nian; Zang, Yu-Feng; Milham, Michael P.
2014-01-01
As researchers increase their efforts to characterize variations in the functional connectome across studies and individuals, concerns about the many sources of nuisance variation present and their impact on resting state fMRI (R-fMRI) measures continue to grow. Although substantial within-site variation can exist, efforts to aggregate data across multiple sites such as the 1000 Functional Connectomes Project (FCP) and International Neuroimaging Data-sharing Initiative (INDI) datasets amplify these concerns. The present work draws upon standardization approaches commonly used in the microarray gene expression literature, and to a lesser extent recent imaging studies, and compares them with respect to their impact on relationships between common R-fMRI measures and nuisance variables (e.g., imaging site, motion), as well as phenotypic variables of interest (age, sex). Standardization approaches differed with regard to whether they were applied post-hoc vs. during pre-processing, and at the individual vs. group level; additionally they varied in whether they addressed additive effects vs. additive + multiplicative effects, and were parametric vs. non-parametric. While all standardization approaches were effective at reducing undesirable relationships with nuisance variables, post-hoc approaches were generally more effective than global signal regression (GSR). Across approaches, correction for additive effects (global mean) appeared to be more important than for multiplicative effects (global SD) for all R-fMRI measures, with the exception of amplitude of low frequency fluctuations (ALFF). Group-level post-hoc standardizations for mean-centering and variance-standardization were found to be advantageous in their ability to avoid the introduction of artifactual relationships with standardization parameters; though results between individual and group-level post-hoc approaches were highly similar overall. While post-hoc standardization procedures drastically increased test–retest (TRT) reliability for ALFF, modest reductions were observed for other measures after post-hoc standardizations—a phenomena likely attributable to the separation of voxel-wise from global differences among subjects (global mean and SD demonstrated moderate TRT reliability for these measures). Finally, the present work calls into question previous observations of increased anatomical specificity for GSR over mean centering, and draws attention to the near equivalence of global and gray matter signal regression. PMID:23631983
Smith, S. Jerrod; Lewis, Jason M.; Graves, Grant M.
2015-09-28
Generalized-least-squares multiple-linear regression analysis was used to formulate regression relations between peak-streamflow frequency statistics and basin characteristics. Contributing drainage area was the only basin characteristic determined to be statistically significant for all percentage of annual exceedance probabilities and was the only basin characteristic used in regional regression equations for estimating peak-streamflow frequency statistics on unregulated streams in and near the Oklahoma Panhandle. The regression model pseudo-coefficient of determination, converted to percent, for the Oklahoma Panhandle regional regression equations ranged from about 38 to 63 percent. The standard errors of prediction and the standard model errors for the Oklahoma Panhandle regional regression equations ranged from about 84 to 148 percent and from about 76 to 138 percent, respectively. These errors were comparable to those reported for regional peak-streamflow frequency regression equations for the High Plains areas of Texas and Colorado. The root mean square errors for the Oklahoma Panhandle regional regression equations (ranging from 3,170 to 92,000 cubic feet per second) were less than the root mean square errors for the Oklahoma statewide regression equations (ranging from 18,900 to 412,000 cubic feet per second); therefore, the Oklahoma Panhandle regional regression equations produce more accurate peak-streamflow statistic estimates for the irrigated period of record in the Oklahoma Panhandle than do the Oklahoma statewide regression equations. The regression equations developed in this report are applicable to streams that are not substantially affected by regulation, impoundment, or surface-water withdrawals. These regression equations are intended for use for stream sites with contributing drainage areas less than or equal to about 2,060 square miles, the maximum value for the independent variable used in the regression analysis.
Marston, Louise; Peacock, Janet L; Yu, Keming; Brocklehurst, Peter; Calvert, Sandra A; Greenough, Anne; Marlow, Neil
2009-07-01
Studies of prematurely born infants contain a relatively large percentage of multiple births, so the resulting data have a hierarchical structure with small clusters of size 1, 2 or 3. Ignoring the clustering may lead to incorrect inferences. The aim of this study was to compare statistical methods which can be used to analyse such data: generalised estimating equations, multilevel models, multiple linear regression and logistic regression. Four datasets which differed in total size and in percentage of multiple births (n = 254, multiple 18%; n = 176, multiple 9%; n = 10 098, multiple 3%; n = 1585, multiple 8%) were analysed. With the continuous outcome, two-level models produced similar results in the larger dataset, while generalised least squares multilevel modelling (ML GLS 'xtreg' in Stata) and maximum likelihood multilevel modelling (ML MLE 'xtmixed' in Stata) produced divergent estimates using the smaller dataset. For the dichotomous outcome, most methods, except generalised least squares multilevel modelling (ML GH 'xtlogit' in Stata) gave similar odds ratios and 95% confidence intervals within datasets. For the continuous outcome, our results suggest using multilevel modelling. We conclude that generalised least squares multilevel modelling (ML GLS 'xtreg' in Stata) and maximum likelihood multilevel modelling (ML MLE 'xtmixed' in Stata) should be used with caution when the dataset is small. Where the outcome is dichotomous and there is a relatively large percentage of non-independent data, it is recommended that these are accounted for in analyses using logistic regression with adjusted standard errors or multilevel modelling. If, however, the dataset has a small percentage of clusters greater than size 1 (e.g. a population dataset of children where there are few multiples) there appears to be less need to adjust for clustering.
Societal integration and age-standardized suicide rates in 21 developed countries, 1955-1989.
Fernquist, R M; Cutright, P
1998-01-01
Gender-specific age-standardized suicide rates for 21 developed countries over seven 5-year periods (1955-59...1985-89) form the two dependent variables. Durkheim's theory of societal integration is the framework used to generate the independent variables, although several recent theories are also examined. The results from a MGLS multiple regression analysis of both male and female rates provide overwhelming support for a multidimensional theory of societal integration and suicide, as first suggested by Durkheim.
SU-F-R-20: Image Texture Features Correlate with Time to Local Failure in Lung SBRT Patients
DOE Office of Scientific and Technical Information (OSTI.GOV)
Andrews, M; Abazeed, M; Woody, N
Purpose: To explore possible correlation between CT image-based texture and histogram features and time-to-local-failure in early stage non-small cell lung cancer (NSCLC) patients treated with stereotactic body radiotherapy (SBRT).Methods and Materials: From an IRB-approved lung SBRT registry for patients treated between 2009–2013 we selected 48 (20 male, 28 female) patients with local failure. Median patient age was 72.3±10.3 years. Mean time to local failure was 15 ± 7.1 months. Physician-contoured gross tumor volumes (GTV) on the planning CT images were processed and 3D gray-level co-occurrence matrix (GLCM) based texture and histogram features were calculated in Matlab. Data were exported tomore » R and a multiple linear regression model was used to examine the relationship between texture features and time-to-local-failure. Results: Multiple linear regression revealed that entropy (p=0.0233, multiple R2=0.60) from GLCM-based texture analysis and the standard deviation (p=0.0194, multiple R2=0.60) from the histogram-based features were statistically significantly correlated with the time-to-local-failure. Conclusion: Image-based texture analysis can be used to predict certain aspects of treatment outcomes of NSCLC patients treated with SBRT. We found entropy and standard deviation calculated for the GTV on the CT images displayed a statistically significant correlation with and time-to-local-failure in lung SBRT patients.« less
Fossum, Kenneth D.; O'Day, Christie M.; Wilson, Barbara J.; Monical, Jim E.
2001-01-01
Stormwater and streamflow in Maricopa County were monitored to (1) describe the physical, chemical, and toxicity characteristics of stormwater from areas having different land uses, (2) describe the physical, chemical, and toxicity characteristics of streamflow from areas that receive urban stormwater, and (3) estimate constituent loads in stormwater. Urban stormwater and streamflow had similar ranges in most constituent concentrations. The mean concentration of dissolved solids in urban stormwater was lower than in streamflow from the Salt River and Indian Bend Wash. Urban stormwater, however, had a greater chemical oxygen demand and higher concentrations of most nutrients. Mean seasonal loads and mean annual loads of 11 constituents and volumes of runoff were estimated for municipalities in the metropolitan Phoenix area, Arizona, by adjusting regional regression equations of loads. This adjustment procedure uses the original regional regression equation and additional explanatory variables that were not included in the original equation. The adjusted equations had standard errors that ranged from 161 to 196 percent. The large standard errors of the prediction result from the large variability of the constituent concentration data used in the regression analysis. Adjustment procedures produced unsatisfactory results for nine of the regressions?suspended solids, dissolved solids, total phosphorus, dissolved phosphorus, total recoverable cadmium, total recoverable copper, total recoverable lead, total recoverable zinc, and storm runoff. These equations had no consistent direction of bias and no other additional explanatory variables correlated with the observed loads. A stepwise-multiple regression or a three-variable regression (total storm rainfall, drainage area, and impervious area) and local data were used to develop local regression equations for these nine constituents. These equations had standard errors from 15 to 183 percent.
ERIC Educational Resources Information Center
Stokes, D. Michelle.
2013-01-01
This correlational study used standard multiple regression to determine if there was a relationship between the factors of cultural intelligence (metacognitive CQ, cognitive CQ, motivational CQ, and behavioral CQ) and transformational leadership in doctoral students. This study also sought to determine the best predictor of burnout by using a…
The Draw a Scientist Test: A Different Population and a Somewhat Different Story
ERIC Educational Resources Information Center
Thomas, Mark D.; Henley, Tracy B.; Snell, Catherine M.
2006-01-01
This study examined Draw-a-Scientist-Test (DAST) images solicited from 212 undergraduate students for the presence of traditional gender stereotypes. Participants were 100 males and 112 females enrolled in psychology or computer science courses with a mean age of 21.02 years. A standard multiple regression generated a model that accounts for the…
NASA Astrophysics Data System (ADS)
Mai, W.; Zhang, J.-F.; Zhao, X.-M.; Li, Z.; Xu, Z.-W.
2017-11-01
Wastewater from the dye industry is typically analyzed using a standard method for measurement of chemical oxygen demand (COD) or by a single-wavelength spectroscopic method. To overcome the disadvantages of these methods, ultraviolet-visible (UV-Vis) spectroscopy was combined with principal component regression (PCR) and partial least squares regression (PLSR) in this study. Unlike the standard method, this method does not require digestion of the samples for preparation. Experiments showed that the PLSR model offered high prediction performance for COD, with a mean relative error of about 5% for two dyes. This error is similar to that obtained with the standard method. In this study, the precision of the PLSR model decreased with the number of dye compounds present. It is likely that multiple models will be required in reality, and the complexity of a COD monitoring system would be greatly reduced if the PLSR model is used because it can include several dyes. UV-Vis spectroscopy with PLSR successfully enhanced the performance of COD prediction for dye wastewater and showed good potential for application in on-line water quality monitoring.
Multiple regression technique for Pth degree polynominals with and without linear cross products
NASA Technical Reports Server (NTRS)
Davis, J. W.
1973-01-01
A multiple regression technique was developed by which the nonlinear behavior of specified independent variables can be related to a given dependent variable. The polynomial expression can be of Pth degree and can incorporate N independent variables. Two cases are treated such that mathematical models can be studied both with and without linear cross products. The resulting surface fits can be used to summarize trends for a given phenomenon and provide a mathematical relationship for subsequent analysis. To implement this technique, separate computer programs were developed for the case without linear cross products and for the case incorporating such cross products which evaluate the various constants in the model regression equation. In addition, the significance of the estimated regression equation is considered and the standard deviation, the F statistic, the maximum absolute percent error, and the average of the absolute values of the percent of error evaluated. The computer programs and their manner of utilization are described. Sample problems are included to illustrate the use and capability of the technique which show the output formats and typical plots comparing computer results to each set of input data.
1990-05-01
0.759 0.744 0.768 0.753 106 (THUMBBR) THUMB BREADTH -0.652 -0.673 -0.539 -0.663 217 (LIPLGTHH) LIP LENGTH HEADBOARD 0.017 0.019 0.020 51 (FTBRHOR) FOOT...DEPENDENT VARIABLE: (106) THUMB BREADTH (THUBBR) MODEL INDEPENDENT VARIABLE 1 2 3 4 5 INTERCEPT 6.621 5.016 6.267 5.697 4.528 59 (HANDCIRC) HAND...95 (SLLSPEL) SLEEVE LENGTH: SPINE-ELBOW -0.020 -0.019 -C.018 9 (BLFTCIRC) BALL OF FOOT CIRCUMFERENCE -0.032 -0.039 106 (THUMBBR) THUMB BREADTH 0.228
Riccardi, M; Mele, G; Pulvento, C; Lavini, A; d'Andria, R; Jacobsen, S-E
2014-06-01
Leaf chlorophyll content provides valuable information about physiological status of plants; it is directly linked to photosynthetic potential and primary production. In vitro assessment by wet chemical extraction is the standard method for leaf chlorophyll determination. This measurement is expensive, laborious, and time consuming. Over the years alternative methods, rapid and non-destructive, have been explored. The aim of this work was to evaluate the applicability of a fast and non-invasive field method for estimation of chlorophyll content in quinoa and amaranth leaves based on RGB components analysis of digital images acquired with a standard SLR camera. Digital images of leaves from different genotypes of quinoa and amaranth were acquired directly in the field. Mean values of each RGB component were evaluated via image analysis software and correlated to leaf chlorophyll provided by standard laboratory procedure. Single and multiple regression models using RGB color components as independent variables have been tested and validated. The performance of the proposed method was compared to that of the widely used non-destructive SPAD method. Sensitivity of the best regression models for different genotypes of quinoa and amaranth was also checked. Color data acquisition of the leaves in the field with a digital camera was quick, more effective, and lower cost than SPAD. The proposed RGB models provided better correlation (highest R (2)) and prediction (lowest RMSEP) of the true value of foliar chlorophyll content and had a lower amount of noise in the whole range of chlorophyll studied compared with SPAD and other leaf image processing based models when applied to quinoa and amaranth.
Estimation of Magnitude and Frequency of Floods for Streams on the Island of Oahu, Hawaii
Wong, Michael F.
1994-01-01
This report describes techniques for estimating the magnitude and frequency of floods for the island of Oahu. The log-Pearson Type III distribution and methodology recommended by the Interagency Committee on Water Data was used to determine the magnitude and frequency of floods at 79 gaging stations that had 11 to 72 years of record. Multiple regression analysis was used to construct regression equations to transfer the magnitude and frequency information from gaged sites to ungaged sites. Oahu was divided into three hydrologic regions to define relations between peak discharge and drainage-basin and climatic characteristics. Regression equations are provided to estimate the 2-, 5-, 10-, 25-, 50-, and 100-year peak discharges at ungaged sites. Significant basin and climatic characteristics included in the regression equations are drainage area, median annual rainfall, and the 2-year, 24-hour rainfall intensity. Drainage areas for sites used in this study ranged from 0.03 to 45.7 square miles. Standard error of prediction for the regression equations ranged from 34 to 62 percent. Peak-discharge data collected through water year 1988, geographic information system (GIS) technology, and generalized least-squares regression were used in the analyses. The use of GIS seems to be a more flexible and consistent means of defining and calculating basin and climatic characteristics than using manual methods. Standard errors of estimate for the regression equations in this report are an average of 8 percent less than those published in previous studies.
ERIC Educational Resources Information Center
Chen, Xinguang; Stanton, Bonita; Li, Xiaoming; Fang, Xiaoyi; Lin, Danhua; Xiong, Qing
2009-01-01
Objective: To determine whether rural-to-urban migrants in China are more likely than rural and urban residents to engage in risk behaviors. Methods: Comparative analysis of survey data between migrants and rural and urban residents using age standardized rate and multiple logistic regression. Results: The prevalence and frequency of tobacco…
Analysis of Sequence Data Under Multivariate Trait-Dependent Sampling.
Tao, Ran; Zeng, Donglin; Franceschini, Nora; North, Kari E; Boerwinkle, Eric; Lin, Dan-Yu
2015-06-01
High-throughput DNA sequencing allows for the genotyping of common and rare variants for genetic association studies. At the present time and for the foreseeable future, it is not economically feasible to sequence all individuals in a large cohort. A cost-effective strategy is to sequence those individuals with extreme values of a quantitative trait. We consider the design under which the sampling depends on multiple quantitative traits. Under such trait-dependent sampling, standard linear regression analysis can result in bias of parameter estimation, inflation of type I error, and loss of power. We construct a likelihood function that properly reflects the sampling mechanism and utilizes all available data. We implement a computationally efficient EM algorithm and establish the theoretical properties of the resulting maximum likelihood estimators. Our methods can be used to perform separate inference on each trait or simultaneous inference on multiple traits. We pay special attention to gene-level association tests for rare variants. We demonstrate the superiority of the proposed methods over standard linear regression through extensive simulation studies. We provide applications to the Cohorts for Heart and Aging Research in Genomic Epidemiology Targeted Sequencing Study and the National Heart, Lung, and Blood Institute Exome Sequencing Project.
Functional capacity following univentricular repair--midterm outcome.
Sen, Supratim; Bandyopadhyay, Biswajit; Eriksson, Peter; Chattopadhyay, Amitabha
2012-01-01
Previous studies have seldom compared functional capacity in children following Fontan procedure alongside those with Glenn operation as destination therapy. We hypothesized that Fontan circulation enables better midterm submaximal exercise capacity as compared to Glenn physiology and evaluated this using the 6-minute walk test. Fifty-seven children aged 5-18 years with Glenn (44) or Fontan (13) operations were evaluated with standard 6-minute walk protocols. Baseline SpO(2) was significantly lower in Glenn patients younger than 10 years compared to Fontan counterparts and similar in the two groups in older children. Postexercise SpO(2) fell significantly in Glenn patients compared to the Fontan group. There was no statistically significant difference in baseline, postexercise, or postrecovery heart rates (HRs), or 6-minute walk distances in the two groups. Multiple regression analysis revealed lower resting HR, higher resting SpO(2) , and younger age at latest operation to be significant determinants of longer 6-minute walk distance. Multiple regression analysis also established that younger age at operation, higher resting SpO(2) , Fontan operation, lower resting HR, and lower postexercise HR were significant determinants of higher postexercise SpO(2) . Younger age at operation and exercise, lower resting HR and postexercise HR, higher resting SpO(2) and postexercise SpO(2) , and dominant ventricular morphology being left ventricular or indeterminate/mixed had significant association with better 6-minute work on multiple regression analysis. Lower resting HR had linear association with longer 6-minute walk distances in the Glenn patients. Compared to Glenn physiology, Fontan operation did not have better submaximal exercise capacity assessed by walk distance or work on multiple regression analysis. Lower resting HR, higher resting SpO(2) , and younger age at operation were factors uniformly associated with better submaximal exercise capacity. © 2012 Wiley Periodicals, Inc.
Howley, Donna; Howley, Peter; Oxenham, Marc F
2018-06-01
Stature and a further 8 anthropometric dimensions were recorded from the arms and hands of a sample of 96 staff and students from the Australian National University and The University of Newcastle, Australia. These dimensions were used to create simple and multiple logistic regression models for sex estimation and simple and multiple linear regression equations for stature estimation of a contemporary Australian population. Overall sex classification accuracies using the models created were comparable to similar studies. The stature estimation models achieved standard errors of estimates (SEE) which were comparable to and in many cases lower than those achieved in similar research. Generic, non sex-specific models achieved similar SEEs and R 2 values to the sex-specific models indicating stature may be accurately estimated when sex is unknown. Copyright © 2018 Elsevier B.V. All rights reserved.
Guo, Ying; Little, Roderick J; McConnell, Daniel S
2012-01-01
Covariate measurement error is common in epidemiologic studies. Current methods for correcting measurement error with information from external calibration samples are insufficient to provide valid adjusted inferences. We consider the problem of estimating the regression of an outcome Y on covariates X and Z, where Y and Z are observed, X is unobserved, but a variable W that measures X with error is observed. Information about measurement error is provided in an external calibration sample where data on X and W (but not Y and Z) are recorded. We describe a method that uses summary statistics from the calibration sample to create multiple imputations of the missing values of X in the regression sample, so that the regression coefficients of Y on X and Z and associated standard errors can be estimated using simple multiple imputation combining rules, yielding valid statistical inferences under the assumption of a multivariate normal distribution. The proposed method is shown by simulation to provide better inferences than existing methods, namely the naive method, classical calibration, and regression calibration, particularly for correction for bias and achieving nominal confidence levels. We also illustrate our method with an example using linear regression to examine the relation between serum reproductive hormone concentrations and bone mineral density loss in midlife women in the Michigan Bone Health and Metabolism Study. Existing methods fail to adjust appropriately for bias due to measurement error in the regression setting, particularly when measurement error is substantial. The proposed method corrects this deficiency.
Barth, Nancy A.; Veilleux, Andrea G.
2012-01-01
The U.S. Geological Survey (USGS) is currently updating at-site flood frequency estimates for USGS streamflow-gaging stations in the desert region of California. The at-site flood-frequency analysis is complicated by short record lengths (less than 20 years is common) and numerous zero flows/low outliers at many sites. Estimates of the three parameters (mean, standard deviation, and skew) required for fitting the log Pearson Type 3 (LP3) distribution are likely to be highly unreliable based on the limited and heavily censored at-site data. In a generalization of the recommendations in Bulletin 17B, a regional analysis was used to develop regional estimates of all three parameters (mean, standard deviation, and skew) of the LP3 distribution. A regional skew value of zero from a previously published report was used with a new estimated mean squared error (MSE) of 0.20. A weighted least squares (WLS) regression method was used to develop both a regional standard deviation and a mean model based on annual peak-discharge data for 33 USGS stations throughout California’s desert region. At-site standard deviation and mean values were determined by using an expected moments algorithm (EMA) method for fitting the LP3 distribution to the logarithms of annual peak-discharge data. Additionally, a multiple Grubbs-Beck (MGB) test, a generalization of the test recommended in Bulletin 17B, was used for detecting multiple potentially influential low outliers in a flood series. The WLS regression found that no basin characteristics could explain the variability of standard deviation. Consequently, a constant regional standard deviation model was selected, resulting in a log-space value of 0.91 with a MSE of 0.03 log units. Yet drainage area was found to be statistically significant at explaining the site-to-site variability in mean. The linear WLS regional mean model based on drainage area had a Pseudo- 2 R of 51 percent and a MSE of 0.32 log units. The regional parameter estimates were then used to develop a set of equations for estimating flows with 50-, 20-, 10-, 4-, 2-, 1-, 0.5-, and 0.2-percent annual exceedance probabilities for ungaged basins. The final equations are functions of drainage area.Average standard errors of prediction for these regression equations range from 214.2 to 856.2 percent.
NASA Astrophysics Data System (ADS)
Zhang, Ying; Bi, Peng; Hiller, Janet
2008-01-01
This is the first study to identify appropriate regression models for the association between climate variation and salmonellosis transmission. A comparison between different regression models was conducted using surveillance data in Adelaide, South Australia. By using notified salmonellosis cases and climatic variables from the Adelaide metropolitan area over the period 1990-2003, four regression methods were examined: standard Poisson regression, autoregressive adjusted Poisson regression, multiple linear regression, and a seasonal autoregressive integrated moving average (SARIMA) model. Notified salmonellosis cases in 2004 were used to test the forecasting ability of the four models. Parameter estimation, goodness-of-fit and forecasting ability of the four regression models were compared. Temperatures occurring 2 weeks prior to cases were positively associated with cases of salmonellosis. Rainfall was also inversely related to the number of cases. The comparison of the goodness-of-fit and forecasting ability suggest that the SARIMA model is better than the other three regression models. Temperature and rainfall may be used as climatic predictors of salmonellosis cases in regions with climatic characteristics similar to those of Adelaide. The SARIMA model could, thus, be adopted to quantify the relationship between climate variations and salmonellosis transmission.
Interquantile Shrinkage in Regression Models
Jiang, Liewen; Wang, Huixia Judy; Bondell, Howard D.
2012-01-01
Conventional analysis using quantile regression typically focuses on fitting the regression model at different quantiles separately. However, in situations where the quantile coefficients share some common feature, joint modeling of multiple quantiles to accommodate the commonality often leads to more efficient estimation. One example of common features is that a predictor may have a constant effect over one region of quantile levels but varying effects in other regions. To automatically perform estimation and detection of the interquantile commonality, we develop two penalization methods. When the quantile slope coefficients indeed do not change across quantile levels, the proposed methods will shrink the slopes towards constant and thus improve the estimation efficiency. We establish the oracle properties of the two proposed penalization methods. Through numerical investigations, we demonstrate that the proposed methods lead to estimations with competitive or higher efficiency than the standard quantile regression estimation in finite samples. Supplemental materials for the article are available online. PMID:24363546
Stature estimation from the lengths of the growing foot-a study on North Indian adolescents.
Krishan, Kewal; Kanchan, Tanuj; Passi, Neelam; DiMaggio, John A
2012-12-01
Stature estimation is considered as one of the basic parameters of the investigation process in unknown and commingled human remains in medico-legal case work. Race, age and sex are the other parameters which help in this process. Stature estimation is of the utmost importance as it completes the biological profile of a person along with the other three parameters of identification. The present research is intended to formulate standards for stature estimation from foot dimensions in adolescent males from North India and study the pattern of foot growth during the growing years. 154 male adolescents from the Northern part of India were included in the study. Besides stature, five anthropometric measurements that included the length of the foot from each toe (T1, T2, T3, T4, and T5 respectively) to pternion were measured on each foot. The data was analyzed statistically using Student's t-test, Pearson's correlation, linear and multiple regression analysis for estimation of stature and growth of foot during ages 13-18 years. Correlation coefficients between stature and all the foot measurements were found to be highly significant and positively correlated. Linear regression models and multiple regression models (with age as a co-variable) were derived for estimation of stature from the different measurements of the foot. Multiple regression models (with age as a co-variable) estimate stature with greater accuracy than the regression models for 13-18 years age group. The study shows the growth pattern of feet in North Indian adolescents and indicates that anthropometric measurements of the foot and its segments are valuable in estimation of stature in growing individuals of that population. Copyright © 2012 Elsevier Ltd. All rights reserved.
The impact of a standardized program on short and long-term outcomes in bariatric surgery.
Aird, Lisa N F; Hong, Dennis; Gmora, Scott; Breau, Ruth; Anvari, Mehran
2017-02-01
The purpose of this study was to determine whether there has been an improvement in short- and long-term clinical outcomes since 2010, when the Ontario Bariatric Network led a province-wide initiative to establish a standardized system of care for bariatric patients. The system includes nine bariatric centers, a centralized referral system, and a research registry. Standardization of procedures has progressed yearly, including guidelines for preoperative assessment and perioperative care. Analysis of the OBN registry data was performed by fiscal year between April 2010 and March 2015. Three-month overall postoperative complication rates and 30 day postoperative mortality were calculated. The mean percentage of weight loss at 1, 2, and 3 years postoperative, and regression of obesity-related diseases were calculated. The analysis of continuous and nominal data was performed using ANOVA, Chi-square, and McNemar's testing. A multiple logistic regression analysis was performed for factors affecting postoperative complication rate. Eight thousand and forty-three patients were included in the bariatric registry between April 2010 and March 2015. Thirty-day mortality was rare (<0.075 %) and showed no significant difference between years. Three-month overall postoperative complication rates significantly decreased with standardization (p < 0.001), as did intra-operative complication rates (p < -0.001). Regression analysis demonstrated increasing standardization to be a predictor of 3 month complication rate OR of 0.59 (95 %CI 0.41-0.85, p = 0.00385). The mean percentage of weight loss at 1, 2, and 3 years postoperative showed stability at 33.2 % (9.0 SD), 34.1 % (10.1 SD), and 32.7 % (10.1 SD), respectively. Sustained regression in obesity-related comorbidities was demonstrated at 1, 2, and 3 years postoperative. Evidence indicates the implementation of a standardized system of bariatric care has contributed to improvements in complication rates and supported prolonged weight loss and regression of obesity-related diseases in patients undergoing bariatric surgery in Ontario.
Ibidunni, Ayodotun Stephen; Ibidunni, Oyebisi Mary; Olokundun, Maxwell Ayodele; Falola, Hezekiah Olubusayo; Salau, Odunayo Paul; Borishade, Taiye Tairat
2018-06-01
This article present data on the disposition of SME operators towards enhancing SMEs Performance through entrepreneurial orientations. Copies of structured questionnaire were administered to 102 SME owners/managers. Using descriptive and standard multiple regression statistical analysis, the data described how proactiveness, risk-taking and autonomy orientations significantly influenced SMEs' profitability, sales growth, customer satisfaction and new product success.
Zhang, L; Liu, X J
2016-06-03
With the rapid development of next-generation high-throughput sequencing technology, RNA-seq has become a standard and important technique for transcriptome analysis. For multi-sample RNA-seq data, the existing expression estimation methods usually deal with each single-RNA-seq sample, and ignore that the read distributions are consistent across multiple samples. In the current study, we propose a structured sparse regression method, SSRSeq, to estimate isoform expression using multi-sample RNA-seq data. SSRSeq uses a non-parameter model to capture the general tendency of non-uniformity read distribution for all genes across multiple samples. Additionally, our method adds a structured sparse regularization, which not only incorporates the sparse specificity between a gene and its corresponding isoform expression levels, but also reduces the effects of noisy reads, especially for lowly expressed genes and isoforms. Four real datasets were used to evaluate our method on isoform expression estimation. Compared with other popular methods, SSRSeq reduced the variance between multiple samples, and produced more accurate isoform expression estimations, and thus more meaningful biological interpretations.
Lee, L.; Helsel, D.
2005-01-01
Trace contaminants in water, including metals and organics, often are measured at sufficiently low concentrations to be reported only as values below the instrument detection limit. Interpretation of these "less thans" is complicated when multiple detection limits occur. Statistical methods for multiply censored, or multiple-detection limit, datasets have been developed for medical and industrial statistics, and can be employed to estimate summary statistics or model the distributions of trace-level environmental data. We describe S-language-based software tools that perform robust linear regression on order statistics (ROS). The ROS method has been evaluated as one of the most reliable procedures for developing summary statistics of multiply censored data. It is applicable to any dataset that has 0 to 80% of its values censored. These tools are a part of a software library, or add-on package, for the R environment for statistical computing. This library can be used to generate ROS models and associated summary statistics, plot modeled distributions, and predict exceedance probabilities of water-quality standards. ?? 2005 Elsevier Ltd. All rights reserved.
On the relation between personality and job performance of airline pilots.
Hormann, H J; Maschke, P
1996-01-01
The validity of a personality questionnaire for the prediction of job success of airline pilots is compared to validities of a simulator checkflight and of flying experience data. During selection, 274 pilots applying for employment with a European charter airline were examined with a multidimensional personality questionnaire (Temperature Structure Scales; TSS). Additionally, the applicants were graded in a simulator checkflight. On the basis of training records, the pilots were classified as performing at standard or below standard after about 3 years of employment in the hiring company. In a multiple-regression model, this dichotomous criterion for job success can be predicted with 73.8% accuracy through the simulator checkflight and flying experience prior to employment. By adding the personality questionnaire to the regression equation, the number of correct classifications increases to 79.3%. On average, successful pilots score substantially higher on interpersonal scales and lower on emotional scales of the TSS.
Measurement of lung volumes from supine portable chest radiographs.
Ries, A L; Clausen, J L; Friedman, P J
1979-12-01
Lung volumes in supine nonambulatory patients are physiological parameters often difficult to measure with current techniques (plethysmograph, gas dilution). Existing radiographic methods for measuring lung volumes require standard upright chest radiographs. Accordingly, in 31 normal supine adults, we determined helium-dilution functional residual and total lung capacities and measured planimetric lung field areas (LFA) from corresponding portable anteroposterior and lateral radiographs. Low radiation dose methods, which delivered less than 10% of that from standard portable X-ray technique, were utilized. Correlation between lung volume and radiographic LFA was highly significant (r = 0.96, SEE = 10.6%). Multiple-step regressions using height and chest diameter correction factors reduced variance, but weight and radiographic magnification factors did not. In 17 additional subjects studied for validation, the regression equations accurately predicted radiographic lung volume. Thus, this technique can provide accurate and rapid measurement of lung volume in studies involving supine patients.
NASA Astrophysics Data System (ADS)
Denli, H. H.; Koc, Z.
2015-12-01
Estimation of real properties depending on standards is difficult to apply in time and location. Regression analysis construct mathematical models which describe or explain relationships that may exist between variables. The problem of identifying price differences of properties to obtain a price index can be converted into a regression problem, and standard techniques of regression analysis can be used to estimate the index. Considering regression analysis for real estate valuation, which are presented in real marketing process with its current characteristics and quantifiers, the method will help us to find the effective factors or variables in the formation of the value. In this study, prices of housing for sale in Zeytinburnu, a district in Istanbul, are associated with its characteristics to find a price index, based on information received from a real estate web page. The associated variables used for the analysis are age, size in m2, number of floors having the house, floor number of the estate and number of rooms. The price of the estate represents the dependent variable, whereas the rest are independent variables. Prices from 60 real estates have been used for the analysis. Same price valued locations have been found and plotted on the map and equivalence curves have been drawn identifying the same valued zones as lines.
Estimation of stature from the foot and its segments in a sub-adult female population of North India
2011-01-01
Background Establishing personal identity is one of the main concerns in forensic investigations. Estimation of stature forms a basic domain of the investigation process in unknown and co-mingled human remains in forensic anthropology case work. The objective of the present study was to set up standards for estimation of stature from the foot and its segments in a sub-adult female population. Methods The sample for the study constituted 149 young females from the Northern part of India. The participants were aged between 13 and 18 years. Besides stature, seven anthropometric measurements that included length of the foot from each toe (T1, T2, T3, T4, and T5 respectively), foot breadth at ball (BBAL) and foot breadth at heel (BHEL) were measured on both feet in each participant using standard methods and techniques. Results The results indicated that statistically significant differences (p < 0.05) between left and right feet occur in both the foot breadth measurements (BBAL and BHEL). Foot length measurements (T1 to T5 lengths) did not show any statistically significant bilateral asymmetry. The correlation between stature and all the foot measurements was found to be positive and statistically significant (p-value < 0.001). Linear regression models and multiple regression models were derived for estimation of stature from the measurements of the foot. The present study indicates that anthropometric measurements of foot and its segments are valuable in the estimation of stature. Foot length measurements estimate stature with greater accuracy when compared to foot breadth measurements. Conclusions The present study concluded that foot measurements have a strong relationship with stature in the sub-adult female population of North India. Hence, the stature of an individual can be successfully estimated from the foot and its segments using different regression models derived in the study. The regression models derived in the study may be applied successfully for the estimation of stature in sub-adult females, whenever foot remains are brought for forensic examination. Stepwise multiple regression models tend to estimate stature more accurately than linear regression models in female sub-adults. PMID:22104433
Krishan, Kewal; Kanchan, Tanuj; Passi, Neelam
2011-11-21
Establishing personal identity is one of the main concerns in forensic investigations. Estimation of stature forms a basic domain of the investigation process in unknown and co-mingled human remains in forensic anthropology case work. The objective of the present study was to set up standards for estimation of stature from the foot and its segments in a sub-adult female population. The sample for the study constituted 149 young females from the Northern part of India. The participants were aged between 13 and 18 years. Besides stature, seven anthropometric measurements that included length of the foot from each toe (T1, T2, T3, T4, and T5 respectively), foot breadth at ball (BBAL) and foot breadth at heel (BHEL) were measured on both feet in each participant using standard methods and techniques. The results indicated that statistically significant differences (p < 0.05) between left and right feet occur in both the foot breadth measurements (BBAL and BHEL). Foot length measurements (T1 to T5 lengths) did not show any statistically significant bilateral asymmetry. The correlation between stature and all the foot measurements was found to be positive and statistically significant (p-value < 0.001). Linear regression models and multiple regression models were derived for estimation of stature from the measurements of the foot. The present study indicates that anthropometric measurements of foot and its segments are valuable in the estimation of stature. Foot length measurements estimate stature with greater accuracy when compared to foot breadth measurements. The present study concluded that foot measurements have a strong relationship with stature in the sub-adult female population of North India. Hence, the stature of an individual can be successfully estimated from the foot and its segments using different regression models derived in the study. The regression models derived in the study may be applied successfully for the estimation of stature in sub-adult females, whenever foot remains are brought for forensic examination. Stepwise multiple regression models tend to estimate stature more accurately than linear regression models in female sub-adults.
Development of quantitative screen for 1550 chemicals with GC-MS.
Bergmann, Alan J; Points, Gary L; Scott, Richard P; Wilson, Glenn; Anderson, Kim A
2018-05-01
With hundreds of thousands of chemicals in the environment, effective monitoring requires high-throughput analytical techniques. This paper presents a quantitative screening method for 1550 chemicals based on statistical modeling of responses with identification and integration performed using deconvolution reporting software. The method was evaluated with representative environmental samples. We tested biological extracts, low-density polyethylene, and silicone passive sampling devices spiked with known concentrations of 196 representative chemicals. A multiple linear regression (R 2 = 0.80) was developed with molecular weight, logP, polar surface area, and fractional ion abundance to predict chemical responses within a factor of 2.5. Linearity beyond the calibration had R 2 > 0.97 for three orders of magnitude. Median limits of quantitation were estimated to be 201 pg/μL (1.9× standard deviation). The number of detected chemicals and the accuracy of quantitation were similar for environmental samples and standard solutions. To our knowledge, this is the most precise method for the largest number of semi-volatile organic chemicals lacking authentic standards. Accessible instrumentation and software make this method cost effective in quantifying a large, customizable list of chemicals. When paired with silicone wristband passive samplers, this quantitative screen will be very useful for epidemiology where binning of concentrations is common. Graphical abstract A multiple linear regression of chemical responses measured with GC-MS allowed quantitation of 1550 chemicals in samples such as silicone wristbands.
Musuku, Adrien; Tan, Aimin; Awaiye, Kayode; Trabelsi, Fethi
2013-09-01
Linear calibration is usually performed using eight to ten calibration concentration levels in regulated LC-MS bioanalysis because a minimum of six are specified in regulatory guidelines. However, we have previously reported that two-concentration linear calibration is as reliable as or even better than using multiple concentrations. The purpose of this research is to compare two-concentration with multiple-concentration linear calibration through retrospective data analysis of multiple bioanalytical projects that were conducted in an independent regulated bioanalytical laboratory. A total of 12 bioanalytical projects were randomly selected: two validations and two studies for each of the three most commonly used types of sample extraction methods (protein precipitation, liquid-liquid extraction, solid-phase extraction). When the existing data were retrospectively linearly regressed using only the lowest and the highest concentration levels, no extra batch failure/QC rejection was observed and the differences in accuracy and precision between the original multi-concentration regression and the new two-concentration linear regression are negligible. Specifically, the differences in overall mean apparent bias (square root of mean individual bias squares) are within the ranges of -0.3% to 0.7% and 0.1-0.7% for the validations and studies, respectively. The differences in mean QC concentrations are within the ranges of -0.6% to 1.8% and -0.8% to 2.5% for the validations and studies, respectively. The differences in %CV are within the ranges of -0.7% to 0.9% and -0.3% to 0.6% for the validations and studies, respectively. The average differences in study sample concentrations are within the range of -0.8% to 2.3%. With two-concentration linear regression, an average of 13% of time and cost could have been saved for each batch together with 53% of saving in the lead-in for each project (the preparation of working standard solutions, spiking, and aliquoting). Furthermore, examples are given as how to evaluate the linearity over the entire concentration range when only two concentration levels are used for linear regression. To conclude, two-concentration linear regression is accurate and robust enough for routine use in regulated LC-MS bioanalysis and it significantly saves time and cost as well. Copyright © 2013 Elsevier B.V. All rights reserved.
The effects of normal aging on multiple aspects of financial decision-making.
Bangma, Dorien F; Fuermaier, Anselm B M; Tucha, Lara; Tucha, Oliver; Koerts, Janneke
2017-01-01
Financial decision-making (FDM) is crucial for independent living. Due to cognitive decline that accompanies normal aging, older adults might have difficulties in some aspects of FDM. However, an improved knowledge, personal experience and affective decision-making, which are also related to normal aging, may lead to a stable or even improved age-related performance in some other aspects of FDM. Therefore, the present explorative study examines the effects of normal aging on multiple aspects of FDM. One-hundred and eighty participants (range 18-87 years) were assessed with eight FDM tests and several standard neuropsychological tests. Age effects were evaluated using hierarchical multiple regression analyses. The validity of the prediction models was examined by internal validation (i.e. bootstrap resampling procedure) as well as external validation on another, independent, sample of participants (n = 124). Multiple regression and correlation analyses were applied to investigate the mediation effect of standard measures of cognition on the observed effects of age on FDM. On a relatively basic level of FDM (e.g., paying bills or using FDM styles) no significant effects of aging were found. However more complex FDM, such as making decisions in accordance with specific rules, becomes more difficult with advancing age. Furthermore, an older age was found to be related to a decreased sensitivity for impulsive buying. These results were confirmed by the internal and external validation analyses. Mediation effects of numeracy and planning were found to explain parts of the association between one aspect of FDM (i.e. Competence in decision rules) and age; however, these cognitive domains were not able to completely explain the relation between age and FDM. Normal aging has a negative influence on a complex aspect of FDM, however, other aspects appear to be unaffected by normal aging or improve.
ERIC Educational Resources Information Center
Federal Trade Commission, Washington, DC. Bureau of Consumer Protection.
The effect of commercial coaching on Scholastic Aptitude Test (SAT) scores was analyzed, using 1974-1977 test results of 2,500 non-coached students and 1,568 enrollees in two coaching schools. (The Stanley H. Kaplan Educational Center, Inc., and the Test Preparation Center, Inc.). Multiple regression analysis was used to control for student…
An analysis of collegiate band directors' exposure to sound pressure levels
NASA Astrophysics Data System (ADS)
Roebuck, Nikole Moore
Noise-induced hearing loss (NIHL) is a significant but unfortunate common occupational hazard. The purpose of the current study was to measure the magnitude of sound pressure levels generated within a collegiate band room and determine if those sound pressure levels are of a magnitude that exceeds the policy standards and recommendations of the Occupational Safety and Health Administration (OSHA), and the National Institute of Occupational Safety and Health (NIOSH). In addition, reverberation times were measured and analyzed in order to determine the appropriateness of acoustical conditions for the band rehearsal environment. Sound pressure measurements were taken from the rehearsal of seven collegiate marching bands. Single sample t test were conducted to compare the sound pressure levels of all bands to the noise exposure standards of OSHA and NIOSH. Multiple regression analysis were conducted and analyzed in order to determine the effect of the band room's conditions on the sound pressure levels and reverberation times. Time weighted averages (TWA), noise percentage doses, and peak levels were also collected. The mean Leq for all band directors was 90.5 dBA. The total accumulated noise percentage dose for all band directors was 77.6% of the maximum allowable daily noise dose under the OSHA standard. The total calculated TWA for all band directors was 88.2% of the maximum allowable daily noise dose under the OSHA standard. The total accumulated noise percentage dose for all band directors was 152.1% of the maximum allowable daily noise dose under the NIOSH standards, and the total calculated TWA for all band directors was 93dBA of the maximum allowable daily noise dose under the NIOSH standard. Multiple regression analysis revealed that the room volume, the level of acoustical treatment and the mean room reverberation time predicted 80% of the variance in sound pressure levels in this study.
Standards for Standardized Logistic Regression Coefficients
ERIC Educational Resources Information Center
Menard, Scott
2011-01-01
Standardized coefficients in logistic regression analysis have the same utility as standardized coefficients in linear regression analysis. Although there has been no consensus on the best way to construct standardized logistic regression coefficients, there is now sufficient evidence to suggest a single best approach to the construction of a…
Age estimation standards for a Western Australian population using the coronal pulp cavity index.
Karkhanis, Shalmira; Mack, Peter; Franklin, Daniel
2013-09-10
Age estimation is a vital aspect in creating a biological profile and aids investigators by narrowing down potentially matching identities from the available pool. In addition to routine casework, in the present global political scenario, age estimation in living individuals is required in cases of refugees, asylum seekers, human trafficking and to ascertain age of criminal responsibility. Thus robust methods that are simple, non-invasive and ethically viable are required. The aim of the present study is, therefore, to test the reliability and applicability of the coronal pulp cavity index method, for the purpose of developing age estimation standards for an adult Western Australian population. A total of 450 orthopantomograms (220 females and 230 males) of Australian individuals were analyzed. Crown and coronal pulp chamber heights were measured in the mandibular left and right premolars, and the first and second molars. These measurements were then used to calculate the tooth coronal index. Data was analyzed using paired sample t-tests to assess bilateral asymmetry followed by simple linear and multiple regressions to develop age estimation models. The most accurate age estimation based on simple linear regression model was with mandibular right first molar (SEE ±8.271 years). Multiple regression models improved age prediction accuracy considerably and the most accurate model was with bilateral first and second molars (SEE ±6.692 years). This study represents the first investigation of this method in a Western Australian population and our results indicate that the method is suitable for forensic application. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Phung, Dung; Huang, Cunrui; Rutherford, Shannon; Chu, Cordia; Wang, Xiaoming; Nguyen, Minh; Nguyen, Nga Huy; Manh, Cuong Do
2015-01-01
The Mekong Delta is highly vulnerable to climate change and a dengue endemic area in Vietnam. This study aims to examine the association between climate factors and dengue incidence and to identify the best climate prediction model for dengue incidence in Can Tho city, the Mekong Delta area in Vietnam. We used three different regression models comprising: standard multiple regression model (SMR), seasonal autoregressive integrated moving average model (SARIMA), and Poisson distributed lag model (PDLM) to examine the association between climate factors and dengue incidence over the period 2003-2010. We validated the models by forecasting dengue cases for the period of January-December, 2011 using the mean absolute percentage error (MAPE). Receiver operating characteristics curves were used to analyze the sensitivity of the forecast of a dengue outbreak. The results indicate that temperature and relative humidity are significantly associated with changes in dengue incidence consistently across the model methods used, but not cumulative rainfall. The Poisson distributed lag model (PDLM) performs the best prediction of dengue incidence for a 6, 9, and 12-month period and diagnosis of an outbreak however the SARIMA model performs a better prediction of dengue incidence for a 3-month period. The simple or standard multiple regression performed highly imprecise prediction of dengue incidence. We recommend a follow-up study to validate the model on a larger scale in the Mekong Delta region and to analyze the possibility of incorporating a climate-based dengue early warning method into the national dengue surveillance system. Copyright © 2014 Elsevier B.V. All rights reserved.
Arsenyev, P A; Trezvov, V V; Saratovskaya, N V
1997-01-01
This work represents a method, which allows to determine phase composition of calcium hydroxylapatite basing on its infrared spectrum. The method uses factor analysis of the spectral data of calibration set of samples to determine minimal number of factors required to reproduce the spectra within experimental error. Multiple linear regression is applied to establish correlation between factor scores of calibration standards and their properties. The regression equations can be used to predict the property value of unknown sample. The regression model was built for determination of beta-tricalcium phosphate content in hydroxylapatite. Statistical estimation of quality of the model was carried out. Application of the factor analysis on spectral data allows to increase accuracy of beta-tricalcium phosphate determination and expand the range of determination towards its less concentration. Reproducibility of results is retained.
Identifying predictors of childhood anaemia in north-east India.
Dey, Sanku; Goswami, Sankar; Dey, Tanujit
2013-12-01
The objective of this study is to examine the factors that influence the occurrence of childhood anaemia in North-East India by exploring dataset of the Reproductive and Child Health-II Survey (RCH-II). The study population consisted of 10,137 children in the age-group of 0-6 year(s) from North-East India to explore the predictors of childhood anaemia by means of different background characteristics, such as place of residence, religion, household standard of living, literacy of mother, total children ever born to a mother, age of mother at marriage. Prevalence of anaemia among children was taken as a polytomous variable. The predicted probabilities of anaemia were established via multinomial logistic regression model. These probabilities provided the degree of assessment of the contribution of predictors in the prevalence of childhood anaemia. The mean haemoglobin concentration in children aged 0-6 year(s) was found to be 11.85 g/dL, with a standard deviation of 5.61 g/dL. The multiple logistic regression analysis showed that rural children were at greater risk of severe (OR = 2.035; p = 0.003) and moderate (OR = 1.23; p = 0.003) anaemia. All types of anaemia (severe, moderate, and mild) were more prevalent among Hindu children (OR = 2.971; p = 0.000), (OR = 1.195; p = 0.010), and (OR = 1.201; p = 0.011) than among children of other religions whereas moderate (OR = 1.406; p = 0.001) and mild (OR = 1.857; p=0.000) anaemia were more prevalent among Muslim children. The fecundity of the mother was found to have significant effect on anaemia. Women with multiple children were prone to greater risk of anaemia. The multiple logistic regression analysis also confirmed that children of literate mothers were comparatively at lesser risk of severe anaemia. Mother's age at marriage had a significant effect on anaemia of their children as well.
Regression analysis of current-status data: an application to breast-feeding.
Grummer-strawn, L M
1993-09-01
"Although techniques for calculating mean survival time from current-status data are well known, their use in multiple regression models is somewhat troublesome. Using data on current breast-feeding behavior, this article considers a number of techniques that have been suggested in the literature, including parametric, nonparametric, and semiparametric models as well as the application of standard schedules. Models are tested in both proportional-odds and proportional-hazards frameworks....I fit [the] models to current status data on breast-feeding from the Demographic and Health Survey (DHS) in six countries: two African (Mali and Ondo State, Nigeria), two Asian (Indonesia and Sri Lanka), and two Latin American (Colombia and Peru)." excerpt
Use of streamflow data to estimate base flowground-water recharge for Wisconsin
Gebert, W.A.; Radloff, M.J.; Considine, E.J.; Kennedy, J.L.
2007-01-01
The average annual base flow/recharge was determined for streamflow-gaging stations throughout Wisconsin by base-flow separation. A map of the State was prepared that shows the average annual base flow for the period 1970-99 for watersheds at 118 gaging stations. Trend analysis was performed on 22 of the 118 streamflow-gaging stations that had long-term records, unregulated flow, and provided aerial coverage of the State. The analysis found that a statistically significant increasing trend was occurring for watersheds where the primary land use was agriculture. Most gaging stations where the land cover was forest had no significant trend. A method to estimate the average annual base flow at ungaged sites was developed by multiple-regression analysis using basin characteristics. The equation with the lowest standard error of estimate, 9.5%, has drainage area, soil infiltration and base flow factor as independent variables. To determine the average annual base flow for smaller watersheds, estimates were made at low-flow partial-record stations in 3 of the 12 major river basins in Wisconsin. Regression equations were developed for each of the three major river basins using basin characteristics. Drainage area, soil infiltration, basin storage and base-flow factor were the independent variables in the regression equations with the lowest standard error of estimate. The standard error of estimate ranged from 17% to 52% for the three river basins. ?? 2007 American Water Resources Association.
Mini vs standard percutaneous nephrolithotomy for renal stones: a comparative study.
ElSheemy, Mohammed S; Elmarakbi, Akram A; Hytham, Mohammed; Ibrahim, Hamdy; Khadgi, Sanjay; Al-Kandari, Ahmed M
2018-03-16
To compare the outcome of mini-percutaneous nephrolithotomy (Mini-PNL) versus standard-PNL for renal stones. Retrospective study was performed between March 2010 and May 2013 for patients treated by Mini-PNL or standard-PNL through 18 and 30 Fr tracts, respectively, using pneumatic lithotripsy. Semirigid ureteroscope (8.5/11.5 Fr) was used for Mini-PNL and 24 Fr nephroscope for standard-PNL. Both groups were compared in stone free rate(SFR), complications and operative time using Student-t, Mann-Whitney, Chi square or Fisher's exact tests as appropriate in addition to logistic regression analysis. P < 0.05 was considered statistically significant. Mini-PNL (378) and standard-PNL (151) were nearly comparable in patients and stones criteria including stone burden (3.77 ± 2.21 vs 3.77 ± 2.43 cm 2 ; respectively). There was no significant difference in number of tracts or supracostal puncture. Mini-PNL had longer operative time (68.6 ± 29.09 vs 60.49 ± 11.38 min; p = 0.434), significantly shorter hospital stay (2.43 ± 1.46 vs 4.29 ± 1.28 days) and significantly higher rate of tubeless PNL (75.1 vs 4.6%). Complications were significantly higher in standard-PNL (7.9 vs 20.5%; p < 0.001). SFR was significantly lower in Mini-PNL (89.9 vs 96%; p = 0.022). This significant difference was found with multiple stones and large stone burden (> 2 cm 2 ), but the SFR was comparable between both groups with single stone or stone burden ≤ 2 cm. Logistic regression analysis confirmed significantly higher complications and SFR with standard-PNL but with significantly shorter operative time. Mini-PNL has significantly lower SFR when compared to standard-PNL (but clinically comparable) with markedly reduced complications and hospital stay. Most of cases can be performed tubeless. The significant difference in SFR was found with multiple stones or large stone burden (> 2 cm 2 ), but not with single stones or stone burden ≤ 2 cm 2 .
Shen, Minxue; Tan, Hongzhuan; Zhou, Shujin; Retnakaran, Ravi; Smith, Graeme N.; Davidge, Sandra T.; Trasler, Jacquetta; Walker, Mark C.; Wen, Shi Wu
2016-01-01
Background It has been reported that higher folate intake from food and supplementation is associated with decreased blood pressure (BP). The association between serum folate concentration and BP has been examined in few studies. We aim to examine the association between serum folate and BP levels in a cohort of young Chinese women. Methods We used the baseline data from a pre-conception cohort of women of childbearing age in Liuyang, China, for this study. Demographic data were collected by structured interview. Serum folate concentration was measured by immunoassay, and homocysteine, blood glucose, triglyceride and total cholesterol were measured through standardized clinical procedures. Multiple linear regression and principal component regression model were applied in the analysis. Results A total of 1,532 healthy normotensive non-pregnant women were included in the final analysis. The mean concentration of serum folate was 7.5 ± 5.4 nmol/L and 55% of the women presented with folate deficiency (< 6.8 nmol/L). Multiple linear regression and principal component regression showed that serum folate levels were inversely associated with systolic and diastolic BP, after adjusting for demographic, anthropometric, and biochemical factors. Conclusions Serum folate is inversely associated with BP in non-pregnant women of childbearing age with high prevalence of folate deficiency. PMID:27182603
NASA Technical Reports Server (NTRS)
Allord, G. J. (Principal Investigator); Scarpace, F. L.
1981-01-01
Estimates of low flow and flood frequency in several southwestern Wisconsin basins were improved by determining land cover from LANDSAT imagery. With the use of estimates of land cover in multiple-regression techniques, the standard error of estimate (SE) for the least annual 7-day low flow for 2- and 10-year recurrence intervals of ungaged sites were lowered by 9% each. The SE of flood frequency in the 'Driftless Area' of Wisconsin for 10-, 50-, and 100-year recurrence intervals were lowered by 14%. Four of nine basin characteristics determined from satellite imagery were significant variables in the multiple-regression techniques, whereas only 1 of the 12 characteristics determined from topographic maps was significant. The percentages of land cover categories in each basin were determined by merging basin boundaries, digitized from quadrangles, with a classified LANDSAT scene. Both the basin boundary X-Y polygon coordinates and the satellite coordinates were converted to latitude-longitude for merging compatibility.
Multivariate meta-analysis for non-linear and other multi-parameter associations
Gasparrini, A; Armstrong, B; Kenward, M G
2012-01-01
In this paper, we formalize the application of multivariate meta-analysis and meta-regression to synthesize estimates of multi-parameter associations obtained from different studies. This modelling approach extends the standard two-stage analysis used to combine results across different sub-groups or populations. The most straightforward application is for the meta-analysis of non-linear relationships, described for example by regression coefficients of splines or other functions, but the methodology easily generalizes to any setting where complex associations are described by multiple correlated parameters. The modelling framework of multivariate meta-analysis is implemented in the package mvmeta within the statistical environment R. As an illustrative example, we propose a two-stage analysis for investigating the non-linear exposure–response relationship between temperature and non-accidental mortality using time-series data from multiple cities. Multivariate meta-analysis represents a useful analytical tool for studying complex associations through a two-stage procedure. Copyright © 2012 John Wiley & Sons, Ltd. PMID:22807043
Demidenko, Eugene
2017-09-01
The exact density distribution of the nonlinear least squares estimator in the one-parameter regression model is derived in closed form and expressed through the cumulative distribution function of the standard normal variable. Several proposals to generalize this result are discussed. The exact density is extended to the estimating equation (EE) approach and the nonlinear regression with an arbitrary number of linear parameters and one intrinsically nonlinear parameter. For a very special nonlinear regression model, the derived density coincides with the distribution of the ratio of two normally distributed random variables previously obtained by Fieller (1932), unlike other approximations previously suggested by other authors. Approximations to the density of the EE estimators are discussed in the multivariate case. Numerical complications associated with the nonlinear least squares are illustrated, such as nonexistence and/or multiple solutions, as major factors contributing to poor density approximation. The nonlinear Markov-Gauss theorem is formulated based on the near exact EE density approximation.
Multiple Correlation versus Multiple Regression.
ERIC Educational Resources Information Center
Huberty, Carl J.
2003-01-01
Describes differences between multiple correlation analysis (MCA) and multiple regression analysis (MRA), showing how these approaches involve different research questions and study designs, different inferential approaches, different analysis strategies, and different reported information. (SLD)
Francoeur, Richard B
2015-01-01
Background The majority of patients with advanced cancer experience symptom pairs or clusters among pain, fatigue, and insomnia. Improved methods are needed to detect and interpret interactions among symptoms or diesease markers to reveal influential pairs or clusters. In prior work, I developed and validated sequential residual centering (SRC), a method that improves the sensitivity of multiple regression to detect interactions among predictors, by conditioning for multicollinearity (shared variation) among interactions and component predictors. Materials and methods Using a hypothetical three-way interaction among pain, fatigue, and sleep to predict depressive affect, I derive and explain SRC multiple regression. Subsequently, I estimate raw and SRC multiple regressions using real data for these symptoms from 268 palliative radiation outpatients. Results Unlike raw regression, SRC reveals that the three-way interaction (pain × fatigue/weakness × sleep problems) is statistically significant. In follow-up analyses, the relationship between pain and depressive affect is aggravated (magnified) within two partial ranges: 1) complete-to-some control over fatigue/weakness when there is complete control over sleep problems (ie, a subset of the pain–fatigue/weakness symptom pair), and 2) no control over fatigue/weakness when there is some-to-no control over sleep problems (ie, a subset of the pain–fatigue/weakness–sleep problems symptom cluster). Otherwise, the relationship weakens (buffering) as control over fatigue/weakness or sleep problems diminishes. Conclusion By reducing the standard error, SRC unmasks a three-way interaction comprising a symptom pair and cluster. Low-to-moderate levels of the moderator variable for fatigue/weakness magnify the relationship between pain and depressive affect. However, when the comoderator variable for sleep problems accompanies fatigue/weakness, only frequent or unrelenting levels of both symptoms magnify the relationship. These findings suggest that a countervailing mechanism involving depressive affect could account for the effectiveness of a cognitive behavioral intervention to reduce the severity of a pain, fatigue, and sleep disturbance cluster in a previous randomized trial. PMID:25565865
Francoeur, Richard B
2015-01-01
The majority of patients with advanced cancer experience symptom pairs or clusters among pain, fatigue, and insomnia. Improved methods are needed to detect and interpret interactions among symptoms or diesease markers to reveal influential pairs or clusters. In prior work, I developed and validated sequential residual centering (SRC), a method that improves the sensitivity of multiple regression to detect interactions among predictors, by conditioning for multicollinearity (shared variation) among interactions and component predictors. Using a hypothetical three-way interaction among pain, fatigue, and sleep to predict depressive affect, I derive and explain SRC multiple regression. Subsequently, I estimate raw and SRC multiple regressions using real data for these symptoms from 268 palliative radiation outpatients. Unlike raw regression, SRC reveals that the three-way interaction (pain × fatigue/weakness × sleep problems) is statistically significant. In follow-up analyses, the relationship between pain and depressive affect is aggravated (magnified) within two partial ranges: 1) complete-to-some control over fatigue/weakness when there is complete control over sleep problems (ie, a subset of the pain-fatigue/weakness symptom pair), and 2) no control over fatigue/weakness when there is some-to-no control over sleep problems (ie, a subset of the pain-fatigue/weakness-sleep problems symptom cluster). Otherwise, the relationship weakens (buffering) as control over fatigue/weakness or sleep problems diminishes. By reducing the standard error, SRC unmasks a three-way interaction comprising a symptom pair and cluster. Low-to-moderate levels of the moderator variable for fatigue/weakness magnify the relationship between pain and depressive affect. However, when the comoderator variable for sleep problems accompanies fatigue/weakness, only frequent or unrelenting levels of both symptoms magnify the relationship. These findings suggest that a countervailing mechanism involving depressive affect could account for the effectiveness of a cognitive behavioral intervention to reduce the severity of a pain, fatigue, and sleep disturbance cluster in a previous randomized trial.
NASA Astrophysics Data System (ADS)
Keat, Sim Chong; Chun, Beh Boon; San, Lim Hwee; Jafri, Mohd Zubir Mat
2015-04-01
Climate change due to carbon dioxide (CO2) emissions is one of the most complex challenges threatening our planet. This issue considered as a great and international concern that primary attributed from different fossil fuels. In this paper, regression model is used for analyzing the causal relationship among CO2 emissions based on the energy consumption in Malaysia using time series data for the period of 1980-2010. The equations were developed using regression model based on the eight major sources that contribute to the CO2 emissions such as non energy, Liquefied Petroleum Gas (LPG), diesel, kerosene, refinery gas, Aviation Turbine Fuel (ATF) and Aviation Gasoline (AV Gas), fuel oil and motor petrol. The related data partly used for predict the regression model (1980-2000) and partly used for validate the regression model (2001-2010). The results of the prediction model with the measured data showed a high correlation coefficient (R2=0.9544), indicating the model's accuracy and efficiency. These results are accurate and can be used in early warning of the population to comply with air quality standards.
ERIC Educational Resources Information Center
Jaccard, James; And Others
1990-01-01
Issues in the detection and interpretation of interaction effects between quantitative variables in multiple regression analysis are discussed. Recent discussions associated with problems of multicollinearity are reviewed in the context of the conditional nature of multiple regression with product terms. (TJH)
Panel regressions to estimate low-flow response to rainfall variability in ungaged basins
Bassiouni, Maoya; Vogel, Richard M.; Archfield, Stacey A.
2016-01-01
Multicollinearity and omitted-variable bias are major limitations to developing multiple linear regression models to estimate streamflow characteristics in ungaged areas and varying rainfall conditions. Panel regression is used to overcome limitations of traditional regression methods, and obtain reliable model coefficients, in particular to understand the elasticity of streamflow to rainfall. Using annual rainfall and selected basin characteristics at 86 gaged streams in the Hawaiian Islands, regional regression models for three stream classes were developed to estimate the annual low-flow duration discharges. Three panel-regression structures (random effects, fixed effects, and pooled) were compared to traditional regression methods, in which space is substituted for time. Results indicated that panel regression generally was able to reproduce the temporal behavior of streamflow and reduce the standard errors of model coefficients compared to traditional regression, even for models in which the unobserved heterogeneity between streams is significant and the variance inflation factor for rainfall is much greater than 10. This is because both spatial and temporal variability were better characterized in panel regression. In a case study, regional rainfall elasticities estimated from panel regressions were applied to ungaged basins on Maui, using available rainfall projections to estimate plausible changes in surface-water availability and usable stream habitat for native species. The presented panel-regression framework is shown to offer benefits over existing traditional hydrologic regression methods for developing robust regional relations to investigate streamflow response in a changing climate.
Panel regressions to estimate low-flow response to rainfall variability in ungaged basins
NASA Astrophysics Data System (ADS)
Bassiouni, Maoya; Vogel, Richard M.; Archfield, Stacey A.
2016-12-01
Multicollinearity and omitted-variable bias are major limitations to developing multiple linear regression models to estimate streamflow characteristics in ungaged areas and varying rainfall conditions. Panel regression is used to overcome limitations of traditional regression methods, and obtain reliable model coefficients, in particular to understand the elasticity of streamflow to rainfall. Using annual rainfall and selected basin characteristics at 86 gaged streams in the Hawaiian Islands, regional regression models for three stream classes were developed to estimate the annual low-flow duration discharges. Three panel-regression structures (random effects, fixed effects, and pooled) were compared to traditional regression methods, in which space is substituted for time. Results indicated that panel regression generally was able to reproduce the temporal behavior of streamflow and reduce the standard errors of model coefficients compared to traditional regression, even for models in which the unobserved heterogeneity between streams is significant and the variance inflation factor for rainfall is much greater than 10. This is because both spatial and temporal variability were better characterized in panel regression. In a case study, regional rainfall elasticities estimated from panel regressions were applied to ungaged basins on Maui, using available rainfall projections to estimate plausible changes in surface-water availability and usable stream habitat for native species. The presented panel-regression framework is shown to offer benefits over existing traditional hydrologic regression methods for developing robust regional relations to investigate streamflow response in a changing climate.
Beyond Multiple Regression: Using Commonality Analysis to Better Understand R[superscript 2] Results
ERIC Educational Resources Information Center
Warne, Russell T.
2011-01-01
Multiple regression is one of the most common statistical methods used in quantitative educational research. Despite the versatility and easy interpretability of multiple regression, it has some shortcomings in the detection of suppressor variables and for somewhat arbitrarily assigning values to the structure coefficients of correlated…
Peak oxygen consumption measured during the stair-climbing test in lung resection candidates.
Brunelli, Alessandro; Xiumé, Francesco; Refai, Majed; Salati, Michele; Di Nunzio, Luca; Pompili, Cecilia; Sabbatini, Armando
2010-01-01
The stair-climbing test is commonly used in the preoperative evaluation of lung resection candidates, but it is difficult to standardize and provides little physiologic information on the performance. To verify the association between the altitude and the V(O2peak) measured during the stair-climbing test. 109 consecutive candidates for lung resection performed a symptom-limited stair-climbing test with direct breath-by-breath measurement of V(O2peak) by a portable gas analyzer. Stepwise logistic regression and bootstrap analyses were used to verify the association of several perioperative variables with a V(O2peak) <15 ml/kg/min. Subsequently, multiple regression analysis was also performed to develop an equation to estimate V(O2peak) from stair-climbing parameters and other patient-related variables. 56% of patients climbing <14 m had a V(O2peak) <15 ml/kg/min, whereas 98% of those climbing >22 m had a V(O2peak) >15 ml/kg/min. The altitude reached at stair-climbing test resulted in the only significant predictor of a V(O2peak) <15 ml/kg/min after logistic regression analysis. Multiple regression analysis yielded an equation to estimate V(O2peak) factoring altitude (p < 0.0001), speed of ascent (p = 0.005) and body mass index (p = 0.0008). There was an association between altitude and V(O2peak) measured during the stair-climbing test. Most of the patients climbing more than 22 m are able to generate high values of V(O2peak) and can proceed to surgery without any additional tests. All others need to be referred for a formal cardiopulmonary exercise test. In addition, we were able to generate an equation to estimate V(O2peak), which could assist in streamlining the preoperative workup and could be used across different settings to standardize this test. Copyright (c) 2010 S. Karger AG, Basel.
Enns, Murray W; Bernstein, Charles N; Kroeker, Kristine; Graff, Lesley; Walker, John R; Lix, Lisa M; Hitchon, Carol A; El-Gabalawy, Renée; Fisk, John D; Marrie, Ruth Ann
2018-01-01
Impairment in work function is a frequent outcome in patients with chronic conditions such as immune-mediated inflammatory diseases (IMID), depression and anxiety disorders. The personal and economic costs of work impairment in these disorders are immense. Symptoms of pain, fatigue, depression and anxiety are potentially remediable forms of distress that may contribute to work impairment in chronic health conditions such as IMID. The present study evaluated the association between pain [Medical Outcomes Study Pain Effects Scale], fatigue [Daily Fatigue Impact Scale], depression and anxiety [Hospital Anxiety and Depression Scale] and work impairment [Work Productivity and Activity Impairment Scale] in four patient populations: multiple sclerosis (n = 255), inflammatory bowel disease (n = 248, rheumatoid arthritis (n = 154) and a depression and anxiety group (n = 307), using quantile regression, controlling for the effects of sociodemographic factors, physical disability, and cognitive deficits. Each of pain, depression symptoms, anxiety symptoms, and fatigue individually showed significant associations with work absenteeism, presenteeism, and general activity impairment (quantile regression standardized estimates ranging from 0.3 to 1.0). When the distress variables were entered concurrently into the regression models, fatigue was a significant predictor of work and activity impairment in all models (quantile regression standardized estimates ranging from 0.2 to 0.5). These findings have important clinical implications for understanding the determinants of work impairment and for improving work-related outcomes in chronic disease.
NASA Astrophysics Data System (ADS)
Yang, Peng; Xia, Jun; Zhang, Yongyong; Han, Jian; Wu, Xia
2017-11-01
Because drought is a very common and widespread natural disaster, it has attracted a great deal of academic interest. Based on 12-month time scale standardized precipitation indices (SPI12) calculated from precipitation data recorded between 1960 and 2015 at 22 weather stations in the Tarim River Basin (TRB), this study aims to identify the trends of SPI and drought duration, severity, and frequency at various quantiles and to perform cluster analysis of drought events in the TRB. The results indicated that (1) both precipitation and temperature at most stations in the TRB exhibited significant positive trends during 1960-2015; (2) multiple scales of SPIs changed significantly around 1986; (3) based on quantile regression analysis of temporal drought changes, the positive SPI slopes indicated less severe and less frequent droughts at lower quantiles, but clear variation was detected in the drought frequency; and (4) significantly different trends were found in drought frequency probably between severe droughts and drought frequency.
NASA Astrophysics Data System (ADS)
Di, Nur Faraidah Muhammad; Satari, Siti Zanariah
2017-05-01
Outlier detection in linear data sets has been done vigorously but only a small amount of work has been done for outlier detection in circular data. In this study, we proposed multiple outliers detection in circular regression models based on the clustering algorithm. Clustering technique basically utilizes distance measure to define distance between various data points. Here, we introduce the similarity distance based on Euclidean distance for circular model and obtain a cluster tree using the single linkage clustering algorithm. Then, a stopping rule for the cluster tree based on the mean direction and circular standard deviation of the tree height is proposed. We classify the cluster group that exceeds the stopping rule as potential outlier. Our aim is to demonstrate the effectiveness of proposed algorithms with the similarity distances in detecting the outliers. It is found that the proposed methods are performed well and applicable for circular regression model.
Deconstructing alcohol use on a night out in England: promotions, preloading and consumption.
McClatchley, Kirstie; Shorter, Gillian W; Chalmers, Jenny
2014-07-01
To examine alcohol consumed during a drinking event (a single drinking occasion) by those attending public house/on-trade establishments on nights with standard pricing and nights with promotional prices. Data (n = 425) were collected in an ecological momentary assessment over eight nights in two locations (Midlands and London) on both promotional and standard (Saturday) nights. Multiple regression was used to predict event alcohol consumption by sex, age, type of night, alcohol preloading behaviour, marital and employment status, education, Alcohol Use Disorders Identification Test alcohol consumption questions separately or total AUDIT-C and social group size. Mean (UK) units consumed were 11.8 (London) and 14.4 (Midlands). In London, consumption was similar on promotional and standard nights, but in the Midlands, standard night consumption was three units higher. Preloading was reported by 30%; more common on standard nights. Regression analyses revealed being male, preloading and past-year total AUDIT-C were associated with higher event consumption. However, when AUDIT-C questions were added separately, being a standard night was associated with increased event consumption and different AUDIT-C questions were significantly associated with event consumption in each location. Event consumption reflected heavy episodic drinking and was influenced by price. Promotional night consumption either matched standard Saturday night consumption or was slightly lower. In London, there was a significant preference for drinking at least one promotional beverage on promotional nights. On standard nights, consumption was over a wider range of venues, and preloading with off-trade alcohol was more likely. © 2014 Australasian Professional Society on Alcohol and other Drugs.
Moderation analysis using a two-level regression model.
Yuan, Ke-Hai; Cheng, Ying; Maxwell, Scott
2014-10-01
Moderation analysis is widely used in social and behavioral research. The most commonly used model for moderation analysis is moderated multiple regression (MMR) in which the explanatory variables of the regression model include product terms, and the model is typically estimated by least squares (LS). This paper argues for a two-level regression model in which the regression coefficients of a criterion variable on predictors are further regressed on moderator variables. An algorithm for estimating the parameters of the two-level model by normal-distribution-based maximum likelihood (NML) is developed. Formulas for the standard errors (SEs) of the parameter estimates are provided and studied. Results indicate that, when heteroscedasticity exists, NML with the two-level model gives more efficient and more accurate parameter estimates than the LS analysis of the MMR model. When error variances are homoscedastic, NML with the two-level model leads to essentially the same results as LS with the MMR model. Most importantly, the two-level regression model permits estimating the percentage of variance of each regression coefficient that is due to moderator variables. When applied to data from General Social Surveys 1991, NML with the two-level model identified a significant moderation effect of race on the regression of job prestige on years of education while LS with the MMR model did not. An R package is also developed and documented to facilitate the application of the two-level model.
[High Risk Sex Behaviors and Associated Factors in Young Men in Chengdu].
2015-11-01
To determine the prevalence of high risk sex behaviors and associated factors in 18-34 years old men in Chengdu. Methods An anonymous questionnaire survey was conducted in 18-34 years old men selected by multi-stage random sampling in Chengdu. Data of 1536 respondents who reported having sex contacts were analyzed. 23.6% of respondents had multiple sex partners in the past 12 months; 11.8% were involved commercial sex; 9.0% had group sex; 4. 7% had anal sex; 15.6% had never used a condom; 37.7% had sex under the influence of alcohol or drugs. Logistic regression analysis revealed that marital status [married, standardized partial regression coefficient (B) = -0.086, P<0.05] , level of education (bachelor or above, B= -0.063, P<0.05), frequency of exposure to pornography (B=0.058, P<0.05), childhood sexual abuse (B= 0.042, P<0.05), first sexual intercourse at an earlier age (B=0.162, P<0.05), frequency of sex under the influence of alcohol or drugs (B=0.054, P<0.05) were significant predictors of having multiple sexual partners. Sexual orientation, age, smoking, alcohol abuse, drug use, anxiety, depression, childhood physical abuse did not appear to be significant factors associated with having multiple sexual partners. Having multiple sexual partners is the main high risk sex behavior of young men in Chengdu. Childhood sexual abuse and early start of sexual intercourse are the major predictors of having multiple sexual partners.
Ertefaie, Ashkan; Shortreed, Susan; Chakraborty, Bibhas
2016-06-15
Q-learning is a regression-based approach that uses longitudinal data to construct dynamic treatment regimes, which are sequences of decision rules that use patient information to inform future treatment decisions. An optimal dynamic treatment regime is composed of a sequence of decision rules that indicate how to optimally individualize treatment using the patients' baseline and time-varying characteristics to optimize the final outcome. Constructing optimal dynamic regimes using Q-learning depends heavily on the assumption that regression models at each decision point are correctly specified; yet model checking in the context of Q-learning has been largely overlooked in the current literature. In this article, we show that residual plots obtained from standard Q-learning models may fail to adequately check the quality of the model fit. We present a modified Q-learning procedure that accommodates residual analyses using standard tools. We present simulation studies showing the advantage of the proposed modification over standard Q-learning. We illustrate this new Q-learning approach using data collected from a sequential multiple assignment randomized trial of patients with schizophrenia. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Ahearn, Elizabeth A.
2010-01-01
Multiple linear regression equations for determining flow-duration statistics were developed to estimate select flow exceedances ranging from 25- to 99-percent for six 'bioperiods'-Salmonid Spawning (November), Overwinter (December-February), Habitat Forming (March-April), Clupeid Spawning (May), Resident Spawning (June), and Rearing and Growth (July-October)-in Connecticut. Regression equations also were developed to estimate the 25- and 99-percent flow exceedances without reference to a bioperiod. In total, 32 equations were developed. The predictive equations were based on regression analyses relating flow statistics from streamgages to GIS-determined basin and climatic characteristics for the drainage areas of those streamgages. Thirty-nine streamgages (and an additional 6 short-term streamgages and 28 partial-record sites for the non-bioperiod 99-percent exceedance) in Connecticut and adjacent areas of neighboring States were used in the regression analysis. Weighted least squares regression analysis was used to determine the predictive equations; weights were assigned based on record length. The basin characteristics-drainage area, percentage of area with coarse-grained stratified deposits, percentage of area with wetlands, mean monthly precipitation (November), mean seasonal precipitation (December, January, and February), and mean basin elevation-are used as explanatory variables in the equations. Standard errors of estimate of the 32 equations ranged from 10.7 to 156 percent with medians of 19.2 and 55.4 percent to predict the 25- and 99-percent exceedances, respectively. Regression equations to estimate high and median flows (25- to 75-percent exceedances) are better predictors (smaller variability of the residual values around the regression line) than the equations to estimate low flows (less than 75-percent exceedance). The Habitat Forming (March-April) bioperiod had the smallest standard errors of estimate, ranging from 10.7 to 20.9 percent. In contrast, the Rearing and Growth (July-October) bioperiod had the largest standard errors, ranging from 30.9 to 156 percent. The adjusted coefficient of determination of the equations ranged from 77.5 to 99.4 percent with medians of 98.5 and 90.6 percent to predict the 25- and 99-percent exceedances, respectively. Descriptive information on the streamgages used in the regression, measured basin and climatic characteristics, and estimated flow-duration statistics are provided in this report. Flow-duration statistics and the 32 regression equations for estimating flow-duration statistics in Connecticut are stored on the U.S. Geological Survey World Wide Web application ?StreamStats? (http://water.usgs.gov/osw/streamstats/index.html). The regression equations developed in this report can be used to produce unbiased estimates of select flow exceedances statewide.
Suzuki, Hideaki; Tabata, Takahisa; Koizumi, Hiroki; Hohchi, Nobusuke; Takeuchi, Shoko; Kitamura, Takuro; Fujino, Yoshihisa; Ohbuchi, Toyoaki
2014-12-01
This study aimed to create a multiple regression model for predicting hearing outcomes of idiopathic sudden sensorineural hearing loss (ISSNHL). The participants were 205 consecutive patients (205 ears) with ISSNHL (hearing level ≥ 40 dB, interval between onset and treatment ≤ 30 days). They received systemic steroid administration combined with intratympanic steroid injection. Data were examined by simple and multiple regression analyses. Three hearing indices (percentage hearing improvement, hearing gain, and posttreatment hearing level [HLpost]) and 7 prognostic factors (age, days from onset to treatment, initial hearing level, initial hearing level at low frequencies, initial hearing level at high frequencies, presence of vertigo, and contralateral hearing level) were included in the multiple regression analysis as dependent and explanatory variables, respectively. In the simple regression analysis, the percentage hearing improvement, hearing gain, and HLpost showed significant correlation with 2, 5, and 6 of the 7 prognostic factors, respectively. The multiple correlation coefficients were 0.396, 0.503, and 0.714 for the percentage hearing improvement, hearing gain, and HLpost, respectively. Predicted values of HLpost calculated by the multiple regression equation were reliable with 70% probability with a 40-dB-width prediction interval. Prediction of HLpost by the multiple regression model may be useful to estimate the hearing prognosis of ISSNHL. © The Author(s) 2014.
Chen, Gang; Wu, Yulian; Wang, Tao; Liang, Jixing; Lin, Wei; Li, Liantao; Wen, Junping; Lin, Lixiang; Huang, Huibin
2012-10-01
The role of the endogenous secretory receptor for advanced glycation end products (esRAGE) in depression of diabetes patients and its clinical significance are unclear. This study investigated the role of serum esRAGE in patients with type 2 diabetes mellitus with depression in the Chinese population. One hundred nineteen hospitalized patients with type 2 diabetes were recruited at Fujian Provincial Hospital (Fuzhou, China) from February 2010 to January 2011. All selected subjects were assessed with the Hamilton Rating Scale for Depression (HAMD). Among them, 71 patients with both type 2 diabetes and depression were included. All selected subjects were examined for the following: esRAGE concentration, glycosylated hemoglobin (HbA1c), blood lipids, C-reactive protein, trace of albumin in urine, and carotid artery intima-media thickness (IMT). Association between serum esRAGE levels and risk of type 2 diabetes mellitus with depression was also analyzed. There were statistically significant differences in gender, age, body mass index, waist circumference, and treatment methods between the group with depression and the group without depression (P<0.05). Multiple linear regression analysis showed that HAMD scores were negatively correlated with esRAGE levels (standard regression coefficient -0.270, P<0.01). HAMD-17 scores were positively correlated with IMT (standard regression coefficient 0.183, P<0.05) and with HbA1c (standard regression coefficient 0.314, P<0.01). Female gender, younger age, obesity, poor glycemic control, complications, and insulin therapy are all risk factors of type 2 diabetes mellitus with combined depression in the Chinese population. Inflammation and atherosclerosis play an important role in the pathogenesis of depression. esRAGE is a protective factor of depression among patients who have type 2 diabetes.
ERIC Educational Resources Information Center
Shear, Benjamin R.; Zumbo, Bruno D.
2013-01-01
Type I error rates in multiple regression, and hence the chance for false positive research findings, can be drastically inflated when multiple regression models are used to analyze data that contain random measurement error. This article shows the potential for inflated Type I error rates in commonly encountered scenarios and provides new…
John W. Edwards; Susan C. Loeb; David C. Guynn
1994-01-01
Multiple regression and use-availability analyses are two methods for examining habitat selection. Use-availability analysis is commonly used to evaluate macrohabitat selection whereas multiple regression analysis can be used to determine microhabitat selection. We compared these techniques using behavioral observations (n = 5534) and telemetry locations (n = 2089) of...
Woo, Sungmin; Suh, Chong Hyun; Kim, Sang Youn; Cho, Jeong Yeon; Kim, Seung Hyup
2018-01-01
The purpose of this study was to perform a head-to-head comparison between high-b-value (> 1000 s/mm 2 ) and standard-b-value (800-1000 s/mm 2 ) DWI regarding diagnostic performance in the detection of prostate cancer. The MEDLINE and EMBASE databases were searched up to April 1, 2017. The analysis included diagnostic accuracy studies in which high- and standard-b-value DWI were used for prostate cancer detection with histopathologic examination as the reference standard. Methodologic quality was assessed with the revised Quality Assessment of Diagnostic Accuracy Studies tool. Sensitivity and specificity of all studies were calculated and were pooled and plotted in a hierarchic summary ROC plot. Meta-regression and multiple-subgroup analyses were performed to compare the diagnostic performances of high- and standard-b-value DWI. Eleven studies (789 patients) were included. High-b-value DWI had greater pooled sensitivity (0.80 [95% CI, 0.70-0.87]) (p = 0.03) and specificity (0.92 [95% CI, 0.87-0.95]) (p = 0.01) than standard-b-value DWI (sensitivity, 0.78 [95% CI, 0.66-0.86]); specificity, 0.87 [95% CI, 0.77-0.93] (p < 0.01). Multiple-subgroup analyses showed that specificity was consistently higher for high- than for standard-b-value DWI (p ≤ 0.05). Sensitivity was significantly higher for high- than for standard-b-value DWI only in the following subgroups: peripheral zone only, transition zone only, multiparametric protocol (DWI and T2-weighted imaging), visual assessment of DW images, and per-lesion analysis (p ≤ 0.04). In a head-to-head comparison, high-b-value DWI had significantly better sensitivity and specificity for detection of prostate cancer than did standard-b-value DWI. Multiple-subgroup analyses showed that specificity was consistently superior for high-b-value DWI.
Building Regression Models: The Importance of Graphics.
ERIC Educational Resources Information Center
Dunn, Richard
1989-01-01
Points out reasons for using graphical methods to teach simple and multiple regression analysis. Argues that a graphically oriented approach has considerable pedagogic advantages in the exposition of simple and multiple regression. Shows that graphical methods may play a central role in the process of building regression models. (Author/LS)
Testing Different Model Building Procedures Using Multiple Regression.
ERIC Educational Resources Information Center
Thayer, Jerome D.
The stepwise regression method of selecting predictors for computer assisted multiple regression analysis was compared with forward, backward, and best subsets regression, using 16 data sets. The results indicated the stepwise method was preferred because of its practical nature, when the models chosen by different selection methods were similar…
Decreasing Multicollinearity: A Method for Models with Multiplicative Functions.
ERIC Educational Resources Information Center
Smith, Kent W.; Sasaki, M. S.
1979-01-01
A method is proposed for overcoming the problem of multicollinearity in multiple regression equations where multiplicative independent terms are entered. The method is not a ridge regression solution. (JKS)
Geostatistics as a validation tool for setting ozone standards for durum wheat.
De Marco, Alessandra; Screpanti, Augusto; Paoletti, Elena
2010-02-01
Which is the best standard for protecting plants from ozone? To answer this question, we must validate the standards by testing biological responses vs. ambient data in the field. A validation is missing for European and USA standards, because the networks for ozone, meteorology and plant responses are spatially independent. We proposed geostatistics as validation tool, and used durum wheat in central Italy as a test. The standards summarized ozone impact on yield better than hourly averages. Although USA criteria explained ozone-induced yield losses better than European criteria, USA legal level (75 ppb) protected only 39% of sites. European exposure-based standards protected > or =90%. Reducing the USA level to the Canadian 65 ppb or using W126 protected 91% and 97%, respectively. For a no-threshold accumulated stomatal flux, 22 mmol m(-2) was suggested to protect 97% of sites. In a multiple regression, precipitation explained 22% and ozone explained <0.9% of yield variability. Copyright (c) 2009 Elsevier Ltd. All rights reserved.
Multiple-Instance Regression with Structured Data
NASA Technical Reports Server (NTRS)
Wagstaff, Kiri L.; Lane, Terran; Roper, Alex
2008-01-01
We present a multiple-instance regression algorithm that models internal bag structure to identify the items most relevant to the bag labels. Multiple-instance regression (MIR) operates on a set of bags with real-valued labels, each containing a set of unlabeled items, in which the relevance of each item to its bag label is unknown. The goal is to predict the labels of new bags from their contents. Unlike previous MIR methods, MI-ClusterRegress can operate on bags that are structured in that they contain items drawn from a number of distinct (but unknown) distributions. MI-ClusterRegress simultaneously learns a model of the bag's internal structure, the relevance of each item, and a regression model that accurately predicts labels for new bags. We evaluated this approach on the challenging MIR problem of crop yield prediction from remote sensing data. MI-ClusterRegress provided predictions that were more accurate than those obtained with non-multiple-instance approaches or MIR methods that do not model the bag structure.
Self-regulated learning and achievement by middle-school children.
Sink, C A; Barnett, J E; Hixon, J E
1991-12-01
The relationship of self-regulated learning to the achievement test scores of 62 Grade 6 students was studied. Generally, the metacognitive and affective variables correlated significantly with teachers' grades and standardized test scores in mathematics, reading, and science. Planning and self-assessment significantly predicted the six measures of achievement. Step-wise multiple regression analyses using the metacognitive and affective variables largely indicate that students' and teachers' perceptions of scholastic ability and planning appear to be the most salient factors in predicting academic performance. The locus of control dimension had no utility in predicting classroom grades and performance on standardized measures of achievement. The implications of the findings for teaching and learning are discussed.
The effects of normal aging on multiple aspects of financial decision-making
Bangma, Dorien F.; Fuermaier, Anselm B. M.; Tucha, Lara; Tucha, Oliver; Koerts, Janneke
2017-01-01
Objectives Financial decision-making (FDM) is crucial for independent living. Due to cognitive decline that accompanies normal aging, older adults might have difficulties in some aspects of FDM. However, an improved knowledge, personal experience and affective decision-making, which are also related to normal aging, may lead to a stable or even improved age-related performance in some other aspects of FDM. Therefore, the present explorative study examines the effects of normal aging on multiple aspects of FDM. Methods One-hundred and eighty participants (range 18–87 years) were assessed with eight FDM tests and several standard neuropsychological tests. Age effects were evaluated using hierarchical multiple regression analyses. The validity of the prediction models was examined by internal validation (i.e. bootstrap resampling procedure) as well as external validation on another, independent, sample of participants (n = 124). Multiple regression and correlation analyses were applied to investigate the mediation effect of standard measures of cognition on the observed effects of age on FDM. Results On a relatively basic level of FDM (e.g., paying bills or using FDM styles) no significant effects of aging were found. However more complex FDM, such as making decisions in accordance with specific rules, becomes more difficult with advancing age. Furthermore, an older age was found to be related to a decreased sensitivity for impulsive buying. These results were confirmed by the internal and external validation analyses. Mediation effects of numeracy and planning were found to explain parts of the association between one aspect of FDM (i.e. Competence in decision rules) and age; however, these cognitive domains were not able to completely explain the relation between age and FDM. Conclusion Normal aging has a negative influence on a complex aspect of FDM, however, other aspects appear to be unaffected by normal aging or improve. PMID:28792973
Modeling Longitudinal Data Containing Non-Normal Within Subject Errors
NASA Technical Reports Server (NTRS)
Feiveson, Alan; Glenn, Nancy L.
2013-01-01
The mission of the National Aeronautics and Space Administration’s (NASA) human research program is to advance safe human spaceflight. This involves conducting experiments, collecting data, and analyzing data. The data are longitudinal and result from a relatively few number of subjects; typically 10 – 20. A longitudinal study refers to an investigation where participant outcomes and possibly treatments are collected at multiple follow-up times. Standard statistical designs such as mean regression with random effects and mixed–effects regression are inadequate for such data because the population is typically not approximately normally distributed. Hence, more advanced data analysis methods are necessary. This research focuses on four such methods for longitudinal data analysis: the recently proposed linear quantile mixed models (lqmm) by Geraci and Bottai (2013), quantile regression, multilevel mixed–effects linear regression, and robust regression. This research also provides computational algorithms for longitudinal data that scientists can directly use for human spaceflight and other longitudinal data applications, then presents statistical evidence that verifies which method is best for specific situations. This advances the study of longitudinal data in a broad range of applications including applications in the sciences, technology, engineering and mathematics fields.
Inhibitory saccadic dysfunction is associated with cerebellar injury in multiple sclerosis.
Kolbe, Scott C; Kilpatrick, Trevor J; Mitchell, Peter J; White, Owen; Egan, Gary F; Fielding, Joanne
2014-05-01
Cognitive dysfunction is common in patients with multiple sclerosis (MS). Saccadic eye movement paradigms such as antisaccades (AS) can sensitively interrogate cognitive function, in particular, the executive and attentional processes of response selection and inhibition. Although we have previously demonstrated significant deficits in the generation of AS in MS patients, the neuropathological changes underlying these deficits were not elucidated. In this study, 24 patients with relapsing-remitting MS underwent testing using an AS paradigm. Rank correlation and multiple regression analyses were subsequently used to determine whether AS errors in these patients were associated with: (i) neurological and radiological abnormalities, as measured by standard clinical techniques, (ii) cognitive dysfunction, and (iii) regionally specific cerebral white and gray-matter damage. Although AS error rates in MS patients did not correlate with clinical disability (using the Expanded Disability Status Score), T2 lesion load or brain parenchymal fraction, AS error rate did correlate with performance on the Paced Auditory Serial Addition Task and the Symbol Digit Modalities Test, neuropsychological tests commonly used in MS. Further, voxel-wise regression analyses revealed associations between AS errors and reduced fractional anisotropy throughout most of the cerebellum, and increased mean diffusivity in the cerebellar vermis. Region-wise regression analyses confirmed that AS errors also correlated with gray-matter atrophy in the cerebellum right VI subregion. These results support the use of the AS paradigm as a marker for cognitive dysfunction in MS and implicate structural and microstructural changes to the cerebellum as a contributing mechanism for AS deficits in these patients. Copyright © 2013 Wiley Periodicals, Inc.
Impact of divorce on the quality of life in school-age children.
Eymann, Alfredo; Busaniche, Julio; Llera, Julián; De Cunto, Carmen; Wahren, Carlos
2009-01-01
To assess psychosocial quality of life in school-age children of divorced parents. A cross-sectional survey was conducted at the pediatric outpatient clinic of a community hospital. Children 5 to 12 years old from married families and divorced families were included. Child quality of life was assessed through maternal reports using a Child Health Questionnaire-Parent Form 50. A multiple linear regression model was constructed including clinically relevant variables significant on univariate analysis (beta coefficient and 95%CI). Three hundred and thirty families were invited to participate and 313 completed the questionnaire. Univariate analysis showed that quality of life was significantly associated with parental separation, child sex, time spent with the father, standard of living, and maternal education. In a multiple linear regression model, quality of life scores decreased in boys -4.5 (-6.8 to -2.3) and increased for time spent with the father 0.09 (0.01 to 0.2). In divorced families, multiple linear regression showed that quality of life scores increased when parents had separated by mutual agreement 6.1 (2.7 to 9.4), when the mother had university level education 5.9 (1.7 to 10.1) and for each year elapsed since separation 0.6 (0.2 to 1.1), whereas scores decreased in boys -5.4 (-9.5 to -1.3) and for each one-year increment of maternal age -0.4 (-0.7 to -0.05). Children's psychosocial quality of life was affected by divorce. The Child Health Questionnaire can be useful to detect a decline in the psychosocial quality of life.
Bartlett, Jonathan W; Keogh, Ruth H
2018-06-01
Bayesian approaches for handling covariate measurement error are well established and yet arguably are still relatively little used by researchers. For some this is likely due to unfamiliarity or disagreement with the Bayesian inferential paradigm. For others a contributory factor is the inability of standard statistical packages to perform such Bayesian analyses. In this paper, we first give an overview of the Bayesian approach to handling covariate measurement error, and contrast it with regression calibration, arguably the most commonly adopted approach. We then argue why the Bayesian approach has a number of statistical advantages compared to regression calibration and demonstrate that implementing the Bayesian approach is usually quite feasible for the analyst. Next, we describe the closely related maximum likelihood and multiple imputation approaches and explain why we believe the Bayesian approach to generally be preferable. We then empirically compare the frequentist properties of regression calibration and the Bayesian approach through simulation studies. The flexibility of the Bayesian approach to handle both measurement error and missing data is then illustrated through an analysis of data from the Third National Health and Nutrition Examination Survey.
Hydrology and trout populations of cold-water rivers of Michigan and Wisconsin
Hendrickson, G.E.; Knutilla, R.L.
1974-01-01
Statistical multiple-regression analyses showed significant relationships between trout populations and hydrologic parameters. Parameters showing the higher levels of significance were temperature, hardness of water, percentage of gravel bottom, percentage of bottom vegetation, variability of streamflow, and discharge per unit drainage area. Trout populations increase with lower levels of annual maximum water temperatures, with increase in water hardness, and with increase in percentage of gravel and bottom vegetation. Trout populations also increase with decrease in variability of streamflow, and with increase in discharge per unit drainage area. Most hydrologic parameters were significant when evaluated collectively, but no parameter, by itself, showed a high degree of correlation with trout populations in regression analyses that included all the streams sampled. Regression analyses of stream segments that were restricted to certain limits of hardness, temperature, or percentage of gravel bottom showed improvements in correlation. Analyses of trout populations, in pounds per acre and pounds per mile and hydrologic parameters resulted in regression equations from which trout populations could be estimated with standard errors of 89 and 84 per cent, respectively.
Arday, D R; Brundage, J F; Gardner, L I; Goldenbaum, M; Wann, F; Wright, S
1991-06-15
The authors conducted a population-based study to attempt to estimate the effect of human immunodeficiency virus type 1 (HIV-1) seropositivity on Armed Services Vocational Aptitude Battery test scores in otherwise healthy individuals with early HIV-1 infection. The Armed Services Vocational Aptitude Battery is a 10-test written multiple aptitude battery administered to all civilian applicants for military enlistment prior to serologic screening for HIV-1 antibodies. A total of 975,489 induction testing records containing both Armed Services Vocational Aptitude Battery and HIV-1 results from October 1985 through March 1987 were examined. An analysis data set (n = 7,698) was constructed by choosing five controls for each of the 1,283 HIV-1-positive cases, matched on five-digit ZIP code, and a multiple linear regression analysis was performed to control for demographic and other factors that might influence test scores. Years of education was the strongest predictor of test scores, raising an applicant's score on a composite test nearly 0.16 standard deviation per year. The HIV-1-positive effect on the composite score was -0.09 standard deviation (99% confidence interval -0.17 to -0.02). Separate regressions on each component test within the battery showed HIV-1 effects between -0.39 and +0.06 standard deviation. The two Armed Services Vocational Aptitude Battery component tests felt a priori to be the most sensitive to HIV-1-positive status showed the least decrease with seropositivity. Much of the variability in test scores was not predicted by either HIV-1 serostatus or the demographic and other factors included in the model. There appeared to be little evidence of a strong HIV-1 effect.
Limbers, Christine A; Young, Danielle
2015-05-01
Executive functions play a critical role in regulating eating behaviors and have been shown to be associated with overeating which over time can result in overweight and obesity. There has been a paucity of research examining the associations among healthy dietary behaviors and executive functions utilizing behavioral rating scales of executive functioning. The objective of the present cross-sectional study was to evaluate the associations among fruit and vegetable consumption, intake of foods high in saturated fat, and executive functions using the Behavioral Rating Inventory of Executive Functioning-Adult Version. A total of 240 university students completed the Behavioral Rating Inventory of Executive Functioning-Adult Version, the 26-Item Eating Attitudes Test, and the Diet subscale of the Summary of Diabetes Self-Care Activities Questionnaire. Multiple linear regression analysis was conducted with two separate models in which fruit and vegetable consumption and saturated fat intake were the outcomes. Demographic variables, body mass index, and eating styles were controlled for in the analysis. Better initiation skills were associated with greater intake of fruits and vegetables in the last 7 days (standardized beta = -0.17; p < 0.05). Stronger inhibitory control was associated with less consumption of high fat foods in the last 7 days (standardized beta = 0.20; p < 0.05) in the multiple linear regression analysis. Executive functions that predict fruit and vegetable consumption are distinct from those that predict avoidance of foods high in saturated fat. Future research should investigate whether continued skill enhancement in initiation and inhibition following standard behavioral interventions improves long-term maintenance of weight loss. © The Author(s) 2015.
Ruder, Avima M; Hein, Misty J; Hopf, Nancy B; Waters, Martha A
2014-03-01
The objective of this analysis was to evaluate mortality among a cohort of 24,865 capacitor-manufacturing workers exposed to polychlorinated biphenyls (PCBs) at plants in Indiana, Massachusetts, and New York and followed for mortality through 2008. Cumulative PCB exposure was estimated using plant-specific job-exposure matrices. External comparisons to US and state-specific populations used standardized mortality ratios, adjusted for gender, race, age and calendar year. Among long-term workers employed 3 months or longer, within-cohort comparisons used standardized rate ratios and multivariable Poisson regression modeling. Through 2008, more than one million person-years at risk and 8749 deaths were accrued. Among long-term employees, all-cause and all-cancer mortality were not elevated; of the a priori outcomes assessed only melanoma mortality was elevated. Mortality was elevated for some outcomes of a priori interest among subgroups of long-term workers: all cancer, intestinal cancer and amyotrophic lateral sclerosis (women); melanoma (men); melanoma and brain and nervous system cancer (Indiana plant); and melanoma and multiple myeloma (New York plant). Standardized rates of stomach and uterine cancer and multiple myeloma mortality increased with estimated cumulative PCB exposure. Poisson regression modeling showed significant associations with estimated cumulative PCB exposure for prostate and stomach cancer mortality. For other outcomes of a priori interest--rectal, liver, ovarian, breast, and thyroid cancer, non-Hodgkin lymphoma, Alzheimer disease, and Parkinson disease--neither elevated mortality nor positive associations with PCB exposure were observed. Associations between estimated cumulative PCB exposure and stomach, uterine, and prostate cancer and myeloma mortality confirmed our previous positive findings. Published by Elsevier GmbH.
Ruder, Avima M.; Hein, Misty J.; Hopf, Nancy B.; Waters, Martha A.
2015-01-01
The objective of this analysis was to evaluate mortality among a cohort of 24,865 capacitor-manufacturing workers exposed to polychlorinated biphenyls (PCBs) at plants in Indiana, Massachusetts, and New York and followed for mortality through 2008. Cumulative PCB exposure was estimated using plant-specific job-exposure matrices. External comparisons to US and state-specific populations used standardized mortality ratios, adjusted for gender, race, age and calendar year. Among long-term workers employed 3 months or longer, within-cohort comparisons used standardized rate ratios and multivariable Poisson regression modeling. Through 2008, more than one million person-years at risk and 8749 deaths were accrued. Among long-term employees, all-cause and all-cancer mortality were not elevated; of the a priori outcomes assessed only melanoma mortality was elevated. Mortality was elevated for some outcomes of a priori interest among subgroups of long-term workers: all cancer, intestinal cancer and amyotrophic lateral sclerosis (women); melanoma (men); melanoma and brain and nervous system cancer (Indiana plant); and melanoma and multiple myeloma (New York plant). Standardized rates of stomach and uterine cancer and multiple myeloma mortality increased with estimated cumulative PCB exposure. Poisson regression modeling showed significant associations with estimated cumulative PCB exposure for prostate and stomach cancer mortality. For other outcomes of a priori interest – rectal, liver, ovarian, breast, and thyroid cancer, non-Hodgkin lymphoma, Alzheimer disease, and Parkinson disease – neither elevated mortality nor positive associations with PCB exposure were observed. Associations between estimated cumulative PCB exposure and stomach, uterine, and prostate cancer and myeloma mortality confirmed our previous positive findings. PMID:23707056
Huang, Shi; MacKinnon, David P.; Perrino, Tatiana; Gallo, Carlos; Cruden, Gracelyn; Brown, C Hendricks
2016-01-01
Mediation analysis often requires larger sample sizes than main effect analysis to achieve the same statistical power. Combining results across similar trials may be the only practical option for increasing statistical power for mediation analysis in some situations. In this paper, we propose a method to estimate: 1) marginal means for mediation path a, the relation of the independent variable to the mediator; 2) marginal means for path b, the relation of the mediator to the outcome, across multiple trials; and 3) the between-trial level variance-covariance matrix based on a bivariate normal distribution. We present the statistical theory and an R computer program to combine regression coefficients from multiple trials to estimate a combined mediated effect and confidence interval under a random effects model. Values of coefficients a and b, along with their standard errors from each trial are the input for the method. This marginal likelihood based approach with Monte Carlo confidence intervals provides more accurate inference than the standard meta-analytic approach. We discuss computational issues, apply the method to two real-data examples and make recommendations for the use of the method in different settings. PMID:28239330
Tighe, Elizabeth L.; Schatschneider, Christopher
2015-01-01
The purpose of this study was to investigate the joint and unique contributions of morphological awareness and vocabulary knowledge at five reading comprehension levels in Adult Basic Education (ABE) students. We introduce the statistical technique of multiple quantile regression, which enabled us to assess the predictive utility of morphological awareness and vocabulary knowledge at multiple points (quantiles) along the continuous distribution of reading comprehension. To demonstrate the efficacy of our multiple quantile regression analysis, we compared and contrasted our results with a traditional multiple regression analytic approach. Our results indicated that morphological awareness and vocabulary knowledge accounted for a large portion of the variance (82-95%) in reading comprehension skills across all quantiles. Morphological awareness exhibited the greatest unique predictive ability at lower levels of reading comprehension whereas vocabulary knowledge exhibited the greatest unique predictive ability at higher levels of reading comprehension. These results indicate the utility of using multiple quantile regression to assess trajectories of component skills across multiple levels of reading comprehension. The implications of our findings for ABE programs are discussed. PMID:25351773
ℓ(p)-Norm multikernel learning approach for stock market price forecasting.
Shao, Xigao; Wu, Kun; Liao, Bifeng
2012-01-01
Linear multiple kernel learning model has been used for predicting financial time series. However, ℓ(1)-norm multiple support vector regression is rarely observed to outperform trivial baselines in practical applications. To allow for robust kernel mixtures that generalize well, we adopt ℓ(p)-norm multiple kernel support vector regression (1 ≤ p < ∞) as a stock price prediction model. The optimization problem is decomposed into smaller subproblems, and the interleaved optimization strategy is employed to solve the regression model. The model is evaluated on forecasting the daily stock closing prices of Shanghai Stock Index in China. Experimental results show that our proposed model performs better than ℓ(1)-norm multiple support vector regression model.
An examination of racial bias in the Beck Depression Inventory-II.
Sashidharan, Tracy; Pawlow, Laura A; Pettibone, Jonathan C
2012-04-01
Historically, many psychological measures were developed and standardized based on a primarily Caucasian population. These tests are subsequently applied to minorities and may be inappropriate and possibly even pathologizing. The widely used Beck Depression Inventory-II (BDI-II) was initially standardized on a sample of Caucasian university students and its use with minorities has only recently been investigated. This study examined the possibility of racial bias in the BDI-II by comparing Caucasian and African American Midwestern university students. A hierarchical multiple regression compared the scores of the BDI-II with a similar measure of depression that is standardized for use with African Americans. There was no evidence of racial bias discovered in the BDI-II in this sample. Implications and future directions of research are discussed. PsycINFO Database Record (c) 2012 APA, all rights reserved.
Relationship between non-standard work arrangements and work-related accident absence in Belgium
Alali, Hanan; Braeckman, Lutgart; Van Hecke, Tanja; De Clercq, Bart; Janssens, Heidi; Wahab, Magd Abdel
2017-01-01
Objectives: The main objective of this study is to examine the relationship between indicators of non-standard work arrangements, including precarious contract, long working hours, multiple jobs, shift work, and work-related accident absence, using a representative Belgian sample and considering several socio-demographic and work characteristics. Methods: This study was based on the data of the fifth European Working Conditions Survey (EWCS). For the analysis, the sample was restricted to 3343 respondents from Belgium who were all employed workers. The associations between non-standard work arrangements and work-related accident absence were studied with multivariate logistic regression modeling techniques while adjusting for several confounders. Results: During the last 12 months, about 11.7% of workers were absent from work because of work-related accident. A multivariate regression model showed an increased injury risk for those performing shift work (OR 1.546, 95% CI 1.074-2.224). The relationship between contract type and occupational injuries was not significant (OR 1.163, 95% CI 0.739-1.831). Furthermore, no statistically significant differences were observed for those performing long working hours (OR 1.217, 95% CI 0.638-2.321) and those performing multiple jobs (OR 1.361, 95% CI 0.827-2.240) in relation to work-related accident absence. Those who rated their health as bad, low educated workers, workers from the construction sector, and those exposed to biomechanical exposure (BM) were more frequent victims of work-related accident absence. No significant gender difference was observed. Conclusion: Indicators of non-standard work arrangements under this study, except shift work, were not significantly associated with work-related accident absence. To reduce the burden of occupational injuries, not only risk reduction strategies and interventions are needed but also policy efforts are to be undertaken to limit shift work. In general, preventive measures and more training on the job are needed to ensure the safety and well-being of all workers. PMID:28111414
Relationship between non-standard work arrangements and work-related accident absence in Belgium.
Alali, Hanan; Braeckman, Lutgart; Van Hecke, Tanja; De Clercq, Bart; Janssens, Heidi; Wahab, Magd Abdel
2017-03-28
The main objective of this study is to examine the relationship between indicators of non-standard work arrangements, including precarious contract, long working hours, multiple jobs, shift work, and work-related accident absence, using a representative Belgian sample and considering several socio-demographic and work characteristics. This study was based on the data of the fifth European Working Conditions Survey (EWCS). For the analysis, the sample was restricted to 3343 respondents from Belgium who were all employed workers. The associations between non-standard work arrangements and work-related accident absence were studied with multivariate logistic regression modeling techniques while adjusting for several confounders. During the last 12 months, about 11.7% of workers were absent from work because of work-related accident. A multivariate regression model showed an increased injury risk for those performing shift work (OR 1.546, 95% CI 1.074-2.224). The relationship between contract type and occupational injuries was not significant (OR 1.163, 95% CI 0.739-1.831). Furthermore, no statistically significant differences were observed for those performing long working hours (OR 1.217, 95% CI 0.638-2.321) and those performing multiple jobs (OR 1.361, 95% CI 0.827-2.240) in relation to work-related accident absence. Those who rated their health as bad, low educated workers, workers from the construction sector, and those exposed to biomechanical exposure (BM) were more frequent victims of work-related accident absence. No significant gender difference was observed. Indicators of non-standard work arrangements under this study, except shift work, were not significantly associated with work-related accident absence. To reduce the burden of occupational injuries, not only risk reduction strategies and interventions are needed but also policy efforts are to be undertaken to limit shift work. In general, preventive measures and more training on the job are needed to ensure the safety and well-being of all workers.
Advanced statistics: linear regression, part II: multiple linear regression.
Marill, Keith A
2004-01-01
The applications of simple linear regression in medical research are limited, because in most situations, there are multiple relevant predictor variables. Univariate statistical techniques such as simple linear regression use a single predictor variable, and they often may be mathematically correct but clinically misleading. Multiple linear regression is a mathematical technique used to model the relationship between multiple independent predictor variables and a single dependent outcome variable. It is used in medical research to model observational data, as well as in diagnostic and therapeutic studies in which the outcome is dependent on more than one factor. Although the technique generally is limited to data that can be expressed with a linear function, it benefits from a well-developed mathematical framework that yields unique solutions and exact confidence intervals for regression coefficients. Building on Part I of this series, this article acquaints the reader with some of the important concepts in multiple regression analysis. These include multicollinearity, interaction effects, and an expansion of the discussion of inference testing, leverage, and variable transformations to multivariate models. Examples from the first article in this series are expanded on using a primarily graphic, rather than mathematical, approach. The importance of the relationships among the predictor variables and the dependence of the multivariate model coefficients on the choice of these variables are stressed. Finally, concepts in regression model building are discussed.
Biases and Standard Errors of Standardized Regression Coefficients
ERIC Educational Resources Information Center
Yuan, Ke-Hai; Chan, Wai
2011-01-01
The paper obtains consistent standard errors (SE) and biases of order O(1/n) for the sample standardized regression coefficients with both random and given predictors. Analytical results indicate that the formulas for SEs given in popular text books are consistent only when the population value of the regression coefficient is zero. The sample…
ERIC Educational Resources Information Center
Anderson, Carolyn J.; Verkuilen, Jay; Peyton, Buddy L.
2010-01-01
Survey items with multiple response categories and multiple-choice test questions are ubiquitous in psychological and educational research. We illustrate the use of log-multiplicative association (LMA) models that are extensions of the well-known multinomial logistic regression model for multiple dependent outcome variables to reanalyze a set of…
NASA Technical Reports Server (NTRS)
Smith, James A.
1992-01-01
The inversion of the leaf area index (LAI) canopy parameter from optical spectral reflectance measurements is obtained using a backpropagation artificial neural network trained using input-output pairs generated by a multiple scattering reflectance model. The problem of LAI estimation over sparse canopies (LAI < 1.0) with varying soil reflectance backgrounds is particularly difficult. Standard multiple regression methods applied to canopies within a single homogeneous soil type yield good results but perform unacceptably when applied across soil boundaries, resulting in absolute percentage errors of >1000 percent for low LAI. Minimization methods applied to merit functions constructed from differences between measured reflectances and predicted reflectances using multiple-scattering models are unacceptably sensitive to a good initial guess for the desired parameter. In contrast, the neural network reported generally yields absolute percentage errors of <30 percent when weighting coefficients trained on one soil type were applied to predicted canopy reflectance at a different soil background.
Peak-flow characteristics of Wyoming streams
Miller, Kirk A.
2003-01-01
Peak-flow characteristics for unregulated streams in Wyoming are described in this report. Frequency relations for annual peak flows through water year 2000 at 364 streamflow-gaging stations in and near Wyoming were evaluated and revised or updated as needed. Analyses of historical floods, temporal trends, and generalized skew were included in the evaluation. Physical and climatic basin characteristics were determined for each gaging station using a geographic information system. Gaging stations with similar peak-flow and basin characteristics were grouped into six hydrologic regions. Regional statistical relations between peak-flow and basin characteristics were explored using multiple-regression techniques. Generalized least squares regression equations for estimating magnitudes of annual peak flows with selected recurrence intervals from 1.5 to 500 years were developed for each region. Average standard errors of estimate range from 34 to 131 percent. Average standard errors of prediction range from 35 to 135 percent. Several statistics for evaluating and comparing the errors in these estimates are described. Limitations of the equations are described. Methods for applying the regional equations for various circumstances are listed and examples are given.
Feri, Rose; Soemantri, Diantha; Jusuf, Anwar
2016-12-29
This study applied self-determination theory (SDT) to investigate the relationship between students' autonomous motivation and tutors' autonomy support in medical students' academic achievement. This was a cross-sectional study. Out of 204 students in a fundamental medical science course, 199 participated in the study. Data was collected using two questionnaires: the Learning Self-Regulation and Learning Climate Questionnaires. The score of the course assessment was the measure of academic achievement. Data was analyzed and reported with descriptive and inferential statistics (mean, standard deviation and multiple regression analysis). Mean score (±standard deviation) of the autonomous motivation, tutors' autonomy support, and academic achievement were 5.48±0.89, 5.22±0.92, and 5.22±0.92. Multiple regression results reported students' autonomous motivation was associated with improvement of students' academic achievement (β=15.2, p=0.004). However, augmentation of tutors' autonomy support was not reflected in the improvement of students' academic achievement (β = -12.6, p = 0.019). Both students' autonomous motivation and tutors' autonomy support had a contribution of about 4.2% students' academic achievement (F = 4.343, p = 0.014, R 2 = 0.042). Due to the unique characteristic of our medical students' educational background, our study shows that tutors' autonomy support is inconsistent with students' academic achievement. However, both autonomous motivation and support are essential to students' academic achievement. Further study is needed to explore students' educational background and self-regulated learning competence to improve students' academic achievement.
Isolating and Examining Sources of Suppression and Multicollinearity in Multiple Linear Regression
ERIC Educational Resources Information Center
Beckstead, Jason W.
2012-01-01
The presence of suppression (and multicollinearity) in multiple regression analysis complicates interpretation of predictor-criterion relationships. The mathematical conditions that produce suppression in regression analysis have received considerable attention in the methodological literature but until now nothing in the way of an analytic…
General Nature of Multicollinearity in Multiple Regression Analysis.
ERIC Educational Resources Information Center
Liu, Richard
1981-01-01
Discusses multiple regression, a very popular statistical technique in the field of education. One of the basic assumptions in regression analysis requires that independent variables in the equation should not be highly correlated. The problem of multicollinearity and some of the solutions to it are discussed. (Author)
ℓ p-Norm Multikernel Learning Approach for Stock Market Price Forecasting
Shao, Xigao; Wu, Kun; Liao, Bifeng
2012-01-01
Linear multiple kernel learning model has been used for predicting financial time series. However, ℓ 1-norm multiple support vector regression is rarely observed to outperform trivial baselines in practical applications. To allow for robust kernel mixtures that generalize well, we adopt ℓ p-norm multiple kernel support vector regression (1 ≤ p < ∞) as a stock price prediction model. The optimization problem is decomposed into smaller subproblems, and the interleaved optimization strategy is employed to solve the regression model. The model is evaluated on forecasting the daily stock closing prices of Shanghai Stock Index in China. Experimental results show that our proposed model performs better than ℓ 1-norm multiple support vector regression model. PMID:23365561
Sample size determination for logistic regression on a logit-normal distribution.
Kim, Seongho; Heath, Elisabeth; Heilbrun, Lance
2017-06-01
Although the sample size for simple logistic regression can be readily determined using currently available methods, the sample size calculation for multiple logistic regression requires some additional information, such as the coefficient of determination ([Formula: see text]) of a covariate of interest with other covariates, which is often unavailable in practice. The response variable of logistic regression follows a logit-normal distribution which can be generated from a logistic transformation of a normal distribution. Using this property of logistic regression, we propose new methods of determining the sample size for simple and multiple logistic regressions using a normal transformation of outcome measures. Simulation studies and a motivating example show several advantages of the proposed methods over the existing methods: (i) no need for [Formula: see text] for multiple logistic regression, (ii) available interim or group-sequential designs, and (iii) much smaller required sample size.
Ding, Changfeng; Li, Xiaogang; Zhang, Taolin; Ma, Yibing; Wang, Xingxiang
2014-10-01
Soil environmental quality standards in respect of heavy metals for farmlands should be established considering both their effects on crop yield and their accumulation in the edible part. A greenhouse experiment was conducted to investigate the effects of chromium (Cr) on biomass production and Cr accumulation in carrot plants grown in a wide range of soils. The results revealed that carrot yield significantly decreased in 18 of the total 20 soils with Cr addition being the soil environmental quality standard of China. The Cr content of carrot grown in the five soils with pH>8.0 exceeded the maximum allowable level (0.5mgkg(-1)) according to the Chinese General Standard for Contaminants in Foods. The relationship between carrot Cr concentration and soil pH could be well fitted (R(2)=0.70, P<0.0001) by a linear-linear segmented regression model. The addition of Cr to soil influenced carrot yield firstly rather than the food quality. The major soil factors controlling Cr phytotoxicity and the prediction models were further identified and developed using path analysis and stepwise multiple linear regression analysis. Soil Cr thresholds for phytotoxicity meanwhile ensuring food safety were then derived on the condition of 10 percent yield reduction. Copyright © 2014 Elsevier Inc. All rights reserved.
Failure of Standard Training Sets in the Analysis of Fast-Scan Cyclic Voltammetry Data.
Johnson, Justin A; Rodeberg, Nathan T; Wightman, R Mark
2016-03-16
The use of principal component regression, a multivariate calibration method, in the analysis of in vivo fast-scan cyclic voltammetry data allows for separation of overlapping signal contributions, permitting evaluation of the temporal dynamics of multiple neurotransmitters simultaneously. To accomplish this, the technique relies on information about current-concentration relationships across the scan-potential window gained from analysis of training sets. The ability of the constructed models to resolve analytes depends critically on the quality of these data. Recently, the use of standard training sets obtained under conditions other than those of the experimental data collection (e.g., with different electrodes, animals, or equipment) has been reported. This study evaluates the analyte resolution capabilities of models constructed using this approach from both a theoretical and experimental viewpoint. A detailed discussion of the theory of principal component regression is provided to inform this discussion. The findings demonstrate that the use of standard training sets leads to misassignment of the current-concentration relationships across the scan-potential window. This directly results in poor analyte resolution and, consequently, inaccurate quantitation, which may lead to erroneous conclusions being drawn from experimental data. Thus, it is strongly advocated that training sets be obtained under the experimental conditions to allow for accurate data analysis.
MBS Measurement Tool for Swallow Impairment—MBSImp: Establishing a Standard
Martin-Harris, Bonnie; Brodsky, Martin B.; Michel, Yvonne; Castell, Donald O.; Schleicher, Melanie; Sandidge, John; Maxwell, Rebekah; Blair, Julie
2014-01-01
The aim of this study was to test reliability, content, construct, and external validity of a new modified barium swallowing study (MBSS) tool (MBSImp) that is used to quantify swallowing impairment. Multiple regression, confirmatory factor, and correlation analyses were used to analyze 300 in- and outpatients with heterogeneous medical and surgical diagnoses who were sequentially referred for MBS exams at a university medical center and private tertiary care community hospital. Main outcome measures were the MBSImp and index scores of aspiration, health status, and quality of life. Inter- and intrarater concordance were 80% or greater for blinded scoring of MBSSs. Regression analysis revealed contributions of eight of nine swallow types to impressions of overall swallowing impairment (p ≤ 0.05). Factor analysis revealed 13 significant components (loadings ≥ 0.5) that formed two impairment groupings (oral and pharyngeal). Significant correlations were found between Oral and Pharyngeal Impairment scores and Penetration-Aspiration Scale scores, and indexes of intake status, nutrition, health status, and quality of life. The MBSImp demonstrated clinical practicality, favorable inter- and intrarater reliability following standardized training, content, and external validity. This study reflects potential for establishment of a new standard for quantification and comparison of oropharyngeal swallowing impairment across patient diagnoses as measured on MBSS. PMID:18855050
Mandel, Micha; Gauthier, Susan A; Guttmann, Charles R G; Weiner, Howard L; Betensky, Rebecca A
2007-12-01
The expanded disability status scale (EDSS) is an ordinal score that measures progression in multiple sclerosis (MS). Progression is defined as reaching EDSS of a certain level (absolute progression) or increasing of one point of EDSS (relative progression). Survival methods for time to progression are not adequate for such data since they do not exploit the EDSS level at the end of follow-up. Instead, we suggest a Markov transitional model applicable for repeated categorical or ordinal data. This approach enables derivation of covariate-specific survival curves, obtained after estimation of the regression coefficients and manipulations of the resulting transition matrix. Large sample theory and resampling methods are employed to derive pointwise confidence intervals, which perform well in simulation. Methods for generating survival curves for time to EDSS of a certain level, time to increase of EDSS of at least one point, and time to two consecutive visits with EDSS greater than three are described explicitly. The regression models described are easily implemented using standard software packages. Survival curves are obtained from the regression results using packages that support simple matrix calculation. We present and demonstrate our method on data collected at the Partners MS center in Boston, MA. We apply our approach to progression defined by time to two consecutive visits with EDSS greater than three, and calculate crude (without covariates) and covariate-specific curves.
Techniques for estimating flood-peak discharges of rural, unregulated streams in Ohio
Koltun, G.F.; Roberts, J.W.
1990-01-01
Multiple-regression equations are presented for estimating flood-peak discharges having recurrence intervals of 2, 5, 10, 25, 50, and 100 years at ungaged sites on rural, unregulated streams in Ohio. The average standard errors of prediction for the equations range from 33.4% to 41.4%. Peak discharge estimates determined by log-Pearson Type III analysis using data collected through the 1987 water year are reported for 275 streamflow-gaging stations. Ordinary least-squares multiple-regression techniques were used to divide the State into three regions and to identify a set of basin characteristics that help explain station-to- station variation in the log-Pearson estimates. Contributing drainage area, main-channel slope, and storage area were identified as suitable explanatory variables. Generalized least-square procedures, which include historical flow data and account for differences in the variance of flows at different gaging stations, spatial correlation among gaging station records, and variable lengths of station record were used to estimate the regression parameters. Weighted peak-discharge estimates computed as a function of the log-Pearson Type III and regression estimates are reported for each station. A method is provided to adjust regression estimates for ungaged sites by use of weighted and regression estimates for a gaged site located on the same stream. Limitations and shortcomings cited in an earlier report on the magnitude and frequency of floods in Ohio are addressed in this study. Geographic bias is no longer evident for the Maumee River basin of northwestern Ohio. No bias is found to be associated with the forested-area characteristic for the range used in the regression analysis (0.0 to 99.0%), nor is this characteristic significant in explaining peak discharges. Surface-mined area likewise is not significant in explaining peak discharges, and the regression equations are not biased when applied to basins having approximately 30% or less surface-mined area. Analyses of residuals indicate that the equations tend to overestimate flood-peak discharges for basins having approximately 30% or more surface-mined area. (USGS)
Tighe, Elizabeth L; Schatschneider, Christopher
2016-07-01
The purpose of this study was to investigate the joint and unique contributions of morphological awareness and vocabulary knowledge at five reading comprehension levels in adult basic education (ABE) students. We introduce the statistical technique of multiple quantile regression, which enabled us to assess the predictive utility of morphological awareness and vocabulary knowledge at multiple points (quantiles) along the continuous distribution of reading comprehension. To demonstrate the efficacy of our multiple quantile regression analysis, we compared and contrasted our results with a traditional multiple regression analytic approach. Our results indicated that morphological awareness and vocabulary knowledge accounted for a large portion of the variance (82%-95%) in reading comprehension skills across all quantiles. Morphological awareness exhibited the greatest unique predictive ability at lower levels of reading comprehension whereas vocabulary knowledge exhibited the greatest unique predictive ability at higher levels of reading comprehension. These results indicate the utility of using multiple quantile regression to assess trajectories of component skills across multiple levels of reading comprehension. The implications of our findings for ABE programs are discussed. © Hammill Institute on Disabilities 2014.
Stepwise versus Hierarchical Regression: Pros and Cons
ERIC Educational Resources Information Center
Lewis, Mitzi
2007-01-01
Multiple regression is commonly used in social and behavioral data analysis. In multiple regression contexts, researchers are very often interested in determining the "best" predictors in the analysis. This focus may stem from a need to identify those predictors that are supportive of theory. Alternatively, the researcher may simply be interested…
NASA Technical Reports Server (NTRS)
Parker, Peter A.; Geoffrey, Vining G.; Wilson, Sara R.; Szarka, John L., III; Johnson, Nels G.
2010-01-01
The calibration of measurement systems is a fundamental but under-studied problem within industrial statistics. The origins of this problem go back to basic chemical analysis based on NIST standards. In today's world these issues extend to mechanical, electrical, and materials engineering. Often, these new scenarios do not provide "gold standards" such as the standard weights provided by NIST. This paper considers the classic "forward regression followed by inverse regression" approach. In this approach the initial experiment treats the "standards" as the regressor and the observed values as the response to calibrate the instrument. The analyst then must invert the resulting regression model in order to use the instrument to make actual measurements in practice. This paper compares this classical approach to "reverse regression," which treats the standards as the response and the observed measurements as the regressor in the calibration experiment. Such an approach is intuitively appealing because it avoids the need for the inverse regression. However, it also violates some of the basic regression assumptions.
Distiller, Larry A; Joffe, Barry I; Melville, Vanessa; Welman, Tania; Distiller, Greg B
2006-01-01
The factors responsible for premature coronary atherosclerosis in patients with type 1 diabetes are ill defined. We therefore assessed carotid intima-media complex thickness (IMT) in relatively long-surviving patients with type 1 diabetes as a marker of atherosclerosis and correlated this with traditional risk factors. Cross-sectional study of 148 patients with relatively long-surviving (>18 years) type 1 diabetes (76 men and 72 women) attending the Centre for Diabetes and Endocrinology, Johannesburg. The mean common carotid artery IMT and presence or absence of plaque was evaluated by high-resolution B-mode ultrasound. Their median age was 48 years and duration of diabetes 26 years (range 18-59 years). Traditional risk factors (age, duration of diabetes, glycemic control, hypertension, smoking and lipoprotein concentrations) were recorded. Three response variables were defined and modeled. Standard multiple regression was used for a continuous IMT variable, logistic regression for the presence/absence of plaque and ordinal logistic regression to model three categories of "risk." The median common carotid IMT was 0.62 mm (range 0.44-1.23 mm) with plaque detected in 28 cases. The multiple regression model found significant associations between IMT and current age (P=.001), duration of diabetes (P=.033), BMI (P=.008) and diagnosed hypertension (P=.046) with HDL showing a protective effect (P=.022). Current age (P=.001) and diagnosed hypertension (P=.004), smoking (P=.008) and retinopathy (P=.033) were significant in the logistic regression model. Current age was also significant in the ordinal logistic regression model (P<.001), as was total cholesterol/HDL ratio (P<.001) and mean HbA(1c) concentration (P=.073). The major factors influencing common carotid IMT in patients with relatively long-surviving type 1 diabetes are age, duration of diabetes, existing hypertension and HDL (protective) with a relatively minor role ascribed to relatively long-standing glycemic control.
Tokunaga, Makoto; Watanabe, Susumu; Sonoda, Shigeru
2017-09-01
Multiple linear regression analysis is often used to predict the outcome of stroke rehabilitation. However, the predictive accuracy may not be satisfactory. The objective of this study was to elucidate the predictive accuracy of a method of calculating motor Functional Independence Measure (mFIM) at discharge from mFIM effectiveness predicted by multiple regression analysis. The subjects were 505 patients with stroke who were hospitalized in a convalescent rehabilitation hospital. The formula "mFIM at discharge = mFIM effectiveness × (91 points - mFIM at admission) + mFIM at admission" was used. By including the predicted mFIM effectiveness obtained through multiple regression analysis in this formula, we obtained the predicted mFIM at discharge (A). We also used multiple regression analysis to directly predict mFIM at discharge (B). The correlation between the predicted and the measured values of mFIM at discharge was compared between A and B. The correlation coefficients were .916 for A and .878 for B. Calculating mFIM at discharge from mFIM effectiveness predicted by multiple regression analysis had a higher degree of predictive accuracy of mFIM at discharge than that directly predicted. Copyright © 2017 National Stroke Association. Published by Elsevier Inc. All rights reserved.
Use of Empirical Estimates of Shrinkage in Multiple Regression: A Caution.
ERIC Educational Resources Information Center
Kromrey, Jeffrey D.; Hines, Constance V.
1995-01-01
The accuracy of four empirical techniques to estimate shrinkage in multiple regression was studied through Monte Carlo simulation. None of the techniques provided unbiased estimates of the population squared multiple correlation coefficient, but the normalized jackknife and bootstrap techniques demonstrated marginally acceptable performance with…
Enhance-Synergism and Suppression Effects in Multiple Regression
ERIC Educational Resources Information Center
Lipovetsky, Stan; Conklin, W. Michael
2004-01-01
Relations between pairwise correlations and the coefficient of multiple determination in regression analysis are considered. The conditions for the occurrence of enhance-synergism and suppression effects when multiple determination becomes bigger than the total of squared correlations of the dependent variable with the regressors are discussed. It…
Clinical Decision Support Model to Predict Occlusal Force in Bruxism Patients.
Thanathornwong, Bhornsawan; Suebnukarn, Siriwan
2017-10-01
The aim of this study was to develop a decision support model for the prediction of occlusal force from the size and color of articulating paper markings in bruxism patients. We used the information from the datasets of 30 bruxism patients in which digital measurements of the size and color of articulating paper markings (12-µm Hanel; Coltene/Whaledent GmbH, Langenau, Germany) on canine protected hard stabilization splints were measured in pixels (P) and in red (R), green (G), and blue (B) values using Adobe Photoshop software (Adobe Systems, San Jose, CA, USA). The occlusal force (F) was measured using T-Scan III (Tekscan Inc., South Boston, MA, USA). The multiple regression equation was applied to predict F from the P and RGB. Model evaluation was performed using the datasets from 10 new patients. The patient's occlusal force measured by T-Scan III was used as a 'gold standard' to compare with the occlusal force predicted by the multiple regression model. The results demonstrate that the correlation between the occlusal force and the pixels and RGB of the articulating paper markings was positive (F = 1.62×P + 0.07×R -0.08×G + 0.08×B + 4.74; R 2 = 0.34). There was a high degree of agreement between the occlusal force of the patient measured using T-Scan III and the occlusal force predicted by the model (kappa value = 0.82). The results obtained demonstrate that the multiple regression model can predict the occlusal force using the digital values for the size and color of the articulating paper markings in bruxism patients.
Analysis of the Environmental Management System based on ISO 14001 on the American continent.
Neves, Fábio de Oliveira; Salgado, Eduardo G; Beijo, Luiz A
2017-09-01
The American continent is in broad economic and industrial development. Consequently, a more detailed discussion of the impacts generated by such development is needed. Moreover, there is an increase in the number of ISO 14001 certificates issued to this continent. Given the above, no studies were found that bridge the gap to identify the influence of different factors on ISO 14001 in the Americas. Thus, this article has as its main aim to check which economic, environmental and cultural factors have influence on ISO 14001 Certification in the American Continent. The data were collected in the ISO Survey, World Bank, United Nations Development Programme and International Energy Agency. Among the countries of that continent, thirteen were analyzed and only two did not show the economic factors as the influence factor in the multiple regression models fitted with Brazil and the United State. In these models, all presented environmental factors as influencing factors. Only in Brazil the index HDI presented as cultural factor in multiple regression model fitted. The economic factors: Gross Domestic Product and exports of goods and services and environmental: Carbon Dioxide (CO 2 ) and fossil fuel consumption were the most influential in ISO 14001 certification. Venezuela, Uruguay, Colombia and the United States were countries that had factors dependent on each other, featuring the environmental marketing. Briefly, this study brings up several implications: to the academy, with the proposal of new concepts and guidance on the factors that assist in ISO 14001 certification in the American Continent. Additionally, taking into account the industry, the factors serve as efficiency parameters for the implementation of ISO 14001 standard, and for the Government to improve through factors that do not fit in multiple regression models. Copyright © 2017 Elsevier Ltd. All rights reserved.
Røislien, Jo; Lossius, Hans Morten; Kristiansen, Thomas
2015-01-01
Background Trauma is a leading global cause of death. Trauma mortality rates are higher in rural areas, constituting a challenge for quality and equality in trauma care. The aim of the study was to explore population density and transport time to hospital care as possible predictors of geographical differences in mortality rates, and to what extent choice of statistical method might affect the analytical results and accompanying clinical conclusions. Methods Using data from the Norwegian Cause of Death registry, deaths from external causes 1998–2007 were analysed. Norway consists of 434 municipalities, and municipality population density and travel time to hospital care were entered as predictors of municipality mortality rates in univariate and multiple regression models of increasing model complexity. We fitted linear regression models with continuous and categorised predictors, as well as piecewise linear and generalised additive models (GAMs). Models were compared using Akaike's information criterion (AIC). Results Population density was an independent predictor of trauma mortality rates, while the contribution of transport time to hospital care was highly dependent on choice of statistical model. A multiple GAM or piecewise linear model was superior, and similar, in terms of AIC. However, while transport time was statistically significant in multiple models with piecewise linear or categorised predictors, it was not in GAM or standard linear regression. Conclusions Population density is an independent predictor of trauma mortality rates. The added explanatory value of transport time to hospital care is marginal and model-dependent, highlighting the importance of exploring several statistical models when studying complex associations in observational data. PMID:25972600
Qing, Si-han; Chang, Yun-feng; Dong, Xiao-ai; Li, Yuan; Chen, Xiao-gang; Shu, Yong-kang; Deng, Zhen-hua
2013-10-01
To establish the mathematical models of stature estimation for Sichuan Han female with measurement of lumbar vertebrae by X-ray to provide essential data for forensic anthropology research. The samples, 206 Sichuan Han females, were divided into three groups including group A, B and C according to the ages. Group A (206 samples) consisted of all ages, group B (116 samples) were 20-45 years old and 90 samples over 45 years old were group C. All the samples were examined lumbar vertebrae through CR technology, including the parameters of five centrums (L1-L5) as anterior border, posterior border and central heights (x1-x15), total central height of lumbar spine (x16), and the real height of every sample. The linear regression analysis was produced using the parameters to establish the mathematical models of stature estimation. Sixty-two trained subjects were tested to verify the accuracy of the mathematical models. The established mathematical models by hypothesis test of linear regression equation model were statistically significant (P<0.05). The standard errors of the equation were 2.982-5.004 cm, while correlation coefficients were 0.370-0.779 and multiple correlation coefficients were 0.533-0.834. The return tests of the highest correlation coefficient and multiple correlation coefficient of each group showed that the highest accuracy of the multiple regression equation, y = 100.33 + 1.489 x3 - 0.548 x6 + 0.772 x9 + 0.058 x12 + 0.645 x15, in group A were 80.6% (+/- lSE) and 100% (+/- 2SE). The established mathematical models in this study could be applied for the stature estimation for Sichuan Han females.
DFT study on oxidation of HS(CH2) m SH ( m = 1-8) in oxidative desulfurization
NASA Astrophysics Data System (ADS)
Song, Y. Z.; Song, J. J.; Zhao, T. T.; Chen, C. Y.; He, M.; Du, J.
2016-06-01
Density functional theory was employed for calculation of HS(CH2) m SH ( m = 1-8) and its derivatives at B3LYP method at 6-31++g ( d, p) level. Using eigenvalues of LUMO and HOMO for HS(CH2) m SH, the standard electrode potentials were estimated by a stepwise multiple regression techniques (MLR), and obtained as E° = 1.500 + 7.167 × 10-3 HOMO-0.229 LUMO with high correlation coefficients of 0.973 and F values of 43.973.
An Effect Size for Regression Predictors in Meta-Analysis
ERIC Educational Resources Information Center
Aloe, Ariel M.; Becker, Betsy Jane
2012-01-01
A new effect size representing the predictive power of an independent variable from a multiple regression model is presented. The index, denoted as r[subscript sp], is the semipartial correlation of the predictor with the outcome of interest. This effect size can be computed when multiple predictor variables are included in the regression model…
Regression Analysis: Legal Applications in Institutional Research
ERIC Educational Resources Information Center
Frizell, Julie A.; Shippen, Benjamin S., Jr.; Luna, Andrew L.
2008-01-01
This article reviews multiple regression analysis, describes how its results should be interpreted, and instructs institutional researchers on how to conduct such analyses using an example focused on faculty pay equity between men and women. The use of multiple regression analysis will be presented as a method with which to compare salaries of…
RAWS II: A MULTIPLE REGRESSION ANALYSIS PROGRAM,
This memorandum gives instructions for the use and operation of a revised version of RAWS, a multiple regression analysis program. The program...of preprocessed data, the directed retention of variable, listing of the matrix of the normal equations and its inverse, and the bypassing of the regression analysis to provide the input variable statistics only. (Author)
Incremental Net Effects in Multiple Regression
ERIC Educational Resources Information Center
Lipovetsky, Stan; Conklin, Michael
2005-01-01
A regular problem in regression analysis is estimating the comparative importance of the predictors in the model. This work considers the 'net effects', or shares of the predictors in the coefficient of the multiple determination, which is a widely used characteristic of the quality of a regression model. Estimation of the net effects can be a…
Floating Data and the Problem with Illustrating Multiple Regression.
ERIC Educational Resources Information Center
Sachau, Daniel A.
2000-01-01
Discusses how to introduce basic concepts of multiple regression by creating a large-scale, three-dimensional regression model using the classroom walls and floor. Addresses teaching points that should be covered and reveals student reaction to the model. Finds that the greatest benefit of the model is the low fear, walk-through, nonmathematical…
Nurse practitioners: leadership behaviors and organizational climate.
Jones, L C; Guberski, T D; Soeken, K L
1990-01-01
The purpose of this article is to examine the relationships of individual nurse practitioners' perceptions of the leadership climate in their organizations and self-reported formal and informal leadership behaviors. The nine climate dimensions (Structure, Responsibility, Reward, Perceived Support of Risk Taking, Warmth, Support, Standard Setting, Conflict, and Identity) identified by Litwin and Stringer in 1968 were used to predict five leadership dimensions (Meeting Organizational Needs, Managing Resources, Leadership Competence, Task Accomplishment, and Communications). Demographic variables of age, educational level, and percent of time spent performing administrative functions were forced as a first step in each multiple regression analysis and used to explain a significant amount of variance in all but one analysis. All leadership dimensions were predicted by at least one organizational climate dimension: (1) Meeting Organizational Needs by Risk and Reward; (2) Managing Resources by Risk and Structure; (3) Leadership Competence by Risk and Standards; (4) Task Accomplishment by Structure, Risk, and Standards; and (5) Communication by Rewards.
Mann, Georgianna; Hosig, Kathy; Zhang, Angang; Shen, Sumin; Serrano, Elena
To assess the effects of the national Smart Snacks in School standards, which include nutrient and ingredient limitations for school competitive foods and beverages effective July, 2014, on student intake in low-income rural Appalachian middle schools. Food-frequency questionnaires were administered to students before and after implementation. Multiple ordinal logistic regression models were conducted to examine effects from year of data collection, grade, and free or reduced price lunch participation rates. No significant changes were observed after implementation except a decrease in consumption of 1% or nonfat flavored milk at school. Smart Snacks in School standards did not result in significant dietary changes in this study. Longitudinal studies could evaluate long-term impacts of nutrition standards. Copyright © 2017 Society for Nutrition Education and Behavior. Published by Elsevier Inc. All rights reserved.
Lindholdt, Louise; Labriola, Merete; Nielsen, Claus Vinther; Horsbøl, Trine Allerslev; Lund, Thomas
2017-01-01
Introduction The return-to-work (RTW) process after long-term sickness absence is often complex and long and implies multiple shifts between different labour market states for the absentee. Standard methods for examining RTW research typically rely on the analysis of one outcome measure at a time, which will not capture the many possible states and transitions the absentee can go through. The purpose of this study was to explore the potential added value of sequence analysis in supplement to standard regression analysis of a multidisciplinary RTW intervention among patients with low back pain (LBP). Methods The study population consisted of 160 patients randomly allocated to either a hospital-based brief or a multidisciplinary intervention. Data on labour market participation following intervention were obtained from a national register and analysed in two ways: as a binary outcome expressed as active or passive relief at a 1-year follow-up and as four different categories for labour market participation. Logistic regression and sequence analysis were performed. Results The logistic regression analysis showed no difference in labour market participation for patients in the two groups after 1 year. Applying sequence analysis showed differences in subsequent labour market participation after 2 years after baseline in favour of the brief intervention group versus the multidisciplinary intervention group. Conclusion The study indicated that sequence analysis could provide added analytical value as a supplement to traditional regression analysis in prospective studies of RTW among patients with LBP. PMID:28729315
2017-03-23
PUBLIC RELEASE; DISTRIBUTION UNLIMITED Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and... Cost and Probability of Cost and Schedule Overrun for Program Managers Ryan C. Trudelle Follow this and additional works at: https://scholar.afit.edu...afit.edu. Recommended Citation Trudelle, Ryan C., "Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and
Motl, Robert W; Fernhall, Bo
2012-03-01
To examine the accuracy of predicting peak oxygen consumption (VO(2peak)) primarily from peak work rate (WR(peak)) recorded during a maximal, incremental exercise test on a cycle ergometer among persons with relapsing-remitting multiple sclerosis (RRMS) who had minimal disability. Cross-sectional study. Clinical research laboratory. Women with RRMS (n=32) and sex-, age-, height-, and weight-matched healthy controls (n=16) completed an incremental exercise test on a cycle ergometer to volitional termination. Not applicable. Measured and predicted VO(2peak) and WR(peak). There were strong, statistically significant associations between measured and predicted VO(2peak) in the overall sample (R(2)=.89, standard error of the estimate=127.4 mL/min) and subsamples with (R(2)=.89, standard error of the estimate=131.3 mL/min) and without (R(2)=.85, standard error of the estimate=126.8 mL/min) multiple sclerosis (MS) based on the linear regression analyses. Based on the 95% confidence limits for worst-case errors, the equation predicted VO(2peak) within 10% of its true value in 95 of every 100 subjects with MS. Peak VO(2) can be accurately predicted in persons with RRMS who have minimal disability as it is in controls by using established equations and WR(peak) recorded from a maximal, incremental exercise test on a cycle ergometer. Copyright © 2012 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
The effect of social deprivation on local authority sickness absence rates.
Wynn, P; Low, A
2008-06-01
There is an extensive body of research relating to the association between ergonomic and psychosocial factors on sickness absence rates. The impact of deprivation on health indices has also been extensively investigated. However, published research has not investigated the extent of any association between standard measures of deprivation and sickness absence and ill-health retirement rates. To establish if a relationship exists between standard measures of deprivation, used by the UK central government to determine regional health and social welfare funding, and sickness absence and ill-health early retirement rates in English local government employers. Local authority sickness absence rates for 2001-02 were regressed against the 2004 Indices of Multiple Deprivation in a multiple regression model that also included size and type of organization as independent variables. A second model using ill-health retirement as the dependent variable was also estimated. In the full regression models, organization size was not significant and reduced models with deprivation and organization type (depending on whether teachers were employed by the organization or not) were estimated. For the sickness absence model, the adjusted R(2) was 0.20, with 17% of the variation in sickness absence rates being explained by deprivation rank. Ill-health retirement showed a similar relationship with deprivation. In both models, the deprivation coefficients were highly significant: for sickness absence [t = -7.85 (P = 0.00)] and for ill-health retirement [t = -4.79 (P = 0.00)]. A significant proportion of variation in sickness absence and ill-health retirement rates in local government in England are associated with local measures of deprivation. Recognition of the impact of deprivation on sickness absence has implications for a number of different areas of work. These include target setting for Local Government Best Value Performance Indicators, history taking in sickness absence consultations and the role of deprivation as a confounding factor in sickness absence intervention studies.
Tools to Support Interpreting Multiple Regression in the Face of Multicollinearity
Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K.
2012-01-01
While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses. PMID:22457655
Tools to support interpreting multiple regression in the face of multicollinearity.
Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K
2012-01-01
While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses.
Ramsthaler, Frank; Kettner, Mattias; Verhoff, Marcel A
2014-01-01
In forensic anthropological casework, estimating age-at-death is key to profiling unknown skeletal remains. The aim of this study was to examine the reliability of a new, simple, fast, and inexpensive digital odontological method for age-at-death estimation. The method is based on the original Lamendin method, which is a widely used technique in the repertoire of odontological aging methods in forensic anthropology. We examined 129 single root teeth employing a digital camera and imaging software for the measurement of the luminance of the teeth's translucent root zone. Variability in luminance detection was evaluated using statistical technical error of measurement analysis. The method revealed stable values largely unrelated to observer experience, whereas requisite formulas proved to be camera-specific and should therefore be generated for an individual recording setting based on samples of known chronological age. Multiple regression analysis showed a highly significant influence of the coefficients of the variables "arithmetic mean" and "standard deviation" of luminance for the regression formula. For the use of this primer multivariate equation for age-at-death estimation in casework, a standard error of the estimate of 6.51 years was calculated. Step-by-step reduction of the number of embedded variables to linear regression analysis employing the best contributor "arithmetic mean" of luminance yielded a regression equation with a standard error of 6.72 years (p < 0.001). The results of this study not only support the premise of root translucency as an age-related phenomenon, but also demonstrate that translucency reflects a number of other influencing factors in addition to age. This new digital measuring technique of the zone of dental root luminance can broaden the array of methods available for estimating chronological age, and furthermore facilitate measurement and age classification due to its low dependence on observer experience.
NASA Astrophysics Data System (ADS)
Zahari, Siti Meriam; Ramli, Norazan Mohamed; Moktar, Balkiah; Zainol, Mohammad Said
2014-09-01
In the presence of multicollinearity and multiple outliers, statistical inference of linear regression model using ordinary least squares (OLS) estimators would be severely affected and produces misleading results. To overcome this, many approaches have been investigated. These include robust methods which were reported to be less sensitive to the presence of outliers. In addition, ridge regression technique was employed to tackle multicollinearity problem. In order to mitigate both problems, a combination of ridge regression and robust methods was discussed in this study. The superiority of this approach was examined when simultaneous presence of multicollinearity and multiple outliers occurred in multiple linear regression. This study aimed to look at the performance of several well-known robust estimators; M, MM, RIDGE and robust ridge regression estimators, namely Weighted Ridge M-estimator (WRM), Weighted Ridge MM (WRMM), Ridge MM (RMM), in such a situation. Results of the study showed that in the presence of simultaneous multicollinearity and multiple outliers (in both x and y-direction), the RMM and RIDGE are more or less similar in terms of superiority over the other estimators, regardless of the number of observation, level of collinearity and percentage of outliers used. However, when outliers occurred in only single direction (y-direction), the WRMM estimator is the most superior among the robust ridge regression estimators, by producing the least variance. In conclusion, the robust ridge regression is the best alternative as compared to robust and conventional least squares estimators when dealing with simultaneous presence of multicollinearity and outliers.
Chronic atrophic gastritis in association with hair mercury level.
Xue, Zeyun; Xue, Huiping; Jiang, Jianlan; Lin, Bing; Zeng, Si; Huang, Xiaoyun; An, Jianfu
2014-11-01
The objective of this study was to explore hair mercury level in association with chronic atrophic gastritis, a precancerous stage of gastric cancer (GC), and thus provide a brand new angle of view on the timely intervention of precancerous stage of GC. We recruited 149 healthy volunteers as controls and 152 patients suffering from chronic gastritis as cases. The controls denied upper gastrointestinal discomforts, and the cases were diagnosed as chronic superficial gastritis (n=68) or chronic atrophic gastritis (n=84). We utilized Mercury Automated Analyzer (NIC MA-3000) to detect hair mercury level of both healthy controls and cases of chronic gastritis. The statistic of measurement data was expressed as mean ± standard deviation, which was analyzed using Levene variance equality test and t test. Pearson correlation analysis was employed to determine associated factors affecting hair mercury levels, and multiple stepwise regression analysis was performed to deduce regression equations. Statistical significance is considered if p value is less than 0.05. The overall hair mercury level was 0.908949 ± 0.8844490 ng/g [mean ± standard deviation (SD)] in gastritis cases and 0.460198 ± 0.2712187 ng/g (mean±SD) in healthy controls; the former level was significantly higher than the latter one (p=0.000<0.01). The hair mercury level in chronic atrophic gastritis subgroup was 1.155220 ± 0.9470246 ng/g (mean ± SD) and that in chronic superficial gastritis subgroup was 0.604732 ± 0.6942509 ng/g (mean ± SD); the former level was significantly higher than the latter level (p<0.01). The hair mercury level in chronic superficial gastritis cases was significantly higher than that in healthy controls (p<0.05). The hair mercury level in chronic atrophic gastritis cases was significantly higher than that in healthy controls (p<0.01). Stratified analysis indicated that the hair mercury level in healthy controls with eating seafood was significantly higher than that in healthy controls without eating seafood (p<0.01) and that the hair mercury level in chronic atrophic gastritis cases was significantly higher than that in chronic superficial gastritis cases (p<0.01). Pearson correlation analysis indicated that eating seafood was most correlated with hair mercury level and positively correlated in the healthy controls and that the severity of gastritis was most correlated with hair mercury level and positively correlated in the gastritis cases. Multiple stepwise regression analysis indicated that the regression equation of hair mercury level in controls could be expressed as 0.262 multiplied the value of eating seafood plus 0.434, the model that was statistically significant (p<0.01). Multiple stepwise regression analysis also indicated that the regression equation of hair mercury level in gastritis cases could be expressed as 0.305 multiplied the severity of gastritis, the model that was also statistically significant (p<0.01). The graphs of regression standardized residual for both controls and cases conformed to normal distribution. The main positively correlated factor affecting the hair mercury level is eating seafood in healthy people whereas the predominant positively correlated factor affecting the hair mercury level is the severity of gastritis in chronic gastritis patients. That is to say, the severity of chronic gastritis is positively correlated with the level of hair mercury. The incessantly increased level of hair mercury possibly reflects the development of gastritis from normal stomach to superficial gastritis and to atrophic gastritis. The detection of hair mercury is potentially a means to predict the severity of chronic gastritis and possibly to insinuate the environmental mercury threat to human health in terms of gastritis or even carcinogenesis.
An improved multiple linear regression and data analysis computer program package
NASA Technical Reports Server (NTRS)
Sidik, S. M.
1972-01-01
NEWRAP, an improved version of a previous multiple linear regression program called RAPIER, CREDUC, and CRSPLT, allows for a complete regression analysis including cross plots of the independent and dependent variables, correlation coefficients, regression coefficients, analysis of variance tables, t-statistics and their probability levels, rejection of independent variables, plots of residuals against the independent and dependent variables, and a canonical reduction of quadratic response functions useful in optimum seeking experimentation. A major improvement over RAPIER is that all regression calculations are done in double precision arithmetic.
Santric-Milicevic, M; Vasic, V; Terzic-Supic, Z
2016-08-15
In times of austerity, the availability of econometric health knowledge assists policy-makers in understanding and balancing health expenditure with health care plans within fiscal constraints. The objective of this study is to explore whether the health workforce supply of the public health care sector, population number, and utilization of inpatient care significantly contribute to total health expenditure. The dependent variable is the total health expenditure (THE) in Serbia from the years 2003 to 2011. The independent variables are the number of health workers employed in the public health care sector, population number, and inpatient care discharges per 100 population. The statistical analyses include the quadratic interpolation method, natural logarithm and differentiation, and multiple linear regression analyses. The level of significance is set at P < 0.05. The regression model captures 90 % of all variations of observed dependent variables (adjusted R square), and the model is significant (P < 0.001). Total health expenditure increased by 1.21 standard deviations, with an increase in health workforce growth rate by 1 standard deviation. Furthermore, this rate decreased by 1.12 standard deviations, with an increase in (negative) population growth rate by 1 standard deviation. Finally, the growth rate increased by 0.38 standard deviation, with an increase of the growth rate of inpatient care discharges per 100 population by 1 standard deviation (P < 0.001). Study results demonstrate that the government has been making an effort to control strongly health budget growth. Exploring causality relationships between health expenditure and health workforce is important for countries that are trying to consolidate their public health finances and achieve universal health coverage at the same time.
Female homicide in Rio Grande do Sul, Brazil.
Leites, Gabriela Tomedi; Meneghel, Stela Nazareth; Hirakata, Vania Noemi
2014-01-01
This study aimed to assess the female homicide rate due to aggression in Rio Grande do Sul, Brazil, using this as a "proxy" of femicide. This was an ecological study which correlated the female homicide rate due to aggression in Rio Grande do Sul, according to the 35 microregions defined by the Brazilian Institute of Geography and Statistics (IBGE), with socioeconomic and demographic variables access and health indicators. Pearson's correlation test was performed with the selected variables. After this, multiple linear regressions were performed with variables with p < 0.20. The standardized average of female homicide rate due to aggression in the period from 2003 to 2007 was 3.1 obits per 100 thousand. After multiple regression analysis, the final model included male mortality due to aggression (p = 0.016), the percentage of hospital admissions for alcohol (p = 0.005) and the proportion of ill-defined deaths (p = 0.015). The model have an explanatory power of 39% (adjusted r2 = 0.391). The results are consistent with other studies and indicate a strong relationship between structural violence in society and violence against women, in addition to a higher incidence of female deaths in places with high alcohol hospitalization.
Time-localized wavelet multiple regression and correlation
NASA Astrophysics Data System (ADS)
Fernández-Macho, Javier
2018-02-01
This paper extends wavelet methodology to handle comovement dynamics of multivariate time series via moving weighted regression on wavelet coefficients. The concept of wavelet local multiple correlation is used to produce one single set of multiscale correlations along time, in contrast with the large number of wavelet correlation maps that need to be compared when using standard pairwise wavelet correlations with rolling windows. Also, the spectral properties of weight functions are investigated and it is argued that some common time windows, such as the usual rectangular rolling window, are not satisfactory on these grounds. The method is illustrated with a multiscale analysis of the comovements of Eurozone stock markets during this century. It is shown how the evolution of the correlation structure in these markets has been far from homogeneous both along time and across timescales featuring an acute divide across timescales at about the quarterly scale. At longer scales, evidence from the long-term correlation structure can be interpreted as stable perfect integration among Euro stock markets. On the other hand, at intramonth and intraweek scales, the short-term correlation structure has been clearly evolving along time, experiencing a sharp increase during financial crises which may be interpreted as evidence of financial 'contagion'.
ERIC Educational Resources Information Center
Baylor, Carolyn; Yorkston, Kathryn; Bamer, Alyssa; Britton, Deanna; Amtmann, Dagmar
2010-01-01
Purpose: To explore variables associated with self-reported communicative participation in a sample (n = 498) of community-dwelling adults with multiple sclerosis (MS). Method: A battery of questionnaires was administered online or on paper per participant preference. Data were analyzed using multiple linear backward stepwise regression. The…
Prediction models for clustered data: comparison of a random intercept and standard regression model
2013-01-01
Background When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Methods Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. Results The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. Conclusion The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters. PMID:23414436
Bouwmeester, Walter; Twisk, Jos W R; Kappen, Teus H; van Klei, Wilton A; Moons, Karel G M; Vergouwe, Yvonne
2013-02-15
When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters.
Suzuki, Taku; Iwamoto, Takuji; Shizu, Kanae; Suzuki, Katsuji; Yamada, Harumoto; Sato, Kazuki
2017-05-01
This retrospective study was designed to investigate prognostic factors for postoperative outcomes for cubital tunnel syndrome (CubTS) using multiple logistic regression analysis with a large number of patients. Eighty-three patients with CubTS who underwent surgeries were enrolled. The following potential prognostic factors for disease severity were selected according to previous reports: sex, age, type of surgery, disease duration, body mass index, cervical lesion, presence of diabetes mellitus, Workers' Compensation status, preoperative severity, and preoperative electrodiagnostic testing. Postoperative severity of disease was assessed 2 years after surgery by Messina's criteria which is an outcome measure specifically for CubTS. Bivariate analysis was performed to select candidate prognostic factors for multiple linear regression analyses. Multiple logistic regression analysis was conducted to identify the association between postoperative severity and selected prognostic factors. Both bivariate and multiple linear regression analysis revealed only preoperative severity as an independent risk factor for poor prognosis, while other factors did not show any significant association. Although conflicting results exist regarding prognosis of CubTS, this study supports evidence from previous studies and concludes early surgical intervention portends the most favorable prognosis. Copyright © 2017 The Japanese Orthopaedic Association. Published by Elsevier B.V. All rights reserved.
Oka, Mayumi; Yamamoto, Mio; Mure, Kanae; Takeshita, Tatsuya; Arita, Mikio
2016-01-01
This study aims to investigate factors that contribute to the differences in incidence of hypertension between different regions in Japan, by accounting for not only individual lifestyles, but also their living environments. The target participants of this survey were individuals who received medical treatment for hypertension, as well as hypertension patients who have not received any treatment. The objective variable for analysis was the incidence of hypertension as data aggregated per prefecture. We used data (in men) including obesity, salt intake, vegetable intake, habitual alcohol consumption, habitual smoking, and number of steps walked per day. The variables within living environment included number of rail stations, standard/light vehicle usage, and slope of habitable land. In addition, we analyzed data for the variables related to medical environment including, participation rate in medical check-ups and number of hospitals. We performed multiple stepwise regression analyses to elucidate the correlation of these variables by using hypertension incidence as the objective variable. Hypertension incidence showed a significant negative correlation with walking and medical check-ups, and a significant positive correlation with light-vehicle usage and slope. Between the number of steps and variables related to the living environment, number of rail stations showed a significant positive correlation, while, standard- and light-vehicle usage showed significant negative correlation. Moreover, with stepwise multiple regression analysis, walking showed the strongest effect. The differences in daily walking based on living environment were associated with the disparities in the hypertension incidence in Japan. PMID:27788198
Soemantri, Diantha; Jusuf, Anwar
2016-01-01
Objectives This study applied self-determination theory (SDT) to investigate the relationship between students’ autonomous motivation and tutors’ autonomy support in medical students’ academic achievement. Methods This was a cross-sectional study. Out of 204 students in a fundamental medical science course, 199 participated in the study. Data was collected using two questionnaires: the Learning Self-Regulation and Learning Climate Questionnaires. The score of the course assessment was the measure of academic achievement. Data was analyzed and reported with descriptive and inferential statistics (mean, standard deviation and multiple regression analysis). Results Mean score (±standard deviation) of the autonomous motivation, tutors’ autonomy support, and academic achievement were 5.48±0.89, 5.22±0.92, and 5.22±0.92. Multiple regression results reported students’ autonomous motivation was associated with improvement of students’ academic achievement (β=15.2, p=0.004). However, augmentation of tutors’ autonomy support was not reflected in the improvement of students’ academic achievement (β = -12.6, p = 0.019). Both students’ autonomous motivation and tutors’ autonomy support had a contribution of about 4.2% students’ academic achievement (F = 4.343, p = 0.014, R2 = 0.042). Conclusions Due to the unique characteristic of our medical students’ educational background, our study shows that tutors’ autonomy support is inconsistent with students’ academic achievement. However, both autonomous motivation and support are essential to students’ academic achievement. Further study is needed to explore students’ educational background and self-regulated learning competence to improve students’ academic achievement. PMID:28035054
Inverse Association between Air Pressure and Rheumatoid Arthritis Synovitis
Furu, Moritoshi; Nakabo, Shuichiro; Ohmura, Koichiro; Nakashima, Ran; Imura, Yoshitaka; Yukawa, Naoichiro; Yoshifuji, Hajime; Matsuda, Fumihiko; Ito, Hiromu; Fujii, Takao; Mimori, Tsuneyo
2014-01-01
Rheumatoid arthritis (RA) is a bone destructive autoimmune disease. Many patients with RA recognize fluctuations of their joint synovitis according to changes of air pressure, but the correlations between them have never been addressed in large-scale association studies. To address this point we recruited large-scale assessments of RA activity in a Japanese population, and performed an association analysis. Here, a total of 23,064 assessments of RA activity from 2,131 patients were obtained from the KURAMA (Kyoto University Rheumatoid Arthritis Management Alliance) database. Detailed correlations between air pressure and joint swelling or tenderness were analyzed separately for each of the 326 patients with more than 20 assessments to regulate intra-patient correlations. Association studies were also performed for seven consecutive days to identify the strongest correlations. Standardized multiple linear regression analysis was performed to evaluate independent influences from other meteorological factors. As a result, components of composite measures for RA disease activity revealed suggestive negative associations with air pressure. The 326 patients displayed significant negative mean correlations between air pressure and swellings or the sum of swellings and tenderness (p = 0.00068 and 0.00011, respectively). Among the seven consecutive days, the most significant mean negative correlations were observed for air pressure three days before evaluations of RA synovitis (p = 1.7×10−7, 0.00027, and 8.3×10−8, for swellings, tenderness and the sum of them, respectively). Standardized multiple linear regression analysis revealed these associations were independent from humidity and temperature. Our findings suggest that air pressure is inversely associated with synovitis in patients with RA. PMID:24454853
Castelo, Paula Midori; Gavião, Maria Beatriz Duarte; Pereira, Luciano José; Bonjardim, Leonardo Rigoldi
2010-01-01
The maintenance of normal conditions of the masticatory function is determinant for the correct growth and development of its structures. Thus, the aims of this study were to evaluate the influence of sucking habits on the presence of crossbite and its relationship with maximal bite force, facial morphology and body variables in 67 children of both genders (3.5-7 years) with primary or early mixed dentition. The children were divided in four groups: primary-normocclusion (PN, n=19), primary-crossbite (PC, n=19), mixed-normocclusion (MN, n=13), and mixed-crossbite (MC, n=16). Bite force was measured with a pressurized tube, and facial morphology was determined by standardized frontal photographs: AFH (anterior face height) and BFW (bizygomatic facial width). It was observed that MC group showed lower bite force than MN, and AFH/BFW was significantly smaller in PN than PC (t-test). Weight and height were only significantly correlated with bite force in PC group (Pearson's correlation test). In the primary dentition, AFH/BFW and breast-feeding (at least six months) were positive and negatively associated with crossbite, respectively (multiple logistic regression). In the mixed dentition, breast-feeding and bite force showed negative associations with crossbite (univariate regression), while nonnutritive sucking (up to 3 years) associated significantly with crossbite in all groups (multiple logistic regression). In the studied sample, sucking habits played an important role in the etiology of crossbite, which was associated with lower bite force and long-face tendency.
Clinical and Radiologic Predictive Factors of Rib Fractures in Outpatients With Chest Pain.
Zhang, Liang; McMahon, Colm J; Shah, Samir; Wu, Jim S; Eisenberg, Ronald L; Kung, Justin W
To identify the clinical and radiologic predictive factors of rib fractures in stable adult outpatients presenting with chest pain and to determine the utility of dedicated rib radiographs in this population of patients. Following Institutional Review Board approval, we performed a retrospective review of 339 consecutive cases in which a frontal chest radiograph and dedicated rib series had been obtained for chest pain in the outpatient setting. The frontal chest radiograph and dedicated rib series were sequentially reviewed in consensus by two fellowship-trained musculoskeletal radiologists blinded to the initial report. The consensus interpretation of the dedicated rib series was used as the gold standard. Multiple variable logistic regression analysis assessed clinical and radiological factors associated with rib fractures. Fisher exact test was used to assess differences in medical treatment between the 2 groups. Of the 339 patients, 53 (15.6%) had at least 1 rib fracture. Only 20 of the 53 (37.7%) patients' fractures could be identified on the frontal chest radiograph. The frontal chest radiograph had a sensitivity of 38% and specificity of 100% when using the rib series as the reference standard. No pneumothorax, new mediastinal widening or pulmonary contusion was identified. Multiple variable logistic regression analysis of clinical factors associated with the presence of rib fractures revealed a significant association of trauma history (odds ratio 5.7 [p < 0.05]) and age ≥40 (odds radio 3.1 [p < 0.05]). Multiple variable logistic regression analysis of radiographic factors associated with rib fractures in this population demonstrated a significant association of pleural effusion with rib fractures (odds ratio 18.9 [p < 0.05]). Patients with rib fractures received narcotic analgesia in 47.2% of the cases, significantly more than those without rib fractures (21.3%, p < 0.05). None of the patients required hospitalization. In the stable outpatient setting, rib fractures have a higher association with a history of minor trauma and age ≥40 in the adult population. Radiographic findings associated with rib fractures include pleural effusion. The frontal chest radiograph alone has low sensitivity in detecting rib fractures. The dedicated rib series detected a greater number of rib fractures. Although no patients required hospitalization, those with rib fractures were more likely to receive narcotic analgesia. Copyright © 2018 Elsevier Inc. All rights reserved.
Pediatric magnet ingestions: the dark side of the force.
Brown, Julie C; Otjen, Jeffrey P; Drugas, George T
2014-05-01
Pediatric magnet ingestions are increasing. Commercial availability of rare-earth magnets poses a serious health risk. This study defines incidence, characteristics, and management of ingestions over time. Cases were identified by searching radiology reports from June 2002 to December 2012 at a children's hospital and verified by chart and imaging review. Relative risk (RR) regressions determined changes in incidence and interventions over time. In all, 98% of ingestions occurred since 2006; 57% involved multiple magnets. Median age was 8 years (range 0 to 18); 0% of single and 56% of multiple ingestions required intervention. Compared with 2007 to 2009, ingestions increased from 2010 to 2012 (RR = 1.9, 95% confidence interval 1.2 to 3.0). Intervention proportion was unchanged (RR = .94, 95% confidence interval .4 to 2.2). Small spherical magnets comprised 26.8% of ingestions since 2010; 86% involved multiple magnets and 47% required intervention. Pediatric magnet ingestions and interventions have increased. Multiple ingestions prompt more imaging and surgical interventions. Magnet safety standards are needed to decrease risk to children. Copyright © 2014 Elsevier Inc. All rights reserved.
Lee, Wanhyung; Yeom, Hyungseon; Yoon, Jin-Ha; Won, Jong-Uk; Jung, Pil Kyun; Lee, June-Hee; Seok, Hongdeok; Roh, Jaehoon
2016-08-01
Occupation influences the risk for developing chronic metabolic diseases. We compared the prevalence of MetS by International Standard Classification of Occupations using the nationally representative data in Korea (KNHANES). We enrolled 16,763 workers (9,175 males; 7,588 females) who had measurements for the National Cholesterol Education Program criteria III and other variables. OR and 95%CIs for MetS and its components were estimated according to occupation using the multiple logistic regression models. The occupational groups with the highest age-standardized prevalence of MetS were lower skilled white-collar men (31.1 ± 2.4%) and green-collar women (24.2 ± 2.9%). Compared with the unskilled male blue-collar group, which had the lowest prevalence of MetS, the OR (95%CIs) of MetS in men were 1.77 (1.45-2.15) in higher skilled white-collar, 1.82 (1.47-2.26) in lower-skilled white-collar, 1.63 (1.32-2.01) in pink-collar and 1.37 (1.13-1.66) in skilled blue-collar workers in final logistic regression model. MetS and its components vary by occupational category and gender in ways that may guide health interventions. Am. J. Ind. Med. 59:685-694, 2016. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.
Major controlling factors and prediction models for arsenic uptake from soil to wheat plants.
Dai, Yunchao; Lv, Jialong; Liu, Ke; Zhao, Xiaoyan; Cao, Yingfei
2016-08-01
The application of current Chinese agriculture soil quality standards fails to evaluate the land utilization functions appropriately due to the diversity of soil properties and plant species. Therefore, the standards should be amended. A greenhouse experiment was conducted to investigate arsenic (As) enrichment in various soils from 18 Chinese provinces in parallel with As transfer to 8 wheat varieties. The goal of the study was to build and calibrate soil-wheat threshold models to forecast the As threshold of wheat soils. In Shaanxi soils, Wanmai and Jimai were the most sensitive and insensitive wheat varieties, respectively; and in Jiangxi soils, Zhengmai and Xumai were the most sensitive and insensitive wheat varieties, respectively. Relationships between soil properties and the bioconcentration factor (BCF) were built based on stepwise multiple linear regressions. Soil pH was the best predictor of BCF, and after normalizing the regression equation (Log BCF=0.2054 pH- 3.2055, R(2)=0.8474, n=14, p<0.001), we obtained a calibrated model. Using the calibrated model, a continuous soil-wheat threshold equation (HC5=10((-0.2054 pH+2.9935))+9.2) was obtained for the species-sensitive distribution curve, which was built on Chinese food safety standards. The threshold equation is a helpful tool that can be applied to estimate As uptake from soil to wheat. Copyright © 2016 Elsevier Inc. All rights reserved.
Advanced statistics: linear regression, part I: simple linear regression.
Marill, Keith A
2004-01-01
Simple linear regression is a mathematical technique used to model the relationship between a single independent predictor variable and a single dependent outcome variable. In this, the first of a two-part series exploring concepts in linear regression analysis, the four fundamental assumptions and the mechanics of simple linear regression are reviewed. The most common technique used to derive the regression line, the method of least squares, is described. The reader will be acquainted with other important concepts in simple linear regression, including: variable transformations, dummy variables, relationship to inference testing, and leverage. Simplified clinical examples with small datasets and graphic models are used to illustrate the points. This will provide a foundation for the second article in this series: a discussion of multiple linear regression, in which there are multiple predictor variables.
Gíslason, Magnús; Sigurðsson, Sigurður; Guðnason, Vilmundur; Harris, Tamara; Carraro, Ugo; Gargiulo, Paolo
2018-01-01
Sarcopenic muscular degeneration has been consistently identified as an independent risk factor for mortality in aging populations. Recent investigations have realized the quantitative potential of computed tomography (CT) image analysis to describe skeletal muscle volume and composition; however, the optimum approach to assessing these data remains debated. Current literature reports average Hounsfield unit (HU) values and/or segmented soft tissue cross-sectional areas to investigate muscle quality. However, standardized methods for CT analyses and their utility as a comorbidity index remain undefined, and no existing studies compare these methods to the assessment of entire radiodensitometric distributions. The primary aim of this study was to present a comparison of nonlinear trimodal regression analysis (NTRA) parameters of entire radiodensitometric muscle distributions against extant CT metrics and their correlation with lower extremity function (LEF) biometrics (normal/fast gait speed, timed up-and-go, and isometric leg strength) and biochemical and nutritional parameters, such as total solubilized cholesterol (SCHOL) and body mass index (BMI). Data were obtained from 3,162 subjects, aged 66–96 years, from the population-based AGES-Reykjavik Study. 1-D k-means clustering was employed to discretize each biometric and comorbidity dataset into twelve subpopulations, in accordance with Sturges’ Formula for Class Selection. Dataset linear regressions were performed against eleven NTRA distribution parameters and standard CT analyses (fat/muscle cross-sectional area and average HU value). Parameters from NTRA and CT standards were analogously assembled by age and sex. Analysis of specific NTRA parameters with standard CT results showed linear correlation coefficients greater than 0.85, but multiple regression analysis of correlative NTRA parameters yielded a correlation coefficient of 0.99 (P<0.005). These results highlight the specificities of each muscle quality metric to LEF biometrics, SCHOL, and BMI, and particularly highlight the value of the connective tissue regime in this regard. PMID:29513690
Edmunds, Kyle; Gíslason, Magnús; Sigurðsson, Sigurður; Guðnason, Vilmundur; Harris, Tamara; Carraro, Ugo; Gargiulo, Paolo
2018-01-01
Sarcopenic muscular degeneration has been consistently identified as an independent risk factor for mortality in aging populations. Recent investigations have realized the quantitative potential of computed tomography (CT) image analysis to describe skeletal muscle volume and composition; however, the optimum approach to assessing these data remains debated. Current literature reports average Hounsfield unit (HU) values and/or segmented soft tissue cross-sectional areas to investigate muscle quality. However, standardized methods for CT analyses and their utility as a comorbidity index remain undefined, and no existing studies compare these methods to the assessment of entire radiodensitometric distributions. The primary aim of this study was to present a comparison of nonlinear trimodal regression analysis (NTRA) parameters of entire radiodensitometric muscle distributions against extant CT metrics and their correlation with lower extremity function (LEF) biometrics (normal/fast gait speed, timed up-and-go, and isometric leg strength) and biochemical and nutritional parameters, such as total solubilized cholesterol (SCHOL) and body mass index (BMI). Data were obtained from 3,162 subjects, aged 66-96 years, from the population-based AGES-Reykjavik Study. 1-D k-means clustering was employed to discretize each biometric and comorbidity dataset into twelve subpopulations, in accordance with Sturges' Formula for Class Selection. Dataset linear regressions were performed against eleven NTRA distribution parameters and standard CT analyses (fat/muscle cross-sectional area and average HU value). Parameters from NTRA and CT standards were analogously assembled by age and sex. Analysis of specific NTRA parameters with standard CT results showed linear correlation coefficients greater than 0.85, but multiple regression analysis of correlative NTRA parameters yielded a correlation coefficient of 0.99 (P<0.005). These results highlight the specificities of each muscle quality metric to LEF biometrics, SCHOL, and BMI, and particularly highlight the value of the connective tissue regime in this regard.
Guo, How-Ran
2011-10-20
Despite its limitations, ecological study design is widely applied in epidemiology. In most cases, adjustment for age is necessary, but different methods may lead to different conclusions. To compare three methods of age adjustment, a study on the associations between arsenic in drinking water and incidence of bladder cancer in 243 townships in Taiwan was used as an example. A total of 3068 cases of bladder cancer, including 2276 men and 792 women, were identified during a ten-year study period in the study townships. Three methods were applied to analyze the same data set on the ten-year study period. The first (Direct Method) applied direct standardization to obtain standardized incidence rate and then used it as the dependent variable in the regression analysis. The second (Indirect Method) applied indirect standardization to obtain standardized incidence ratio and then used it as the dependent variable in the regression analysis instead. The third (Variable Method) used proportions of residents in different age groups as a part of the independent variables in the multiple regression models. All three methods showed a statistically significant positive association between arsenic exposure above 0.64 mg/L and incidence of bladder cancer in men and women, but different results were observed for the other exposure categories. In addition, the risk estimates obtained by different methods for the same exposure category were all different. Using an empirical example, the current study confirmed the argument made by other researchers previously that whereas the three different methods of age adjustment may lead to different conclusions, only the third approach can obtain unbiased estimates of the risks. The third method can also generate estimates of the risk associated with each age group, but the other two are unable to evaluate the effects of age directly.
Hacisalihoglu, Gokhan; Larbi, Bismark; Settles, A Mark
2010-01-27
The objective of this study was to explore the potential of near-infrared reflectance (NIR) spectroscopy to determine individual seed composition in common bean ( Phaseolus vulgaris L.). NIR spectra and analytical measurements of seed weight, protein, and starch were collected from 267 individual bean seeds representing 91 diverse genotypes. Partial least-squares (PLS) regression models were developed with 61 bean accessions randomly assigned to a calibration data set and 30 accessions assigned to an external validation set. Protein gave the most accurate PLS regression, with the external validation set having a standard error of prediction (SEP) = 1.6%. PLS regressions for seed weight and starch had sufficient accuracy for seed sorting applications, with SEP = 41.2 mg and 4.9%, respectively. Seed color had a clear effect on the NIR spectra, with black beans having a distinct spectral type. Seed coat color did not impact the accuracy of PLS predictions. This research demonstrates that NIR is a promising technique for simultaneous sorting of multiple seed traits in single bean seeds with no sample preparation.
Estimating Interaction Effects With Incomplete Predictor Variables
Enders, Craig K.; Baraldi, Amanda N.; Cham, Heining
2014-01-01
The existing missing data literature does not provide a clear prescription for estimating interaction effects with missing data, particularly when the interaction involves a pair of continuous variables. In this article, we describe maximum likelihood and multiple imputation procedures for this common analysis problem. We outline 3 latent variable model specifications for interaction analyses with missing data. These models apply procedures from the latent variable interaction literature to analyses with a single indicator per construct (e.g., a regression analysis with scale scores). We also discuss multiple imputation for interaction effects, emphasizing an approach that applies standard imputation procedures to the product of 2 raw score predictors. We thoroughly describe the process of probing interaction effects with maximum likelihood and multiple imputation. For both missing data handling techniques, we outline centering and transformation strategies that researchers can implement in popular software packages, and we use a series of real data analyses to illustrate these methods. Finally, we use computer simulations to evaluate the performance of the proposed techniques. PMID:24707955
NASA Astrophysics Data System (ADS)
Nishidate, Izumi; Wiswadarma, Aditya; Hase, Yota; Tanaka, Noriyuki; Maeda, Takaaki; Niizeki, Kyuichi; Aizu, Yoshihisa
2011-08-01
In order to visualize melanin and blood concentrations and oxygen saturation in human skin tissue, a simple imaging technique based on multispectral diffuse reflectance images acquired at six wavelengths (500, 520, 540, 560, 580 and 600nm) was developed. The technique utilizes multiple regression analysis aided by Monte Carlo simulation for diffuse reflectance spectra. Using the absorbance spectrum as a response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as predictor variables, multiple regression analysis provides regression coefficients. Concentrations of melanin and total blood are then determined from the regression coefficients using conversion vectors that are deduced numerically in advance, while oxygen saturation is obtained directly from the regression coefficients. Experiments with a tissue-like agar gel phantom validated the method. In vivo experiments with human skin of the human hand during upper limb occlusion and of the inner forearm exposed to UV irradiation demonstrated the ability of the method to evaluate physiological reactions of human skin tissue.
ERIC Educational Resources Information Center
Quinino, Roberto C.; Reis, Edna A.; Bessegato, Lupercio F.
2013-01-01
This article proposes the use of the coefficient of determination as a statistic for hypothesis testing in multiple linear regression based on distributions acquired by beta sampling. (Contains 3 figures.)
Estimating annual suspended-sediment loads in the northern and central Appalachian Coal region
Koltun, G.F.
1985-01-01
Multiple-regression equations were developed for estimating the annual suspended-sediment load, for a given year, from small to medium-sized basins in the northern and central parts of the Appalachian coal region. The regression analysis was performed with data for land use, basin characteristics, streamflow, rainfall, and suspended-sediment load for 15 sites in the region. Two variables, the maximum mean-daily discharge occurring within the year and the annual peak discharge, explained much of the variation in the annual suspended-sediment load. Separate equations were developed employing each of these discharge variables. Standard errors for both equations are relatively large, which suggests that future predictions will probably have a low level of precision. This level of precision, however, may be acceptable for certain purposes. It is therefore left to the user to asses whether the level of precision provided by these equations is acceptable for the intended application.
NASA Technical Reports Server (NTRS)
Johnson, R. W.; Bahn, G. S.
1977-01-01
Statistical analysis techniques were applied to develop quantitative relationships between in situ river measurements and the remotely sensed data that were obtained over the James River in Virginia on 28 May 1974. The remotely sensed data were collected with a multispectral scanner and with photographs taken from an aircraft platform. Concentration differences among water quality parameters such as suspended sediment, chlorophyll a, and nutrients indicated significant spectral variations. Calibrated equations from the multiple regression analysis were used to develop maps that indicated the quantitative distributions of water quality parameters and the dispersion characteristics of a pollutant plume entering the turbid river system. Results from further analyses that use only three preselected multispectral scanner bands of data indicated that regression coefficients and standard errors of estimate were not appreciably degraded compared with results from the 10-band analysis.
Comparative amino acid digestibility for broiler chickens and White Pekin ducks.
Kong, C; Adeola, O
2013-09-01
A total of 608 three-week-old male broiler chickens and White Pekin ducks were used in a 5-d trial to compare ileal amino acid (AA) digestibility of soybean meal (SBM) and canola meal (CM) using the regression method. A corn-casein-cornstarch-based diet was mixed to contain 15% CP. Cornstarch was replaced with test ingredient (SBM or CM) to contain 18 or 21% of CP in 4 other diets. A nitrogen-free diet (NFD) was used for standardization of apparent digestibility. Birds received a standard starter diet (23% CP) from d 0 to 14 posthatch and then 6 experimental diets for 5 d. On d 19 posthatch, birds were asphyxiated with CO(2), and digesta from the distal section of ileum was collected. The ileal digestibility of AA from the test ingredients was assessed by multiple linear regression analysis using data on daily apparent ileal digestible AA and total AA intakes. The basal endogenous losses of N and all AA for ducks were significantly higher than those for broilers. For ileal AA digestibility by regression of apparent digestible AA intake against AA intake, there was a higher (P < 0.05) digestibility for Cys and Pro in ducks compared with broilers (P < 0.05). Within species, digestibility was not different between SBM and CM except for Lys of ducks, and Lys and Pro of broilers (P < 0.05). The results of this study showed that ducks have higher basal endogenous AA losses compared with broiler chickens as well as higher ileal Cys and Pro digestibility estimates derived from regression approach, indicating that data obtained from broilers should not be used to formulate diets for ducks.
Lindholdt, Louise; Labriola, Merete; Nielsen, Claus Vinther; Horsbøl, Trine Allerslev; Lund, Thomas
2017-07-20
The return-to-work (RTW) process after long-term sickness absence is often complex and long and implies multiple shifts between different labour market states for the absentee. Standard methods for examining RTW research typically rely on the analysis of one outcome measure at a time, which will not capture the many possible states and transitions the absentee can go through. The purpose of this study was to explore the potential added value of sequence analysis in supplement to standard regression analysis of a multidisciplinary RTW intervention among patients with low back pain (LBP). The study population consisted of 160 patients randomly allocated to either a hospital-based brief or a multidisciplinary intervention. Data on labour market participation following intervention were obtained from a national register and analysed in two ways: as a binary outcome expressed as active or passive relief at a 1-year follow-up and as four different categories for labour market participation. Logistic regression and sequence analysis were performed. The logistic regression analysis showed no difference in labour market participation for patients in the two groups after 1 year. Applying sequence analysis showed differences in subsequent labour market participation after 2 years after baseline in favour of the brief intervention group versus the multidisciplinary intervention group. The study indicated that sequence analysis could provide added analytical value as a supplement to traditional regression analysis in prospective studies of RTW among patients with LBP. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Ibidunni, Ayodotun Stephen; Kehinde, Oladele Joseph; Ibidunni, Oyebisi Mary; Olokundun, Maxwell Ayodele; Olubusayo, Falola Hezekiah; Salau, Odunayo Paul; Borishade, Taiye Tairat; Fred, Peter
2018-06-01
The article presents data on the relationship between financing strategies, entrepreneurial competencies and business growth of technology-based SMEs in Nigeria. Copies of structured questionnaire were administered to 233 SME owners and financial managers. Using descriptive and standard multiple regression statistical analysis, the data revealed that venture capital and business donations significantly influences profit growth of technology-based SMEs. Moreover, the data revealed that technology-`based firms can enhance their access to financing through capacity building in entrepreneurial competencies, such as acquiring the right skills and attitude.
The M Word: Multicollinearity in Multiple Regression.
ERIC Educational Resources Information Center
Morrow-Howell, Nancy
1994-01-01
Notes that existence of substantial correlation between two or more independent variables creates problems of multicollinearity in multiple regression. Discusses multicollinearity problem in social work research in which independent variables are usually intercorrelated. Clarifies problems created by multicollinearity, explains detection of…
Ling, Ru; Liu, Jiawang
2011-12-01
To construct prediction model for health workforce and hospital beds in county hospitals of Hunan by multiple linear regression. We surveyed 16 counties in Hunan with stratified random sampling according to uniform questionnaires,and multiple linear regression analysis with 20 quotas selected by literature view was done. Independent variables in the multiple linear regression model on medical personnels in county hospitals included the counties' urban residents' income, crude death rate, medical beds, business occupancy, professional equipment value, the number of devices valued above 10 000 yuan, fixed assets, long-term debt, medical income, medical expenses, outpatient and emergency visits, hospital visits, actual available bed days, and utilization rate of hospital beds. Independent variables in the multiple linear regression model on county hospital beds included the the population of aged 65 and above in the counties, disposable income of urban residents, medical personnel of medical institutions in county area, business occupancy, the total value of professional equipment, fixed assets, long-term debt, medical income, medical expenses, outpatient and emergency visits, hospital visits, actual available bed days, utilization rate of hospital beds, and length of hospitalization. The prediction model shows good explanatory and fitting, and may be used for short- and mid-term forecasting.
Qidwai, Tabish; Yadav, Dharmendra K; Khan, Feroz; Dhawan, Sangeeta; Bhakuni, R S
2012-01-01
This work presents the development of quantitative structure activity relationship (QSAR) model to predict the antimalarial activity of artemisinin derivatives. The structures of the molecules are represented by chemical descriptors that encode topological, geometric, and electronic structure features. Screening through QSAR model suggested that compounds A24, A24a, A53, A54, A62 and A64 possess significant antimalarial activity. Linear model is developed by the multiple linear regression method to link structures to their reported antimalarial activity. The correlation in terms of regression coefficient (r(2)) was 0.90 and prediction accuracy of model in terms of cross validation regression coefficient (rCV(2)) was 0.82. This study indicates that chemical properties viz., atom count (all atoms), connectivity index (order 1, standard), ring count (all rings), shape index (basic kappa, order 2), and solvent accessibility surface area are well correlated with antimalarial activity. The docking study showed high binding affinity of predicted active compounds against antimalarial target Plasmepsins (Plm-II). Further studies for oral bioavailability, ADMET and toxicity risk assessment suggest that compound A24, A24a, A53, A54, A62 and A64 exhibits marked antimalarial activity comparable to standard antimalarial drugs. Later one of the predicted active compound A64 was chemically synthesized, structure elucidated by NMR and in vivo tested in multidrug resistant strain of Plasmodium yoelii nigeriensis infected mice. The experimental results obtained agreed well with the predicted values.
Liu, Gang; Mukherjee, Bhramar; Lee, Seunggeun; Lee, Alice W; Wu, Anna H; Bandera, Elisa V; Jensen, Allan; Rossing, Mary Anne; Moysich, Kirsten B; Chang-Claude, Jenny; Doherty, Jennifer A; Gentry-Maharaj, Aleksandra; Kiemeney, Lambertus; Gayther, Simon A; Modugno, Francesmary; Massuger, Leon; Goode, Ellen L; Fridley, Brooke L; Terry, Kathryn L; Cramer, Daniel W; Ramus, Susan J; Anton-Culver, Hoda; Ziogas, Argyrios; Tyrer, Jonathan P; Schildkraut, Joellen M; Kjaer, Susanne K; Webb, Penelope M; Ness, Roberta B; Menon, Usha; Berchuck, Andrew; Pharoah, Paul D; Risch, Harvey; Pearce, Celeste Leigh
2018-02-01
There have been recent proposals advocating the use of additive gene-environment interaction instead of the widely used multiplicative scale, as a more relevant public health measure. Using gene-environment independence enhances statistical power for testing multiplicative interaction in case-control studies. However, under departure from this assumption, substantial bias in the estimates and inflated type I error in the corresponding tests can occur. In this paper, we extend the empirical Bayes (EB) approach previously developed for multiplicative interaction, which trades off between bias and efficiency in a data-adaptive way, to the additive scale. An EB estimator of the relative excess risk due to interaction is derived, and the corresponding Wald test is proposed with a general regression setting under a retrospective likelihood framework. We study the impact of gene-environment association on the resultant test with case-control data. Our simulation studies suggest that the EB approach uses the gene-environment independence assumption in a data-adaptive way and provides a gain in power compared with the standard logistic regression analysis and better control of type I error when compared with the analysis assuming gene-environment independence. We illustrate the methods with data from the Ovarian Cancer Association Consortium. © The Author(s) 2017. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
ERIC Educational Resources Information Center
Li, Deping; Oranje, Andreas
2007-01-01
Two versions of a general method for approximating standard error of regression effect estimates within an IRT-based latent regression model are compared. The general method is based on Binder's (1983) approach, accounting for complex samples and finite populations by Taylor series linearization. In contrast, the current National Assessment of…
Anderson, Carl A; McRae, Allan F; Visscher, Peter M
2006-07-01
Standard quantitative trait loci (QTL) mapping techniques commonly assume that the trait is both fully observed and normally distributed. When considering survival or age-at-onset traits these assumptions are often incorrect. Methods have been developed to map QTL for survival traits; however, they are both computationally intensive and not available in standard genome analysis software packages. We propose a grouped linear regression method for the analysis of continuous survival data. Using simulation we compare this method to both the Cox and Weibull proportional hazards models and a standard linear regression method that ignores censoring. The grouped linear regression method is of equivalent power to both the Cox and Weibull proportional hazards methods and is significantly better than the standard linear regression method when censored observations are present. The method is also robust to the proportion of censored individuals and the underlying distribution of the trait. On the basis of linear regression methodology, the grouped linear regression model is computationally simple and fast and can be implemented readily in freely available statistical software.
Hein, R; Abbas, S; Seibold, P; Salazar, R; Flesch-Janys, D; Chang-Claude, J
2012-01-01
Menopausal hormone therapy (MHT) is associated with an increased breast cancer risk in postmenopausal women, with combined estrogen-progestagen therapy posing a greater risk than estrogen monotherapy. However, few studies focused on potential effect modification of MHT-associated breast cancer risk by genetic polymorphisms in the progesterone metabolism. We assessed effect modification of MHT use by five coding single nucleotide polymorphisms (SNPs) in the progesterone metabolizing enzymes AKR1C3 (rs7741), AKR1C4 (rs3829125, rs17134592), and SRD5A1 (rs248793, rs3736316) using a two-center population-based case-control study from Germany with 2,502 postmenopausal breast cancer patients and 4,833 matched controls. An empirical-Bayes procedure that tests for interaction using a weighted combination of the prospective and the retrospective case-control estimators as well as standard prospective logistic regression were applied to assess multiplicative statistical interaction between polymorphisms and duration of MHT use with regard to breast cancer risk assuming a log-additive mode of inheritance. No genetic marginal effects were observed. Breast cancer risk associated with duration of combined therapy was significantly modified by SRD5A1_rs3736316, showing a reduced risk elevation in carriers of the minor allele (p (interaction,empirical-Bayes) = 0.006 using the empirical-Bayes method, p (interaction,logistic regression) = 0.013 using logistic regression). The risk associated with duration of use of monotherapy was increased by AKR1C3_rs7741 in minor allele carriers (p (interaction,empirical-Bayes) = 0.083, p (interaction,logistic regression) = 0.029) and decreased in minor allele carriers of two SNPs in AKR1C4 (rs3829125: p (interaction,empirical-Bayes) = 0.07, p (interaction,logistic regression) = 0.021; rs17134592: p (interaction,empirical-Bayes) = 0.101, p (interaction,logistic regression) = 0.038). After Bonferroni correction for multiple testing only SRD5A1_rs3736316 assessed using the empirical-Bayes method remained significant. Postmenopausal breast cancer risk associated with combined therapy may be modified by genetic variation in SRD5A1. Further well-powered studies are, however, required to replicate our finding.
Kuiper, Gerhardus J A J M; Houben, Rik; Wetzels, Rick J H; Verhezen, Paul W M; Oerle, Rene van; Ten Cate, Hugo; Henskens, Yvonne M C; Lancé, Marcus D
2017-11-01
Low platelet counts and hematocrit levels hinder whole blood point-of-care testing of platelet function. Thus far, no reference ranges for MEA (multiple electrode aggregometry) and PFA-100 (platelet function analyzer 100) devices exist for low ranges. Through dilution methods of volunteer whole blood, platelet function at low ranges of platelet count and hematocrit levels was assessed on MEA for four agonists and for PFA-100 in two cartridges. Using (multiple) regression analysis, 95% reference intervals were computed for these low ranges. Low platelet counts affected MEA in a positive correlation (all agonists showed r 2 ≥ 0.75) and PFA-100 in an inverse correlation (closure times were prolonged with lower platelet counts). Lowered hematocrit did not affect MEA testing, except for arachidonic acid activation (ASPI), which showed a weak positive correlation (r 2 = 0.14). Closure time on PFA-100 testing was inversely correlated with hematocrit for both cartridges. Regression analysis revealed different 95% reference intervals in comparison with originally established intervals for both MEA and PFA-100 in low platelet or hematocrit conditions. Multiple regression analysis of ASPI and both tests on the PFA-100 for combined low platelet and hematocrit conditions revealed that only PFA-100 testing should be adjusted for both thrombocytopenia and anemia. 95% reference intervals were calculated using multiple regression analysis. However, coefficients of determination of PFA-100 were poor, and some variance remained unexplained. Thus, in this pilot study using (multiple) regression analysis, we could establish reference intervals of platelet function in anemia and thrombocytopenia conditions on PFA-100 and in thrombocytopenia conditions on MEA.
Zhang, Hualing
2014-03-01
To learn characteristics and their mutual relations of self-esteem, self-harmony and interpersonal-harmony of university students, in order to provide the basis for mental health education. With a stratified cluster random sampling method, a questionnaire survey was conducted in 820 university students from 16 classes of four universities, chosen from 30 universities in Anhui Province. Meanwhile, Rosenberg Self-esteem Scale, Self-harmony Scale and Interpersonal-harmony Diagnostic Scale were used for assessment. Self-esteem of university students has an average score of (30.71 +/- 4.77), higher than median thoery 25, and there existed statistical significance in the dimensions of gender (P = 0.004), origin (P = 0.038) and only-child (P = 0.005). University students' self-harmony has an average score of (98.66 +/- 8.69), among which there were 112 students in the group of low score, counting for 13.7%, 442 in that of middle score, counting for 53.95%, 265 in that of high score, counting for 32.33%. And there existed no statistical significance in the total-score of self-harmony and score differences from most of subscales in the dimention of gender and origin, but satistical significance did exist in the dimention of only-child (P = 0.004). It was statistically significant (P = 0.006) on the "stereotype" subscales, on the differences between university students from urban areas and rural areas. Every dimension of self-esteem and self -harmony and interpersonal harmony was correlated and statistically significant. Multiple regression analysis found that when there was a variable in self-esteem, the amount of the variable of self-harmony for explaination of interpersonal conversation dropped from 22.6% to 12%, and standard regression coefficient changing from 0.087 to 0.035. The trouble of interpersonal dating fell from 27.6% to 13.1%, the standard regression coefficient changing from 0.104 to 0.019. The bother of treating people fell from 30.9% to 15%, and the standard regression coefficient changing from 0.079 to 0.020. The problem of heterosexual contact fell from 23.4% to 17.3%, and the standard regression coefficient changing from 0.095 to 0.024. Self-esteem was a mediator variable between self-harmony and interpersonal-harmony. By cultivating university students' level of self-esteem to achieve their self-harmony and interpersonal-harmony, university students' mental health level can be improved.
Seasonal forecasting of high wind speeds over Western Europe
NASA Astrophysics Data System (ADS)
Palutikof, J. P.; Holt, T.
2003-04-01
As financial losses associated with extreme weather events escalate, there is interest from end users in the forestry and insurance industries, for example, in the development of seasonal forecasting models with a long lead time. This study uses exceedences of the 90th, 95th, and 99th percentiles of daily maximum wind speed over the period 1958 to present to derive predictands of winter wind extremes. The source data is the 6-hourly NCEP Reanalysis gridded surface wind field. Predictor variables include principal components of Atlantic sea surface temperature and several indices of climate variability, including the NAO and SOI. Lead times of up to a year are considered, in monthly increments. Three regression techniques are evaluated; multiple linear regression (MLR), principal component regression (PCR), and partial least squares regression (PLS). PCR and PLS proved considerably superior to MLR with much lower standard errors. PLS was chosen to formulate the predictive model since it offers more flexibility in experimental design and gave slightly better results than PCR. The results indicate that winter windiness can be predicted with considerable skill one year ahead for much of coastal Europe, but that this deteriorates rapidly in the hinterland. The experiment succeeded in highlighting PLS as a very useful method for developing more precise forecasting models, and in identifying areas of high predictability.
Tchetgen Tchetgen, Eric
2011-03-01
This article considers the detection and evaluation of genetic effects incorporating gene-environment interaction and independence. Whereas ordinary logistic regression cannot exploit the assumption of gene-environment independence, the proposed approach makes explicit use of the independence assumption to improve estimation efficiency. This method, which uses both cases and controls, fits a constrained retrospective regression in which the genetic variant plays the role of the response variable, and the disease indicator and the environmental exposure are the independent variables. The regression model constrains the association of the environmental exposure with the genetic variant among the controls to be null, thus explicitly encoding the gene-environment independence assumption, which yields substantial gain in accuracy in the evaluation of genetic effects. The proposed retrospective regression approach has several advantages. It is easy to implement with standard software, and it readily accounts for multiple environmental exposures of a polytomous or of a continuous nature, while easily incorporating extraneous covariates. Unlike the profile likelihood approach of Chatterjee and Carroll (Biometrika. 2005;92:399-418), the proposed method does not require a model for the association of a polytomous or continuous exposure with the disease outcome, and, therefore, it is agnostic to the functional form of such a model and completely robust to its possible misspecification.
As a fast and effective technique, the multiple linear regression (MLR) method has been widely used in modeling and prediction of beach bacteria concentrations. Among previous works on this subject, however, several issues were insufficiently or inconsistently addressed. Those is...
MULTIPLE REGRESSION MODELS FOR HINDCASTING AND FORECASTING MIDSUMMER HYPOXIA IN THE GULF OF MEXICO
A new suite of multiple regression models were developed that describe the relationship between the area of bottom water hypoxia along the northern Gulf of Mexico and Mississippi-Atchafalaya River nitrate concentration, total phosphorus (TP) concentration, and discharge. Variabil...
Khalil, Mohamed H.; Shebl, Mostafa K.; Kosba, Mohamed A.; El-Sabrout, Karim; Zaki, Nesma
2016-01-01
Aim: This research was conducted to determine the most affecting parameters on hatchability of indigenous and improved local chickens’ eggs. Materials and Methods: Five parameters were studied (fertility, early and late embryonic mortalities, shape index, egg weight, and egg weight loss) on four strains, namely Fayoumi, Alexandria, Matrouh, and Montazah. Multiple linear regression was performed on the studied parameters to determine the most influencing one on hatchability. Results: The results showed significant differences in commercial and scientific hatchability among strains. Alexandria strain has the highest significant commercial hatchability (80.70%). Regarding the studied strains, highly significant differences in hatching chick weight among strains were observed. Using multiple linear regression analysis, fertility made the greatest percent contribution (71.31%) to hatchability, and the lowest percent contributions were made by shape index and egg weight loss. Conclusion: A prediction of hatchability using multiple regression analysis could be a good tool to improve hatchability percentage in chickens. PMID:27651666
Kontopantelis, Evangelos; Parisi, Rosa; Springate, David A; Reeves, David
2017-01-13
In modern health care systems, the computerization of all aspects of clinical care has led to the development of large data repositories. For example, in the UK, large primary care databases hold millions of electronic medical records, with detailed information on diagnoses, treatments, outcomes and consultations. Careful analyses of these observational datasets of routinely collected data can complement evidence from clinical trials or even answer research questions that cannot been addressed in an experimental setting. However, 'missingness' is a common problem for routinely collected data, especially for biological parameters over time. Absence of complete data for the whole of a individual's study period is a potential bias risk and standard complete-case approaches may lead to biased estimates. However, the structure of the data values makes standard cross-sectional multiple-imputation approaches unsuitable. In this paper we propose and evaluate mibmi, a new command for cleaning and imputing longitudinal body mass index data. The regression-based data cleaning aspects of the algorithm can be useful when researchers analyze messy longitudinal data. Although the multiple imputation algorithm is computationally expensive, it performed similarly or even better to existing alternatives, when interpolating observations. The mibmi algorithm can be a useful tool for analyzing longitudinal body mass index data, or other longitudinal data with very low individual-level variability.
An Occupational Performance Test Validation Program for Fire Fighters at the Kennedy Space Center
NASA Technical Reports Server (NTRS)
Schonfeld, Brian R.; Doerr, Donald F.; Convertino, Victor A.
1990-01-01
We evaluated performance of a modified Combat Task Test (CTT) and of standard fitness tests in 20 male subjects to assess the prediction of occupational performance standards for Kennedy Space Center fire fighters. The CTT consisted of stair-climbing, a chopping simulation, and a victim rescue simulation. Average CTT performance time was 3.61 +/- 0.25 min (SEM) and all CTT tasks required 93% to 97% maximal heart rate. By using scores from the standard fitness tests, a multiple linear regression model was fitted to each parameter: the stairclimb (r(exp 2) = .905, P less than .05), the chopping performance time (r(exp 2) = .582, P less than .05), the victim rescue time (r(exp 2) = .218, P = not significant), and the total performance time (r(exp 2) = .769, P less than .05). Treadmill time was the predominant variable, being the major predictor in two of four models. These results indicated that standardized fitness tests can predict performance on some CTT tasks and that test predictors were amenable to exercise training.
Mean centering, multicollinearity, and moderators in multiple regression: The reconciliation redux.
Iacobucci, Dawn; Schneider, Matthew J; Popovich, Deidre L; Bakamitsos, Georgios A
2017-02-01
In this article, we attempt to clarify our statements regarding the effects of mean centering. In a multiple regression with predictors A, B, and A × B (where A × B serves as an interaction term), mean centering A and B prior to computing the product term can clarify the regression coefficients (which is good) and the overall model fit R 2 will remain undisturbed (which is also good).
2013-01-01
application of the Hammett equation with the constants rph in the chemistry of organophosphorus compounds, Russ. Chem. Rev. 38 (1969) 795–811. [13...of oximes and OP compounds and the ability of oximes to reactivate OP- inhibited AChE. Multiple linear regression equations were analyzed using...phosphonate pairs, 21 oxime/ phosphoramidate pairs and 12 oxime/phosphate pairs. The best linear regression equation resulting from multiple regression anal
Pautex, Sophie; Herrmann, François R; Michon, Agnès; Giannakopoulos, Panteleimon; Gold, Gabriel
2007-01-01
Self-report is the "gold standard" for pain assessment, however, observational pain scales, such as Doloplus-2 must be used for patients who cannot communicate. In this follow-up study, we report the psychometric properties of the observational Doloplus-2 scale using the visual analog scale (VAS) pain score as a gold standard and evaluate its performance. Prospective clinical study of 180 hospitalized older patients who demonstrated good comprehension and reliable use of the VAS: 131 participants with dementia and 49 without. All participants assessed their chronic pain using the VAS. Doloplus-2 was independently completed by the nursing team. Mean age of patients (133 women, 47 men) was 83.7+/-6.5. Median mini-mental state examination of patients with diagnosis of dementia was 18.0+/-7.7. Nearly half of the patients (49%) reported that they experienced pain in response to a direct question. The administration of Doloplus-2 was possible in all 180 patients. Doloplus-2 correlated moderately with self-assessment (Spearman coefficient: 0.46). In a multiple regression model, Doloplus-2 predicted 41% of the variability in pain intensity measured by VAS. The somatic dimension alone explained 36% of the variance, the psychosocial bloc 5% with no better contribution of the psychomotor bloc. To shorten Doloplus-2, we constructed a version with only the 5 items that were significantly associated with the VAS score in the multiple regression models. The observational Doloplus-2 scale correlates moderately with self-assessment pain score and has adequate internal consistency. Our data also suggest that Doloplus-2 could be substantially shortened as the brief version performed similarly to the complete Doloplus-2.
Relationship between self-esteem and living conditions among stroke survivors at home.
Shida, Junko; Sugawara, Kyoko; Goto, Junko; Sekito, Yoshiko
2014-10-01
To clarify the relationship between self-esteem of stroke survivors at home and their living conditions. Study participants were stroke survivors who lived at home and commuted to one of two medical facilities in the Tohoku region of Japan. Stroke survivors were recruited for the present study when they came to the hospital for a routine visit. The researcher or research assistant explained the study objective and methods to the stroke survivor, and the questionnaire survey was conducted. Survey contents included the Japanese version of the Rosenberg Self-Esteem Scale (RSE) and questions designed to assess living conditions. A total of 65 participants with complete RSE data were included in the analysis. The mean (standard deviation) age of participants was 70.9 years (± 11.1), with a mean RSE score of 32.12 (± 8.32). Only a minor decrease in participant self-esteem was observed, even after having experienced a stroke. Factors associated with self-esteem, including "independent bathing" (standardized partial regression coefficient, β = 0.405, P < 0.001), "being needed by family members" (β = 0.389, P < 0.001), "independent grooming" (β = 0.292, P = 0.009), and "sleep satisfaction" (β = 0.237, P = 0.017), were analyzed by stepwise multiple regression analysis. The multiple correlation coefficient adjusted for the degrees of freedom was 0.738 (P < 0.001). Our analysis revealed that the maintenance of activities of daily living, and the presence of a suitable environment that enhances physical function recovery and promotes activity and participation, are necessary to improve self-esteem in stroke survivors living at home. © 2013 The Authors. Japan Journal of Nursing Science © 2013 Japan Academy of Nursing Science.
Yan, H C; Hao, Y T; Guo, Y F; Wei, Y H; Zhang, J H; Huang, G P; Mao, L M; Zhang, Z Q
2017-11-10
Objective: To evaluate the accuracy of simple anthropometric parameters in diagnosing obesity in children in Guangzhou. Methods: A cross-sectional study, including 465 children aged 6-9 years, was carried out in Guangzhou. Their body height and weight, waist circumference (WC) and hip circumference were measured according to standard procedure. Body mass index (BMI), waist to hip ratio (WHR) and waist-to-height ratio (WHtR) were calculated. Body fat percentage (BF%) was determined by dual-energy X-ray absorptiometry. Multiple regression analysis was applied to evaluate the correlations between age-adjusted physical indicators and BF%, after the adjustment for age. Obesity was defined by BF%. Receiver operating characteristic (ROC) curve analyses were performed to assess the diagnostic accuracy of the indicators for childhood obesity. Area under-ROC curves (AUCs) were calculated and the best cut-off point that maximizing 'sensitivity + specificity-1' was determined. Results: BMI showed the strongest association with BF% through multiple regression analysis. For 'per-standard deviation increase' of BMI, BF% increased by 5.3% ( t =23.1, P <0.01) in boys and 4.6% ( t =17.5, P <0.01) in girls, respectively. The ROC curve analysis indicated that BMI exhibited the largest AUC in both boys (AUC=0.908) and girls (AUC=0.895). The sensitivity was 80.8% in boys and 81.8% in girls, and the specificity was 88.2% in boys and 87.1% in girls. Both the AUCs for WHtR and WC were less than 0.8 in boys and girls. WHR had the smallest AUCs (<0.8) in both boys and girls. Conclusion: BMI appeared to be a good predicator for BF% in children aged 6-9 years in Guangzhou.
Linardakis, M; Papadaki, A; Smpokos, E; Sarri, K; Vozikaki, M; Philalithis, A
2015-05-01
Behavioural risk factors for chronic diseases involve factors relating to lifestyle habits. This study examined the relationship of religious and spiritual beliefs with the adoption and presence of multiple behavioural risk factors (MBRFs) in European adults. Cross-sectional study. Data were used from 16,557 individuals, aged 50+ years, participating in the Survey of Health, Ageing and Retirement in Europe (2004/05). MBRFs clustering was defined by high body weight, smoking, physical inactivity and risky alcohol consumption, and regression estimations with religiosity and prayer use were assessed based on sampling weights. In total, 79.4% of participants had received religious education, 33.4% had used prayer '≥1 time/day' and 53.3% had clustering of 2+ MBRFs. Lower prevalence of smoking was found in males (20.6% vs. 29.4%, P < 0.05), as well as in females (13.1% vs. 22.6%, P < 0.05), who prayed '≥1 time/day', compared to those who never prayed. Categorical regression analysis revealed that the presence of MBRFs was associated negatively with religious education (standardized beta = -0.048, P < 0.001) and positively with low frequency of prayer use (standardized beta = 0.056, P < 0.001). Having received religious education and prayer use were related to the presence of fewer MBRFs in European adults aged 50+ years. These lifestyle factors should be assessed as potential determinants of MBRFs adoption when examining chronic disease development in multicultural populations. Copyright © 2015 The Royal Society for Public Health. Published by Elsevier Ltd. All rights reserved.
Nishioka, Shinta; Wakabayashi, Hidetaka; Yoshida, Tomomi; Mori, Natsumi; Watanabe, Riko; Nishioka, Emi
2016-01-01
A protective effect of excessive body mass index (BMI) on mortality or functional outcome in patients with stroke is not well established in the Asian population. This study aimed to explore whether obese patients with stroke have advantages for functional improvement in Japanese rehabilitation wards. This retrospective cohort study included consecutive patients with stroke admitted and discharged from convalescent rehabilitation wards between 2011 and 2015. Demographic data, BMI, Functional Independence Measure (FIM) score, and nutritional status were analyzed. Participants were classified into 4 groups according to BMI (underweight <18.5 kg/m(2), standard 18.5-<23 kg/m(2), overweight 23-<27.5 kg/m(2), obese ≥27.5 kg/m(2)). The primary outcome was the FIM gain, and the secondary outcome was the FIM score at discharge. Multiple regression analysis was performed to analyze the relationship between BMI and functional recovery. In total, 897 participants (males 484, females 413; mean age 71.6 years) were analyzed and classified as underweight (134), standard (432), overweight (277), and obese (54). The median FIM gain and the FIM score at discharge were 30 and 114, respectively. The FIM gain in the obese group was significantly higher than those in the other groups. Multiple regression analysis revealed that obesity was independently correlated with the FIM gain, and those at discharge after adjusting for confounders such as age, gender, and FIM score on admission. Obese Japanese convalescent patients with stroke may have some advantages for functional recovery in rehabilitation wards. Copyright © 2015 National Stroke Association. Published by Elsevier Inc. All rights reserved.
He, Dan; Kuhn, David; Parida, Laxmi
2016-06-15
Given a set of biallelic molecular markers, such as SNPs, with genotype values encoded numerically on a collection of plant, animal or human samples, the goal of genetic trait prediction is to predict the quantitative trait values by simultaneously modeling all marker effects. Genetic trait prediction is usually represented as linear regression models. In many cases, for the same set of samples and markers, multiple traits are observed. Some of these traits might be correlated with each other. Therefore, modeling all the multiple traits together may improve the prediction accuracy. In this work, we view the multitrait prediction problem from a machine learning angle: as either a multitask learning problem or a multiple output regression problem, depending on whether different traits share the same genotype matrix or not. We then adapted multitask learning algorithms and multiple output regression algorithms to solve the multitrait prediction problem. We proposed a few strategies to improve the least square error of the prediction from these algorithms. Our experiments show that modeling multiple traits together could improve the prediction accuracy for correlated traits. The programs we used are either public or directly from the referred authors, such as MALSAR (http://www.public.asu.edu/~jye02/Software/MALSAR/) package. The Avocado data set has not been published yet and is available upon request. dhe@us.ibm.com. © The Author 2016. Published by Oxford University Press.
Multiple imputation for cure rate quantile regression with censored data.
Wu, Yuanshan; Yin, Guosheng
2017-03-01
The main challenge in the context of cure rate analysis is that one never knows whether censored subjects are cured or uncured, or whether they are susceptible or insusceptible to the event of interest. Considering the susceptible indicator as missing data, we propose a multiple imputation approach to cure rate quantile regression for censored data with a survival fraction. We develop an iterative algorithm to estimate the conditionally uncured probability for each subject. By utilizing this estimated probability and Bernoulli sample imputation, we can classify each subject as cured or uncured, and then employ the locally weighted method to estimate the quantile regression coefficients with only the uncured subjects. Repeating the imputation procedure multiple times and taking an average over the resultant estimators, we obtain consistent estimators for the quantile regression coefficients. Our approach relaxes the usual global linearity assumption, so that we can apply quantile regression to any particular quantile of interest. We establish asymptotic properties for the proposed estimators, including both consistency and asymptotic normality. We conduct simulation studies to assess the finite-sample performance of the proposed multiple imputation method and apply it to a lung cancer study as an illustration. © 2016, The International Biometric Society.
Linard, Joshua I.
2013-01-01
Mitigating the effects of salt and selenium on water quality in the Grand Valley and lower Gunnison River Basin in western Colorado is a major concern for land managers. Previous modeling indicated means to improve the models by including more detailed geospatial data and a more rigorous method for developing the models. After evaluating all possible combinations of geospatial variables, four multiple linear regression models resulted that could estimate irrigation-season salt yield, nonirrigation-season salt yield, irrigation-season selenium yield, and nonirrigation-season selenium yield. The adjusted r-squared and the residual standard error (in units of log-transformed yield) of the models were, respectively, 0.87 and 2.03 for the irrigation-season salt model, 0.90 and 1.25 for the nonirrigation-season salt model, 0.85 and 2.94 for the irrigation-season selenium model, and 0.93 and 1.75 for the nonirrigation-season selenium model. The four models were used to estimate yields and loads from contributing areas corresponding to 12-digit hydrologic unit codes in the lower Gunnison River Basin study area. Each of the 175 contributing areas was ranked according to its estimated mean seasonal yield of salt and selenium.
[The importance of handprint morphometry for determining the human body length].
Grigor'eva, M A
2018-01-01
Handprint morphometry for the purpose of personality identification still remains a relatively novel approach. The methods employed for the measurements are not infrequently difficult to reproduce and therefore cause controversy. The objective of the present study was to introduce the system of methods for the measurement of handprints suitable for the reliable determination of the human body length. The study included the measurement of the size of 40 handprints left by124 adult subjects (52 men and 72 women). Two methods of the regression analysis, stepwise and forced inclusion, were applied to the combined group of handprints to select the equations with the high (R>0.800) coefficients of multiple correlation with the body length. 13 equations of multiple regression were obtained and analyzed. The standard error of estimating (SEE) varied from 4.30 to 5.19 cm. The best results were obtained with the equations constructed from the sizes I, II, and III of the rays without their distal phalanges. It was shown that the body length can be successfully reconstructed within the height range from 168 to 183 cm for men and from 157 to 176 cm for women. The examples of the use of the equations for the purpose of expertise of illegible and incomplete handprints are presented.
Yamakado, Minoru; Tanaka, Takayuki; Nagao, Kenji; Imaizumi, Akira; Komatsu, Michiharu; Daimon, Takashi; Miyano, Hiroshi; Tani, Mizuki; Toda, Akiko; Yamamoto, Hiroshi; Horimoto, Katsuhisa; Ishizaka, Yuko
2017-11-03
Fatty liver disease (FLD) increases the risk of diabetes, cardiovascular disease, and steatohepatitis, which leads to fibrosis, cirrhosis, and hepatocellular carcinoma. Thus, the early detection of FLD is necessary. We aimed to find a quantitative and feasible model for discriminating the FLD, based on plasma free amino acid (PFAA) profiles. We constructed models of the relationship between PFAA levels in 2,000 generally healthy Japanese subjects and the diagnosis of FLD by abdominal ultrasound scan by multiple logistic regression analysis with variable selection. The performance of these models for FLD discrimination was validated using an independent data set of 2,160 subjects. The generated PFAA-based model was able to identify FLD patients. The area under the receiver operating characteristic curve for the model was 0.83, which was higher than those of other existing liver function-associated markers ranging from 0.53 to 0.80. The value of the linear discriminant in the model yielded the adjusted odds ratio (with 95% confidence intervals) for a 1 standard deviation increase of 2.63 (2.14-3.25) in the multiple logistic regression analysis with known liver function-associated covariates. Interestingly, the linear discriminant values were significantly associated with the progression of FLD, and patients with nonalcoholic steatohepatitis also exhibited higher values.
NASA Astrophysics Data System (ADS)
Laborda, Francisco; Medrano, Jesús; Castillo, Juan R.
2004-06-01
The quality of the quantitative results obtained from transient signals in high-performance liquid chromatography-inductively coupled plasma mass spectrometry (HPLC-ICPMS) and flow injection-inductively coupled plasma mass spectrometry (FI-ICPMS) was investigated under multielement conditions. Quantification methods were based on multiple-point calibration by simple and weighted linear regression, and double-point calibration (measurement of the baseline and one standard). An uncertainty model, which includes the main sources of uncertainty from FI-ICPMS and HPLC-ICPMS (signal measurement, sample flow rate and injection volume), was developed to estimate peak area uncertainties and statistical weights used in weighted linear regression. The behaviour of the ICPMS instrument was characterized in order to be considered in the model, concluding that the instrument works as a concentration detector when it is used to monitorize transient signals from flow injection or chromatographic separations. Proper quantification by the three calibration methods was achieved when compared to reference materials, although the double-point calibration allowed to obtain results of the same quality as the multiple-point calibration, shortening the calibration time. Relative expanded uncertainties ranged from 10-20% for concentrations around the LOQ to 5% for concentrations higher than 100 times the LOQ.
Searching for a neurologic injury's Wechsler Adult Intelligence Scale-Third Edition profile.
Gonçalves, Marta A; Moura, Octávio; Castro-Caldas, Alexandre; Simões, Mário R
2017-01-01
This study aimed to investigate the presence of a Wechsler Adult Intelligence Scale-Third Edition (WAIS-III) cognitive profile in a Portuguese neurologic injured sample. The Portuguese WAIS-III was administered to 81 mixed neurologic patients and 81 healthy matched controls selected from the Portuguese standardization sample. Although the mixed neurologic injury group performed significantly lower than the healthy controls for the majority of the WAIS-III scores (i.e., composite measures, discrepancies, and subtests), the mean scores were within the normal range and, therefore, at risk of being unobserved in a clinical evaluation. ROC curves analysis showed poor to acceptable diagnostic accuracy for the WAIS-III composite measures and subtests (Working Memory Index and Digit Span revealed the highest accuracy for discriminating between participants, respectively). Multiple regression analysis showed that both literacy and the presence of brain injury were significant predictors for all of the composite measures. In addition, multiple regression analysis also showed that literacy, age of injury onset, and years of survival predicted all seven composite measures for the mixed neurologic injured group. Despite the failure to find a WAIS-III cognitive profile for mixed neurologic patients, the results showed a significant influence of brain lesion and literacy in the performance of the WAIS-III.
A New Metric for Land-Atmosphere Coupling Strength: Applications on Observations and Modeling
NASA Astrophysics Data System (ADS)
Tang, Q.; Xie, S.; Zhang, Y.; Phillips, T. J.; Santanello, J. A., Jr.; Cook, D. R.; Riihimaki, L.; Gaustad, K.
2017-12-01
A new metric is proposed to quantify the land-atmosphere (LA) coupling strength and is elaborated by correlating the surface evaporative fraction and impacting land and atmosphere variables (e.g., soil moisture, vegetation, and radiation). Based upon multiple linear regression, this approach simultaneously considers multiple factors and thus represents complex LA coupling mechanisms better than existing single variable metrics. The standardized regression coefficients quantify the relative contributions from individual drivers in a consistent manner, avoiding the potential inconsistency in relative influence of conventional metrics. Moreover, the unique expendable feature of the new method allows us to verify and explore potentially important coupling mechanisms. Our observation-based application of the new metric shows moderate coupling with large spatial variations at the U.S. Southern Great Plains. The relative importance of soil moisture vs. vegetation varies by location. We also show that LA coupling strength is generally underestimated by single variable methods due to their incompleteness. We also apply this new metric to evaluate the representation of LA coupling in the Accelerated Climate Modeling for Energy (ACME) V1 Contiguous United States (CONUS) regionally refined model (RRM). This work is performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344. LLNL-ABS-734201
Examining the Relationship of Textbooks and Labs on Student Achievement in Eighth-Grade Science
NASA Astrophysics Data System (ADS)
Sugalan, Anacita Noromor
One of the most important objectives of teachers, parents, school administrators, and students is to improve student scores on standardized tests such as the State of Texas Assessment for Academic Readiness (STAAR) in eighth-grade science. This quasi experimental study examined the science achievement scores between schools that use textbooks and labs when delivering instruction. This study utilized a quantitative approach using archival data and survey design. Analysis of covariance (ANCOVA) and multiple regression were used to analyze the data while controlling STAAR eighth-grade reading scores to reveal significant differences between classes. The sample and population for this study were predominantly eighth-grade Hispanic students in South Texas. Analysis of covariance showed that classes that used high labs got higher science scores and that the reading scores were significantly related to science scores. Multiple regression findings indicated that textbooks and labs were significant predictors of student achievement on the STAAR eighth- grade science class result in South Texas for Spring 2015. The findings of this study may serve as a catalyst for improving student achievement in science through changes in textbook adoption and doing labs in science. The result suggests the need to research further to investigate other contributing factors of student achievement.
Multiple imputation for handling missing outcome data when estimating the relative risk.
Sullivan, Thomas R; Lee, Katherine J; Ryan, Philip; Salter, Amy B
2017-09-06
Multiple imputation is a popular approach to handling missing data in medical research, yet little is known about its applicability for estimating the relative risk. Standard methods for imputing incomplete binary outcomes involve logistic regression or an assumption of multivariate normality, whereas relative risks are typically estimated using log binomial models. It is unclear whether misspecification of the imputation model in this setting could lead to biased parameter estimates. Using simulated data, we evaluated the performance of multiple imputation for handling missing data prior to estimating adjusted relative risks from a correctly specified multivariable log binomial model. We considered an arbitrary pattern of missing data in both outcome and exposure variables, with missing data induced under missing at random mechanisms. Focusing on standard model-based methods of multiple imputation, missing data were imputed using multivariate normal imputation or fully conditional specification with a logistic imputation model for the outcome. Multivariate normal imputation performed poorly in the simulation study, consistently producing estimates of the relative risk that were biased towards the null. Despite outperforming multivariate normal imputation, fully conditional specification also produced somewhat biased estimates, with greater bias observed for higher outcome prevalences and larger relative risks. Deleting imputed outcomes from analysis datasets did not improve the performance of fully conditional specification. Both multivariate normal imputation and fully conditional specification produced biased estimates of the relative risk, presumably since both use a misspecified imputation model. Based on simulation results, we recommend researchers use fully conditional specification rather than multivariate normal imputation and retain imputed outcomes in the analysis when estimating relative risks. However fully conditional specification is not without its shortcomings, and so further research is needed to identify optimal approaches for relative risk estimation within the multiple imputation framework.
Undergraduate Student Motivation in Modularized Developmental Mathematics Courses
ERIC Educational Resources Information Center
Pachlhofer, Keith A.
2017-01-01
This study used the Motivated Strategies for Learning Questionnaire in modularized courses at three institutions across the nation (N = 189), and multiple regression was completed to investigate five categories of student motivation that predicted academic success and course completion. The overall multiple regression analysis was significant and…
MULGRES: a computer program for stepwise multiple regression analysis
A. Jeff Martin
1971-01-01
MULGRES is a computer program source deck that is designed for multiple regression analysis employing the technique of stepwise deletion in the search for most significant variables. The features of the program, along with inputs and outputs, are briefly described, with a note on machine compatibility.
Categorical Variables in Multiple Regression: Some Cautions.
ERIC Educational Resources Information Center
O'Grady, Kevin E.; Medoff, Deborah R.
1988-01-01
Limitations of dummy coding and nonsense coding as methods of coding categorical variables for use as predictors in multiple regression analysis are discussed. The combination of these approaches often yields estimates and tests of significance that are not intended by researchers for inclusion in their models. (SLD)
A Unified Framework for Association Analysis with Multiple Related Phenotypes
Stephens, Matthew
2013-01-01
We consider the problem of assessing associations between multiple related outcome variables, and a single explanatory variable of interest. This problem arises in many settings, including genetic association studies, where the explanatory variable is genotype at a genetic variant. We outline a framework for conducting this type of analysis, based on Bayesian model comparison and model averaging for multivariate regressions. This framework unifies several common approaches to this problem, and includes both standard univariate and standard multivariate association tests as special cases. The framework also unifies the problems of testing for associations and explaining associations – that is, identifying which outcome variables are associated with genotype. This provides an alternative to the usual, but conceptually unsatisfying, approach of resorting to univariate tests when explaining and interpreting significant multivariate findings. The method is computationally tractable genome-wide for modest numbers of phenotypes (e.g. 5–10), and can be applied to summary data, without access to raw genotype and phenotype data. We illustrate the methods on both simulated examples, and to a genome-wide association study of blood lipid traits where we identify 18 potential novel genetic associations that were not identified by univariate analyses of the same data. PMID:23861737
Fleischer, Adam E; Hshieh, Shenche; Crews, Ryan T; Waverly, Brett J; Jones, Jacob M; Klein, Erin E; Weil, Lowell; Weil, Lowell Scott
2018-05-01
Metatarsal length is believed to play a role in plantar plate dysfunction, although the mechanism through which progressive injury occurs is still uncertain. We aimed to clarify whether length of the second metatarsal was associated with increased plantar pressure measurements in the forefoot while walking. Weightbearing radiographs and corresponding pedobarographic data from 100 patients in our practice walking without a limp were retrospectively reviewed. Radiographs were assessed for several anatomic relationships, including metatarsal length, by a single rater. Pearson correlation analyses and multiple linear regression models were used to determine whether metatarsal length was associated with forefoot loading parameters. The relative length of the second to first metatarsal was positively associated with the ratio of peak pressure beneath the respective metatarsophalangeal joints ( r = 0.243, P = .015). The relative length of the second to third metatarsal was positively associated with the ratios of peak pressure ( r = 0.292, P = .003), pressure-time integral ( r = 0.249, P = .013), and force-time integral ( r = 0.221, P = .028) beneath the respective metatarsophalangeal joints. Although the variability in loading predicted by the various regression analyses was not large (4%-14%), the relative length of the second metatarsal (to the first and to the third) was maintained in each of the multiple regression models and remained the strongest predictor (highest standardized β-coefficient) in each of the models. Patients with longer second metatarsals exhibited relatively higher loads beneath the second metatarsophalangeal joint during barefoot walking. These findings provide a mechanism through which elongated second metatarsals may contribute to plantar plate injuries. Level III, comparative study.
The base rates and factors associated with reported access to firearms in psychiatric inpatients.
Kolla, Bhanu Prakash; O'Connor, Stephen S; Lineberry, Timothy W
2011-01-01
The aim of this study was to define whether specific patient demographic groups, diagnoses or other factors are associated with psychiatric inpatients reporting firearms access. A retrospective medical records review study was conducted using information on access to firearms from electronic medical records for all patients 16 years and older admitted between July 2007 and May 2008 at the Mayo Clinic Psychiatric Hospital in Rochester, MN. Data were obtained only on patients providing authorization for record review. Data were analyzed using univariate and multivariate logistic regression analyses accounting for gender, diagnostic groups, comorbid substance use, history of suicide attempts and family history of suicide/suicide attempts. Seventy-four percent (1169/1580) of patients provided research authorization. The ratio of men to women was identical in both research and nonresearch authorization groups. There were 14.6% of inpatients who reported firearms access. In univariate analysis, men were more likely (P<.0001) to report access than women, and a history of previous suicide attempt(s) was associated with decreased access (P=.02). Multiple logistic regression analyses controlling for other factors found females and patients with history of previous suicide attempt(s) less likely to report access, while patients with a family history of suicide or suicide attempts reported increased firearms access. Diagnostic groups were not associated with access on univariate or multiple logistic regression analyses. Men and inpatients with a family history of suicide/suicide attempts were more likely to report firearms access. Clinicians should develop standardized systems of identification of firearms access and provide guidance on removal. Copyright © 2011 Elsevier Inc. All rights reserved.
Cardarelli, Roberto; Singh, Meharvan; Meyer, Jason; Balyakina, Elizabeth; Perez, Oscar; King, Michael
2014-07-01
Hypogonadism is highly prevalent in men older than 45 years and is associated with an increased risk of chronic diseases, including obesity, metabolic syndrome, diabetes, and cardiovascular disease. The objective of this study was to determine whether lifestyle factors such as smoking, diet, and exercise are associated with reduced testosterone levels. In this cross-sectional study, 147 men older than 44 years were recruited from a collaborative network of primary care clinics in the Dallas/Fort Worth, Texas, metropolitan area. Free testosterone levels were measured in plasma samples via an enzyme-linked immunosorbent assay-based method, and analyzed by simple and multiple linear regression in relationship to age, race/ethnicity, smoking, diet, exercise, obesity, diabetes, hypertension, and dyslipidemia. The participants had a mean free testosterone level of 3.1 ng/mL (standard deviation [SD] = 1.5) and mean age of 56.8 years (SD = 7.9). In simple regression analysis, free testosterone levels were associated with increased age (β = -0.04; P = .02), diet (β = -0.49; P = .05), diabetes (β = -0.9; P = .003), and hypertension (β = -0.55; P = .03) but not with race/ethnicity, smoking, exercise, obesity, or dyslipidemia. In multiple regression analysis, free testosterone values were significantly associated only with age (β = -0.05; P = .01) and diet (β = -0.72; P = .01). This study implicates diet, in addition to advanced age as a possible risk factor in the development of reduced testosterone levels. © The Author(s) 2014.
CASTELO, Paula Midori; GAVIÃO, Maria Beatriz Duarte; PEREIRA, Luciano José; BONJARDIM, Leonardo Rigoldi
2010-01-01
Objective The maintenance of normal conditions of the masticatory function is determinant for the correct growth and development of its structures. Thus, the aims of this study were to evaluate the influence of sucking habits on the presence of crossbite and its relationship with maximal bite force, facial morphology and body variables in 67 children of both genders (3.5-7 years) with primary or early mixed dentition. Material and methods The children were divided in four groups: primary-normocclusion (PN, n=19), primary-crossbite (PC, n=19), mixed-normocclusion (MN, n=13), and mixed-crossbite (MC, n=16). Bite force was measured with a pressurized tube, and facial morphology was determined by standardized frontal photographs: AFH (anterior face height) and BFW (bizygomatic facial width). Results It was observed that MC group showed lower bite force than MN, and AFH/ BFW was significantly smaller in PN than PC (t-test). Weight and height were only significantly correlated with bite force in PC group (Pearson’s correlation test). In the primary dentition, AFH/BFW and breast-feeding (at least six months) were positive and negatively associated with crossbite, respectively (multiple logistic regression). In the mixed dentition, breastfeeding and bite force showed negative associations with crossbite (univariate regression), while nonnutritive sucking (up to 3 years) associated significantly with crossbite in all groups (multiple logistic regression). Conclusions In the studied sample, sucking habits played an important role in the etiology of crossbite, which was associated with lower bite force and long-face tendency. PMID:20485925
Morioka, Noriko; Tomio, Jun; Seto, Toshikazu; Kobayashi, Yasuki
2017-01-01
In Japan, the revision of the fee schedules in 2006 introduced a new category of general care ward for more advanced care, with a higher staffing standard, a patient-to-nurse ratio of 7:1. Previous studies have suggested that these changes worsened inequalities in the geographic distribution of nurses, but there have been few quantitative studies evaluating this effect. This study aimed to investigate the association between the distribution of 7:1 beds and the geographic distribution of hospital nursing staffs. We conducted a secondary data analysis of hospital reimbursement reports in 2012 in Japan. The study units were secondary medical areas (SMAs) in Japan, which are roughly comparable to hospital service areas in the United States. The outcome variable was the nurse density per 100,000 population in each SMA. The 7:1 bed density per 100,000 population was the main independent variable. To investigate the association between the nurse density and 7:1 bed density, adjusting for other variables, we applied a multiple linear regression model, with nurse density as an outcome variable, and the bed densities by functional category of inpatient ward as independent variables, adding other variables related to socio-economic status and nurse workforce. To investigate whether 7:1 bed density made the largest contribution to the nurse density, compared to other bed densities, we estimated the standardized regression coefficients. There were 344 SMAs in the study period, of which 343 were used because of data availability. There were approximately 553,600 full time equivalent nurses working in inpatient wards in hospitals. The mean (standard deviation) of the full time equivalent nurse density was 426.4 (147.5) and for 7:1 bed density, the figures were 271.9 (185.9). The 7:1 bed density ranged from 0.0 to 1,295.5. After adjusting for the possible confounders, there were more hospital nurses in the areas with higher densities of 7:1 beds (standardized regression coefficient 0.62, 95% confidence interval 0.56-0.68). We found that the 7:1 nurse staffing standard made the largest contribution to the geographic distribution of hospital nurses, adjusted for socio-economic status and nurse workforce-related factors.
Advanced Statistics for Exotic Animal Practitioners.
Hodsoll, John; Hellier, Jennifer M; Ryan, Elizabeth G
2017-09-01
Correlation and regression assess the association between 2 or more variables. This article reviews the core knowledge needed to understand these analyses, moving from visual analysis in scatter plots through correlation, simple and multiple linear regression, and logistic regression. Correlation estimates the strength and direction of a relationship between 2 variables. Regression can be considered more general and quantifies the numerical relationships between an outcome and 1 or multiple variables in terms of a best-fit line, allowing predictions to be made. Each technique is discussed with examples and the statistical assumptions underlying their correct application. Copyright © 2017 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Cole, Russell; Deke, John; Seftor, Neil
2016-01-01
The What Works Clearinghouse (WWC) maintains design standards to identify rigorous, internally valid education research. As education researchers advance new methodologies, the WWC must revise its standards to include an assessment of the new designs. Recently, the WWC has revised standards for two emerging study designs: regression discontinuity…
Use of Thematic Mapper for water quality assessment
NASA Technical Reports Server (NTRS)
Horn, E. M.; Morrissey, L. A.
1984-01-01
The evaluation of simulated TM data obtained on an ER-2 aircraft at twenty-five predesignated sample sites for mapping water quality factors such as conductivity, pH, suspended solids, turbidity, temperature, and depth, is discussed. Using a multiple regression for the seven TM bands, an equation is developed for the suspended solids. TM bands 1, 2, 3, 4, and 6 are used with logarithm conductivity in a multiple regression. The assessment of regression equations for a high coefficient of determination (R-squared) and statistical significance is considered. Confidence intervals about the mean regression point are calculated in order to assess the robustness of the regressions used for mapping conductivity, turbidity, and suspended solids, and by regressing random subsamples of sites and comparing the resultant range of R-squared, cross validation is conducted.
Relationship of aerobic and anaerobic parameters with 400 m front crawl swimming performance
Kalva-Filho, CA; Campos, EZ; Andrade, VL; Silva, ASR; Zagatto, AM; Lima, MCS
2015-01-01
The aims of the present study were to investigate the relationship of aerobic and anaerobic parameters with 400 m performance, and establish which variable better explains long distance performance in swimming. Twenty-two swimmers (19.1±1.5 years, height 173.9±10.0 cm, body mass 71.2±10.2 kg; 76.6±5.3% of 400 m world record) underwent a lactate minimum test to determine lactate minimum speed (LMS) (i.e., aerobic capacity index). Moreover, the swimmers performed a 400 m maximal effort to determine mean speed (S400m), peak oxygen uptake (V.O2PEAK) and total anaerobic contribution (CANA). The CANA was assumed as the sum of alactic and lactic contributions. Physiological parameters of 400 m were determined using the backward extrapolation technique (V.O2PEAK and alactic contributions of CANA) and blood lactate concentration analysis (lactic anaerobic contributions of CANA). The Pearson correlation test and backward multiple regression analysis were used to verify the possible correlations between the physiological indices (predictor factors) and S400m (independent variable) (p < 0.05). Values are presented as mean ± standard deviation. Significant correlations were observed between S400m (1.4±0.1 m·s-1) and LMS (1.3±0.1 m·s-1; r = 0.80), V.O2PEAK (4.5±3.9 L·min-1; r = 0.72) and CANA (4.7±1.5 L·O2; r= 0.44). The best model constructed using multiple regression analysis demonstrated that LMS and V.O2PEAK explained 85% of the 400 m performance variance. When backward multiple regression analysis was performed, CANA lost significance. Thus, the results demonstrated that both aerobic parameters (capacity and power) can be used to predict 400 m swimming performance. PMID:28479663
Multiple network-constrained regressions expand insights into influenza vaccination responses.
Avey, Stefan; Mohanty, Subhasis; Wilson, Jean; Zapata, Heidi; Joshi, Samit R; Siconolfi, Barbara; Tsang, Sui; Shaw, Albert C; Kleinstein, Steven H
2017-07-15
Systems immunology leverages recent technological advancements that enable broad profiling of the immune system to better understand the response to infection and vaccination, as well as the dysregulation that occurs in disease. An increasingly common approach to gain insights from these large-scale profiling experiments involves the application of statistical learning methods to predict disease states or the immune response to perturbations. However, the goal of many systems studies is not to maximize accuracy, but rather to gain biological insights. The predictors identified using current approaches can be biologically uninterpretable or present only one of many equally predictive models, leading to a narrow understanding of the underlying biology. Here we show that incorporating prior biological knowledge within a logistic modeling framework by using network-level constraints on transcriptional profiling data significantly improves interpretability. Moreover, incorporating different types of biological knowledge produces models that highlight distinct aspects of the underlying biology, while maintaining predictive accuracy. We propose a new framework, Logistic Multiple Network-constrained Regression (LogMiNeR), and apply it to understand the mechanisms underlying differential responses to influenza vaccination. Although standard logistic regression approaches were predictive, they were minimally interpretable. Incorporating prior knowledge using LogMiNeR led to models that were equally predictive yet highly interpretable. In this context, B cell-specific genes and mTOR signaling were associated with an effective vaccination response in young adults. Overall, our results demonstrate a new paradigm for analyzing high-dimensional immune profiling data in which multiple networks encoding prior knowledge are incorporated to improve model interpretability. The R source code described in this article is publicly available at https://bitbucket.org/kleinstein/logminer . steven.kleinstein@yale.edu or stefan.avey@yale.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Navarrete-Muñoz, Eva María; Valera-Gran, Desirée; Garcia-de-la-Hera, Manuela; Gonzalez-Palacios, Sandra; Riaño, Isolina; Murcia, Mario; Lertxundi, Aitana; Guxens, Mònica; Tardón, Adonina; Amiano, Pilar; Vrijheid, Martine; Rebagliato, Marisa; Vioque, Jesus
2017-11-27
We investigated the association between maternal use of folic acid (FA) during pregnancy and child anthropometric measures at birth. We included 2302 mother-child pairs from a population-based birth cohort in Spain (INMA Project). FA dosages at first and third trimester of pregnancy were assessed using a specific battery questionnaire and were categorized in non-user, < 1000, 1000-4999, and ≥ 5000 µg/day. Anthropometric measures at birth (weight in grams, length and head circumference in centimetres) were obtained from medical records. Small for gestational age according to weight (SGA-w), length (SGA-l) and head circumference (SGA-hc) were defined using the 10th percentile based on Spanish standardized growth reference charts. Multiple linear and logistic regression analyses were used to explore the association between FA dosages in different stages of pregnancy and child anthropometric measures at birth. In the multiple linear regression analysis, we found a tendency for a negative association between the use of high dosages of FA (≥ 5000 µg/day) in the periconceptional period of pregnancy and weight at birth compared to mothers who were non-users of FA (β = - 73.83; 95% CI - 151.71, 4.06). In the multiple logistic regression, a greater risk of SGA-w was also evident among children whose mothers took FA dosages of 1000-4999 (OR = 2.21; 95% CI 1.17, 4.19) and of ≥ 5000 µg/day (OR = 2.32; 95% CI 1.06, 5.08) compared to mothers non-users of FA in the periconceptional period of pregnancy. Our findings suggest that a high dosage of FA (≥ 1000 µg/day) may be associated with an increased risk of SGA-w at birth.
Due to the complexity of the processes contributing to beach bacteria concentrations, many researchers rely on statistical modeling, among which multiple linear regression (MLR) modeling is most widely used. Despite its ease of use and interpretation, there may be time dependence...
Data from the Interagency Monitoring of Protected Visual Environments (IMPROVE) network are used to estimate organic mass to organic carbon (OM/OC) ratios across the United States by extending previously published multiple regression techniques. Our new methodology addresses com...
Analysis and Interpretation of Findings Using Multiple Regression Techniques
ERIC Educational Resources Information Center
Hoyt, William T.; Leierer, Stephen; Millington, Michael J.
2006-01-01
Multiple regression and correlation (MRC) methods form a flexible family of statistical techniques that can address a wide variety of different types of research questions of interest to rehabilitation professionals. In this article, we review basic concepts and terms, with an emphasis on interpretation of findings relevant to research questions…
Estimating air drying times of lumber with multiple regression
William T. Simpson
2004-01-01
In this study, the applicability of a multiple regression equation for estimating air drying times of red oak, sugar maple, and ponderosa pine lumber was evaluated. The equation allows prediction of estimated air drying times from historic weather records of temperature and relative humidity at any desired location.
Using Robust Variance Estimation to Combine Multiple Regression Estimates with Meta-Analysis
ERIC Educational Resources Information Center
Williams, Ryan
2013-01-01
The purpose of this study was to explore the use of robust variance estimation for combining commonly specified multiple regression models and for combining sample-dependent focal slope estimates from diversely specified models. The proposed estimator obviates traditionally required information about the covariance structure of the dependent…
Multiple Regression: A Leisurely Primer.
ERIC Educational Resources Information Center
Daniel, Larry G.; Onwuegbuzie, Anthony J.
Multiple regression is a useful statistical technique when the researcher is considering situations in which variables of interest are theorized to be multiply caused. It may also be useful in those situations in which the researchers is interested in studies of predictability of phenomena of interest. This paper provides an introduction to…
Using Monte Carlo Techniques to Demonstrate the Meaning and Implications of Multicollinearity
ERIC Educational Resources Information Center
Vaughan, Timothy S.; Berry, Kelly E.
2005-01-01
This article presents an in-class Monte Carlo demonstration, designed to demonstrate to students the implications of multicollinearity in a multiple regression study. In the demonstration, students already familiar with multiple regression concepts are presented with a scenario in which the "true" relationship between the response and…
ERIC Educational Resources Information Center
Bates, Reid A.; Holton, Elwood F., III; Burnett, Michael F.
1999-01-01
A case study of learning transfer demonstrates the possible effect of influential observation on linear regression analysis. A diagnostic method that tests for violation of assumptions, multicollinearity, and individual and multiple influential observations helps determine which observation to delete to eliminate bias. (SK)
A mixed-effects regression model for longitudinal multivariate ordinal data.
Liu, Li C; Hedeker, Donald
2006-03-01
A mixed-effects item response theory model that allows for three-level multivariate ordinal outcomes and accommodates multiple random subject effects is proposed for analysis of multivariate ordinal outcomes in longitudinal studies. This model allows for the estimation of different item factor loadings (item discrimination parameters) for the multiple outcomes. The covariates in the model do not have to follow the proportional odds assumption and can be at any level. Assuming either a probit or logistic response function, maximum marginal likelihood estimation is proposed utilizing multidimensional Gauss-Hermite quadrature for integration of the random effects. An iterative Fisher scoring solution, which provides standard errors for all model parameters, is used. An analysis of a longitudinal substance use data set, where four items of substance use behavior (cigarette use, alcohol use, marijuana use, and getting drunk or high) are repeatedly measured over time, is used to illustrate application of the proposed model.
Lopez, M.A.; Giovannelli, R.F.
1984-01-01
Rainfall, runoff, and water quality data were collected at nine urban watersheds in the Tampa Bay area from 1975 to 1980. Watershed drainage area ranged from 0.34 to 0.45 sq mi. Land use was mixed. Development ranged from a mostly residential watershed with a 19% impervious surface, to a commercial-residential watershed with a 61% impervious surface. Average biochemical oxygen demand concentrations of base flow at two sites and of stormwater runoff at five sites exceeded treated sewage effluent standards. Average coliform concentrations of stormwater runoff at all sites were several orders of magnitude greater than standards for Florida Class III receiving water (for recreation or propagation and management of fish and wildlife). Average concentrations of lead and zinc in stormwater runoff were consistently higher than Class III standards. Stormwater-runoff loads and base-flow concentrations of biochemical oxygen demand, chemical oxygen demand, total nitrogen, total organic nitrogen, total phosphorus, and lead were related to runoff volume, land use, urban development, and antecedent daily rainfall by multiple linear regression. Stormwater-runoff volume was related to pervious area, hydraulically connected impervious surfaces, storm rainfall, and soil-infiltration index. Base-flow daily discharge was related to drainage area and antecedent daily rainfall. The flow regression equations of this report were used to compute 1979 water-year loads of biochemical oxygen demand, chemical oxygen demand, total nitrogen, total organic nitrogen, total phosphorus , and total lead for the nine Tampa Bay area urban watersheds. (Lantz-PTT)
Statistical tools for transgene copy number estimation based on real-time PCR.
Yuan, Joshua S; Burris, Jason; Stewart, Nathan R; Mentewab, Ayalew; Stewart, C Neal
2007-11-01
As compared with traditional transgene copy number detection technologies such as Southern blot analysis, real-time PCR provides a fast, inexpensive and high-throughput alternative. However, the real-time PCR based transgene copy number estimation tends to be ambiguous and subjective stemming from the lack of proper statistical analysis and data quality control to render a reliable estimation of copy number with a prediction value. Despite the recent progresses in statistical analysis of real-time PCR, few publications have integrated these advancements in real-time PCR based transgene copy number determination. Three experimental designs and four data quality control integrated statistical models are presented. For the first method, external calibration curves are established for the transgene based on serially-diluted templates. The Ct number from a control transgenic event and putative transgenic event are compared to derive the transgene copy number or zygosity estimation. Simple linear regression and two group T-test procedures were combined to model the data from this design. For the second experimental design, standard curves were generated for both an internal reference gene and the transgene, and the copy number of transgene was compared with that of internal reference gene. Multiple regression models and ANOVA models can be employed to analyze the data and perform quality control for this approach. In the third experimental design, transgene copy number is compared with reference gene without a standard curve, but rather, is based directly on fluorescence data. Two different multiple regression models were proposed to analyze the data based on two different approaches of amplification efficiency integration. Our results highlight the importance of proper statistical treatment and quality control integration in real-time PCR-based transgene copy number determination. These statistical methods allow the real-time PCR-based transgene copy number estimation to be more reliable and precise with a proper statistical estimation. Proper confidence intervals are necessary for unambiguous prediction of trangene copy number. The four different statistical methods are compared for their advantages and disadvantages. Moreover, the statistical methods can also be applied for other real-time PCR-based quantification assays including transfection efficiency analysis and pathogen quantification.
Prediction of adult height in girls: the Beunen-Malina-Freitas method.
Beunen, Gaston P; Malina, Robert M; Freitas, Duarte L; Thomis, Martine A; Maia, José A; Claessens, Albrecht L; Gouveia, Elvio R; Maes, Hermine H; Lefevre, Johan
2011-12-01
The purpose of this study was to validate and cross-validate the Beunen-Malina-Freitas method for non-invasive prediction of adult height in girls. A sample of 420 girls aged 10-15 years from the Madeira Growth Study were measured at yearly intervals and then 8 years later. Anthropometric dimensions (lengths, breadths, circumferences, and skinfolds) were measured; skeletal age was assessed using the Tanner-Whitehouse 3 method and menarcheal status (present or absent) was recorded. Adult height was measured and predicted using stepwise, forward, and maximum R (2) regression techniques. Multiple correlations, mean differences, standard errors of prediction, and error boundaries were calculated. A sample of the Leuven Longitudinal Twin Study was used to cross-validate the regressions. Age-specific coefficients of determination (R (2)) between predicted and measured adult height varied between 0.57 and 0.96, while standard errors of prediction varied between 1.1 and 3.9 cm. The cross-validation confirmed the validity of the Beunen-Malina-Freitas method in girls aged 12-15 years, but at lower ages the cross-validation was less consistent. We conclude that the Beunen-Malina-Freitas method is valid for the prediction of adult height in girls aged 12-15 years. It is applicable to European populations or populations of European ancestry.
Montgomery, Katherine L; Vaughn, Michael G; Thompson, Sanna J; Howard, Matthew O
2013-11-01
Research on juvenile offenders has largely treated this population as a homogeneous group. However, recent findings suggest that this at-risk population may be considerably more heterogeneous than previously believed. This study compared mixture regression analyses with standard regression techniques in an effort to explain how known factors such as distress, trauma, and personality are associated with drug abuse among juvenile offenders. Researchers recruited 728 juvenile offenders from Missouri juvenile correctional facilities for participation in this study. Researchers investigated past-year substance use in relation to the following variables: demographic characteristics (gender, ethnicity, age, familial use of public assistance), antisocial behavior, and mental illness symptoms (psychopathic traits, psychiatric distress, and prior trauma). Results indicated that standard and mixed regression approaches identified significant variables related to past-year substance use among this population; however, the mixture regression methods provided greater specificity in results. Mixture regression analytic methods may help policy makers and practitioners better understand and intervene with the substance-related subgroups of juvenile offenders.
Perquier, Florence; Duroy, David; Oudinet, Camille; Maamar, Alya; Choquet, Christophe; Casalino, Enrique; Lejoyeux, Michel
2017-07-01
Among patients examined after a suicide attempt in a Parisian emergency department, we aimed to compare individual characteristics of i) first time and multiple suicide attempters, ii) attempters whose principal motive was "to die" and attempters who had any other motive. Information regarding sociodemographics, clinical characteristics, prior mental health care and outgoing referral was collected in 168 suicide attempters using a standardized form. Associations of these variables with suicide attempt repetition (yes or no) and with the motive underlying the attempt (to die or not) were examined using descriptive statistics and multivariable logistic regression models. Multiple attempters were more likely to have no occupation and to report previous mental health care: mental health follow-up, psychiatric medication or psychiatric hospitalization. The motive to die was not associated with the risk of multiple suicide attempts but related to past suicidal ideation and to some specific precipitating factors, including psychiatric disorder. Patients who intended to die were also more likely to be referred to inpatient than to outpatient psychiatric care. Multiple attempters and attempters who desire to die might represent two distinct high-risk groups regarding clinical characteristics and care pathways. They would probably not benefit from the same intervention strategies. Copyright © 2017 Elsevier Ireland Ltd. All rights reserved.
Talbert, Steven
2009-01-01
This study evaluated the association between changing physiological status (delta data) with severe injury (SI) or need for trauma center resources (TCR). Prehospital and emergency department arrival weighted RTS (RTSw) were computed for patients with complete records entered into the registry from 2002 to 2004 (n = 23,753). Physiological change was classified as unchanged, deteriorated, or improved (PreRTSw vs EDRTSw). Performance of delta data was evaluated using standard epidemiological approaches and multiple logistic regression. Deterioration status predicted SI (operating room [OR] = 1.38) and TCR (OR = 2.09). Improved status predicted TCR (OR = 1.27). Delta data independently predicted both SI and TCR.
Analysis of the thermal comfort model in an environment of metal mechanical branch.
Pinto, N M; Xavier, A A P; do Amaral, Regiane T
2012-01-01
This study aims to identify the correlation between the Predicted Mean Vote (PMV) with the thermal sensation (S) of 55 employees, establishing a linear multiple regression equation. The measurement of environmental variables followed established standards. The survey was conducted in a metal industry located in Ponta Grossa of the State of Parana in Brazil. It was applied the physical model of thermal comfort to the environmental variables and also to the subjective data on the thermal sensations of employees. The survey was conducted from May to November, 2010, with 48 measurements. This study will serve as the basis for a dissertation consisting of 72 measurements.
Jung, Juergen
2013-01-01
We explore the determinants of inspection outcomes across 1.6 million Occupational Safety and Health Agency (OSHA) audits from 1990 through 2010. We find that discretion in enforcement differs in state and federally conducted inspections. State agencies are more sensitive to local economic conditions, finding fewer standard violations and fewer serious violations as unemployment increases. Larger companies receive greater lenience in multiple dimensions. Inspector issued fines and final fines, after negotiated reductions, are both smaller during Republican presidencies. Quantile regression analysis reveals that Presidential and Congressional party affiliations have their greatest impact on the largest negotiated reductions in fines. PMID:24659856
Afantitis, Antreas; Melagraki, Georgia; Sarimveis, Haralambos; Koutentis, Panayiotis A; Markopoulos, John; Igglessi-Markopoulou, Olga
2006-08-01
A quantitative-structure activity relationship was obtained by applying Multiple Linear Regression Analysis to a series of 80 1-[2-hydroxyethoxy-methyl]-6-(phenylthio) thymine (HEPT) derivatives with significant anti-HIV activity. For the selection of the best among 37 different descriptors, the Elimination Selection Stepwise Regression Method (ES-SWR) was utilized. The resulting QSAR model (R (2) (CV) = 0.8160; S (PRESS) = 0.5680) proved to be very accurate both in training and predictive stages.
Riley, Richard D; Ensor, Joie; Jackson, Dan; Burke, Danielle L
2017-01-01
Many meta-analysis models contain multiple parameters, for example due to multiple outcomes, multiple treatments or multiple regression coefficients. In particular, meta-regression models may contain multiple study-level covariates, and one-stage individual participant data meta-analysis models may contain multiple patient-level covariates and interactions. Here, we propose how to derive percentage study weights for such situations, in order to reveal the (otherwise hidden) contribution of each study toward the parameter estimates of interest. We assume that studies are independent, and utilise a decomposition of Fisher's information matrix to decompose the total variance matrix of parameter estimates into study-specific contributions, from which percentage weights are derived. This approach generalises how percentage weights are calculated in a traditional, single parameter meta-analysis model. Application is made to one- and two-stage individual participant data meta-analyses, meta-regression and network (multivariate) meta-analysis of multiple treatments. These reveal percentage study weights toward clinically important estimates, such as summary treatment effects and treatment-covariate interactions, and are especially useful when some studies are potential outliers or at high risk of bias. We also derive percentage study weights toward methodologically interesting measures, such as the magnitude of ecological bias (difference between within-study and across-study associations) and the amount of inconsistency (difference between direct and indirect evidence in a network meta-analysis).
Magnitude and frequency of floods in Washington
Cummans, J.E.; Collings, Michael R.; Nasser, Edmund George
1975-01-01
Relations are provided to estimate the magnitude and frequency of floods on Washington streams. Annual-peak-flow data from stream gaging stations on unregulated streams having 1 years or more of record were used to determine a log-Pearson Type III frequency curve for each station. Flood magnitudes having recurrence intervals of 2, 5, i0, 25, 50, and 10years were then related to physical and climatic indices of the drainage basins by multiple-regression analysis using the Biomedical Computer Program BMDO2R. These regression relations are useful for estimating flood magnitudes of the specified recurrence intervals at ungaged or short-record sites. Separate sets of regression equations were defined for western and eastern parts of the State, and the State was further subdivided into 12 regions in which the annual floods exhibit similar flood characteristics. Peak flows are related most significantly in western Washington to drainage-area size and mean annual precipitation. In eastern Washington-they are related most significantly to drainage-area size, mean annual precipitation, and percentage of forest cover. Standard errors of estimate of the estimating relations range from 25 to 129 percent, and the smallest errors are generally associated with the more humid regions.
Nie, Z Q; Ou, Y Q; Zhuang, J; Qu, Y J; Mai, J Z; Chen, J M; Liu, X Q
2016-05-01
Conditional logistic regression analysis and unconditional logistic regression analysis are commonly used in case control study, but Cox proportional hazard model is often used in survival data analysis. Most literature only refer to main effect model, however, generalized linear model differs from general linear model, and the interaction was composed of multiplicative interaction and additive interaction. The former is only statistical significant, but the latter has biological significance. In this paper, macros was written by using SAS 9.4 and the contrast ratio, attributable proportion due to interaction and synergy index were calculated while calculating the items of logistic and Cox regression interactions, and the confidence intervals of Wald, delta and profile likelihood were used to evaluate additive interaction for the reference in big data analysis in clinical epidemiology and in analysis of genetic multiplicative and additive interactions.
Wavelet regression model in forecasting crude oil price
NASA Astrophysics Data System (ADS)
Hamid, Mohd Helmie; Shabri, Ani
2017-05-01
This study presents the performance of wavelet multiple linear regression (WMLR) technique in daily crude oil forecasting. WMLR model was developed by integrating the discrete wavelet transform (DWT) and multiple linear regression (MLR) model. The original time series was decomposed to sub-time series with different scales by wavelet theory. Correlation analysis was conducted to assist in the selection of optimal decomposed components as inputs for the WMLR model. The daily WTI crude oil price series has been used in this study to test the prediction capability of the proposed model. The forecasting performance of WMLR model were also compared with regular multiple linear regression (MLR), Autoregressive Moving Average (ARIMA) and Generalized Autoregressive Conditional Heteroscedasticity (GARCH) using root mean square errors (RMSE) and mean absolute errors (MAE). Based on the experimental results, it appears that the WMLR model performs better than the other forecasting technique tested in this study.
Staal, Jason A; Sacks, Amanda; Matheis, Robert; Collier, Lesley; Calia, Tina; Hanif, Henry; Kofman, Eugene S
2007-01-01
A randomized, controlled, single-blinded, between group study of 24 participants with moderate to severe dementia was conducted on a geriatric psychiatric unit. All participants received pharmacological therapy, occupational therapy, structured hospital environment, and were randomized to receive multi sensory behavior therapy (MSBT) or a structured activity session. Greater independence in activities of daily living (ADLs) was observed for the group treated with MSBT and standard psychiatric inpatient care on the Katz Index of Activities of Daily Living (KI-ADL; P = 0.05) than standard psychiatric inpatient care alone. The combination treatment of MSBT and standard psychiatric care also reduced agitation and apathy greater than standard psychiatric inpatient care alone as measured with the Pittsburgh Agitation Scale and the Scale for the Assessment of Negative Symptoms in Alzheimer's Disease (P = 0.05). Multiple regression analysis predicted that within the multi-sensory group, activities of daily living (KI-ADL) increased as apathy and agitation reduced (R2 = 0.42; p = 0.03). These data suggest that utilizing MSBT with standard psychiatric inpatient care may reduce apathy and agitation and additionally improve activities of daily living in hospitalized people with moderate to severe dementia more than standard care alone.
ERIC Educational Resources Information Center
Li, Spencer D.
2011-01-01
Mediation analysis in child and adolescent development research is possible using large secondary data sets. This article provides an overview of two statistical methods commonly used to test mediated effects in secondary analysis: multiple regression and structural equation modeling (SEM). Two empirical studies are presented to illustrate the…
A Simple and Convenient Method of Multiple Linear Regression to Calculate Iodine Molecular Constants
ERIC Educational Resources Information Center
Cooper, Paul D.
2010-01-01
A new procedure using a student-friendly least-squares multiple linear-regression technique utilizing a function within Microsoft Excel is described that enables students to calculate molecular constants from the vibronic spectrum of iodine. This method is advantageous pedagogically as it calculates molecular constants for ground and excited…
Conjoint Analysis: A Study of the Effects of Using Person Variables.
ERIC Educational Resources Information Center
Fraas, John W.; Newman, Isadore
Three statistical techniques--conjoint analysis, a multiple linear regression model, and a multiple linear regression model with a surrogate person variable--were used to estimate the relative importance of five university attributes for students in the process of selecting a college. The five attributes include: availability and variety of…
An Exploratory Study of Face-to-Face and Cyberbullying in Sixth Grade Students
ERIC Educational Resources Information Center
Accordino, Denise B.; Accordino, Michael P.
2011-01-01
In a pilot study, sixth grade students (N = 124) completed a questionnaire assessing students' experience with bullying and cyberbullying, demographic information, quality of parent-child relationship, and ways they have dealt with bullying/cyberbullying in the past. Two multiple regression analyses were conducted. The multiple regression analysis…
ERIC Educational Resources Information Center
Campbell, S. Duke; Greenberg, Barry
The development of a predictive equation capable of explaining a significant percentage of enrollment variability at Florida International University is described. A model utilizing trend analysis and a multiple regression approach to enrollment forecasting was adapted to investigate enrollment dynamics at the university. Four independent…
ERIC Educational Resources Information Center
Fraas, John W.; Newman, Isadore
1996-01-01
In a conjoint-analysis consumer-preference study, researchers must determine whether the product factor estimates, which measure consumer preferences, should be calculated and interpreted for each respondent or collectively. Multiple regression models can determine whether to aggregate data by examining factor-respondent interaction effects. This…
Double Cross-Validation in Multiple Regression: A Method of Estimating the Stability of Results.
ERIC Educational Resources Information Center
Rowell, R. Kevin
In multiple regression analysis, where resulting predictive equation effectiveness is subject to shrinkage, it is especially important to evaluate result replicability. Double cross-validation is an empirical method by which an estimate of invariance or stability can be obtained from research data. A procedure for double cross-validation is…
Is the standard SF-12 health survey valid and equivalent for a Chinese population?
Lam, Cindy L K; Tse, Eileen Y Y; Gandek, Barbara
2005-03-01
Chinese is the world's largest ethnic group but few health-related quality of life (HRQoL) measures have been tested on them. The aim of this study was to determine if the standard SF-12 was valid and equivalent for a Chinese population. The SF-36 data of 2410 Chinese adults randomly selected from the general population of Hong Kong (HK) were analysed. The Chinese (HK) specific SF-12 items and scoring algorithm were derived from the HK Chinese population data by multiple regressions. The SF-36 PCS and MCS scores were used as criteria to assess the content and criterion validity of the SF-12. The standard and Chinese (HK) specific SF-12 PCS and MCS scores were compared for equivalence. The standard SF-12 explained 82% and 89% of the variance of the SF-36 PCS and MCS scores, respectively, and the effect size differences between the standard SF-36 and SF-12 scores were less than 0.3. Six of the Chinese (HK) specific SF-12 items were different from those of the standard SF-12, but the effect size differences between the Chinese (HK) specific and standard SF-12 scores were mostly less than 0.3. The standard SF-12 was valid and equivalent for the Chinese, which would enable more Chinese to be included in clinical trials that measure HRQoL.
Perry, Charles A.; Wolock, David M.; Artman, Joshua C.
2004-01-01
Streamflow statistics of flow duration and peak-discharge frequency were estimated for 4,771 individual locations on streams listed on the 1999 Kansas Surface Water Register. These statistics included the flow-duration values of 90, 75, 50, 25, and 10 percent, as well as the mean flow value. Peak-discharge frequency values were estimated for the 2-, 5-, 10-, 25-, 50-, and 100-year floods. Least-squares multiple regression techniques were used, along with Tobit analyses, to develop equations for estimating flow-duration values of 90, 75, 50, 25, and 10 percent and the mean flow for uncontrolled flow stream locations. The contributing-drainage areas of 149 U.S. Geological Survey streamflow-gaging stations in Kansas and parts of surrounding States that had flow uncontrolled by Federal reservoirs and used in the regression analyses ranged from 2.06 to 12,004 square miles. Logarithmic transformations of climatic and basin data were performed to yield the best linear relation for developing equations to compute flow durations and mean flow. In the regression analyses, the significant climatic and basin characteristics, in order of importance, were contributing-drainage area, mean annual precipitation, mean basin permeability, and mean basin slope. The analyses yielded a model standard error of prediction range of 0.43 logarithmic units for the 90-percent duration analysis to 0.15 logarithmic units for the 10-percent duration analysis. The model standard error of prediction was 0.14 logarithmic units for the mean flow. Regression equations used to estimate peak-discharge frequency values were obtained from a previous report, and estimates for the 2-, 5-, 10-, 25-, 50-, and 100-year floods were determined for this report. The regression equations and an interpolation procedure were used to compute flow durations, mean flow, and estimates of peak-discharge frequency for locations along uncontrolled flow streams on the 1999 Kansas Surface Water Register. Flow durations, mean flow, and peak-discharge frequency values determined at available gaging stations were used to interpolate the regression-estimated flows for the stream locations where available. Streamflow statistics for locations that had uncontrolled flow were interpolated using data from gaging stations weighted according to the drainage area and the bias between the regression-estimated and gaged flow information. On controlled reaches of Kansas streams, the streamflow statistics were interpolated between gaging stations using only gaged data weighted by drainage area.
A normative set of 98 pairs of nonsensical pictures (droodles).
Nishimoto, Takehiko; Ueda, Takashi; Miyawaki, Kaori; Une, Yuko; Takahashi, Masaru
2010-08-01
Our purpose in the present study is to provide a normative set of nonsensical pictures known as droodles and to demonstrate the role of semantic comprehension in facilitating recall of pictorial stimuli. The set consists of 98 pairs of droodles. Experiment 1 standardized these pictorial stimuli with respect to several variables, such as appropriateness of verbal labels, relationship between two droodles, and correct recall. Appropriateness of verbal labels was rated higher for pictures presented in pairs than for pictures presented singly. Experiment 2 used the standardized set of droodles in a recall experiment similar to those of Bower, Karlin, and Dueck (1975) and others. As we expected, semantic interpretation can strongly facilitate recall. Multiple regression analysis showed that several measures had significant power of explanation for recall performance. The full set of norms and pictures from this article may be downloaded from http://brm.psychonomic-journals.org/content/supplemental.
Studies in Software Cost Model Behavior: Do We Really Understand Cost Model Performance?
NASA Technical Reports Server (NTRS)
Lum, Karen; Hihn, Jairus; Menzies, Tim
2006-01-01
While there exists extensive literature on software cost estimation techniques, industry practice continues to rely upon standard regression-based algorithms. These software effort models are typically calibrated or tuned to local conditions using local data. This paper cautions that current approaches to model calibration often produce sub-optimal models because of the large variance problem inherent in cost data and by including far more effort multipliers than the data supports. Building optimal models requires that a wider range of models be considered while correctly calibrating these models requires rejection rules that prune variables and records and use multiple criteria for evaluating model performance. The main contribution of this paper is to document a standard method that integrates formal model identification, estimation, and validation. It also documents what we call the large variance problem that is a leading cause of cost model brittleness or instability.
Nutrient intake and use of dietary supplements among US adults with disabilities.
An, Ruopeng; Chiu, Chung-Yi; Andrade, Flavia
2015-04-01
Physical, mental, social, and financial hurdles in adults with disabilities may limit their access to adequate nutrition. To examine the impact of dietary supplement use on daily total nutrient intake levels among US adults 20 years and older with disabilities. Study sample came from 2007-2008 and 2009-2010 waves of the National Health and Nutrition Examination Survey, a nationally representative repeated cross-sectional survey. Disability was classified into 5 categories using standardized indices. Nutrient intakes from foods and dietary supplements were calculated from 2 nonconsecutive 24-hour dietary recalls. Two-sample proportion tests and multiple logistic regressions were used to examine the adherence rates to the recommended daily nutrient intake levels between dietary supplement users and nonusers in each disability category. The association between sociodemographic characteristics and dietary supplement use was assessed using multiple logistic regressions, accounting for complex survey design. A substantial proportion of the US adult population with disabilities failed to meet dietary guidelines, with insufficient intakes of multiple nutrients. Over half of the US adults with disabilities used dietary supplements. Dietary supplement use was associated with higher adherence rates for vitamin A, vitamin B1, vitamin B2, vitamin B6, vitamin B12, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, and zinc intake among adults with disabilities. Women, non-Hispanic Whites, older age, higher education, and higher household income were found to predict dietary supplement use. Proper use of dietary supplements under the guidance of health care providers may improve the nutritional status among adults with disabilities. Copyright © 2015 Elsevier Inc. All rights reserved.
Quality of search strategies reported in systematic reviews published in stereotactic radiosurgery.
Faggion, Clovis M; Wu, Yun-Chun; Tu, Yu-Kang; Wasiak, Jason
2016-06-01
Systematic reviews require comprehensive literature search strategies to avoid publication bias. This study aimed to assess and evaluate the reporting quality of search strategies within systematic reviews published in the field of stereotactic radiosurgery (SRS). Three electronic databases (Ovid MEDLINE(®), Ovid EMBASE(®) and the Cochrane Library) were searched to identify systematic reviews addressing SRS interventions, with the last search performed in October 2014. Manual searches of the reference lists of included systematic reviews were conducted. The search strategies of the included systematic reviews were assessed using a standardized nine-question form based on the Cochrane Collaboration guidelines and Assessment of Multiple Systematic Reviews checklist. Multiple linear regression analyses were performed to identify the important predictors of search quality. A total of 85 systematic reviews were included. The median quality score of search strategies was 2 (interquartile range = 2). Whilst 89% of systematic reviews reported the use of search terms, only 14% of systematic reviews reported searching the grey literature. Multiple linear regression analyses identified publication year (continuous variable), meta-analysis performance and journal impact factor (continuous variable) as predictors of higher mean quality scores. This study identified the urgent need to improve the quality of search strategies within systematic reviews published in the field of SRS. This study is the first to address how authors performed searches to select clinical studies for inclusion in their systematic reviews. Comprehensive and well-implemented search strategies are pivotal to reduce the chance of publication bias and consequently generate more reliable systematic review findings.
A Systematic Review of Global Drivers of Ant Elevational Diversity
Szewczyk, Tim; McCain, Christy M.
2016-01-01
Ant diversity shows a variety of patterns across elevational gradients, though the patterns and drivers have not been evaluated comprehensively. In this systematic review and reanalysis, we use published data on ant elevational diversity to detail the observed patterns and to test the predictions and interactions of four major diversity hypotheses: thermal energy, the mid-domain effect, area, and the elevational climate model. Of sixty-seven published datasets from the literature, only those with standardized, comprehensive sampling were used. Datasets included both local and regional ant diversity and spanned 80° in latitude across six biogeographical provinces. We used a combination of simulations, linear regressions, and non-parametric statistics to test multiple quantitative predictions of each hypothesis. We used an environmentally and geometrically constrained model as well as multiple regression to test their interactions. Ant diversity showed three distinct patterns across elevations: most common were hump-shaped mid-elevation peaks in diversity, followed by low-elevation plateaus and monotonic decreases in the number of ant species. The elevational climate model, which proposes that temperature and precipitation jointly drive diversity, and area were partially supported as independent drivers. Thermal energy and the mid-domain effect were not supported as primary drivers of ant diversity globally. The interaction models supported the influence of multiple drivers, though not a consistent set. In contrast to many vertebrate taxa, global ant elevational diversity patterns appear more complex, with the best environmental model contingent on precipitation levels. Differences in ecology and natural history among taxa may be crucial to the processes influencing broad-scale diversity patterns. PMID:27175999
Ayaz, Sevin; Ayaz, Ümit Yaşar
2016-01-01
We aimed to present unusual cranial FDG PET/CT findings of a 56-year-old female with multiple myeloma (MM). Plain CT images revealed a lytic lesion in the right parietal bone, filled with an oval-shaped, large, extra-axial, extradural, intracranial mass which measured 75×75×40 mm and had smooth borders. The right parietal lobe was compressed by the mass. The maximum standardized uptake value (SUV max ) of the mass lesion was 8.94 on FDG PET/CT images. Multiple lytic lesions with an increased uptake were also detected in other calvarial bones, in several vertebras and in the proximal left femur. After seven months, a control FDG PET/CT following radiotherapy and chemotherapy revealed almost complete regression of the right parietal extra-axial mass lesion. The number, size and metabolism of lytic lesions in other bones also decreased. FDG PET/CT was useful for an initial evaluation of MM lesions and was effective in monitoring the response of these lesions to therapy.
Singh, Jagmahender; Pathak, R K; Chavali, Krishnadutt H
2011-03-20
Skeletal height estimation from regression analysis of eight sternal lengths in the subjects of Chandigarh zone of Northwest India is the topic of discussion in this study. Analysis of eight sternal lengths (length of manubrium, length of mesosternum, combined length of manubrium and mesosternum, total sternal length and first four intercostals lengths of mesosternum) measured from 252 male and 91 female sternums obtained at postmortems revealed that mean cadaver stature and sternal lengths were more in North Indians and males than the South Indians and females. Except intercostal lengths, all the sternal lengths were positively correlated with stature of the deceased in both sexes (P < 0.001). The multiple regression analysis of sternal lengths was found more useful than the linear regression for stature estimation. Using multivariate regression analysis, the combined length of manubrium and mesosternum in both sexes and the length of manubrium along with 2nd and 3rd intercostal lengths of mesosternum in males were selected as best estimators of stature. Nonetheless, the stature of males can be predicted with SEE of 6.66 (R(2) = 0.16, r = 0.318) from combination of MBL+BL_3+LM+BL_2, and in females from MBL only, it can be estimated with SEE of 6.65 (R(2) = 0.10, r = 0.318), whereas from the multiple regression analysis of pooled data, stature can be known with SEE of 6.97 (R(2) = 0.387, r = 575) from the combination of MBL+LM+BL_2+TSL+BL_3. The R(2) and F-ratio were found to be statistically significant for almost all the variables in both the sexes, except 4th intercostal length in males and 2nd to 4th intercostal lengths in females. The 'major' sternal lengths were more useful than the 'minor' ones for stature estimation The universal regression analysis used by Kanchan et al. [39] when applied to sternal lengths, gave satisfactory estimates of stature for males only but female stature was comparatively better estimated from simple linear regressions. But they are not proposed for the subjects of known sex, as they underestimate the male and overestimate female stature. However, intercostal lengths were found to be the poor estimators of stature (P < 0.05). And also sternal lengths exhibit weaker correlation coefficients and higher standard errors of estimate. Copyright © 2010 Elsevier Ireland Ltd. All rights reserved.
Ridge: a computer program for calculating ridge regression estimates
Donald E. Hilt; Donald W. Seegrist
1977-01-01
Least-squares coefficients for multiple-regression models may be unstable when the independent variables are highly correlated. Ridge regression is a biased estimation procedure that produces stable estimates of the coefficients. Ridge regression is discussed, and a computer program for calculating the ridge coefficients is presented.
Zhu, Xiang; Stephens, Matthew
2017-01-01
Bayesian methods for large-scale multiple regression provide attractive approaches to the analysis of genome-wide association studies (GWAS). For example, they can estimate heritability of complex traits, allowing for both polygenic and sparse models; and by incorporating external genomic data into the priors, they can increase power and yield new biological insights. However, these methods require access to individual genotypes and phenotypes, which are often not easily available. Here we provide a framework for performing these analyses without individual-level data. Specifically, we introduce a “Regression with Summary Statistics” (RSS) likelihood, which relates the multiple regression coefficients to univariate regression results that are often easily available. The RSS likelihood requires estimates of correlations among covariates (SNPs), which also can be obtained from public databases. We perform Bayesian multiple regression analysis by combining the RSS likelihood with previously proposed prior distributions, sampling posteriors by Markov chain Monte Carlo. In a wide range of simulations RSS performs similarly to analyses using the individual data, both for estimating heritability and detecting associations. We apply RSS to a GWAS of human height that contains 253,288 individuals typed at 1.06 million SNPs, for which analyses of individual-level data are practically impossible. Estimates of heritability (52%) are consistent with, but more precise, than previous results using subsets of these data. We also identify many previously unreported loci that show evidence for association with height in our analyses. Software is available at https://github.com/stephenslab/rss. PMID:29399241
Sawle, Leanne; Freeman, Jennifer; Marsden, Jonathan
2017-04-01
Balance is a complex construct, affected by multiple components such as strength and co-ordination. However, whilst assessing an athlete's dynamic balance is an important part of clinical examination, there is no gold standard measure. The multiple single-leg hop-stabilization test is a functional test which may offer a method of evaluating the dynamic attributes of balance, but it needs to show adequate intra-tester reliability. The purpose of this study was to assess the intra-rater reliability of a dynamic balance test, the multiple single-leg hop-stabilization test on the dominant and non-dominant legs. Intra-rater reliability study. Fifteen active participants were tested twice with a 10-minute break between tests. The outcome measure was the multiple single-leg hop-stabilization test score, based on a clinically assessed numerical scoring system. Results were analysed using an Intraclass Correlations Coefficient (ICC 2,1 ) and Bland-Altman plots. Regression analyses explored relationships between test scores, leg dominance, age and training (an alpha level of p = 0.05 was selected). ICCs for intra-rater reliability were 0.85 for the dominant and non-dominant legs (confidence intervals = 0.62-0.95 and 0.61-0.95 respectively). Bland-Altman plots showed scores within two standard deviations. A significant correlation was observed between the dominant and non-dominant leg on balance scores (R 2 =0.49, p<0.05), and better balance was associated with younger participants in their non-dominant leg (R 2 =0.28, p<0.05) and their dominant leg (R 2 =0.39, p<0.05), and a higher number of hours spent training for the non-dominant leg R 2 =0.37, p<0.05). The multiple single-leg hop-stabilisation test demonstrated strong intra-tester reliability with active participants. Younger participants who trained more, have better balance scores. This test may be a useful measure for evaluating the dynamic attributes of balance. 3.
An empirical study using permutation-based resampling in meta-regression
2012-01-01
Background In meta-regression, as the number of trials in the analyses decreases, the risk of false positives or false negatives increases. This is partly due to the assumption of normality that may not hold in small samples. Creation of a distribution from the observed trials using permutation methods to calculate P values may allow for less spurious findings. Permutation has not been empirically tested in meta-regression. The objective of this study was to perform an empirical investigation to explore the differences in results for meta-analyses on a small number of trials using standard large sample approaches verses permutation-based methods for meta-regression. Methods We isolated a sample of randomized controlled clinical trials (RCTs) for interventions that have a small number of trials (herbal medicine trials). Trials were then grouped by herbal species and condition and assessed for methodological quality using the Jadad scale, and data were extracted for each outcome. Finally, we performed meta-analyses on the primary outcome of each group of trials and meta-regression for methodological quality subgroups within each meta-analysis. We used large sample methods and permutation methods in our meta-regression modeling. We then compared final models and final P values between methods. Results We collected 110 trials across 5 intervention/outcome pairings and 5 to 10 trials per covariate. When applying large sample methods and permutation-based methods in our backwards stepwise regression the covariates in the final models were identical in all cases. The P values for the covariates in the final model were larger in 78% (7/9) of the cases for permutation and identical for 22% (2/9) of the cases. Conclusions We present empirical evidence that permutation-based resampling may not change final models when using backwards stepwise regression, but may increase P values in meta-regression of multiple covariates for relatively small amount of trials. PMID:22587815
Köhler, Thorsten; Janssen, Christian; Plath, Sven-Christoph; Reese, Jens Peter; Lay, Jann; Steinhausen, Simone; Gloede, Tristan; Kowalski, Christoph; Schulz-Nieswandt, Frank; Pfaff, Holger
2010-12-01
The present study aims to measure the determinants of the innovative climate in German banks with a focus on workplace health management (WHM). We analyze the determinants of innovative climate with multiple regressions using a dataset based on standardized telephone interviews conducted with health promotion experts from 198 randomly selected German banks. The regression analysis provided a good explanation of the variance in the dependent variable (R² = 55%). Communication climate (β = 0.55; p < 0.001), social capital (β = 0.21; p < 0.01), the establishment of a WHM program (β = 0.13; p < 0.05) as well as company size (β = 0.15; p < 0.01) were found to have a significant impact on an organization's innovative climate. In order to foster an innovation-friendly climate, organizations should establish shared values. An active step in this direction involves strengthening the organizations' social capital and communication climate through trustworthy management decisions such as the implementation of a WHM program.
Maggin, Daniel M; Swaminathan, Hariharan; Rogers, Helen J; O'Keeffe, Breda V; Sugai, George; Horner, Robert H
2011-06-01
A new method for deriving effect sizes from single-case designs is proposed. The strategy is applicable to small-sample time-series data with autoregressive errors. The method uses Generalized Least Squares (GLS) to model the autocorrelation of the data and estimate regression parameters to produce an effect size that represents the magnitude of treatment effect from baseline to treatment phases in standard deviation units. In this paper, the method is applied to two published examples using common single case designs (i.e., withdrawal and multiple-baseline). The results from these studies are described, and the method is compared to ten desirable criteria for single-case effect sizes. Based on the results of this application, we conclude with observations about the use of GLS as a support to visual analysis, provide recommendations for future research, and describe implications for practice. Copyright © 2011 Society for the Study of School Psychology. Published by Elsevier Ltd. All rights reserved.
Estimating a child's age from an image using whole body proportions.
Lucas, Teghan; Henneberg, Maciej
2017-09-01
The use and distribution of child pornography is an increasing problem. Forensic anthropologists are often asked to estimate a child's age from a photograph. Previous studies have attempted to estimate the age of children from photographs using ratios of the face. Here, we propose to include body measurement ratios into age estimates. A total of 1603 boys and 1833 girls aged 5-16 years were measured over a 10-year period. They are 'Cape Coloured' children from South Africa. Their age was regressed on ratios derived from anthropometric measurements of the head as well as the body. Multiple regression equations including four ratios for each sex (head height to shoulder and hip width, knee width, leg length and trunk length) have a standard error of 1.6-1.7 years. The error is of the same order as variation of differences between biological and chronological ages of the children. Thus, the error cannot be minimised any further as it is a direct reflection of a naturally occurring phenomenon.
Meng, Yilin; Roux, Benoît
2015-08-11
The weighted histogram analysis method (WHAM) is a standard protocol for postprocessing the information from biased umbrella sampling simulations to construct the potential of mean force with respect to a set of order parameters. By virtue of the WHAM equations, the unbiased density of state is determined by satisfying a self-consistent condition through an iterative procedure. While the method works very effectively when the number of order parameters is small, its computational cost grows rapidly in higher dimension. Here, we present a simple and efficient alternative strategy, which avoids solving the self-consistent WHAM equations iteratively. An efficient multivariate linear regression framework is utilized to link the biased probability densities of individual umbrella windows and yield an unbiased global free energy landscape in the space of order parameters. It is demonstrated with practical examples that free energy landscapes that are comparable in accuracy to WHAM can be generated at a small fraction of the cost.
Yadav, Dharmendra Kumar; Kalani, Komal; Khan, Feroz; Srivastava, Santosh Kumar
2013-12-01
For the prediction of anticancer activity of glycyrrhetinic acid (GA-1) analogs against the human lung cancer cell line (A-549), a QSAR model was developed by forward stepwise multiple linear regression methodology. The regression coefficient (r(2)) and prediction accuracy (rCV(2)) of the QSAR model were taken 0.94 and 0.82, respectively in terms of correlation. The QSAR study indicates that the dipole moments, size of smallest ring, amine counts, hydroxyl and nitro functional groups are correlated well with cytotoxic activity. The docking studies showed high binding affinity of the predicted active compounds against the lung cancer target EGFR. These active glycyrrhetinic acid derivatives were then semi-synthesized, characterized and in-vitro tested for anticancer activity. The experimental results were in agreement with the predicted values and the ethyl oxalyl derivative of GA-1 (GA-3) showed equal cytotoxic activity to that of standard anticancer drug paclitaxel.
2015-01-01
The weighted histogram analysis method (WHAM) is a standard protocol for postprocessing the information from biased umbrella sampling simulations to construct the potential of mean force with respect to a set of order parameters. By virtue of the WHAM equations, the unbiased density of state is determined by satisfying a self-consistent condition through an iterative procedure. While the method works very effectively when the number of order parameters is small, its computational cost grows rapidly in higher dimension. Here, we present a simple and efficient alternative strategy, which avoids solving the self-consistent WHAM equations iteratively. An efficient multivariate linear regression framework is utilized to link the biased probability densities of individual umbrella windows and yield an unbiased global free energy landscape in the space of order parameters. It is demonstrated with practical examples that free energy landscapes that are comparable in accuracy to WHAM can be generated at a small fraction of the cost. PMID:26574437
Thermal sensation and comfort during exposure to local airflow to face or legs.
Yamashita, Kazuaki; Matsuo, Juntaro; Tochihara, Yutaka; Kondo, Youichiro; Takayama, Shizuka; Nagayama, Hiroki
2005-01-01
The present study examined the contribution of local airflow temperature to thermal sensation and comfort in humans. Eight healthy male students were exposed to local airflow to their faces (summer condition) or legs (winter condition) for 30 minutes. Local airflow temperature (Tf) was maintained at 18 degrees C to 36 degrees C, and ambient temperature (Ta) was maintained at 17.4 degrees C to 31.4 degrees C. Each subject was exposed to 16 conditions chosen from the combination of Tf and Ta. Based on the results of multiple regression analysis, the standardized partial regression coefficient of Tf and Ta were determined to be 0.93 and 0.13 in the summer condition, and 0.71 and 0.36 in the winter condition at the end of the exposure. Also, thermal comfort was observed to depend closely on the interrelation between Tf and Ta. The present data suggested that local airflow temperature is an important thermal factor regarding thermal sensation and comfort.
NASA Astrophysics Data System (ADS)
Kiss, I.; Cioată, V. G.; Ratiu, S. A.; Rackov, M.; Penčić, M.
2018-01-01
Multivariate research is important in areas of cast-iron brake shoes manufacturing, because many variables interact with each other simultaneously. This article focuses on expressing the multiple linear regression model related to the hardness assurance by the chemical composition of the phosphorous cast irons destined to the brake shoes, having in view that the regression coefficients will illustrate the unrelated contributions of each independent variable towards predicting the dependent variable. In order to settle the multiple correlations between the hardness of the cast-iron brake shoes, and their chemical compositions several regression equations has been proposed. Is searched a mathematical solution which can determine the optimum chemical composition for the hardness desirable values. Starting from the above-mentioned affirmations two new statistical experiments are effectuated related to the values of Phosphorus [P], Manganese [Mn] and Silicon [Si]. Therefore, the regression equations, which describe the mathematical dependency between the above-mentioned elements and the hardness, are determined. As result, several correlation charts will be revealed.
Enhanced ID Pit Sizing Using Multivariate Regression Algorithm
NASA Astrophysics Data System (ADS)
Krzywosz, Kenji
2007-03-01
EPRI is funding a program to enhance and improve the reliability of inside diameter (ID) pit sizing for balance-of plant heat exchangers, such as condensers and component cooling water heat exchangers. More traditional approaches to ID pit sizing involve the use of frequency-specific amplitude or phase angles. The enhanced multivariate regression algorithm for ID pit depth sizing incorporates three simultaneous input parameters of frequency, amplitude, and phase angle. A set of calibration data sets consisting of machined pits of various rounded and elongated shapes and depths was acquired in the frequency range of 100 kHz to 1 MHz for stainless steel tubing having nominal wall thickness of 0.028 inch. To add noise to the acquired data set, each test sample was rotated and test data acquired at 3, 6, 9, and 12 o'clock positions. The ID pit depths were estimated using a second order and fourth order regression functions by relying on normalized amplitude and phase angle information from multiple frequencies. Due to unique damage morphology associated with the microbiologically-influenced ID pits, it was necessary to modify the elongated calibration standard-based algorithms by relying on the algorithm developed solely from the destructive sectioning results. This paper presents the use of transformed multivariate regression algorithm to estimate ID pit depths and compare the results with the traditional univariate phase angle analysis. Both estimates were then compared with the destructive sectioning results.
Koch, Cosima; Posch, Andreas E; Goicoechea, Héctor C; Herwig, Christoph; Lendl, Bernhard
2014-01-07
This paper presents the quantification of Penicillin V and phenoxyacetic acid, a precursor, inline during Pencillium chrysogenum fermentations by FTIR spectroscopy and partial least squares (PLS) regression and multivariate curve resolution - alternating least squares (MCR-ALS). First, the applicability of an attenuated total reflection FTIR fiber optic probe was assessed offline by measuring standards of the analytes of interest and investigating matrix effects of the fermentation broth. Then measurements were performed inline during four fed-batch fermentations with online HPLC for the determination of Penicillin V and phenoxyacetic acid as reference analysis. PLS and MCR-ALS models were built using these data and validated by comparison of single analyte spectra with the selectivity ratio of the PLS models and the extracted spectral traces of the MCR-ALS models, respectively. The achieved root mean square errors of cross-validation for the PLS regressions were 0.22 g L(-1) for Penicillin V and 0.32 g L(-1) for phenoxyacetic acid and the root mean square errors of prediction for MCR-ALS were 0.23 g L(-1) for Penicillin V and 0.15 g L(-1) for phenoxyacetic acid. A general work-flow for building and assessing chemometric regression models for the quantification of multiple analytes in bioprocesses by FTIR spectroscopy is given. Copyright © 2013 The Authors. Published by Elsevier B.V. All rights reserved.
ERIC Educational Resources Information Center
Porter, Kristin E.; Reardon, Sean F.; Unlu, Fatih; Bloom, Howard S.; Robinson-Cimpian, Joseph P.
2014-01-01
A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the "surface" method, the "frontier" method, the "binding-score" method, and…
ERIC Educational Resources Information Center
Muller, Veronica; Brooks, Jessica; Tu, Wei-Mo; Moser, Erin; Lo, Chu-Ling; Chan, Fong
2015-01-01
Purpose: The main objective of this study was to determine the extent to which physical and cognitive-affective factors are associated with fibromyalgia (FM) fatigue. Method: A quantitative descriptive design using correlation techniques and multiple regression analysis. The participants consisted of 302 members of the National Fibromyalgia &…
ERIC Educational Resources Information Center
Choi, Kilchan
2011-01-01
This report explores a new latent variable regression 4-level hierarchical model for monitoring school performance over time using multisite multiple-cohorts longitudinal data. This kind of data set has a 4-level hierarchical structure: time-series observation nested within students who are nested within different cohorts of students. These…
ERIC Educational Resources Information Center
Richter, Tobias
2006-01-01
Most reading time studies using naturalistic texts yield data sets characterized by a multilevel structure: Sentences (sentence level) are nested within persons (person level). In contrast to analysis of variance and multiple regression techniques, hierarchical linear models take the multilevel structure of reading time data into account. They…
Some Applied Research Concerns Using Multiple Linear Regression Analysis.
ERIC Educational Resources Information Center
Newman, Isadore; Fraas, John W.
The intention of this paper is to provide an overall reference on how a researcher can apply multiple linear regression in order to utilize the advantages that it has to offer. The advantages and some concerns expressed about the technique are examined. A number of practical ways by which researchers can deal with such concerns as…
A Spreadsheet Tool for Learning the Multiple Regression F-Test, T-Tests, and Multicollinearity
ERIC Educational Resources Information Center
Martin, David
2008-01-01
This note presents a spreadsheet tool that allows teachers the opportunity to guide students towards answering on their own questions related to the multiple regression F-test, the t-tests, and multicollinearity. The note demonstrates approaches for using the spreadsheet that might be appropriate for three different levels of statistics classes,…
ERIC Educational Resources Information Center
Anderson, Joan L.
2006-01-01
Data from graduate student applications at a large Western university were used to determine which factors were the best predictors of success in graduate school, as defined by cumulative graduate grade point average. Two statistical models were employed and compared: artificial neural networking and simultaneous multiple regression. Both models…
ERIC Educational Resources Information Center
Preacher, Kristopher J.; Curran, Patrick J.; Bauer, Daniel J.
2006-01-01
Simple slopes, regions of significance, and confidence bands are commonly used to evaluate interactions in multiple linear regression (MLR) models, and the use of these techniques has recently been extended to multilevel or hierarchical linear modeling (HLM) and latent curve analysis (LCA). However, conducting these tests and plotting the…
Regression Models for the Analysis of Longitudinal Gaussian Data from Multiple Sources
O’Brien, Liam M.; Fitzmaurice, Garrett M.
2006-01-01
We present a regression model for the joint analysis of longitudinal multiple source Gaussian data. Longitudinal multiple source data arise when repeated measurements are taken from two or more sources, and each source provides a measure of the same underlying variable and on the same scale. This type of data generally produces a relatively large number of observations per subject; thus estimation of an unstructured covariance matrix often may not be possible. We consider two methods by which parsimonious models for the covariance can be obtained for longitudinal multiple source data. The methods are illustrated with an example of multiple informant data arising from a longitudinal interventional trial in psychiatry. PMID:15726666
Interpretation of commonly used statistical regression models.
Kasza, Jessica; Wolfe, Rory
2014-01-01
A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.
Applied Multiple Linear Regression: A General Research Strategy
ERIC Educational Resources Information Center
Smith, Brandon B.
1969-01-01
Illustrates some of the basic concepts and procedures for using regression analysis in experimental design, analysis of variance, analysis of covariance, and curvilinear regression. Applications to evaluation of instruction and vocational education programs are illustrated. (GR)
Wood, Molly S.; Fosness, Ryan L.; Skinner, Kenneth D.; Veilleux, Andrea G.
2016-06-27
The U.S. Geological Survey, in cooperation with the Idaho Transportation Department, updated regional regression equations to estimate peak-flow statistics at ungaged sites on Idaho streams using recent streamflow (flow) data and new statistical techniques. Peak-flow statistics with 80-, 67-, 50-, 43-, 20-, 10-, 4-, 2-, 1-, 0.5-, and 0.2-percent annual exceedance probabilities (1.25-, 1.50-, 2.00-, 2.33-, 5.00-, 10.0-, 25.0-, 50.0-, 100-, 200-, and 500-year recurrence intervals, respectively) were estimated for 192 streamgages in Idaho and bordering States with at least 10 years of annual peak-flow record through water year 2013. The streamgages were selected from drainage basins with little or no flow diversion or regulation. The peak-flow statistics were estimated by fitting a log-Pearson type III distribution to records of annual peak flows and applying two additional statistical methods: (1) the Expected Moments Algorithm to help describe uncertainty in annual peak flows and to better represent missing and historical record; and (2) the generalized Multiple Grubbs Beck Test to screen out potentially influential low outliers and to better fit the upper end of the peak-flow distribution. Additionally, a new regional skew was estimated for the Pacific Northwest and used to weight at-station skew at most streamgages. The streamgages were grouped into six regions (numbered 1_2, 3, 4, 5, 6_8, and 7, to maintain consistency in region numbering with a previous study), and the estimated peak-flow statistics were related to basin and climatic characteristics to develop regional regression equations using a generalized least squares procedure. Four out of 24 evaluated basin and climatic characteristics were selected for use in the final regional peak-flow regression equations.Overall, the standard error of prediction for the regional peak-flow regression equations ranged from 22 to 132 percent. Among all regions, regression model fit was best for region 4 in west-central Idaho (average standard error of prediction=46.4 percent; pseudo-R2>92 percent) and region 5 in central Idaho (average standard error of prediction=30.3 percent; pseudo-R2>95 percent). Regression model fit was poor for region 7 in southern Idaho (average standard error of prediction=103 percent; pseudo-R2<78 percent) compared to other regions because few streamgages in region 7 met the criteria for inclusion in the study, and the region’s semi-arid climate and associated variability in precipitation patterns causes substantial variability in peak flows.A drainage area ratio-adjustment method, using ratio exponents estimated using generalized least-squares regression, was presented as an alternative to the regional regression equations if peak-flow estimates are desired at an ungaged site that is close to a streamgage selected for inclusion in this study. The alternative drainage area ratio-adjustment method is appropriate for use when the drainage area ratio between the ungaged and gaged sites is between 0.5 and 1.5.The updated regional peak-flow regression equations had lower total error (standard error of prediction) than all regression equations presented in a 1982 study and in four of six regions presented in 2002 and 2003 studies in Idaho. A more extensive streamgage screening process used in the current study resulted in fewer streamgages used in the current study than in the 1982, 2002, and 2003 studies. Fewer streamgages used and the selection of different explanatory variables were likely causes of increased error in some regions compared to previous studies, but overall, regional peak‑flow regression model fit was generally improved for Idaho. The revised statistical procedures and increased streamgage screening applied in the current study most likely resulted in a more accurate representation of natural peak-flow conditions.The updated, regional peak-flow regression equations will be integrated in the U.S. Geological Survey StreamStats program to allow users to estimate basin and climatic characteristics and peak-flow statistics at ungaged locations of interest. StreamStats estimates peak-flow statistics with quantifiable certainty only when used at sites with basin and climatic characteristics within the range of input variables used to develop the regional regression equations. Both the regional regression equations and StreamStats should be used to estimate peak-flow statistics only in naturally flowing, relatively unregulated streams without substantial local influences to flow, such as large seeps, springs, or other groundwater-surface water interactions that are not widespread or characteristic of the respective region.
The impact of depression on fatigue in patients with haemodialysis: a correlational study.
Bai, Yu-Ling; Lai, Liu-Yuan; Lee, Bih-O; Chang, Yong-Yuan; Chiou, Chou-Ping
2015-07-01
To investigate the fatigue levels and important fatigue predictors for patients undergoing haemodialysis. Fatigue is a common symptom for haemodialysis patients. With its debilitating and distressing effects, it impacts patients in terms of their quality of life while also increasing their mortality rate. A descriptive correlational study. Convenience sampling was conducted at six chosen haemodialysis centres in Southern Taiwan. Data were collected via a structured questionnaire from 193 haemodialysis patients. The scales involved in this study were socio-demographic details, the Center for Epidemiologic Studies Depression Scale, and the Fatigue Scale for haemodialysis patients. Data analysis included percentages, means, standard deviations and hierarchical multiple regression analysis. The fatigue level for haemodialysis patients was in the moderate range. Results from the hierarchical multiple regression analysis indicated that age, employment status, types of medications, physical activity and depression were significant. Of those variables, depression had the greatest impact on the patients' fatigue level, accounting for up to 30·6% of the explanatory power. The total explanatory power of the regression model was 64·2%. This study determined that for haemodialysis patients, unemployment, increased age, taking more medications or lower exercise frequencies resulted in more severe depression, which translated in turn to higher levels of fatigue. Among all these factors, depression had the greatest impact on the patients' fatigue levels. Not only is this finding beneficial to future studies on fatigue as a source of reference, it is also helpful in our understanding of important predictors relating to fatigue in the everyday lives of haemodialysis patients. It is recommended that when caring for fatigued patients, more care should be dedicated to their psychological states, and assistance should be provided in a timely way so as to reduce the amount of fatigue suffered. © 2015 John Wiley & Sons Ltd.
Yiming, Gulinuer; Zhou, Xianhui; Lv, Wenkui; Peng, Yi; Zhang, Wenhui; Cheng, Xinchun; Li, Yaodong; Xing, Qiang; Zhang, Jianghua; Zhou, Qina; Zhang, Ling; Lu, Yanmei; Wang, Hongli; Tang, Baopeng
2017-01-01
Brachial-ankle pulse wave velocity (baPWV), a direct measure of aortic stiffness, has increasingly become an important assessment for cardiovascular risk. The present study established the reference and normal values of baPWV in a Central Asia population in Xinjiang, China. We recruited participants from a central Asia population in Xinjiang, China. We performed multiple regression analysis to investigate the determinants of baPWV. The median and 10th-90th percentiles were calculated to establish the reference and normal values based on these categories. In total, 5,757 Han participants aged 15-88 years were included in the present study. Spearman correlation analysis showed that age (r = 0.587, p < 0.001) and mean blood pressure (MBP, r = 0.599, p <0.001) were the major factors influencing the values of baPWV in the reference population. Furthermore, in the multiple linear regression analysis, the standardized regression coefficients of age (0.445) and MBP (0.460) were much higher than those of body mass index, triglyceride, and glycemia (-0.054, 0.035, and 0.033, respectively). In the covariance analysis, after adjustment for age and MBP, only diabetes was the significant independent determinant of baPWV (p = 0.009). Thus, participants with diabetes were excluded from the reference value population. The reference values ranged from 14.3 to 25.2 m/s, and the normal values ranged from 13.9 to 21.2 m/s. This is the first study that has established the reference and normal values for baPWV according to age and blood pressure in a Central Asia population.
Takagi, Yukinori; Sumi, Misa; Nakamura, Hideki; Sato, Shuntaro; Kawakami, Atsushi; Nakamura, Takashi
2016-02-01
To evaluate ultrasonography (US) grading of salivary gland disease as a predictor of treatment efficacy for impaired salivary function in xerostomia patients with or without Sjögren's syndrome (SS). We retrospectively analysed the prognostic importance of salivary US grading in 317 patients (168 with SS and 149 without SS). US images of the parotid and submandibular glands in each patient were individually categorized into grades 0-4 based on the extent of damage to the gland; and the sum total grade of the two gland types on either side was assigned a US score of 0-8 for each patient. The relative importance of US score and demographic and clinical variables was assessed using stepwise multiple regression analysis after various durations of xerostomia treatment. Multiple regression analysis indicated that the baseline US score before treatment was the most important factor [standardized regression coefficient (β) = -0.523, t-statistic (t) = -7.967, P < 0.001] in predicting negative outcomes in SS patients. Treatment duration (β = 0.277, t = 4.225, P < 0.001) was also a significant but less important positive variable. On the other hand, US grading did not effectively predict treatment outcomes in non-SS patients, with treatment duration (β = 0.199, t = 2.486, P = 0.014) and baseline salivary flow rate before treatment (β = -0.172, t = -2.159, P = 0.032) being significant but weak predictors of positive and negative outcome, respectively. Salivary gland US grading may help to predict outcomes of treatment for impaired salivary function in patients with SS. © The Author 2015. Published by Oxford University Press on behalf of the British Society for Rheumatology. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ozaki, Toshiro, E-mail: ganronbun@amail.plala.or.jp; Seki, Hiroshi; Shiina, Makoto
2009-09-15
The purpose of the present study was to elucidate a method for predicting the intrahepatic arteriovenous shunt rate from computed tomography (CT) images and biochemical data, instead of from arterial perfusion scintigraphy, because adverse exacerbated systemic effects may be induced in cases where a high shunt rate exists. CT and arterial perfusion scintigraphy were performed in patients with liver metastases from gastric or colorectal cancer. Biochemical data and tumor marker levels of 33 enrolled patients were measured. The results were statistically verified by multiple regression analysis. The total metastatic hepatic tumor volume (V{sub metastasized}), residual hepatic parenchyma volume (V{sub residual};more » calculated from CT images), and biochemical data were treated as independent variables; the intrahepatic arteriovenous (IHAV) shunt rate (calculated from scintigraphy) was treated as a dependent variable. The IHAV shunt rate was 15.1 {+-} 11.9%. Based on the correlation matrixes, the best correlation coefficient of 0.84 was established between the IHAV shunt rate and V{sub metastasized} (p < 0.01). In the multiple regression analysis with the IHAV shunt rate as the dependent variable, the coefficient of determination (R{sup 2}) was 0.75, which was significant at the 0.1% level with two significant independent variables (V{sub metastasized} and V{sub residual}). The standardized regression coefficients ({beta}) of V{sub metastasized} and V{sub residual} were significant at the 0.1 and 5% levels, respectively. Based on this result, we can obtain a predicted value of IHAV shunt rate (p < 0.001) using CT images. When a high shunt rate was predicted, beneficial and consistent clinical monitoring can be initiated in, for example, hepatic arterial infusion chemotherapy.« less
Botto, Fernando; Obregon, Sebastian; Rubinstein, Fernando; Scuteri, Angelo; Nilsson, Peter M; Kotliar, Carol
2018-03-01
The main objective was to estimate the frequency of early vascular aging (EVA) in a sample of subjects from Latin America, with emphasis in young adults. We included 1416 subjects from 12 countries in Latin America who provided information about lifestyle, cardiovascular risk factors (CVRF), and anthropometrics. We measured pulse wave velocity (PWV) as a marker of arterial stiffness, and blood pressure (BP) using an oscillometric device (Mobil-O-Graph). To determine the frequency of EVA, we used multiple linear regression to estimate each subject's PWV expected for his/her age and systolic BP, and compared with observed values to obtain standardized residuals (z-scores). We defined EVA when z-score was ≥1.96. Finally, a multivariable logistic regression analysis was performed to determine baseline characteristics associated with EVA. Mean age was 49.9 ± 15.5 years, male gender was 50.3%. Mean PWV was 7.52 m/s (SD 1.97), mean systolic BP was 125.3 mmHg (SD 16.7) and mean diastolic BP was 78.9 mmHg (SD 12.2). The frequency of EVA was 5.7% in the total population, 9.8% in adults of 40 years or less and 18.7% in those 30 years or less. In these young adults, multiple logistic regression analyses demonstrated that dyslipidemia and hypertension showed an independent association with EVA, and smoking a borderline association (p = 0.07). In conclusion, the frequency of EVA in a sample from Latin America was around 6%, with higher rates in young adults. These results would support the search of CVRF and EVA during early adulthood.
Kikui, Miki; Kida, Momoyo; Kosaka, Takayuki; Yamamoto, Masaaki; Yoshimuta, Yoko; Yasui, Sakae; Nokubi, Takashi; Maeda, Yoshinobu; Kokubo, Yoshihiro; Watanabe, Makoto; Miyamoto, Yoshihiro
2015-01-01
Abstract There are numerous reports on the relationship between regular utilization of dental care services and oral health, but most are based on questionnaires and subjective evaluation. Few have objectively evaluated masticatory performance and its relationship to utilization of dental care services. The purpose of this study was to identify the effect of regular utilization of dental services on masticatory performance. The subjects consisted of 1804 general residents of Suita City, Osaka Prefecture (760 men and 1044 women, mean age 66.5 ± 7.9 years). Regular utilization of dental services and oral hygiene habits (frequency of toothbrushing and use of interdental aids) was surveyed, and periodontal status, occlusal support, and masticatory performance were measured. Masticatory performance was evaluated by a chewing test using gummy jelly. The correlation between age, sex, regular dental utilization, oral hygiene habits, periodontal status or occlusal support, and masticatory performance was analyzed using Spearman's correlation test and t‐test. In addition, multiple linear regression analysis was carried out to investigate the relationship of regular dental utilization with masticatory performance after controlling for other factors. Masticatory performance was significantly correlated to age when using Spearman's correlation test, and to regular dental utilization, periodontal status, or occlusal support with t‐test. Multiple linear regression analysis showed that regular utilization of dental services was significantly related to masticatory performance even after adjusting for age, sex, oral hygiene habits, periodontal status, and occlusal support (standardized partial regression coefficient β = 0.055). These findings suggested that the regular utilization of dental care services is an important factor influencing masticatory performance in a Japanese urban population. PMID:29744141
Occupational injuries in Italy: risk factors and long term trend (1951-98)
Fabiano, B; Curro, F; Pastorino, R
2001-01-01
OBJECTIVES—Trends in the rates of total injuries and fatal accidents in the different sectors of Italian industries were explored during the period 1951-98. Causes and dynamics of injury were also studied for setting priorities for improving safety standards. METHODS—Data on occupational injuries from the National Organisation for Labour Injury Insurance were combined with data from the State Statistics Institute to highlight the interaction between the injury frequency index trend and the production cycle—that is, the evolution of industrial production throughout the years. Multiple regression with log transformed rates was adopted to model the trends of occupational fatalities for each industrial group. RESULTS—The ratios between the linked indices of injury frequency and industrial production showed a good correlation over the whole period. A general decline in injuries was found across all sectors, with values ranging from 79.86% in the energy group to 23.32% in the textile group. In analysing fatalities, the trend seemed to be more clearly decreasing than the trend of total injuries, including temporary and permanent disabilities; the fatalities showed an exponential decrease according to multiple regression, with an annual decline equal to 4.42%. CONCLUSIONS—The overall probability of industrial fatal accidents in Italy tended to decrease exponentially by year. The most effective actions in preventing injuries were directed towards fatal accidents. By analysing the rates of fatal accident in the different sectors, appropriate targets and priorities for increased strategies to prevent injuries can be suggested. The analysis of the dynamics and the material causes of injuries showed that still more consideration should be given to human and organisational factors. Keywords: labour injuries; severity; regression model PMID:11303083
Kikui, Miki; Ono, Takahiro; Kida, Momoyo; Kosaka, Takayuki; Yamamoto, Masaaki; Yoshimuta, Yoko; Yasui, Sakae; Nokubi, Takashi; Maeda, Yoshinobu; Kokubo, Yoshihiro; Watanabe, Makoto; Miyamoto, Yoshihiro
2015-12-01
There are numerous reports on the relationship between regular utilization of dental care services and oral health, but most are based on questionnaires and subjective evaluation. Few have objectively evaluated masticatory performance and its relationship to utilization of dental care services. The purpose of this study was to identify the effect of regular utilization of dental services on masticatory performance. The subjects consisted of 1804 general residents of Suita City, Osaka Prefecture (760 men and 1044 women, mean age 66.5 ± 7.9 years). Regular utilization of dental services and oral hygiene habits (frequency of toothbrushing and use of interdental aids) was surveyed, and periodontal status, occlusal support, and masticatory performance were measured. Masticatory performance was evaluated by a chewing test using gummy jelly. The correlation between age, sex, regular dental utilization, oral hygiene habits, periodontal status or occlusal support, and masticatory performance was analyzed using Spearman's correlation test and t -test. In addition, multiple linear regression analysis was carried out to investigate the relationship of regular dental utilization with masticatory performance after controlling for other factors. Masticatory performance was significantly correlated to age when using Spearman's correlation test, and to regular dental utilization, periodontal status, or occlusal support with t -test. Multiple linear regression analysis showed that regular utilization of dental services was significantly related to masticatory performance even after adjusting for age, sex, oral hygiene habits, periodontal status, and occlusal support (standardized partial regression coefficient β = 0.055). These findings suggested that the regular utilization of dental care services is an important factor influencing masticatory performance in a Japanese urban population.
Correlates and predictors of missed nursing care in hospitals.
Bragadóttir, Helga; Kalisch, Beatrice J; Tryggvadóttir, Gudný Bergthora
2017-06-01
To identify the contribution of hospital, unit, staff characteristics, staffing adequacy and teamwork to missed nursing care in Iceland hospitals. A recently identified quality indicator for nursing care and patient safety is missed nursing care defined as any standard, required nursing care omitted or significantly delayed, indicating an error of omission. Former studies point to contributing factors to missed nursing care regarding hospital, unit and staff characteristics, perceptions of staffing adequacy as well as nursing teamwork, displayed in the Missed Nursing Care Model. This was a quantitative cross-sectional survey study. The samples were all registered nurses and practical nurses (n = 864) working on 27 medical, surgical and intensive care inpatient units in eight hospitals throughout Iceland. Response rate was 69·3%. Data were collected in March-April 2012 using the combined MISSCARE Survey-Icelandic and the Nursing Teamwork Survey-Icelandic. Descriptive, correlational and regression statistics were used for data analysis. Missed nursing care was significantly related to hospital and unit type, participants' age and role and their perception of adequate staffing and level of teamwork. The multiple regression testing of Model 1 indicated unit type, role, age and staffing adequacy to predict 16% of the variance in missed nursing care. Controlling for unit type, role, age and perceptions of staffing adequacy, the multiple regression testing of Model 2 showed that nursing teamwork predicted an additional 14% of the variance in missed nursing care. The results shed light on the correlates and predictors of missed nursing care in hospitals. This study gives direction as to the development of strategies for decreasing missed nursing care, including ensuring appropriate staffing levels and enhanced teamwork. By identifying contributing factors to missed nursing care, appropriate interventions can be developed and tested. © 2016 John Wiley & Sons Ltd.
Taljaard, Monica; McKenzie, Joanne E; Ramsay, Craig R; Grimshaw, Jeremy M
2014-06-19
An interrupted time series design is a powerful quasi-experimental approach for evaluating effects of interventions introduced at a specific point in time. To utilize the strength of this design, a modification to standard regression analysis, such as segmented regression, is required. In segmented regression analysis, the change in intercept and/or slope from pre- to post-intervention is estimated and used to test causal hypotheses about the intervention. We illustrate segmented regression using data from a previously published study that evaluated the effectiveness of a collaborative intervention to improve quality in pre-hospital ambulance care for acute myocardial infarction (AMI) and stroke. In the original analysis, a standard regression model was used with time as a continuous variable. We contrast the results from this standard regression analysis with those from segmented regression analysis. We discuss the limitations of the former and advantages of the latter, as well as the challenges of using segmented regression in analysing complex quality improvement interventions. Based on the estimated change in intercept and slope from pre- to post-intervention using segmented regression, we found insufficient evidence of a statistically significant effect on quality of care for stroke, although potential clinically important effects for AMI cannot be ruled out. Segmented regression analysis is the recommended approach for analysing data from an interrupted time series study. Several modifications to the basic segmented regression analysis approach are available to deal with challenges arising in the evaluation of complex quality improvement interventions.
Dulin, Patrick L; Gavala, Jhanitra; Stephens, Christine; Kostick, Marylynne; McDonald, Jennifer
2012-01-01
This study sought to understand the relationship between volunteer activity and happiness among a sample of older adult New Zealanders. It specifically sought to determine if ethnicity (Māori vs. non-Māori) and economic living standards (ELS) functioned as moderators of the relationship between volunteering and happiness. Data were garnered from the 2008 administration of the New Zealand Health, Work, and Retirement Longitudinal Study. Correlational and multiple regression procedures were employed to examine study hypotheses. Results from multiple regression analyses showed that the amount of volunteering per week was a unique predictor of the overall level of happiness. Moderation analyses indicated that ethnicity did not function as a moderator of the relationship between volunteering and happiness, but ELS did. Those with low ELS evidenced a stronger relationship between volunteering and happiness than those with high ELS. Results also indicated that Maori and those with low ELS volunteered more frequently than non-Māori and those with high ELS. This study provides evidence that volunteering is related to increased happiness, irrespective of ethnicity. It also provides further evidence that the relationship between volunteering and happiness is moderated by economic resources. Older individuals at the low end of the economic spectrum are likely to benefit more from volunteering than those at the high end.
Relationship between pelvic incidence and osteoarthritis of the hip
Weinberg, D. S.; Bohl, M. S.; Liu, R. W.
2016-01-01
Objectives Sagittal alignment of the lumbosacral spine, and specifically pelvic incidence (PI), has been implicated in the development of spine pathology, but generally ignored with regards to diseases of the hip. We aimed to determine if increased PI is correlated with higher rates of hip osteoarthritis (HOA). The effect of PI on the development of knee osteoarthritis (KOA) was used as a negative control. Methods We studied 400 well-preserved cadaveric skeletons ranging from 50 to 79 years of age at death. Each specimen’s OA of the hip and knee were graded using a previously described method. PI was measured from standardised lateral photographs of reconstructed pelvises. Multiple regression analysis was performed to determine the relationship between age and PI with HOA and KOA. Results The mean age was 60.2 years (standard deviation (sd) 8.1), and the mean PI was 46.7° (sd 10.7°). Multiple regression analysis demonstrated a significant correlation between increased PI and HOA (standardised beta = 0.103, p = 0.017). There was no correlation between PI and KOA (standardised beta = 0.003, p = 0.912). Conclusion Higher PI in the younger individual may contribute to the development of HOA in later life. Cite this article: Dr J. J. Gebhart. Relationship between pelvic incidence and osteoarthritis of the hip. Bone Joint Res 2016;5:66–72. DOI: 10.1302/2046-3758.52.2000552. PMID:26912384
Biomechanical, anthropometric, and psychological determinants of barbell back squat strength.
Vigotsky, Andrew D; Bryanton, Megan A; Nuckols, Greg; Beardsley, Chris; Contreras, Bret; Evans, Jessica; Schoenfeld, Brad J
2018-02-27
Previous investigations of strength have only focused on biomechanical or psychological determinants, while ignoring the potential interplay and relative contributions of these variables. The purpose of this study was to investigate the relative contributions of biomechanical, anthropometric, and psychological variables to the prediction of maximum parallel barbell back squat strength. Twenty-one college-aged participants (male = 14; female = 7; age = 23 ± 3 years) reported to the laboratory for two visits. The first visit consisted of anthropometric, psychometric, and parallel barbell back squat one-repetition maximum (1RM) testing. On the second visit, participants performed isometric dynamometry testing for the knee, hip, and spinal extensors in a sticking point position-specific manner. Multiple linear regression and correlations were used to investigate the combined and individual relationships between biomechanical, anthropometric, and psychological variables and squat 1RM. Multiple regression revealed only one statistically predictive determinant: fat free mass normalized to height (standardized estimate ± SE = 0.6 ± 0.3; t(16) = 2.28; p = 0.037). Correlation coefficients for individual variables and squat 1RM ranged from r = -0.79-0.83, with biomechanical, anthropometric, experiential, and sex predictors showing the strongest relationships, and psychological variables displaying the weakest relationships. These data suggest that back squat strength in a heterogeneous population is multifactorial and more related to physical rather than psychological variables.
Martin, Richard W; Head, Andrew J; René, Jonathan; Swartz, Timothy J; Fiechtner, Justus J; McIntosh, Barbara A; Holmes-Rovner, Margaret
2008-04-01
To explore how rheumatoid arthritis (RA) antirheumatic drug-specific knowledge and numeric literacy, patient trust in physician, and demographic and disease-related factors relate to the confidence of patient decision-making related to disease modifying antirheumatic drugs (DMARD). Data were analyzed from 628 randomly selected patients with RA receiving care in community rheumatology practices, who responded to a multicenter, cross-sectional mail survey. We used multiple regression models to predict patient confidence in DMARD decision-making related to their most recently initiated DMARD. Significant positive correlation was found between confidence in DMARD decision and trust in physician, DMARD-specific knowledge, and disease duration, but not risk-related numeric literacy, sex, or education. Negative correlations were found with disease severity and current bother with DMARD side effects. A multiple linear regression model of confidence in DMARD decision had an overall R = 0.788, R2 = 0.620 (p < 0.001). The 4 dependent variables contributing significantly to the model were female sex, Medicaid insurance status, satisfaction with RA disease control, and trust in physician, with standardized beta = 0.077, -0.089, 0.147, and 0.687, respectively. In this sample of community patients with RA, the patient trust in physician had substantially greater effect on confidence in DMARD decision than DMARD-specific knowledge, disease-related factors, or demographic characteristics.
MMR immunisation status among Dublin paediatric A&E attenders.
Murphy, A W; Power, R; Kinlen, D M; Johnson, Z
1994-01-01
The objectives of this study were to establish the need for opportunistic MMR immunisation among paediatric A&E attenders to the three Dublin paediatric hospitals and to examine the relationship between immunisation status and socioeconomic factors. Design was that of a two month cross sectional study. Survey data was then compared with information on the Eastern Health Board (EHB) records system. Small area and multiple regression analysis of socioeconomic factors derived from participants addresses was also performed. Subjects were 337 children who attended these departments and were aged between fifteen months and five years. For 66% of cases there was a history of MMR immunisation, 30% gave a negative history and 4% did not know. Of those giving a negative history, one third said immunisation had been omitted for no specific reason. EHB records suggested that 39% were immunised, 41% were not and 20% were not on file. Eligibility for the GMS was not associated with failure to immunise. Small area and multiple regression analysis showed little association between immunisation uptake and socioeconomic factors. An opportunistic MMR immunisation policy in A&E Departments would make an important contribution to increasing overall uptake figures. Parental knowledge of the implications of measles and the effectiveness of immunisation needs to be improved. Computerised child health systems must have high data quality standards and access to these systems should be made available in A&E departments.
Extending the Distributed Lag Model framework to handle chemical mixtures.
Bello, Ghalib A; Arora, Manish; Austin, Christine; Horton, Megan K; Wright, Robert O; Gennings, Chris
2017-07-01
Distributed Lag Models (DLMs) are used in environmental health studies to analyze the time-delayed effect of an exposure on an outcome of interest. Given the increasing need for analytical tools for evaluation of the effects of exposure to multi-pollutant mixtures, this study attempts to extend the classical DLM framework to accommodate and evaluate multiple longitudinally observed exposures. We introduce 2 techniques for quantifying the time-varying mixture effect of multiple exposures on an outcome of interest. Lagged WQS, the first technique, is based on Weighted Quantile Sum (WQS) regression, a penalized regression method that estimates mixture effects using a weighted index. We also introduce Tree-based DLMs, a nonparametric alternative for assessment of lagged mixture effects. This technique is based on the Random Forest (RF) algorithm, a nonparametric, tree-based estimation technique that has shown excellent performance in a wide variety of domains. In a simulation study, we tested the feasibility of these techniques and evaluated their performance in comparison to standard methodology. Both methods exhibited relatively robust performance, accurately capturing pre-defined non-linear functional relationships in different simulation settings. Further, we applied these techniques to data on perinatal exposure to environmental metal toxicants, with the goal of evaluating the effects of exposure on neurodevelopment. Our methods identified critical neurodevelopmental windows showing significant sensitivity to metal mixtures. Copyright © 2017 Elsevier Inc. All rights reserved.
Percy, Andrew J; Yang, Juncong; Chambers, Andrew G; Borchers, Christoph H
2016-01-01
Absolute quantitative strategies are emerging as a powerful and preferable means of deriving concentrations in biological samples for systems biology applications. Method development is driven by the need to establish new-and validate current-protein biomarkers of high-to-low abundance for clinical utility. In this chapter, we describe a methodology involving two-dimensional (2D) reversed-phase liquid chromatography (RPLC), operated under alkaline and acidic pH conditions, combined with multiple reaction monitoring (MRM)-mass spectrometry (MS) (also called selected reaction monitoring (SRM)-MS) and a complex mixture of stable isotope-labeled standard (SIS) peptides, to quantify a broad and diverse panel of 253 proteins in human blood plasma. The quantitation range spans 8 orders of magnitude-from 15 mg/mL (for vitamin D-binding protein) to 450 pg/mL (for protein S100-B)-and includes 31 low-abundance proteins (defined as being <10 ng/mL) of potential disease relevance. The method is designed to assess candidates at the discovery and/or verification phases of the biomarker pipeline and can be adapted to examine smaller or alternate panels of proteins for higher sample throughput. Also detailed here is the application of our recently developed software tool-Qualis-SIS-for protein quantitation (via regression analysis of standard curves) and quality assessment of the resulting data. Overall, this chapter provides the blueprint for the replication of this quantitative proteomic method by proteomic scientists of all skill levels.
Brown, Ted; Mapleston, Jennifer; Nairn, Allison; Molloy, Andrew
2013-03-01
Most individuals who have had a stroke present with some degree of residual cognitive and/or perceptual impairment. Occupational therapists often utilize standardized cognitive and perceptual assessments with clients to establish a baseline of skill performance as well as to inform goal setting and intervention planning. Being able to predict the functional independence of individuals who have had a stroke based on cognitive and perceptual impairments would assist with appropriate discharge planning and follow-up resource allocation. The study objective was to investigate the ability of the Developmental Test of Visual Perception - Adolescents and Adults (DTVP-A) and the Neurobehavioural Cognitive Status Exam (Cognistat) to predict the functional performance as measured by the Barthel Index of individuals who have had a stroke. Data was collected using the DTVP-A, Cognistat and the Barthal Index from 32 adults recovering from stroke. Two standard multiple regression models were used to determine predictive variables of the functional independence dependent variable. Both the Cognistat and DTVP-A had a statistically significant ability to predict functional performance (as measured by the Barthel Index) accounting for 64.4% and 27.9% of each regression model, respectively. Two Cognistat subscales (Comprehension [beta = 0.48; p < 0.001)] and Repetition [beta = 0.45; p < 0.004]) and one DTVP-A subscale (Copying [beta = 0.46; p < 0.014]) made statistically significant contributions to the regression models as independent variables. On the basis of the regression model findings, it appears that DTVP-A's Copying and the Cognistat's Comprehension and Repetition subscales are useful in predicting the functional independence (as measured by the Barthel Index) in those individuals who have had a stroke. Given the fundamental importance that cognition and perception has for one's ability to function independently, further investigation is warranted to determine other predictors of functional performance of individuals with a stroke. Copyright © 2012 John Wiley & Sons, Ltd.
ERIC Educational Resources Information Center
Porter, Kristin E.; Reardon, Sean F.; Unlu, Fatih; Bloom, Howard S.; Cimpian, Joseph R.
2017-01-01
A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the "surface" method, the "frontier" method, the "binding-score" method, and…
ERIC Educational Resources Information Center
Woolley, Kristin K.
Many researchers are unfamiliar with suppressor variables and how they operate in multiple regression analyses. This paper describes the role suppressor variables play in a multiple regression model and provides practical examples that explain how they can change research results. A variable that when added as another predictor increases the total…
ERIC Educational Resources Information Center
Martz, Erin
2004-01-01
Because the onset of a spinal cord injury may involve a brush with death and because serious injury and disability can act as a reminder of death, death anxiety was examined as a predictor of posttraumatic stress levels among individuals with disabilities. This cross-sectional study used multiple regression and multivariate multiple regression to…
McClelland, Gary H; Irwin, Julie R; Disatnik, David; Sivan, Liron
2017-02-01
Multicollinearity is irrelevant to the search for moderator variables, contrary to the implications of Iacobucci, Schneider, Popovich, and Bakamitsos (Behavior Research Methods, 2016, this issue). Multicollinearity is like the red herring in a mystery novel that distracts the statistical detective from the pursuit of a true moderator relationship. We show multicollinearity is completely irrelevant for tests of moderator variables. Furthermore, readers of Iacobucci et al. might be confused by a number of their errors. We note those errors, but more positively, we describe a variety of methods researchers might use to test and interpret their moderated multiple regression models, including two-stage testing, mean-centering, spotlighting, orthogonalizing, and floodlighting without regard to putative issues of multicollinearity. We cite a number of recent studies in the psychological literature in which the researchers used these methods appropriately to test, to interpret, and to report their moderated multiple regression models. We conclude with a set of recommendations for the analysis and reporting of moderated multiple regression that should help researchers better understand their models and facilitate generalizations across studies.
Sex Differences in Diabetes Mellitus Mortality Trends in Brazil, 1980-2012.
Malhão, Thainá Alves; Brito, Alexandre Dos Santos; Pinheiro, Rejane Sobrino; Cabral, Cristiane da Silva; Camargo, Thais Medina Coeli Rochel de; Coeli, Claudia Medina
2016-01-01
To investigate the hypothesis that the change from the female predominance of diabetes mellitus to a standard of equality or even male preponderance can already be observed in Brazilian mortality statistics. Data on deaths for which diabetes mellitus was listed as the underlying cause were obtained from the Brazilian Mortality Information System for the years 1980 to 2012. The mortality data were also analyzed according to the multiple causes of death approach from 2001 to 2012. The population data came from the Brazilian Institute of Geography and Statistics. The mortality rates were standardized to the world population. We used a log-linear joinpoint regression to evaluate trends in age-standardized mortality rates (ASMR). From 1980 to 2012, we found a marked increment in the diabetes ASMR among Brazilian men and a less sharp increase in the rate among women, with the latter period (2003-2012) showing a slight decrease among women, though it was not statistically significant. The results of this study suggest that diabetes mellitus in Brazil has changed from a pattern of higher mortality among women compared to men to equality or even male predominance.
Vilella, Karina Duarte; Fraiz, Fabian Calixto; Benelli, Elaine Machado; Assunção, Luciana Reichert da Silva
This study evaluated the effect of oral health literacy (OHL) on the retention of health information in pregnant women. A total of 175 pregnant women were randomly assigned to standard oral (spoken), written and control intervention groups. With the exception of the control group, the interventions investigated the eating habits and oral hygiene among children under 2 years of age. The participants' answers before the interventions (pre-test), 15 min after the interventions (post-test) and 4 weeks after the interventions (follow-up test) were used to estimate the knowledge score (KS). Information acquisition was determined by comparing pre-test and post-test results, while retention of information was based comparing pre-test and follow-up test results. OHL was analysed by BREALD-30. The data were assessed by nonparametric tests and Poisson regression models with robust variance (α = 0.05). By the end of the follow-up period, 162 pregnant women had been assessed. The BREALD-30 mean was 22.3 (SD = 4.80). Regardless of the type of intervention, pregnant women with low OHL had lower knowledge scores in the three assessments. Participants with low OHL showed higher acquisition and retention of information in the standard oral health intervention. Multiple regression models demonstrated that OHL was independently associated with KS, age, socioeconomic status and type of intervention. The results suggest a negative effect of low OHL on retention of information. Only the standard, spoken oral health intervention could address the differences in literacy levels.
Optimal estimation of suspended-sediment concentrations in streams
Holtschlag, D.J.
2001-01-01
Optimal estimators are developed for computation of suspended-sediment concentrations in streams. The estimators are a function of parameters, computed by use of generalized least squares, which simultaneously account for effects of streamflow, seasonal variations in average sediment concentrations, a dynamic error component, and the uncertainty in concentration measurements. The parameters are used in a Kalman filter for on-line estimation and an associated smoother for off-line estimation of suspended-sediment concentrations. The accuracies of the optimal estimators are compared with alternative time-averaging interpolators and flow-weighting regression estimators by use of long-term daily-mean suspended-sediment concentration and streamflow data from 10 sites within the United States. For sampling intervals from 3 to 48 days, the standard errors of on-line and off-line optimal estimators ranged from 52.7 to 107%, and from 39.5 to 93.0%, respectively. The corresponding standard errors of linear and cubic-spline interpolators ranged from 48.8 to 158%, and from 50.6 to 176%, respectively. The standard errors of simple and multiple regression estimators, which did not vary with the sampling interval, were 124 and 105%, respectively. Thus, the optimal off-line estimator (Kalman smoother) had the lowest error characteristics of those evaluated. Because suspended-sediment concentrations are typically measured at less than 3-day intervals, use of optimal estimators will likely result in significant improvements in the accuracy of continuous suspended-sediment concentration records. Additional research on the integration of direct suspended-sediment concentration measurements and optimal estimators applied at hourly or shorter intervals is needed.
Junoy Montolio, Francisco G; Wesselink, Christiaan; Gordijn, Marijke; Jansonius, Nomdo M
2012-10-09
To determine the influence of several factors on standard automated perimetry test results in glaucoma. Longitudinal Humphrey field analyzer 30-2 Swedish interactive threshold algorithm data from 160 eyes of 160 glaucoma patients were used. The influence of technician experience, time of day, and season on the mean deviation (MD) was determined by performing linear regression analysis of MD against time on a series of visual fields and subsequently performing a multiple linear regression analysis with the MD residuals as dependent variable and the factors mentioned above as independent variables. Analyses were performed with and without adjustment for the test reliability (fixation losses and false-positive and false-negative answers) and with and without stratification according to disease stage (baseline MD). Mean follow-up was 9.4 years, with on average 10.8 tests per patient. Technician experience, time of day, and season were associated with the MD. Approximately 0.2 dB lower MD values were found for inexperienced technicians (P < 0.001), tests performed after lunch (P < 0.001), and tests performed in the summer or autumn (P < 0.001). The effects of time of day and season appeared to depend on disease stage. Independent of these effects, the percentage of false-positive answers strongly influenced the MD with a 1 dB increase in MD per 10% increase in false-positive answers. Technician experience, time of day, season, and the percentage of false-positive answers have a significant influence on the MD of standard automated perimetry.
Stingone, Jeanette A.; McVeigh, Katharine H.; Claudio, Luz
2016-01-01
The objective of this research was to determine if prenatal exposure to two common urban air pollutants, diesel and perchloroethylene, affects children's 3rd grade standardized test scores in mathematics and English language arts (ELA). Exposure estimates consisted of annual average ambient concentrations of diesel particulate matter and perchloroethylene obtained from the Environmental Protection Agency's 1996 National Air Toxics Assessment for the residential census tract at birth. Outcome data consisted of linked birth and educational records for 201,559 singleton, non-anomalous children born between 1994-1998 who attended New York City public schools. Quantile regression models were used to estimate the effects of these exposures on multiple points within the continuous distribution of standardized test scores. Modified Poisson regression models were used to calculate risk ratios (RR) and 95% confidence intervals (CI) of failing to meet curricula standards, an indicator derived from test scores. Models were adjusted for a number of maternal, neighborhood and childhood factors. Results showed that math scores were approximately 6% of a standard deviation lower for children exposed to the highest levels of both pollutants as compared to children with low levels of both pollutants. Children exposed to high levels of both pollutants also had the largest risk of failing to meet math test standards when compared to children with low levels of exposure to the pollutants (RR 1.10 95%CI 1.07,1.12 RR high perchloroethylene only 1.03 95%CI 1.00,1.06; RR high diesel PM only 1.02 95%CI 0.99,1.06). There was no association observed between exposure to only one of the pollutants and failing to meet ELA standards. This study provides preliminary evidence of associations between prenatal exposure to urban air pollutants and lower academic outcomes. Additionally, these findings suggest that individual pollutants may additively impact health and point to the need to study the collective effects of air pollutant mixtures. Key Words: air toxics, academic outcomes, urban health, tetrachloroethylene, air pollutant mixtures PMID:27058443
Stingone, Jeanette A; McVeigh, Katharine H; Claudio, Luz
2016-07-01
The objective of this research was to determine if prenatal exposure to two common urban air pollutants, diesel and perchloroethylene, affects children's 3rd grade standardized test scores in mathematics and English language arts (ELA). Exposure estimates consisted of annual average ambient concentrations of diesel particulate matter and perchloroethylene obtained from the Environmental Protection Agency's 1996 National Air Toxics Assessment for the residential census tract at birth. Outcome data consisted of linked birth and educational records for 201,559 singleton, non-anomalous children born between 1994 and 1998 who attended New York City public schools. Quantile regression models were used to estimate the effects of these exposures on multiple points within the continuous distribution of standardized test scores. Modified Poisson regression models were used to calculate risk ratios (RR) and 95% confidence intervals (CI) of failing to meet curricula standards, an indicator derived from test scores. Models were adjusted for a number of maternal, neighborhood and childhood factors. Results showed that math scores were approximately 6% of a standard deviation lower for children exposed to the highest levels of both pollutants as compared to children with low levels of both pollutants. Children exposed to high levels of both pollutants also had the largest risk of failing to meet math test standards when compared to children with low levels of exposure to the pollutants (RR 1.10 95%CI 1.07,1.12 RR high perchloroethylene only 1.03 95%CI 1.00,1.06; RR high diesel PM only 1.02 95%CI 0.99,1.06). There was no association observed between exposure to the pollutants and failing to meet ELA standards. This study provides preliminary evidence of associations between prenatal exposure to urban air pollutants and lower academic outcomes. Additionally, these findings suggest that individual pollutants may additively impact health and point to the need to study the collective effects of air pollutant mixtures. air toxics, academic outcomes, urban health, tetrachloroethylene, air pollutant mixtures. Copyright © 2016 Elsevier Inc. All rights reserved.
Brown, C. Erwin
1993-01-01
Correlation analysis in conjunction with principal-component and multiple-regression analyses were applied to laboratory chemical and petrographic data to assess the usefulness of these techniques in evaluating selected physical and hydraulic properties of carbonate-rock aquifers in central Pennsylvania. Correlation and principal-component analyses were used to establish relations and associations among variables, to determine dimensions of property variation of samples, and to filter the variables containing similar information. Principal-component and correlation analyses showed that porosity is related to other measured variables and that permeability is most related to porosity and grain size. Four principal components are found to be significant in explaining the variance of data. Stepwise multiple-regression analysis was used to see how well the measured variables could predict porosity and (or) permeability for this suite of rocks. The variation in permeability and porosity is not totally predicted by the other variables, but the regression is significant at the 5% significance level. ?? 1993.
Liu, Qi; Wu, Youcong; Yuan, Youhua; Bai, Li; Niu, Kun
2011-12-01
To research the relationship between the virulence factors of Saccharomyces albicans (S. albicans) and the random amplified polymorphic DNA (RAPD) bands of them, and establish the regression model by multiple regression analysis. Extracellular phospholipase, secreted proteinase, ability to generate germ tubes and adhere to oral mucosal cells of 92 strains of S. albicans were measured in vitro; RAPD-polymerase chain reaction (RAPD-PCR) was used to get their bands. Multiple regression for virulence factors of S. albicans and RAPD-PCR bands was established. The extracellular phospholipase activity was associated with 4 RAPD bands: 350, 450, 650 and 1 300 bp (P < 0.05); secreted proteinase activity of S. albicans was associated with 2 bands: 350 and 1 200 bp (P < 0.05); the ability of germ tube produce was associated with 2 bands: 400 and 550 bp (P < 0.05). Some RAPD bands will reflect the virulence factors of S. albicans indirectly. These bands would contain some important messages for regulation of S. albicans virulence factors.
Simultaneous multiple non-crossing quantile regression estimation using kernel constraints
Liu, Yufeng; Wu, Yichao
2011-01-01
Quantile regression (QR) is a very useful statistical tool for learning the relationship between the response variable and covariates. For many applications, one often needs to estimate multiple conditional quantile functions of the response variable given covariates. Although one can estimate multiple quantiles separately, it is of great interest to estimate them simultaneously. One advantage of simultaneous estimation is that multiple quantiles can share strength among them to gain better estimation accuracy than individually estimated quantile functions. Another important advantage of joint estimation is the feasibility of incorporating simultaneous non-crossing constraints of QR functions. In this paper, we propose a new kernel-based multiple QR estimation technique, namely simultaneous non-crossing quantile regression (SNQR). We use kernel representations for QR functions and apply constraints on the kernel coefficients to avoid crossing. Both unregularised and regularised SNQR techniques are considered. Asymptotic properties such as asymptotic normality of linear SNQR and oracle properties of the sparse linear SNQR are developed. Our numerical results demonstrate the competitive performance of our SNQR over the original individual QR estimation. PMID:22190842
Bounthavong, Mark; Watanabe, Jonathan H; Sullivan, Kevin M
2015-04-01
The complete capture of all values for each variable of interest in pharmacy research studies remains aspirational. The absence of these possibly influential values is a common problem for pharmacist investigators. Failure to account for missing data may translate to biased study findings and conclusions. Our goal in this analysis was to apply validated statistical methods for missing data to a previously analyzed data set and compare results when missing data methods were implemented versus standard analytics that ignore missing data effects. Using data from a retrospective cohort study, the statistical method of multiple imputation was used to provide regression-based estimates of the missing values to improve available data usable for study outcomes measurement. These findings were then contrasted with a complete-case analysis that restricted estimation to subjects in the cohort that had no missing values. Odds ratios were compared to assess differences in findings of the analyses. A nonadjusted regression analysis ("crude analysis") was also performed as a reference for potential bias. Veterans Integrated Systems Network that includes VA facilities in the Southern California and Nevada regions. New statin users between November 30, 2006, and December 2, 2007, with a diagnosis of dyslipidemia. We compared the odds ratios (ORs) and 95% confidence intervals (CIs) for the crude, complete-case, and multiple imputation analyses for the end points of a 25% or greater reduction in atherogenic lipids. Data were missing for 21.5% of identified patients (1665 subjects of 7739). Regression model results were similar for the crude, complete-case, and multiple imputation analyses with overlap of 95% confidence limits at each end point. The crude, complete-case, and multiple imputation ORs (95% CIs) for a 25% or greater reduction in low-density lipoprotein cholesterol were 3.5 (95% CI 3.1-3.9), 4.3 (95% CI 3.8-4.9), and 4.1 (95% CI 3.7-4.6), respectively. The crude, complete-case, and multiple imputation ORs (95% CIs) for a 25% or greater reduction in non-high-density lipoprotein cholesterol were 3.5 (95% CI 3.1-3.9), 4.5 (95% CI 4.0-5.2), and 4.4 (95% CI 3.9-4.9), respectively. The crude, complete-case, and multiple imputation ORs (95% CIs) for 25% or greater reduction in TGs were 3.1 (95% CI 2.8-3.6), 4.0 (95% CI 3.5-4.6), and 4.1 (95% CI 3.6-4.6), respectively. The use of the multiple imputation method to account for missing data did not alter conclusions based on a complete-case analysis. Given the frequency of missing data in research using electronic health records and pharmacy claims data, multiple imputation may play an important role in the validation of study findings. © 2015 Pharmacotherapy Publications, Inc.
Malau-Aduli, Bunmi Sherifat; Teague, Peta-Ann; D'Souza, Karen; Heal, Clare; Turner, Richard; Garne, David L; van der Vleuten, Cees
2017-12-01
A key issue underpinning the usefulness of the OSCE assessment to medical education is standard setting, but the majority of standard-setting methods remain challenging for performance assessment because they produce varying passing marks. Several studies have compared standard-setting methods; however, most of these studies are limited by their experimental scope, or use data on examinee performance at a single OSCE station or from a single medical school. This collaborative study between 10 Australian medical schools investigated the effect of standard-setting methods on OSCE cut scores and failure rates. This research used 5256 examinee scores from seven shared OSCE stations to calculate cut scores and failure rates using two different compromise standard-setting methods, namely the Borderline Regression and Cohen's methods. The results of this study indicate that Cohen's method yields similar outcomes to the Borderline Regression method, particularly for large examinee cohort sizes. However, with lower examinee numbers on a station, the Borderline Regression method resulted in higher cut scores and larger difference margins in the failure rates. Cohen's method yields similar outcomes as the Borderline Regression method and its application for benchmarking purposes and in resource-limited settings is justifiable, particularly with large examinee numbers.
Determinants of ambulance response time: A study in Sabah, Malaysia
NASA Astrophysics Data System (ADS)
Chin, Su Na; Cheah, Phee Kheng; Arifin, Muhamad Yaakub; Wong, Boh Leng; Omar, Zaturrawiah; Yassin, Fouziah Md; Gabda, Darmesah
2017-04-01
Ambulance response time (ART) is one of the standard key performance indicators (KPI) in measuring the emergency medical services (EMS) delivery performances. When the mean time of ART of EMS system reaches the KPI target, it shows that the EMS system performs well. This paper considers the determinants of ART, using data sampled from 967 ambulance runs in a government hospital in Sabah. Multiple regression analysis with backward elimination was proposed for the identification of significant factors. Amongst the underlying factors, travel distance, age of patients, type of treatment and peak hours were identified to be significantly affecting ART. Identifying factors that influence ART helps the development of strategic improvement planning for reducing the ART.
Cost effectiveness of the stream-gaging program in northeastern California
Hoffard, S.H.; Pearce, V.F.; Tasker, Gary D.; Doyle, W.H.
1984-01-01
Results are documented of a study of the cost effectiveness of the stream-gaging program in northeastern California. Data uses and funding sources were identified for the 127 continuous stream gages currently being operated in the study area. One stream gage was found to have insufficient data use to warrant cooperative Federal funding. Flow-routing and multiple-regression models were used to simulate flows at selected gaging stations. The models may be sufficiently accurate to replace two of the stations. The average standard error of estimate of streamflow records is 12.9 percent. This overall level of accuracy could be reduced to 12.0 percent using computer-recommended service routes and visit frequencies. (USGS)
Mambet Doue, Constance; Roussiau, Nicolas
2016-12-01
This research investigates the indirect effects of religiosity (practice and belief) on therapeutic compliance in 81 HIV-positive patients who are migrants from sub-Saharan Africa (23 men and 58 women). Using analyses of mediation and standard multiple regression, including a resampling procedure by bootstrapping, the role of these mediators (magical-religious beliefs and nonuse of toxic substances) was tested. The results show that, through magical-religious beliefs, religiosity has a negative indirect effect, while with the nonuse of toxic substances, religious practice has a positive indirect effect. Beyond religiosity, the role of mediators is highlighted in the interaction with therapeutic compliance.
DiPrete, Thomas A.; Burik, Casper A. P.; Koellinger, Philipp D.
2018-01-01
Identifying causal effects in nonexperimental data is an enduring challenge. One proposed solution that recently gained popularity is the idea to use genes as instrumental variables [i.e., Mendelian randomization (MR)]. However, this approach is problematic because many variables of interest are genetically correlated, which implies the possibility that many genes could affect both the exposure and the outcome directly or via unobserved confounding factors. Thus, pleiotropic effects of genes are themselves a source of bias in nonexperimental data that would also undermine the ability of MR to correct for endogeneity bias from nongenetic sources. Here, we propose an alternative approach, genetic instrumental variable (GIV) regression, that provides estimates for the effect of an exposure on an outcome in the presence of pleiotropy. As a valuable byproduct, GIV regression also provides accurate estimates of the chip heritability of the outcome variable. GIV regression uses polygenic scores (PGSs) for the outcome of interest which can be constructed from genome-wide association study (GWAS) results. By splitting the GWAS sample for the outcome into nonoverlapping subsamples, we obtain multiple indicators of the outcome PGSs that can be used as instruments for each other and, in combination with other methods such as sibling fixed effects, can address endogeneity bias from both pleiotropy and the environment. In two empirical applications, we demonstrate that our approach produces reasonable estimates of the chip heritability of educational attainment (EA) and show that standard regression and MR provide upwardly biased estimates of the effect of body height on EA. PMID:29686100
DiPrete, Thomas A; Burik, Casper A P; Koellinger, Philipp D
2018-05-29
Identifying causal effects in nonexperimental data is an enduring challenge. One proposed solution that recently gained popularity is the idea to use genes as instrumental variables [i.e., Mendelian randomization (MR)]. However, this approach is problematic because many variables of interest are genetically correlated, which implies the possibility that many genes could affect both the exposure and the outcome directly or via unobserved confounding factors. Thus, pleiotropic effects of genes are themselves a source of bias in nonexperimental data that would also undermine the ability of MR to correct for endogeneity bias from nongenetic sources. Here, we propose an alternative approach, genetic instrumental variable (GIV) regression, that provides estimates for the effect of an exposure on an outcome in the presence of pleiotropy. As a valuable byproduct, GIV regression also provides accurate estimates of the chip heritability of the outcome variable. GIV regression uses polygenic scores (PGSs) for the outcome of interest which can be constructed from genome-wide association study (GWAS) results. By splitting the GWAS sample for the outcome into nonoverlapping subsamples, we obtain multiple indicators of the outcome PGSs that can be used as instruments for each other and, in combination with other methods such as sibling fixed effects, can address endogeneity bias from both pleiotropy and the environment. In two empirical applications, we demonstrate that our approach produces reasonable estimates of the chip heritability of educational attainment (EA) and show that standard regression and MR provide upwardly biased estimates of the effect of body height on EA. Copyright © 2018 the Author(s). Published by PNAS.
Futia, Gregory L; Schlaepfer, Isabel R; Qamar, Lubna; Behbakht, Kian; Gibson, Emily A
2017-07-01
Detection of circulating tumor cells (CTCs) in a blood sample is limited by the sensitivity and specificity of the biomarker panel used to identify CTCs over other blood cells. In this work, we present Bayesian theory that shows how test sensitivity and specificity set the rarity of cell that a test can detect. We perform our calculation of sensitivity and specificity on our image cytometry biomarker panel by testing on pure disease positive (D + ) populations (MCF7 cells) and pure disease negative populations (D - ) (leukocytes). In this system, we performed multi-channel confocal fluorescence microscopy to image biomarkers of DNA, lipids, CD45, and Cytokeratin. Using custom software, we segmented our confocal images into regions of interest consisting of individual cells and computed the image metrics of total signal, second spatial moment, spatial frequency second moment, and the product of the spatial-spatial frequency moments. We present our analysis of these 16 features. The best performing of the 16 features produced an average separation of three standard deviations between D + and D - and an average detectable rarity of ∼1 in 200. We performed multivariable regression and feature selection to combine multiple features for increased performance and showed an average separation of seven standard deviations between the D + and D - populations making our average detectable rarity of ∼1 in 480. Histograms and receiver operating characteristics (ROC) curves for these features and regressions are presented. We conclude that simple regression analysis holds promise to further improve the separation of rare cells in cytometry applications. © 2017 International Society for Advancement of Cytometry. © 2017 International Society for Advancement of Cytometry.
Gómez Campos, Rossana; Pacheco Carrillo, Jaime; Almonacid Fierro, Alejandro; Urra Albornoz, Camilo; Cossío-Bolaños, Marco
2018-03-01
(i) To propose regression equations based on anthropometric measures to estimate fat mass (FM) using dual energy X-ray absorptiometry (DXA) as reference method, and (ii)to establish population reference standards for equation-derived FM. A cross-sectional study on 6,713 university students (3,354 males and 3,359 females) from Chile aged 17.0 to 27.0years. Anthropometric measures (weight, height, waist circumference) were taken in all participants. Whole body DXA was performed in 683 subjects. A total of 478 subjects were selected to develop regression equations, and 205 for their cross-validation. Data from 6,030 participants were used to develop reference standards for FM. Equations were generated using stepwise multiple regression analysis. Percentiles were developed using the LMS method. Equations for men were: (i) FM=-35,997.486 +232.285 *Weight +432.216 *CC (R 2 =0.73, SEE=4.1); (ii)FM=-37,671.303 +309.539 *Weight +66,028.109 *ICE (R2=0.76, SEE=3.8), while equations for women were: (iii)FM=-13,216.917 +461,302 *Weight+91.898 *CC (R 2 =0.70, SEE=4.6), and (iv) FM=-14,144.220 +464.061 *Weight +16,189.297 *ICE (R 2 =0.70, SEE=4.6). Percentiles proposed included p10, p50, p85, and p95. The developed equations provide valid and accurate estimation of FM in both sexes. The values obtained using the equations may be analyzed from percentiles that allow for categorizing body fat levels by age and sex. Copyright © 2017 SEEN y SED. Publicado por Elsevier España, S.L.U. All rights reserved.
Monitoring heavy metal Cr in soil based on hyperspectral data using regression analysis
NASA Astrophysics Data System (ADS)
Zhang, Ningyu; Xu, Fuyun; Zhuang, Shidong; He, Changwei
2016-10-01
Heavy metal pollution in soils is one of the most critical problems in the global ecology and environment safety nowadays. Hyperspectral remote sensing and its application is capable of high speed, low cost, less risk and less damage, and provides a good method for detecting heavy metals in soil. This paper proposed a new idea of applying regression analysis of stepwise multiple regression between the spectral data and monitoring the amount of heavy metal Cr by sample points in soil for environmental protection. In the measurement, a FieldSpec HandHeld spectroradiometer is used to collect reflectance spectra of sample points over the wavelength range of 325-1075 nm. Then the spectral data measured by the spectroradiometer is preprocessed to reduced the influence of the external factors, and the preprocessed methods include first-order differential equation, second-order differential equation and continuum removal method. The algorithms of stepwise multiple regression are established accordingly, and the accuracy of each equation is tested. The results showed that the accuracy of first-order differential equation works best, which makes it feasible to predict the content of heavy metal Cr by using stepwise multiple regression.
Multicollinearity and Regression Analysis
NASA Astrophysics Data System (ADS)
Daoud, Jamal I.
2017-12-01
In regression analysis it is obvious to have a correlation between the response and predictor(s), but having correlation among predictors is something undesired. The number of predictors included in the regression model depends on many factors among which, historical data, experience, etc. At the end selection of most important predictors is something objective due to the researcher. Multicollinearity is a phenomena when two or more predictors are correlated, if this happens, the standard error of the coefficients will increase [8]. Increased standard errors means that the coefficients for some or all independent variables may be found to be significantly different from In other words, by overinflating the standard errors, multicollinearity makes some variables statistically insignificant when they should be significant. In this paper we focus on the multicollinearity, reasons and consequences on the reliability of the regression model.
Near Infrared Spectrometry of Clinically Significant Fatty Acids Using Multicomponent Regression
NASA Astrophysics Data System (ADS)
Kalinin, A. V.; Krasheninnikov, V. N.; Sviridov, A. P.; Titov, V. N.
2016-11-01
We have developed methods for determining the content of clinically important fatty acids (FAs), primarily saturated palmitic acid, monounsaturated oleic acid, and the sum of polyenoic fatty acids (eicosapentaenoic + docosahexaenoic), in oily media (food products and supplements, fish oils) using different types of near infrared (NIR) spectrometers: Fourier-transform, linear photodiode array, and Raman. Based on a calibration method (regression) by means of projections to latent structures, using standard samples of oil and fat mixtures, we have confirmed the feasibility of reliable and selective quantitative analysis of the above-indicated fatty acids. As a result of comparing the calibration models for Fourier-transform spectrometers in different parts of the NIR range (based on different overtones and combinations of fatty acid absorption), we have provided a basis for selection of the spectral range for a portable linear InGaAs-photodiode array spectrometer. In testing the calibrations of a linear InGaAs-photodiode array spectrometer which is a prototype for a portable instrument, for palmitic and oleic acids and also the sum of the polyenoic fatty acids we have achieved a multiple correlation coefficient of 0.89, 0.85, and 0.96 and a standard error of 0.53%, 1.43%, and 0.39% respectively. We have confirmed the feasibility of using Raman spectra to determine the content of the above-indicated fatty acids in media where water is present.
Forecasting USAF JP-8 Fuel Needs
2009-03-01
versus complex ones. When we consider long -term forecasts, 5-years in this case, multiple regression outperforms ANN modeling within the specified...with more simple and easy-to-implement methods, versus complex ones. When we consider long -term 5-year forecasts, our multiple regression model...effort. The insight and experience was certainly appreciated. Special thanks to my Turkish peers for their continuous support and help during this long
ERIC Educational Resources Information Center
Le, Huy; Marcus, Justin
2012-01-01
This study used Monte Carlo simulation to examine the properties of the overall odds ratio (OOR), which was recently introduced as an index for overall effect size in multiple logistic regression. It was found that the OOR was relatively independent of study base rate and performed better than most commonly used R-square analogs in indexing model…
ERIC Educational Resources Information Center
Pecorella, Patricia A.; Bowers, David G.
Multiple regression in a double cross-validated design was used to predict two performance measures (total variable expense and absence rate) by multi-month period in five industrial firms. The regressions do cross-validate, and produce multiple coefficients which display both concurrent and predictive effects, peaking 18 months to two years…
USDA-ARS?s Scientific Manuscript database
A technique of using multiple calibration sets in partial least squares regression (PLS) was proposed to improve the quantitative determination of ammonia from open-path Fourier transform infrared spectra. The spectra were measured near animal farms, and the path-integrated concentration of ammonia...
Exercise as an adjunct to nicotine gum in treating tobacco dependence among women.
Kinnunen, Taru; Leeman, Robert F; Korhonen, Tellervo; Quiles, Zandra N; Terwal, Donna M; Garvey, Arthur J; Hartley, Howard L
2008-04-01
This was the first randomized, controlled smoking cessation trial assessing the efficacy of an exercise intervention as an adjunct to nicotine gum therapy in comparison with both equal contact control and standard care control conditions. Sedentary female smokers aged 18-55 years were provided with nicotine gum treatment along with brief behavioral counseling and were randomized into one of these three behavioral adjunct conditions. In the "intent-to-treat" sample (N = 182), at end of treatment and at 1-year follow-up, there were clear, but nonsignificant, trends in univariate analyses in which the exercise and equal contact control conditions both had higher rates of abstinence than the standard care control. However, when adjusting for other predictors of relapse in a multiple logistic regression, both exercise and equal contact control showed an advantage over standard care control in avoiding early relapse (i.e., after 1 week). In a multivariate survival model adjusting for other predictors, the equal contact condition had a significantly lower likelihood of relapse compared with the standard care condition and there was a near significant trend in which exercise offered an advantage over standard care as well. While these findings suggest a slightly improved likelihood of abstinence with exercise compared with standard care, exercise did not differ from equal contact control in its efficacy. Potential explanations for these equivalent levels of efficacy and implications for the findings are discussed.
Robust geographically weighted regression of modeling the Air Polluter Standard Index (APSI)
NASA Astrophysics Data System (ADS)
Warsito, Budi; Yasin, Hasbi; Ispriyanti, Dwi; Hoyyi, Abdul
2018-05-01
The Geographically Weighted Regression (GWR) model has been widely applied to many practical fields for exploring spatial heterogenity of a regression model. However, this method is inherently not robust to outliers. Outliers commonly exist in data sets and may lead to a distorted estimate of the underlying regression model. One of solution to handle the outliers in the regression model is to use the robust models. So this model was called Robust Geographically Weighted Regression (RGWR). This research aims to aid the government in the policy making process related to air pollution mitigation by developing a standard index model for air polluter (Air Polluter Standard Index - APSI) based on the RGWR approach. In this research, we also consider seven variables that are directly related to the air pollution level, which are the traffic velocity, the population density, the business center aspect, the air humidity, the wind velocity, the air temperature, and the area size of the urban forest. The best model is determined by the smallest AIC value. There are significance differences between Regression and RGWR in this case, but Basic GWR using the Gaussian kernel is the best model to modeling APSI because it has smallest AIC.
Kesselmeier, Miriam; Lorenzo Bermejo, Justo
2017-11-01
Logistic regression is the most common technique used for genetic case-control association studies. A disadvantage of standard maximum likelihood estimators of the genotype relative risk (GRR) is their strong dependence on outlier subjects, for example, patients diagnosed at unusually young age. Robust methods are available to constrain outlier influence, but they are scarcely used in genetic studies. This article provides a non-intimidating introduction to robust logistic regression, and investigates its benefits and limitations in genetic association studies. We applied the bounded Huber and extended the R package 'robustbase' with the re-descending Hampel functions to down-weight outlier influence. Computer simulations were carried out to assess the type I error rate, mean squared error (MSE) and statistical power according to major characteristics of the genetic study and investigated markers. Simulations were complemented with the analysis of real data. Both standard and robust estimation controlled type I error rates. Standard logistic regression showed the highest power but standard GRR estimates also showed the largest bias and MSE, in particular for associated rare and recessive variants. For illustration, a recessive variant with a true GRR=6.32 and a minor allele frequency=0.05 investigated in a 1000 case/1000 control study by standard logistic regression resulted in power=0.60 and MSE=16.5. The corresponding figures for Huber-based estimation were power=0.51 and MSE=0.53. Overall, Hampel- and Huber-based GRR estimates did not differ much. Robust logistic regression may represent a valuable alternative to standard maximum likelihood estimation when the focus lies on risk prediction rather than identification of susceptibility variants. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Nakamura, Ryo; Nakano, Kumiko; Tamura, Hiroyasu; Mizunuma, Masaki; Fushiki, Tohru; Hirata, Dai
2017-08-01
Many factors contribute to palatability. In order to evaluate the palatability of Japanese alcohol sake paired with certain dishes by integrating multiple factors, here we applied an evaluation method previously reported for palatability of cheese by multiple regression analysis based on 3 subdomain factors (rewarding, cultural, and informational). We asked 94 Japanese participants/subjects to evaluate the palatability of sake (1st evaluation/E1 for the first cup, 2nd/E2 and 3rd/E3 for the palatability with aftertaste/afterglow of certain dishes) and to respond to a questionnaire related to 3 subdomains. In E1, 3 factors were extracted by a factor analysis, and the subsequent multiple regression analyses indicated that the palatability of sake was interpreted by mainly the rewarding. Further, the results of attribution-dissections in E1 indicated that 2 factors (rewarding and informational) contributed to the palatability. Finally, our results indicated that the palatability of sake was influenced by the dish eaten just before drinking.
Gieswein, Alexander; Hering, Daniel; Feld, Christian K
2017-09-01
Freshwater ecosystems are impacted by a range of stressors arising from diverse human-caused land and water uses. Identifying the relative importance of single stressors and understanding how multiple stressors interact and jointly affect biology is crucial for River Basin Management. This study addressed multiple human-induced stressors and their effects on the aquatic flora and fauna based on data from standard WFD monitoring schemes. For altogether 1095 sites within a mountainous catchment, we used 12 stressor variables covering three different stressor groups: riparian land use, physical habitat quality and nutrient enrichment. Twenty-one biological metrics calculated from taxa lists of three organism groups (fish, benthic invertebrates and aquatic macrophytes) served as response variables. Stressor and response variables were subjected to Boosted Regression Tree (BRT) analysis to identify stressor hierarchy and stressor interactions and subsequently to Generalised Linear Regression Modelling (GLM) to quantify the stressors standardised effect size. Our results show that riverine habitat degradation was the dominant stressor group for the river fauna, notably the bed physical habitat structure. Overall, the explained variation in benthic invertebrate metrics was higher than it was in fish and macrophyte metrics. In particular, general integrative (aggregate) metrics such as % Ephemeroptera, Plecoptera and Trichoptera (EPT) taxa performed better than ecological traits (e.g. % feeding types). Overall, additive stressor effects dominated, while significant and meaningful stressor interactions were generally rare and weak. We concluded that given the type of stressor and ecological response variables addressed in this study, river basin managers do not need to bother much about complex stressor interactions, but can focus on the prevailing stressors according to the hierarchy identified. Copyright © 2017 Elsevier B.V. All rights reserved.
Zheng, Jie; Rodriguez, Santiago; Laurin, Charles; Baird, Denis; Trela-Larsen, Lea; Erzurumluoglu, Mesut A; Zheng, Yi; White, Jon; Giambartolomei, Claudia; Zabaneh, Delilah; Morris, Richard; Kumari, Meena; Casas, Juan P; Hingorani, Aroon D; Evans, David M; Gaunt, Tom R; Day, Ian N M
2017-01-01
Fine mapping is a widely used approach for identifying the causal variant(s) at disease-associated loci. Standard methods (e.g. multiple regression) require individual level genotypes. Recent fine mapping methods using summary-level data require the pairwise correlation coefficients ([Formula: see text]) of the variants. However, haplotypes rather than pairwise [Formula: see text], are the true biological representation of linkage disequilibrium (LD) among multiple loci. In this article, we present an empirical iterative method, HAPlotype Regional Association analysis Program (HAPRAP), that enables fine mapping using summary statistics and haplotype information from an individual-level reference panel. Simulations with individual-level genotypes show that the results of HAPRAP and multiple regression are highly consistent. In simulation with summary-level data, we demonstrate that HAPRAP is less sensitive to poor LD estimates. In a parametric simulation using Genetic Investigation of ANthropometric Traits height data, HAPRAP performs well with a small training sample size (N < 2000) while other methods become suboptimal. Moreover, HAPRAP's performance is not affected substantially by single nucleotide polymorphisms (SNPs) with low minor allele frequencies. We applied the method to existing quantitative trait and binary outcome meta-analyses (human height, QTc interval and gallbladder disease); all previous reported association signals were replicated and two additional variants were independently associated with human height. Due to the growing availability of summary level data, the value of HAPRAP is likely to increase markedly for future analyses (e.g. functional prediction and identification of instruments for Mendelian randomization). The HAPRAP package and documentation are available at http://apps.biocompute.org.uk/haprap/ CONTACT: : jie.zheng@bristol.ac.uk or tom.gaunt@bristol.ac.ukSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.
[Multiple roles and health among Korean women].
Cho, Su-Jin; Jang, Soong-Nang; Cho, Sung-Il
2008-09-01
Most studies about multiple roles and women's health suggested that combining with paid job, being married and having children was more likely to improve health status than in case of single or traditional roles. We investigated whether there was better health outcome in multiple roles among Korean women coinciding with previous studies of other nations. Data were from the 2005 Korea National Health & Nutritional Examination Survey, a subsample of women aged 25-59 years (N=2,943). Health status was assessed for self-rated poor health, perceived stress and depression, respectively based on one questionnaire item. The age-standardized prevalence of all health outcomes were calculated by role categories and socioeconomic status. Multiple logistic regression was used to assess the association of self rated health, perceived stress, and depression with multiple roles adjusted for age, education, household income, number of children and age of children. Having multiple roles with working role was not associated with better health and psychological wellbeing. Compared to those with traditional roles, employed women more frequently experienced perceived stress, with marital and/or parental roles. Non-working single mothers suffered depression more often than women with traditional roles or other role occupancy. Socioeconomic status indicators were potent independent correlates of self-rated health and perceived stress. Employment of women with other roles did not confer additional health benefit to traditional family responsibility. Juggling of work and family responsibility appeared more stressful than traditional unemployed parental and marital role in Korean women.
Correlation and simple linear regression.
Eberly, Lynn E
2007-01-01
This chapter highlights important steps in using correlation and simple linear regression to address scientific questions about the association of two continuous variables with each other. These steps include estimation and inference, assessing model fit, the connection between regression and ANOVA, and study design. Examples in microbiology are used throughout. This chapter provides a framework that is helpful in understanding more complex statistical techniques, such as multiple linear regression, linear mixed effects models, logistic regression, and proportional hazards regression.
London Measure of Unplanned Pregnancy: guidance for its use as an outcome measure
Hall, Jennifer A; Barrett, Geraldine; Copas, Andrew; Stephenson, Judith
2017-01-01
Background The London Measure of Unplanned Pregnancy (LMUP) is a psychometrically validated measure of the degree of intention of a current or recent pregnancy. The LMUP is increasingly being used worldwide, and can be used to evaluate family planning or preconception care programs. However, beyond recommending the use of the full LMUP scale, there is no published guidance on how to use the LMUP as an outcome measure. Ordinal logistic regression has been recommended informally, but studies published to date have all used binary logistic regression and dichotomized the scale at different cut points. There is thus a need for evidence-based guidance to provide a standardized methodology for multivariate analysis and to enable comparison of results. This paper makes recommendations for the regression method for analysis of the LMUP as an outcome measure. Materials and methods Data collected from 4,244 pregnant women in Malawi were used to compare five regression methods: linear, logistic with two cut points, and ordinal logistic with either the full or grouped LMUP score. The recommendations were then tested on the original UK LMUP data. Results There were small but no important differences in the findings across the regression models. Logistic regression resulted in the largest loss of information, and assumptions were violated for the linear and ordinal logistic regression. Consequently, robust standard errors were used for linear regression and a partial proportional odds ordinal logistic regression model attempted. The latter could only be fitted for grouped LMUP score. Conclusion We recommend the linear regression model with robust standard errors to make full use of the LMUP score when analyzed as an outcome measure. Ordinal logistic regression could be considered, but a partial proportional odds model with grouped LMUP score may be required. Logistic regression is the least-favored option, due to the loss of information. For logistic regression, the cut point for un/planned pregnancy should be between nine and ten. These recommendations will standardize the analysis of LMUP data and enhance comparability of results across studies. PMID:28435343
Garcia-Huidobro, Diego; Michael Oakes, J
2017-04-01
Randomised controlled trials (RCTs) are typically viewed as the gold standard for causal inference. This is because effects of interest can be identified with the fewest assumptions, especially imbalance in background characteristics. Yet because conducting RCTs are expensive, time consuming and sometimes unethical, observational studies are frequently used to study causal associations. In these studies, imbalance, or confounding, is usually controlled with multiple regression, which entails strong assumptions. The purpose of this manuscript is to describe strengths and weaknesses of several methods to control for confounding in observational studies, and to demonstrate their use in cross-sectional dataset that use patient registration data from the Juan Pablo II Primary Care Clinic in La Pintana-Chile. The dataset contains responses from 5855 families who provided complete information on family socio-demographics, family functioning and health problems among their family members. We employ regression adjustment, stratification, restriction, matching, propensity score matching, standardisation and inverse probability weighting to illustrate the approaches to better causal inference in non-experimental data and compare results. By applying study design and data analysis techniques that control for confounding in different ways than regression adjustment, researchers may strengthen the scientific relevance of observational studies. © 2016 International Union of Psychological Science.
Poisson Mixture Regression Models for Heart Disease Prediction.
Mufudza, Chipo; Erol, Hamza
2016-01-01
Early heart disease control can be achieved by high disease prediction and diagnosis efficiency. This paper focuses on the use of model based clustering techniques to predict and diagnose heart disease via Poisson mixture regression models. Analysis and application of Poisson mixture regression models is here addressed under two different classes: standard and concomitant variable mixture regression models. Results show that a two-component concomitant variable Poisson mixture regression model predicts heart disease better than both the standard Poisson mixture regression model and the ordinary general linear Poisson regression model due to its low Bayesian Information Criteria value. Furthermore, a Zero Inflated Poisson Mixture Regression model turned out to be the best model for heart prediction over all models as it both clusters individuals into high or low risk category and predicts rate to heart disease componentwise given clusters available. It is deduced that heart disease prediction can be effectively done by identifying the major risks componentwise using Poisson mixture regression model.
Poisson Mixture Regression Models for Heart Disease Prediction
Erol, Hamza
2016-01-01
Early heart disease control can be achieved by high disease prediction and diagnosis efficiency. This paper focuses on the use of model based clustering techniques to predict and diagnose heart disease via Poisson mixture regression models. Analysis and application of Poisson mixture regression models is here addressed under two different classes: standard and concomitant variable mixture regression models. Results show that a two-component concomitant variable Poisson mixture regression model predicts heart disease better than both the standard Poisson mixture regression model and the ordinary general linear Poisson regression model due to its low Bayesian Information Criteria value. Furthermore, a Zero Inflated Poisson Mixture Regression model turned out to be the best model for heart prediction over all models as it both clusters individuals into high or low risk category and predicts rate to heart disease componentwise given clusters available. It is deduced that heart disease prediction can be effectively done by identifying the major risks componentwise using Poisson mixture regression model. PMID:27999611
Factor analysis and multiple regression between topography and precipitation on Jeju Island, Korea
NASA Astrophysics Data System (ADS)
Um, Myoung-Jin; Yun, Hyeseon; Jeong, Chang-Sam; Heo, Jun-Haeng
2011-11-01
SummaryIn this study, new factors that influence precipitation were extracted from geographic variables using factor analysis, which allow for an accurate estimation of orographic precipitation. Correlation analysis was also used to examine the relationship between nine topographic variables from digital elevation models (DEMs) and the precipitation in Jeju Island. In addition, a spatial analysis was performed in order to verify the validity of the regression model. From the results of the correlation analysis, it was found that all of the topographic variables had a positive correlation with the precipitation. The relations between the variables also changed in accordance with a change in the precipitation duration. However, upon examining the correlation matrix, no significant relationship between the latitude and the aspect was found. According to the factor analysis, eight topographic variables (latitude being the exception) were found to have a direct influence on the precipitation. Three factors were then extracted from the eight topographic variables. By directly comparing the multiple regression model with the factors (model 1) to the multiple regression model with the topographic variables (model 3), it was found that model 1 did not violate the limits of statistical significance and multicollinearity. As such, model 1 was considered to be appropriate for estimating the precipitation when taking into account the topography. In the study of model 1, the multiple regression model using factor analysis was found to be the best method for estimating the orographic precipitation on Jeju Island.
Hypothesis Testing Using Factor Score Regression
Devlieger, Ines; Mayer, Axel; Rosseel, Yves
2015-01-01
In this article, an overview is given of four methods to perform factor score regression (FSR), namely regression FSR, Bartlett FSR, the bias avoiding method of Skrondal and Laake, and the bias correcting method of Croon. The bias correcting method is extended to include a reliable standard error. The four methods are compared with each other and with structural equation modeling (SEM) by using analytic calculations and two Monte Carlo simulation studies to examine their finite sample characteristics. Several performance criteria are used, such as the bias using the unstandardized and standardized parameterization, efficiency, mean square error, standard error bias, type I error rate, and power. The results show that the bias correcting method, with the newly developed standard error, is the only suitable alternative for SEM. While it has a higher standard error bias than SEM, it has a comparable bias, efficiency, mean square error, power, and type I error rate. PMID:29795886
Weather Impact on Airport Arrival Meter Fix Throughput
NASA Technical Reports Server (NTRS)
Wang, Yao
2017-01-01
Time-based flow management provides arrival aircraft schedules based on arrival airport conditions, airport capacity, required spacing, and weather conditions. In order to meet a scheduled time at which arrival aircraft can cross an airport arrival meter fix prior to entering the airport terminal airspace, air traffic controllers make regulations on air traffic. Severe weather may create an airport arrival bottleneck if one or more of airport arrival meter fixes are partially or completely blocked by the weather and the arrival demand has not been reduced accordingly. Under these conditions, aircraft are frequently being put in holding patterns until they can be rerouted. A model that predicts the weather impacted meter fix throughput may help air traffic controllers direct arrival flows into the airport more efficiently, minimizing arrival meter fix congestion. This paper presents an analysis of air traffic flows across arrival meter fixes at the Newark Liberty International Airport (EWR). Several scenarios of weather impacted EWR arrival fix flows are described. Furthermore, multiple linear regression and regression tree ensemble learning approaches for translating multiple sector Weather Impacted Traffic Indexes (WITI) to EWR arrival meter fix throughputs are examined. These weather translation models are developed and validated using the EWR arrival flight and weather data for the period of April-September in 2014. This study also compares the performance of the regression tree ensemble with traditional multiple linear regression models for estimating the weather impacted throughputs at each of the EWR arrival meter fixes. For all meter fixes investigated, the results from the regression tree ensemble weather translation models show a stronger correlation between model outputs and observed meter fix throughputs than that produced from multiple linear regression method.
Quantitative imaging biomarkers: Effect of sample size and bias on confidence interval coverage.
Obuchowski, Nancy A; Bullen, Jennifer
2017-01-01
Introduction Quantitative imaging biomarkers (QIBs) are being increasingly used in medical practice and clinical trials. An essential first step in the adoption of a quantitative imaging biomarker is the characterization of its technical performance, i.e. precision and bias, through one or more performance studies. Then, given the technical performance, a confidence interval for a new patient's true biomarker value can be constructed. Estimating bias and precision can be problematic because rarely are both estimated in the same study, precision studies are usually quite small, and bias cannot be measured when there is no reference standard. Methods A Monte Carlo simulation study was conducted to assess factors affecting nominal coverage of confidence intervals for a new patient's quantitative imaging biomarker measurement and for change in the quantitative imaging biomarker over time. Factors considered include sample size for estimating bias and precision, effect of fixed and non-proportional bias, clustered data, and absence of a reference standard. Results Technical performance studies of a quantitative imaging biomarker should include at least 35 test-retest subjects to estimate precision and 65 cases to estimate bias. Confidence intervals for a new patient's quantitative imaging biomarker measurement constructed under the no-bias assumption provide nominal coverage as long as the fixed bias is <12%. For confidence intervals of the true change over time, linearity must hold and the slope of the regression of the measurements vs. true values should be between 0.95 and 1.05. The regression slope can be assessed adequately as long as fixed multiples of the measurand can be generated. Even small non-proportional bias greatly reduces confidence interval coverage. Multiple lesions in the same subject can be treated as independent when estimating precision. Conclusion Technical performance studies of quantitative imaging biomarkers require moderate sample sizes in order to provide robust estimates of bias and precision for constructing confidence intervals for new patients. Assumptions of linearity and non-proportional bias should be assessed thoroughly.
Miller, Suellen; Fathalla, Mohamed M F; Ojengbede, Oladosu A; Camlin, Carol; Mourad-Youssif, Mohammed; Morhason-Bello, Imran O; Galadanci, Hadiza; Nsima, David; Butrick, Elizabeth; Al Hussaini, Tarek; Turan, Janet; Meyer, Carinne; Martin, Hilarie; Mohammed, Aminu I
2010-10-18
Obstetric hemorrhage is the leading cause of maternal mortality globally. The Non-pneumatic Anti-Shock Garment (NASG) is a low-technology, first-aid compression device which, when added to standard hypovolemic shock protocols, may improve outcomes for women with hypovolemic shock secondary to obstetric hemorrhage in tertiary facilities in low-resource settings. This study employed a pre-intervention/intervention design in four facilities in Nigeria and two in Egypt. Primary outcomes were measured mean and median blood loss, severe end-organ failure morbidity (renal failure, pulmonary failure, cardiac failure, or CNS dysfunctions), mortality, and emergency hysterectomy for 1442 women with ≥750 mL blood loss and at least one sign of hemodynamic instability. Comparisons of outcomes by study phase were assessed with rank sum tests, relative risks (RR), number needed to treat for benefit (NNTb), and multiple logistic regression. Women in the NASG phase (n = 835) were in worse condition on study entry, 38.5% with mean arterial pressure <60 mmHg vs. 29.9% in the pre-intervention phase (p = 0.001). Despite this, negative outcomes were significantly reduced in the NASG phase: mean measured blood loss decreased from 444 mL to 240 mL (p < 0.001), maternal mortality decreased from 6.3% to 3.5% (RR 0.56, 95% CI 0.35-0.89), severe morbidities from 3.7% to 0.7% (RR 0.20, 95% CI 0.08-0.50), and emergency hysterectomy from 8.9% to 4.0% (RR 0.44, 0.23-0.86). In multiple logistic regression, there was a 55% reduced odds of mortality during the NASG phase (aOR 0.45, 0.27-0.77). The NNTb to prevent either mortality or severe morbidity was 18 (12-36). Adding the NASG to standard shock and hemorrhage management may significantly improve maternal outcomes from hypovolemic shock secondary to obstetric hemorrhage at tertiary care facilities in low-resource settings.
Polak, Michel; Blair, Jo; Kotnik, Primoz; Pournara, Effie; Pedersen, Birgitte Tønnes; Rohrer, Tilman R
2017-11-01
To investigate the effect of age at growth hormone (GH) treatment start on near adult height (NAH) in children with isolated GH deficiency (GHD). NordiNet® International Outcome Study (IOS) (Nbib960128), a non-interventional, multicentre study, evaluates the long-term effectiveness and safety of Norditropin® (somatropin) (Novo Nordisk A/S) in the real-life clinical setting. Patients ( n = 172) treated to NAH (height at ≥18 years, or height velocity <2 cm/year at ≥16 (boys) or ≥15 (girls) years) were grouped by age (years) at treatment start (early (girls, <8; boys, <9), intermediate (girls, 8-10; boys, 9-11) or late (girls, >10; boys, >11)) and GHD severity (<3 ng/mL or 3 to ≤10 ng/mL). Multiple regression analysis was used to evaluate the effect of age at treatment start (as a categorical and continuous variable) on NAH standard deviation score (SDS). Age at treatment start had a marked effect on NAH SDS; NAH SDS achieved by patients starting treatment early ( n = 40 (boys, 70.0%); least squares mean (standard error) -0.76 (0.14)) exceeded that achieved by those starting later (intermediate, n = 42 (boys, 57.1%); -1.14 (0.15); late, n = 90 (boys, 68.9%); -1.21 (0.10)). Multiple regression analysis showed a significant association between NAH SDS and age at treatment start ( P < 0.0242), baseline height SDS (HSDS) ( P < 0.0001), target HSDS ( P < 0.0001), and GHD severity ( P = 0.0012). Most (78.5%) patients achieved a normal NAH irrespective of age at treatment start. Early initiation of GH treatment in children with isolated GHD improves their chance of achieving their genetic height potential. © 2017 The authors.
A Statistical Multimodel Ensemble Approach to Improving Long-Range Forecasting in Pakistan
2012-03-01
Impact of global warming on monsoon variability in Pakistan. J. Anim. Pl. Sci., 21, no. 1, 107–110. Gillies, S., T. Murphree, and D. Meyer, 2012...are generated by multiple regression models that relate globally distributed oceanic and atmospheric predictors to local predictands. The...generated by multiple regression models that relate globally distributed oceanic and atmospheric predictors to local predictands. The predictands are
NASA Astrophysics Data System (ADS)
Yoshida, Kenichiro; Nishidate, Izumi; Ojima, Nobutoshi; Iwata, Kayoko
2014-01-01
To quantitatively evaluate skin chromophores over a wide region of curved skin surface, we propose an approach that suppresses the effect of the shading-derived error in the reflectance on the estimation of chromophore concentrations, without sacrificing the accuracy of that estimation. In our method, we use multiple regression analysis, assuming the absorbance spectrum as the response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as the predictor variables. The concentrations of melanin and total hemoglobin are determined from the multiple regression coefficients using compensation formulae (CF) based on the diffuse reflectance spectra derived from a Monte Carlo simulation. To suppress the shading-derived error, we investigated three different combinations of multiple regression coefficients for the CF. In vivo measurements with the forearm skin demonstrated that the proposed approach can reduce the estimation errors that are due to shading-derived errors in the reflectance. With the best combination of multiple regression coefficients, we estimated that the ratio of the error to the chromophore concentrations is about 10%. The proposed method does not require any measurements or assumptions about the shape of the subjects; this is an advantage over other studies related to the reduction of shading-derived errors.
Byun, Bo-Ram; Kim, Yong-Il; Yamaguchi, Tetsutaro; Maki, Koutaro; Son, Woo-Sung
2015-01-01
This study was aimed to examine the correlation between skeletal maturation status and parameters from the odontoid process/body of the second vertebra and the bodies of third and fourth cervical vertebrae and simultaneously build multiple regression models to be able to estimate skeletal maturation status in Korean girls. Hand-wrist radiographs and cone beam computed tomography (CBCT) images were obtained from 74 Korean girls (6-18 years of age). CBCT-generated cervical vertebral maturation (CVM) was used to demarcate the odontoid process and the body of the second cervical vertebra, based on the dentocentral synchondrosis. Correlation coefficient analysis and multiple linear regression analysis were used for each parameter of the cervical vertebrae (P < 0.05). Forty-seven of 64 parameters from CBCT-generated CVM (independent variables) exhibited statistically significant correlations (P < 0.05). The multiple regression model with the greatest R (2) had six parameters (PH2/W2, UW2/W2, (OH+AH2)/LW2, UW3/LW3, D3, and H4/W4) as independent variables with a variance inflation factor (VIF) of <2. CBCT-generated CVM was able to include parameters from the second cervical vertebral body and odontoid process, respectively, for the multiple regression models. This suggests that quantitative analysis might be used to estimate skeletal maturation status.
Multi-model ensemble combinations of the water budget in the East/Japan Sea
NASA Astrophysics Data System (ADS)
HAN, S.; Hirose, N.; Usui, N.; Miyazawa, Y.
2016-02-01
The water balance of East/Japan Sea is determined mainly by inflow and outflow through the Korea/Tsushima, Tsugaru and Soya/La Perouse Straits. However, the volume transports measured at three straits remain quantitatively unbalanced. This study examined the seasonal variation of the volume transport using the multiple linear regression and ridge regression of multi-model ensemble (MME) methods to estimate physically consistent circulation in East/Japan Sea by using four different data assimilation models. The MME outperformed all of the single models by reducing uncertainties, especially the multicollinearity problem with the ridge regression. However, the regression constants turned out to be inconsistent with each other if the MME was applied separately for each strait. The MME for a connected system was thus performed to find common constants for these straits. The estimation of this MME was found to be similar to the MME result of sea level difference (SLD). The estimated mean transport (2.42 Sv) was smaller than the measurement data at the Korea/Tsushima Strait, but the calibrated transport of the Tsugaru Strait (1.63 Sv) was larger than the observed data. The MME results of transport and SLD also suggested that the standard deviation (STD) of the Korea/Tsushima Strait is larger than the STD of the observation, whereas the estimated results were almost identical to that observed for the Tsugaru and Soya/La Perouse Straits. The similarity between MME results enhances the reliability of the present MME estimation.
Multi-model ensemble estimation of volume transport through the straits of the East/Japan Sea
NASA Astrophysics Data System (ADS)
Han, Sooyeon; Hirose, Naoki; Usui, Norihisa; Miyazawa, Yasumasa
2016-01-01
The volume transports measured at the Korea/Tsushima, Tsugaru, and Soya/La Perouse Straits remain quantitatively inconsistent. However, data assimilation models at least provide a self-consistent budget despite subtle differences among the models. This study examined the seasonal variation of the volume transport using the multiple linear regression and ridge regression of multi-model ensemble (MME) methods to estimate more accurately transport at these straits by using four different data assimilation models. The MME outperformed all of the single models by reducing uncertainties, especially the multicollinearity problem with the ridge regression. However, the regression constants turned out to be inconsistent with each other if the MME was applied separately for each strait. The MME for a connected system was thus performed to find common constants for these straits. The estimation of this MME was found to be similar to the MME result of sea level difference (SLD). The estimated mean transport (2.43 Sv) was smaller than the measurement data at the Korea/Tsushima Strait, but the calibrated transport of the Tsugaru Strait (1.63 Sv) was larger than the observed data. The MME results of transport and SLD also suggested that the standard deviation (STD) of the Korea/Tsushima Strait is larger than the STD of the observation, whereas the estimated results were almost identical to that observed for the Tsugaru and Soya/La Perouse Straits. The similarity between MME results enhances the reliability of the present MME estimation.
ERIC Educational Resources Information Center
Crawford, John R.; Garthwaite, Paul H.; Denham, Annie K.; Chelune, Gordon J.
2012-01-01
Regression equations have many useful roles in psychological assessment. Moreover, there is a large reservoir of published data that could be used to build regression equations; these equations could then be employed to test a wide variety of hypotheses concerning the functioning of individual cases. This resource is currently underused because…
Behavioral and psychosocial factors associated with suicidal ideation among adolescents.
Lee, GyuYoung; Ham, Ok Kyung
2018-04-10
Suicidal ideation poses a serious threat to the well-being of adolescents and is the strongest risk factor for suicide. Indeed, Korea ranks first among Organisation for Economic Cooperation and Development countries regarding the age-standardized suicide rates. In the present study, we examined multiple levels of factors associated with the suicidal ideation of adolescents in Korea by applying the Ecological Models of Health Behavior. A cross-sectional study was conducted with a convenience sample of 860 adolescents. The instruments included the Beck Depression Inventory and the Adolescent Mental Health and Problem Behavior Questionnaire. The data were analyzed using hierarchical multiple regression. Sixteen percent of participants reported suicidal ideation. Intrapersonal (sleep disturbance, Internet game addiction, destructive behavior, and depressive symptoms) and interpersonal factors (family conflicts and peer victimization) were associated with suicidal ideation. Because multiple factors were associated with suicidal ideation among adolescents, both intrapersonal (sleep disturbance, Internet game addiction, and depression) and interpersonal factors (family conflicts and peer problems) should be considered in the development of suicide-prevention programs. These programs could include campaigns changing the norms (permissive attitudes toward school violence) and the development of strict and rigorous school non-violence policies. © 2018 John Wiley & Sons Australia, Ltd.
Multiple traumatic brain injury and concussive symptoms among deployed military personnel.
Bryan, Craig J
2013-01-01
To identify if concussive symptoms occur with greater frequency among military personnel with multiple lifetime TBIs and if a history of TBI increases risk for subsequent TBI. One hundred and sixty-one military personnel referred to a TBI clinic for evaluation and treatment of suspected head injury at a military clinic in Iraq. Military patients completed standardized self-report measures of concussion, depression and post-traumatic stress symptoms; clinical interview; and physical examination. Group comparisons were made according to number of lifetime TBIs and logistic regression was utilized to determine the association of past TBIs on current TBI. Patients with one or more previous TBIs were more likely to report concussion symptoms immediately following a recent injury and during the evaluation. Although differences between single and multiple TBI groups were observed, these did not reach the level of statistical significance. A history of any TBI increased the likelihood of current TBI diagnosis, but this relationship was no longer significant when adjusting for injury mechanism, depression and post-traumatic stress symptoms. Among deployed military personnel, the relationship of previous TBI with recent TBI and concussive symptoms may be largely explained by the presence of psychological symptoms.
Kahnert, Kathrin; Alter, Peter; Welte, Tobias; Huber, Rudolf M; Behr, Jürgen; Biertz, Frank; Watz, Henrik; Bals, Robert; Vogelmeier, Claus F; Jörres, Rudolf A
2018-06-04
Recent investigations showed single associations between uric acid levels, functional parameters, exacerbations and mortality in COPD patients. The aim of this study was to describe the role of uric acid within the network of multiple relationships between function, exacerbation and comorbidities. We used baseline data from the German COPD cohort COSYCONET which were evaluated by standard multiple regression analyses as well as path analysis to quantify the network of relations between parameters, particularly uric acid. Data from 1966 patients were analyzed. Uric acid was significantly associated with reduced FEV 1 , reduced 6-MWD, higher burden of exacerbations (GOLD criteria) and cardiovascular comorbidities, in addition to risk factors such as BMI and packyears. These associations remained significant after taking into account their multiple interdependences. Compared to uric acid levels the diagnosis of hyperuricemia and its medication played a minor role. Within the limits of a cross-sectional approach, our results strongly suggest that uric acid is a biomarker of high impact in COPD and plays a genuine role for relevant outcomes such as physical capacity and exacerbations. These findings suggest that more attention should be paid to uric acid in the evaluation of COPD disease status.
Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits.
Zhang, Futao; Xie, Dan; Liang, Meimei; Xiong, Momiao
2016-04-01
To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains fundamentally unexplored. To fill this gap, we formulate a test for interaction between two genes in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large-scale simulations to calculate Type I error rates for testing interaction between two genes with multiple phenotypes and to compare the power with multivariate pairwise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate performance, the MFRG for epistasis analysis is applied to five phenotypes of exome sequence data from the NHLBI's Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 267 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has a much higher power to detect interaction than the interaction analysis of a single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes.
Raj, Retheep; Sivanandan, K S
2017-01-01
Estimation of elbow dynamics has been the object of numerous investigations. In this work a solution is proposed for estimating elbow movement velocity and elbow joint angle from Surface Electromyography (SEMG) signals. Here the Surface Electromyography signals are acquired from the biceps brachii muscle of human hand. Two time-domain parameters, Integrated EMG (IEMG) and Zero Crossing (ZC), are extracted from the Surface Electromyography signal. The relationship between the time domain parameters, IEMG and ZC with elbow angular displacement and elbow angular velocity during extension and flexion of the elbow are studied. A multiple input-multiple output model is derived for identifying the kinematics of elbow. A Nonlinear Auto Regressive with eXogenous inputs (NARX) structure based multiple layer perceptron neural network (MLPNN) model is proposed for the estimation of elbow joint angle and elbow angular velocity. The proposed NARX MLPNN model is trained using Levenberg-marquardt based algorithm. The proposed model is estimating the elbow joint angle and elbow movement angular velocity with appreciable accuracy. The model is validated using regression coefficient value (R). The average regression coefficient value (R) obtained for elbow angular displacement prediction is 0.9641 and for the elbow anglular velocity prediction is 0.9347. The Nonlinear Auto Regressive with eXogenous inputs (NARX) structure based multiple layer perceptron neural networks (MLPNN) model can be used for the estimation of angular displacement and movement angular velocity of the elbow with good accuracy.
Williams, Rachel E; Arabi, Mazdak; Loftis, Jim; Elmund, G Keith
2014-09-01
Implementation of numeric nutrient standards in Colorado has prompted a need for greater understanding of human impacts on ambient nutrient levels. This study explored the variability of annual nutrient concentrations due to upstream anthropogenic influences and developed a mathematical expression for the number of samples required to estimate median concentrations for standard compliance. A procedure grounded in statistical hypothesis testing was developed to estimate the number of annual samples required at monitoring locations while taking into account the difference between the median concentrations and the water quality standard for a lognormal population. For the Cache La Poudre River in northern Colorado, the relationship between the median and standard deviation of total N (TN) and total P (TP) concentrations and the upstream point and nonpoint concentrations and general hydrologic descriptors was explored using multiple linear regression models. Very strong relationships were evident between the upstream anthropogenic influences and annual medians for TN and TP ( > 0.85, < 0.001) and corresponding standard deviations ( > 0.7, < 0.001). Sample sizes required to demonstrate (non)compliance with the standard depend on the measured water quality conditions. When the median concentration differs from the standard by >20%, few samples are needed to reach a 95% confidence level. When the median is within 20% of the corresponding water quality standard, however, the required sample size increases rapidly, and hundreds of samples may be required. Copyright © by the American Society of Agronomy, Crop Science Society of America, and Soil Science Society of America, Inc.
On the Occurrence of Standardized Regression Coefficients Greater than One.
ERIC Educational Resources Information Center
Deegan, John, Jr.
1978-01-01
It is demonstrated here that standardized regression coefficients greater than one can legitimately occur. Furthermore, the relationship between the occurrence of such coefficients and the extent of multicollinearity present among the set of predictor variables in an equation is examined. Comments on the interpretation of these coefficients are…
Regression Commonality Analysis: A Technique for Quantitative Theory Building
ERIC Educational Resources Information Center
Nimon, Kim; Reio, Thomas G., Jr.
2011-01-01
When it comes to multiple linear regression analysis (MLR), it is common for social and behavioral science researchers to rely predominately on beta weights when evaluating how predictors contribute to a regression model. Presenting an underutilized statistical technique, this article describes how organizational researchers can use commonality…
Precision Efficacy Analysis for Regression.
ERIC Educational Resources Information Center
Brooks, Gordon P.
When multiple linear regression is used to develop a prediction model, sample size must be large enough to ensure stable coefficients. If the derivation sample size is inadequate, the model may not predict well for future subjects. The precision efficacy analysis for regression (PEAR) method uses a cross- validity approach to select sample sizes…
Automated Assessment of Child Vocalization Development Using LENA.
Richards, Jeffrey A; Xu, Dongxin; Gilkerson, Jill; Yapanel, Umit; Gray, Sharmistha; Paul, Terrance
2017-07-12
To produce a novel, efficient measure of children's expressive vocal development on the basis of automatic vocalization assessment (AVA), child vocalizations were automatically identified and extracted from audio recordings using Language Environment Analysis (LENA) System technology. Assessment was based on full-day audio recordings collected in a child's unrestricted, natural language environment. AVA estimates were derived using automatic speech recognition modeling techniques to categorize and quantify the sounds in child vocalizations (e.g., protophones and phonemes). These were expressed as phone and biphone frequencies, reduced to principal components, and inputted to age-based multiple linear regression models to predict independently collected criterion-expressive language scores. From these models, we generated vocal development AVA estimates as age-standardized scores and development age estimates. AVA estimates demonstrated strong statistical reliability and validity when compared with standard criterion expressive language assessments. Automated analysis of child vocalizations extracted from full-day recordings in natural settings offers a novel and efficient means to assess children's expressive vocal development. More research remains to identify specific mechanisms of operation.
Body Composition Predicts Growth in Infants and Toddlers With Chronic Liver Disease.
Hurtado-López, Erika F; Vásquez-Garibay, Edgar M; Trujillo, Xóchitl; Larrosa-Haro, Alfredo
2017-12-01
This cross-sectional study was conducted on 15 infants and toddlers with chronic liver disease to validate arm anthropometry as an accurate measure of body composition (BC) compared to dual-energy x-ray absorptiometry and to predict growth from BC. The z score means of the anthropometric indicators were <-2 standard deviation, except for body fat index and subscapular skinfold, which were between -2 and +2 standard deviation. Fat mass was predicted by arm adiposity indicators and fat-free mass by arm muscle area. Bone mineral content explained 87% of variation in length. Two multiple regression models predicted length: 1 with fat mass plus fat-free mass; and the second with fat mass and bone mineral content. These observations suggest that arm anthropometry is a useful tool to estimate BC and the nutritional status in infants and toddlers with chronic liver disease. Length and head circumference can be predicted by fat mass, fat-free mass, and bone mineral content.
The impact of social and organizational factors on workers' coping with musculoskeletal symptoms.
Torp, S; Riise, T; Moen, B E
2001-07-01
Workers with musculoskeletal symptoms are often advised to cope with their symptoms by changing their working technique and by using lifting equipment. The main objective of this study was to test the hypothesis that negative social and organizational factors where people are employed may prevent workers from implementing these coping strategies. A total of 1,567 automobile garage workers (72%) returned a questionnaire concerning coping with musculoskeletal symptoms and social and organizational factors. When job demands, decision authority, social support, and management support related to health, environment, and safety (HES) were used as predictor variables in a multiple regression model, coping as the outcome variable was correlated with decision authority, social support, and HES-related management support (standardized beta=.079,.12, and.13, respectively). When an index for health-related support and control was added to the model, it correlated with coping (standardized beta=.36), whereas the other relationships disappeared. Decision authority and social support entail health-related support and control that, in turn, influences coping.
Ham, Young-Sik; Kobori, Hiromi; Takasago, Masahisa
2009-05-01
The indicator bacteria (standard plate count, total coliform, and fecal coliform bacteria) concentrations have been investigated using six ambient habitats (population density, percent sewer penetration, stream flow rate (m(3)/sec), percent residential area, percent forest area and percent agricultural area) in the Tama River basin in Tokyo, Japan during June 2003 to January 2005. The downstream and tributary Tama River showed higher concentrations of TC and FC bacteria than the upstream waters, which exceeded an environmental quality standard for rivers and a bathing water quality criterion. It was estimated that combined sewer overflow (CSO) and stormwater effluents contributed -4-23% to the indicator bacteria concentrations of the Tama River. The results of multiple regression analyses show that the indicator bacteria concentrations of Tama River basin are significantly affected by population density. It is concluded that the Tama River received a significant bacterial contamination load originating from the anthropogenic source.
Gotvald, Anthony J.; Barth, Nancy A.; Veilleux, Andrea G.; Parrett, Charles
2012-01-01
Methods for estimating the magnitude and frequency of floods in California that are not substantially affected by regulation or diversions have been updated. Annual peak-flow data through water year 2006 were analyzed for 771 streamflow-gaging stations (streamgages) in California having 10 or more years of data. Flood-frequency estimates were computed for the streamgages by using the expected moments algorithm to fit a Pearson Type III distribution to logarithms of annual peak flows for each streamgage. Low-outlier and historic information were incorporated into the flood-frequency analysis, and a generalized Grubbs-Beck test was used to detect multiple potentially influential low outliers. Special methods for fitting the distribution were developed for streamgages in the desert region in southeastern California. Additionally, basin characteristics for the streamgages were computed by using a geographical information system. Regional regression analysis, using generalized least squares regression, was used to develop a set of equations for estimating flows with 50-, 20-, 10-, 4-, 2-, 1-, 0.5-, and 0.2-percent annual exceedance probabilities for ungaged basins in California that are outside of the southeastern desert region. Flood-frequency estimates and basin characteristics for 630 streamgages were combined to form the final database used in the regional regression analysis. Five hydrologic regions were developed for the area of California outside of the desert region. The final regional regression equations are functions of drainage area and mean annual precipitation for four of the five regions. In one region, the Sierra Nevada region, the final equations are functions of drainage area, mean basin elevation, and mean annual precipitation. Average standard errors of prediction for the regression equations in all five regions range from 42.7 to 161.9 percent. For the desert region of California, an analysis of 33 streamgages was used to develop regional estimates of all three parameters (mean, standard deviation, and skew) of the log-Pearson Type III distribution. The regional estimates were then used to develop a set of equations for estimating flows with 50-, 20-, 10-, 4-, 2-, 1-, 0.5-, and 0.2-percent annual exceedance probabilities for ungaged basins. The final regional regression equations are functions of drainage area. Average standard errors of prediction for these regression equations range from 214.2 to 856.2 percent. Annual peak-flow data through water year 2006 were analyzed for eight streamgages in California having 10 or more years of data considered to be affected by urbanization. Flood-frequency estimates were computed for the urban streamgages by fitting a Pearson Type III distribution to logarithms of annual peak flows for each streamgage. Regression analysis could not be used to develop flood-frequency estimation equations for urban streams because of the limited number of sites. Flood-frequency estimates for the eight urban sites were graphically compared to flood-frequency estimates for 630 non-urban sites. The regression equations developed from this study will be incorporated into the U.S. Geological Survey (USGS) StreamStats program. The StreamStats program is a Web-based application that provides streamflow statistics and basin characteristics for USGS streamgages and ungaged sites of interest. StreamStats can also compute basin characteristics and provide estimates of streamflow statistics for ungaged sites when users select the location of a site along any stream in California.
Assessment of the spatial scaling behaviour of floods in the United Kingdom
NASA Astrophysics Data System (ADS)
Formetta, Giuseppe; Stewart, Elizabeth; Bell, Victoria
2017-04-01
Floods are among the most dangerous natural hazards, causing loss of life and significant damage to private and public property. Regional flood-frequency analysis (FFA) methods are essential tools to assess the flood hazard and plan interventions for its mitigation. FFA methods are often based on the well-known index flood method that assumes the invariance of the coefficient of variation of floods with drainage area. This assumption is equivalent to the simple scaling or self-similarity assumption for peak floods, i.e. their spatial structure remains similar in a particular, relatively simple, way to itself over a range of scales. Spatial scaling of floods has been evaluated at national scale for different countries such as Canada, USA, and Australia. According our knowledge. Such a study has not been conducted for the United Kingdom even though the standard FFA method there is based on the index flood assumption. In this work we present an integrated approach to assess of the spatial scaling behaviour of floods in the United Kingdom using three different methods: product moments (PM), probability weighted moments (PWM), and quantile analysis (QA). We analyse both instantaneous and daily annual observed maximum floods and performed our analysis both across the entire country and in its sub-climatic regions as defined in the Flood Studies Report (NERC, 1975). To evaluate the relationship between the k-th moments or quantiles and the drainage area we used both regression with area alone and multiple regression considering other explanatory variables to account for the geomorphology, amount of rainfall, and soil type of the catchments. The latter multiple regression approach was only recently demonstrated being more robust than the traditional regression with area alone that can lead to biased estimates of scaling exponents and misinterpretation of spatial scaling behaviour. We tested our framework on almost 600 rural catchments in UK considered as entire region and split in 11 sub-regions with 50 catchments per region on average. Preliminary results from the three different spatial scaling methods are generally in agreement and indicate that: i) only some of the peak flow variability is explained by area alone (approximately 50% for the entire country and ranging between the 40% and 70% for the sub-regions); ii) this percentage increases to 90% for the entire country and ranges between 80% and 95% for the sub-regions when the multiple regression is used; iii) the simple scaling hypothesis holds in all sub-regions with the exception of weak multi-scaling found in the regions 2 (North), and 5 and 6 (South East). We hypothesize that these deviations can be explained by heterogeneity in large scale precipitation and by the influence of the soil type (predominantly chalk) on the flood formation process in regions 5 and 6.
NASA Astrophysics Data System (ADS)
Dama, Madhukar Shivajirao
2013-06-01
Human populations show remarkable variation in the sex ratio at birth which is believed to be related to the parental condition. In the present study, the global variation of sex ratio at birth (SRB, proportion of male offspring born) was analyzed with respect to indirect measure of condition, the intelligence quotient (IQ). IQ correlates strongly with lifespan across nations, which makes it a good indicator of health of the large populations. Relation between three standard measures of average national IQ and SRB was studied using multiple linear regression models. Average national IQ was positively correlated with SRB ( r = 0.54 to 0.57, p < 0.001). Further, IQ emerged as a powerful predictor of SRB after controlling for the effects of all the known covariates like fertility, maternal age, polygyny prevalence, wealth, son preference, latitude, low birth weight, and neonatal mortality in the regression models. These results suggest that the striking variation of offspring sex ratio across nations could be caused in part by the difference in general condition of populations.
Waber, Deborah P; Bryce, Cyralene P; Girard, Jonathan M; Zichlin, Miriam; Fitzmaurice, Garrett M; Galler, Janina R
2014-02-01
To evaluate IQ and academic skills in adults who experienced an episode of moderate-to-severe infantile malnutrition and a healthy control group, all followed since childhood in the Barbados Nutrition Study. IQ and academic skills were assessed in 77 previously malnourished adults (mean age = 38.4 years; 53% male) and 59 controls (mean age = 38.1 years; 54% male). Group comparisons were carried out by multiple regression and logistic regression, adjusted for childhood socioeconomic factors. The previously malnourished group showed substantial deficits on all outcomes relative to healthy controls (P < 0.0001). IQ scores in the intellectual disability range (< 70) were nine times more prevalent in the previously malnourished group (odds ratio = 9.18; 95% confidence interval = 3.50-24.13). Group differences in IQ of approximately one standard deviation were stable from adolescence through mid-life. Moderate-to-severe malnutrition during infancy is associated with a significantly elevated incidence of impaired IQ in adulthood, even when physical growth is completely rehabilitated. An episode of malnutrition during the first year of life carries risk for significant lifelong functional morbidity.
Life-space mobility and social support in elderly adults with orthopaedic disorders.
Suzuki, Tomoko; Kitaike, Tadashi; Ikezaki, Sumie
2014-03-01
The purpose of this cross-sectional survey was to explore relationships between life-space mobility and the related factors in elderly Japanese people who attend orthopaedic clinics. The study measures included surveys of life-space mobility (Life-space Assessment (LSA) score), social support (social network diversity and social ties), physical ability (instrumental self-maintenance, intellectual activity, social role), orthopaedic factors (diseases and symptoms) and demographic information. The questionnaire was distributed to 156 subjects; 152 persons responded, yielding 140 valid responses. Mean age of the sample was 76.0 ± 6.4 (range, 65-96 years), with 57.9% women (n = 81). In a multiple regression analysis, the six factors were significantly associated with LSA. Standardized partial regression coefficients (β) were gender (0.342), instrumental self-maintenance (0.297), social network diversity (0.217), age (-0.170), difficulty of motion (-0.156) and intellectual activity (0.150), with an adjusted R(2) = 0.488. These results suggest that outpatient health-care providers need to intervene in not only addressing orthopaedic factors but also promoting social support among elderly Japanese. © 2014 Wiley Publishing Asia Pty Ltd.
Optimization of fixture layouts of glass laser optics using multiple kernel regression.
Su, Jianhua; Cao, Enhua; Qiao, Hong
2014-05-10
We aim to build an integrated fixturing model to describe the structural properties and thermal properties of the support frame of glass laser optics. Therefore, (a) a near global optimal set of clamps can be computed to minimize the surface shape error of the glass laser optic based on the proposed model, and (b) a desired surface shape error can be obtained by adjusting the clamping forces under various environmental temperatures based on the model. To construct the model, we develop a new multiple kernel learning method and call it multiple kernel support vector functional regression. The proposed method uses two layer regressions to group and order the data sources by the weights of the kernels and the factors of the layers. Because of that, the influences of the clamps and the temperature can be evaluated by grouping them into different layers.
Prediction of anthropometric foot characteristics in children.
Morrison, Stewart C; Durward, Brian R; Watt, Gordon F; Donaldson, Malcolm D C
2009-01-01
The establishment of growth reference values is needed in pediatric practice where pathologic conditions can have a detrimental effect on the growth and development of the pediatric foot. This study aims to use multiple regression to evaluate the effects of multiple predictor variables (height, age, body mass, and gender) on anthropometric characteristics of the peripubescent foot. Two hundred children aged 9 to 12 years were recruited, and three anthropometric measurements of the pediatric foot were recorded (foot length, forefoot width, and navicular height). Multiple regression analysis was conducted, and coefficients for gender, height, and body mass all had significant relationships for the prediction of forefoot width and foot length (P < or = .05, r > or = 0.7). The coefficients for gender and body mass were not significant for the prediction of navicular height (P > or = .05), whereas height was (P < or = .05). Normative growth reference values and prognostic regression equations are presented for the peripubescent foot.
Birthweight Related Factors in Northwestern Iran: Using Quantile Regression Method.
Fallah, Ramazan; Kazemnejad, Anoshirvan; Zayeri, Farid; Shoghli, Alireza
2015-11-18
Birthweight is one of the most important predicting indicators of the health status in adulthood. Having a balanced birthweight is one of the priorities of the health system in most of the industrial and developed countries. This indicator is used to assess the growth and health status of the infants. The aim of this study was to assess the birthweight of the neonates by using quantile regression in Zanjan province. This analytical descriptive study was carried out using pre-registered (March 2010 - March 2012) data of neonates in urban/rural health centers of Zanjan province using multiple-stage cluster sampling. Data were analyzed using multiple linear regressions andquantile regression method and SAS 9.2 statistical software. From 8456 newborn baby, 4146 (49%) were female. The mean age of the mothers was 27.1±5.4 years. The mean birthweight of the neonates was 3104 ± 431 grams. Five hundred and seventy-three patients (6.8%) of the neonates were less than 2500 grams. In all quantiles, gestational age of neonates (p<0.05), weight and educational level of the mothers (p<0.05) showed a linear significant relationship with the i of the neonates. However, sex and birth rank of the neonates, mothers age, place of residence (urban/rural) and career were not significant in all quantiles (p>0.05). This study revealed the results of multiple linear regression and quantile regression were not identical. We strictly recommend the use of quantile regression when an asymmetric response variable or data with outliers is available.
Birthweight Related Factors in Northwestern Iran: Using Quantile Regression Method
Fallah, Ramazan; Kazemnejad, Anoshirvan; Zayeri, Farid; Shoghli, Alireza
2016-01-01
Introduction: Birthweight is one of the most important predicting indicators of the health status in adulthood. Having a balanced birthweight is one of the priorities of the health system in most of the industrial and developed countries. This indicator is used to assess the growth and health status of the infants. The aim of this study was to assess the birthweight of the neonates by using quantile regression in Zanjan province. Methods: This analytical descriptive study was carried out using pre-registered (March 2010 - March 2012) data of neonates in urban/rural health centers of Zanjan province using multiple-stage cluster sampling. Data were analyzed using multiple linear regressions andquantile regression method and SAS 9.2 statistical software. Results: From 8456 newborn baby, 4146 (49%) were female. The mean age of the mothers was 27.1±5.4 years. The mean birthweight of the neonates was 3104 ± 431 grams. Five hundred and seventy-three patients (6.8%) of the neonates were less than 2500 grams. In all quantiles, gestational age of neonates (p<0.05), weight and educational level of the mothers (p<0.05) showed a linear significant relationship with the i of the neonates. However, sex and birth rank of the neonates, mothers age, place of residence (urban/rural) and career were not significant in all quantiles (p>0.05). Conclusion: This study revealed the results of multiple linear regression and quantile regression were not identical. We strictly recommend the use of quantile regression when an asymmetric response variable or data with outliers is available. PMID:26925889
Office and 24-hour heart rate and target organ damage in hypertensive patients
2012-01-01
Background We investigated the association between heart rate and its variability with the parameters that assess vascular, renal and cardiac target organ damage. Methods A cross-sectional study was performed including a consecutive sample of 360 hypertensive patients without heart rate lowering drugs (aged 56 ± 11 years, 64.2% male). Heart rate (HR) and its standard deviation (HRV) in clinical and 24-hour ambulatory monitoring were evaluated. Renal damage was assessed by glomerular filtration rate and albumin/creatinine ratio; vascular damage by carotid intima-media thickness and ankle/brachial index; and cardiac damage by the Cornell voltage-duration product and left ventricular mass index. Results There was a positive correlation between ambulatory, but not clinical, heart rate and its standard deviation with glomerular filtration rate, and a negative correlation with carotid intima-media thickness, and night/day ratio of systolic and diastolic blood pressure. There was no correlation with albumin/creatinine ratio, ankle/brachial index, Cornell voltage-duration product or left ventricular mass index. In the multiple linear regression analysis, after adjusting for age, the association of glomerular filtration rate and intima-media thickness with ambulatory heart rate and its standard deviation was lost. According to the logistic regression analysis, the predictors of any target organ damage were age (OR = 1.034 and 1.033) and night/day systolic blood pressure ratio (OR = 1.425 and 1.512). Neither 24 HR nor 24 HRV reached statistical significance. Conclusions High ambulatory heart rate and its variability, but not clinical HR, are associated with decreased carotid intima-media thickness and a higher glomerular filtration rate, although this is lost after adjusting for age. Trial Registration ClinicalTrials.gov: NCT01325064 PMID:22439900
A framework for evaluating student perceptions of health policy training in medical school.
Patel, Mitesh S; Lypson, Monica L; Miller, D Douglas; Davis, Matthew M
2014-10-01
Nearly half of graduating medical students in the United States report that medical school provides inadequate instruction in topics related to health policy. Although most medical schools report some form of policy education, there lacks a standard for teaching core concepts and evaluating student satisfaction. Responses to the Association of American Medical College's Medical School Graduation Questionnaire were obtained for the years 2007-2008 and 2011-2012 and mapped to domains of training in health policy curricula for four domains: systems and principles; value and equity; quality and safety; and politics and law. Chi-square tests were used to test differences among unadjusted temporal trends. Multiple logistic regression models were fit to the outcome variables and adjusted for student characteristics, student preferences, and medical school characteristics. Compared with 2007-2008, students' perceptions of training in 2011-2012 increased on a relative basis by 11.7% for components within systems and principles, 2.8% for quality and safety, and 6.8% for value and equity. Components within politics and law had a composite decline of 4.8%. Multiple logistic regression models found higher odds of reporting satisfaction with training over time for all components within the domains of systems and principles, quality and safety, and value and equity (P < .01), with the exception of medical economics. Medical student perceptions of training in health policy improved over time. Causal factors for these trends require further study. Despite improvement, nearly 40% of graduating medical students still report inadequate instruction in health policy.
Age estimation by pulp/tooth ratio in lower premolars by orthopantomography.
Cameriere, Roberto; De Luca, Stefano; Alemán, Inmaculada; Ferrante, Luigi; Cingolani, Mariano
2012-01-10
Accurate age estimation has always been a problem for forensic scientists, and apposition of secondary dentine is often used as an indicator of age. Since 2004, in order to examine patterns of secondary dentine apposition, Cameriere et al. have been extensively studying the pulp/tooth area ratio of the canines by panoramic and peri-apical X-ray images. The main aim of this paper is to examine the relationship between age and age-related changes in the pulp/tooth area ratio in monoradicular teeth, with the exception of canines, by orthopantomography. A total of 606 orthopantomograms of Spanish white Caucasian patients (289 women and 317 men), aged between 18 and 75 years and coming from Bilbao and Granada (Spain), was analysed. Regression analysis of age of monoradicular teeth indicated that the lower premolars were the most closely correlated with age. An ANCOVA did not show significant differences between men and women. Multiple regression analysis, with age as dependent variable and pulp/tooth area ratio as predictor, yielded several formulae. R(2) ranged from 0.69 to 0.75 for a single lower premolar tooth and from 0.79 to 0.86 for multiple lower premolar teeth. Depending on the available number of premolar teeth, the mean of the absolute values of residual standard error, at 95% confidence interval, ranged between 4.34 and 6.02 years, showing that the pulp/tooth area ratio is a useful variable for assessing age with reasonable accuracy. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Paiva, M B; Saxton, R E; Letts, G A; Chung, P S; Soudant, J; Vanderwerf, Q; Castro, D J
1995-10-01
Photodynamic therapy (PDT) with lasers and new dyes has gained popularity in recent years as a minimally invasive technique with high tumoricidal effects in vitro and in some cancer patients. However, because new laser dyes are not FDA approved at present, the clinical evaluation of PDT may be years away. During the past 6 years we have used laser alone for photothermal ablation in both preclinical studies and in a large number of patients with an observed 60% tumor response rate. The 40% treatment failure led us to explore the possibility of combined therapy with lasers and standard chemotherapeutic drugs. We have recently tested a promising preclinical alternative using implantation of a bare 600-microns KTP 532 laser fiberoptic in multiple tumor sites 30 min after intratumor injection of the anthrapyrazole DUP-941. As a control, this drug was injected in 3 sites of P3 human squamous cell tumor transplants in nude mice, which led to tumor stasis without regression. Similar 400-600 mm3 tumors exposed to laser illumination alone (0.8 W for 5 sec) at multiple sites resulted in tumor regrowth after 10 weeks in 80% of the animals. However, combining interstitial laser illumination with intratumor DUP-941 injections led to complete tumor regression in 85% of the mice. We propose that intratumor drug injection followed by interstitial laser fiberoptic treatment represents a potentially useful new method for tumor ablation in advanced cancer patients.
Sexual Experiences of Chinese Patients Living With an Ostomy.
Zhu, Xiaomei; Chen, Yongyi; Tang, Xinhui; Chen, Yupan; Liu, Yangyu; Guo, Wei; Liu, Aizhong
The purpose of this study was to examine the sexual experience of Chinese patients with ostomy and associated factors. A prospective descriptive study using self-report questionnaires. Seventy-five Chinese participants who underwent ostomy surgery in a large cancer specialist hospital in the Hunan province between 2008 and 2013. Data were collected face-to-face by the investigators in an outpatient setting from 75 participants who completed the Arizona Sexual Experience Inventory Scale (ASEX). The t test was used to compare variances between sexual function and dysfunction subgroups. A multiple linear regression model was used to analyze factors influencing sexual life after ostomy surgery. The mean ASEX score was 20.56 (5.378) years, which is higher than the standard for sexual dysfunction. The main subsection of sexual dysfunction included sexual arousal, orgasm ability, vaginal lubrication/penile erection, and sexual satisfaction. Significant differences in the ASEX score were observed in subgroups of age, gender, educational level, family relations, operation modes, stoma type, operation time, complications, supporters, self-care ability, and sexual life guidance. Multiple stepwise regression analysis indicated that family relations, operation modes, ostomy type, complications, and sexual life guidance affected sexual experience. The findings of this study demonstrate that patients with ostomy experience sexual dysfunction and many factors influence their quality of sexual life. WOC nurses and other healthcare providers should consider providing sexual health education for both the patient and spouse to improve the self-care capacity and quality of sexual life following ostomy surgery.
Diabetes and Risk of Surgical Site Infection: A systematic review and meta-analysis
Kaye, Keith S.; Knott, Caitlin; Nguyen, Huong; Santarossa, Maressa; Evans, Richard; Bertran, Elizabeth; Jaber, Linda
2016-01-01
Objective To determine the independent association between diabetes and SSI across multiple surgical procedures. Design Systematic review and meta-analysis. Methods Studies indexed in PubMed published between December 1985 and through July 2015 were identified through the search terms “risk factors” or “glucose” and “surgical site infection”. A total of 3,631 abstracts were identified through the initial search terms. Full texts were reviewed for 522 articles. Of these, 94 articles met the criteria for inclusion. Standardized data collection forms were used to extract study-specific estimates for diabetes, blood glucose levels, and body mass index (BMI). Random-effects meta-analysis was used to generate pooled estimates and meta-regression was used to evaluate specific hypothesized sources of heterogeneity. Results The primary outcome was SSI, as defined by the Centers for Disease Control and Prevention surveillance criteria. The overall effect size for the association between diabetes and SSI was OR=1.53 (95% Predictive Interval 1.11, 2.12, I2: 57.2%). SSI class, study design, or patient BMI did not significantly impact study results in a meta-regression model. The association was higher for cardiac surgery 2.03 (95% Predictive Interval 1.13, 4.05) compared to surgeries of other types (p=0.001). Conclusion These results support the consideration of diabetes as an independent risk factor for SSIs for multiple surgical procedure types. Continued efforts are needed to improve surgical outcomes for diabetic patients. PMID:26503187
NASA Astrophysics Data System (ADS)
Baasch, Benjamin; Müller, Hendrik; von Dobeneck, Tilo; Oberle, Ferdinand K. J.
2017-05-01
The electric conductivity and magnetic susceptibility of sediments are fundamental parameters in environmental geophysics. Both can be derived from marine electromagnetic profiling, a novel, fast and non-invasive seafloor mapping technique. Here we present statistical evidence that electric conductivity and magnetic susceptibility can help to determine physical grain-size characteristics (size, sorting and mud content) of marine surficial sediments. Electromagnetic data acquired with the bottom-towed electromagnetic profiler MARUM NERIDIS III were analysed and compared with grain size data from 33 samples across the NW Iberian continental shelf. A negative correlation between mean grain size and conductivity (R=-0.79) as well as mean grain size and susceptibility (R=-0.78) was found. Simple and multiple linear regression analyses were carried out to predict mean grain size, mud content and the standard deviation of the grain-size distribution from conductivity and susceptibility. The comparison of both methods showed that multiple linear regression models predict the grain-size distribution characteristics better than the simple models. This exemplary study demonstrates that electromagnetic benthic profiling is capable to estimate mean grain size, sorting and mud content of marine surficial sediments at a very high significance level. Transfer functions can be calibrated using grains-size data from a few reference samples and extrapolated along shelf-wide survey lines. This study suggests that electromagnetic benthic profiling should play a larger role for coastal zone management, seafloor contamination and sediment provenance studies in worldwide continental shelf systems.
Broadband external cavity quantum cascade laser based sensor for gasoline detection
NASA Astrophysics Data System (ADS)
Ding, Junya; He, Tianbo; Zhou, Sheng; Li, Jinsong
2018-02-01
A new type of tunable diode spectroscopy sensor based on an external cavity quantum cascade laser (ECQCL) and a quartz crystal tuning fork (QCTF) were used for quantitative analysis of volatile organic compounds. In this work, the sensor system had been tested on different gasoline sample analysis. For signal processing, the self-established interpolation algorithm and multiple linear regression algorithm model were used for quantitative analysis of major volatile organic compounds in gasoline samples. The results were very consistent with that of the standard spectra taken from the Pacific Northwest National Laboratory (PNNL) database. In future, The ECQCL sensor will be used for trace explosive, chemical warfare agent, and toxic industrial chemical detection and spectroscopic analysis, etc.
Multiple imputation of missing fMRI data in whole brain analysis
Vaden, Kenneth I.; Gebregziabher, Mulugeta; Kuchinsky, Stefanie E.; Eckert, Mark A.
2012-01-01
Whole brain fMRI analyses rarely include the entire brain because of missing data that result from data acquisition limits and susceptibility artifact, in particular. This missing data problem is typically addressed by omitting voxels from analysis, which may exclude brain regions that are of theoretical interest and increase the potential for Type II error at cortical boundaries or Type I error when spatial thresholds are used to establish significance. Imputation could significantly expand statistical map coverage, increase power, and enhance interpretations of fMRI results. We examined multiple imputation for group level analyses of missing fMRI data using methods that leverage the spatial information in fMRI datasets for both real and simulated data. Available case analysis, neighbor replacement, and regression based imputation approaches were compared in a general linear model framework to determine the extent to which these methods quantitatively (effect size) and qualitatively (spatial coverage) increased the sensitivity of group analyses. In both real and simulated data analysis, multiple imputation provided 1) variance that was most similar to estimates for voxels with no missing data, 2) fewer false positive errors in comparison to mean replacement, and 3) fewer false negative errors in comparison to available case analysis. Compared to the standard analysis approach of omitting voxels with missing data, imputation methods increased brain coverage in this study by 35% (from 33,323 to 45,071 voxels). In addition, multiple imputation increased the size of significant clusters by 58% and number of significant clusters across statistical thresholds, compared to the standard voxel omission approach. While neighbor replacement produced similar results, we recommend multiple imputation because it uses an informed sampling distribution to deal with missing data across subjects that can include neighbor values and other predictors. Multiple imputation is anticipated to be particularly useful for 1) large fMRI data sets with inconsistent missing voxels across subjects and 2) addressing the problem of increased artifact at ultra-high field, which significantly limit the extent of whole brain coverage and interpretations of results. PMID:22500925
Weighted regression analysis and interval estimators
Donald W. Seegrist
1974-01-01
A method for deriving the weighted least squares estimators for the parameters of a multiple regression model. Confidence intervals for expected values, and prediction intervals for the means of future samples are given.
NASA Astrophysics Data System (ADS)
Cannon, Alex
2017-04-01
Estimating historical trends in short-duration rainfall extremes at regional and local scales is challenging due to low signal-to-noise ratios and the limited availability of homogenized observational data. In addition to being of scientific interest, trends in rainfall extremes are of practical importance, as their presence calls into question the stationarity assumptions that underpin traditional engineering and infrastructure design practice. Even with these fundamental challenges, increasingly complex questions are being asked about time series of extremes. For instance, users may not only want to know whether or not rainfall extremes have changed over time, they may also want information on the modulation of trends by large-scale climate modes or on the nonstationarity of trends (e.g., identifying hiatus periods or periods of accelerating positive trends). Efforts have thus been devoted to the development and application of more robust and powerful statistical estimators for regional and local scale trends. While a standard nonparametric method like the regional Mann-Kendall test, which tests for the presence of monotonic trends (i.e., strictly non-decreasing or non-increasing changes), makes fewer assumptions than parametric methods and pools information from stations within a region, it is not designed to visualize detected trends, include information from covariates, or answer questions about the rate of change in trends. As a remedy, monotone quantile regression (MQR) has been developed as a nonparametric alternative that can be used to estimate a common monotonic trend in extremes at multiple stations. Quantile regression makes efficient use of data by directly estimating conditional quantiles based on information from all rainfall data in a region, i.e., without having to precompute the sample quantiles. The MQR method is also flexible and can be used to visualize and analyze the nonlinearity of the detected trend. However, it is fundamentally a univariate technique, and cannot incorporate information from additional covariates, for example ENSO state or physiographic controls on extreme rainfall within a region. Here, the univariate MQR model is extended to allow the use of multiple covariates. Multivariate monotone quantile regression (MMQR) is based on a single hidden-layer feedforward network with the quantile regression error function and partial monotonicity constraints. The MMQR model is demonstrated via Monte Carlo simulations and the estimation and visualization of regional trends in moderate rainfall extremes based on homogenized sub-daily precipitation data at stations in Canada.
Fenske, Nora; Burns, Jacob; Hothorn, Torsten; Rehfuess, Eva A.
2013-01-01
Background Most attempts to address undernutrition, responsible for one third of global child deaths, have fallen behind expectations. This suggests that the assumptions underlying current modelling and intervention practices should be revisited. Objective We undertook a comprehensive analysis of the determinants of child stunting in India, and explored whether the established focus on linear effects of single risks is appropriate. Design Using cross-sectional data for children aged 0–24 months from the Indian National Family Health Survey for 2005/2006, we populated an evidence-based diagram of immediate, intermediate and underlying determinants of stunting. We modelled linear, non-linear, spatial and age-varying effects of these determinants using additive quantile regression for four quantiles of the Z-score of standardized height-for-age and logistic regression for stunting and severe stunting. Results At least one variable within each of eleven groups of determinants was significantly associated with height-for-age in the 35% Z-score quantile regression. The non-modifiable risk factors child age and sex, and the protective factors household wealth, maternal education and BMI showed the largest effects. Being a twin or multiple birth was associated with dramatically decreased height-for-age. Maternal age, maternal BMI, birth order and number of antenatal visits influenced child stunting in non-linear ways. Findings across the four quantile and two logistic regression models were largely comparable. Conclusions Our analysis confirms the multifactorial nature of child stunting. It emphasizes the need to pursue a systems-based approach and to consider non-linear effects, and suggests that differential effects across the height-for-age distribution do not play a major role. PMID:24223839
Fenske, Nora; Burns, Jacob; Hothorn, Torsten; Rehfuess, Eva A
2013-01-01
Most attempts to address undernutrition, responsible for one third of global child deaths, have fallen behind expectations. This suggests that the assumptions underlying current modelling and intervention practices should be revisited. We undertook a comprehensive analysis of the determinants of child stunting in India, and explored whether the established focus on linear effects of single risks is appropriate. Using cross-sectional data for children aged 0-24 months from the Indian National Family Health Survey for 2005/2006, we populated an evidence-based diagram of immediate, intermediate and underlying determinants of stunting. We modelled linear, non-linear, spatial and age-varying effects of these determinants using additive quantile regression for four quantiles of the Z-score of standardized height-for-age and logistic regression for stunting and severe stunting. At least one variable within each of eleven groups of determinants was significantly associated with height-for-age in the 35% Z-score quantile regression. The non-modifiable risk factors child age and sex, and the protective factors household wealth, maternal education and BMI showed the largest effects. Being a twin or multiple birth was associated with dramatically decreased height-for-age. Maternal age, maternal BMI, birth order and number of antenatal visits influenced child stunting in non-linear ways. Findings across the four quantile and two logistic regression models were largely comparable. Our analysis confirms the multifactorial nature of child stunting. It emphasizes the need to pursue a systems-based approach and to consider non-linear effects, and suggests that differential effects across the height-for-age distribution do not play a major role.
NASA Astrophysics Data System (ADS)
Gholizadeh, H.; Robeson, S. M.
2015-12-01
Empirical models have been widely used to estimate global chlorophyll content from remotely sensed data. Here, we focus on the standard NASA empirical models that use blue-green band ratios. These band ratio ocean color (OC) algorithms are in the form of fourth-order polynomials and the parameters of these polynomials (i.e. coefficients) are estimated from the NASA bio-Optical Marine Algorithm Data set (NOMAD). Most of the points in this data set have been sampled from tropical and temperate regions. However, polynomial coefficients obtained from this data set are used to estimate chlorophyll content in all ocean regions with different properties such as sea-surface temperature, salinity, and downwelling/upwelling patterns. Further, the polynomial terms in these models are highly correlated. In sum, the limitations of these empirical models are as follows: 1) the independent variables within the empirical models, in their current form, are correlated (multicollinear), and 2) current algorithms are global approaches and are based on the spatial stationarity assumption, so they are independent of location. Multicollinearity problem is resolved by using partial least squares (PLS). PLS, which transforms the data into a set of independent components, can be considered as a combined form of principal component regression (PCR) and multiple regression. Geographically weighted regression (GWR) is also used to investigate the validity of spatial stationarity assumption. GWR solves a regression model over each sample point by using the observations within its neighbourhood. PLS results show that the empirical method underestimates chlorophyll content in high latitudes, including the Southern Ocean region, when compared to PLS (see Figure 1). Cluster analysis of GWR coefficients also shows that the spatial stationarity assumption in empirical models is not likely a valid assumption.
Tay, Cheryl Sihui; Sterzing, Thorsten; Lim, Chen Yen; Ding, Rui; Kong, Pui Wah
2017-05-01
This study examined (a) the strength of four individual footwear perception factors to influence the overall preference of running shoes and (b) whether these perception factors satisfied the nonmulticollinear assumption in a regression model. Running footwear must fulfill multiple functional criteria to satisfy its potential users. Footwear perception factors, such as fit and cushioning, are commonly used to guide shoe design and development, but it is unclear whether running-footwear users are able to differentiate one factor from another. One hundred casual runners assessed four running shoes on a 15-cm visual analogue scale for four footwear perception factors (fit, cushioning, arch support, and stability) as well as for overall preference during a treadmill running protocol. Diagnostic tests showed an absence of multicollinearity between factors, where values for tolerance ranged from .36 to .72, corresponding to variance inflation factors of 2.8 to 1.4. The multiple regression model of these four footwear perception variables accounted for 77.7% to 81.6% of variance in overall preference, with each factor explaining a unique part of the total variance. Casual runners were able to rate each footwear perception factor separately, thus assigning each factor a true potential to improve overall preference for the users. The results also support the use of a multiple regression model of footwear perception factors to predict overall running shoe preference. Regression modeling is a useful tool for running-shoe manufacturers to more precisely evaluate how individual factors contribute to the subjective assessment of running footwear.
A population-based study on the association between rheumatoid arthritis and voice problems.
Hah, J Hun; An, Soo-Youn; Sim, Songyong; Kim, So Young; Oh, Dong Jun; Park, Bumjung; Kim, Sung-Gyun; Choi, Hyo Geun
2016-07-01
The objective of this study was to investigate whether rheumatoid arthritis increases the frequency of organic laryngeal lesions and the subjective voice complaint rate in those with no organic laryngeal lesion. We performed a cross-sectional study using the data from 19,368 participants (418 rheumatoid arthritis patients and 18,950 controls) of the 2008-2011 Korea National Health and Nutrition Examination Survey. The associations between rheumatoid arthritis and organic laryngeal lesions/subjective voice complaints were analyzed using simple/multiple logistic regression analysis with complex sample adjusting for confounding factors, including age, sex, smoking status, stress level, and body mass index, which could provoke voice problems. Vocal nodules, vocal polyp, and vocal palsy were not associated with rheumatoid arthritis in a multiple regression analysis, and only laryngitis showed a positive association (adjusted odds ratio, 1.59; 95 % confidence interval, 1.01-2.52; P = 0.047). Rheumatoid arthritis was associated with subjective voice discomfort in a simple regression analysis, but not in a multiple regression analysis. Participants with rheumatoid arthritis were older, more often female, and had higher stress levels than those without rheumatoid arthritis. These factors were associated with subjective voice complaints in both simple and multiple regression analyses. Rheumatoid arthritis was not associated with organic laryngeal diseases except laryngitis. Rheumatoid arthritis did not increase the odds ratio for subjective voice complaints. Voice problems in participants with rheumatoid arthritis originated from the characteristics of the rheumatoid arthritis group (higher mean age, female sex, and stress level) rather than rheumatoid arthritis itself.
Predicting MHC-II binding affinity using multiple instance regression
EL-Manzalawy, Yasser; Dobbs, Drena; Honavar, Vasant
2011-01-01
Reliably predicting the ability of antigen peptides to bind to major histocompatibility complex class II (MHC-II) molecules is an essential step in developing new vaccines. Uncovering the amino acid sequence correlates of the binding affinity of MHC-II binding peptides is important for understanding pathogenesis and immune response. The task of predicting MHC-II binding peptides is complicated by the significant variability in their length. Most existing computational methods for predicting MHC-II binding peptides focus on identifying a nine amino acids core region in each binding peptide. We formulate the problems of qualitatively and quantitatively predicting flexible length MHC-II peptides as multiple instance learning and multiple instance regression problems, respectively. Based on this formulation, we introduce MHCMIR, a novel method for predicting MHC-II binding affinity using multiple instance regression. We present results of experiments using several benchmark datasets that show that MHCMIR is competitive with the state-of-the-art methods for predicting MHC-II binding peptides. An online web server that implements the MHCMIR method for MHC-II binding affinity prediction is freely accessible at http://ailab.cs.iastate.edu/mhcmir. PMID:20855923
Burgette, Lane F; Reiter, Jerome P
2013-06-01
Multinomial outcomes with many levels can be challenging to model. Information typically accrues slowly with increasing sample size, yet the parameter space expands rapidly with additional covariates. Shrinking all regression parameters towards zero, as often done in models of continuous or binary response variables, is unsatisfactory, since setting parameters equal to zero in multinomial models does not necessarily imply "no effect." We propose an approach to modeling multinomial outcomes with many levels based on a Bayesian multinomial probit (MNP) model and a multiple shrinkage prior distribution for the regression parameters. The prior distribution encourages the MNP regression parameters to shrink toward a number of learned locations, thereby substantially reducing the dimension of the parameter space. Using simulated data, we compare the predictive performance of this model against two other recently-proposed methods for big multinomial models. The results suggest that the fully Bayesian, multiple shrinkage approach can outperform these other methods. We apply the multiple shrinkage MNP to simulating replacement values for areal identifiers, e.g., census tract indicators, in order to protect data confidentiality in public use datasets.
Hwang, Bosun; Han, Jonghee; Choi, Jong Min; Park, Kwang Suk
2008-11-01
The purpose of this study was to develop an unobtrusive energy expenditure (EE) measurement system using an infrared (IR) sensor-based activity monitoring system to measure indoor activities and to estimate individual quantitative EE. IR-sensor activation counts were measured with a Bluetooth-based monitoring system and the standard EE was calculated using an established regression equation. Ten male subjects participated in the experiment and three different EE measurement systems (gas analyzer, accelerometer, IR sensor) were used simultaneously in order to determine the regression equation and evaluate the performance. As a standard measurement, oxygen consumption was simultaneously measured by a portable metabolic system (Metamax 3X, Cortex, Germany). A single room experiment was performed to develop a regression model of the standard EE measurement from the proposed IR sensor-based measurement system. In addition, correlation and regression analyses were done to compare the performance of the IR system with that of the Actigraph system. We determined that our proposed IR-based EE measurement system shows a similar correlation to the Actigraph system with the standard measurement system.
NASA Astrophysics Data System (ADS)
ul-Haq, Zia; Rana, Asim Daud; Tariq, Salman; Mahmood, Khalid; Ali, Muhammad; Bashir, Iqra
2018-03-01
We have applied regression analyses for the modeling of tropospheric NO2 (tropo-NO2) as the function of anthropogenic nitrogen oxides (NOx) emissions, aerosol optical depth (AOD), and some important meteorological parameters such as temperature (Temp), precipitation (Preci), relative humidity (RH), wind speed (WS), cloud fraction (CLF) and outgoing long-wave radiation (OLR) over different climatic zones and land use/land cover types in South Asia during October 2004-December 2015. Simple linear regression shows that, over South Asia, tropo-NO2 variability is significantly linked to AOD, WS, NOx, Preci and CLF. Also zone-5, consisting of tropical monsoon areas of eastern India and Myanmar, is the only study zone over which all the selected parameters show their influence on tropo-NO2 at statistical significance levels. In stepwise multiple linear modeling, tropo-NO2 column over landmass of South Asia, is significantly predicted by the combination of RH (standardized regression coefficient, β = - 49), AOD (β = 0.42) and NOx (β = 0.25). The leading predictors of tropo-NO2 columns over zones 1-5 are OLR, AOD, Temp, OLR, and RH respectively. Overall, as revealed by the higher correlation coefficients (r), the multiple regressions provide reasonable models for tropo-NO2 over South Asia (r = 0.82), zone-4 (r = 0.90) and zone-5 (r = 0.93). The lowest r (of 0.66) has been found for hot semi-arid region in northwestern Indus-Ganges Basin (zone-2). The highest value of β for urban area AOD (of 0.42) is observed for megacity Lahore, located in warm semi-arid zone-2 with large scale crop-residue burning, indicating strong influence of aerosols on the modeled tropo-NO2 column. A statistical significant correlation (r = 0.22) at the 0.05 level is found between tropo-NO2 and AOD over Lahore. Also NOx emissions appear as the highest contributor (β = 0.59) for modeled tropo-NO2 column over megacity Dhaka.
Poursafa, Parinaz; Baradaran-Mahdavi, Sadegh; Moradi, Bita; Haghjooy Javanmard, Shaghayegh; Tajadini, Mohammadhasan; Mehrabian, Ferdous; Kelishadi, Roya
2016-04-01
This study aims to investigate the association of exposure to ambient air pollution during pregnancy with cord blood concentrations of surrogate markers of endothelial dysfunction. This population-based cohort was conducted from March 2014 to March 2015 among 250 mother-neonate pairs in urban areas of Isfahan, the second large and air-polluted city in Iran. We analyzed the association between the ambient carbon monoxide (CO), ozone (O3), nitrogen dioxide (NO2), sulfur dioxide (SO2), particular matter 10 (PM10), and air quality index (AQI) with cord blood levels of endothelin-1, vascular adhesion molecule (VCAM), and intercellular adhesion molecule (ICAM). Multiple regression analysis was conducted after adjustment for potential confounding factors and covariates. The regression coefficient (beta), standard error of the estimate (SE), and 95% confidence intervals for each regression coefficient (95% CI) are reported. Data of 233 mother-neonate pairs were complete, and included in the analysis. Multiple regression analyses showed that AQI, CO and O3 had significant correlation with cord blood ICAM-1 [Beta (SE), 95%CI: 2.93 (0.72), 1.33,5.54; 2.28(1.44), 1.56,5.12; and 2.02(0.01), 1.03,2.04, respectively] as well as with VCAM-1 [2.78(0.91), 1.69,4.57; 2.47(1.47), 1.43,5.37; and 2.01(0.01),1.07,2.04, respectively]. AQI, PM10, and SO2 were significantly associated with Endothelin-1 concentrations [Beta (SE), 95%CI: 10.16(5.08),7.61,14.28; 9.70(3.46), 2.88,16.52; and 1.07(0.02), 1.03,2.11, respectively]. The significant associations of air pollutants with markers of endothelial dysfunction during fetal period may provide another evidence on the adverse health effects of air pollutants on early stages of atherosclerosis from fetal period. Our findings underscore the importance of considering environmental factors in primordial prevention of chronic diseases. Copyright © 2015 Elsevier Inc. All rights reserved.
Quantile Regression in the Study of Developmental Sciences
ERIC Educational Resources Information Center
Petscher, Yaacov; Logan, Jessica A. R.
2014-01-01
Linear regression analysis is one of the most common techniques applied in developmental research, but only allows for an estimate of the average relations between the predictor(s) and the outcome. This study describes quantile regression, which provides estimates of the relations between the predictor(s) and outcome, but across multiple points of…
Maintenance Operations in Mission Oriented Protective Posture Level IV (MOPPIV)
1987-10-01
Repair FADAC Printed Circuit Board ............. 6 3. Data Analysis Techniques ............................. 6 a. Multiple Linear Regression... ANALYSIS /DISCUSSION ............................... 12 1. Exa-ple of Regression Analysis ..................... 12 S2. Regression results for all tasks...6 * TABLE 9. Task Grouping for Analysis ........................ 7 "TABXLE 10. Remove/Replace H60A3 Power Pack................. 8 TABLE
Curcic, Marijana; Buha, Aleksandra; Stankovic, Sanja; Milovanovic, Vesna; Bulat, Zorica; Đukić-Ćosić, Danijela; Antonijević, Evica; Vučinić, Slavica; Matović, Vesna; Antonijevic, Biljana
2017-02-01
The objective of this study was to assess toxicity of Cd and BDE-209 mixture on haematological parameters in subacutely exposed rats and to determine the presence and type of interactions between these two chemicals using multiple factorial regression analysis. Furthermore, for the assessment of interaction type, an isobologram based methodology was applied and compared with multiple factorial regression analysis. Chemicals were given by oral gavage to the male Wistar rats weighing 200-240g for 28days. Animals were divided in 16 groups (8/group): control vehiculum group, three groups of rats were treated with 2.5, 7.5 or 15mg Cd/kg/day. These doses were chosen on the bases of literature data and reflect relatively high Cd environmental exposure, three groups of rats were treated with 1000, 2000 or 4000mg BDE-209/kg/bw/day, doses proved to induce toxic effects in rats. Furthermore, nine groups of animals were treated with different mixtures of Cd and BDE-209 containing doses of Cd and BDE-209 stated above. Blood samples were taken at the end of experiment and red blood cells, white blood cells and platelets counts were determined. For interaction assessment multiple factorial regression analysis and fitted isobologram approach were used. In this study, we focused on multiple factorial regression analysis as a method for interaction assessment. We also investigated the interactions between Cd and BDE-209 by the derived model for the description of the obtained fitted isobologram curves. Current study indicated that co-exposure to Cd and BDE-209 can result in significant decrease in RBC count, increase in WBC count and decrease in PLT count, when compared with controls. Multiple factorial regression analysis used for the assessment of interactions type between Cd and BDE-209 indicated synergism for the effect on RBC count and no interactions i.e. additivity for the effects on WBC and PLT counts. On the other hand, isobologram based approach showed slight antagonism for the effects on RBC and WBC while no interactions were proved for the joint effect on PLT count. These results confirm that the assessment of interactions between chemicals in the mixture greatly depends on the concept or method used for this evaluation. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Kanada, Yoshikiyo; Sakurai, Hiroaki; Sugiura, Yoshito; Arai, Tomoaki; Koyama, Soichiro; Tanabe, Shigeo
2017-11-01
[Purpose] To create a regression formula in order to estimate 1RM for knee extensors, based on the maximal isometric muscle strength measured using a hand-held dynamometer and data regarding the body composition. [Subjects and Methods] Measurement was performed in 21 healthy males in their twenties to thirties. Single regression analysis was performed, with measurement values representing 1RM and the maximal isometric muscle strength as dependent and independent variables, respectively. Furthermore, multiple regression analysis was performed, with data regarding the body composition incorporated as another independent variable, in addition to the maximal isometric muscle strength. [Results] Through single regression analysis with the maximal isometric muscle strength as an independent variable, the following regression formula was created: 1RM (kg)=0.714 + 0.783 × maximal isometric muscle strength (kgf). On multiple regression analysis, only the total muscle mass was extracted. [Conclusion] A highly accurate regression formula to estimate 1RM was created based on both the maximal isometric muscle strength and body composition. Using a hand-held dynamometer and body composition analyzer, it was possible to measure these items in a short time, and obtain clinically useful results.
The repeatability of mean defect with size III and size V standard automated perimetry.
Wall, Michael; Doyle, Carrie K; Zamba, K D; Artes, Paul; Johnson, Chris A
2013-02-15
The mean defect (MD) of the visual field is a global statistical index used to monitor overall visual field change over time. Our goal was to investigate the relationship of MD and its variability for two clinically used strategies (Swedish Interactive Threshold Algorithm [SITA] standard size III and full threshold size V) in glaucoma patients and controls. We tested one eye, at random, for 46 glaucoma patients and 28 ocularly healthy subjects with Humphrey program 24-2 SITA standard for size III and full threshold for size V each five times over a 5-week period. The standard deviation of MD was regressed against the MD for the five repeated tests, and quantile regression was used to show the relationship of variability and MD. A Wilcoxon test was used to compare the standard deviations of the two testing methods following quantile regression. Both types of regression analysis showed increasing variability with increasing visual field damage. Quantile regression showed modestly smaller MD confidence limits. There was a 15% decrease in SD with size V in glaucoma patients (P = 0.10) and a 12% decrease in ocularly healthy subjects (P = 0.08). The repeatability of size V MD appears to be slightly better than size III SITA testing. When using MD to determine visual field progression, a change of 1.5 to 4 decibels (dB) is needed to be outside the normal 95% confidence limits, depending on the size of the stimulus and the amount of visual field damage.
de Souza E Silva, Christina G; Kaminsky, Leonard A; Arena, Ross; Christle, Jeffrey W; Araújo, Claudio Gil S; Lima, Ricardo M; Ashley, Euan A; Myers, Jonathan
2018-05-01
Background Maximal oxygen uptake (VO 2 max) is a powerful predictor of health outcomes. Valid and portable reference values are integral to interpreting measured VO 2 max; however, available reference standards lack validation and are specific to exercise mode. This study was undertaken to develop and validate a single equation for normal standards for VO 2 max for the treadmill or cycle ergometer in men and women. Methods Healthy individuals ( N = 10,881; 67.8% men, 20-85 years) who performed a maximal cardiopulmonary exercise test on either a treadmill or a cycle ergometer were studied. Of these, 7617 and 3264 individuals were randomly selected for development and validation of the equation, respectively. A Brazilian sample (1619 individuals) constituted a second validation cohort. The prediction equation was determined using multiple regression analysis, and comparisons were made with the widely-used Wasserman and European equations. Results Age, sex, weight, height and exercise mode were significant predictors of VO 2 max. The regression equation was: VO 2 max (ml kg -1 min -1 ) = 45.2 - 0.35*Age - 10.9*Sex (male = 1; female = 2) - 0.15*Weight (pounds) + 0.68*Height (inches) - 0.46*Exercise Mode (treadmill = 1; bike = 2) ( R = 0.79, R 2 = 0.62, standard error of the estimate = 6.6 ml kg -1 min -1 ). Percentage predicted VO 2 max for the US and Brazilian validation cohorts were 102.8% and 95.8%, respectively. The new equation performed better than traditional equations, particularly among women and individuals ≥60 years old. Conclusion A combined equation was developed for normal standards for VO 2 max for different exercise modes derived from a US national registry. The equation provided a lower average error between measured and predicted VO 2 max than traditional equations even when applied to an independent cohort. Additional studies are needed to determine its portability.
Measuring the lesion load of multiple sclerosis patients within the corticospinal tract
NASA Astrophysics Data System (ADS)
Klein, Jan; Hanken, Katrin; Koceva, Jasna; Hildebrandt, Helmut; Hahn, Horst K.
2015-03-01
In this paper we present a framework for reliable determination of the lesion load within the corticospinal tract (CST) of multiple sclerosis patients. The basis constitutes a probabilistic fiber tracking approach which checks possible parameter intervals on the fly using an anatomical brain atlas. By exploiting the range of those intervals, the algorithm is able to resolve fiber crossings and to determine the CST in its full entity although it can use a simple diffusion tensor model. Another advantage is its short running time, tracking the CST takes less than a minute. For segmenting the lesions we developed a semi-automatic approach. First, a trained classifier is applied to multimodal MRI data (T1/FLAIR) where the spectrum of lesions has been determined in advance by a clustering algorithm. This leads to an automatic detection of the lesions which can be manually corrected afterwards using a threshold-based approach. For evaluation we scanned 46 MS patients and 16 healthy controls. Fiber tracking has been performed using our novel fiber tracking and a standard defection based algorithm. Regression analysis of the old and new version of the algorithm showed a highly significant superiority of the new algorithm for disease duration. Additionally, a low correlation between old and new approach supports the observation that standard DTI fiber tracking is not always able to track and quantify the CST reliably.
Yokoo, Takeshi; Bydder, Mark; Hamilton, Gavin; Middleton, Michael S.; Gamst, Anthony C.; Wolfson, Tanya; Hassanein, Tarek; Patton, Heather M.; Lavine, Joel E.; Schwimmer, Jeffrey B.; Sirlin, Claude B.
2009-01-01
Purpose: To assess the accuracy of four fat quantification methods at low-flip-angle multiecho gradient-recalled-echo (GRE) magnetic resonance (MR) imaging in nonalcoholic fatty liver disease (NAFLD) by using MR spectroscopy as the reference standard. Materials and Methods: In this institutional review board–approved, HIPAA-compliant prospective study, 110 subjects (29 with biopsy-confirmed NAFLD, 50 overweight and at risk for NAFLD, and 31 healthy volunteers) (mean age, 32.6 years ± 15.6 [standard deviation]; range, 8–66 years) gave informed consent and underwent MR spectroscopy and GRE MR imaging of the liver. Spectroscopy involved a long repetition time (to suppress T1 effects) and multiple echo times (to estimate T2 effects); the reference fat fraction (FF) was calculated from T2-corrected fat and water spectral peak areas. Imaging involved a low flip angle (to suppress T1 effects) and multiple echo times (to estimate T2* effects); imaging FF was calculated by using four analysis methods of progressive complexity: dual echo, triple echo, multiecho, and multiinterference. All methods except dual echo corrected for T2* effects. The multiinterference method corrected for multiple spectral interference effects of fat. For each method, the accuracy for diagnosis of fatty liver, as defined with a spectroscopic threshold, was assessed by estimating sensitivity and specificity; fat-grading accuracy was assessed by comparing imaging and spectroscopic FF values by using linear regression. Results: Dual-echo, triple-echo, multiecho, and multiinterference methods had a sensitivity of 0.817, 0.967, 0.950, and 0.983 and a specificity of 1.000, 0.880, 1.000, and 0.880, respectively. On the basis of regression slope and intercept, the multiinterference (slope, 0.98; intercept, 0.91%) method had high fat-grading accuracy without statistically significant error (P > .05). Dual-echo (slope, 0.98; intercept, −2.90%), triple-echo (slope, 0.94; intercept, 1.42%), and multiecho (slope, 0.85; intercept, −0.15%) methods had statistically significant error (P < .05). Conclusion: Relaxation- and interference-corrected fat quantification at low-flip-angle multiecho GRE MR imaging provides high diagnostic and fat-grading accuracy in NAFLD. © RSNA, 2009 PMID:19221054
Teh, Benjamin W; Worth, Leon J; Harrison, Simon J; Thursky, Karin A; Slavin, Monica A
2015-07-01
Infections are a leading cause of morbidity and mortality in patients with multiple myeloma. The epidemiology, risk factors and outcomes of viral respiratory tract infections (vRTI) are not well described in patients with multiple myeloma managed with novel agents, the current standard of care. Patients with myeloma from 2009 to 2012 who tested positive on respiratory virus multiplex polymerase chain reaction had clinical, radiological and microbiological records reviewed. The Fourth European Conference on Infections in Leukaemia (ECIL-4) definitions of RTI were applied. Univariate and multivariate regression analysis of risk factors was performed using vRTI as the evaluable outcome. Of 330 patients, 75 (22.7%) tested positive for a total of 100 vRTI episodes. All patients received thalidomide, lenalidomide or bortezomib in combination with myeloma therapies (median of three treatment regimens). vRTI occurred most commonly in patients with progressive disease, and receipt of more than three lines of myeloma therapy was associated with an increased risk of vRTI (p < 0.01). Amongst key respiratory pathogens, influenza was associated with the highest hospital admission rate (66.7%), ICU admission rate (41.6%) and mortality (33.3%) whilst RSV was associated with prolonged hospital stay. Patients with multiple myeloma and advanced disease managed with multiple lines of therapy are at risk for vRTI, and targeted interventions for prevention/treatment are required.
Cross Validation of Selection of Variables in Multiple Regression.
1979-12-01
55 vii CROSS VALIDATION OF SELECTION OF VARIABLES IN MULTIPLE REGRESSION I Introduction Background Long term DoD planning gcals...028545024 .31109000 BF * SS - .008700618 .0471961 Constant - .70977903 85.146786 55 had adequate predictive capabilities; the other two models (the...71ZCO F111D Control 54 73EGO FlIID Computer, General Purpose 55 73EPO FII1D Converter-Multiplexer 56 73HAO flllD Stabilizer Platform 57 73HCO F1ID
Byun, Bo-Ram; Kim, Yong-Il; Maki, Koutaro; Son, Woo-Sung
2015-01-01
This study was aimed to examine the correlation between skeletal maturation status and parameters from the odontoid process/body of the second vertebra and the bodies of third and fourth cervical vertebrae and simultaneously build multiple regression models to be able to estimate skeletal maturation status in Korean girls. Hand-wrist radiographs and cone beam computed tomography (CBCT) images were obtained from 74 Korean girls (6–18 years of age). CBCT-generated cervical vertebral maturation (CVM) was used to demarcate the odontoid process and the body of the second cervical vertebra, based on the dentocentral synchondrosis. Correlation coefficient analysis and multiple linear regression analysis were used for each parameter of the cervical vertebrae (P < 0.05). Forty-seven of 64 parameters from CBCT-generated CVM (independent variables) exhibited statistically significant correlations (P < 0.05). The multiple regression model with the greatest R 2 had six parameters (PH2/W2, UW2/W2, (OH+AH2)/LW2, UW3/LW3, D3, and H4/W4) as independent variables with a variance inflation factor (VIF) of <2. CBCT-generated CVM was able to include parameters from the second cervical vertebral body and odontoid process, respectively, for the multiple regression models. This suggests that quantitative analysis might be used to estimate skeletal maturation status. PMID:25878721
NASA Technical Reports Server (NTRS)
Hollyday, E. F. (Principal Investigator)
1975-01-01
The author has identified the following significant results. Streamflow characteristics in the Delmarva Peninsula derived from the records of daily discharge of 20 gaged basins are representative of the full range in flow conditions and include all of those commonly used for design or planning purposes. They include annual flood peaks with recurrence intervals of 2, 5, 10, 25, and 50 years, mean annual discharge, standard deviation of the mean annual discharge, mean monthly discharges, standard deviation of the mean monthly discharges, low-flow characteristics, flood volume characteristics, and the discharge equalled or exceeded 50 percent of the time. Streamflow and basin characteristics were related by a technique of multiple regression using a digital computer. A control group of equations was computed using basin characteristics derived from maps and climatological records. An experimental group of equations was computed using basin characteristics derived from LANDSAT imagery as well as from maps and climatological records. Based on a reduction in standard error of estimate equal to or greater than 10 percent, the equations for 12 stream flow characteristics were substantially improved by adding to the analyses basin characteristics derived from LANDSAT imagery.
Sex Differences in Diabetes Mellitus Mortality Trends in Brazil, 1980-2012
Brito, Alexandre dos Santos; Pinheiro, Rejane Sobrino; Cabral, Cristiane da Silva; de Camargo, Thais Medina Coeli Rochel
2016-01-01
Aims To investigate the hypothesis that the change from the female predominance of diabetes mellitus to a standard of equality or even male preponderance can already be observed in Brazilian mortality statistics. Methods Data on deaths for which diabetes mellitus was listed as the underlying cause were obtained from the Brazilian Mortality Information System for the years 1980 to 2012. The mortality data were also analyzed according to the multiple causes of death approach from 2001 to 2012. The population data came from the Brazilian Institute of Geography and Statistics. The mortality rates were standardized to the world population. We used a log-linear joinpoint regression to evaluate trends in age-standardized mortality rates (ASMR). Results From 1980 to 2012, we found a marked increment in the diabetes ASMR among Brazilian men and a less sharp increase in the rate among women, with the latter period (2003–2012) showing a slight decrease among women, though it was not statistically significant. Conclusions The results of this study suggest that diabetes mellitus in Brazil has changed from a pattern of higher mortality among women compared to men to equality or even male predominance. PMID:27275600
Ranking of factors determining potassium mass balance in bicarbonate haemodialysis.
Basile, Carlo; Libutti, Pasquale; Lisi, Piero; Teutonico, Annalisa; Vernaglione, Luigi; Casucci, Francesco; Lomonte, Carlo
2015-03-01
One of the most important pathogenetic factors involved in the onset of intradialysis arrhytmias is the alteration in electrolyte concentration, particularly potassium (K(+)). Two studies were performed: Study A was designed to investigate above all the isolated effect of the factor time t on intradialysis K(+) mass balance (K(+)MB): 11 stable prevalent Caucasian anuric patients underwent one standard (∼4 h) and one long-hour (∼8 h) bicarbonate haemodialysis (HD) session. The latter were pair-matched as far as the dialysate and blood volume processed (90 L) and volume of ultrafiltration are concerned. Study B was designed to identify and rank the other factors determining intradialysis K(+)MB: 63 stable prevalent Caucasian anuric patients underwent one 4-h standard bicarbonate HD session. Dialysate K(+) concentration was 2.0 mmol/L in both studies. Blood samples were obtained from the inlet blood tubing immediately before the onset of dialysis and at t60, t120, t180 min and at end of the 4- and 8-h sessions for the measurement of plasma K(+), blood bicarbonates and blood pH. Additional blood samples were obtained at t360 min for the 8 h sessions. Direct dialysate quantification was utilized for K(+)MBs. Direct potentiometry with an ion-selective electrode was used for K(+) measurements. Study A: mean K(+)MBs were significantly higher in the 8-h sessions (4 h: -88.4 ± 23.2 SD mmol versus 8 h: -101.9 ± 32.2 mmol; P = 0.02). Bivariate linear regression analyses showed that only mean plasma K(+), area under the curve (AUC) of the hourly inlet dialyser diffusion concentration gradient of K(+) (hcgAUCK(+)) and AUC of blood bicarbonates and mean blood bicarbonates were significantly related to K(+)MB in both 4- and 8-h sessions. A multiple linear regression output with K(+)MB as dependent variable showed that only mean plasma K(+), hcgAUCK(+) and duration of HD sessions per se remained statistically significant. Study B: mean K(+)MBs were -86.7 ± 22.6 mmol. Bivariate linear regression analyses showed that only mean plasma K(+), hcgAUCK(+) and mean blood bicarbonates were significantly related to K(+)MB. Again, only mean plasma K(+) and hcgAUCK(+) predicted K(+)MB at the multiple linear regression analysis. Our studies enabled to establish the ranking of factors determining intradialysis K(+)MB: plasma K(+) → dialysate K(+) gradient is the main determinant; acid-base balance plays a much less important role. The duration of HD session per se is an independent determinant of K(+)MB. © The Author 2014. Published by Oxford University Press on behalf of ERA-EDTA. All rights reserved.
NeCamp, Timothy; Kilbourne, Amy; Almirall, Daniel
2017-08-01
Cluster-level dynamic treatment regimens can be used to guide sequential treatment decision-making at the cluster level in order to improve outcomes at the individual or patient-level. In a cluster-level dynamic treatment regimen, the treatment is potentially adapted and re-adapted over time based on changes in the cluster that could be impacted by prior intervention, including aggregate measures of the individuals or patients that compose it. Cluster-randomized sequential multiple assignment randomized trials can be used to answer multiple open questions preventing scientists from developing high-quality cluster-level dynamic treatment regimens. In a cluster-randomized sequential multiple assignment randomized trial, sequential randomizations occur at the cluster level and outcomes are observed at the individual level. This manuscript makes two contributions to the design and analysis of cluster-randomized sequential multiple assignment randomized trials. First, a weighted least squares regression approach is proposed for comparing the mean of a patient-level outcome between the cluster-level dynamic treatment regimens embedded in a sequential multiple assignment randomized trial. The regression approach facilitates the use of baseline covariates which is often critical in the analysis of cluster-level trials. Second, sample size calculators are derived for two common cluster-randomized sequential multiple assignment randomized trial designs for use when the primary aim is a between-dynamic treatment regimen comparison of the mean of a continuous patient-level outcome. The methods are motivated by the Adaptive Implementation of Effective Programs Trial which is, to our knowledge, the first-ever cluster-randomized sequential multiple assignment randomized trial in psychiatry.
NASA Astrophysics Data System (ADS)
Sahabiev, I. A.; Ryazanov, S. S.; Kolcova, T. G.; Grigoryan, B. R.
2018-03-01
The three most common techniques to interpolate soil properties at a field scale—ordinary kriging (OK), regression kriging with multiple linear regression drift model (RK + MLR), and regression kriging with principal component regression drift model (RK + PCR)—were examined. The results of the performed study were compiled into an algorithm of choosing the most appropriate soil mapping technique. Relief attributes were used as the auxiliary variables. When spatial dependence of a target variable was strong, the OK method showed more accurate interpolation results, and the inclusion of the auxiliary data resulted in an insignificant improvement in prediction accuracy. According to the algorithm, the RK + PCR method effectively eliminates multicollinearity of explanatory variables. However, if the number of predictors is less than ten, the probability of multicollinearity is reduced, and application of the PCR becomes irrational. In that case, the multiple linear regression should be used instead.
Genetic Programming Transforms in Linear Regression Situations
NASA Astrophysics Data System (ADS)
Castillo, Flor; Kordon, Arthur; Villa, Carlos
The chapter summarizes the use of Genetic Programming (GP) inMultiple Linear Regression (MLR) to address multicollinearity and Lack of Fit (LOF). The basis of the proposed method is applying appropriate input transforms (model respecification) that deal with these issues while preserving the information content of the original variables. The transforms are selected from symbolic regression models with optimal trade-off between accuracy of prediction and expressional complexity, generated by multiobjective Pareto-front GP. The chapter includes a comparative study of the GP-generated transforms with Ridge Regression, a variant of ordinary Multiple Linear Regression, which has been a useful and commonly employed approach for reducing multicollinearity. The advantages of GP-generated model respecification are clearly defined and demonstrated. Some recommendations for transforms selection are given as well. The application benefits of the proposed approach are illustrated with a real industrial application in one of the broadest empirical modeling areas in manufacturing - robust inferential sensors. The chapter contributes to increasing the awareness of the potential of GP in statistical model building by MLR.
A Solution to Separation and Multicollinearity in Multiple Logistic Regression
Shen, Jianzhao; Gao, Sujuan
2010-01-01
In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27–38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth’s penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study. PMID:20376286
A Solution to Separation and Multicollinearity in Multiple Logistic Regression.
Shen, Jianzhao; Gao, Sujuan
2008-10-01
In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27-38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth's penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study.
Haynos, Ann F.; Snipes, Cassandra; Guarda, Angela; Mayer, Laurel E.; Attia, Evelyn
2015-01-01
Objective Sparse research informs how caloric prescriptions should be advanced during nutritional rehabilitation of inpatients with anorexia nervosa (AN). This study compared the impact of a standardized caloric increase approach, in which increases occurred on a predetermined schedule, to an individualized approach, in which increases occurred only following insufficient weight gain, on rate, pattern, and cumulative amount of weight gain and other weight restoration outcomes. Method This study followed a natural experiment design comparing AN inpatients consecutively admitted before (n = 35) and after (n = 35) an institutional change from individualized to standardized caloric prescriptions. Authors examined the impact of prescription plan on weekly weight gain in the first treatment month using multilevel modeling. Within a subsample remaining inpatient through weight restoration (n = 40), multiple regressions examined the impact of caloric prescription plan on time to weight restoration, length of hospitalization, maximum caloric prescription, discharge BMI, and incidence of activity restriction and edema. Results There were significant interactions between prescription plan and quadratic time on average weekly weight gain (p = .03) and linear time on cumulative weekly weight gain (p < .001). Under the standardized plan, patients gained in an accelerated curvilinear pattern (p = .04) and, therefore, gained cumulatively greater amounts of weight over time (p < .001). Additionally, 30% fewer patients required activity restriction under the standardized plan. Discussion Standardized caloric prescriptions may confer advantage by facilitating accelerated early weight gain and lower incidence of bed rest without increasing the incidence of refeeding syndrome. PMID:26769581
NASA Technical Reports Server (NTRS)
Whitlock, C. H.; Kuo, C. Y.
1979-01-01
The objective of this paper is to define optical physics and/or environmental conditions under which the linear multiple-regression should be applicable. An investigation of the signal-response equations is conducted and the concept is tested by application to actual remote sensing data from a laboratory experiment performed under controlled conditions. Investigation of the signal-response equations shows that the exact solution for a number of optical physics conditions is of the same form as a linearized multiple-regression equation, even if nonlinear contributions from surface reflections, atmospheric constituents, or other water pollutants are included. Limitations on achieving this type of solution are defined.
NASA Astrophysics Data System (ADS)
Whitmyer, Charnita P.
This dissertation uses Bolman and Deal's Four Framework approach to reframing an organization to examine science teachers' beliefs on teacher preparation and reform practices for diverse learners. Despite the national emphasis on "science for all students" in the National Science Education Standards (NRC, 2011), some traditionally underserved groups tend to underperform on standardized measures of science learning (Kober, 2001; Darling-Hammond, 2010; Bracey, 2009; Kozol, 2009, 2007; PCAST, 2012); and teachers struggle to meet the needs of these students (Hira, 2010). The literature is replete with calls for a better understanding of teacher quality as an entry point into increased student achievement in science. In the current study, the 2012 National Survey of Science and Mathematics Education (NSSME) was used to gain an understanding of science teacher quality in the United States, and SPSS 22.0 software was used to evaluate descriptive and inferential statistics, including bivariate correlation analysis, simple linear regression, and a multiple regression of the survey responses. The findings indicated that professional development was the most salient predictor of teachers' preparedness to teach diverse learners. Findings further showed that teachers who held favorable perceptions of preparedness to teach diverse learners were more likely to use reform-oriented practices. This study contributes to an emerging area of research on science teacher quality and its influence on instructional reform for diverse learners. The study concludes with a discussion of supports and obstacles that may enable or inhibit the development of these relationships.
Food Polyamine and Cardiovascular Disease -An Epidemiological Study-
Soda, Kuniyasu; Kano, Yoshihiko; Chiba, Fumihiro
2012-01-01
The purpose of this study was to examine the contribution of dietary polyamines toward preventing cardiovascular disease (CVD). Age-standardized mortality rates as well as other relevant information regarding individuals with CVD were gathered from the World Health Organization and the International Monetary Fund in 48 different European and other Western countries. Food supply data were collected from the database of the United Nations, and the amount of dietary polyamines was estimated by using polyamine concentrations in foods from published sources. The association between CVD mortality and the amount of polyamines was investigated by performing a series of multiple linear regression analyses. Analyses using factors known to modulate the risk of CVD including: Gross Domestic Product (GDP) (standardized regression coefficient (r) = -0.786, p < 0.001) and the amount of fruits, vegetable, nuts, and beans (r = -0.183, p = 0.001) but not including polyamines, showed negative associations with CVD, while smoking rate (r = 0.139, p = 0.041) and whole milk amount (r = 0.131, p = 0.028) showed positive associations with CVD. When the amount of polyamines was added to the analyses as a covariate, GDP (r = -0.864, p < 0.001) and polyamines (r = -0.355, p = 0.007) showed negative associations with CVD, while smoking rate (r = 0.183, p = 0.006) and whole milk (r = 0.113, p = 0.041) showed positive associations with CVD. The inverse association between dietary polyamines and CVD mortality revealed by the present study merits further evaluation. PMID:23121753
Food polyamine and cardiovascular disease--an epidemiological study.
Soda, Kuniyasu; Kano, Yoshihiko; Chiba, Fumihiro
2012-09-28
The purpose of this study was to examine the contribution of dietary polyamines toward preventing cardiovascular disease (CVD). Age-standardized mortality rates as well as other relevant information regarding individuals with CVD were gathered from the World Health Organization and the International Monetary Fund in 48 different European and other Western countries. Food supply data were collected from the database of the United Nations, and the amount of dietary polyamines was estimated by using polyamine concentrations in foods from published sources. The association between CVD mortality and the amount of polyamines was investigated by performing a series of multiple linear regression analyses. Analyses using factors known to modulate the risk of CVD including: Gross Domestic Product (GDP) (standardized regression coefficient (r) = -0.786, p < 0.001) and the amount of fruits, vegetable, nuts, and beans (r = -0.183, p = 0.001) but not including polyamines, showed negative associations with CVD, while smoking rate (r = 0.139, p = 0.041) and whole milk amount (r = 0.131, p = 0.028) showed positive associations with CVD. When the amount of polyamines was added to the analyses as a covariate, GDP (r = -0.864, p < 0.001) and polyamines (r = -0.355, p = 0.007) showed negative associations with CVD, while smoking rate (r = 0.183, p = 0.006) and whole milk (r = 0.113, p = 0.041) showed positive associations with CVD. The inverse association between dietary polyamines and CVD mortality revealed by the present study merits further evaluation.
Methodology for the development of normative data for Spanish-speaking pediatric populations.
Rivera, D; Arango-Lasprilla, J C
2017-01-01
To describe the methodology utilized to calculate reliability and the generation of norms for 10 neuropsychological tests for children in Spanish-speaking countries. The study sample consisted of over 4,373 healthy children from nine countries in Latin America (Chile, Cuba, Ecuador, Guatemala, Honduras, Mexico, Paraguay, Peru, and Puerto Rico) and Spain. Inclusion criteria for all countries were to have between 6 to 17 years of age, an Intelligence Quotient of≥80 on the Test of Non-Verbal Intelligence (TONI-2), and score of <19 on the Children's Depression Inventory. Participants completed 10 neuropsychological tests. Reliability and norms were calculated for all tests. Test-retest analysis showed excellent or good- reliability on all tests (r's>0.55; p's<0.001) except M-WCST perseverative errors whose coefficient magnitude was fair. All scores were normed using multiple linear regressions and standard deviations of residual values. Age, age2, sex, and mean level of parental education (MLPE) were included as predictors in the models by country. The non-significant variables (p > 0.05) were removed and the analysis were run again. This is the largest Spanish-speaking children and adolescents normative study in the world. For the generation of normative data, the method based on linear regression models and the standard deviation of residual values was used. This method allows determination of the specific variables that predict test scores, helps identify and control for collinearity of predictive variables, and generates continuous and more reliable norms than those of traditional methods.
Development of a mobbing short scale in the Gutenberg Health Study.
Garthus-Niegel, Susan; Nübling, Matthias; Letzel, Stephan; Hegewald, Janice; Wagner, Mandy; Wild, Philipp S; Blettner, Maria; Zwiener, Isabella; Latza, Ute; Jankowiak, Sylvia; Liebers, Falk; Seidler, Andreas
2016-01-01
Despite its highly detrimental potential, most standard questionnaires assessing psychosocial stress at work do not include mobbing as a risk factor. In the German standard version of COPSOQ, mobbing is assessed with a single item. In the Gutenberg Health Study, this version was used together with a newly developed short scale based on the Leymann Inventory of Psychological Terror. The purpose of the present study was to evaluate the psychometric properties of these two measures, to compare them and to test their differential impact on relevant outcome parameters. This analysis is based on a population-based sample of 1441 employees participating in the Gutenberg Health Study. Exploratory and confirmatory factor analyses and reliability analyses were used to assess the mobbing scale. To determine their predictive validities, multiple linear regression analyses with six outcome parameters and log-binomial regression models for two of the outcome aspects were run. Factor analyses of the five-item scale confirmed a one-factor solution, reliability was α = 0.65. Both the single-item and the five-item scales were associated with all six outcome scales. Effect sizes were similar for both mobbing measures. Mobbing is an important risk factor for health-related outcomes. For the purpose of psychosocial risk assessment in the workplace, both the single-item and the five-item constructs were psychometrically appropriate. Associations with outcomes were about equivalent. However, the single item has the advantage of parsimony, whereas the five-item construct depicts several distinct forms of mobbing.
Lim, Pooi Khoon; Ng, Siew-Cheok; Jassim, Wissam A.; Redmond, Stephen J.; Zilany, Mohammad; Avolio, Alberto; Lim, Einly; Tan, Maw Pin; Lovell, Nigel H.
2015-01-01
We present a novel approach to improve the estimation of systolic (SBP) and diastolic blood pressure (DBP) from oscillometric waveform data using variable characteristic ratios between SBP and DBP with mean arterial pressure (MAP). This was verified in 25 healthy subjects, aged 28 ± 5 years. The multiple linear regression (MLR) and support vector regression (SVR) models were used to examine the relationship between the SBP and the DBP ratio with ten features extracted from the oscillometric waveform envelope (OWE). An automatic algorithm based on relative changes in the cuff pressure and neighbouring oscillometric pulses was proposed to remove outlier points caused by movement artifacts. Substantial reduction in the mean and standard deviation of the blood pressure estimation errors were obtained upon artifact removal. Using the sequential forward floating selection (SFFS) approach, we were able to achieve a significant reduction in the mean and standard deviation of differences between the estimated SBP values and the reference scoring (MLR: mean ± SD = −0.3 ± 5.8 mmHg; SVR and −0.6 ± 5.4 mmHg) with only two features, i.e., Ratio2 and Area3, as compared to the conventional maximum amplitude algorithm (MAA) method (mean ± SD = −1.6 ± 8.6 mmHg). Comparing the performance of both MLR and SVR models, our results showed that the MLR model was able to achieve comparable performance to that of the SVR model despite its simplicity. PMID:26087370
Chaitoff, Alexander; Sun, Bob; Windover, Amy; Bokar, Daniel; Featherall, Joseph; Rothberg, Michael B; Misra-Hebert, Anita D
2017-10-01
To identify correlates of physician empathy and determine whether physician empathy is related to standardized measures of patient experience. Demographic, professional, and empathy data were collected during 2013-2015 from Cleveland Clinic Health System physicians prior to participation in mandatory communication skills training. Empathy was assessed using the Jefferson Scale of Empathy. Data were also collected for seven measures (six provider communication items and overall provider rating) from the visit-specific and 12-month Consumer Assessment of Healthcare Providers and Systems Clinician and Group (CG-CAHPS) surveys. Associations between empathy and provider characteristics were assessed by linear regression, ANOVA, or a nonparametric equivalent. Significant predictors were included in a multivariable linear regression model. Correlations between empathy and CG-CAHPS scores were assessed using Spearman rank correlation coefficients. In bivariable analysis (n = 847 physicians), female sex (P < .001), specialty (P < .01), outpatient practice setting (P < .05), and DO degree (P < .05) were associated with higher empathy scores. In multivariable analysis, female sex (P < .001) and four specialties (obstetrics-gynecology, pediatrics, psychiatry, and thoracic surgery; all P < .05) were significantly associated with higher empathy scores. Of the seven CG-CAHPS measures, scores on five for the 583 physicians with visit-specific data and on three for the 277 physicians with 12-month data were positively correlated with empathy. Specialty and sex were independently associated with physician empathy. Empathy was correlated with higher scores on multiple CG-CAHPS items, suggesting improving physician empathy might play a role in improving patient experience.
Prediction equations for maximal respiratory pressures of Brazilian adolescents.
Mendes, Raquel E F; Campos, Tania F; Macêdo, Thalita M F; Borja, Raíssa O; Parreira, Verônica F; Mendonça, Karla M P P
2013-01-01
The literature emphasizes the need for studies to provide reference values and equations able to predict respiratory muscle strength of Brazilian subjects at different ages and from different regions of Brazil. To develop prediction equations for maximal respiratory pressures (MRP) of Brazilian adolescents. In total, 182 healthy adolescents (98 boys and 84 girls) aged between 12 and 18 years, enrolled in public and private schools in the city of Natal-RN, were evaluated using an MVD300 digital manometer (Globalmed®) according to a standardized protocol. Statistical analysis was performed using SPSS Statistics 17.0 software, with a significance level of 5%. Data normality was verified using the Kolmogorov-Smirnov test, and descriptive analysis results were expressed as the mean and standard deviation. To verify the correlation between the MRP and the independent variables (age, weight, height and sex), the Pearson correlation test was used. To obtain the prediction equations, stepwise multiple linear regression was used. The variables height, weight and sex were correlated to MRP. However, weight and sex explained part of the variability of MRP, and the regression analysis in this study indicated that these variables contributed significantly in predicting maximal inspiratory pressure, and only sex contributed significantly to maximal expiratory pressure. This study provides reference values and two models of prediction equations for maximal inspiratory and expiratory pressures and sets the necessary normal lower limits for the assessment of the respiratory muscle strength of Brazilian adolescents.
NASA Astrophysics Data System (ADS)
Chiu, Tina
This dissertation includes three studies that analyze a new set of assessment tasks developed by the Learning Progressions in Middle School Science (LPS) Project. These assessment tasks were designed to measure science content knowledge on the structure of matter domain and scientific argumentation, while following the goals from the Next Generation Science Standards (NGSS). The three studies focus on the evidence available for the success of this design and its implementation, generally labelled as "validity" evidence. I use explanatory item response models (EIRMs) as the overarching framework to investigate these assessment tasks. These models can be useful when gathering validity evidence for assessments as they can help explain student learning and group differences. In the first study, I explore the dimensionality of the LPS assessment by comparing the fit of unidimensional, between-item multidimensional, and Rasch testlet models to see which is most appropriate for this data. By applying multidimensional item response models, multiple relationships can be investigated, and in turn, allow for a more substantive look into the assessment tasks. The second study focuses on person predictors through latent regression and differential item functioning (DIF) models. Latent regression models show the influence of certain person characteristics on item responses, while DIF models test whether one group is differentially affected by specific assessment items, after conditioning on latent ability. Finally, the last study applies the linear logistic test model (LLTM) to investigate whether item features can help explain differences in item difficulties.
ERIC Educational Resources Information Center
Rocconi, Louis M.
2011-01-01
Hierarchical linear models (HLM) solve the problems associated with the unit of analysis problem such as misestimated standard errors, heterogeneity of regression and aggregation bias by modeling all levels of interest simultaneously. Hierarchical linear modeling resolves the problem of misestimated standard errors by incorporating a unique random…
Korany, Mohamed A; Maher, Hadir M; Galal, Shereen M; Ragab, Marwa A A
2013-05-01
This manuscript discusses the application and the comparison between three statistical regression methods for handling data: parametric, nonparametric, and weighted regression (WR). These data were obtained from different chemometric methods applied to the high-performance liquid chromatography response data using the internal standard method. This was performed on a model drug Acyclovir which was analyzed in human plasma with the use of ganciclovir as internal standard. In vivo study was also performed. Derivative treatment of chromatographic response ratio data was followed by convolution of the resulting derivative curves using 8-points sin x i polynomials (discrete Fourier functions). This work studies and also compares the application of WR method and Theil's method, a nonparametric regression (NPR) method with the least squares parametric regression (LSPR) method, which is considered the de facto standard method used for regression. When the assumption of homoscedasticity is not met for analytical data, a simple and effective way to counteract the great influence of the high concentrations on the fitted regression line is to use WR method. WR was found to be superior to the method of LSPR as the former assumes that the y-direction error in the calibration curve will increase as x increases. Theil's NPR method was also found to be superior to the method of LSPR as the former assumes that errors could occur in both x- and y-directions and that might not be normally distributed. Most of the results showed a significant improvement in the precision and accuracy on applying WR and NPR methods relative to LSPR.
Bowden, Jack; Del Greco M, Fabiola; Minelli, Cosetta; Davey Smith, George; Sheehan, Nuala A; Thompson, John R
2016-12-01
: MR-Egger regression has recently been proposed as a method for Mendelian randomization (MR) analyses incorporating summary data estimates of causal effect from multiple individual variants, which is robust to invalid instruments. It can be used to test for directional pleiotropy and provides an estimate of the causal effect adjusted for its presence. MR-Egger regression provides a useful additional sensitivity analysis to the standard inverse variance weighted (IVW) approach that assumes all variants are valid instruments. Both methods use weights that consider the single nucleotide polymorphism (SNP)-exposure associations to be known, rather than estimated. We call this the `NO Measurement Error' (NOME) assumption. Causal effect estimates from the IVW approach exhibit weak instrument bias whenever the genetic variants utilized violate the NOME assumption, which can be reliably measured using the F-statistic. The effect of NOME violation on MR-Egger regression has yet to be studied. An adaptation of the I2 statistic from the field of meta-analysis is proposed to quantify the strength of NOME violation for MR-Egger. It lies between 0 and 1, and indicates the expected relative bias (or dilution) of the MR-Egger causal estimate in the two-sample MR context. We call it IGX2 . The method of simulation extrapolation is also explored to counteract the dilution. Their joint utility is evaluated using simulated data and applied to a real MR example. In simulated two-sample MR analyses we show that, when a causal effect exists, the MR-Egger estimate of causal effect is biased towards the null when NOME is violated, and the stronger the violation (as indicated by lower values of IGX2 ), the stronger the dilution. When additionally all genetic variants are valid instruments, the type I error rate of the MR-Egger test for pleiotropy is inflated and the causal effect underestimated. Simulation extrapolation is shown to substantially mitigate these adverse effects. We demonstrate our proposed approach for a two-sample summary data MR analysis to estimate the causal effect of low-density lipoprotein on heart disease risk. A high value of IGX2 close to 1 indicates that dilution does not materially affect the standard MR-Egger analyses for these data. : Care must be taken to assess the NOME assumption via the IGX2 statistic before implementing standard MR-Egger regression in the two-sample summary data context. If IGX2 is sufficiently low (less than 90%), inferences from the method should be interpreted with caution and adjustment methods considered. © The Author 2016. Published by Oxford University Press on behalf of the International Epidemiological Association.
Louys, Julien; Meloro, Carlo; Elton, Sarah; Ditchfield, Peter; Bishop, Laura C
2015-01-01
We test the performance of two models that use mammalian communities to reconstruct multivariate palaeoenvironments. While both models exploit the correlation between mammal communities (defined in terms of functional groups) and arboreal heterogeneity, the first uses a multiple multivariate regression of community structure and arboreal heterogeneity, while the second uses a linear regression of the principal components of each ecospace. The success of these methods means the palaeoenvironment of a particular locality can be reconstructed in terms of the proportions of heavy, moderate, light, and absent tree canopy cover. The linear regression is less biased, and more precisely and accurately reconstructs heavy tree canopy cover than the multiple multivariate model. However, the multiple multivariate model performs better than the linear regression for all other canopy cover categories. Both models consistently perform better than randomly generated reconstructions. We apply both models to the palaeocommunity of the Upper Laetolil Beds, Tanzania. Our reconstructions indicate that there was very little heavy tree cover at this site (likely less than 10%), with the palaeo-landscape instead comprising a mixture of light and absent tree cover. These reconstructions help resolve the previous conflicting palaeoecological reconstructions made for this site. Copyright © 2014 Elsevier Ltd. All rights reserved.
Cruz, Antonio M; Barr, Cameron; Puñales-Pozo, Elsa
2008-01-01
This research's main goals were to build a predictor for a turnaround time (TAT) indicator for estimating its values and use a numerical clustering technique for finding possible causes of undesirable TAT values. The following stages were used: domain understanding, data characterisation and sample reduction and insight characterisation. Building the TAT indicator multiple linear regression predictor and clustering techniques were used for improving corrective maintenance task efficiency in a clinical engineering department (CED). The indicator being studied was turnaround time (TAT). Multiple linear regression was used for building a predictive TAT value model. The variables contributing to such model were clinical engineering department response time (CE(rt), 0.415 positive coefficient), stock service response time (Stock(rt), 0.734 positive coefficient), priority level (0.21 positive coefficient) and service time (0.06 positive coefficient). The regression process showed heavy reliance on Stock(rt), CE(rt) and priority, in that order. Clustering techniques revealed the main causes of high TAT values. This examination has provided a means for analysing current technical service quality and effectiveness. In doing so, it has demonstrated a process for identifying areas and methods of improvement and a model against which to analyse these methods' effectiveness.
Pratt, Bethany; Chang, Heejun
2012-03-30
The relationship among land cover, topography, built structure and stream water quality in the Portland Metro region of Oregon and Clark County, Washington areas, USA, is analyzed using ordinary least squares (OLS) and geographically weighted (GWR) multiple regression models. Two scales of analysis, a sectional watershed and a buffer, offered a local and a global investigation of the sources of stream pollutants. Model accuracy, measured by R(2) values, fluctuated according to the scale, season, and regression method used. While most wet season water quality parameters are associated with urban land covers, most dry season water quality parameters are related topographic features such as elevation and slope. GWR models, which take into consideration local relations of spatial autocorrelation, had stronger results than OLS regression models. In the multiple regression models, sectioned watershed results were consistently better than the sectioned buffer results, except for dry season pH and stream temperature parameters. This suggests that while riparian land cover does have an effect on water quality, a wider contributing area needs to be included in order to account for distant sources of pollutants. Copyright © 2012 Elsevier B.V. All rights reserved.
Mohr, David C; Eaton, Jennifer Lipkowitz; McPhaul, Kathleen M; Hodgson, Michael J
2015-04-22
We examined relationships between employee safety climate and patient safety culture. Because employee safety may be a precondition for the development of patient safety, we hypothesized that employee safety culture would be strongly and positively related to patient safety culture. An employee safety climate survey was administered in 2010 and assessed employees' views and experiences of safety for employees. The patient safety survey administered in 2011 assessed the safety culture for patients. We performed Pearson correlations and multiple regression analysis to examine the relationships between a composite measure of employee safety with subdimensions of patient safety culture. The regression models controlled for size, geographic characteristics, and teaching affiliation. Analyses were conducted at the group level using data from 132 medical centers. Higher employee safety climate composite scores were positively associated with all 9 patient safety culture measures examined. Standardized multivariate regression coefficients ranged from 0.44 to 0.64. Medical facilities where staff have more positive perceptions of health care workplace safety climate tended to have more positive assessments of patient safety culture. This suggests that patient safety culture and employee safety climate could be mutually reinforcing, such that investments and improvements in one domain positively impacts the other. Further research is needed to better understand the nexus between health care employee and patient safety to generalize and act upon findings.
Application of near-infrared spectroscopy for the rapid quality assessment of Radix Paeoniae Rubra
NASA Astrophysics Data System (ADS)
Zhan, Hao; Fang, Jing; Tang, Liying; Yang, Hongjun; Li, Hua; Wang, Zhuju; Yang, Bin; Wu, Hongwei; Fu, Meihong
2017-08-01
Near-infrared (NIR) spectroscopy with multivariate analysis was used to quantify gallic acid, catechin, albiflorin, and paeoniflorin in Radix Paeoniae Rubra, and the feasibility to classify the samples originating from different areas was investigated. A new high-performance liquid chromatography method was developed and validated to analyze gallic acid, catechin, albiflorin, and paeoniflorin in Radix Paeoniae Rubra as the reference. Partial least squares (PLS), principal component regression (PCR), and stepwise multivariate linear regression (SMLR) were performed to calibrate the regression model. Different data pretreatments such as derivatives (1st and 2nd), multiplicative scatter correction, standard normal variate, Savitzky-Golay filter, and Norris derivative filter were applied to remove the systematic errors. The performance of the model was evaluated according to the root mean square of calibration (RMSEC), root mean square error of prediction (RMSEP), root mean square error of cross-validation (RMSECV), and correlation coefficient (r). The results show that compared to PCR and SMLR, PLS had a lower RMSEC, RMSECV, and RMSEP and higher r for all the four analytes. PLS coupled with proper pretreatments showed good performance in both the fitting and predicting results. Furthermore, the original areas of Radix Paeoniae Rubra samples were partly distinguished by principal component analysis. This study shows that NIR with PLS is a reliable, inexpensive, and rapid tool for the quality assessment of Radix Paeoniae Rubra.
1981-09-01
corresponds to the same square footage that consumed the electrical energy. 3. The basic assumptions of multiple linear regres- sion, as enumerated in...7. Data related to the sample of bases is assumed to be representative of bases in the population. Limitations Basic limitations on this research were... Ratemaking --Overview. Rand Report R-5894, Santa Monica CA, May 1977. Chatterjee, Samprit, and Bertram Price. Regression Analysis by Example. New York: John
Estimates of Median Flows for Streams on the 1999 Kansas Surface Water Register
Perry, Charles A.; Wolock, David M.; Artman, Joshua C.
2004-01-01
The Kansas State Legislature, by enacting Kansas Statute KSA 82a?2001 et. seq., mandated the criteria for determining which Kansas stream segments would be subject to classification by the State. One criterion for the selection as a classified stream segment is based on the statistic of median flow being equal to or greater than 1 cubic foot per second. As specified by KSA 82a?2001 et. seq., median flows were determined from U.S. Geological Survey streamflow-gaging-station data by using the most-recent 10 years of gaged data (KSA) for each streamflow-gaging station. Median flows also were determined by using gaged data from the entire period of record (all-available hydrology, AAH). Least-squares multiple regression techniques were used, along with Tobit analyses, to develop equations for estimating median flows for uncontrolled stream segments. The drainage area of the gaging stations on uncontrolled stream segments used in the regression analyses ranged from 2.06 to 12,004 square miles. A logarithmic transformation of the data was needed to develop the best linear relation for computing median flows. In the regression analyses, the significant climatic and basin characteristics, in order of importance, were drainage area, mean annual precipitation, mean basin permeability, and mean basin slope. Tobit analyses of KSA data yielded a model standard error of prediction of 0.285 logarithmic units, and the best equations using Tobit analyses of AAH data had a model standard error of prediction of 0.250 logarithmic units. These regression equations and an interpolation procedure were used to compute median flows for the uncontrolled stream segments on the 1999 Kansas Surface Water Register. Measured median flows from gaging stations were incorporated into the regression-estimated median flows along the stream segments where available. The segments that were uncontrolled were interpolated using gaged data weighted according to the drainage area and the bias between the regression-estimated and gaged flow information. On controlled segments of Kansas streams, the median flow information was interpolated between gaging stations using only gaged data weighted by drainage area. Of the 2,232 total stream segments on the Kansas Surface Water Register, 34.5 percent of the segments had an estimated median streamflow of less than 1 cubic foot per second when the KSA analysis was used. When the AAH analysis was used, 36.2 percent of the segments had an estimated median streamflow of less than 1 cubic foot per second. This report supercedes U.S. Geological Survey Water-Resources Investigations Report 02?4292.
David, Ingrid; Garreau, Hervé; Balmisse, Elodie; Billon, Yvon; Canario, Laurianne
2017-01-20
Some genetic studies need to take into account correlations between traits that are repeatedly measured over time. Multiple-trait random regression models are commonly used to analyze repeated traits but suffer from several major drawbacks. In the present study, we developed a multiple-trait extension of the structured antedependence model (SAD) to overcome this issue and validated its usefulness by modeling the association between litter size (LS) and average birth weight (ABW) over parities in pigs and rabbits. The single-trait SAD model assumes that a random effect at time [Formula: see text] can be explained by the previous values of the random effect (i.e. at previous times). The proposed multiple-trait extension of the SAD model consists in adding a cross-antedependence parameter to the single-trait SAD model. This model can be easily fitted using ASReml and the OWN Fortran program that we have developed. In comparison with the random regression model, we used our multiple-trait SAD model to analyze the LS and ABW of 4345 litters from 1817 Large White sows and 8706 litters from 2286 L-1777 does over a maximum of five successive parities. For both species, the multiple-trait SAD fitted the data better than the random regression model. The difference between AIC of the two models (AIC_random regression-AIC_SAD) were equal to 7 and 227 for pigs and rabbits, respectively. A similar pattern of heritability and correlation estimates was obtained for both species. Heritabilities were lower for LS (ranging from 0.09 to 0.29) than for ABW (ranging from 0.23 to 0.39). The general trend was a decrease of the genetic correlation for a given trait between more distant parities. Estimates of genetic correlations between LS and ABW were negative and ranged from -0.03 to -0.52 across parities. No correlation was observed between the permanent environmental effects, except between the permanent environmental effects of LS and ABW of the same parity, for which the estimate of the correlation was strongly negative (ranging from -0.57 to -0.67). We demonstrated that application of our multiple-trait SAD model is feasible for studying several traits with repeated measurements and showed that it provided a better fit to the data than the random regression model.
Determination of streamflow of the Arkansas River near Bentley in south-central Kansas
Perry, Charles A.
2012-01-01
The Kansas Department of Agriculture, Division of Water Resources, requires that the streamflow of the Arkansas River just upstream from Bentley in south-central Kansas be measured or calculated before groundwater can be pumped from the well field. When the daily streamflow of the Arkansas River near Bentley is less than 165 cubic feet per second (ft3/s), pumping must be curtailed. Daily streamflow near Bentley was calculated by determining the relations between streamflow data from two reference streamgages with a concurrent record of 24 years, one located 17.2 miles (mi) upstream and one located 10.9 mi downstream, and streamflow at a temporary gage located just upstream from Bentley (Arkansas River near Bentley, Kansas). Flow-duration curves for the two reference streamgages indicate that during 1988?2011, the mean daily streamflow was less than 165 ft3/s 30 to 35 percent of the time. During extreme low-flow (drought) conditions, the reach of the Arkansas River between Hutchinson and Maize can lose flow to the adjacent alluvial aquifer, with streamflow losses as much as 1.6 cubic feet per second per mile. Three models were developed to calculate the streamflow of the Arkansas River near Bentley, Kansas. The model chosen depends on the data available and on whether the reach of the Arkansas River between Hutchinson and Maize is gaining or losing groundwater from or to the adjacent alluvial aquifer. The first model was a pair of equations developed from linear regressions of the relation between daily streamflow data from the Bentley streamgage and daily streamflow data from either the Arkansas River near Hutchinson, Kansas, station (station number 07143330) or the Arkansas River near Maize, Kansas, station (station number 07143375). The standard error of the Hutchinson-only equation was 22.8 ft3/s, and the standard error of the Maize-only equation was 22.3 ft3/s. The single-station model would be used if only one streamgage was available. In the second model, the flow gradient between the streamflow near Hutchinson and the streamflow near Maize was used to calculate the streamflow at the Bentley streamgage. This equation resulted in a standard error of 26.7 ft3/s. In the third model, a multiple regression analysis between both the daily streamflow of the Arkansas River near Hutchinson, Kansas, and the daily streamflow of the Arkansas River near Maize, Kansas, was used to calculate the streamflow at the Bentley streamgage. The multiple regression equation had a standard error of 21.2 ft3/s, which was the smallest of the standard errors for all the models. An analysis of the number of low-flow days and the number of days when the reach between Hutchinson and Maize loses flow to the adjacent alluvial aquifer indicates that the long-term trend is toward fewer days of losing conditions. This trend may indicate a long-term increase in water levels in the alluvial aquifer, which could be caused by one or more of several conditions, including an increase in rainfall, a decrease in pumping, a decrease in temperature, and an increase in streamflow upstream from the Hutchinson-to-Maize reach of the Arkansas River.
Yang, Xiaowei; Nie, Kun
2008-03-15
Longitudinal data sets in biomedical research often consist of large numbers of repeated measures. In many cases, the trajectories do not look globally linear or polynomial, making it difficult to summarize the data or test hypotheses using standard longitudinal data analysis based on various linear models. An alternative approach is to apply the approaches of functional data analysis, which directly target the continuous nonlinear curves underlying discretely sampled repeated measures. For the purposes of data exploration, many functional data analysis strategies have been developed based on various schemes of smoothing, but fewer options are available for making causal inferences regarding predictor-outcome relationships, a common task seen in hypothesis-driven medical studies. To compare groups of curves, two testing strategies with good power have been proposed for high-dimensional analysis of variance: the Fourier-based adaptive Neyman test and the wavelet-based thresholding test. Using a smoking cessation clinical trial data set, this paper demonstrates how to extend the strategies for hypothesis testing into the framework of functional linear regression models (FLRMs) with continuous functional responses and categorical or continuous scalar predictors. The analysis procedure consists of three steps: first, apply the Fourier or wavelet transform to the original repeated measures; then fit a multivariate linear model in the transformed domain; and finally, test the regression coefficients using either adaptive Neyman or thresholding statistics. Since a FLRM can be viewed as a natural extension of the traditional multiple linear regression model, the development of this model and computational tools should enhance the capacity of medical statistics for longitudinal data.
5 CFR 591.219 - How does OPM compute shelter price indexes?
Code of Federal Regulations, 2014 CFR
2014-01-01
... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...
5 CFR 591.219 - How does OPM compute shelter price indexes?
Code of Federal Regulations, 2011 CFR
2011-01-01
... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...
5 CFR 591.219 - How does OPM compute shelter price indexes?
Code of Federal Regulations, 2013 CFR
2013-01-01
... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...
5 CFR 591.219 - How does OPM compute shelter price indexes?
Code of Federal Regulations, 2012 CFR
2012-01-01
... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...
Steen, Paul J.; Passino-Reader, Dora R.; Wiley, Michael J.
2006-01-01
As a part of the Great Lakes Regional Aquatic Gap Analysis Project, we evaluated methodologies for modeling associations between fish species and habitat characteristics at a landscape scale. To do this, we created brook trout Salvelinus fontinalis presence and absence models based on four different techniques: multiple linear regression, logistic regression, neural networks, and classification trees. The models were tested in two ways: by application to an independent validation database and cross-validation using the training data, and by visual comparison of statewide distribution maps with historically recorded occurrences from the Michigan Fish Atlas. Although differences in the accuracy of our models were slight, the logistic regression model predicted with the least error, followed by multiple regression, then classification trees, then the neural networks. These models will provide natural resource managers a way to identify habitats requiring protection for the conservation of fish species.