Standards for Standardized Logistic Regression Coefficients
ERIC Educational Resources Information Center
Menard, Scott
2011-01-01
Standardized coefficients in logistic regression analysis have the same utility as standardized coefficients in linear regression analysis. Although there has been no consensus on the best way to construct standardized logistic regression coefficients, there is now sufficient evidence to suggest a single best approach to the construction of a…
Biases and Standard Errors of Standardized Regression Coefficients
ERIC Educational Resources Information Center
Yuan, Ke-Hai; Chan, Wai
2011-01-01
The paper obtains consistent standard errors (SE) and biases of order O(1/n) for the sample standardized regression coefficients with both random and given predictors. Analytical results indicate that the formulas for SEs given in popular text books are consistent only when the population value of the regression coefficient is zero. The sample…
On the Occurrence of Standardized Regression Coefficients Greater than One.
ERIC Educational Resources Information Center
Deegan, John, Jr.
1978-01-01
It is demonstrated here that standardized regression coefficients greater than one can legitimately occur. Furthermore, the relationship between the occurrence of such coefficients and the extent of multicollinearity present among the set of predictor variables in an equation is examined. Comments on the interpretation of these coefficients are…
Population heterogeneity in the salience of multiple risk factors for adolescent delinquency.
Lanza, Stephanie T; Cooper, Brittany R; Bray, Bethany C
2014-03-01
To present mixture regression analysis as an alternative to more standard regression analysis for predicting adolescent delinquency. We demonstrate how mixture regression analysis allows for the identification of population subgroups defined by the salience of multiple risk factors. We identified population subgroups (i.e., latent classes) of individuals based on their coefficients in a regression model predicting adolescent delinquency from eight previously established risk indices drawn from the community, school, family, peer, and individual levels. The study included N = 37,763 10th-grade adolescents who participated in the Communities That Care Youth Survey. Standard, zero-inflated, and mixture Poisson and negative binomial regression models were considered. Standard and mixture negative binomial regression models were selected as optimal. The five-class regression model was interpreted based on the class-specific regression coefficients, indicating that risk factors had varying salience across classes of adolescents. Standard regression showed that all risk factors were significantly associated with delinquency. Mixture regression provided more nuanced information, suggesting a unique set of risk factors that were salient for different subgroups of adolescents. Implications for the design of subgroup-specific interventions are discussed. Copyright © 2014 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.
NASA Technical Reports Server (NTRS)
Kalton, G.
1983-01-01
A number of surveys were conducted to study the relationship between the level of aircraft or traffic noise exposure experienced by people living in a particular area and their annoyance with it. These surveys generally employ a clustered sample design which affects the precision of the survey estimates. Regression analysis of annoyance on noise measures and other variables is often an important component of the survey analysis. Formulae are presented for estimating the standard errors of regression coefficients and ratio of regression coefficients that are applicable with a two- or three-stage clustered sample design. Using a simple cost function, they also determine the optimum allocation of the sample across the stages of the sample design for the estimation of a regression coefficient.
SPSS macros to compare any two fitted values from a regression model.
Weaver, Bruce; Dubois, Sacha
2012-12-01
In regression models with first-order terms only, the coefficient for a given variable is typically interpreted as the change in the fitted value of Y for a one-unit increase in that variable, with all other variables held constant. Therefore, each regression coefficient represents the difference between two fitted values of Y. But the coefficients represent only a fraction of the possible fitted value comparisons that might be of interest to researchers. For many fitted value comparisons that are not captured by any of the regression coefficients, common statistical software packages do not provide the standard errors needed to compute confidence intervals or carry out statistical tests-particularly in more complex models that include interactions, polynomial terms, or regression splines. We describe two SPSS macros that implement a matrix algebra method for comparing any two fitted values from a regression model. The !OLScomp and !MLEcomp macros are for use with models fitted via ordinary least squares and maximum likelihood estimation, respectively. The output from the macros includes the standard error of the difference between the two fitted values, a 95% confidence interval for the difference, and a corresponding statistical test with its p-value.
Multicollinearity and Regression Analysis
NASA Astrophysics Data System (ADS)
Daoud, Jamal I.
2017-12-01
In regression analysis it is obvious to have a correlation between the response and predictor(s), but having correlation among predictors is something undesired. The number of predictors included in the regression model depends on many factors among which, historical data, experience, etc. At the end selection of most important predictors is something objective due to the researcher. Multicollinearity is a phenomena when two or more predictors are correlated, if this happens, the standard error of the coefficients will increase [8]. Increased standard errors means that the coefficients for some or all independent variables may be found to be significantly different from In other words, by overinflating the standard errors, multicollinearity makes some variables statistically insignificant when they should be significant. In this paper we focus on the multicollinearity, reasons and consequences on the reliability of the regression model.
2014-01-01
Background Support vector regression (SVR) and Gaussian process regression (GPR) were used for the analysis of electroanalytical experimental data to estimate diffusion coefficients. Results For simulated cyclic voltammograms based on the EC, Eqr, and EqrC mechanisms these regression algorithms in combination with nonlinear kernel/covariance functions yielded diffusion coefficients with higher accuracy as compared to the standard approach of calculating diffusion coefficients relying on the Nicholson-Shain equation. The level of accuracy achieved by SVR and GPR is virtually independent of the rate constants governing the respective reaction steps. Further, the reduction of high-dimensional voltammetric signals by manual selection of typical voltammetric peak features decreased the performance of both regression algorithms compared to a reduction by downsampling or principal component analysis. After training on simulated data sets, diffusion coefficients were estimated by the regression algorithms for experimental data comprising voltammetric signals for three organometallic complexes. Conclusions Estimated diffusion coefficients closely matched the values determined by the parameter fitting method, but reduced the required computational time considerably for one of the reaction mechanisms. The automated processing of voltammograms according to the regression algorithms yields better results than the conventional analysis of peak-related data. PMID:24987463
The Geometry of Enhancement in Multiple Regression
ERIC Educational Resources Information Center
Waller, Niels G.
2011-01-01
In linear multiple regression, "enhancement" is said to occur when R[superscript 2] = b[prime]r greater than r[prime]r, where b is a p x 1 vector of standardized regression coefficients and r is a p x 1 vector of correlations between a criterion y and a set of standardized regressors, x. When p = 1 then b [is congruent to] r and…
The Use of Structure Coefficients to Address Multicollinearity in Sport and Exercise Science
ERIC Educational Resources Information Center
Yeatts, Paul E.; Barton, Mitch; Henson, Robin K.; Martin, Scott B.
2017-01-01
A common practice in general linear model (GLM) analyses is to interpret regression coefficients (e.g., standardized ß weights) as indicators of variable importance. However, focusing solely on standardized beta weights may provide limited or erroneous information. For example, ß weights become increasingly unreliable when predictor variables are…
Estimation of octanol/water partition coefficients using LSER parameters
Luehrs, Dean C.; Hickey, James P.; Godbole, Kalpana A.; Rogers, Tony N.
1998-01-01
The logarithms of octanol/water partition coefficients, logKow, were regressed against the linear solvation energy relationship (LSER) parameters for a training set of 981 diverse organic chemicals. The standard deviation for logKow was 0.49. The regression equation was then used to estimate logKow for a test of 146 chemicals which included pesticides and other diverse polyfunctional compounds. Thus the octanol/water partition coefficient may be estimated by LSER parameters without elaborate software but only moderate accuracy should be expected.
Standardized Regression Coefficients as Indices of Effect Sizes in Meta-Analysis
ERIC Educational Resources Information Center
Kim, Rae Seon
2011-01-01
When conducting a meta-analysis, it is common to find many collected studies that report regression analyses, because multiple regression analysis is widely used in many fields. Meta-analysis uses effect sizes drawn from individual studies as a means of synthesizing a collection of results. However, indices of effect size from regression analyses…
Bootstrap Methods: A Very Leisurely Look.
ERIC Educational Resources Information Center
Hinkle, Dennis E.; Winstead, Wayland H.
The Bootstrap method, a computer-intensive statistical method of estimation, is illustrated using a simple and efficient Statistical Analysis System (SAS) routine. The utility of the method for generating unknown parameters, including standard errors for simple statistics, regression coefficients, discriminant function coefficients, and factor…
Design of experiments enhanced statistical process control for wind tunnel check standard testing
NASA Astrophysics Data System (ADS)
Phillips, Ben D.
The current wind tunnel check standard testing program at NASA Langley Research Center is focused on increasing data quality, uncertainty quantification and overall control and improvement of wind tunnel measurement processes. The statistical process control (SPC) methodology employed in the check standard testing program allows for the tracking of variations in measurements over time as well as an overall assessment of facility health. While the SPC approach can and does provide researchers with valuable information, it has certain limitations in the areas of process improvement and uncertainty quantification. It is thought by utilizing design of experiments methodology in conjunction with the current SPC practices that one can efficiently and more robustly characterize uncertainties and develop enhanced process improvement procedures. In this research, methodologies were developed to generate regression models for wind tunnel calibration coefficients, balance force coefficients and wind tunnel flow angularities. The coefficients of these regression models were then tracked in statistical process control charts, giving a higher level of understanding of the processes. The methodology outlined is sufficiently generic such that this research can be applicable to any wind tunnel check standard testing program.
ERIC Educational Resources Information Center
Kane, Michael T.; Mroch, Andrew A.
2010-01-01
In evaluating the relationship between two measures across different groups (i.e., in evaluating "differential validity") it is necessary to examine differences in correlation coefficients and in regression lines. Ordinary least squares (OLS) regression is the standard method for fitting lines to data, but its criterion for optimal fit…
NASA Astrophysics Data System (ADS)
Setiyorini, Anis; Suprijadi, Jadi; Handoko, Budhi
2017-03-01
Geographically Weighted Regression (GWR) is a regression model that takes into account the spatial heterogeneity effect. In the application of the GWR, inference on regression coefficients is often of interest, as is estimation and prediction of the response variable. Empirical research and studies have demonstrated that local correlation between explanatory variables can lead to estimated regression coefficients in GWR that are strongly correlated, a condition named multicollinearity. It later results on a large standard error on estimated regression coefficients, and, hence, problematic for inference on relationships between variables. Geographically Weighted Lasso (GWL) is a method which capable to deal with spatial heterogeneity and local multicollinearity in spatial data sets. GWL is a further development of GWR method, which adds a LASSO (Least Absolute Shrinkage and Selection Operator) constraint in parameter estimation. In this study, GWL will be applied by using fixed exponential kernel weights matrix to establish a poverty modeling of Java Island, Indonesia. The results of applying the GWL to poverty datasets show that this method stabilizes regression coefficients in the presence of multicollinearity and produces lower prediction and estimation error of the response variable than GWR does.
Teaching Students Not to Dismiss the Outermost Observations in Regressions
ERIC Educational Resources Information Center
Kasprowicz, Tomasz; Musumeci, Jim
2015-01-01
One econometric rule of thumb is that greater dispersion in observations of the independent variable improves estimates of regression coefficients and therefore produces better results, i.e., lower standard errors of the estimates. Nevertheless, students often seem to mistrust precisely the observations that contribute the most to this greater…
Testing for gene-environment interaction under exposure misspecification.
Sun, Ryan; Carroll, Raymond J; Christiani, David C; Lin, Xihong
2017-11-09
Complex interplay between genetic and environmental factors characterizes the etiology of many diseases. Modeling gene-environment (GxE) interactions is often challenged by the unknown functional form of the environment term in the true data-generating mechanism. We study the impact of misspecification of the environmental exposure effect on inference for the GxE interaction term in linear and logistic regression models. We first examine the asymptotic bias of the GxE interaction regression coefficient, allowing for confounders as well as arbitrary misspecification of the exposure and confounder effects. For linear regression, we show that under gene-environment independence and some confounder-dependent conditions, when the environment effect is misspecified, the regression coefficient of the GxE interaction can be unbiased. However, inference on the GxE interaction is still often incorrect. In logistic regression, we show that the regression coefficient is generally biased if the genetic factor is associated with the outcome directly or indirectly. Further, we show that the standard robust sandwich variance estimator for the GxE interaction does not perform well in practical GxE studies, and we provide an alternative testing procedure that has better finite sample properties. © 2017, The International Biometric Society.
Regression Simulation Model. Appendix X. Users Manual,
1981-03-01
change as the prediction equations become refined. Whereas no notice will be provided when the changes are made, the programs will be modified such that...NATIONAL BUREAU Of STANDARDS 1963 A ___,_ __ _ __ _ . APPENDIX X ( R4/ EGRESSION IMULATION ’jDEL. Ape’A ’) 7 USERS MANUA submitted to The Great River...regression analysis and to establish a prediction equation (model). The prediction equation contains the partial regression coefficients (B-weights) which
NASA Technical Reports Server (NTRS)
Stolzer, Alan J.; Halford, Carl
2007-01-01
In a previous study, multiple regression techniques were applied to Flight Operations Quality Assurance-derived data to develop parsimonious model(s) for fuel consumption on the Boeing 757 airplane. The present study examined several data mining algorithms, including neural networks, on the fuel consumption problem and compared them to the multiple regression results obtained earlier. Using regression methods, parsimonious models were obtained that explained approximately 85% of the variation in fuel flow. In general data mining methods were more effective in predicting fuel consumption. Classification and Regression Tree methods reported correlation coefficients of .91 to .92, and General Linear Models and Multilayer Perceptron neural networks reported correlation coefficients of about .99. These data mining models show great promise for use in further examining large FOQA databases for operational and safety improvements.
Effect of Contact Damage on the Strength of Ceramic Materials.
1982-10-01
variables that are important to erosion, and a multivariate , linear regression analysis is used to fit the data to the dimensional analysis. The...of Equations 7 and 8 by a multivariable regression analysis (room tem- perature data) Exponent Regression Standard error Computed coefficient of...1980) 593. WEAVER, Proc. Brit. Ceram. Soc. 22 (1973) 125. 39. P. W. BRIDGMAN, "Dimensional Analaysis ", (Yale 18. R. W. RICE, S. W. FREIMAN and P. F
Wang, Anxin; Li, Zhifang; Yang, Yuling; Chen, Guojuan; Wang, Chunxue; Wu, Yuntao; Ruan, Chunyu; Liu, Yan; Wang, Yilong; Wu, Shouling
2016-01-01
To investigate the relationship between baseline systolic blood pressure (SBP) and visit-to-visit blood pressure variability in a general population. This is a prospective longitudinal cohort study on cardiovascular risk factors and cardiovascular or cerebrovascular events. Study participants attended a face-to-face interview every 2 years. Blood pressure variability was defined using the standard deviation and coefficient of variation of all SBP values at baseline and follow-up visits. The coefficient of variation is the ratio of the standard deviation to the mean SBP. We used multivariate linear regression models to test the relationships between SBP and standard deviation, and between SBP and coefficient of variation. Approximately 43,360 participants (mean age: 48.2±11.5 years) were selected. In multivariate analysis, after adjustment for potential confounders, baseline SBPs <120 mmHg were inversely related to standard deviation (P<0.001) and coefficient of variation (P<0.001). In contrast, baseline SBPs ≥140 mmHg were significantly positively associated with standard deviation (P<0.001) and coefficient of variation (P<0.001). Baseline SBPs of 120-140 mmHg were associated with the lowest standard deviation and coefficient of variation. The associations between baseline SBP and standard deviation, and between SBP and coefficient of variation during follow-ups showed a U curve. Both lower and higher baseline SBPs were associated with increased blood pressure variability. To control blood pressure variability, a good target SBP range for a general population might be 120-139 mmHg.
Chen, Gang; Wu, Yulian; Wang, Tao; Liang, Jixing; Lin, Wei; Li, Liantao; Wen, Junping; Lin, Lixiang; Huang, Huibin
2012-10-01
The role of the endogenous secretory receptor for advanced glycation end products (esRAGE) in depression of diabetes patients and its clinical significance are unclear. This study investigated the role of serum esRAGE in patients with type 2 diabetes mellitus with depression in the Chinese population. One hundred nineteen hospitalized patients with type 2 diabetes were recruited at Fujian Provincial Hospital (Fuzhou, China) from February 2010 to January 2011. All selected subjects were assessed with the Hamilton Rating Scale for Depression (HAMD). Among them, 71 patients with both type 2 diabetes and depression were included. All selected subjects were examined for the following: esRAGE concentration, glycosylated hemoglobin (HbA1c), blood lipids, C-reactive protein, trace of albumin in urine, and carotid artery intima-media thickness (IMT). Association between serum esRAGE levels and risk of type 2 diabetes mellitus with depression was also analyzed. There were statistically significant differences in gender, age, body mass index, waist circumference, and treatment methods between the group with depression and the group without depression (P<0.05). Multiple linear regression analysis showed that HAMD scores were negatively correlated with esRAGE levels (standard regression coefficient -0.270, P<0.01). HAMD-17 scores were positively correlated with IMT (standard regression coefficient 0.183, P<0.05) and with HbA1c (standard regression coefficient 0.314, P<0.01). Female gender, younger age, obesity, poor glycemic control, complications, and insulin therapy are all risk factors of type 2 diabetes mellitus with combined depression in the Chinese population. Inflammation and atherosclerosis play an important role in the pathogenesis of depression. esRAGE is a protective factor of depression among patients who have type 2 diabetes.
Cerebrospinal fluid norepinephrine and cognition in subjects across the adult age span
Wang, Lucy Y.; Murphy, Richard R.; Hanscom, Brett; Li, Ge; Millard, Steven P.; Petrie, Eric C.; Galasko, Douglas R.; Sikkema, Carl; Raskind, Murray A.; Wilkinson, Charles W.; Peskind, Elaine R.
2013-01-01
Adequate central nervous system noradrenergic activity enhances cognition, but excessive noradrenergic activity may have adverse effects on cognition. Previous studies have also demonstrated that noradrenergic activity is higher in older than younger adults. We aimed to determine relationships between cerebrospinal fluid (CSF) norepinephrine (NE) concentration and cognitive performance by using data from a CSF bank that includes samples from 258 cognitively normal participants aged 21–100 years. After adjusting for age, gender, education, and ethnicity, higher CSF NE levels (units of 100 pg/mL) are associated with poorer performance on tests of attention, processing speed, and executive function (Trail Making A: regression coefficient 1.5, standard error [SE] 0.77, p = 0.046; Trail Making B: regression coefficient 5.0, SE 2.2, p = 0.024; Stroop Word-Color Interference task: regression coefficient 6.1, SE 2.0, p = 0.003). Findings are consistent with the earlier literature relating excess noradrenergic activity with cognitive impairment. PMID:23639207
Cerebrospinal fluid norepinephrine and cognition in subjects across the adult age span.
Wang, Lucy Y; Murphy, Richard R; Hanscom, Brett; Li, Ge; Millard, Steven P; Petrie, Eric C; Galasko, Douglas R; Sikkema, Carl; Raskind, Murray A; Wilkinson, Charles W; Peskind, Elaine R
2013-10-01
Adequate central nervous system noradrenergic activity enhances cognition, but excessive noradrenergic activity may have adverse effects on cognition. Previous studies have also demonstrated that noradrenergic activity is higher in older than younger adults. We aimed to determine relationships between cerebrospinal fluid (CSF) norepinephrine (NE) concentration and cognitive performance by using data from a CSF bank that includes samples from 258 cognitively normal participants aged 21-100 years. After adjusting for age, gender, education, and ethnicity, higher CSF NE levels (units of 100 pg/mL) are associated with poorer performance on tests of attention, processing speed, and executive function (Trail Making A: regression coefficient 1.5, standard error [SE] 0.77, p = 0.046; Trail Making B: regression coefficient 5.0, SE 2.2, p = 0.024; Stroop Word-Color Interference task: regression coefficient 6.1, SE 2.0, p = 0.003). Findings are consistent with the earlier literature relating excess noradrenergic activity with cognitive impairment. Published by Elsevier Inc.
Interquantile Shrinkage in Regression Models
Jiang, Liewen; Wang, Huixia Judy; Bondell, Howard D.
2012-01-01
Conventional analysis using quantile regression typically focuses on fitting the regression model at different quantiles separately. However, in situations where the quantile coefficients share some common feature, joint modeling of multiple quantiles to accommodate the commonality often leads to more efficient estimation. One example of common features is that a predictor may have a constant effect over one region of quantile levels but varying effects in other regions. To automatically perform estimation and detection of the interquantile commonality, we develop two penalization methods. When the quantile slope coefficients indeed do not change across quantile levels, the proposed methods will shrink the slopes towards constant and thus improve the estimation efficiency. We establish the oracle properties of the two proposed penalization methods. Through numerical investigations, we demonstrate that the proposed methods lead to estimations with competitive or higher efficiency than the standard quantile regression estimation in finite samples. Supplemental materials for the article are available online. PMID:24363546
Standardization of domestic frying processes by an engineering approach.
Franke, K; Strijowski, U
2011-05-01
An approach was developed to enable a better standardization of domestic frying of potato products. For this purpose, 5 domestic fryers differing in heating power and oil capacity were used. A very defined frying process using a highly standardized model product and a broad range of frying conditions was carried out in these fryers and the development of browning representing an important quality parameter was measured. Product-to-oil ratio, oil temperature, and frying time were varied. Quite different color changes were measured in the different fryers although the same frying process parameters were applied. The specific energy consumption for water evaporation (spECWE) during frying related to product amount was determined for all frying processes to define an engineering parameter for characterizing the frying process. A quasi-linear regression approach was applied to calculate this parameter from frying process settings and fryer properties. The high significance of the regression coefficients and a coefficient of determination close to unity confirmed the suitability of this approach. Based on this regression equation, curves for standard frying conditions (SFC curves) were calculated which describe the frying conditions required to obtain the same level of spECWE in the different domestic fryers. Comparison of browning results from the different fryers operated at conditions near the SFC curves confirmed the applicability of the approach. © 2011 Institute of Food Technologists®
Klaeboe, Ronny
2005-09-01
When Gardermoen replaced Fornebu as the main airport for Oslo, aircraft noise levels increased in recreational areas near Gardermoen and decreased in areas near Fornebu. Krog and Engdahl [J. Acoust. Soc. Am. 116, 323-333 (2004)] estimate that recreationists' annoyance from aircraft noise in these areas changed more than would be anticipated from the actual noise changes. However, the sizes of their estimated "situation" effects are not credible. One possible reason for the anomalous results is that standard regression assumptions become violated when motivational factors are inserted into the regression model. Standardized regression coefficients (beta values) should also not be utilized for comparisons across equations.
Zhang, Hualing
2014-03-01
To learn characteristics and their mutual relations of self-esteem, self-harmony and interpersonal-harmony of university students, in order to provide the basis for mental health education. With a stratified cluster random sampling method, a questionnaire survey was conducted in 820 university students from 16 classes of four universities, chosen from 30 universities in Anhui Province. Meanwhile, Rosenberg Self-esteem Scale, Self-harmony Scale and Interpersonal-harmony Diagnostic Scale were used for assessment. Self-esteem of university students has an average score of (30.71 +/- 4.77), higher than median thoery 25, and there existed statistical significance in the dimensions of gender (P = 0.004), origin (P = 0.038) and only-child (P = 0.005). University students' self-harmony has an average score of (98.66 +/- 8.69), among which there were 112 students in the group of low score, counting for 13.7%, 442 in that of middle score, counting for 53.95%, 265 in that of high score, counting for 32.33%. And there existed no statistical significance in the total-score of self-harmony and score differences from most of subscales in the dimention of gender and origin, but satistical significance did exist in the dimention of only-child (P = 0.004). It was statistically significant (P = 0.006) on the "stereotype" subscales, on the differences between university students from urban areas and rural areas. Every dimension of self-esteem and self -harmony and interpersonal harmony was correlated and statistically significant. Multiple regression analysis found that when there was a variable in self-esteem, the amount of the variable of self-harmony for explaination of interpersonal conversation dropped from 22.6% to 12%, and standard regression coefficient changing from 0.087 to 0.035. The trouble of interpersonal dating fell from 27.6% to 13.1%, the standard regression coefficient changing from 0.104 to 0.019. The bother of treating people fell from 30.9% to 15%, and the standard regression coefficient changing from 0.079 to 0.020. The problem of heterosexual contact fell from 23.4% to 17.3%, and the standard regression coefficient changing from 0.095 to 0.024. Self-esteem was a mediator variable between self-harmony and interpersonal-harmony. By cultivating university students' level of self-esteem to achieve their self-harmony and interpersonal-harmony, university students' mental health level can be improved.
Model averaging and muddled multimodel inferences.
Cade, Brian S
2015-09-01
Three flawed practices associated with model averaging coefficients for predictor variables in regression models commonly occur when making multimodel inferences in analyses of ecological data. Model-averaged regression coefficients based on Akaike information criterion (AIC) weights have been recommended for addressing model uncertainty but they are not valid, interpretable estimates of partial effects for individual predictors when there is multicollinearity among the predictor variables. Multicollinearity implies that the scaling of units in the denominators of the regression coefficients may change across models such that neither the parameters nor their estimates have common scales, therefore averaging them makes no sense. The associated sums of AIC model weights recommended to assess relative importance of individual predictors are really a measure of relative importance of models, with little information about contributions by individual predictors compared to other measures of relative importance based on effects size or variance reduction. Sometimes the model-averaged regression coefficients for predictor variables are incorrectly used to make model-averaged predictions of the response variable when the models are not linear in the parameters. I demonstrate the issues with the first two practices using the college grade point average example extensively analyzed by Burnham and Anderson. I show how partial standard deviations of the predictor variables can be used to detect changing scales of their estimates with multicollinearity. Standardizing estimates based on partial standard deviations for their variables can be used to make the scaling of the estimates commensurate across models, a necessary but not sufficient condition for model averaging of the estimates to be sensible. A unimodal distribution of estimates and valid interpretation of individual parameters are additional requisite conditions. The standardized estimates or equivalently the t statistics on unstandardized estimates also can be used to provide more informative measures of relative importance than sums of AIC weights. Finally, I illustrate how seriously compromised statistical interpretations and predictions can be for all three of these flawed practices by critiquing their use in a recent species distribution modeling technique developed for predicting Greater Sage-Grouse (Centrocercus urophasianus) distribution in Colorado, USA. These model averaging issues are common in other ecological literature and ought to be discontinued if we are to make effective scientific contributions to ecological knowledge and conservation of natural resources.
Model averaging and muddled multimodel inferences
Cade, Brian S.
2015-01-01
Three flawed practices associated with model averaging coefficients for predictor variables in regression models commonly occur when making multimodel inferences in analyses of ecological data. Model-averaged regression coefficients based on Akaike information criterion (AIC) weights have been recommended for addressing model uncertainty but they are not valid, interpretable estimates of partial effects for individual predictors when there is multicollinearity among the predictor variables. Multicollinearity implies that the scaling of units in the denominators of the regression coefficients may change across models such that neither the parameters nor their estimates have common scales, therefore averaging them makes no sense. The associated sums of AIC model weights recommended to assess relative importance of individual predictors are really a measure of relative importance of models, with little information about contributions by individual predictors compared to other measures of relative importance based on effects size or variance reduction. Sometimes the model-averaged regression coefficients for predictor variables are incorrectly used to make model-averaged predictions of the response variable when the models are not linear in the parameters. I demonstrate the issues with the first two practices using the college grade point average example extensively analyzed by Burnham and Anderson. I show how partial standard deviations of the predictor variables can be used to detect changing scales of their estimates with multicollinearity. Standardizing estimates based on partial standard deviations for their variables can be used to make the scaling of the estimates commensurate across models, a necessary but not sufficient condition for model averaging of the estimates to be sensible. A unimodal distribution of estimates and valid interpretation of individual parameters are additional requisite conditions. The standardized estimates or equivalently the tstatistics on unstandardized estimates also can be used to provide more informative measures of relative importance than sums of AIC weights. Finally, I illustrate how seriously compromised statistical interpretations and predictions can be for all three of these flawed practices by critiquing their use in a recent species distribution modeling technique developed for predicting Greater Sage-Grouse (Centrocercus urophasianus) distribution in Colorado, USA. These model averaging issues are common in other ecological literature and ought to be discontinued if we are to make effective scientific contributions to ecological knowledge and conservation of natural resources.
Simple and multiple linear regression: sample size considerations.
Hanley, James A
2016-11-01
The suggested "two subjects per variable" (2SPV) rule of thumb in the Austin and Steyerberg article is a chance to bring out some long-established and quite intuitive sample size considerations for both simple and multiple linear regression. This article distinguishes two of the major uses of regression models that imply very different sample size considerations, neither served well by the 2SPV rule. The first is etiological research, which contrasts mean Y levels at differing "exposure" (X) values and thus tends to focus on a single regression coefficient, possibly adjusted for confounders. The second research genre guides clinical practice. It addresses Y levels for individuals with different covariate patterns or "profiles." It focuses on the profile-specific (mean) Y levels themselves, estimating them via linear compounds of regression coefficients and covariates. By drawing on long-established closed-form variance formulae that lie beneath the standard errors in multiple regression, and by rearranging them for heuristic purposes, one arrives at quite intuitive sample size considerations for both research genres. Copyright © 2016 Elsevier Inc. All rights reserved.
Daily magnesium intake and serum magnesium concentration among Japanese people.
Akizawa, Yoriko; Koizumi, Sadayuki; Itokawa, Yoshinori; Ojima, Toshiyuki; Nakamura, Yosikazu; Tamura, Tarou; Kusaka, Yukinori
2008-01-01
The vitamins and minerals that are deficient in the daily diet of a normal adult remain unknown. To answer this question, we conducted a population survey focusing on the relationship between dietary magnesium intake and serum magnesium level. The subjects were 62 individuals from Fukui Prefecture who participated in the 1998 National Nutrition Survey. The survey investigated the physical status, nutritional status, and dietary data of the subjects. Holidays and special occasions were avoided, and a day when people are most likely to be on an ordinary diet was selected as the survey date. The mean (+/-standard deviation) daily magnesium intake was 322 (+/-132), 323 (+/-163), and 322 (+/-147) mg/day for men, women, and the entire group, respectively. The mean (+/-standard deviation) serum magnesium concentration was 20.69 (+/-2.83), 20.69 (+/-2.88), and 20.69 (+/-2.83) ppm for men, women, and the entire group, respectively. The distribution of serum magnesium concentration was normal. Dietary magnesium intake showed a log-normal distribution, which was then transformed by logarithmic conversion for examining the regression coefficients. The slope of the regression line between the serum magnesium concentration (Y ppm) and daily magnesium intake (X mg) was determined using the formula Y = 4.93 (log(10)X) + 8.49. The coefficient of correlation (r) was 0.29. A regression line (Y = 14.65X + 19.31) was observed between the daily intake of magnesium (Y mg) and serum magnesium concentration (X ppm). The coefficient of correlation was 0.28. The daily magnesium intake correlated with serum magnesium concentration, and a linear regression model between them was proposed.
Modified Regression Correlation Coefficient for Poisson Regression Model
NASA Astrophysics Data System (ADS)
Kaengthong, Nattacha; Domthong, Uthumporn
2017-09-01
This study gives attention to indicators in predictive power of the Generalized Linear Model (GLM) which are widely used; however, often having some restrictions. We are interested in regression correlation coefficient for a Poisson regression model. This is a measure of predictive power, and defined by the relationship between the dependent variable (Y) and the expected value of the dependent variable given the independent variables [E(Y|X)] for the Poisson regression model. The dependent variable is distributed as Poisson. The purpose of this research was modifying regression correlation coefficient for Poisson regression model. We also compare the proposed modified regression correlation coefficient with the traditional regression correlation coefficient in the case of two or more independent variables, and having multicollinearity in independent variables. The result shows that the proposed regression correlation coefficient is better than the traditional regression correlation coefficient based on Bias and the Root Mean Square Error (RMSE).
Neither fixed nor random: weighted least squares meta-regression.
Stanley, T D; Doucouliagos, Hristos
2017-03-01
Our study revisits and challenges two core conventional meta-regression estimators: the prevalent use of 'mixed-effects' or random-effects meta-regression analysis and the correction of standard errors that defines fixed-effects meta-regression analysis (FE-MRA). We show how and explain why an unrestricted weighted least squares MRA (WLS-MRA) estimator is superior to conventional random-effects (or mixed-effects) meta-regression when there is publication (or small-sample) bias that is as good as FE-MRA in all cases and better than fixed effects in most practical applications. Simulations and statistical theory show that WLS-MRA provides satisfactory estimates of meta-regression coefficients that are practically equivalent to mixed effects or random effects when there is no publication bias. When there is publication selection bias, WLS-MRA always has smaller bias than mixed effects or random effects. In practical applications, an unrestricted WLS meta-regression is likely to give practically equivalent or superior estimates to fixed-effects, random-effects, and mixed-effects meta-regression approaches. However, random-effects meta-regression remains viable and perhaps somewhat preferable if selection for statistical significance (publication bias) can be ruled out and when random, additive normal heterogeneity is known to directly affect the 'true' regression coefficient. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Practical Session: Simple Linear Regression
NASA Astrophysics Data System (ADS)
Clausel, M.; Grégoire, G.
2014-12-01
Two exercises are proposed to illustrate the simple linear regression. The first one is based on the famous Galton's data set on heredity. We use the lm R command and get coefficients estimates, standard error of the error, R2, residuals …In the second example, devoted to data related to the vapor tension of mercury, we fit a simple linear regression, predict values, and anticipate on multiple linear regression. This pratical session is an excerpt from practical exercises proposed by A. Dalalyan at EPNC (see Exercises 1 and 2 of http://certis.enpc.fr/~dalalyan/Download/TP_ENPC_4.pdf).
Investigating bias in squared regression structure coefficients
Nimon, Kim F.; Zientek, Linda R.; Thompson, Bruce
2015-01-01
The importance of structure coefficients and analogs of regression weights for analysis within the general linear model (GLM) has been well-documented. The purpose of this study was to investigate bias in squared structure coefficients in the context of multiple regression and to determine if a formula that had been shown to correct for bias in squared Pearson correlation coefficients and coefficients of determination could be used to correct for bias in squared regression structure coefficients. Using data from a Monte Carlo simulation, this study found that squared regression structure coefficients corrected with Pratt's formula produced less biased estimates and might be more accurate and stable estimates of population squared regression structure coefficients than estimates with no such corrections. While our findings are in line with prior literature that identified multicollinearity as a predictor of bias in squared regression structure coefficients but not coefficients of determination, the findings from this study are unique in that the level of predictive power, number of predictors, and sample size were also observed to contribute bias in squared regression structure coefficients. PMID:26217273
Guo, Changning; Doub, William H; Kauffman, John F
2010-08-01
Monte Carlo simulations were applied to investigate the propagation of uncertainty in both input variables and response measurements on model prediction for nasal spray product performance design of experiment (DOE) models in the first part of this study, with an initial assumption that the models perfectly represent the relationship between input variables and the measured responses. In this article, we discard the initial assumption, and extended the Monte Carlo simulation study to examine the influence of both input variable variation and product performance measurement variation on the uncertainty in DOE model coefficients. The Monte Carlo simulations presented in this article illustrate the importance of careful error propagation during product performance modeling. Our results show that the error estimates based on Monte Carlo simulation result in smaller model coefficient standard deviations than those from regression methods. This suggests that the estimated standard deviations from regression may overestimate the uncertainties in the model coefficients. Monte Carlo simulations provide a simple software solution to understand the propagation of uncertainty in complex DOE models so that design space can be specified with statistically meaningful confidence levels. (c) 2010 Wiley-Liss, Inc. and the American Pharmacists Association
Prediction models for clustered data: comparison of a random intercept and standard regression model
2013-01-01
Background When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Methods Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. Results The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. Conclusion The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters. PMID:23414436
Bouwmeester, Walter; Twisk, Jos W R; Kappen, Teus H; van Klei, Wilton A; Moons, Karel G M; Vergouwe, Yvonne
2013-02-15
When study data are clustered, standard regression analysis is considered inappropriate and analytical techniques for clustered data need to be used. For prediction research in which the interest of predictor effects is on the patient level, random effect regression models are probably preferred over standard regression analysis. It is well known that the random effect parameter estimates and the standard logistic regression parameter estimates are different. Here, we compared random effect and standard logistic regression models for their ability to provide accurate predictions. Using an empirical study on 1642 surgical patients at risk of postoperative nausea and vomiting, who were treated by one of 19 anesthesiologists (clusters), we developed prognostic models either with standard or random intercept logistic regression. External validity of these models was assessed in new patients from other anesthesiologists. We supported our results with simulation studies using intra-class correlation coefficients (ICC) of 5%, 15%, or 30%. Standard performance measures and measures adapted for the clustered data structure were estimated. The model developed with random effect analysis showed better discrimination than the standard approach, if the cluster effects were used for risk prediction (standard c-index of 0.69 versus 0.66). In the external validation set, both models showed similar discrimination (standard c-index 0.68 versus 0.67). The simulation study confirmed these results. For datasets with a high ICC (≥15%), model calibration was only adequate in external subjects, if the used performance measure assumed the same data structure as the model development method: standard calibration measures showed good calibration for the standard developed model, calibration measures adapting the clustered data structure showed good calibration for the prediction model with random intercept. The models with random intercept discriminate better than the standard model only if the cluster effect is used for predictions. The prediction model with random intercept had good calibration within clusters.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Brink, Carsten, E-mail: carsten.brink@rsyd.dk; Laboratory of Radiation Physics, Odense University Hospital; Bernchou, Uffe
2014-07-15
Purpose: Large interindividual variations in volume regression of non-small cell lung cancer (NSCLC) are observable on standard cone beam computed tomography (CBCT) during fractionated radiation therapy. Here, a method for automated assessment of tumor volume regression is presented and its potential use in response adapted personalized radiation therapy is evaluated empirically. Methods and Materials: Automated deformable registration with calculation of the Jacobian determinant was applied to serial CBCT scans in a series of 99 patients with NSCLC. Tumor volume at the end of treatment was estimated on the basis of the first one third and two thirds of the scans.more » The concordance between estimated and actual relative volume at the end of radiation therapy was quantified by Pearson's correlation coefficient. On the basis of the estimated relative volume, the patients were stratified into 2 groups having volume regressions below or above the population median value. Kaplan-Meier plots of locoregional disease-free rate and overall survival in the 2 groups were used to evaluate the predictive value of tumor regression during treatment. Cox proportional hazards model was used to adjust for other clinical characteristics. Results: Automatic measurement of the tumor regression from standard CBCT images was feasible. Pearson's correlation coefficient between manual and automatic measurement was 0.86 in a sample of 9 patients. Most patients experienced tumor volume regression, and this could be quantified early into the treatment course. Interestingly, patients with pronounced volume regression had worse locoregional tumor control and overall survival. This was significant on patient with non-adenocarcinoma histology. Conclusions: Evaluation of routinely acquired CBCT images during radiation therapy provides biological information on the specific tumor. This could potentially form the basis for personalized response adaptive therapy.« less
The problem of natural funnel asymmetries: a simulation analysis of meta-analysis in macroeconomics.
Callot, Laurent; Paldam, Martin
2011-06-01
Effect sizes in macroeconomic are estimated by regressions on data published by statistical agencies. Funnel plots are a representation of the distribution of the resulting regression coefficients. They are normally much wider than predicted by the t-ratio of the coefficients and often asymmetric. The standard method of meta-analysts in economics assumes that the asymmetries are because of publication bias causing censoring and adjusts the average accordingly. The paper shows that some funnel asymmetries may be 'natural' so that they occur without censoring. We investigate such asymmetries by simulating funnels by pairs of data generating processes (DGPs) and estimating models (EMs), in which the EM has the problem that it disregards a property of the DGP. The problems are data dependency, structural breaks, non-normal residuals, non-linearity, and omitted variables. We show that some of these problems generate funnel asymmetries. When they do, the standard method often fails. Copyright © 2011 John Wiley & Sons, Ltd. Copyright © 2011 John Wiley & Sons, Ltd.
NASA Technical Reports Server (NTRS)
Clark, P. E.; Andre, C. G.; Adler, I.; Weidner, J.; Podwysocki, M.
1976-01-01
The positive correlation between Al/Si X-ray fluorescence intensity ratios determined during the Apollo 15 lunar mission and a broad-spectrum visible albedo of the moon is quantitatively established. Linear regression analysis performed on 246 1 degree geographic cells of X-ray fluorescence intensity and visible albedo data points produced a statistically significant correlation coefficient of .78. Three distinct distributions of data were identified as (1) within one standard deviation of the regression line, (2) greater than one standard deviation below the line, and (3) greater than one standard deviation above the line. The latter two distributions of data were found to occupy distinct geographic areas in the Palus Somni region.
Daily Magnesium Intake and Serum Magnesium Concentration among Japanese People
Akizawa, Yoriko; Koizumi, Sadayuki; Itokawa, Yoshinori; Ojima, Toshiyuki; Nakamura, Yosikazu; Tamura, Tarou; Kusaka, Yukinori
2008-01-01
Background The vitamins and minerals that are deficient in the daily diet of a normal adult remain unknown. To answer this question, we conducted a population survey focusing on the relationship between dietary magnesium intake and serum magnesium level. Methods The subjects were 62 individuals from Fukui Prefecture who participated in the 1998 National Nutrition Survey. The survey investigated the physical status, nutritional status, and dietary data of the subjects. Holidays and special occasions were avoided, and a day when people are most likely to be on an ordinary diet was selected as the survey date. Results The mean (±standard deviation) daily magnesium intake was 322 (±132), 323 (±163), and 322 (±147) mg/day for men, women, and the entire group, respectively. The mean (±standard deviation) serum magnesium concentration was 20.69 (±2.83), 20.69 (±2.88), and 20.69 (±2.83) ppm for men, women, and the entire group, respectively. The distribution of serum magnesium concentration was normal. Dietary magnesium intake showed a log-normal distribution, which was then transformed by logarithmic conversion for examining the regression coefficients. The slope of the regression line between the serum magnesium concentration (Y ppm) and daily magnesium intake (X mg) was determined using the formula Y = 4.93 (log10X) + 8.49. The coefficient of correlation (r) was 0.29. A regression line (Y = 14.65X + 19.31) was observed between the daily intake of magnesium (Y mg) and serum magnesium concentration (X ppm). The coefficient of correlation was 0.28. Conclusion The daily magnesium intake correlated with serum magnesium concentration, and a linear regression model between them was proposed. PMID:18635902
Inequality in Maternal Mortality in Iran: An Ecologic Study
Tajik, Parvin; Nedjat, Saharnaz; Afshar, Nozhat Emami; Changizi, Nasrin; Yazdizadeh, Bahareh; Azemikhah, Arash; Aamrolalaei, Sima; Majdzadeh, Reza
2012-01-01
Background: Maternal mortality (MM) is an avoidable death and there is national, international and political commitment to reduce it. The objective of this study is to examine the relation of MM to socioeconomic factors and its inequality in Iran's provinces at an ecologic level. Methods: The overall MM from each province was considered for 3 years from 2004 to 2006. The five independent variables whose relations were studied included the literacy rate among men and women in each province, mean annual household income per capita, Gini coefficients in each province, and Human Development Index (HDI). The correlation of Maternal Mortality Ratio (MMR) to the above five variables was evaluated through Pearson's correlation coefficient (simple and weighted for each province's population) and linear regression – by considering MMR as the dependent variable and the Gini coefficient, HDI, and difference in literacy rate among men and women as the independent variables. Results: The mean MMR in the years 2004–2006 was 24.7 in 100,000 live births. The correlation coefficients between MMR and literacy rate among women, literacy rate among men, the mean annual household income per capita, Gini coefficient and HDI were 0.82, 0.90, –0.61, 0.52 and –0.77, respectively. Based on multivariate regression, MMR was significantly associated with HDI (standardized B=–0.93) and difference in literacy rate among men and women (standardized B=–0.47). However, MMR was not significantly associated with the Gini coefficient. Conclusion: This study shows the association between socioeconomic variables and their inequalities with MMR in Iran's provinces at an ecologic level. In addition to the other direct interventions performed to reduce MM, it seems essential to especially focus on more distal factors influencing MMR. PMID:22347608
Panel regressions to estimate low-flow response to rainfall variability in ungaged basins
Bassiouni, Maoya; Vogel, Richard M.; Archfield, Stacey A.
2016-01-01
Multicollinearity and omitted-variable bias are major limitations to developing multiple linear regression models to estimate streamflow characteristics in ungaged areas and varying rainfall conditions. Panel regression is used to overcome limitations of traditional regression methods, and obtain reliable model coefficients, in particular to understand the elasticity of streamflow to rainfall. Using annual rainfall and selected basin characteristics at 86 gaged streams in the Hawaiian Islands, regional regression models for three stream classes were developed to estimate the annual low-flow duration discharges. Three panel-regression structures (random effects, fixed effects, and pooled) were compared to traditional regression methods, in which space is substituted for time. Results indicated that panel regression generally was able to reproduce the temporal behavior of streamflow and reduce the standard errors of model coefficients compared to traditional regression, even for models in which the unobserved heterogeneity between streams is significant and the variance inflation factor for rainfall is much greater than 10. This is because both spatial and temporal variability were better characterized in panel regression. In a case study, regional rainfall elasticities estimated from panel regressions were applied to ungaged basins on Maui, using available rainfall projections to estimate plausible changes in surface-water availability and usable stream habitat for native species. The presented panel-regression framework is shown to offer benefits over existing traditional hydrologic regression methods for developing robust regional relations to investigate streamflow response in a changing climate.
Panel regressions to estimate low-flow response to rainfall variability in ungaged basins
NASA Astrophysics Data System (ADS)
Bassiouni, Maoya; Vogel, Richard M.; Archfield, Stacey A.
2016-12-01
Multicollinearity and omitted-variable bias are major limitations to developing multiple linear regression models to estimate streamflow characteristics in ungaged areas and varying rainfall conditions. Panel regression is used to overcome limitations of traditional regression methods, and obtain reliable model coefficients, in particular to understand the elasticity of streamflow to rainfall. Using annual rainfall and selected basin characteristics at 86 gaged streams in the Hawaiian Islands, regional regression models for three stream classes were developed to estimate the annual low-flow duration discharges. Three panel-regression structures (random effects, fixed effects, and pooled) were compared to traditional regression methods, in which space is substituted for time. Results indicated that panel regression generally was able to reproduce the temporal behavior of streamflow and reduce the standard errors of model coefficients compared to traditional regression, even for models in which the unobserved heterogeneity between streams is significant and the variance inflation factor for rainfall is much greater than 10. This is because both spatial and temporal variability were better characterized in panel regression. In a case study, regional rainfall elasticities estimated from panel regressions were applied to ungaged basins on Maui, using available rainfall projections to estimate plausible changes in surface-water availability and usable stream habitat for native species. The presented panel-regression framework is shown to offer benefits over existing traditional hydrologic regression methods for developing robust regional relations to investigate streamflow response in a changing climate.
Poor methodological quality and reporting standards of systematic reviews in burn care management.
Wasiak, Jason; Tyack, Zephanie; Ware, Robert; Goodwin, Nicholas; Faggion, Clovis M
2017-10-01
The methodological and reporting quality of burn-specific systematic reviews has not been established. The aim of this study was to evaluate the methodological quality of systematic reviews in burn care management. Computerised searches were performed in Ovid MEDLINE, Ovid EMBASE and The Cochrane Library through to February 2016 for systematic reviews relevant to burn care using medical subject and free-text terms such as 'burn', 'systematic review' or 'meta-analysis'. Additional studies were identified by hand-searching five discipline-specific journals. Two authors independently screened papers, extracted and evaluated methodological quality using the 11-item A Measurement Tool to Assess Systematic Reviews (AMSTAR) tool and reporting quality using the 27-item Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) checklist. Characteristics of systematic reviews associated with methodological and reporting quality were identified. Descriptive statistics and linear regression identified features associated with improved methodological quality. A total of 60 systematic reviews met the inclusion criteria. Six of the 11 AMSTAR items reporting on 'a priori' design, duplicate study selection, grey literature, included/excluded studies, publication bias and conflict of interest were reported in less than 50% of the systematic reviews. Of the 27 items listed for PRISMA, 13 items reporting on introduction, methods, results and the discussion were addressed in less than 50% of systematic reviews. Multivariable analyses showed that systematic reviews associated with higher methodological or reporting quality incorporated a meta-analysis (AMSTAR regression coefficient 2.1; 95% CI: 1.1, 3.1; PRISMA regression coefficient 6·3; 95% CI: 3·8, 8·7) were published in the Cochrane library (AMSTAR regression coefficient 2·9; 95% CI: 1·6, 4·2; PRISMA regression coefficient 6·1; 95% CI: 3·1, 9·2) and included a randomised control trial (AMSTAR regression coefficient 1·4; 95%CI: 0·4, 2·4; PRISMA regression coefficient 3·4; 95% CI: 0·9, 5·8). The methodological and reporting quality of systematic reviews in burn care requires further improvement with stricter adherence by authors to the PRISMA checklist and AMSTAR tool. © 2016 Medicalhelplines.com Inc and John Wiley & Sons Ltd.
Linear regression metamodeling as a tool to summarize and present simulation model results.
Jalal, Hawre; Dowd, Bryan; Sainfort, François; Kuntz, Karen M
2013-10-01
Modelers lack a tool to systematically and clearly present complex model results, including those from sensitivity analyses. The objective was to propose linear regression metamodeling as a tool to increase transparency of decision analytic models and better communicate their results. We used a simplified cancer cure model to demonstrate our approach. The model computed the lifetime cost and benefit of 3 treatment options for cancer patients. We simulated 10,000 cohorts in a probabilistic sensitivity analysis (PSA) and regressed the model outcomes on the standardized input parameter values in a set of regression analyses. We used the regression coefficients to describe measures of sensitivity analyses, including threshold and parameter sensitivity analyses. We also compared the results of the PSA to deterministic full-factorial and one-factor-at-a-time designs. The regression intercept represented the estimated base-case outcome, and the other coefficients described the relative parameter uncertainty in the model. We defined simple relationships that compute the average and incremental net benefit of each intervention. Metamodeling produced outputs similar to traditional deterministic 1-way or 2-way sensitivity analyses but was more reliable since it used all parameter values. Linear regression metamodeling is a simple, yet powerful, tool that can assist modelers in communicating model characteristics and sensitivity analyses.
Measurement of reaeration coefficients for selected Florida streams
Hampson, P.S.; Coffin, J.E.
1989-01-01
A total of 29 separate reaeration coefficient determinations were performed on 27 subreaches of 12 selected Florida streams between October 1981 and May 1985. Measurements performed prior to June 1984 were made using the peak and area methods with ethylene and propane as the tracer gases. Later measurements utilized the steady-state method with propane as the only tracer gas. The reaeration coefficients ranged from 1.07 to 45.9 days with a mean estimated probable error of +/16.7%. Ten predictive equations (compiled from the literature) were also evaluated using the measured coefficients. The most representative equation was one of the energy dissipation type with a standard error of 60.3%. Seven of the 10 predictive additional equations were modified using the measured coefficients and nonlinear regression techniques. The most accurate of the developed equations was also of the energy dissipation form and had a standard error of 54.9%. For 5 of the 13 subreaches in which both ethylene and propane were used, the ethylene data resulted in substantially larger reaeration coefficient values which were rejected. In these reaches, ethylene concentrations were probably significantly affected by one or more electrophilic addition reactions known to occur in aqueous media. (Author 's abstract)
Moderation analysis using a two-level regression model.
Yuan, Ke-Hai; Cheng, Ying; Maxwell, Scott
2014-10-01
Moderation analysis is widely used in social and behavioral research. The most commonly used model for moderation analysis is moderated multiple regression (MMR) in which the explanatory variables of the regression model include product terms, and the model is typically estimated by least squares (LS). This paper argues for a two-level regression model in which the regression coefficients of a criterion variable on predictors are further regressed on moderator variables. An algorithm for estimating the parameters of the two-level model by normal-distribution-based maximum likelihood (NML) is developed. Formulas for the standard errors (SEs) of the parameter estimates are provided and studied. Results indicate that, when heteroscedasticity exists, NML with the two-level model gives more efficient and more accurate parameter estimates than the LS analysis of the MMR model. When error variances are homoscedastic, NML with the two-level model leads to essentially the same results as LS with the MMR model. Most importantly, the two-level regression model permits estimating the percentage of variance of each regression coefficient that is due to moderator variables. When applied to data from General Social Surveys 1991, NML with the two-level model identified a significant moderation effect of race on the regression of job prestige on years of education while LS with the MMR model did not. An R package is also developed and documented to facilitate the application of the two-level model.
The Bayesian group lasso for confounded spatial data
Hefley, Trevor J.; Hooten, Mevin B.; Hanks, Ephraim M.; Russell, Robin E.; Walsh, Daniel P.
2017-01-01
Generalized linear mixed models for spatial processes are widely used in applied statistics. In many applications of the spatial generalized linear mixed model (SGLMM), the goal is to obtain inference about regression coefficients while achieving optimal predictive ability. When implementing the SGLMM, multicollinearity among covariates and the spatial random effects can make computation challenging and influence inference. We present a Bayesian group lasso prior with a single tuning parameter that can be chosen to optimize predictive ability of the SGLMM and jointly regularize the regression coefficients and spatial random effect. We implement the group lasso SGLMM using efficient Markov chain Monte Carlo (MCMC) algorithms and demonstrate how multicollinearity among covariates and the spatial random effect can be monitored as a derived quantity. To test our method, we compared several parameterizations of the SGLMM using simulated data and two examples from plant ecology and disease ecology. In all examples, problematic levels multicollinearity occurred and influenced sampling efficiency and inference. We found that the group lasso prior resulted in roughly twice the effective sample size for MCMC samples of regression coefficients and can have higher and less variable predictive accuracy based on out-of-sample data when compared to the standard SGLMM.
The use of gas chromatographic-mass spectrometric-computer systems in pharmacokinetic studies.
Horning, M G; Nowlin, J; Stafford, M; Lertratanangkoon, K; Sommer, K R; Hill, R M; Stillwell, R N
1975-10-29
Pharmacokinetic studies involving plasma, urine, breast milk, saliva and liver homogenates have been carried out by selective ion detection with a gas chromatographic-mass spectrometric-computer system operated in the chemical ionization mode. Stable isotope labeled drugs were used as internal standards for quantification. The half-lives, the concentration at zero time, the slope (regression coefficient), the maximum velocity of the reaction and the apparent Michaelis constant of the reaction were determined by regression analysis, and also by graphic means.
Bowen, Stephen R; Chappell, Richard J; Bentzen, Søren M; Deveau, Michael A; Forrest, Lisa J; Jeraj, Robert
2012-01-01
Purpose To quantify associations between pre-radiotherapy and post-radiotherapy PET parameters via spatially resolved regression. Materials and methods Ten canine sinonasal cancer patients underwent PET/CT scans of [18F]FDG (FDGpre), [18F]FLT (FLTpre), and [61Cu]Cu-ATSM (Cu-ATSMpre). Following radiotherapy regimens of 50 Gy in 10 fractions, veterinary patients underwent FDG PET/CT scans at three months (FDGpost). Regression of standardized uptake values in baseline FDGpre, FLTpre and Cu-ATSMpre tumour voxels to those in FDGpost images was performed for linear, log-linear, generalized-linear and mixed-fit linear models. Goodness-of-fit in regression coefficients was assessed by R2. Hypothesis testing of coefficients over the patient population was performed. Results Multivariate linear model fits of FDGpre to FDGpost were significantly positive over the population (FDGpost~0.17 FDGpre, p=0.03), and classified slopes of RECIST non-responders and responders to be different (0.37 vs. 0.07, p=0.01). Generalized-linear model fits related FDGpre to FDGpost by a linear power law (FDGpost~FDGpre0.93, p<0.001). Univariate mixture model fits of FDGpre improved R2 from 0.17 to 0.52. Neither baseline FLT PET nor Cu-ATSM PET uptake contributed statistically significant multivariate regression coefficients. Conclusions Spatially resolved regression analysis indicates that pre-treatment FDG PET uptake is most strongly associated with three-month post-treatment FDG PET uptake in this patient population, though associations are histopathology-dependent. PMID:22682748
Background stratified Poisson regression analysis of cohort data.
Richardson, David B; Langholz, Bryan
2012-03-01
Background stratified Poisson regression is an approach that has been used in the analysis of data derived from a variety of epidemiologically important studies of radiation-exposed populations, including uranium miners, nuclear industry workers, and atomic bomb survivors. We describe a novel approach to fit Poisson regression models that adjust for a set of covariates through background stratification while directly estimating the radiation-disease association of primary interest. The approach makes use of an expression for the Poisson likelihood that treats the coefficients for stratum-specific indicator variables as 'nuisance' variables and avoids the need to explicitly estimate the coefficients for these stratum-specific parameters. Log-linear models, as well as other general relative rate models, are accommodated. This approach is illustrated using data from the Life Span Study of Japanese atomic bomb survivors and data from a study of underground uranium miners. The point estimate and confidence interval obtained from this 'conditional' regression approach are identical to the values obtained using unconditional Poisson regression with model terms for each background stratum. Moreover, it is shown that the proposed approach allows estimation of background stratified Poisson regression models of non-standard form, such as models that parameterize latency effects, as well as regression models in which the number of strata is large, thereby overcoming the limitations of previously available statistical software for fitting background stratified Poisson regression models.
Intermittent nocturnal hypoxia and metabolic risk in obese adolescents with obstructive sleep apnea.
Narang, Indra; McCrindle, Brian W; Manlhiot, Cedric; Lu, Zihang; Al-Saleh, Suhail; Birken, Catherine S; Hamilton, Jill
2018-01-22
There is conflicting data regarding the independent associations of obstructive sleep apnea (OSA) with metabolic risk in obese youth. Previous studies have not consistently addressed central adiposity, specifically elevated waist to height ratio (WHtR), which is associated with metabolic risk independent of body mass index. The objective of this study was to determine the independent effects of the obstructive apnea-hypopnea index (OAHI) and associated indices of nocturnal hypoxia on metabolic function in obese youth after adjusting for WHtR. Subjects had standardized anthropometric measurements. Fasting blood included insulin, glucose, glycated hemoglobin, alanine transferase, and aspartate transaminase. Insulin resistance was quantified with the homeostatic model assessment. Overnight polysomnography determined the OAHI and nocturnal oxygenation indices. Of the 75 recruited subjects, 23% were diagnosed with OSA. Adjusting for age, gender, and WHtR in multivariable linear regression models, a higher oxygen desaturation index was associated with a higher fasting insulin (coefficient [standard error] = 48.076 [11.255], p < 0.001), higher glycated hemoglobin (coefficient [standard error] = 0.097 [0.041], p = 0.02), higher insulin resistance (coefficient [standard error] = 1.516 [0.364], p < 0.001), elevated alanine transferase (coefficient [standard error] = 11.631 [2.770], p < 0.001), and aspartate transaminase (coefficient [standard error] = 4.880 [1.444], p = 0.001). However, there were no significant associations between OAHI, glucose metabolism, and liver enzymes. Intermittent nocturnal hypoxia rather than the OAHI was associated with metabolic risk in obese youth after adjusting for WHtR. Measures of abdominal adiposity such as WHtR should be considered in future studies that evaluate the impact of OSA on metabolic health.
Chen, Ying-Jen; Ho, Meng-Yang; Chen, Kwan-Ju; Hsu, Chia-Fen; Ryu, Shan-Jin
2009-08-01
The aims of the present study were to (i) investigate if traditional Chinese word reading ability can be used for estimating premorbid general intelligence; and (ii) to provide multiple regression equations for estimating premorbid performance on Raven's Standard Progressive Matrices (RSPM), using age, years of education and Chinese Graded Word Reading Test (CGWRT) scores as predictor variables. Four hundred and twenty-six healthy volunteers (201 male, 225 female), aged 16-93 years (mean +/- SD, 41.92 +/- 18.19 years) undertook the tests individually under supervised conditions. Seventy percent of subjects were randomly allocated to the derivation group (n = 296), and the rest to the validation group (n = 130). RSPM score was positively correlated with CGWRT score and years of education. RSPM and CGWRT scores and years of education were also inversely correlated with age, but the declining trend for RSPM performance against age was steeper than that for CGWRT performance. Separate multiple regression equations were derived for estimating RSPM scores using different combinations of age, years of education, and CGWRT score for both groups. The multiple regression coefficient of each equation ranged from 0.71 to 0.80 with the standard error of estimate between 7 and 8 RSPM points. When fitting the data of one group to the equations derived from its counterpart group, the cross-validation multiple regression coefficients ranged from 0.71 to 0.79. There were no significant differences in the 'predicted-obtained' RSPM discrepancies between any equations. The regression equations derived in the present study may provide a basis for estimating premorbid RSPM performance.
ERIC Educational Resources Information Center
Tuncer, Murat
2013-01-01
Present research investigates reciprocal relations amidst computer self-efficacy, scientific research and information literacy self-efficacy. Research findings have demonstrated that according to standardized regression coefficients, computer self-efficacy has a positive effect on information literacy self-efficacy. Likewise it has been detected…
Zhang, Lifang; Sui, Minghong; Yan, Tiebin; You, Liming; Li, Kun; Gao, Yan
2017-03-01
To explore the impacts of social participation and the environment on depression among people with stroke. Cross-sectional survey. Structured interviews in the participants' homes. Community-dwelling persons with stroke in the rural areas of China ( N = 639). Not applicable. Depression (Hamilton Rating Scale for Depression-6), activity and social participation (Chinese version of the World Health Organization's Disability Assessment Schedule 2.0), environmental barriers (Craig Hospital Inventory of Environmental Factors), neurological function (Canadian Neurological Scale). A total of 42% of the variance in depression was explained by the environmental barriers, neurological function, activity, and social participation factors studied. Social participation, services/assistance, and attitudes/support were directly related to depression; their standardized regression coefficients were 0.530, 0.162, and 0.092, respectively ( p ⩽ 0.01). The physical environment, policies, and neurological function indirectly impacted depression. Depression influences social participation in turn, with a standardized regression coefficient of 0.29 ( p ⩽ 0.01). Depression and social participation are inversely related. The physical environment, services/assistance, attitudes/support, and policies all impact post-stroke depression.
Determinations of cloud liquid water in the tropics from the SSM/I
NASA Technical Reports Server (NTRS)
Alishouse, John C.; Swift, Calvin; Ruf, Christopher; Snyder, Sheila; Vongsathorn, Jennifer
1989-01-01
Upward-looking microwave radiometric observations were used to validate the SSM/I determinations, and also as a basis for the determination of new coefficients. Due to insufficiency of the initial four channel algorithm for cloud liquid water, the improved algorithm was derived from the CORRAD (the University of Massachusetts autocorrelation radiometer) measurements of cloud liquid water and the matching SSM/I brightness temperatures using the standard linear regression. The correlation coefficients for the possible four channel combinations, and subsequently the best and the worst combinations were determined.
Ridge: a computer program for calculating ridge regression estimates
Donald E. Hilt; Donald W. Seegrist
1977-01-01
Least-squares coefficients for multiple-regression models may be unstable when the independent variables are highly correlated. Ridge regression is a biased estimation procedure that produces stable estimates of the coefficients. Ridge regression is discussed, and a computer program for calculating the ridge coefficients is presented.
Sabetghadam, Samaneh; Ahmadi-Givi, Farhang
2014-01-01
Light extinction, which is the extent of attenuation of light signal for every distance traveled by light in the absence of special weather conditions (e.g., fog and rain), can be expressed as the sum of scattering and absorption effects of aerosols. In this paper, diurnal and seasonal variations of the extinction coefficient are investigated for the urban areas of Tehran from 2007 to 2009. Cases of visibility impairment that were concurrent with reports of fog, mist, precipitation, or relative humidity above 90% are filtered. The mean value and standard deviation of daily extinction are 0.49 and 0.39 km(-1), respectively. The average is much higher than that in many other large cities in the world, indicating the rather poor air quality over Tehran. The extinction coefficient shows obvious diurnal variations in each season, with a peak in the morning that is more pronounced in the wintertime. Also, there is a very slight increasing trend in the annual variations of atmospheric extinction coefficient, which suggests that air quality has regressed since 2007. The horizontal extinction coefficient decreased from January to July in each year and then increased between July and December, with the maximum value in the winter. Diurnal variation of extinction is often associated with small values for low relative humidity (RH), but increases significantly at higher RH. Annual correlation analysis shows that there is a positive correlation between the extinction coefficient and RH, CO, PM10, SO2, and NO2 concentration, while negative correlation exists between the extinction and T, WS, and O3, implying their unfavorable impact on extinction variation. The extinction budget was derived from multiple regression equations using the regression coefficients. On average, 44% of the extinction is from suspended particles, 3% is from air molecules, about 5% is from NO2 absorption, 0.35% is from RH, and approximately 48% is unaccounted for, which may represent errors in the data as well as contribution of other atmospheric constituents omitted from the analysis. Stronger regression equation is achieved in the summer, meaning that the extinction is more predictable in this season using pollutant concentrations.
Waller, Niels G
2016-01-01
For a fixed set of standardized regression coefficients and a fixed coefficient of determination (R-squared), an infinite number of predictor correlation matrices will satisfy the implied quadratic form. I call such matrices fungible correlation matrices. In this article, I describe an algorithm for generating positive definite (PD), positive semidefinite (PSD), or indefinite (ID) fungible correlation matrices that have a random or fixed smallest eigenvalue. The underlying equations of this algorithm are reviewed from both algebraic and geometric perspectives. Two simulation studies illustrate that fungible correlation matrices can be profitably used in Monte Carlo research. The first study uses PD fungible correlation matrices to compare penalized regression algorithms. The second study uses ID fungible correlation matrices to compare matrix-smoothing algorithms. R code for generating fungible correlation matrices is presented in the supplemental materials.
Optimization of Regression Models of Experimental Data Using Confirmation Points
NASA Technical Reports Server (NTRS)
Ulbrich, N.
2010-01-01
A new search metric is discussed that may be used to better assess the predictive capability of different math term combinations during the optimization of a regression model of experimental data. The new search metric can be determined for each tested math term combination if the given experimental data set is split into two subsets. The first subset consists of data points that are only used to determine the coefficients of the regression model. The second subset consists of confirmation points that are exclusively used to test the regression model. The new search metric value is assigned after comparing two values that describe the quality of the fit of each subset. The first value is the standard deviation of the PRESS residuals of the data points. The second value is the standard deviation of the response residuals of the confirmation points. The greater of the two values is used as the new search metric value. This choice guarantees that both standard deviations are always less or equal to the value that is used during the optimization. Experimental data from the calibration of a wind tunnel strain-gage balance is used to illustrate the application of the new search metric. The new search metric ultimately generates an optimized regression model that was already tested at regression model independent confirmation points before it is ever used to predict an unknown response from a set of regressors.
Nakatsuka, Haruo; Chiba, Keiko; Watanabe, Takao; Sawatari, Hideyuki; Seki, Takako
2016-11-01
Iodine intake by adults in farming districts in Northeastern Japan was evaluated by two methods: (1) government-approved food composition tables based calculation and (2) instrumental measurement. The correlation between these two values and a regression model for the calibration of calculated values was presented. Iodine intake was calculated, using the values in the Japan Standard Tables of Food Composition (FCT), through the analysis of duplicate samples of complete 24-h food consumption for 90 adult subjects. In cases where the value for iodine content was not available in the FCT, it was assumed to be zero for that food item (calculated values). Iodine content was also measured by ICP-MS (measured values). Calculated and measured values rendered geometric means (GM) of 336 and 279 μg/day, respectively. There was no statistically significant (p > 0.05) difference between calculated and measured values. The correlation coefficient was 0.646 (p < 0.05). With this high correlation coefficient, a simple regression line can be applied to estimate measured value from calculated value. A survey of the literature suggests that the values in this study were similar to values that have been reported to date for Japan, and higher than those for other countries in Asia. Iodine intake of Japanese adults was 336 μg/day (GM, calculated) and 279 μg/day (GM, measured). Both values correlated so well, with a correlation coefficient of 0.646, that a regression model (Y = 130.8 + 1.9479X, where X and Y are measured and calculated values, respectively) could be used to calibrate calculated values.
Hearing loss screening tool (COBRA score) for newborns in primary care setting
Poonual, Watcharapol; Navacharoen, Niramon; Kangsanarak, Jaran; Namwongprom, Sirianong
2017-01-01
Purpose To develop and evaluate a simple screening tool to assess hearing loss in newborns. A derived score was compared with the standard clinical practice tool. Methods This cohort study was designed to screen the hearing of newborns using transiently evoked otoacoustic emission and auditory brain stem response, and to determine the risk factors associated with hearing loss of newborns in 3 tertiary hospitals in Northern Thailand. Data were prospectively collected from November 1, 2010 to May 31, 2012. To develop the risk score, clinical-risk indicators were measured by Poisson risk regression. The regression coefficients were transformed into item scores dividing each regression-coefficient with the smallest coefficient in the model, rounding the number to its nearest integer, and adding up to a total score. Results Five clinical risk factors (Craniofacial anomaly, Ototoxicity, Birth weight, family history [Relative] of congenital sensorineural hearing loss, and Apgar score) were included in our COBRA score. The screening tool detected, by area under the receiver operating characteristic curve, more than 80% of existing hearing loss. The positive-likelihood ratio of hearing loss in patients with scores of 4, 6, and 8 were 25.21 (95% confidence interval [CI], 14.69–43.26), 58.52 (95% CI, 36.26–94.44), and 51.56 (95% CI, 33.74–78.82), respectively. This result was similar to the standard tool (The Joint Committee on Infant Hearing) of 26.72 (95% CI, 20.59–34.66). Conclusion A simple screening tool of five predictors provides good prediction indices for newborn hearing loss, which may motivate parents to bring children for further appropriate testing and investigations. PMID:29234358
Li, Min; Zhang, Lu; Yao, Xiaolong; Jiang, Xingyu
2017-01-01
The emerging membrane introduction mass spectrometry technique has been successfully used to detect benzene, toluene, ethyl benzene and xylene (BTEX), while overlapped spectra have unfortunately hindered its further application to the analysis of mixtures. Multivariate calibration, an efficient method to analyze mixtures, has been widely applied. In this paper, we compared univariate and multivariate analyses for quantification of the individual components of mixture samples. The results showed that the univariate analysis creates poor models with regression coefficients of 0.912, 0.867, 0.440 and 0.351 for BTEX, respectively. For multivariate analysis, a comparison to the partial-least squares (PLS) model shows that the orthogonal partial-least squares (OPLS) regression exhibits an optimal performance with regression coefficients of 0.995, 0.999, 0.980 and 0.976, favorable calibration parameters (RMSEC and RMSECV) and a favorable validation parameter (RMSEP). Furthermore, the OPLS exhibits a good recovery of 73.86 - 122.20% and relative standard deviation (RSD) of the repeatability of 1.14 - 4.87%. Thus, MIMS coupled with the OPLS regression provides an optimal approach for a quantitative BTEX mixture analysis in monitoring and predicting water pollution.
2014-12-01
Approx) A N OVA R~gresslon R esidual Total d.f. ss 1 . 0 .00018 1. 0 .00518 2. 0 .00536 Coefficients Standard Error Intercept 0.04 234 0...150.3230 + 27.9237 • "Mission" Rating (1-10) AN OVA Regression Residual To Cal d.f. 1 . 1 . 2 . ss 2,469.15829 1,630.59238 4 .099.75068...per Foot ($M) =· 1.8590 + 0.0091 • Length (ft) AN OVA d . fc ss Regres sion 1 . 4.22352 Residual 4 . 7.26194 Total 5 . 11.48546 Coefficients
Middleton, Michael S; Haufe, William; Hooker, Jonathan; Borga, Magnus; Dahlqvist Leinhard, Olof; Romu, Thobias; Tunón, Patrik; Hamilton, Gavin; Wolfson, Tanya; Gamst, Anthony; Loomba, Rohit; Sirlin, Claude B
2017-05-01
Purpose To determine the repeatability and accuracy of a commercially available magnetic resonance (MR) imaging-based, semiautomated method to quantify abdominal adipose tissue and thigh muscle volume and hepatic proton density fat fraction (PDFF). Materials and Methods This prospective study was institutional review board- approved and HIPAA compliant. All subjects provided written informed consent. Inclusion criteria were age of 18 years or older and willingness to participate. The exclusion criterion was contraindication to MR imaging. Three-dimensional T1-weighted dual-echo body-coil images were acquired three times. Source images were reconstructed to generate water and calibrated fat images. Abdominal adipose tissue and thigh muscle were segmented, and their volumes were estimated by using a semiautomated method and, as a reference standard, a manual method. Hepatic PDFF was estimated by using a confounder-corrected chemical shift-encoded MR imaging method with hybrid complex-magnitude reconstruction and, as a reference standard, MR spectroscopy. Tissue volume and hepatic PDFF intra- and interexamination repeatability were assessed by using intraclass correlation and coefficient of variation analysis. Tissue volume and hepatic PDFF accuracy were assessed by means of linear regression with the respective reference standards. Results Adipose and thigh muscle tissue volumes of 20 subjects (18 women; age range, 25-76 years; body mass index range, 19.3-43.9 kg/m 2 ) were estimated by using the semiautomated method. Intra- and interexamination intraclass correlation coefficients were 0.996-0.998 and coefficients of variation were 1.5%-3.6%. For hepatic MR imaging PDFF, intra- and interexamination intraclass correlation coefficients were greater than or equal to 0.994 and coefficients of variation were less than or equal to 7.3%. In the regression analyses of manual versus semiautomated volume and spectroscopy versus MR imaging, PDFF slopes and intercepts were close to the identity line, and correlations of determination at multivariate analysis (R 2 ) ranged from 0.744 to 0.994. Conclusion This MR imaging-based, semiautomated method provides high repeatability and accuracy for estimating abdominal adipose tissue and thigh muscle volumes and hepatic PDFF. © RSNA, 2017.
ERIC Educational Resources Information Center
Dolan, Conor V.; Wicherts, Jelte M.; Molenaar, Peter C. M.
2004-01-01
We consider the question of how variation in the number and reliability of indicators affects the power to reject the hypothesis that the regression coefficients are zero in latent linear regression analysis. We show that power remains constant as long as the coefficient of determination remains unchanged. Any increase in the number of indicators…
The regionalization of national-scale SPARROW models for stream nutrients
Schwarz, Gregory E.; Alexander, Richard B.; Smith, Richard A.; Preston, Stephen D.
2011-01-01
This analysis modifies the parsimonious specification of recently published total nitrogen (TN) and total phosphorus (TP) national-scale SPAtially Referenced Regressions On Watershed attributes models to allow each model coefficient to vary geographically among three major river basins of the conterminous United States. Regionalization of the national models reduces the standard errors in the prediction of TN and TP loads, expressed as a percentage of the predicted load, by about 6 and 7%. We develop and apply a method for combining national-scale and regional-scale information to estimate a hybrid model that imposes cross-region constraints that limit regional variation in model coefficients, effectively reducing the number of free model parameters as compared to a collection of independent regional models. The hybrid TN and TP regional models have improved model fit relative to the respective national models, reducing the standard error in the prediction of loads, expressed as a percentage of load, by about 5 and 4%. Only 19% of the TN hybrid model coefficients and just 2% of the TP hybrid model coefficients show evidence of substantial regional specificity (more than ±100% deviation from the national model estimate). The hybrid models have much greater precision in the estimated coefficients than do the unconstrained regional models, demonstrating the efficacy of pooling information across regions to improve regional models.
NASA Astrophysics Data System (ADS)
Gholizadeh, H.; Robeson, S. M.
2015-12-01
Empirical models have been widely used to estimate global chlorophyll content from remotely sensed data. Here, we focus on the standard NASA empirical models that use blue-green band ratios. These band ratio ocean color (OC) algorithms are in the form of fourth-order polynomials and the parameters of these polynomials (i.e. coefficients) are estimated from the NASA bio-Optical Marine Algorithm Data set (NOMAD). Most of the points in this data set have been sampled from tropical and temperate regions. However, polynomial coefficients obtained from this data set are used to estimate chlorophyll content in all ocean regions with different properties such as sea-surface temperature, salinity, and downwelling/upwelling patterns. Further, the polynomial terms in these models are highly correlated. In sum, the limitations of these empirical models are as follows: 1) the independent variables within the empirical models, in their current form, are correlated (multicollinear), and 2) current algorithms are global approaches and are based on the spatial stationarity assumption, so they are independent of location. Multicollinearity problem is resolved by using partial least squares (PLS). PLS, which transforms the data into a set of independent components, can be considered as a combined form of principal component regression (PCR) and multiple regression. Geographically weighted regression (GWR) is also used to investigate the validity of spatial stationarity assumption. GWR solves a regression model over each sample point by using the observations within its neighbourhood. PLS results show that the empirical method underestimates chlorophyll content in high latitudes, including the Southern Ocean region, when compared to PLS (see Figure 1). Cluster analysis of GWR coefficients also shows that the spatial stationarity assumption in empirical models is not likely a valid assumption.
Yamazaki, Takeshi; Takeda, Hisato; Hagiya, Koichi; Yamaguchi, Satoshi; Sasaki, Osamu
2018-03-13
Because lactation periods in dairy cows lengthen with increasing total milk production, it is important to predict individual productivities after 305 days in milk (DIM) to determine the optimal lactation period. We therefore examined whether the random regression (RR) coefficient from 306 to 450 DIM (M2) can be predicted from those during the first 305 DIM (M1) by using a random regression model. We analyzed test-day milk records from 85690 Holstein cows in their first lactations and 131727 cows in their later (second to fifth) lactations. Data in M1 and M2 were analyzed separately by using different single-trait RR animal models. We then performed a multiple regression analysis of the RR coefficients of M2 on those of M1 during the first and later lactations. The first-order Legendre polynomials were practical covariates of random regression for the milk yields of M2. All RR coefficients for the additive genetic (AG) effect and the intercept for the permanent environmental (PE) effect of M2 had moderate to strong correlations with the intercept for the AG effect of M1. The coefficients of determination for multiple regression of the combined intercepts for the AG and PE effects of M2 on the coefficients for the AG effect of M1 were moderate to high. The daily milk yields of M2 predicted by using the RR coefficients for the AG effect of M1 were highly correlated with those obtained by using the coefficients of M2. Milk production after 305 DIM can be predicted by using the RR coefficient estimates of the AG effect during the first 305 DIM.
Thompson, Ronald E.; Hoffman, Scott A.
2006-01-01
A suite of 28 streamflow statistics, ranging from extreme low to high flows, was computed for 17 continuous-record streamflow-gaging stations and predicted for 20 partial-record stations in Monroe County and contiguous counties in north-eastern Pennsylvania. The predicted statistics for the partial-record stations were based on regression analyses relating inter-mittent flow measurements made at the partial-record stations indexed to concurrent daily mean flows at continuous-record stations during base-flow conditions. The same statistics also were predicted for 134 ungaged stream locations in Monroe County on the basis of regression analyses relating the statistics to GIS-determined basin characteristics for the continuous-record station drainage areas. The prediction methodology for developing the regression equations used to estimate statistics was developed for estimating low-flow frequencies. This study and a companion study found that the methodology also has application potential for predicting intermediate- and high-flow statistics. The statistics included mean monthly flows, mean annual flow, 7-day low flows for three recurrence intervals, nine flow durations, mean annual base flow, and annual mean base flows for two recurrence intervals. Low standard errors of prediction and high coefficients of determination (R2) indicated good results in using the regression equations to predict the statistics. Regression equations for the larger flow statistics tended to have lower standard errors of prediction and higher coefficients of determination (R2) than equations for the smaller flow statistics. The report discusses the methodologies used in determining the statistics and the limitations of the statistics and the equations used to predict the statistics. Caution is indicated in using the predicted statistics for small drainage area situations. Study results constitute input needed by water-resource managers in Monroe County for planning purposes and evaluation of water-resources availability.
Interpreting Regression Results: beta Weights and Structure Coefficients are Both Important.
ERIC Educational Resources Information Center
Thompson, Bruce
Various realizations have led to less frequent use of the "OVA" methods (analysis of variance--ANOVA--among others) and to more frequent use of general linear model approaches such as regression. However, too few researchers understand all the various coefficients produced in regression. This paper explains these coefficients and their…
1990-05-01
0.759 0.744 0.768 0.753 106 (THUMBBR) THUMB BREADTH -0.652 -0.673 -0.539 -0.663 217 (LIPLGTHH) LIP LENGTH HEADBOARD 0.017 0.019 0.020 51 (FTBRHOR) FOOT...DEPENDENT VARIABLE: (106) THUMB BREADTH (THUBBR) MODEL INDEPENDENT VARIABLE 1 2 3 4 5 INTERCEPT 6.621 5.016 6.267 5.697 4.528 59 (HANDCIRC) HAND...95 (SLLSPEL) SLEEVE LENGTH: SPINE-ELBOW -0.020 -0.019 -C.018 9 (BLFTCIRC) BALL OF FOOT CIRCUMFERENCE -0.032 -0.039 106 (THUMBBR) THUMB BREADTH 0.228
NASA Astrophysics Data System (ADS)
Pradhan, Biswajeet
2010-05-01
This paper presents the results of the cross-validation of a multivariate logistic regression model using remote sensing data and GIS for landslide hazard analysis on the Penang, Cameron, and Selangor areas in Malaysia. Landslide locations in the study areas were identified by interpreting aerial photographs and satellite images, supported by field surveys. SPOT 5 and Landsat TM satellite imagery were used to map landcover and vegetation index, respectively. Maps of topography, soil type, lineaments and land cover were constructed from the spatial datasets. Ten factors which influence landslide occurrence, i.e., slope, aspect, curvature, distance from drainage, lithology, distance from lineaments, soil type, landcover, rainfall precipitation, and normalized difference vegetation index (ndvi), were extracted from the spatial database and the logistic regression coefficient of each factor was computed. Then the landslide hazard was analysed using the multivariate logistic regression coefficients derived not only from the data for the respective area but also using the logistic regression coefficients calculated from each of the other two areas (nine hazard maps in all) as a cross-validation of the model. For verification of the model, the results of the analyses were then compared with the field-verified landslide locations. Among the three cases of the application of logistic regression coefficient in the same study area, the case of Selangor based on the Selangor logistic regression coefficients showed the highest accuracy (94%), where as Penang based on the Penang coefficients showed the lowest accuracy (86%). Similarly, among the six cases from the cross application of logistic regression coefficient in other two areas, the case of Selangor based on logistic coefficient of Cameron showed highest (90%) prediction accuracy where as the case of Penang based on the Selangor logistic regression coefficients showed the lowest accuracy (79%). Qualitatively, the cross application model yields reasonable results which can be used for preliminary landslide hazard mapping.
Hierarchical Bayesian Logistic Regression to forecast metabolic control in type 2 DM patients.
Dagliati, Arianna; Malovini, Alberto; Decata, Pasquale; Cogni, Giulia; Teliti, Marsida; Sacchi, Lucia; Cerra, Carlo; Chiovato, Luca; Bellazzi, Riccardo
2016-01-01
In this work we present our efforts in building a model able to forecast patients' changes in clinical conditions when repeated measurements are available. In this case the available risk calculators are typically not applicable. We propose a Hierarchical Bayesian Logistic Regression model, which allows taking into account individual and population variability in model parameters estimate. The model is used to predict metabolic control and its variation in type 2 diabetes mellitus. In particular we have analyzed a population of more than 1000 Italian type 2 diabetic patients, collected within the European project Mosaic. The results obtained in terms of Matthews Correlation Coefficient are significantly better than the ones gathered with standard logistic regression model, based on data pooling.
Qidwai, Tabish; Yadav, Dharmendra K; Khan, Feroz; Dhawan, Sangeeta; Bhakuni, R S
2012-01-01
This work presents the development of quantitative structure activity relationship (QSAR) model to predict the antimalarial activity of artemisinin derivatives. The structures of the molecules are represented by chemical descriptors that encode topological, geometric, and electronic structure features. Screening through QSAR model suggested that compounds A24, A24a, A53, A54, A62 and A64 possess significant antimalarial activity. Linear model is developed by the multiple linear regression method to link structures to their reported antimalarial activity. The correlation in terms of regression coefficient (r(2)) was 0.90 and prediction accuracy of model in terms of cross validation regression coefficient (rCV(2)) was 0.82. This study indicates that chemical properties viz., atom count (all atoms), connectivity index (order 1, standard), ring count (all rings), shape index (basic kappa, order 2), and solvent accessibility surface area are well correlated with antimalarial activity. The docking study showed high binding affinity of predicted active compounds against antimalarial target Plasmepsins (Plm-II). Further studies for oral bioavailability, ADMET and toxicity risk assessment suggest that compound A24, A24a, A53, A54, A62 and A64 exhibits marked antimalarial activity comparable to standard antimalarial drugs. Later one of the predicted active compound A64 was chemically synthesized, structure elucidated by NMR and in vivo tested in multidrug resistant strain of Plasmodium yoelii nigeriensis infected mice. The experimental results obtained agreed well with the predicted values.
The Outlier Detection for Ordinal Data Using Scalling Technique of Regression Coefficients
NASA Astrophysics Data System (ADS)
Adnan, Arisman; Sugiarto, Sigit
2017-06-01
The aims of this study is to detect the outliers by using coefficients of Ordinal Logistic Regression (OLR) for the case of k category responses where the score from 1 (the best) to 8 (the worst). We detect them by using the sum of moduli of the ordinal regression coefficients calculated by jackknife technique. This technique is improved by scalling the regression coefficients to their means. R language has been used on a set of ordinal data from reference distribution. Furthermore, we compare this approach by using studentised residual plots of jackknife technique for ANOVA (Analysis of Variance) and OLR. This study shows that the jackknifing technique along with the proper scaling may lead us to reveal outliers in ordinal regression reasonably well.
Restoration of Monotonicity Respecting in Dynamic Regression
Huang, Yijian
2017-01-01
Dynamic regression models, including the quantile regression model and Aalen’s additive hazards model, are widely adopted to investigate evolving covariate effects. Yet lack of monotonicity respecting with standard estimation procedures remains an outstanding issue. Advances have recently been made, but none provides a complete resolution. In this article, we propose a novel adaptive interpolation method to restore monotonicity respecting, by successively identifying and then interpolating nearest monotonicity-respecting points of an original estimator. Under mild regularity conditions, the resulting regression coefficient estimator is shown to be asymptotically equivalent to the original. Our numerical studies have demonstrated that the proposed estimator is much more smooth and may have better finite-sample efficiency than the original as well as, when available as only in special cases, other competing monotonicity-respecting estimators. Illustration with a clinical study is provided. PMID:29430068
Gíslason, Magnús; Sigurðsson, Sigurður; Guðnason, Vilmundur; Harris, Tamara; Carraro, Ugo; Gargiulo, Paolo
2018-01-01
Sarcopenic muscular degeneration has been consistently identified as an independent risk factor for mortality in aging populations. Recent investigations have realized the quantitative potential of computed tomography (CT) image analysis to describe skeletal muscle volume and composition; however, the optimum approach to assessing these data remains debated. Current literature reports average Hounsfield unit (HU) values and/or segmented soft tissue cross-sectional areas to investigate muscle quality. However, standardized methods for CT analyses and their utility as a comorbidity index remain undefined, and no existing studies compare these methods to the assessment of entire radiodensitometric distributions. The primary aim of this study was to present a comparison of nonlinear trimodal regression analysis (NTRA) parameters of entire radiodensitometric muscle distributions against extant CT metrics and their correlation with lower extremity function (LEF) biometrics (normal/fast gait speed, timed up-and-go, and isometric leg strength) and biochemical and nutritional parameters, such as total solubilized cholesterol (SCHOL) and body mass index (BMI). Data were obtained from 3,162 subjects, aged 66–96 years, from the population-based AGES-Reykjavik Study. 1-D k-means clustering was employed to discretize each biometric and comorbidity dataset into twelve subpopulations, in accordance with Sturges’ Formula for Class Selection. Dataset linear regressions were performed against eleven NTRA distribution parameters and standard CT analyses (fat/muscle cross-sectional area and average HU value). Parameters from NTRA and CT standards were analogously assembled by age and sex. Analysis of specific NTRA parameters with standard CT results showed linear correlation coefficients greater than 0.85, but multiple regression analysis of correlative NTRA parameters yielded a correlation coefficient of 0.99 (P<0.005). These results highlight the specificities of each muscle quality metric to LEF biometrics, SCHOL, and BMI, and particularly highlight the value of the connective tissue regime in this regard. PMID:29513690
Edmunds, Kyle; Gíslason, Magnús; Sigurðsson, Sigurður; Guðnason, Vilmundur; Harris, Tamara; Carraro, Ugo; Gargiulo, Paolo
2018-01-01
Sarcopenic muscular degeneration has been consistently identified as an independent risk factor for mortality in aging populations. Recent investigations have realized the quantitative potential of computed tomography (CT) image analysis to describe skeletal muscle volume and composition; however, the optimum approach to assessing these data remains debated. Current literature reports average Hounsfield unit (HU) values and/or segmented soft tissue cross-sectional areas to investigate muscle quality. However, standardized methods for CT analyses and their utility as a comorbidity index remain undefined, and no existing studies compare these methods to the assessment of entire radiodensitometric distributions. The primary aim of this study was to present a comparison of nonlinear trimodal regression analysis (NTRA) parameters of entire radiodensitometric muscle distributions against extant CT metrics and their correlation with lower extremity function (LEF) biometrics (normal/fast gait speed, timed up-and-go, and isometric leg strength) and biochemical and nutritional parameters, such as total solubilized cholesterol (SCHOL) and body mass index (BMI). Data were obtained from 3,162 subjects, aged 66-96 years, from the population-based AGES-Reykjavik Study. 1-D k-means clustering was employed to discretize each biometric and comorbidity dataset into twelve subpopulations, in accordance with Sturges' Formula for Class Selection. Dataset linear regressions were performed against eleven NTRA distribution parameters and standard CT analyses (fat/muscle cross-sectional area and average HU value). Parameters from NTRA and CT standards were analogously assembled by age and sex. Analysis of specific NTRA parameters with standard CT results showed linear correlation coefficients greater than 0.85, but multiple regression analysis of correlative NTRA parameters yielded a correlation coefficient of 0.99 (P<0.005). These results highlight the specificities of each muscle quality metric to LEF biometrics, SCHOL, and BMI, and particularly highlight the value of the connective tissue regime in this regard.
Analysis of a Split-Plot Experimental Design Applied to a Low-Speed Wind Tunnel Investigation
NASA Technical Reports Server (NTRS)
Erickson, Gary E.
2013-01-01
A procedure to analyze a split-plot experimental design featuring two input factors, two levels of randomization, and two error structures in a low-speed wind tunnel investigation of a small-scale model of a fighter airplane configuration is described in this report. Standard commercially-available statistical software was used to analyze the test results obtained in a randomization-restricted environment often encountered in wind tunnel testing. The input factors were differential horizontal stabilizer incidence and the angle of attack. The response variables were the aerodynamic coefficients of lift, drag, and pitching moment. Using split-plot terminology, the whole plot, or difficult-to-change, factor was the differential horizontal stabilizer incidence, and the subplot, or easy-to-change, factor was the angle of attack. The whole plot and subplot factors were both tested at three levels. Degrees of freedom for the whole plot error were provided by replication in the form of three blocks, or replicates, which were intended to simulate three consecutive days of wind tunnel facility operation. The analysis was conducted in three stages, which yielded the estimated mean squares, multiple regression function coefficients, and corresponding tests of significance for all individual terms at the whole plot and subplot levels for the three aerodynamic response variables. The estimated regression functions included main effects and two-factor interaction for the lift coefficient, main effects, two-factor interaction, and quadratic effects for the drag coefficient, and only main effects for the pitching moment coefficient.
Bayesian Estimation of Multivariate Latent Regression Models: Gauss versus Laplace
ERIC Educational Resources Information Center
Culpepper, Steven Andrew; Park, Trevor
2017-01-01
A latent multivariate regression model is developed that employs a generalized asymmetric Laplace (GAL) prior distribution for regression coefficients. The model is designed for high-dimensional applications where an approximate sparsity condition is satisfied, such that many regression coefficients are near zero after accounting for all the model…
Smith, S. Jerrod; Lewis, Jason M.; Graves, Grant M.
2015-09-28
Generalized-least-squares multiple-linear regression analysis was used to formulate regression relations between peak-streamflow frequency statistics and basin characteristics. Contributing drainage area was the only basin characteristic determined to be statistically significant for all percentage of annual exceedance probabilities and was the only basin characteristic used in regional regression equations for estimating peak-streamflow frequency statistics on unregulated streams in and near the Oklahoma Panhandle. The regression model pseudo-coefficient of determination, converted to percent, for the Oklahoma Panhandle regional regression equations ranged from about 38 to 63 percent. The standard errors of prediction and the standard model errors for the Oklahoma Panhandle regional regression equations ranged from about 84 to 148 percent and from about 76 to 138 percent, respectively. These errors were comparable to those reported for regional peak-streamflow frequency regression equations for the High Plains areas of Texas and Colorado. The root mean square errors for the Oklahoma Panhandle regional regression equations (ranging from 3,170 to 92,000 cubic feet per second) were less than the root mean square errors for the Oklahoma statewide regression equations (ranging from 18,900 to 412,000 cubic feet per second); therefore, the Oklahoma Panhandle regional regression equations produce more accurate peak-streamflow statistic estimates for the irrigated period of record in the Oklahoma Panhandle than do the Oklahoma statewide regression equations. The regression equations developed in this report are applicable to streams that are not substantially affected by regulation, impoundment, or surface-water withdrawals. These regression equations are intended for use for stream sites with contributing drainage areas less than or equal to about 2,060 square miles, the maximum value for the independent variable used in the regression analysis.
Viability estimation of pepper seeds using time-resolved photothermal signal characterization
NASA Astrophysics Data System (ADS)
Kim, Ghiseok; Kim, Geon-Hee; Lohumi, Santosh; Kang, Jum-Soon; Cho, Byoung-Kwan
2014-11-01
We used infrared thermal signal measurement system and photothermal signal and image reconstruction techniques for viability estimation of pepper seeds. Photothermal signals from healthy and aged seeds were measured for seven periods (24, 48, 72, 96, 120, 144, and 168 h) using an infrared camera and analyzed by a regression method. The photothermal signals were regressed using a two-term exponential decay curve with two amplitudes and two time variables (lifetime) as regression coefficients. The regression coefficients of the fitted curve showed significant differences for each seed groups, depending on the aging times. In addition, the viability of a single seed was estimated by imaging of its regression coefficient, which was reconstructed from the measured photothermal signals. The time-resolved photothermal characteristics, along with the regression coefficient images, can be used to discriminate the aged or dead pepper seeds from the healthy seeds.
Parameter estimation in Cox models with missing failure indicators and the OPPERA study.
Brownstein, Naomi C; Cai, Jianwen; Slade, Gary D; Bair, Eric
2015-12-30
In a prospective cohort study, examining all participants for incidence of the condition of interest may be prohibitively expensive. For example, the "gold standard" for diagnosing temporomandibular disorder (TMD) is a physical examination by a trained clinician. In large studies, examining all participants in this manner is infeasible. Instead, it is common to use questionnaires to screen for incidence of TMD and perform the "gold standard" examination only on participants who screen positively. Unfortunately, some participants may leave the study before receiving the "gold standard" examination. Within the framework of survival analysis, this results in missing failure indicators. Motivated by the Orofacial Pain: Prospective Evaluation and Risk Assessment (OPPERA) study, a large cohort study of TMD, we propose a method for parameter estimation in survival models with missing failure indicators. We estimate the probability of being an incident case for those lacking a "gold standard" examination using logistic regression. These estimated probabilities are used to generate multiple imputations of case status for each missing examination that are combined with observed data in appropriate regression models. The variance introduced by the procedure is estimated using multiple imputation. The method can be used to estimate both regression coefficients in Cox proportional hazard models as well as incidence rates using Poisson regression. We simulate data with missing failure indicators and show that our method performs as well as or better than competing methods. Finally, we apply the proposed method to data from the OPPERA study. Copyright © 2015 John Wiley & Sons, Ltd.
Regression-based adaptive sparse polynomial dimensional decomposition for sensitivity analysis
NASA Astrophysics Data System (ADS)
Tang, Kunkun; Congedo, Pietro; Abgrall, Remi
2014-11-01
Polynomial dimensional decomposition (PDD) is employed in this work for global sensitivity analysis and uncertainty quantification of stochastic systems subject to a large number of random input variables. Due to the intimate structure between PDD and Analysis-of-Variance, PDD is able to provide simpler and more direct evaluation of the Sobol' sensitivity indices, when compared to polynomial chaos (PC). Unfortunately, the number of PDD terms grows exponentially with respect to the size of the input random vector, which makes the computational cost of the standard method unaffordable for real engineering applications. In order to address this problem of curse of dimensionality, this work proposes a variance-based adaptive strategy aiming to build a cheap meta-model by sparse-PDD with PDD coefficients computed by regression. During this adaptive procedure, the model representation by PDD only contains few terms, so that the cost to resolve repeatedly the linear system of the least-square regression problem is negligible. The size of the final sparse-PDD representation is much smaller than the full PDD, since only significant terms are eventually retained. Consequently, a much less number of calls to the deterministic model is required to compute the final PDD coefficients.
Ono, Tomohiro; Nakamura, Mitsuhiro; Hirose, Yoshinori; Kitsuda, Kenji; Ono, Yuka; Ishigaki, Takashi; Hiraoka, Masahiro
2017-09-01
To estimate the lung tumor position from multiple anatomical features on four-dimensional computed tomography (4D-CT) data sets using single regression analysis (SRA) and multiple regression analysis (MRA) approach and evaluate an impact of the approach on internal target volume (ITV) for stereotactic body radiotherapy (SBRT) of the lung. Eleven consecutive lung cancer patients (12 cases) underwent 4D-CT scanning. The three-dimensional (3D) lung tumor motion exceeded 5 mm. The 3D tumor position and anatomical features, including lung volume, diaphragm, abdominal wall, and chest wall positions, were measured on 4D-CT images. The tumor position was estimated by SRA using each anatomical feature and MRA using all anatomical features. The difference between the actual and estimated tumor positions was defined as the root-mean-square error (RMSE). A standard partial regression coefficient for the MRA was evaluated. The 3D lung tumor position showed a high correlation with the lung volume (R = 0.92 ± 0.10). Additionally, ITVs derived from SRA and MRA approaches were compared with ITV derived from contouring gross tumor volumes on all 10 phases of the 4D-CT (conventional ITV). The RMSE of the SRA was within 3.7 mm in all directions. Also, the RMSE of the MRA was within 1.6 mm in all directions. The standard partial regression coefficient for the lung volume was the largest and had the most influence on the estimated tumor position. Compared with conventional ITV, average percentage decrease of ITV were 31.9% and 38.3% using SRA and MRA approaches, respectively. The estimation accuracy of lung tumor position was improved by the MRA approach, which provided smaller ITV than conventional ITV. © 2017 The Authors. Journal of Applied Clinical Medical Physics published by Wiley Periodicals, Inc. on behalf of American Association of Physicists in Medicine.
Qing, Si-han; Chang, Yun-feng; Dong, Xiao-ai; Li, Yuan; Chen, Xiao-gang; Shu, Yong-kang; Deng, Zhen-hua
2013-10-01
To establish the mathematical models of stature estimation for Sichuan Han female with measurement of lumbar vertebrae by X-ray to provide essential data for forensic anthropology research. The samples, 206 Sichuan Han females, were divided into three groups including group A, B and C according to the ages. Group A (206 samples) consisted of all ages, group B (116 samples) were 20-45 years old and 90 samples over 45 years old were group C. All the samples were examined lumbar vertebrae through CR technology, including the parameters of five centrums (L1-L5) as anterior border, posterior border and central heights (x1-x15), total central height of lumbar spine (x16), and the real height of every sample. The linear regression analysis was produced using the parameters to establish the mathematical models of stature estimation. Sixty-two trained subjects were tested to verify the accuracy of the mathematical models. The established mathematical models by hypothesis test of linear regression equation model were statistically significant (P<0.05). The standard errors of the equation were 2.982-5.004 cm, while correlation coefficients were 0.370-0.779 and multiple correlation coefficients were 0.533-0.834. The return tests of the highest correlation coefficient and multiple correlation coefficient of each group showed that the highest accuracy of the multiple regression equation, y = 100.33 + 1.489 x3 - 0.548 x6 + 0.772 x9 + 0.058 x12 + 0.645 x15, in group A were 80.6% (+/- lSE) and 100% (+/- 2SE). The established mathematical models in this study could be applied for the stature estimation for Sichuan Han females.
Liu, Chang-Fu; He, Xing-Yuan; Chen, Wei; Zhao, Gui-Ling; Xue, Wen-Duo
2008-06-01
Based on the fractal theory of forest growth, stepwise regression was employed to pursue a convenient and efficient method of measuring the three-dimensional green biomass (TGB) of urban forests in small area. A total of thirteen simulation equations of TGB of urban forests in Shenyang City were derived, with the factors affecting the TGB analyzed. The results showed that the coefficients of determination (R2) of the 13 simulation equations ranged from 0.612 to 0.842. No evident pattern was shown in residual analysis, and the precisions were all higher than 87% (alpha = 0.05) and 83% (alpha = 0.01). The most convenient simulation equation was ln Y = 7.468 + 0.926 lnx1, where Y was the simulated TGB and x1 was basal area at breast height per hectare (SDB). The correlations between the standard regression coefficients of the simulation equations and 16 tree characteristics suggested that SDB was the main factor affecting the TGB of urban forests in Shenyang.
Penalized spline estimation for functional coefficient regression models.
Cao, Yanrong; Lin, Haiqun; Wu, Tracy Z; Yu, Yan
2010-04-01
The functional coefficient regression models assume that the regression coefficients vary with some "threshold" variable, providing appreciable flexibility in capturing the underlying dynamics in data and avoiding the so-called "curse of dimensionality" in multivariate nonparametric estimation. We first investigate the estimation, inference, and forecasting for the functional coefficient regression models with dependent observations via penalized splines. The P-spline approach, as a direct ridge regression shrinkage type global smoothing method, is computationally efficient and stable. With established fixed-knot asymptotics, inference is readily available. Exact inference can be obtained for fixed smoothing parameter λ, which is most appealing for finite samples. Our penalized spline approach gives an explicit model expression, which also enables multi-step-ahead forecasting via simulations. Furthermore, we examine different methods of choosing the important smoothing parameter λ: modified multi-fold cross-validation (MCV), generalized cross-validation (GCV), and an extension of empirical bias bandwidth selection (EBBS) to P-splines. In addition, we implement smoothing parameter selection using mixed model framework through restricted maximum likelihood (REML) for P-spline functional coefficient regression models with independent observations. The P-spline approach also easily allows different smoothness for different functional coefficients, which is enabled by assigning different penalty λ accordingly. We demonstrate the proposed approach by both simulation examples and a real data application.
Kaambwa, Billingsley; Bryan, Stirling; Billingham, Lucinda
2012-06-27
Missing data is a common statistical problem in healthcare datasets from populations of older people. Some argue that arbitrarily assuming the mechanism responsible for the missingness and therefore the method for dealing with this missingness is not the best option-but is this always true? This paper explores what happens when extra information that suggests that a particular mechanism is responsible for missing data is disregarded and methods for dealing with the missing data are chosen arbitrarily. Regression models based on 2,533 intermediate care (IC) patients from the largest evaluation of IC done and published in the UK to date were used to explain variation in costs, EQ-5D and Barthel index. Three methods for dealing with missingness were utilised, each assuming a different mechanism as being responsible for the missing data: complete case analysis (assuming missing completely at random-MCAR), multiple imputation (assuming missing at random-MAR) and Heckman selection model (assuming missing not at random-MNAR). Differences in results were gauged by examining the signs of coefficients as well as the sizes of both coefficients and associated standard errors. Extra information strongly suggested that missing cost data were MCAR. The results show that MCAR and MAR-based methods yielded similar results with sizes of most coefficients and standard errors differing by less than 3.4% while those based on MNAR-methods were statistically different (up to 730% bigger). Significant variables in all regression models also had the same direction of influence on costs. All three mechanisms of missingness were shown to be potential causes of the missing EQ-5D and Barthel data. The method chosen to deal with missing data did not seem to have any significant effect on the results for these data as they led to broadly similar conclusions with sizes of coefficients and standard errors differing by less than 54% and 322%, respectively. Arbitrary selection of methods to deal with missing data should be avoided. Using extra information gathered during the data collection exercise about the cause of missingness to guide this selection would be more appropriate.
Hidden Connections between Regression Models of Strain-Gage Balance Calibration Data
NASA Technical Reports Server (NTRS)
Ulbrich, Norbert
2013-01-01
Hidden connections between regression models of wind tunnel strain-gage balance calibration data are investigated. These connections become visible whenever balance calibration data is supplied in its design format and both the Iterative and Non-Iterative Method are used to process the data. First, it is shown how the regression coefficients of the fitted balance loads of a force balance can be approximated by using the corresponding regression coefficients of the fitted strain-gage outputs. Then, data from the manual calibration of the Ames MK40 six-component force balance is chosen to illustrate how estimates of the regression coefficients of the fitted balance loads can be obtained from the regression coefficients of the fitted strain-gage outputs. The study illustrates that load predictions obtained by applying the Iterative or the Non-Iterative Method originate from two related regression solutions of the balance calibration data as long as balance loads are given in the design format of the balance, gage outputs behave highly linear, strict statistical quality metrics are used to assess regression models of the data, and regression model term combinations of the fitted loads and gage outputs can be obtained by a simple variable exchange.
Wrong Signs in Regression Coefficients
NASA Technical Reports Server (NTRS)
McGee, Holly
1999-01-01
When using parametric cost estimation, it is important to note the possibility of the regression coefficients having the wrong sign. A wrong sign is defined as a sign on the regression coefficient opposite to the researcher's intuition and experience. Some possible causes for the wrong sign discussed in this paper are a small range of x's, leverage points, missing variables, multicollinearity, and computational error. Additionally, techniques for determining the cause of the wrong sign are given.
NASA Astrophysics Data System (ADS)
Kelly, B.; Chelsky, A.; Bulygina, E.; Roberts, B. J.
2017-12-01
Remote sensing techniques have become valuable tools to researchers, providing the capability to measure and visualize important parameters without the need for time or resource intensive sampling trips. Relationships between dissolved organic carbon (DOC), colored dissolved organic matter (CDOM) and spectral data have been used to remotely sense DOC concentrations in riverine systems, however, this approach has not been applied to the northern Gulf of Mexico (GoM) and needs to be tested to determine how accurate these relationships are in riverine-dominated shelf systems. In April, July, and October 2017 we sampled surface water from 80+ sites over an area of 100,000 km2 along the Louisiana-Texas shelf in the northern GoM. DOC concentrations were measured on filtered water samples using a Shimadzu TOC-VCSH analyzer using standard techniques. Additionally, DOC concentrations were estimated from CDOM absorption coefficients of filtered water samples on a UV-Vis spectrophotometer using a modification of the methods of Fichot and Benner (2011). These values were regressed against Landsat visible band spectral data for those same locations to establish a relationship between the spectral data, CDOM absorption coefficients. This allowed us to spatially map CDOM absorption coefficients in the Gulf of Mexico using the Landsat spectral data in GIS. We then used a multiple linear regressions model to derive DOC concentrations from the CDOM absorption coefficients and applied those to our map. This study provides an evaluation of the viability of scaling up CDOM absorption coefficient and remote-sensing derived estimates of DOC concentrations to the scale of the LA-TX shelf ecosystem.
NASA Astrophysics Data System (ADS)
Nishidate, Izumi; Wiswadarma, Aditya; Hase, Yota; Tanaka, Noriyuki; Maeda, Takaaki; Niizeki, Kyuichi; Aizu, Yoshihisa
2011-08-01
In order to visualize melanin and blood concentrations and oxygen saturation in human skin tissue, a simple imaging technique based on multispectral diffuse reflectance images acquired at six wavelengths (500, 520, 540, 560, 580 and 600nm) was developed. The technique utilizes multiple regression analysis aided by Monte Carlo simulation for diffuse reflectance spectra. Using the absorbance spectrum as a response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as predictor variables, multiple regression analysis provides regression coefficients. Concentrations of melanin and total blood are then determined from the regression coefficients using conversion vectors that are deduced numerically in advance, while oxygen saturation is obtained directly from the regression coefficients. Experiments with a tissue-like agar gel phantom validated the method. In vivo experiments with human skin of the human hand during upper limb occlusion and of the inner forearm exposed to UV irradiation demonstrated the ability of the method to evaluate physiological reactions of human skin tissue.
Adjusted variable plots for Cox's proportional hazards regression model.
Hall, C B; Zeger, S L; Bandeen-Roche, K J
1996-01-01
Adjusted variable plots are useful in linear regression for outlier detection and for qualitative evaluation of the fit of a model. In this paper, we extend adjusted variable plots to Cox's proportional hazards model for possibly censored survival data. We propose three different plots: a risk level adjusted variable (RLAV) plot in which each observation in each risk set appears, a subject level adjusted variable (SLAV) plot in which each subject is represented by one point, and an event level adjusted variable (ELAV) plot in which the entire risk set at each failure event is represented by a single point. The latter two plots are derived from the RLAV by combining multiple points. In each point, the regression coefficient and standard error from a Cox proportional hazards regression is obtained by a simple linear regression through the origin fit to the coordinates of the pictured points. The plots are illustrated with a reanalysis of a dataset of 65 patients with multiple myeloma.
Michienzi, Alissa; Kron, Tomas; Callahan, Jason; Plumridge, Nikki; Ball, David; Everitt, Sarah
2017-04-01
Cone-beam computed tomography (CBCT) is a valuable image-guidance tool in radiation therapy (RT). This study was initiated to assess the accuracy of CBCT for quantifying non-small cell lung cancer (NSCLC) tumour volumes compared to the anatomical 'gold standard', CT. Tumour regression or progression on CBCT was also analysed. Patients with Stage I-III NSCLC, prescribed 60 Gy in 30 fractions RT with concurrent platinum-based chemotherapy, routine CBCT and enrolled in a prospective study of serial PET/CT (baseline, weeks two and four) were eligible. Time-matched CBCT and CT gross tumour volumes (GTVs) were manually delineated by a single observer on MIM software, and were analysed descriptively and using Pearson's correlation coefficient (r) and linear regression (R 2 ). Of 94 CT/CBCT pairs, 30 patients were eligible for inclusion. The mean (± SD) CT GTV vs CBCT GTV on the four time-matched pairs were 95 (±182) vs 98.8 (±160.3), 73.6 (±132.4) vs 70.7 (±96.6), 54.7 (±92.9) vs 61.0 (±98.8) and 61.3 (±53.3) vs 62.1 (±47.9) respectively. Pearson's correlation coefficient (r) was 0.98 (95% CI 0.97-0.99, ρ < 0.001). The mean (±SD) CT/CBCT Dice's similarity coefficient was 0.66 (±0.16). Of 289 CBCT scans, tumours in 27 (90%) patients regressed by a mean (±SD) rate of 1.5% (±0.75) per fraction. The mean (±SD) GTV regression was 43.1% (±23.1) from the first to final CBCT. Primary lung tumour volumes observed on CBCT and time-matched CT are highly correlated (although not identical), thereby validating observations of GTV regression on CBCT in NSCLC. © 2016 The Royal Australian and New Zealand College of Radiologists.
Determining the response of sea level to atmospheric pressure forcing using TOPEX/POSEIDON data
NASA Technical Reports Server (NTRS)
Fu, Lee-Lueng; Pihos, Greg
1994-01-01
The static response of sea level to the forcing of atmospheric pressure, the so-called inverted barometer (IB) effect, is investigated using TOPEX/POSEIDON data. This response, characterized by the rise and fall of sea level to compensate for the change of atmospheric pressure at a rate of -1 cm/mbar, is not associated with any ocean currents and hence is normally treated as an error to be removed from sea level observation. Linear regression and spectral transfer function analyses are applied to sea level and pressure to examine the validity of the IB effect. In regions outside the tropics, the regression coefficient is found to be consistently close to the theoretical value except for the regions of western boundary currents, where the mesoscale variability interferes with the IB effect. The spectral transfer function shows near IB response at periods of 30 degrees is -0.84 +/- 0.29 cm/mbar (1 standard deviation). The deviation from = 1 cm /mbar is shown to be caused primarily by the effect of wind forcing on sea level, based on multivariate linear regression model involving both pressure and wind forcing. The regression coefficient for pressure resulting from the multivariate analysis is -0.96 +/- 0.32 cm/mbar. In the tropics the multivariate analysis fails because sea level in the tropics is primarily responding to remote wind forcing. However, after removing from the data the wind-forced sea level estimated by a dynamic model of the tropical Pacific, the pressure regression coefficient improves from -1.22 +/- 0.69 cm/mbar to -0.99 +/- 0.46 cm/mbar, clearly revealing an IB response. The result of the study suggests that with a proper removal of the effect of wind forcing the IB effect is valid in most of the open ocean at periods longer than 20 days and spatial scales larger than 500 km.
The Regionalization of National-Scale SPARROW Models for Stream Nutrients
Schwarz, G.E.; Alexander, R.B.; Smith, R.A.; Preston, S.D.
2011-01-01
This analysis modifies the parsimonious specification of recently published total nitrogen (TN) and total phosphorus (TP) national-scale SPAtially Referenced Regressions On Watershed attributes models to allow each model coefficient to vary geographically among three major river basins of the conterminous United States. Regionalization of the national models reduces the standard errors in the prediction of TN and TP loads, expressed as a percentage of the predicted load, by about 6 and 7%. We develop and apply a method for combining national-scale and regional-scale information to estimate a hybrid model that imposes cross-region constraints that limit regional variation in model coefficients, effectively reducing the number of free model parameters as compared to a collection of independent regional models. The hybrid TN and TP regional models have improved model fit relative to the respective national models, reducing the standard error in the prediction of loads, expressed as a percentage of load, by about 5 and 4%. Only 19% of the TN hybrid model coefficients and just 2% of the TP hybrid model coefficients show evidence of substantial regional specificity (more than ??100% deviation from the national model estimate). The hybrid models have much greater precision in the estimated coefficients than do the unconstrained regional models, demonstrating the efficacy of pooling information across regions to improve regional models. ?? 2011 American Water Resources Association. This article is a U.S. Government work and is in the public domain in the USA.
Morioka, Noriko; Tomio, Jun; Seto, Toshikazu; Kobayashi, Yasuki
2017-01-01
In Japan, the revision of the fee schedules in 2006 introduced a new category of general care ward for more advanced care, with a higher staffing standard, a patient-to-nurse ratio of 7:1. Previous studies have suggested that these changes worsened inequalities in the geographic distribution of nurses, but there have been few quantitative studies evaluating this effect. This study aimed to investigate the association between the distribution of 7:1 beds and the geographic distribution of hospital nursing staffs. We conducted a secondary data analysis of hospital reimbursement reports in 2012 in Japan. The study units were secondary medical areas (SMAs) in Japan, which are roughly comparable to hospital service areas in the United States. The outcome variable was the nurse density per 100,000 population in each SMA. The 7:1 bed density per 100,000 population was the main independent variable. To investigate the association between the nurse density and 7:1 bed density, adjusting for other variables, we applied a multiple linear regression model, with nurse density as an outcome variable, and the bed densities by functional category of inpatient ward as independent variables, adding other variables related to socio-economic status and nurse workforce. To investigate whether 7:1 bed density made the largest contribution to the nurse density, compared to other bed densities, we estimated the standardized regression coefficients. There were 344 SMAs in the study period, of which 343 were used because of data availability. There were approximately 553,600 full time equivalent nurses working in inpatient wards in hospitals. The mean (standard deviation) of the full time equivalent nurse density was 426.4 (147.5) and for 7:1 bed density, the figures were 271.9 (185.9). The 7:1 bed density ranged from 0.0 to 1,295.5. After adjusting for the possible confounders, there were more hospital nurses in the areas with higher densities of 7:1 beds (standardized regression coefficient 0.62, 95% confidence interval 0.56-0.68). We found that the 7:1 nurse staffing standard made the largest contribution to the geographic distribution of hospital nurses, adjusted for socio-economic status and nurse workforce-related factors.
Huang, Shi; MacKinnon, David P.; Perrino, Tatiana; Gallo, Carlos; Cruden, Gracelyn; Brown, C Hendricks
2016-01-01
Mediation analysis often requires larger sample sizes than main effect analysis to achieve the same statistical power. Combining results across similar trials may be the only practical option for increasing statistical power for mediation analysis in some situations. In this paper, we propose a method to estimate: 1) marginal means for mediation path a, the relation of the independent variable to the mediator; 2) marginal means for path b, the relation of the mediator to the outcome, across multiple trials; and 3) the between-trial level variance-covariance matrix based on a bivariate normal distribution. We present the statistical theory and an R computer program to combine regression coefficients from multiple trials to estimate a combined mediated effect and confidence interval under a random effects model. Values of coefficients a and b, along with their standard errors from each trial are the input for the method. This marginal likelihood based approach with Monte Carlo confidence intervals provides more accurate inference than the standard meta-analytic approach. We discuss computational issues, apply the method to two real-data examples and make recommendations for the use of the method in different settings. PMID:28239330
Chaurasia, Ashok; Harel, Ofer
2015-02-10
Tests for regression coefficients such as global, local, and partial F-tests are common in applied research. In the framework of multiple imputation, there are several papers addressing tests for regression coefficients. However, for simultaneous hypothesis testing, the existing methods are computationally intensive because they involve calculation with vectors and (inversion of) matrices. In this paper, we propose a simple method based on the scalar entity, coefficient of determination, to perform (global, local, and partial) F-tests with multiply imputed data. The proposed method is evaluated using simulated data and applied to suicide prevention data. Copyright © 2014 John Wiley & Sons, Ltd.
Mortamais, Marion; Chevrier, Cécile; Philippat, Claire; Petit, Claire; Calafat, Antonia M; Ye, Xiaoyun; Silva, Manori J; Brambilla, Christian; Eijkemans, Marinus J C; Charles, Marie-Aline; Cordier, Sylvaine; Slama, Rémy
2012-04-26
Environmental epidemiology and biomonitoring studies typically rely on biological samples to assay the concentration of non-persistent exposure biomarkers. Between-participant variations in sampling conditions of these biological samples constitute a potential source of exposure misclassification. Few studies attempted to correct biomarker levels for this error. We aimed to assess the influence of sampling conditions on concentrations of urinary biomarkers of select phenols and phthalates, two widely-produced families of chemicals, and to standardize biomarker concentrations on sampling conditions. Urine samples were collected between 2002 and 2006 among 287 pregnant women from Eden and Pélagie cohorts, from which phthalates and phenols metabolites levels were assayed. We applied a 2-step standardization method based on regression residuals. First, the influence of sampling conditions (including sampling hour, duration of storage before freezing) and of creatinine levels on biomarker concentrations were characterized using adjusted linear regression models. In the second step, the model estimates were used to remove the variability in biomarker concentrations due to sampling conditions and to standardize concentrations as if all samples had been collected under the same conditions (e.g., same hour of urine collection). Sampling hour was associated with concentrations of several exposure biomarkers. After standardization for sampling conditions, median concentrations differed by--38% for 2,5-dichlorophenol to +80 % for a metabolite of diisodecyl phthalate. However, at the individual level, standardized biomarker levels were strongly correlated (correlation coefficients above 0.80) with unstandardized measures. Sampling conditions, such as sampling hour, should be systematically collected in biomarker-based studies, in particular when the biomarker half-life is short. The 2-step standardization method based on regression residuals that we proposed in order to limit the impact of heterogeneity in sampling conditions could be further tested in studies describing levels of biomarkers or their influence on health.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Rivkin, R.B.; Seliger, H.H.
1981-07-01
Short term rates of /sup 14/C uptake for single cells and small numbers of isolated algal cells of five phytoplankton species from natural populations were measured by liquid scintillation counting. Regression analysis of uptake rates per cell for cells isolated from unialgal cultures of seven species of dinoflagellates, ranging in volume from ca. 10/sup 3/ to 10/sup 7/ ..mu..m/sup 3/, gave results identical to uptake rates per cell measured by conventional /sup 14/C techniques. Relative standard errors or regression coefficients ranged between 3 and 10%, indicating that for any species there was little variation in photosynthesis per cell.
[Comparison of two methods for rapid determination of C-reactive protein with the Tina-quant].
Oremek, G M; Luksaite, R; Bretschneider, I
2008-03-01
C-reactive protein (CRP) as an acute phase protein is an important diagnostic marker for the presence and course of human processes. Out of the acute phase proteins it is one of those the concentrations increase most rapidly with its sensitivity being superior to other markers of inflammation, such as leukocytosis, erythrocytic sedimentation rate, and fever. This study compared two-point-of-care assays with the standard laboratory method Tina-quant CRP processed on a Hitachi 917: the immunofiltration assay NycoCard CRP Whole Blood and the turbidimetric immunoassay Micros CRP. Both methods are carried in the presence of a patient, by using capillary or venous blood. Seventy-eight blood samples were analyzed first in the standard laboratory routine and then by both rapid test assays. The precision of both assays was determined from the confidence interval. The results were statistically analyzed by arithmetic standard deviation mean method, variation coefficient, Spearman correlation index, Wilcoxon and Bland-Altman tests, and Passing-Bablock regression. NycoCard CRP Whole Blood showed a correlation coefficient of R = 0.9838; the precision had a coefficient of variation of CV = 1.8759% while As compared with Tina-quant CRP had R = 0.9934 and CV = 0.9160%. Both assays indicated the same results as Tina-quant CRP. Both Tina-quant CRP and NycoCard CRP Whole Blood give the best fit for the rapid determination of CRP.
Roth, Philip L; Le, Huy; Oh, In-Sue; Van Iddekinge, Chad H; Bobko, Philip
2018-06-01
Meta-analysis has become a well-accepted method for synthesizing empirical research about a given phenomenon. Many meta-analyses focus on synthesizing correlations across primary studies, but some primary studies do not report correlations. Peterson and Brown (2005) suggested that researchers could use standardized regression weights (i.e., beta coefficients) to impute missing correlations. Indeed, their beta estimation procedures (BEPs) have been used in meta-analyses in a wide variety of fields. In this study, the authors evaluated the accuracy of BEPs in meta-analysis. We first examined how use of BEPs might affect results from a published meta-analysis. We then developed a series of Monte Carlo simulations that systematically compared the use of existing correlations (that were not missing) to data sets that incorporated BEPs (that impute missing correlations from corresponding beta coefficients). These simulations estimated ρ̄ (mean population correlation) and SDρ (true standard deviation) across a variety of meta-analytic conditions. Results from both the existing meta-analysis and the Monte Carlo simulations revealed that BEPs were associated with potentially large biases when estimating ρ̄ and even larger biases when estimating SDρ. Using only existing correlations often substantially outperformed use of BEPs and virtually never performed worse than BEPs. Overall, the authors urge a return to the standard practice of using only existing correlations in meta-analysis. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
2016-03-01
regression models that yield hedonic price indexes is closely related to standard techniques for developing cost estimating relationships ( CERs ...October 2014). iii analysis) and derives a price index from the coefficients on variables reflecting the year of purchase. In CER development, the...index. The relevant cost metric in both cases is unit recurring flyaway (URF) costs. For the current project, we develop a “Baseline” CER model, taking
Lu, Hsueh-Kuan; Chen, Yu-Yawn; Yeh, Chinagwen; Chuang, Chih-Lin; Chiang, Li-Ming; Lai, Chung-Liang; Casebolt, Kevin M; Huang, Ai-Chun; Lin, Wen-Long; Hsieh, Kuen-Chang
2017-08-22
The aim of this study was to evaluate leg-to-leg bioelectrical impedance analysis (LBIA) using a four-contact electrode system for measuring abdominal visceral fat area (VFA). The present study recruited 381 (240 male and 141 female) Chinese participants to compare VFA measurements estimated by a standing LBIA system (VFALBIA) with computerized tomography (CT) scanned at the L4-L5 vertebrae (VFA CT ). The total mean body mass index (BMI) was 24.7 ± 4.2 kg/m 2 . Correlation analysis, regression analysis, Bland-Altman plot, and paired sample t-tests were used to analyze the accuracy of the VFA LBIA . For the total subjects, the regression line was VFA LBIA = 0.698 VFA CT + 29.521, (correlation coefficient (r) = 0.789, standard estimate of error (SEE) = 24.470 cm 2 , p < 0.001), Lin's correlation coefficient (CCC) was 0.785; and the limit of agreement (LOA; mean difference ±2 standard deviation) ranged from -43.950 to 67.951 cm 2 , LOA% (given as a percentage of mean value measured by the CT) was 48.2%. VFA LBIA and VFA CT showed significant difference (p < 0.001). Collectively, the current study indicates that LBIA has limited potential to accurately estimate visceral fat in a clinical setting.
Rogers, Paul; Stoner, Julie
2016-01-01
Regression models for correlated binary outcomes are commonly fit using a Generalized Estimating Equations (GEE) methodology. GEE uses the Liang and Zeger sandwich estimator to produce unbiased standard error estimators for regression coefficients in large sample settings even when the covariance structure is misspecified. The sandwich estimator performs optimally in balanced designs when the number of participants is large, and there are few repeated measurements. The sandwich estimator is not without drawbacks; its asymptotic properties do not hold in small sample settings. In these situations, the sandwich estimator is biased downwards, underestimating the variances. In this project, a modified form for the sandwich estimator is proposed to correct this deficiency. The performance of this new sandwich estimator is compared to the traditional Liang and Zeger estimator as well as alternative forms proposed by Morel, Pan and Mancl and DeRouen. The performance of each estimator was assessed with 95% coverage probabilities for the regression coefficient estimators using simulated data under various combinations of sample sizes and outcome prevalence values with an Independence (IND), Autoregressive (AR) and Compound Symmetry (CS) correlation structure. This research is motivated by investigations involving rare-event outcomes in aviation data. PMID:26998504
ERIC Educational Resources Information Center
Yan, Jun; Aseltine, Robert H., Jr.; Harel, Ofer
2013-01-01
Comparing regression coefficients between models when one model is nested within another is of great practical interest when two explanations of a given phenomenon are specified as linear models. The statistical problem is whether the coefficients associated with a given set of covariates change significantly when other covariates are added into…
Applicability of Cameriere's and Drusini's age estimation methods to a sample of Turkish adults.
Hatice, Boyacioglu Dogru; Nihal, Avcu; Nursel, Akkaya; Humeyra Ozge, Yilanci; Goksuluk, Dincer
2017-10-01
The aim of this study was to investigate the applicability of Drusini's and Cameriere's methods to a sample of Turkish people. Panoramic images of 200 individuals were allocated into two groups as study and test groups and examined by two observers. Tooth coronal indexes (TCI), which is the ratio between coronal pulp cavity height and crown height, were calculated in the mandibular first and second premolars and molars. Pulp/tooth area ratios (ARs) were calculated in the maxillary and mandibular canine teeth. Study group measurements were used to derive a regression model. Test group measurements were used to evaluate the accuracy of the regression model. Pearson's correlation coefficients and regression analysis were used. The correlations between TCIs and age were -0.230, -0.301, -0.344 and -0.257 for mandibular first premolar, second premolar, first molar and second molar, respectively. Those for the maxillary canine (MX) and mandibular canine (MN) ARs were -0.716 and -0.514, respectively. The MX ARs were used to build the linear regression model that explained 51.2% of the total variation, with a standard error of 9.23 years. The mean error of the estimates in test group was 8 years and age of 64% of the individuals were estimated with an error of <±10 years which is acceptable in forensic age prediction. The low correlation coefficients between age and TCI indicate that Drusini's method was not applicable to the estimation of age in a Turkish population. Using Cameriere's method, we derived a regression model.
Tools to Support Interpreting Multiple Regression in the Face of Multicollinearity
Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K.
2012-01-01
While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses. PMID:22457655
Tools to support interpreting multiple regression in the face of multicollinearity.
Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K
2012-01-01
While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses.
Adjusting for Confounding in Early Postlaunch Settings: Going Beyond Logistic Regression Models.
Schmidt, Amand F; Klungel, Olaf H; Groenwold, Rolf H H
2016-01-01
Postlaunch data on medical treatments can be analyzed to explore adverse events or relative effectiveness in real-life settings. These analyses are often complicated by the number of potential confounders and the possibility of model misspecification. We conducted a simulation study to compare the performance of logistic regression, propensity score, disease risk score, and stabilized inverse probability weighting methods to adjust for confounding. Model misspecification was induced in the independent derivation dataset. We evaluated performance using relative bias confidence interval coverage of the true effect, among other metrics. At low events per coefficient (1.0 and 0.5), the logistic regression estimates had a large relative bias (greater than -100%). Bias of the disease risk score estimates was at most 13.48% and 18.83%. For the propensity score model, this was 8.74% and >100%, respectively. At events per coefficient of 1.0 and 0.5, inverse probability weighting frequently failed or reduced to a crude regression, resulting in biases of -8.49% and 24.55%. Coverage of logistic regression estimates became less than the nominal level at events per coefficient ≤5. For the disease risk score, inverse probability weighting, and propensity score, coverage became less than nominal at events per coefficient ≤2.5, ≤1.0, and ≤1.0, respectively. Bias of misspecified disease risk score models was 16.55%. In settings with low events/exposed subjects per coefficient, disease risk score methods can be useful alternatives to logistic regression models, especially when propensity score models cannot be used. Despite better performance of disease risk score methods than logistic regression and propensity score models in small events per coefficient settings, bias, and coverage still deviated from nominal.
Hospitalizations for primary care-sensitive conditions in Pelotas, Brazil: 1998 to 2012.
Costa, Juvenal Soares Dias da; Teixeira, Ana Maria Ferreira Borges; Moraes, Mauricio; Strauch, Eliane Schneider; Silveira, Denise Silva da; Carret, Maria Laura Vidal; Fantinel, Everton
2017-01-01
To verify the hospitalization trend for primary care sensitive-conditions in Pelotas, Rio Grande do Sul, Brazil from 1998 to 2012. An ecological study compared hospitalizations rates of the city of Pelotas with the rest of state of Rio Grande do Sul. Analysis was conducted using direct standardization of rates, coefficients were stratified by sex and the Poisson regression was used. Hospitalizations for sensitive conditions decreased in Pelotas and Rio Grande do Sul. In Pelotas, a 63.8% decrease was detected in the period observed, and there was a 43.1% decrease in the state of Rio Grande do Sul. Poisson regression coefficients showed a decrease of 7% in Pelotas and of 4% in the rest of Rio Grande do Sul each year. During the study period, several changes were introduced in the Brazilian Unified Health System ("Sistema Único de Saúde") that may have influenced the results, including changes in administration, health funding, and a complete reworking of primary care through the creation of the Family Health Strategy program ("Estratégia Saúde da Família").
Yoneoka, Daisuke; Henmi, Masayuki
2017-11-30
Recently, the number of clinical prediction models sharing the same regression task has increased in the medical literature. However, evidence synthesis methodologies that use the results of these regression models have not been sufficiently studied, particularly in meta-analysis settings where only regression coefficients are available. One of the difficulties lies in the differences between the categorization schemes of continuous covariates across different studies. In general, categorization methods using cutoff values are study specific across available models, even if they focus on the same covariates of interest. Differences in the categorization of covariates could lead to serious bias in the estimated regression coefficients and thus in subsequent syntheses. To tackle this issue, we developed synthesis methods for linear regression models with different categorization schemes of covariates. A 2-step approach to aggregate the regression coefficient estimates is proposed. The first step is to estimate the joint distribution of covariates by introducing a latent sampling distribution, which uses one set of individual participant data to estimate the marginal distribution of covariates with categorization. The second step is to use a nonlinear mixed-effects model with correction terms for the bias due to categorization to estimate the overall regression coefficients. Especially in terms of precision, numerical simulations show that our approach outperforms conventional methods, which only use studies with common covariates or ignore the differences between categorization schemes. The method developed in this study is also applied to a series of WHO epidemiologic studies on white blood cell counts. Copyright © 2017 John Wiley & Sons, Ltd.
McAuley, Paul A; Hsu, Fang-Chi; Loman, Kurt K; Carr, J Jeffrey; Budoff, Matthew J; Szklo, Moyses; Sharrett, A Richey; Ding, Jingzhong
2011-09-01
Insulin resistance is linked to general and abdominal obesity, but its relation to hepatic lipid content and pericardial adipose tissue is less clear. The purpose of this study was to examine cross-sectional associations of liver attenuation, pericardial adipose tissue, BMI, and waist circumference with insulin resistance. We measured liver attenuation and pericardial adipose tissue using the existing cardiac computed tomography scans in 5,291 individuals free of clinical cardiovascular disease and diabetes in the Multi-Ethnic Study of Atherosclerosis (MESA) during the study's baseline visit (2000-2002). Low liver attenuation was defined as the lowest quartile and high pericardial adipose tissue as the upper quartile of volume (cm(3)). We used standard clinical definitions for obesity and abdominal obesity. Insulin resistance was assessed by the homeostasis model assessment of insulin resistance (HOMA(IR)) index. In multivariate linear regression with all adiposity measures in the model simultaneously, all adiposity measures were significantly (P < 0.0001) associated with insulin resistance: regression coefficients (±s.e.) were 0.31 (±0.02) for low liver attenuation, 0.27 (±0.02) for high pericardial adipose tissue, 0.27 (±0.02) for obesity, and 0.32 (±0.02) for abdominal obesity. We found significant differences (P = 0.003) between standardized liver attenuation and insulin resistance by ethnicity: regression coefficients per 1 s.d. increment were 0.10 ± 0.01 for whites, 0.11 ± 0.02 for Chinese, 0.08 ± 0.2 for blacks, and 0.14 ± 0.01 for Hispanics. Liver attenuation and pericardial adipose tissue were associated with insulin resistance, independent of BMI and waist circumference.
Ghoreishi, Mohammad; Abdi-Shahshahani, Mehdi; Peyman, Alireza; Pourazizi, Mohsen
2018-02-21
The aim of this study was to determine the correlation between ocular biometric parameters and sulcus-to-sulcus (STS) diameter. This was a cross-sectional study of preoperative ocular biometry data of patients who were candidates for phakic intraocular lens (IOL) surgery. Subjects underwent ocular biometry analysis, including refraction error evaluation using an autorefractor and Orbscan topography for white-to-white (WTW) corneal diameter and measurement. Pentacam was used to perform WTW corneal diameter and measurements of minimum and maximum keratometry (K). Measurements of STS and angle-to-angle (ATA) were obtained using a 50-MHz B-mode ultrasound device. Anterior optical coherence tomography was performed for anterior chamber depth measurement. Pearson's correlation test and stepwise linear regression analysis were used to find a model to predict STS. Fifty-eight eyes of 58 patients were enrolled. Mean age ± standard deviation of sample was 28.95 ± 6.04 years. The Pearson's correlation coefficient between STS with WTW, ATA, mean K was 0.383, 0.492, and - 0.353, respectively, which was statistically significant (all P < 0.001). Using stepwise linear regression analysis, there is a statistically significant association between STS with WTW (P = 0.011) and mean K (P = 0.025). The standardized coefficient was 0.323 and - 0.284 for WTW and mean K, respectively. The stepwise linear regression analysis equation was: (STS = 9.549 + 0.518 WTW - 0.083 mean K). Based on our result, given the correlation of STS with WTW and mean K and potential of direct and essay measurement of WTW and mean K, it seems that current IOL sizing protocols could be estimating with WTW and mean K.
Barrett, Bruce; Brown, Roger; Mundt, Marlon
2008-02-01
Evaluative health-related quality-of-life instruments used in clinical trials should be able to detect small but important changes in health status. Several approaches to minimal important difference (MID) and responsiveness have been developed. To compare anchor-based and distributional approaches to important difference and responsiveness for the Wisconsin Upper Respiratory Symptom Survey (WURSS), an illness-specific quality of life outcomes instrument. Participants with community-acquired colds self-reported daily using the WURSS-44. Distribution-based methods calculated standardized effect size (ES) and standard error of measurement (SEM). Anchor-based methods compared daily interval changes to global ratings of change, using: (1) standard MID methods based on correspondence to ratings of "a little better" or "somewhat better," and (2) two-level multivariate regression models. About 150 adults were monitored throughout their colds (1,681 sick days.): 88% were white, 69% were women, and 50% had completed college. The mean age was 35.5 years (SD = 14.7). WURSS scores increased 2.2 points from the first to second day, and then dropped by an average of 8.2 points per day from days 2 to 7. The SEM averaged 9.1 during these 7 days. Standard methods yielded a between day MID of 22 points. Regression models of MID projected 11.3-point daily changes. Dividing these estimates of small-but-important-difference by pooled SDs yielded coefficients of .425 for standard MID, .218 for regression model, .177 for SEM, and .157 for ES. These imply per-group sample sizes of 870 using ES, 616 for SEM, 302 for regression model, and 89 for standard MID, assuming alpha = .05, beta = .20 (80% power), and two-tailed testing. Distribution and anchor-based approaches provide somewhat different estimates of small but important difference, which in turn can have substantial impact on trial design.
Method of estimating flood-frequency parameters for streams in Idaho
Kjelstrom, L.C.; Moffatt, R.L.
1981-01-01
Skew coefficients for the log-Pearson type III distribution are generalized on the basis of some similarity of floods in the Snake River basin and other parts of Idaho. Generalized skew coefficients aid in shaping flood-frequency curves because skew coefficients computed from gaging stations having relatively short periods of peak flow records can be unreliable. Generalized skew coefficients can be obtained for a gaging station from one of three maps in this report. The map to be used depends on whether (1) snowmelt floods are domiant (generally when more than 20 percent of the drainage area is above 6,000 feet altitude), (2) rainstorm floods are dominant (generally when the mean altitude is less than 3,000 feet), or (3) either snowmelt or rainstorm floods can be the annual miximum discharge. For the latter case, frequency curves constructed using separate arrays of each type of runoff can be combined into one curve, which, for some stations, is significantly different than the frequency curve constructed using only annual maximum discharges. For 269 gaging stations, flood-frequency curves that include the generalized skew coefficients in the computation of the log-Pearson type III equation tend to fit the data better than previous analyses. Frequency curves for ungaged sites can be derived by estimating three statistics of the log-Pearson type III distribution. The mean and standard deviation of logarithms of annual maximum discharges are estimated by regression equations that use basin characteristics as independent variables. Skew coefficient estimates are the generalized skews. The log-Pearson type III equation is then applied with the three estimated statistics to compute the discharge at selected exceedance probabilities. Standard errors at the 2-percent exceedance probability range from 41 to 90 percent. (USGS)
Considerations for monitoring raptor population trends based on counts of migrants
Titus, K.; Fuller, M.R.; Ruos, J.L.; Meyburg, B-U.; Chancellor, R.D.
1989-01-01
Various problems were identified with standardized hawk count data as annually collected at six sites. Some of the hawk lookouts increased their hours of observation from 1979-1985, thereby confounding the total counts. Data recording and missing data hamper coding of data and their use with modern analytical techniques. Coefficients of variation among years in counts averaged about 40%. The advantages and disadvantages of various analytical techniques are discussed including regression, non-parametric rank correlation trend analysis, and moving averages.
ERIC Educational Resources Information Center
Quinino, Roberto C.; Reis, Edna A.; Bessegato, Lupercio F.
2013-01-01
This article proposes the use of the coefficient of determination as a statistic for hypothesis testing in multiple linear regression based on distributions acquired by beta sampling. (Contains 3 figures.)
Two SPSS programs for interpreting multiple regression results.
Lorenzo-Seva, Urbano; Ferrando, Pere J; Chico, Eliseo
2010-02-01
When multiple regression is used in explanation-oriented designs, it is very important to determine both the usefulness of the predictor variables and their relative importance. Standardized regression coefficients are routinely provided by commercial programs. However, they generally function rather poorly as indicators of relative importance, especially in the presence of substantially correlated predictors. We provide two user-friendly SPSS programs that implement currently recommended techniques and recent developments for assessing the relevance of the predictors. The programs also allow the user to take into account the effects of measurement error. The first program, MIMR-Corr.sps, uses a correlation matrix as input, whereas the second program, MIMR-Raw.sps, uses the raw data and computes bootstrap confidence intervals of different statistics. The SPSS syntax, a short manual, and data files related to this article are available as supplemental materials from http://brm.psychonomic-journals.org/content/supplemental.
A Regression Framework for Effect Size Assessments in Longitudinal Modeling of Group Differences
Feingold, Alan
2013-01-01
The use of growth modeling analysis (GMA)--particularly multilevel analysis and latent growth modeling--to test the significance of intervention effects has increased exponentially in prevention science, clinical psychology, and psychiatry over the past 15 years. Model-based effect sizes for differences in means between two independent groups in GMA can be expressed in the same metric (Cohen’s d) commonly used in classical analysis and meta-analysis. This article first reviews conceptual issues regarding calculation of d for findings from GMA and then introduces an integrative framework for effect size assessments that subsumes GMA. The new approach uses the structure of the linear regression model, from which effect sizes for findings from diverse cross-sectional and longitudinal analyses can be calculated with familiar statistics, such as the regression coefficient, the standard deviation of the dependent measure, and study duration. PMID:23956615
Deriving Hounsfield units using grey levels in cone beam computed tomography
Mah, P; Reeves, T E; McDavid, W D
2010-01-01
Objectives An in vitro study was performed to investigate the relationship between grey levels in dental cone beam CT (CBCT) and Hounsfield units (HU) in CBCT scanners. Methods A phantom containing 8 different materials of known composition and density was imaged with 11 different dental CBCT scanners and 2 medical CT scanners. The phantom was scanned under three conditions: phantom alone and phantom in a small and large water container. The reconstructed data were exported as Digital Imaging and Communications in Medicine (DICOM) and analysed with On Demand 3D® by Cybermed, Seoul, Korea. The relationship between grey levels and linear attenuation coefficients was investigated. Results It was demonstrated that a linear relationship between the grey levels and the attenuation coefficients of each of the materials exists at some “effective” energy. From the linear regression equation of the reference materials, attenuation coefficients were obtained for each of the materials and CT numbers in HU were derived using the standard equation. Conclusions HU can be derived from the grey levels in dental CBCT scanners using linear attenuation coefficients as an intermediate step. PMID:20729181
An improved multiple linear regression and data analysis computer program package
NASA Technical Reports Server (NTRS)
Sidik, S. M.
1972-01-01
NEWRAP, an improved version of a previous multiple linear regression program called RAPIER, CREDUC, and CRSPLT, allows for a complete regression analysis including cross plots of the independent and dependent variables, correlation coefficients, regression coefficients, analysis of variance tables, t-statistics and their probability levels, rejection of independent variables, plots of residuals against the independent and dependent variables, and a canonical reduction of quadratic response functions useful in optimum seeking experimentation. A major improvement over RAPIER is that all regression calculations are done in double precision arithmetic.
NASA Astrophysics Data System (ADS)
Keat, Sim Chong; Chun, Beh Boon; San, Lim Hwee; Jafri, Mohd Zubir Mat
2015-04-01
Climate change due to carbon dioxide (CO2) emissions is one of the most complex challenges threatening our planet. This issue considered as a great and international concern that primary attributed from different fossil fuels. In this paper, regression model is used for analyzing the causal relationship among CO2 emissions based on the energy consumption in Malaysia using time series data for the period of 1980-2010. The equations were developed using regression model based on the eight major sources that contribute to the CO2 emissions such as non energy, Liquefied Petroleum Gas (LPG), diesel, kerosene, refinery gas, Aviation Turbine Fuel (ATF) and Aviation Gasoline (AV Gas), fuel oil and motor petrol. The related data partly used for predict the regression model (1980-2000) and partly used for validate the regression model (2001-2010). The results of the prediction model with the measured data showed a high correlation coefficient (R2=0.9544), indicating the model's accuracy and efficiency. These results are accurate and can be used in early warning of the population to comply with air quality standards.
Fukushima, Romualdo S; Kerley, Monty S
2011-04-27
A nongravimetric acetyl bromide lignin (ABL) method was evaluated to quantify lignin concentration in a variety of plant materials. The traditional approach to lignin quantification required extraction of lignin with acidic dioxane and its isolation from each plant sample to construct a standard curve via spectrophotometric analysis. Lignin concentration was then measured in pre-extracted plant cell walls. However, this presented a methodological complexity because extraction and isolation procedures are lengthy and tedious, particularly if there are many samples involved. This work was targeted to simplify lignin quantification. Our hypothesis was that any lignin, regardless of its botanical origin, could be used to construct a standard curve for the purpose of determining lignin concentration in a variety of plants. To test our hypothesis, lignins were isolated from a range of diverse plants and, along with three commercial lignins, standard curves were built and compared among them. Slopes and intercepts derived from these standard curves were close enough to allow utilization of a mean extinction coefficient in the regression equation to estimate lignin concentration in any plant, independent of its botanical origin. Lignin quantification by use of a common regression equation obviates the steps of lignin extraction, isolation, and standard curve construction, which substantially expedites the ABL method. Acetyl bromide lignin method is a fast, convenient analytical procedure that may routinely be used to quantify lignin.
Prediction of ethanol in bottled Chinese rice wine by NIR spectroscopy
NASA Astrophysics Data System (ADS)
Ying, Yibin; Yu, Haiyan; Pan, Xingxiang; Lin, Tao
2006-10-01
To evaluate the applicability of non-invasive visible and near infrared (VIS-NIR) spectroscopy for determining ethanol concentration of Chinese rice wine in square brown glass bottle, transmission spectra of 100 bottled Chinese rice wine samples were collected in the spectral range of 350-1200 nm. Statistical equations were established between the reference data and VIS-NIR spectra by partial least squares (PLS) regression method. Performance of three kinds of mathematical treatment of spectra (original spectra, first derivative spectra and second derivative spectra) were also discussed. The PLS models of original spectra turned out better results, with higher correlation coefficient in calibration (R cal) of 0.89, lower root mean standard error of calibration (RMSEC) of 0.165, and lower root mean standard error of cross validation (RMSECV) of 0.179. Using original spectra, PLS models for ethanol concentration prediction were developed. The R cal and the correlation coefficient in validation (R val) were 0.928 and 0.875, respectively; and the RMSEC and the root mean standard error of validation (RMSEP) were 0.135 (%, v v -1) and 0.177 (%, v v -1), respectively. The results demonstrated that VIS-NIR spectroscopy could be used to predict ethanol concentration in bottled Chinese rice wine.
Grieve, Richard; Nixon, Richard; Thompson, Simon G
2010-01-01
Cost-effectiveness analyses (CEA) may be undertaken alongside cluster randomized trials (CRTs) where randomization is at the level of the cluster (for example, the hospital or primary care provider) rather than the individual. Costs (and outcomes) within clusters may be correlated so that the assumption made by standard bivariate regression models, that observations are independent, is incorrect. This study develops a flexible modeling framework to acknowledge the clustering in CEA that use CRTs. The authors extend previous Bayesian bivariate models for CEA of multicenter trials to recognize the specific form of clustering in CRTs. They develop new Bayesian hierarchical models (BHMs) that allow mean costs and outcomes, and also variances, to differ across clusters. They illustrate how each model can be applied using data from a large (1732 cases, 70 primary care providers) CRT evaluating alternative interventions for reducing postnatal depression. The analyses compare cost-effectiveness estimates from BHMs with standard bivariate regression models that ignore the data hierarchy. The BHMs show high levels of cost heterogeneity across clusters (intracluster correlation coefficient, 0.17). Compared with standard regression models, the BHMs yield substantially increased uncertainty surrounding the cost-effectiveness estimates, and altered point estimates. The authors conclude that ignoring clustering can lead to incorrect inferences. The BHMs that they present offer a flexible modeling framework that can be applied more generally to CEA that use CRTs.
NASA Astrophysics Data System (ADS)
Zhan, Liwei; Li, Chengwei
2017-02-01
A hybrid PSO-SVM-based model is proposed to predict the friction coefficient between aircraft tire and coating. The presented hybrid model combines a support vector machine (SVM) with particle swarm optimization (PSO) technique. SVM has been adopted to solve regression problems successfully. Its regression accuracy is greatly related to optimizing parameters such as the regularization constant C , the parameter gamma γ corresponding to RBF kernel and the epsilon parameter \\varepsilon in the SVM training procedure. However, the friction coefficient which is predicted based on SVM has yet to be explored between aircraft tire and coating. The experiment reveals that drop height and tire rotational speed are the factors affecting friction coefficient. Bearing in mind, the friction coefficient can been predicted using the hybrid PSO-SVM-based model by the measured friction coefficient between aircraft tire and coating. To compare regression accuracy, a grid search (GS) method and a genetic algorithm (GA) are used to optimize the relevant parameters (C , γ and \\varepsilon ), respectively. The regression accuracy could be reflected by the coefficient of determination ({{R}2} ). The result shows that the hybrid PSO-RBF-SVM-based model has better accuracy compared with the GS-RBF-SVM- and GA-RBF-SVM-based models. The agreement of this model (PSO-RBF-SVM) with experiment data confirms its good performance.
Zhao, Yu Xi; Xie, Ping; Sang, Yan Fang; Wu, Zi Yi
2018-04-01
Hydrological process evaluation is temporal dependent. Hydrological time series including dependence components do not meet the data consistency assumption for hydrological computation. Both of those factors cause great difficulty for water researches. Given the existence of hydrological dependence variability, we proposed a correlationcoefficient-based method for significance evaluation of hydrological dependence based on auto-regression model. By calculating the correlation coefficient between the original series and its dependence component and selecting reasonable thresholds of correlation coefficient, this method divided significance degree of dependence into no variability, weak variability, mid variability, strong variability, and drastic variability. By deducing the relationship between correlation coefficient and auto-correlation coefficient in each order of series, we found that the correlation coefficient was mainly determined by the magnitude of auto-correlation coefficient from the 1 order to p order, which clarified the theoretical basis of this method. With the first-order and second-order auto-regression models as examples, the reasonability of the deduced formula was verified through Monte-Carlo experiments to classify the relationship between correlation coefficient and auto-correlation coefficient. This method was used to analyze three observed hydrological time series. The results indicated the coexistence of stochastic and dependence characteristics in hydrological process.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Ozaki, Toshiro, E-mail: ganronbun@amail.plala.or.jp; Seki, Hiroshi; Shiina, Makoto
2009-09-15
The purpose of the present study was to elucidate a method for predicting the intrahepatic arteriovenous shunt rate from computed tomography (CT) images and biochemical data, instead of from arterial perfusion scintigraphy, because adverse exacerbated systemic effects may be induced in cases where a high shunt rate exists. CT and arterial perfusion scintigraphy were performed in patients with liver metastases from gastric or colorectal cancer. Biochemical data and tumor marker levels of 33 enrolled patients were measured. The results were statistically verified by multiple regression analysis. The total metastatic hepatic tumor volume (V{sub metastasized}), residual hepatic parenchyma volume (V{sub residual};more » calculated from CT images), and biochemical data were treated as independent variables; the intrahepatic arteriovenous (IHAV) shunt rate (calculated from scintigraphy) was treated as a dependent variable. The IHAV shunt rate was 15.1 {+-} 11.9%. Based on the correlation matrixes, the best correlation coefficient of 0.84 was established between the IHAV shunt rate and V{sub metastasized} (p < 0.01). In the multiple regression analysis with the IHAV shunt rate as the dependent variable, the coefficient of determination (R{sup 2}) was 0.75, which was significant at the 0.1% level with two significant independent variables (V{sub metastasized} and V{sub residual}). The standardized regression coefficients ({beta}) of V{sub metastasized} and V{sub residual} were significant at the 0.1 and 5% levels, respectively. Based on this result, we can obtain a predicted value of IHAV shunt rate (p < 0.001) using CT images. When a high shunt rate was predicted, beneficial and consistent clinical monitoring can be initiated in, for example, hepatic arterial infusion chemotherapy.« less
The relationship between body mass index and uric acid: a study on Japanese adult twins.
Tanaka, Kentaro; Ogata, Soshiro; Tanaka, Haruka; Omura, Kayoko; Honda, Chika; Hayakawa, Kazuo
2015-09-01
The present study aimed to investigate the association between body mass index (BMI) and uric acid (UA) using the twin study methodology to adjust for genetic factors. The association between BMI and UA was investigated in a cross-sectional study using data from both monozygotic and dizygotic twins registered at the Osaka University Center for Twin Research and the Osaka University Graduate School of Medicine. From January 2011 to March 2014, 422 individuals participated in the health examination. We measured height, weight, age, BMI, lifestyle habits (Breslow's Health Practice Index), serum UA, and serum creatinine. To investigate the association between UA and BMI with adjustment for the clustering of a twin within a pair, individual-level analyses were performed using generalized linear mixed models (GLMMs). To investigate an association with adjustment for genetic and family environmental factors, twin-pair difference values analyses were performed. In all analysis, BMI was associated with UA in men and women. Using the GLMMs, standardized regression coefficients were 0.194 (95 % confidence interval: 0.016-0.373) in men and 0.186 (95 % confidence interval: 0.071-0.302) in women. Considering twin-pair difference value analyses, standardized regression coefficients were 0.333 (95 % confidence interval: 0.072-0.594) in men and 0.314 (95 % confidence interval: 0.151-0.477) in women. The present study shows that BMI was significantly associated with UA, after adjusting for both genetic and familial environment factors in both men and women.
[Analysis of Cr in soil by LIBS based on conical spatial confinement of plasma].
Lin, Yong-Zeng; Yao, Ming-Yin; Chen, Tian-Bing; Li, Wen-Bing; Zheng, Mei-Lan; Xu, Xue-Hong; Tu, Jian-Ping; Liu, Mu-Hua
2013-11-01
The present study is to improve the sensitivity of detection and reduce the limit of detection in detecting heavy metal of soil by laser induced breakdown spectroscopy (LIBS). The Cr element of national standard soil was regarded as the research object. In the experiment, a conical cavity with small diameter end of 20 mm and large diameter end of 45 mm respectively was installed below the focusing lens near the experiment sample to mainly confine the signal transmitted by plasma and to some extent to confine the plasma itself in the LIBS setup. In detecting Cr I 425.44 nm, the beast delay time gained from experiment is 1.3 micros, and the relative standard deviation is below 10%. Compared with the setup of non-spatial confinement, the spectral intensity of Cr in the soil sample was enhanced more than 7%. Calibration curve was established in the Cr concentration range from 60 to 400 microg x g(-1). Under the condition of spatial confinement, the liner regression coefficient and the limit of detection were 0.997 71 and 18.85 microg x g(-1) respectively, however, the regression coefficient and the limit of detection were 0.991 22 and 36.99 microg x g(-1) without spatial confinement. So, this shows that conical spatial confinement can/improve the sensitivity of detection and enhance the spectral intensity. And it is a good auxiliary function in detecting Cr in the soil by laser induced breakdown spectroscopy.
Detection of Cutting Tool Wear using Statistical Analysis and Regression Model
NASA Astrophysics Data System (ADS)
Ghani, Jaharah A.; Rizal, Muhammad; Nuawi, Mohd Zaki; Haron, Che Hassan Che; Ramli, Rizauddin
2010-10-01
This study presents a new method for detecting the cutting tool wear based on the measured cutting force signals. A statistical-based method called Integrated Kurtosis-based Algorithm for Z-Filter technique, called I-kaz was used for developing a regression model and 3D graphic presentation of I-kaz 3D coefficient during machining process. The machining tests were carried out using a CNC turning machine Colchester Master Tornado T4 in dry cutting condition. A Kistler 9255B dynamometer was used to measure the cutting force signals, which were transmitted, analyzed, and displayed in the DasyLab software. Various force signals from machining operation were analyzed, and each has its own I-kaz 3D coefficient. This coefficient was examined and its relationship with flank wear lands (VB) was determined. A regression model was developed due to this relationship, and results of the regression model shows that the I-kaz 3D coefficient value decreases as tool wear increases. The result then is used for real time tool wear monitoring.
Zhi, Ruicong; Zhao, Lei; Xie, Nan; Wang, Houyin; Shi, Bolin; Shi, Jingye
2016-01-13
A framework of establishing standard reference scale (texture) is proposed by multivariate statistical analysis according to instrumental measurement and sensory evaluation. Multivariate statistical analysis is conducted to rapidly select typical reference samples with characteristics of universality, representativeness, stability, substitutability, and traceability. The reasonableness of the framework method is verified by establishing standard reference scale of texture attribute (hardness) with Chinese well-known food. More than 100 food products in 16 categories were tested using instrumental measurement (TPA test), and the result was analyzed with clustering analysis, principal component analysis, relative standard deviation, and analysis of variance. As a result, nine kinds of foods were determined to construct the hardness standard reference scale. The results indicate that the regression coefficient between the estimated sensory value and the instrumentally measured value is significant (R(2) = 0.9765), which fits well with Stevens's theory. The research provides reliable a theoretical basis and practical guide for quantitative standard reference scale establishment on food texture characteristics.
SCI model structure determination program (OSR) user's guide. [optimal subset regression
NASA Technical Reports Server (NTRS)
1979-01-01
The computer program, OSR (Optimal Subset Regression) which estimates models for rotorcraft body and rotor force and moment coefficients is described. The technique used is based on the subset regression algorithm. Given time histories of aerodynamic coefficients, aerodynamic variables, and control inputs, the program computes correlation between various time histories. The model structure determination is based on these correlations. Inputs and outputs of the program are given.
Ecotoxicology of phenylphosphonothioates.
Francis, B M; Hansen, L G; Fukuto, T R; Lu, P Y; Metcalf, R L
1980-01-01
The phenylphosphonothioate insecticides EPN and leptophos, and several analogs, were evaluated with respect to their delayed neurotoxic effects in hens and their environmental behavior in a terrestrial-aquatic model ecosystem. Acute toxicity to insects was highly correlated with sigma sigma of the substituted phenyl group (regression coefficient r = -0.91) while acute toxicity to mammals was slightly less well correlated (regression coefficient r = -0.71), and neurotoxicity was poorly correlated with sigma sigma (regression coefficient r = -0.35). Both EPN and leptophos were markedly more persistent and bioaccumulative in the model ecosystem than parathion. Desbromoleptophos, a contaminant and metabolite of leptophos, was seen to be a highly stable and persistent terminal residue of leptophos. PMID:6159210
Kitagawa, Yasuhisa; Teramoto, Tamio; Daida, Hiroyuki
2012-01-01
We evaluated the impact of adherence to preferable behavior on serum lipid control assessed by a self-reported questionnaire in high-risk patients taking pravastatin for primary prevention of coronary artery disease. High-risk patients taking pravastatin were followed for 2 years. Questionnaire surveys comprising 21 questions, including 18 questions concerning awareness of health, and current status of diet, exercise, and drug therapy, were conducted at baseline and after 1 year. Potential domains were established by factor analysis from the results of questionnaires, and adherence scores were calculated in each domain. The relationship between adherence scores and lipid values during the 1-year treatment period was analyzed by each domain using multiple regression analysis. A total of 5,792 patients taking pravastatin were included in the analysis. Multiple regression analysis showed a significant correlation in terms of "Intake of high fat/cholesterol/sugar foods" (regression coefficient -0.58, p=0.0105) and "Adherence to instructions for drug therapy" (regression coefficient -6.61, p<0.0001). Low-density lipoprotein cholesterol (LDL-C) values were significantly lower in patients who had an increase in the adherence score in the "Awareness of health" domain compared with those with a decreased score. There was a significant correlation between high-density lipoprotein (HDL-C) values and "Awareness of health" (regression coefficient 0.26; p= 0.0037), "Preferable dietary behaviors" (regression coefficient 0.75; p<0.0001), and "Exercise" (regression coefficient 0.73; p= 0.0002). Similar relations were seen with triglycerides. In patients who have a high awareness of their health, a positive attitude toward lipid-lowering treatment including diet, exercise, and high adherence to drug therapy, is related with favorable overall lipid control even in patients under treatment with pravastatin.
Ham, Joo-ho; Park, Hun-Young; Kim, Youn-ho; Bae, Sang-kon; Ko, Byung-hoon
2017-01-01
[Purpose] The purpose of this study was to develop a regression model to estimate the heart rate at the lactate threshold (HRLT) and the heart rate at the ventilatory threshold (HRVT) using the heart rate threshold (HRT), and to test the validity of the regression model. [Methods] We performed a graded exercise test with a treadmill in 220 normal individuals (men: 112, women: 108) aged 20–59 years. HRT, HRLT, and HRVT were measured in all subjects. A regression model was developed to estimate HRLT and HRVT using HRT with 70% of the data (men: 79, women: 76) through randomization (7:3), with the Bernoulli trial. The validity of the regression model developed with the remaining 30% of the data (men: 33, women: 32) was also examined. [Results] Based on the regression coefficient, we found that the independent variable HRT was a significant variable in all regression models. The adjusted R2 of the developed regression models averaged about 70%, and the standard error of estimation of the validity test results was 11 bpm, which is similar to that of the developed model. [Conclusion] These results suggest that HRT is a useful parameter for predicting HRLT and HRVT. PMID:29036765
Ham, Joo-Ho; Park, Hun-Young; Kim, Youn-Ho; Bae, Sang-Kon; Ko, Byung-Hoon; Nam, Sang-Seok
2017-09-30
The purpose of this study was to develop a regression model to estimate the heart rate at the lactate threshold (HRLT) and the heart rate at the ventilatory threshold (HRVT) using the heart rate threshold (HRT), and to test the validity of the regression model. We performed a graded exercise test with a treadmill in 220 normal individuals (men: 112, women: 108) aged 20-59 years. HRT, HRLT, and HRVT were measured in all subjects. A regression model was developed to estimate HRLT and HRVT using HRT with 70% of the data (men: 79, women: 76) through randomization (7:3), with the Bernoulli trial. The validity of the regression model developed with the remaining 30% of the data (men: 33, women: 32) was also examined. Based on the regression coefficient, we found that the independent variable HRT was a significant variable in all regression models. The adjusted R2 of the developed regression models averaged about 70%, and the standard error of estimation of the validity test results was 11 bpm, which is similar to that of the developed model. These results suggest that HRT is a useful parameter for predicting HRLT and HRVT. ©2017 The Korean Society for Exercise Nutrition
DOE Office of Scientific and Technical Information (OSTI.GOV)
Tang, Kunkun, E-mail: ktg@illinois.edu; Inria Bordeaux – Sud-Ouest, Team Cardamom, 200 avenue de la Vieille Tour, 33405 Talence; Congedo, Pietro M.
The Polynomial Dimensional Decomposition (PDD) is employed in this work for the global sensitivity analysis and uncertainty quantification (UQ) of stochastic systems subject to a moderate to large number of input random variables. Due to the intimate connection between the PDD and the Analysis of Variance (ANOVA) approaches, PDD is able to provide a simpler and more direct evaluation of the Sobol' sensitivity indices, when compared to the Polynomial Chaos expansion (PC). Unfortunately, the number of PDD terms grows exponentially with respect to the size of the input random vector, which makes the computational cost of standard methods unaffordable formore » real engineering applications. In order to address the problem of the curse of dimensionality, this work proposes essentially variance-based adaptive strategies aiming to build a cheap meta-model (i.e. surrogate model) by employing the sparse PDD approach with its coefficients computed by regression. Three levels of adaptivity are carried out in this paper: 1) the truncated dimensionality for ANOVA component functions, 2) the active dimension technique especially for second- and higher-order parameter interactions, and 3) the stepwise regression approach designed to retain only the most influential polynomials in the PDD expansion. During this adaptive procedure featuring stepwise regressions, the surrogate model representation keeps containing few terms, so that the cost to resolve repeatedly the linear systems of the least-squares regression problem is negligible. The size of the finally obtained sparse PDD representation is much smaller than the one of the full expansion, since only significant terms are eventually retained. Consequently, a much smaller number of calls to the deterministic model is required to compute the final PDD coefficients.« less
Zhao, Yang; Zhang, Xue Qing; Bian, Xiao Dong
2018-01-01
To investigate the early supplementary processes of fishre sources in the Bohai Sea, the geographically weighted regression (GWR) was introduced to the habitat suitability index (HSI) model. The Bohai Sea larval Japanese Halfbeak HSI GWR model was established with four environmental variables, including sea surface temperature (SST), sea surface salinity (SSS), water depth (DEP), and chlorophyll a concentration (Chl a). Results of the simulation showed that the four variables had different performances in August 2015. SST and Chl a were global variables, and had little impacts on HSI, with the regression coefficients of -0.027 and 0.006, respectively. SSS and DEP were local variables, and had larger impacts on HSI, while the average values of absolute values of their regression coefficients were 0.075 and 0.129, respectively. In the central Bohai Sea, SSS showed a negative correlation with HSI, and the most negative correlation coefficient was -0.3. In contrast, SSS was correlated positively but weakly with HSI in the three bays of Bohai Sea, and the largest correlation coefficient was 0.1. In particular, DEP and HSI were negatively correlated in the entire Bohai Sea, while they were more negatively correlated in the three bays of Bohai than in the central Bohai Sea, and the most negative correlation coefficient was -0.16 in the three bays. The Poisson regression coefficient of the HSI GWR model was 0.705, consistent with field measurements. Therefore, it could provide a new method for the research on fish habitats in the future.
Portable visible and near-infrared spectrophotometer for triglyceride measurements.
Kobayashi, Takanori; Kato, Yukiko Hakariya; Tsukamoto, Megumi; Ikuta, Kazuyoshi; Sakudo, Akikazu
2009-01-01
An affordable and portable machine is required for the practical use of visible and near-infrared (Vis-NIR) spectroscopy. A portable fruit tester comprising a Vis-NIR spectrophotometer was modified for use in the transmittance mode and employed to quantify triglyceride levels in serum in combination with a chemometric analysis. Transmittance spectra collected in the 600- to 1100-nm region were subjected to a partial least-squares regression analysis and leave-out cross-validation to develop a chemometrics model for predicting triglyceride concentrations in serum. The model yielded a coefficient of determination in cross-validation (R2VAL) of 0.7831 with a standard error of cross-validation (SECV) of 43.68 mg/dl. The detection limit of the model was 148.79 mg/dl. Furthermore, masked samples predicted by the model yielded a coefficient of determination in prediction (R2PRED) of 0.6856 with a standard error of prediction (SEP) and detection limit of 61.54 and 159.38 mg/dl, respectively. The portable Vis-NIR spectrophotometer may prove convenient for the measurement of triglyceride concentrations in serum, although before practical use there remain obstacles, which are discussed.
Estimated Probability of a Cervical Spine Injury During an ISS Mission
NASA Technical Reports Server (NTRS)
Brooker, John E.; Weaver, Aaron S.; Myers, Jerry G.
2013-01-01
Introduction: The Integrated Medical Model (IMM) utilizes historical data, cohort data, and external simulations as input factors to provide estimates of crew health, resource utilization and mission outcomes. The Cervical Spine Injury Module (CSIM) is an external simulation designed to provide the IMM with parameter estimates for 1) a probability distribution function (PDF) of the incidence rate, 2) the mean incidence rate, and 3) the standard deviation associated with the mean resulting from injury/trauma of the neck. Methods: An injury mechanism based on an idealized low-velocity blunt impact to the superior posterior thorax of an ISS crewmember was used as the simulated mission environment. As a result of this impact, the cervical spine is inertially loaded from the mass of the head producing an extension-flexion motion deforming the soft tissues of the neck. A multibody biomechanical model was developed to estimate the kinematic and dynamic response of the head-neck system from a prescribed acceleration profile. Logistic regression was performed on a dataset containing AIS1 soft tissue neck injuries from rear-end automobile collisions with published Neck Injury Criterion values producing an injury transfer function (ITF). An injury event scenario (IES) was constructed such that crew 1 is moving through a primary or standard translation path transferring large volume equipment impacting stationary crew 2. The incidence rate for this IES was estimated from in-flight data and used to calculate the probability of occurrence. The uncertainty in the model input factors were estimated from representative datasets and expressed in terms of probability distributions. A Monte Carlo Method utilizing simple random sampling was employed to propagate both aleatory and epistemic uncertain factors. Scatterplots and partial correlation coefficients (PCC) were generated to determine input factor sensitivity. CSIM was developed in the SimMechanics/Simulink environment with a Monte Carlo wrapper (MATLAB) used to integrate the components of the module. Results: The probability of generating an AIS1 soft tissue neck injury from the extension/flexion motion induced by a low-velocity blunt impact to the superior posterior thorax was fitted with a lognormal PDF with mean 0.26409, standard deviation 0.11353, standard error of mean 0.00114, and 95% confidence interval [0.26186, 0.26631]. Combining the probability of an AIS1 injury with the probability of IES occurrence was fitted with a Johnson SI PDF with mean 0.02772, standard deviation 0.02012, standard error of mean 0.00020, and 95% confidence interval [0.02733, 0.02812]. The input factor sensitivity analysis in descending order was IES incidence rate, ITF regression coefficient 1, impactor initial velocity, ITF regression coefficient 2, and all others (equipment mass, crew 1 body mass, crew 2 body mass) insignificant. Verification and Validation (V&V): The IMM V&V, based upon NASA STD 7009, was implemented which included an assessment of the data sets used to build CSIM. The documentation maintained includes source code comments and a technical report. The software code and documentation is under Subversion configuration management. Kinematic validation was performed by comparing the biomechanical model output to established corridors.
The NIST Quantitative Infrared Database
Chu, P. M.; Guenther, F. R.; Rhoderick, G. C.; Lafferty, W. J.
1999-01-01
With the recent developments in Fourier transform infrared (FTIR) spectrometers it is becoming more feasible to place these instruments in field environments. As a result, there has been enormous increase in the use of FTIR techniques for a variety of qualitative and quantitative chemical measurements. These methods offer the possibility of fully automated real-time quantitation of many analytes; therefore FTIR has great potential as an analytical tool. Recently, the U.S. Environmental Protection Agency (U.S.EPA) has developed protocol methods for emissions monitoring using both extractive and open-path FTIR measurements. Depending upon the analyte, the experimental conditions and the analyte matrix, approximately 100 of the hazardous air pollutants (HAPs) listed in the 1990 U.S.EPA Clean Air Act amendment (CAAA) can be measured. The National Institute of Standards and Technology (NIST) has initiated a program to provide quality-assured infrared absorption coefficient data based on NIST prepared primary gas standards. Currently, absorption coefficient data has been acquired for approximately 20 of the HAPs. For each compound, the absorption coefficient spectrum was calculated using nine transmittance spectra at 0.12 cm−1 resolution and the Beer’s law relationship. The uncertainties in the absorption coefficient data were estimated from the linear regressions of the transmittance data and considerations of other error sources such as the nonlinear detector response. For absorption coefficient values greater than 1 × 10−4 μmol/mol)−1 m−1 the average relative expanded uncertainty is 2.2 %. This quantitative infrared database is currently an ongoing project at NIST. Additional spectra will be added to the database as they are acquired. Our current plans include continued data acquisition of the compounds listed in the CAAA, as well as the compounds that contribute to global warming and ozone depletion.
DFT study on oxidation of HS(CH2) m SH ( m = 1-8) in oxidative desulfurization
NASA Astrophysics Data System (ADS)
Song, Y. Z.; Song, J. J.; Zhao, T. T.; Chen, C. Y.; He, M.; Du, J.
2016-06-01
Density functional theory was employed for calculation of HS(CH2) m SH ( m = 1-8) and its derivatives at B3LYP method at 6-31++g ( d, p) level. Using eigenvalues of LUMO and HOMO for HS(CH2) m SH, the standard electrode potentials were estimated by a stepwise multiple regression techniques (MLR), and obtained as E° = 1.500 + 7.167 × 10-3 HOMO-0.229 LUMO with high correlation coefficients of 0.973 and F values of 43.973.
Mixed conditional logistic regression for habitat selection studies.
Duchesne, Thierry; Fortin, Daniel; Courbin, Nicolas
2010-05-01
1. Resource selection functions (RSFs) are becoming a dominant tool in habitat selection studies. RSF coefficients can be estimated with unconditional (standard) and conditional logistic regressions. While the advantage of mixed-effects models is recognized for standard logistic regression, mixed conditional logistic regression remains largely overlooked in ecological studies. 2. We demonstrate the significance of mixed conditional logistic regression for habitat selection studies. First, we use spatially explicit models to illustrate how mixed-effects RSFs can be useful in the presence of inter-individual heterogeneity in selection and when the assumption of independence from irrelevant alternatives (IIA) is violated. The IIA hypothesis states that the strength of preference for habitat type A over habitat type B does not depend on the other habitat types also available. Secondly, we demonstrate the significance of mixed-effects models to evaluate habitat selection of free-ranging bison Bison bison. 3. When movement rules were homogeneous among individuals and the IIA assumption was respected, fixed-effects RSFs adequately described habitat selection by simulated animals. In situations violating the inter-individual homogeneity and IIA assumptions, however, RSFs were best estimated with mixed-effects regressions, and fixed-effects models could even provide faulty conclusions. 4. Mixed-effects models indicate that bison did not select farmlands, but exhibited strong inter-individual variations in their response to farmlands. Less than half of the bison preferred farmlands over forests. Conversely, the fixed-effect model simply suggested an overall selection for farmlands. 5. Conditional logistic regression is recognized as a powerful approach to evaluate habitat selection when resource availability changes. This regression is increasingly used in ecological studies, but almost exclusively in the context of fixed-effects models. Fitness maximization can imply differences in trade-offs among individuals, which can yield inter-individual differences in selection and lead to departure from IIA. These situations are best modelled with mixed-effects models. Mixed-effects conditional logistic regression should become a valuable tool for ecological research.
Spatial Autocorrelation Approaches to Testing Residuals from Least Squares Regression.
Chen, Yanguang
2016-01-01
In geo-statistics, the Durbin-Watson test is frequently employed to detect the presence of residual serial correlation from least squares regression analyses. However, the Durbin-Watson statistic is only suitable for ordered time or spatial series. If the variables comprise cross-sectional data coming from spatial random sampling, the test will be ineffectual because the value of Durbin-Watson's statistic depends on the sequence of data points. This paper develops two new statistics for testing serial correlation of residuals from least squares regression based on spatial samples. By analogy with the new form of Moran's index, an autocorrelation coefficient is defined with a standardized residual vector and a normalized spatial weight matrix. Then by analogy with the Durbin-Watson statistic, two types of new serial correlation indices are constructed. As a case study, the two newly presented statistics are applied to a spatial sample of 29 China's regions. These results show that the new spatial autocorrelation models can be used to test the serial correlation of residuals from regression analysis. In practice, the new statistics can make up for the deficiencies of the Durbin-Watson test.
Olson, Scott A.; with a section by Veilleux, Andrea G.
2014-01-01
This report provides estimates of flood discharges at selected annual exceedance probabilities (AEPs) for streamgages in and adjacent to Vermont and equations for estimating flood discharges at AEPs of 50-, 20-, 10-, 4-, 2-, 1-, 0.5-, and 0.2-percent (recurrence intervals of 2-, 5-, 10-, 25-, 50-, 100-, 200-, and 500-years, respectively) for ungaged, unregulated, rural streams in Vermont. The equations were developed using generalized least-squares regression. Flood-frequency and drainage-basin characteristics from 145 streamgages were used in developing the equations. The drainage-basin characteristics used as explanatory variables in the regression equations include drainage area, percentage of wetland area, and the basin-wide mean of the average annual precipitation. The average standard errors of prediction for estimating the flood discharges at the 50-, 20-, 10-, 4-, 2-, 1-, 0.5-, and 0.2-percent AEP with these equations are 34.9, 36.0, 38.7, 42.4, 44.9, 47.3, 50.7, and 55.1 percent, respectively. Flood discharges at selected AEPs for streamgages were computed by using the Expected Moments Algorithm. To improve estimates of the flood discharges for given exceedance probabilities at streamgages in Vermont, a new generalized skew coefficient was developed. The new generalized skew for the region is a constant, 0.44. The mean square error of the generalized skew coefficient is 0.078. This report describes a technique for using results from the regression equations to adjust an AEP discharge computed from a streamgage record. This report also describes a technique for using a drainage-area adjustment to estimate flood discharge at a selected AEP for an ungaged site upstream or downstream from a streamgage. The final regression equations and the flood-discharge frequency data used in this study will be available in StreamStats. StreamStats is a World Wide Web application providing automated regression-equation solutions for user-selected sites on streams.
Sensitivity Analysis of the Integrated Medical Model for ISS Programs
NASA Technical Reports Server (NTRS)
Goodenow, D. A.; Myers, J. G.; Arellano, J.; Boley, L.; Garcia, Y.; Saile, L.; Walton, M.; Kerstman, E.; Reyes, D.; Young, M.
2016-01-01
Sensitivity analysis estimates the relative contribution of the uncertainty in input values to the uncertainty of model outputs. Partial Rank Correlation Coefficient (PRCC) and Standardized Rank Regression Coefficient (SRRC) are methods of conducting sensitivity analysis on nonlinear simulation models like the Integrated Medical Model (IMM). The PRCC method estimates the sensitivity using partial correlation of the ranks of the generated input values to each generated output value. The partial part is so named because adjustments are made for the linear effects of all the other input values in the calculation of correlation between a particular input and each output. In SRRC, standardized regression-based coefficients measure the sensitivity of each input, adjusted for all the other inputs, on each output. Because the relative ranking of each of the inputs and outputs is used, as opposed to the values themselves, both methods accommodate the nonlinear relationship of the underlying model. As part of the IMM v4.0 validation study, simulations are available that predict 33 person-missions on ISS and 111 person-missions on STS. These simulated data predictions feed the sensitivity analysis procedures. The inputs to the sensitivity procedures include the number occurrences of each of the one hundred IMM medical conditions generated over the simulations and the associated IMM outputs: total quality time lost (QTL), number of evacuations (EVAC), and number of loss of crew lives (LOCL). The IMM team will report the results of using PRCC and SRRC on IMM v4.0 predictions of the ISS and STS missions created as part of the external validation study. Tornado plots will assist in the visualization of the condition-related input sensitivities to each of the main outcomes. The outcomes of this sensitivity analysis will drive review focus by identifying conditions where changes in uncertainty could drive changes in overall model output uncertainty. These efforts are an integral part of the overall verification, validation, and credibility review of IMM v4.0.
Guo, Yin; Liu, Li Juan; Tang, Ping; Feng, Yi; Lv, Yan Yun; Wu, Min; Xu, Liang; Jonas, Jost B
2018-03-01
To assess the development and enlargement of the parapapillary gamma zone in school children. This school-based prospective longitudinal study included Chinese children attending grade 1 in 2011 and returning for yearly follow-up examinations until 2016. These examinations consisted of a comprehensive ocular examination with biometry and color fundus photographs. The parents underwent a standardized interview. The parapapillary gamma zone was defined as the area with visible sclera at the temporal optic disc margin, and the optic disc itself was measured on fundus photographs. The study included 294 children (mean age in 2016, 11.4 ± 0.5 years [range, 10-13 years]; mean axial length, 24.1 ± 1.1 mm [range, 21.13-27.29 mm]). In multivariate analysis, larger increases in the gamma zone area during the study period were correlated (coefficient of determination for bivariate analysis [r2], r2 = 0.69) with larger increases in the vertical-to-horizontal disc diameter ratios (P < 0.001; standardized regression coefficient beta [beta], 0.53; nonstandardized regression coefficient B [B], 4.05; 95% confidence intervals [CI], 3.37-4.73), larger axial elongation (P < 0.001; beta, 0.32; B, 0.37; 95% CI, 0.26-0.47), a larger vertical disc diameter at baseline (P < 0.001; beta, 0.22; B, 0.98; 95% CI, 0.62-1.33), a larger gamma zone area at baseline (P < 0.001; beta, 0.14; B, 0.41; 95% CI, 0.17-0.64), and more time spent indoors studying (P = 0.015; beta, 0.10; B, 0.09; 95% CI, 0.02-0.17). The development and enlargement of the gamma zone in the temporal parapapillary region were associated with an optic disc rotation around the vertical disc axis as indicated by an increasing vertical-to-horizontal disc diameter ratio. These morphologic findings fit with the notion of a backward pull of the temporal peripapillary sclera through the optic nerve dura mater in axially elongated eyes.
Confidence Intervals for Squared Semipartial Correlation Coefficients: The Effect of Nonnormality
ERIC Educational Resources Information Center
Algina, James; Keselman, H. J.; Penfield, Randall D.
2010-01-01
The increase in the squared multiple correlation coefficient ([delta]R[superscript 2]) associated with a variable in a regression equation is a commonly used measure of importance in regression analysis. Algina, Keselman, and Penfield found that intervals based on asymptotic principles were typically very inaccurate, even though the sample size…
Thermal requirements of Dermanyssus gallinae (De Geer, 1778) (Acari: Dermanyssidae).
Tucci, Edna Clara; do Prado, Angelo P; de Araújo, Raquel Pires
2008-01-01
The thermal requirements for development of Dermanyssus gallinae were studied under laboratory conditions at 15, 20, 25, 30 and 35 degrees C, a 12h photoperiod and 60-85% RH. The thermal requirements for D. gallinae were as follows. Preoviposition: base temperature 3.4 degrees C, thermal constant (k) 562.85 degree-hours, determination coefficient (R(2)) 0.59, regression equation: Y= -0.006035 + 0.001777x. Egg: base temperature 10.60 degrees C, thermal constant (k) 689.65 degree-hours, determination coefficient (R(2)) 0.94, regression equation: Y= -0.015367 + 0.001450x. Larva: base temperature 9.82 degrees C, thermal constant (k) 464.91 degree-hours, determination coefficient (R(2)) 0.87, regression equation: Y= -0.021123 + 0.002151x. Protonymph: base temperature 10.17 degrees C, thermal constant (k) 504.49 degree-hours, determination coefficient (R(2)) 0.90, regression equation: Y= -0.020152 + 0.001982x. Deutonymph: base temperature 11.80 degrees C, thermal constant (k) 501.11 degree-hours, determination coefficient (R(2)) 0.99, regression equation: Y= -0.023555 + 0.001996x. The results obtained showed that 15 to 42 generations of Dermanyssus gallinae may occur during the year in the State of São Paulo, as estimated based on isotherm charts. Dermanyssus gallinae may develop continually in the State of São Paulo, with a population decrease in the winter. There were differences between the developmental stages of D. gallinae in relation to thermal requirements.
Biostatistics Series Module 6: Correlation and Linear Regression.
Hazra, Avijit; Gogtay, Nithya
2016-01-01
Correlation and linear regression are the most commonly used techniques for quantifying the association between two numeric variables. Correlation quantifies the strength of the linear relationship between paired variables, expressing this as a correlation coefficient. If both variables x and y are normally distributed, we calculate Pearson's correlation coefficient ( r ). If normality assumption is not met for one or both variables in a correlation analysis, a rank correlation coefficient, such as Spearman's rho (ρ) may be calculated. A hypothesis test of correlation tests whether the linear relationship between the two variables holds in the underlying population, in which case it returns a P < 0.05. A 95% confidence interval of the correlation coefficient can also be calculated for an idea of the correlation in the population. The value r 2 denotes the proportion of the variability of the dependent variable y that can be attributed to its linear relation with the independent variable x and is called the coefficient of determination. Linear regression is a technique that attempts to link two correlated variables x and y in the form of a mathematical equation ( y = a + bx ), such that given the value of one variable the other may be predicted. In general, the method of least squares is applied to obtain the equation of the regression line. Correlation and linear regression analysis are based on certain assumptions pertaining to the data sets. If these assumptions are not met, misleading conclusions may be drawn. The first assumption is that of linear relationship between the two variables. A scatter plot is essential before embarking on any correlation-regression analysis to show that this is indeed the case. Outliers or clustering within data sets can distort the correlation coefficient value. Finally, it is vital to remember that though strong correlation can be a pointer toward causation, the two are not synonymous.
Biostatistics Series Module 6: Correlation and Linear Regression
Hazra, Avijit; Gogtay, Nithya
2016-01-01
Correlation and linear regression are the most commonly used techniques for quantifying the association between two numeric variables. Correlation quantifies the strength of the linear relationship between paired variables, expressing this as a correlation coefficient. If both variables x and y are normally distributed, we calculate Pearson's correlation coefficient (r). If normality assumption is not met for one or both variables in a correlation analysis, a rank correlation coefficient, such as Spearman's rho (ρ) may be calculated. A hypothesis test of correlation tests whether the linear relationship between the two variables holds in the underlying population, in which case it returns a P < 0.05. A 95% confidence interval of the correlation coefficient can also be calculated for an idea of the correlation in the population. The value r2 denotes the proportion of the variability of the dependent variable y that can be attributed to its linear relation with the independent variable x and is called the coefficient of determination. Linear regression is a technique that attempts to link two correlated variables x and y in the form of a mathematical equation (y = a + bx), such that given the value of one variable the other may be predicted. In general, the method of least squares is applied to obtain the equation of the regression line. Correlation and linear regression analysis are based on certain assumptions pertaining to the data sets. If these assumptions are not met, misleading conclusions may be drawn. The first assumption is that of linear relationship between the two variables. A scatter plot is essential before embarking on any correlation-regression analysis to show that this is indeed the case. Outliers or clustering within data sets can distort the correlation coefficient value. Finally, it is vital to remember that though strong correlation can be a pointer toward causation, the two are not synonymous. PMID:27904175
Martínez Gila, Diego Manuel; Cano Marchal, Pablo; Gómez Ortega, Juan; Gámez García, Javier
2018-03-25
Normally the olive oil quality is assessed by chemical analysis according to international standards. These norms define chemical and organoleptic markers, and depending on the markers, the olive oil can be labelled as lampante, virgin, or extra virgin olive oil (EVOO), the last being an indicator of top quality. The polyphenol content is related to EVOO organoleptic features, and different scientific works have studied the positive influence that these compounds have on human health. The works carried out in this paper are focused on studying relations between the polyphenol content in olive oil samples and its spectral response in the near infrared spectra. In this context, several acquisition parameters have been assessed to optimize the measurement process within the virgin olive oil production process. The best regression model reached a mean error value of 156.14 mg/kg in leave one out cross validation, and the higher regression coefficient was 0.81 through holdout validation.
Cano Marchal, Pablo; Gómez Ortega, Juan; Gámez García, Javier
2018-01-01
Normally the olive oil quality is assessed by chemical analysis according to international standards. These norms define chemical and organoleptic markers, and depending on the markers, the olive oil can be labelled as lampante, virgin, or extra virgin olive oil (EVOO), the last being an indicator of top quality. The polyphenol content is related to EVOO organoleptic features, and different scientific works have studied the positive influence that these compounds have on human health. The works carried out in this paper are focused on studying relations between the polyphenol content in olive oil samples and its spectral response in the near infrared spectra. In this context, several acquisition parameters have been assessed to optimize the measurement process within the virgin olive oil production process. The best regression model reached a mean error value of 156.14 mg/kg in leave one out cross validation, and the higher regression coefficient was 0.81 through holdout validation. PMID:29587403
Harada, Sei; Hirayama, Akiyoshi; Chan, Queenie; Kurihara, Ayako; Fukai, Kota; Iida, Miho; Kato, Suzuka; Sugiyama, Daisuke; Kuwabara, Kazuyo; Takeuchi, Ayano; Akiyama, Miki; Okamura, Tomonori; Ebbels, Timothy M D; Elliott, Paul; Tomita, Masaru; Sato, Asako; Suzuki, Chizuru; Sugimoto, Masahiro; Soga, Tomoyoshi; Takebayashi, Toru
2018-01-01
Cohort studies with metabolomics data are becoming more widespread, however, large-scale studies involving 10,000s of participants are still limited, especially in Asian populations. Therefore, we started the Tsuruoka Metabolomics Cohort Study enrolling 11,002 community-dwelling adults in Japan, and using capillary electrophoresis-mass spectrometry (CE-MS) and liquid chromatography-mass spectrometry. The CE-MS method is highly amenable to absolute quantification of polar metabolites, however, its reliability for large-scale measurement is unclear. The aim of this study is to examine reproducibility and validity of large-scale CE-MS measurements. In addition, the study presents absolute concentrations of polar metabolites in human plasma, which can be used in future as reference ranges in a Japanese population. Metabolomic profiling of 8,413 fasting plasma samples were completed using CE-MS, and 94 polar metabolites were structurally identified and quantified. Quality control (QC) samples were injected every ten samples and assessed throughout the analysis. Inter- and intra-batch coefficients of variation of QC and participant samples, and technical intraclass correlation coefficients were estimated. Passing-Bablok regression of plasma concentrations by CE-MS on serum concentrations by standard clinical chemistry assays was conducted for creatinine and uric acid. In QC samples, coefficient of variation was less than 20% for 64 metabolites, and less than 30% for 80 metabolites out of the 94 metabolites. Inter-batch coefficient of variation was less than 20% for 81 metabolites. Estimated technical intraclass correlation coefficient was above 0.75 for 67 metabolites. The slope of Passing-Bablok regression was estimated as 0.97 (95% confidence interval: 0.95, 0.98) for creatinine and 0.95 (0.92, 0.96) for uric acid. Compared to published data from other large cohort measurement platforms, reproducibility of metabolites common to the platforms was similar to or better than in the other studies. These results show that our CE-MS platform is suitable for conducting large-scale epidemiological studies.
Factor Scores, Structure Coefficients, and Communality Coefficients
ERIC Educational Resources Information Center
Goodwyn, Fara
2012-01-01
This paper presents heuristic explanations of factor scores, structure coefficients, and communality coefficients. Common misconceptions regarding these topics are clarified. In addition, (a) the regression (b) Bartlett, (c) Anderson-Rubin, and (d) Thompson methods for calculating factor scores are reviewed. Syntax necessary to execute all four…
Garabedian, Stephen P.
1986-01-01
A nonlinear, least-squares regression technique for the estimation of ground-water flow model parameters was applied to the regional aquifer underlying the eastern Snake River Plain, Idaho. The technique uses a computer program to simulate two-dimensional, steady-state ground-water flow. Hydrologic data for the 1980 water year were used to calculate recharge rates, boundary fluxes, and spring discharges. Ground-water use was estimated from irrigated land maps and crop consumptive-use figures. These estimates of ground-water withdrawal, recharge rates, and boundary flux, along with leakance, were used as known values in the model calibration of transmissivity. Leakance values were adjusted between regression solutions by comparing model-calculated to measured spring discharges. In other simulations, recharge and leakance also were calibrated as prior-information regression parameters, which limits the variation of these parameters using a normalized standard error of estimate. Results from a best-fit model indicate a wide areal range in transmissivity from about 0.05 to 44 feet squared per second and in leakance from about 2.2x10 -9 to 6.0 x 10 -8 feet per second per foot. Along with parameter values, model statistics also were calculated, including the coefficient of correlation between calculated and observed head (0.996), the standard error of the estimates for head (40 feet), and the parameter coefficients of variation (about 10-40 percent). Additional boundary flux was added in some areas during calibration to achieve proper fit to ground-water flow directions. Model fit improved significantly when areas that violated model assumptions were removed. It also improved slightly when y-direction (northwest-southeast) transmissivity values were larger than x-direction (northeast-southwest) transmissivity values. The model was most sensitive to changes in recharge, and in some areas, to changes in transmissivity, particularly near the spring discharge area from Milner Dam to King Hill.
Jaworski, Mariusz; Panczyk, Mariusz; Cedro, Małgorzata; Kucharska, Alicja
2018-01-01
Adherence by diabetic patients to dietary recommendations is important for effective therapy. Considering patients' expectations in case of diet is significant in this regard. The aim of this paper was to analyze the relationship between selected independent variables (eg, regular blood glucose testing) and patients' adherence to dietary recommendations, bearing in mind that the degree of disease acceptance might play a mediation role. A cross-sectional study was conducted in 91 patients treated for type 2 diabetes mellitus in a public medical facility. Paper-and-pencil interviewing was administered ahead of the planned visit with a diabetes specialist. Two measures were applied in the study: the Acceptance and Action Diabetes Questionnaire and the Patient Diet Adherence in Diabetes Scale. Additionally, data related to sociodemographic characteristics, lifestyle-related factors, and the course of the disease (management, incidence of complications, and dietician's supervision) were also collected. The regression method was used in the analysis, and Cohen's methodology was used to estimate partial mediation. Significance of the mediation effect was assessed by the Goodman test. P -values of <0.05 were considered statistically significant. Patients' non-adherence to dietary recommendations was related to a low level of disease acceptance (standardized regression coefficient =-0.266; P =0.010). Moreover, failure to perform regular blood glucose testing was associated with a lack of disease acceptance (standardized regression coefficient =-0.455; P =0.000). However, the lack of regular blood glucose testing and low level of acceptance had only partially negative impacts on adherence to dietary recommendations (Goodman mediation test, Z =1.939; P =0.054). This dependence was not seen in patients treated with diet and concomitant oral medicines and/or insulin therapy. Effective dietary education should include activities promoting a more positive attitude toward the disease. This may be obtained by individual counseling, respecting the patient's needs, and focus on regular blood glucose testing.
Liu, Yi; Luo, Bi-Ru
2016-11-20
To analyze the factors affecting maternal physical activities at different stages among pregnant women. Self-designed questionnaires were used to investigate the physical activities of women in different stages, including 650 in the first, 650 in the second, and 750 in the third trimester of pregnancy. The factors affecting maternal physical activities were analyzed using the structural equation model that comprised 4 latent variables (attitude, norm, behavioral attention and behavior) with observed variables that matched the latent variables. The participants ranged from 18 to 35 years of age. The women and their husbands, but not their mothers or mothers-in-law, were all well educated. The caregiver during pregnancy was mostly the mother followed by the husband. For traveling, the women in the first, second and third trimesters preferred walking, bus, and personal escort, respectively; the main physical activity was walking in all trimesters, and the women in different trimester were mostly sedentary, a greater intensity of exercise was associated with less exercise time. Structural equation modeling (SEM) analysis showed that the physical activities of pregnant women was affected by behavioral intention (with standardized regression coefficient of 0.372); attitude and subjective norms affected physical activity by indirectly influencing the behavior intention (standardized regression coefficients of 0.140 and 0.669). The pregnant women in different stages have inappropriate physical activities with insufficient exercise time and intensity. The subjective norms affects the physical activities of the pregnant women by influencing their attitudes and behavior intention indirectly, suggesting the need of health education of the caregivers during pregnancy.
Multiple linear regression analysis
NASA Technical Reports Server (NTRS)
Edwards, T. R.
1980-01-01
Program rapidly selects best-suited set of coefficients. User supplies only vectors of independent and dependent data and specifies confidence level required. Program uses stepwise statistical procedure for relating minimal set of variables to set of observations; final regression contains only most statistically significant coefficients. Program is written in FORTRAN IV for batch execution and has been implemented on NOVA 1200.
Estimation of stature using hand and foot dimensions in Slovak adults.
Uhrová, Petra; Beňuš, Radoslav; Masnicová, Soňa; Obertová, Zuzana; Kramárová, Daniela; Kyselicová, Klaudia; Dörnhöferová, Michaela; Bodoriková, Silvia; Neščáková, Eva
2015-03-01
Hand and foot dimensions used for stature estimation help to formulate a biological profile in the process of personal identification. Morphological variability of hands and feet shows the importance of generating population-specific equations to estimate stature. The stature, hand length, hand breadth, foot length and foot breadth of 250 young Slovak males and females, aged 18-24 years, were measured according to standard anthropometric procedures. The data were statistically analyzed using independent t-test for sex and bilateral differences. Pearson correlation coefficient was used for assessing relationship between stature and hand/foot parameters, and subsequently linear regression analysis was used to estimate stature. The results revealed significant sex differences in hand and foot dimensions as well as in stature (p<0.05). There was a positive and statistically significant correlation between stature and all measurements in both sexes (p<0.01). The highest correlation coefficient was found for foot length in males (r=0.71) as well as in females (r=0.63). Regression equations were computed separately for each sex. The accuracy of stature prediction ranged from ±4.6 to ±6.1cm. The results of this study indicate that hand and foot dimension can be used to estimate stature for Slovak for the purpose of forensic field. The regression equations can be of use for stature estimation particularly in cases of dismembered bodies. Copyright © 2014 Elsevier Ireland Ltd. All rights reserved.
Lack of transferability between two automated immunoassays for serum IGF-I measurement.
Gomez-Gomez, Carolina; Iglesias, Eva M; Barallat, Jaume; Moreno, Fernando; Biosca, Carme; Pastor, Mari-Cruz; Granada, Maria-Luisa
2014-01-01
IGF-I is a clinically relevant protein in the diagnosis and monitoring of treatment of growth disor- ders. The Growth Hormone Research Society and the International IGF Research Society have encouraged the adoption of a universal calibration for immunoassays to improve standardization of IGF-I measurements, but currently commercial assays are calibrated either against the old WHO IRR 87/518 or the new WHO 02/254. We compared two IGF-I immunochemiluminescent assays: IMMULITE® 2000 (Siemens) and LIAISON® (DiaSorin), which differ in their standardization, and verified their precision according to quality specifications based on biological variation and their linear range. 62 patient serum samples were analyzed for both assays and compared according to standards of the Clinical and Laboratory Standards Institute (CLSI), EP9-A2-IR. Precision was verified according to CLSI EP15- A2. Optimal coefficient of variation (CVo) and desirable coefficient of variation (CVd) for IGF-I assays were calculated as quality specifications based on the biological variability, in order to assess if the interassay analytical CV (CVa1) in the two methods were appropriate. Two dilution series using the 1st WHO International Standard (WHO IS) for IGF-I 02/254 were used to verify and compare the linearity range. The regression analysis showed constant and proportional differences for serum samples (slope b = 0.8115 (CI 95% CI; 0.7575-0.8556); intercept a = 33.6873 (95% CI: 23.3613-44.0133) between assays and similar pro- portional differences for WHO IS 02/254 standard dilutions series (slope b = 0.8024 (CI 95% CI; 0.7560-0.8616); intercept a = 6.9623 (95% CI: -2.0819-18.4383) between assays. Within-laboratory coefficients of variation for low and high levels were 2.82% and 3.80% for IMMULITE® 2000 and 3.58% and 2.14% for LIAISON®, respecttively. IGF-I concentrations measured by both assays are not transferable. The results emphasize the need to express IGF-I concentrations in standard deviation score (SDS) according to a matched normal population of the same age and gender. Within-laboratory precision in both methods met quality specifications derived from biological variation.
Estimating the effects of wages on obesity.
Kim, DaeHwan; Leigh, John Paul
2010-05-01
To estimate the effects of wages on obesity and body mass. Data on household heads, aged 20 to 65 years, with full-time jobs, were drawn from the Panel Study of Income Dynamics for 2003 to 2007. The Panel Study of Income Dynamics is a nationally representative sample. Instrumental variables (IV) for wages were created using knowledge of computer software and state legal minimum wages. Least squares (linear regression) with corrected standard errors were used to estimate the equations. Statistical tests revealed both instruments were strong and tests for over-identifying restrictions were favorable. Wages were found to be predictive (P < 0.05) of obesity and body mass in regressions both before and after applying IVs. Coefficient estimates suggested stronger effects in the IV models. Results are consistent with the hypothesis that low wages increase obesity prevalence and body mass.
Determination of the optimal level for combining area and yield estimates
NASA Technical Reports Server (NTRS)
Bauer, M. E. (Principal Investigator); Hixson, M. M.; Jobusch, C. D.
1981-01-01
Several levels of obtaining both area and yield estimates of corn and soybeans in Iowa were considered: county, refined strata, refined/split strata, crop reporting district, and state. Using the CCEA model form and smoothed weather data, regression coefficients at each level were derived to compute yield and its variance. Variances were also computed with stratum level. The variance of the yield estimates was largest at the state and smallest at the county level for both crops. The refined strata had somewhat larger variances than those associated with the refined/split strata and CRD. For production estimates, the difference in standard deviations among levels was not large for corn, but for soybeans the standard deviation at the state level was more than 50% greater than for the other levels. The refined strata had the smallest standard deviations. The county level was not considered in evaluation of production estimates due to lack of county area variances.
Kumar, K Vasanth; Porkodi, K; Rocha, F
2008-01-15
A comparison of linear and non-linear regression method in selecting the optimum isotherm was made to the experimental equilibrium data of basic red 9 sorption by activated carbon. The r(2) was used to select the best fit linear theoretical isotherm. In the case of non-linear regression method, six error functions namely coefficient of determination (r(2)), hybrid fractional error function (HYBRID), Marquardt's percent standard deviation (MPSD), the average relative error (ARE), sum of the errors squared (ERRSQ) and sum of the absolute errors (EABS) were used to predict the parameters involved in the two and three parameter isotherms and also to predict the optimum isotherm. Non-linear regression was found to be a better way to obtain the parameters involved in the isotherms and also the optimum isotherm. For two parameter isotherm, MPSD was found to be the best error function in minimizing the error distribution between the experimental equilibrium data and predicted isotherms. In the case of three parameter isotherm, r(2) was found to be the best error function to minimize the error distribution structure between experimental equilibrium data and theoretical isotherms. The present study showed that the size of the error function alone is not a deciding factor to choose the optimum isotherm. In addition to the size of error function, the theory behind the predicted isotherm should be verified with the help of experimental data while selecting the optimum isotherm. A coefficient of non-determination, K(2) was explained and was found to be very useful in identifying the best error function while selecting the optimum isotherm.
NASA Astrophysics Data System (ADS)
Tang, Kunkun; Congedo, Pietro M.; Abgrall, Rémi
2016-06-01
The Polynomial Dimensional Decomposition (PDD) is employed in this work for the global sensitivity analysis and uncertainty quantification (UQ) of stochastic systems subject to a moderate to large number of input random variables. Due to the intimate connection between the PDD and the Analysis of Variance (ANOVA) approaches, PDD is able to provide a simpler and more direct evaluation of the Sobol' sensitivity indices, when compared to the Polynomial Chaos expansion (PC). Unfortunately, the number of PDD terms grows exponentially with respect to the size of the input random vector, which makes the computational cost of standard methods unaffordable for real engineering applications. In order to address the problem of the curse of dimensionality, this work proposes essentially variance-based adaptive strategies aiming to build a cheap meta-model (i.e. surrogate model) by employing the sparse PDD approach with its coefficients computed by regression. Three levels of adaptivity are carried out in this paper: 1) the truncated dimensionality for ANOVA component functions, 2) the active dimension technique especially for second- and higher-order parameter interactions, and 3) the stepwise regression approach designed to retain only the most influential polynomials in the PDD expansion. During this adaptive procedure featuring stepwise regressions, the surrogate model representation keeps containing few terms, so that the cost to resolve repeatedly the linear systems of the least-squares regression problem is negligible. The size of the finally obtained sparse PDD representation is much smaller than the one of the full expansion, since only significant terms are eventually retained. Consequently, a much smaller number of calls to the deterministic model is required to compute the final PDD coefficients.
Grijalva-Eternod, Carlos S; Wells, Jonathan C K; Girma, Tsinuel; Kæstel, Pernille; Admassu, Bitiya; Friis, Henrik; Andersen, Gregers S
2015-09-01
A midupper arm circumference (MUAC) <115 mm and weight-for-height z score (WHZ) or weight-for-length z score (WLZ) less than -3, all of which are recommended to identify severe wasting in children, often identify different children. The reasons behind this poor agreement are not well understood. We investigated the association between these 2 anthropometric indexes and body composition to help understand why they identify different children as wasted. We analyzed weight, length, MUAC, fat-mass (FM), and fat-free mass (FFM) data from 2470 measurements from 595 healthy Ethiopian infants obtained at birth and at 1.5, 2.5, 3.5, 4.5, and 6 mo of age. We derived WLZs by using 2006 WHO growth standards. We derived length-adjusted FM and FFM values as unexplained residuals after regressing each FM and FFM against length. We used a correlation analysis to assess associations between length, FFM, and FM (adjusted and nonadjusted for length) and the MUAC and WLZ and a multivariable regression analysis to assess the independent variability of length and length-adjusted FM and FFM with either the MUAC or the WLZ as the outcome. At all ages, length showed consistently strong positive correlations with the MUAC but not with the WLZ. Adjustment for length reduced observed correlation coefficients of FM and FFM with the MUAC but increased those for the WLZ. At all ages, both length-adjusted FM and FFM showed an independent association with the WLZ and MUAC with higher regression coefficients for the WLZ. Conversely, length showed greater regression coefficients for the MUAC. At all ages, the MUAC was shown to be more influenced than was the WLZ by the FM variability relative to the FFM variability. The MUAC and WLZ have different associations with body composition, and length influences these associations differently. Our results suggest that the WLZ is a good marker of tissue masses independent of length. The MUAC acts more as a composite index of poor growth indexing jointly tissue masses and length. This trial was registered at www.controlled-trials.com as ISRCTN46718296. © 2015 American Society for Nutrition.
Liu, Jinbao; Han, Jichang; Zhang, Yang; Wang, Huanyuan; Kong, Hui; Shi, Lei
2018-06-05
The storage of soil organic carbon (SOC) should improve soil fertility. Conventional determination of SOC is expensive and tedious. Visible-near infrared reflectance spectroscopy is a practical and cost-effective approach that has been successfully used SOC concentration. Soil spectral inversion model could quickly and efficiently determine SOC content. This paper presents a study dealing with SOC estimation through the combination of soil spectroscopy and stepwise multiple linear regression (SMLR), partial least squares regression (PLSR), principal component regression (PCR). Spectral measurements for 106 soil samples were acquired using an ASD FieldSpec 4 standard-res spectroradiometer (350-2500 nm). Six types of transformations and three regression methods were applied to build for the quantification of different parent materials development soil. The results show that (1)the basaltic volcanic clastics development of SOC spectral response bands located in 500 nm, 800 nm; Trachyte spectral response of the soil quality, and the volcanic clastics development at 405 nm, 465 nm, 575 nm, 1105 nm. (2) Basaltic volcanic debris soil development, first deviation of maximum correlation coefficient is 0.8898; thick surface soil of the development of rocky volcanic debris from bottom reflectivity logarithm of first deviation of maximum correlation coefficient is 0.9029. (3) Soil organic matter content of basaltic volcanic clastics development optimal prediction model based on spectral reflectance inverse logarithms of first deviation of SMLR. Independent variable number is 7, Rv 2 = 0.9720, RMSEP = 2.0590, sig = 0.003. Trachyte qualitative volcanic clastics developed soil organic matter content of the optimal prediction model based on spectral reflectance inverse logarithms of first deviation of PLSR. Model number of the independent variables Pc = 5, Rc = 0.9872, Rc 2 = 0.9745, RMSEC = 0.4821, SEC = 0.4906, forecasts determine coefficient Rv 2 = 0.9702, RMSEP = 0.9563, SEP = 0.9711, Bias = 0.0637. Copyright © 2018 Elsevier B.V. All rights reserved.
Ahearn, Elizabeth A.
2010-01-01
Multiple linear regression equations for determining flow-duration statistics were developed to estimate select flow exceedances ranging from 25- to 99-percent for six 'bioperiods'-Salmonid Spawning (November), Overwinter (December-February), Habitat Forming (March-April), Clupeid Spawning (May), Resident Spawning (June), and Rearing and Growth (July-October)-in Connecticut. Regression equations also were developed to estimate the 25- and 99-percent flow exceedances without reference to a bioperiod. In total, 32 equations were developed. The predictive equations were based on regression analyses relating flow statistics from streamgages to GIS-determined basin and climatic characteristics for the drainage areas of those streamgages. Thirty-nine streamgages (and an additional 6 short-term streamgages and 28 partial-record sites for the non-bioperiod 99-percent exceedance) in Connecticut and adjacent areas of neighboring States were used in the regression analysis. Weighted least squares regression analysis was used to determine the predictive equations; weights were assigned based on record length. The basin characteristics-drainage area, percentage of area with coarse-grained stratified deposits, percentage of area with wetlands, mean monthly precipitation (November), mean seasonal precipitation (December, January, and February), and mean basin elevation-are used as explanatory variables in the equations. Standard errors of estimate of the 32 equations ranged from 10.7 to 156 percent with medians of 19.2 and 55.4 percent to predict the 25- and 99-percent exceedances, respectively. Regression equations to estimate high and median flows (25- to 75-percent exceedances) are better predictors (smaller variability of the residual values around the regression line) than the equations to estimate low flows (less than 75-percent exceedance). The Habitat Forming (March-April) bioperiod had the smallest standard errors of estimate, ranging from 10.7 to 20.9 percent. In contrast, the Rearing and Growth (July-October) bioperiod had the largest standard errors, ranging from 30.9 to 156 percent. The adjusted coefficient of determination of the equations ranged from 77.5 to 99.4 percent with medians of 98.5 and 90.6 percent to predict the 25- and 99-percent exceedances, respectively. Descriptive information on the streamgages used in the regression, measured basin and climatic characteristics, and estimated flow-duration statistics are provided in this report. Flow-duration statistics and the 32 regression equations for estimating flow-duration statistics in Connecticut are stored on the U.S. Geological Survey World Wide Web application ?StreamStats? (http://water.usgs.gov/osw/streamstats/index.html). The regression equations developed in this report can be used to produce unbiased estimates of select flow exceedances statewide.
ERIC Educational Resources Information Center
Mugrage, Beverly; And Others
Three ridge regression solutions are compared with ordinary least squares regression and with principal components regression using all components. Ridge regression, particularly the Lawless-Wang solution, out-performed ordinary least squares regression and the principal components solution on the criteria of stability of coefficient and closeness…
Li, Dan; Jiang, Jia; Han, Dandan; Yu, Xinyu; Wang, Kun; Zang, Shuang; Lu, Dayong; Yu, Aimin; Zhang, Ziwei
2016-04-05
A new method is proposed for measuring the antioxidant capacity by electron spin resonance spectroscopy based on the loss of electron spin resonance signal after Cu(2+) is reduced to Cu(+) with antioxidant. Cu(+) was removed by precipitation in the presence of SCN(-). The remaining Cu(2+) was coordinated with diethyldithiocarbamate, extracted into n-butanol and determined by electron spin resonance spectrometry. Eight standards widely used in antioxidant capacity determination, including Trolox, ascorbic acid, ferulic acid, rutin, caffeic acid, quercetin, chlorogenic acid, and gallic acid were investigated. The standard curves for determining the eight standards were plotted, and results showed that the linear regression correlation coefficients were all high enough (r > 0.99). Trolox equivalent antioxidant capacity values for the antioxidant standards were calculated, and a good correlation (r > 0.94) between the values obtained by the present method and cupric reducing antioxidant capacity method was observed. The present method was applied to the analysis of real fruit samples and the evaluation of the antioxidant capacity of these fruits.
Cruz, Antonio M; Barr, Cameron; Puñales-Pozo, Elsa
2008-01-01
This research's main goals were to build a predictor for a turnaround time (TAT) indicator for estimating its values and use a numerical clustering technique for finding possible causes of undesirable TAT values. The following stages were used: domain understanding, data characterisation and sample reduction and insight characterisation. Building the TAT indicator multiple linear regression predictor and clustering techniques were used for improving corrective maintenance task efficiency in a clinical engineering department (CED). The indicator being studied was turnaround time (TAT). Multiple linear regression was used for building a predictive TAT value model. The variables contributing to such model were clinical engineering department response time (CE(rt), 0.415 positive coefficient), stock service response time (Stock(rt), 0.734 positive coefficient), priority level (0.21 positive coefficient) and service time (0.06 positive coefficient). The regression process showed heavy reliance on Stock(rt), CE(rt) and priority, in that order. Clustering techniques revealed the main causes of high TAT values. This examination has provided a means for analysing current technical service quality and effectiveness. In doing so, it has demonstrated a process for identifying areas and methods of improvement and a model against which to analyse these methods' effectiveness.
Shrinkage regression-based methods for microarray missing value imputation.
Wang, Hsiuying; Chiu, Chia-Chun; Wu, Yi-Ching; Wu, Wei-Sheng
2013-01-01
Missing values commonly occur in the microarray data, which usually contain more than 5% missing values with up to 90% of genes affected. Inaccurate missing value estimation results in reducing the power of downstream microarray data analyses. Many types of methods have been developed to estimate missing values. Among them, the regression-based methods are very popular and have been shown to perform better than the other types of methods in many testing microarray datasets. To further improve the performances of the regression-based methods, we propose shrinkage regression-based methods. Our methods take the advantage of the correlation structure in the microarray data and select similar genes for the target gene by Pearson correlation coefficients. Besides, our methods incorporate the least squares principle, utilize a shrinkage estimation approach to adjust the coefficients of the regression model, and then use the new coefficients to estimate missing values. Simulation results show that the proposed methods provide more accurate missing value estimation in six testing microarray datasets than the existing regression-based methods do. Imputation of missing values is a very important aspect of microarray data analyses because most of the downstream analyses require a complete dataset. Therefore, exploring accurate and efficient methods for estimating missing values has become an essential issue. Since our proposed shrinkage regression-based methods can provide accurate missing value estimation, they are competitive alternatives to the existing regression-based methods.
The microcomputer scientific software series 2: general linear model--regression.
Harold M. Rauscher
1983-01-01
The general linear model regression (GLMR) program provides the microcomputer user with a sophisticated regression analysis capability. The output provides a regression ANOVA table, estimators of the regression model coefficients, their confidence intervals, confidence intervals around the predicted Y-values, residuals for plotting, a check for multicollinearity, a...
Residualization is not the answer: Rethinking how to address multicollinearity.
York, Richard
2012-11-01
Here I show that a commonly used procedure to address problems stemming from collinearity and multicollinearity among independent variables in regression analysis, "residualization", leads to biased coefficient and standard error estimates and does not address the fundamental problem of collinearity, which is a lack of information. I demonstrate this using visual representations of collinearity, hypothetical experimental designs, and analyses of both artificial and real world data. I conclude by noting the importance of examining methodological practices to ensure that their validity can be established based on rational criteria. Copyright © 2012 Elsevier Inc. All rights reserved.
Specifics of soil temperature under winter oilseed rape canopy
NASA Astrophysics Data System (ADS)
Krčmářová, Jana; Středa, Tomáš; Pokorný, Radovan
2014-09-01
The aim of this study was to evaluate the course of soil temperature under the winter oilseed rape canopy and to determine relationships between soil temperature, air temperature and partly soil moisture. In addition, the aim was to describe the dependence by means of regression equations usable for pests and pathogens prediction, crop development, and yields models. The measurement of soil and near the ground air temperatures was performed at the experimental field Žabiče (South Moravia, the Czech Republic). The course of temperature was determined under or in the winter oilseed rape canopy during spring growth season in the course of four years (2010 - 2012 and 2014). In all years, the standard varieties (Petrol, Sherpa) were grown, in 2014 the semi-dwarf variety PX104 was added. Automatic soil sensors were positioned at three depths (0.05, 0.10 and 0.20 m) under soil surface, air temperature sensors in 0.05 m above soil surfaces. The course of soil temperature differs significantly between standard (Sherpa and Petrol) and semi-dwarf (PX104) varieties. Results of the cross correlation analysis showed, that the best interrelationships between air and soil temperature were achieved in 2 hours delay for the soil temperature in 0.05 m, 4 hour delay for 0.10 m and 7 hour delay for 0.20 m for standard varieties. For semi-dwarf variety, this delay reached 6 hour for the soil temperature in 0.05 m, 7 hour delay for 0.10 m and 11 hour for 0.20 m. After the time correction, the determination coefficient (R2) reached values from 0.67 to 0.95 for 0.05 m, 0.50 to 0.84 for 0.10 m in variety Sherpa during all experimental years. For variety PX104 this coefficient reached values from 0.51 to 0.72 in 0.05 m depth and from 0.39 to 0.67 in 0.10 m depth in the year 2014. The determination coefficient in the 0.20 m depth was lower for both varieties; its values were from 0.15 to 0.65 in variety Sherpa. In variety PX104 the values of R2 from 0.23 to 0.57 were determined. When using multiple regressions with quadratic spacing (modelling of hourly soil temperature based on the hourly near surface air temperature and hourly soil moisture in the 0.10-0.40 m profile), the difference between the measured and modelled soil temperatures in the depth of 0.05 m was -3.92 to 3.99°C. The regression equation paired with alternative agrometeorological instruments enables relatively accurate modelling of soil temperatures (R2 = 0.95).
Bell, Michelle L.; de Sousa Zanotti Stagliorio Coelho, Micheline; Leon Guo, Yue-Liang; Guo, Yuming; Goodman, Patrick; Hashizume, Masahiro; Honda, Yasushi; Kim, Ho; Lavigne, Eric; Michelozzi, Paola; Hilario Nascimento Saldiva, Paulo; Schwartz, Joel; Scortichini, Matteo; Sera, Francesco; Tobias, Aurelio; Tong, Shilu; Wu, Chang-fu; Zanobetti, Antonella; Zeka, Ariana; Gasparrini, Antonio
2017-01-01
Background: In many places, daily mortality has been shown to increase after days with particularly high or low temperatures, but such daily time-series studies cannot identify whether such increases reflect substantial life shortening or short-term displacement of deaths (harvesting). Objectives: To clarify this issue, we estimated the association between annual mortality and annual summaries of heat and cold in 278 locations from 12 countries. Methods: Indices of annual heat and cold were used as predictors in regressions of annual mortality in each location, allowing for trends over time and clustering of annual count anomalies by country and pooling estimates using meta-regression. We used two indices of annual heat and cold based on preliminary standard daily analyses: a) mean annual degrees above/below minimum mortality temperature (MMT), and b) estimated fractions of deaths attributed to heat and cold. The first index was simpler and matched previous related research; the second was added because it allowed the interpretation that coefficients equal to 0 and 1 are consistent with none (0) or all (1) of the deaths attributable in daily analyses being displaced by at least 1 y. Results: On average, regression coefficients of annual mortality on heat and cold mean degrees were 1.7% [95% confidence interval (CI): 0.3, 3.1] and 1.1% (95% CI: 0.6, 1.6) per degree, respectively, and daily attributable fractions were 0.8 (95% CI: 0.2, 1.3) and 1.1 (95% CI: 0.9, 1.4). The proximity of the latter coefficients to 1.0 provides evidence that most deaths found attributable to heat and cold in daily analyses were brought forward by at least 1 y. Estimates were broadly robust to alternative model assumptions. Conclusions: These results provide strong evidence that most deaths associated in daily analyses with heat and cold are displaced by at least 1 y. https://doi.org/10.1289/EHP1756 PMID:29084393
Dennison, Jessica L; Stack, Jim; Beatty, Stephen; Nolan, John M
2013-11-01
This study compares in vivo measurements of macular pigment (MP) obtained using customized heterochromatic flicker photometry (cHFP; Macular Metrics Densitometer(™)), dual-wavelength fundus autofluorescence (Heidelberg Spectralis(®) HRA + OCT MultiColor) and single-wavelength fundus reflectance (Zeiss Visucam(®) 200). MP was measured in one eye of 62 subjects on each device. Data from 49 subjects (79%) was suitable for analysis. Agreement between the Densitometer and Spectralis was investigated at various eccentricities using a variety of quantitative and graphical methods, including: Pearson correlation coefficient to measure degree of scatter (precision), accuracy coefficient, concordance correlation coefficient (ccc), paired t-test, scatter and Bland-Altman plots. The relationship between max MP from the Visucam and central MP from the Spectralis and Densitometer was investigated using regression methods. Agreement was strong between the Densitometer and Spectralis at all central eccentricities (e.g. at 0.25° eccentricity: accuracy = 0.97, precision = 0.90, ccc = 0.87). Regression analysis showed a very weak relationship between the Visucam and Densitometer (e.g. Visucam max on Densitometer central MP: R(2) = 0.008, p = 0.843). Regression analysis also demonstrated a weak relationship between MP measured by the Spectralis and Visucam (e.g. Visucam max on Spectralis central MP: R(2) = 0.047, p = 0.348). MP values obtained using the Heidelberg Spectralis are comparable to MP values obtained using the Densitometer. In contrast, MP values obtained using the Zeiss Visucam are not comparable with either the Densitometer or the Spectralis MP measuring devices. Taking cHFP as the current standard to which other MP measuring devices should be compared, the Spectralis is suitable for use in a clinical and research setting, whereas the Visucam is not. Copyright © 2013 Elsevier Ltd. All rights reserved.
NASA Astrophysics Data System (ADS)
Sanchez Rivera, Yamil
The purpose of this study is to add to what we know about the affective domain and to create a valid instrument for future studies. The Motivation to Learn Science (MLS) Inventory is based on Krathwohl's Taxonomy of Affective Behaviors (Krathwohl et al., 1964). The results of the Exploratory Factor Analysis (EFA) and Confirmatory Factor Analysis (CFA) demonstrated that the MLS Inventory is a valid and reliable instrument. Therefore, the MLS Inventory is a uni-dimensional instrument composed of 9 items with convergent validity (no divergence). The instrument had a high Chronbach Alpha value of .898 during the EFA analysis and .919 with the CFA analysis. Factor loadings on the 9 items ranged from .617 to .800. Standardized regression weights ranged from .639 to .835 in the CFA analysis. Various indices (RMSEA = .033; NFI = .987; GFI = .985; CFI = 1.000) demonstrated a good fitness of the proposed model. Hierarchical linear modeling was used to statistical analyze data where students' motivation to learn science scores (level-1) were nested within teachers (level-2). The analysis was geared toward identifying if teachers' use of affective behavior (a level-2 classroom variable) was significantly related with students' MLS scores (level-1 criterion variable). Model testing proceeded in three phases: intercept-only model, means-as-outcome model, and a random-regression coefficient model. The intercept-only model revealed an intra-class correlation coefficient of .224 with an estimated reliability of .726. Therefore, data suggested that only 22.4% of the variance in MLS scores is between-classes and the remaining 77.6% is at the student-level. Due to the significant variance in MLS scores, X2(62.756, p<.0001), teachers' TAB scores were added as a level-2 predictor. The regression coefficient was non-significant (p>.05). Therefore, the teachers' self-reported use of affective behaviors was not a significant predictor of students' motivation to learn science.
Dental age assessment of young Iranian adults using third molars: A multivariate regression study.
Bagherpour, Ali; Anbiaee, Najmeh; Partovi, Parnia; Golestani, Shayan; Afzalinasab, Shakiba
2012-10-01
In recent years, a noticeable increase in forensic age estimations of living individuals has been observed. Radiologic assessment of the mineralisation stage of third molars is of particular importance, with regard to the relevant age group. To attain a referral database and regression equations for dental age estimation of unaccompanied minors in an Iranian population was the goal of this study. Moreover, determination was made concerning the probability of an individual being over the age of 18 in case of full third molar(s) development. Using the scoring system of Gleiser and Hunt, modified by Köhler, an investigation of a cross-sectional sample of 1274 orthopantomograms of 885 females and 389 males aged between 15 and 22 years was carried out. Using kappa statistics, intra-observer reliability was tested. With Spearman correlation coefficient, correlation between the scores of all four wisdom teeth, was evaluated. We also carried out the Wilcoxon signed-rank test on asymmetry and calculated the regression formulae. A strong intra-observer agreement was displayed by the kappa value. No significant difference (p-value for upper and lower jaws were 0.07 and 0.59, respectively) was discovered by Wilcoxon signed-rank test for left and right asymmetry. The developmental stage of upper right and upper left third molars yielded the greatest correlation coefficient. The probability of an individual being over the age of 18 is 95.6% for males and 100.0% for females in case four fully developed third molars are present. Taking into consideration gender, location and number of wisdom teeth, regression formulae were arrived at. Use of population-specific standards is recommended as a means of improving the accuracy of forensic age estimates based on third molars mineralisation. To obtain more exact regression formulae, wider age range studies are recommended. Copyright © 2012 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.
Ristić-Medić, Danijela; Dullemeijer, Carla; Tepsić, Jasna; Petrović-Oggiano, Gordana; Popović, Tamara; Arsić, Aleksandra; Glibetić, Marija; Souverein, Olga W; Collings, Rachel; Cavelaars, Adriënne; de Groot, Lisette; van't Veer, Pieter; Gurinović, Mirjana
2014-03-01
The objective of this systematic review was to identify studies investigating iodine intake and biomarkers of iodine status, to assess the data of the selected studies, and to estimate dose-response relationships using meta-analysis. All randomized controlled trials, prospective cohort studies, nested case-control studies, and cross-sectional studies that supplied or measured dietary iodine and measured iodine biomarkers were included. The overall pooled regression coefficient (β) and the standard error of β were calculated by random-effects meta-analysis on a double-log scale, using the calculated intake-status regression coefficient (β) for each individual study. The results of pooled randomized controlled trials indicated that the doubling of dietary iodine intake increased urinary iodine concentrations by 14% in children and adolescents, by 57% in adults and the elderly, and by 81% in pregnant women. The dose-response relationship between iodine intake and biomarkers of iodine status indicated a 12% decrease in thyroid-stimulating hormone and a 31% decrease in thyroglobulin in pregnant women. The model of dose-response quantification used to describe the relationship between iodine intake and biomarkers of iodine status may be useful for providing complementary evidence to support recommendations for iodine intake in different population groups.
Kowalkowska, Joanna; Slowinska, Malgorzata A.; Slowinski, Dariusz; Dlugosz, Anna; Niedzwiedzka, Ewa; Wadolowska, Lidia
2013-01-01
The food frequency questionnaire (FFQ) and the food record (FR) are among the most common methods used in dietary research. It is important to know that is it possible to use both methods simultaneously in dietary assessment and prepare a single, comprehensive interpretation. The aim of this study was to compare the energy and nutritional value of diets, determined by the FFQ and by the three-day food records of young women. The study involved 84 female students aged 21–26 years (mean of 22.2 ± 0.8 years). Completing the FFQ was preceded by obtaining unweighted food records covering three consecutive days. Energy and nutritional value of diets was assessed for both methods (FFQ-crude, FR-crude). Data obtained for FFQ-crude were adjusted with beta-coefficient equaling 0.5915 (FFQ-adjusted) and regression analysis (FFQ-regressive). The FFQ-adjusted was calculated as FR-crude/FFQ-crude ratio of mean daily energy intake. FFQ-regressive was calculated for energy and each nutrient separately using regression equation, including FFQ-crude and FR-crude as covariates. For FR-crude and FFQ-crude the energy value of diets was standardized to 2000 kcal (FR-standardized, FFQ-standardized). Methods of statistical comparison included a dependent samples t-test, a chi-square test, and the Bland-Altman method. The mean energy intake in FFQ-crude was significantly higher than FR-crude (2740.5 kcal vs. 1621.0 kcal, respectively). For FR-standardized and FFQ-standardized, significance differences were found in the mean intake of 18 out of 31 nutrients, for FR-crude and FFQ-adjusted in 13 out of 31 nutrients and FR-crude and FFQ-regressive in 11 out of 31 nutrients. The Bland-Altman method showed an overestimation of energy and nutrient intake by FFQ-crude in comparison to FR-crude, e.g., total protein was overestimated by 34.7 g/day (95% Confidence Interval, CI: −29.6, 99.0 g/day) and fat by 48.6 g/day (95% CI: −36.4, 133.6 g/day). After regressive transformation of FFQ, the absolute difference between FFQ-regressive and FR-crude equaled 0.0 g/day and 95% CI were much better (e.g., for total protein 95% CI: −32.7, 32.7 g/day, for fat 95% CI: −49.6, 49.6 g/day). In conclusion, differences in nutritional value of diets resulted from overestimating energy intake by the FFQ in comparison to the three-day unweighted food records. Adjustment of energy and nutrient intake applied for the FFQ using various methods, particularly regression equations, significantly improved the agreement between results obtained by both methods and dietary assessment. To obtain the most accurate results in future studies using this FFQ, energy and nutrient intake should be adjusted by the regression equations presented in this paper. PMID:23877089
Innovating patient care delivery: DSRIP's interrupted time series analysis paradigm.
Shenoy, Amrita G; Begley, Charles E; Revere, Lee; Linder, Stephen H; Daiger, Stephen P
2017-12-08
Adoption of Medicaid Section 1115 waiver is one of the many ways of innovating healthcare delivery system. The Delivery System Reform Incentive Payment (DSRIP) pool, one of the two funding pools of the waiver has four categories viz. infrastructure development, program innovation and redesign, quality improvement reporting and lastly, bringing about population health improvement. A metric of the fourth category, preventable hospitalization (PH) rate was analyzed in the context of eight conditions for two time periods, pre-reporting years (2010-2012) and post-reporting years (2013-2015) for two hospital cohorts, DSRIP participating and non-participating hospitals. The study explains how DSRIP impacted Preventable Hospitalization (PH) rates of eight conditions for both hospital cohorts within two time periods. Eight PH rates were regressed as the dependent variable with time, intervention and post-DSRIP Intervention as independent variables. PH rates of eight conditions were then consolidated into one rate for regressing with the above independent variables to evaluate overall impact of DSRIP. An interrupted time series regression was performed after accounting for auto-correlation, stationarity and seasonality in the dataset. In the individual regression model, PH rates showed statistically significant coefficients for seven out of eight conditions in DSRIP participating hospitals. In the combined regression model, the coefficient of the PH rate showed a statistically significant decrease with negative p-values for regression coefficients in DSRIP participating hospitals compared to positive/increased p-values for regression coefficients in DSRIP non-participating hospitals. Several macro- and micro-level factors may have likely contributed DSRIP hospitals outperforming DSRIP non-participating hospitals. Healthcare organization/provider collaboration, support from healthcare professionals, DSRIP's design, state reimbursement and coordination in care delivery methods may have led to likely success of DSRIP. IV, a retrospective cohort study based on longitudinal data. Copyright © 2017 Elsevier Inc. All rights reserved.
Evaluation of different methods for determining growing degree-day thresholds in apricot cultivars
NASA Astrophysics Data System (ADS)
Ruml, Mirjana; Vuković, Ana; Milatović, Dragan
2010-07-01
The aim of this study was to examine different methods for determining growing degree-day (GDD) threshold temperatures for two phenological stages (full bloom and harvest) and select the optimal thresholds for a greater number of apricot ( Prunus armeniaca L.) cultivars grown in the Belgrade region. A 10-year data series were used to conduct the study. Several commonly used methods to determine the threshold temperatures from field observation were evaluated: (1) the least standard deviation in GDD; (2) the least standard deviation in days; (3) the least coefficient of variation in GDD; (4) regression coefficient; (5) the least standard deviation in days with a mean temperature above the threshold; (6) the least coefficient of variation in days with a mean temperature above the threshold; and (7) the smallest root mean square error between the observed and predicted number of days. In addition, two methods for calculating daily GDD, and two methods for calculating daily mean air temperatures were tested to emphasize the differences that can arise by different interpretations of basic GDD equation. The best agreement with observations was attained by method (7). The lower threshold temperature obtained by this method differed among cultivars from -5.6 to -1.7°C for full bloom, and from -0.5 to 6.6°C for harvest. However, the “Null” method (lower threshold set to 0°C) and “Fixed Value” method (lower threshold set to -2°C for full bloom and to 3°C for harvest) gave very good results. The limitations of the widely used method (1) and methods (5) and (6), which generally performed worst, are discussed in the paper.
The role of stress sensitization in progression of posttraumatic distress following deployment.
Smid, Geert E; Kleber, Rolf J; Rademaker, Arthur R; van Zuiden, Mirjam; Vermetten, Eric
2013-11-01
Military personnel exposed to combat are at risk for experiencing post-traumatic distress that can progress over time following deployment. We hypothesized that progression of post-traumatic distress may be related to enhanced susceptibility to post-deployment stressors. This study aimed at examining the concept of stress sensitization prospectively in a sample of Dutch military personnel deployed in support of the conflicts in Afghanistan. In a cohort of soldiers (N = 814), symptoms of post-traumatic stress disorder (PTSD) were assessed before deployment as well as 2, 7, 14, and 26 months (N = 433; 53 %) after their return. Data were analyzed using latent growth modeling. Using multiple group analysis, we examined whether high combat stress exposure during deployment moderated the relation between post-deployment stressors and linear change in post-traumatic distress after deployment. A higher baseline level of post-traumatic distress was associated with more early life stressors (standardized regression coefficient = 0.30, p < 0.001). In addition, a stronger increase in posttraumatic distress during deployment was associated with more deployment stressors (standardized coefficient = 0.21, p < 0.001). A steeper linear increase in posttraumatic distress post-deployment (from 2 to 26 months) was predicted by more post-deployment stressors (standardized coefficient = 0.29, p < 0.001) in high combat stress exposed soldiers, but not in a less combat stress exposed group. The group difference in the predictive effect of post-deployment stressors on progression of post-traumatic distress was significant (χ²(1) = 7.85, p = 0.005). Progression of post-traumatic distress following combat exposure may be related to sensitization to the effects of post-deployment stressors during the first year following return from deployment.
Thackeray, J F; Dykes, S
2016-02-01
Thackeray has previously explored the possibility of using a morphometric approach to quantify the "amount" of variation within species and to assess probabilities of conspecificity when two fossil specimens are compared, instead of "pigeon-holing" them into discrete species. In an attempt to obtain a statistical (probabilistic) definition of a species, Thackeray has recognized an approximation of a biological species constant (T=-1.61) based on the log-transformed standard error of the coefficient m (log sem) in regression analysis of cranial and other data from pairs of specimens of conspecific extant species, associated with regression equations of the form y=mx+c where m is the slope and c is the intercept, using measurements of any specimen A (x axis), and any specimen B of the same species (y axis). The log-transformed standard error of the co-efficient m (log sem) is a measure of the degree of similarity between pairs of specimens, and in this study shows central tendency around a mean value of -1.61 and standard deviation 0.10 for modern conspecific specimens. In this paper we focus attention on the need to take into account the range of difference in log sem values (Δlog sem or "delta log sem") obtained from comparisons when specimen A (x axis) is compared to B (y axis), and secondly when specimen A (y axis) is compared to B (x axis). Thackeray's approach can be refined to focus on high probabilities of conspecificity for pairs of specimens for which log sem is less than -1.61 and for which Δlog sem is less than 0.03. We appeal for the adoption of a concept here called "sigma taxonomy" (as opposed to "alpha taxonomy"), recognizing that boundaries between species are not always well defined. Copyright © 2015 Elsevier GmbH. All rights reserved.
NASA Astrophysics Data System (ADS)
Cambra-López, María; Winkel, Albert; Mosquera, Julio; Ogink, Nico W. M.; Aarnink, André J. A.
2015-06-01
The objective of this study was to compare co-located real-time light scattering devices and equivalent gravimetric samplers in poultry and pig houses for PM10 mass concentration, and to develop animal-specific calibration factors for light scattering samplers. These results will contribute to evaluate the comparability of different sampling instruments for PM10 concentrations. Paired DustTrak light scattering device (DustTrak aerosol monitor, TSI, U.S.) and PM10 gravimetric cyclone sampler were used for measuring PM10 mass concentrations during 24 h periods (from noon to noon) inside animal houses. Sampling was conducted in 32 animal houses in the Netherlands, including broilers, broiler breeders, layers in floor and in aviary system, turkeys, piglets, growing-finishing pigs in traditional and low emission housing with dry and liquid feed, and sows in individual and group housing. A total of 119 pairs of 24 h measurements (55 for poultry and 64 for pigs) were recorded and analyzed using linear regression analysis. Deviations between samplers were calculated and discussed. In poultry, cyclone sampler and DustTrak data fitted well to a linear regression, with a regression coefficient equal to 0.41, an intercept of 0.16 mg m-3 and a correlation coefficient of 0.91 (excluding turkeys). Results in turkeys showed a regression coefficient equal to 1.1 (P = 0.49), an intercept of 0.06 mg m-3 (P < 0.0001) and a correlation coefficient of 0.98. In pigs, we found a regression coefficient equal to 0.61, an intercept of 0.05 mg m-3 and a correlation coefficient of 0.84. Measured PM10 concentrations using DustTraks were clearly underestimated (approx. by a factor 2) in both poultry and pig housing systems compared with cyclone pre-separators. Absolute, relative, and random deviations increased with concentration. DustTrak light scattering devices should be self-calibrated to investigate PM10 mass concentrations accurately in animal houses. We recommend linear regression equations as animal-specific calibration factors for DustTraks instead of manufacturer calibration factors, especially in heavily dusty environments such as animal houses.
Forster, Jeri E.; MaWhinney, Samantha; Ball, Erika L.; Fairclough, Diane
2011-01-01
Dropout is common in longitudinal clinical trials and when the probability of dropout depends on unobserved outcomes even after conditioning on available data, it is considered missing not at random and therefore nonignorable. To address this problem, mixture models can be used to account for the relationship between a longitudinal outcome and dropout. We propose a Natural Spline Varying-coefficient mixture model (NSV), which is a straightforward extension of the parametric Conditional Linear Model (CLM). We assume that the outcome follows a varying-coefficient model conditional on a continuous dropout distribution. Natural cubic B-splines are used to allow the regression coefficients to semiparametrically depend on dropout and inference is therefore more robust. Additionally, this method is computationally stable and relatively simple to implement. We conduct simulation studies to evaluate performance and compare methodologies in settings where the longitudinal trajectories are linear and dropout time is observed for all individuals. Performance is assessed under conditions where model assumptions are both met and violated. In addition, we compare the NSV to the CLM and a standard random-effects model using an HIV/AIDS clinical trial with probable nonignorable dropout. The simulation studies suggest that the NSV is an improvement over the CLM when dropout has a nonlinear dependence on the outcome. PMID:22101223
Liu, Cong; Kolarik, Barbara; Gunnarsen, Lars; Zhang, Yinping
2015-10-20
Polychlorinated biphenyls (PCBs) have been found to be persistent in the environment and possibly harmful. Many buildings are characterized with high PCB concentrations. Knowledge about partitioning between primary sources and building materials is critical for exposure assessment and practical remediation of PCB contamination. This study develops a C-depth method to determine diffusion coefficient (D) and partition coefficient (K), two key parameters governing the partitioning process. For concrete, a primary material studied here, relative standard deviations of results among five data sets are 5%-22% for K and 42-66% for D. Compared with existing methods, C-depth method overcomes the inability to obtain unique estimation for nonlinear regression and does not require assumed correlations for D and K among congeners. Comparison with a more sophisticated two-term approach implies significant uncertainty for D, and smaller uncertainty for K. However, considering uncertainties associated with sampling and chemical analysis, and impact of environmental factors, the results are acceptable for engineering applications. This was supported by good agreement between model prediction and measurement. Sensitivity analysis indicated that effective diffusion distance, contacting time of materials with primary sources, and depth of measured concentrations are critical for determining D, and PCB concentration in primary sources is critical for K.
Higher-order Multivariable Polynomial Regression to Estimate Human Affective States
NASA Astrophysics Data System (ADS)
Wei, Jie; Chen, Tong; Liu, Guangyuan; Yang, Jiemin
2016-03-01
From direct observations, facial, vocal, gestural, physiological, and central nervous signals, estimating human affective states through computational models such as multivariate linear-regression analysis, support vector regression, and artificial neural network, have been proposed in the past decade. In these models, linear models are generally lack of precision because of ignoring intrinsic nonlinearities of complex psychophysiological processes; and nonlinear models commonly adopt complicated algorithms. To improve accuracy and simplify model, we introduce a new computational modeling method named as higher-order multivariable polynomial regression to estimate human affective states. The study employs standardized pictures in the International Affective Picture System to induce thirty subjects’ affective states, and obtains pure affective patterns of skin conductance as input variables to the higher-order multivariable polynomial model for predicting affective valence and arousal. Experimental results show that our method is able to obtain efficient correlation coefficients of 0.98 and 0.96 for estimation of affective valence and arousal, respectively. Moreover, the method may provide certain indirect evidences that valence and arousal have their brain’s motivational circuit origins. Thus, the proposed method can serve as a novel one for efficiently estimating human affective states.
Spatial Autocorrelation Approaches to Testing Residuals from Least Squares Regression
Chen, Yanguang
2016-01-01
In geo-statistics, the Durbin-Watson test is frequently employed to detect the presence of residual serial correlation from least squares regression analyses. However, the Durbin-Watson statistic is only suitable for ordered time or spatial series. If the variables comprise cross-sectional data coming from spatial random sampling, the test will be ineffectual because the value of Durbin-Watson’s statistic depends on the sequence of data points. This paper develops two new statistics for testing serial correlation of residuals from least squares regression based on spatial samples. By analogy with the new form of Moran’s index, an autocorrelation coefficient is defined with a standardized residual vector and a normalized spatial weight matrix. Then by analogy with the Durbin-Watson statistic, two types of new serial correlation indices are constructed. As a case study, the two newly presented statistics are applied to a spatial sample of 29 China’s regions. These results show that the new spatial autocorrelation models can be used to test the serial correlation of residuals from regression analysis. In practice, the new statistics can make up for the deficiencies of the Durbin-Watson test. PMID:26800271
Saunders, Christina T; Blume, Jeffrey D
2017-10-26
Mediation analysis explores the degree to which an exposure's effect on an outcome is diverted through a mediating variable. We describe a classical regression framework for conducting mediation analyses in which estimates of causal mediation effects and their variance are obtained from the fit of a single regression model. The vector of changes in exposure pathway coefficients, which we named the essential mediation components (EMCs), is used to estimate standard causal mediation effects. Because these effects are often simple functions of the EMCs, an analytical expression for their model-based variance follows directly. Given this formula, it is instructive to revisit the performance of routinely used variance approximations (e.g., delta method and resampling methods). Requiring the fit of only one model reduces the computation time required for complex mediation analyses and permits the use of a rich suite of regression tools that are not easily implemented on a system of three equations, as would be required in the Baron-Kenny framework. Using data from the BRAIN-ICU study, we provide examples to illustrate the advantages of this framework and compare it with the existing approaches. © The Author 2017. Published by Oxford University Press.
Higher-order Multivariable Polynomial Regression to Estimate Human Affective States
Wei, Jie; Chen, Tong; Liu, Guangyuan; Yang, Jiemin
2016-01-01
From direct observations, facial, vocal, gestural, physiological, and central nervous signals, estimating human affective states through computational models such as multivariate linear-regression analysis, support vector regression, and artificial neural network, have been proposed in the past decade. In these models, linear models are generally lack of precision because of ignoring intrinsic nonlinearities of complex psychophysiological processes; and nonlinear models commonly adopt complicated algorithms. To improve accuracy and simplify model, we introduce a new computational modeling method named as higher-order multivariable polynomial regression to estimate human affective states. The study employs standardized pictures in the International Affective Picture System to induce thirty subjects’ affective states, and obtains pure affective patterns of skin conductance as input variables to the higher-order multivariable polynomial model for predicting affective valence and arousal. Experimental results show that our method is able to obtain efficient correlation coefficients of 0.98 and 0.96 for estimation of affective valence and arousal, respectively. Moreover, the method may provide certain indirect evidences that valence and arousal have their brain’s motivational circuit origins. Thus, the proposed method can serve as a novel one for efficiently estimating human affective states. PMID:26996254
Comparison of different functional EIT approaches to quantify tidal ventilation distribution.
Zhao, Zhanqi; Yun, Po-Jen; Kuo, Yen-Liang; Fu, Feng; Dai, Meng; Frerichs, Inez; Möller, Knut
2018-01-30
The aim of the study was to examine the pros and cons of different types of functional EIT (fEIT) to quantify tidal ventilation distribution in a clinical setting. fEIT images were calculated with (1) standard deviation of pixel time curve, (2) regression coefficients of global and local impedance time curves, or (3) mean tidal variations. To characterize temporal heterogeneity of tidal ventilation distribution, another fEIT image of pixel inspiration times is also proposed. fEIT-regression is very robust to signals with different phase information. When the respiratory signal should be distinguished from the heart-beat related signal, or during high-frequency oscillatory ventilation, fEIT-regression is superior to other types. fEIT-tidal variation is the most stable image type regarding the baseline shift. We recommend using this type of fEIT image for preliminary evaluation of the acquired EIT data. However, all these fEITs would be misleading in their assessment of ventilation distribution in the presence of temporal heterogeneity. The analysis software provided by the currently available commercial EIT equipment only offers either fEIT of standard deviation or tidal variation. Considering the pros and cons of each fEIT type, we recommend embedding more types into the analysis software to allow the physicians dealing with more complex clinical applications with on-line EIT measurements.
Solid harmonic wavelet scattering for predictions of molecule properties
NASA Astrophysics Data System (ADS)
Eickenberg, Michael; Exarchakis, Georgios; Hirn, Matthew; Mallat, Stéphane; Thiry, Louis
2018-06-01
We present a machine learning algorithm for the prediction of molecule properties inspired by ideas from density functional theory (DFT). Using Gaussian-type orbital functions, we create surrogate electronic densities of the molecule from which we compute invariant "solid harmonic scattering coefficients" that account for different types of interactions at different scales. Multilinear regressions of various physical properties of molecules are computed from these invariant coefficients. Numerical experiments show that these regressions have near state-of-the-art performance, even with relatively few training examples. Predictions over small sets of scattering coefficients can reach a DFT precision while being interpretable.
Nguyen, Quynh C.; Osypuk, Theresa L.; Schmidt, Nicole M.; Glymour, M. Maria; Tchetgen Tchetgen, Eric J.
2015-01-01
Despite the recent flourishing of mediation analysis techniques, many modern approaches are difficult to implement or applicable to only a restricted range of regression models. This report provides practical guidance for implementing a new technique utilizing inverse odds ratio weighting (IORW) to estimate natural direct and indirect effects for mediation analyses. IORW takes advantage of the odds ratio's invariance property and condenses information on the odds ratio for the relationship between the exposure (treatment) and multiple mediators, conditional on covariates, by regressing exposure on mediators and covariates. The inverse of the covariate-adjusted exposure-mediator odds ratio association is used to weight the primary analytical regression of the outcome on treatment. The treatment coefficient in such a weighted regression estimates the natural direct effect of treatment on the outcome, and indirect effects are identified by subtracting direct effects from total effects. Weighting renders treatment and mediators independent, thereby deactivating indirect pathways of the mediators. This new mediation technique accommodates multiple discrete or continuous mediators. IORW is easily implemented and is appropriate for any standard regression model, including quantile regression and survival analysis. An empirical example is given using data from the Moving to Opportunity (1994–2002) experiment, testing whether neighborhood context mediated the effects of a housing voucher program on obesity. Relevant Stata code (StataCorp LP, College Station, Texas) is provided. PMID:25693776
Effects of mining-associated lead and zinc soil contamination on native floristic quality.
Struckhoff, Matthew A; Stroh, Esther D; Grabner, Keith W
2013-04-15
We assessed the quality of plant communities across a range of lead (Pb) and zinc (Zn) soil concentrations at a variety of sites associated with Pb mining in southeast Missouri, USA. In a novel application, two standard floristic quality measures, Mean Coefficient of Conservatism (Mean C) and Floristic Quality Index (FQI), were examined in relation to concentrations of Pb and Zn, soil nutrients, and other soil characteristics. Nonmetric Multidimensional Scaling and Regression Tree Analyses identified soil Pb and Zn concentrations as primary explanatory variables for plant community composition and indicated negative relationships between soil metals concentrations and both Mean C and FQI. Univariate regression also demonstrated significant negative relationships between metals concentrations and floristic quality. The negative effects of metals in native soils with otherwise relatively undisturbed conditions indicate that elevated soil metals concentrations adversely affect native floristic quality where no other human disturbance is evident. Published by Elsevier Ltd.
Prediction of heat capacities of solid inorganic salts from group contributions
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mostafa, A.T.M.G.; Eakman, J.M.; Yarbro, S.L.
1997-01-01
A group contribution technique is proposed to predict the coefficients in the heat capacity correlation, C{sub p} = a + bT + c/T{sup 2} + dT{sup 2}, for solid inorganic salts. The results from this work are compared with fits to experimental data from the literature. It is shown to give good predictions for both simple and complex solid inorganic salts. Literature heat capacities for a large number (664) of solid inorganic salts covering a broad range of cations (129), anions (17) and ligands (2) have been used in regressions to obtain group contributions for the parameters in the heatmore » capacity temperature function. A mean error of 3.18% is found when predicted values are compared with literature values for heat capacity at 298{degrees} K. Estimates of the error standard deviation from the regression for each additivity constant are also determined.« less
NASA Technical Reports Server (NTRS)
Johnson, R. W.; Bahn, G. S.
1977-01-01
Statistical analysis techniques were applied to develop quantitative relationships between in situ river measurements and the remotely sensed data that were obtained over the James River in Virginia on 28 May 1974. The remotely sensed data were collected with a multispectral scanner and with photographs taken from an aircraft platform. Concentration differences among water quality parameters such as suspended sediment, chlorophyll a, and nutrients indicated significant spectral variations. Calibrated equations from the multiple regression analysis were used to develop maps that indicated the quantitative distributions of water quality parameters and the dispersion characteristics of a pollutant plume entering the turbid river system. Results from further analyses that use only three preselected multispectral scanner bands of data indicated that regression coefficients and standard errors of estimate were not appreciably degraded compared with results from the 10-band analysis.
NASA Astrophysics Data System (ADS)
Yong, Yan Ling; Tan, Li Kuo; McLaughlin, Robert A.; Chee, Kok Han; Liew, Yih Miin
2017-12-01
Intravascular optical coherence tomography (OCT) is an optical imaging modality commonly used in the assessment of coronary artery diseases during percutaneous coronary intervention. Manual segmentation to assess luminal stenosis from OCT pullback scans is challenging and time consuming. We propose a linear-regression convolutional neural network to automatically perform vessel lumen segmentation, parameterized in terms of radial distances from the catheter centroid in polar space. Benchmarked against gold-standard manual segmentation, our proposed algorithm achieves average locational accuracy of the vessel wall of 22 microns, and 0.985 and 0.970 in Dice coefficient and Jaccard similarity index, respectively. The average absolute error of luminal area estimation is 1.38%. The processing rate is 40.6 ms per image, suggesting the potential to be incorporated into a clinical workflow and to provide quantitative assessment of vessel lumen in an intraoperative time frame.
Effects of mining-associated lead and zinc soil contamination on native floristic quality
Struckhoff, Matthew A.; Stroh, Esther D.; Grabner, Keith W.
2013-01-01
We assessed the quality of plant communities across a range of lead (Pb) and zinc (Zn) soil concentrations at a variety of sites associated with Pb mining in southeast Missouri, USA. In a novel application, two standard floristic quality measures, Mean Coefficient of Conservatism (Mean C) and Floristic Quality Index (FQI), were examined in relation to concentrations of Pb and Zn, soil nutrients, and other soil characteristics. Nonmetric Multidimensional Scaling and Regression Tree Analyses identified soil Pb and Zn concentrations as primary explanatory variables for plant community composition and indicated negative relationships between soil metals concentrations and both Mean C and FQI. Univariate regression also demonstrated significant negative relationships between metals concentrations and floristic quality. The negative effects of metals in native soils with otherwise relatively undisturbed conditions indicate that elevated soil metals concentrations adversely affect native floristic quality where no other human disturbance is evident.
Mancia, G; Ferrari, A; Gregorini, L; Parati, G; Pomidossi, G; Bertinieri, G; Grassi, G; Zanchetti, A
1980-12-01
1. Intra-arterial blood pressure and heart rate were recorded for 24 h in ambulant hospitalized patients of variable age who had normal blood pressure or essential hypertension. Mean 24 h values, standard deviations and variation coefficient were obtained as the averages of values separately analysed for 48 consecutive half-hour periods. 2. In older subjects standard deviation and variation coefficient for mean arterial pressure were greater than in younger subjects with similar pressure values, whereas standard deviation and variation coefficient for mean arterial pressure were greater than in younger subjects with similar pressure values, whereas standard deviation aations and variation coefficient were obtained as the averages of values separately analysed for 48 consecurive half-hour periods. 2. In older subjects standard deviation and variation coefficient for mean arterial pressure were greater than in younger subjects with similar pressure values, whereas standard deviation and variation coefficient for heart rate were smaller. 3. In hypertensive subjects standard deviation for mean arterial pressure was greater than in normotensive subjects of similar ages, but this was not the case for variation coefficient, which was slightly smaller in the former than in the latter group. Normotensive and hypertensive subjects showed no difference in standard deviation and variation coefficient for heart rate. 4. In both normotensive and hypertensive subjects standard deviation and even more so variation coefficient were slightly or not related to arterial baroreflex sensitivity as measured by various methods (phenylephrine, neck suction etc.). 5. It is concluded that blood pressure variability increases and heart rate variability decreases with age, but that changes in variability are not so obvious in hypertension. Also, differences in variability among subjects are only marginally explained by differences in baroreflex function.
Rice, Stephen B; Chan, Christopher; Brown, Scott C; Eschbach, Peter; Han, Li; Ensor, David S; Stefaniak, Aleksandr B; Bonevich, John; Vladár, András E; Hight Walker, Angela R; Zheng, Jiwen; Starnes, Catherine; Stromberg, Arnold; Ye, Jia; Grulke, Eric A
2015-01-01
This paper reports an interlaboratory comparison that evaluated a protocol for measuring and analysing the particle size distribution of discrete, metallic, spheroidal nanoparticles using transmission electron microscopy (TEM). The study was focused on automated image capture and automated particle analysis. NIST RM8012 gold nanoparticles (30 nm nominal diameter) were measured for area-equivalent diameter distributions by eight laboratories. Statistical analysis was used to (1) assess the data quality without using size distribution reference models, (2) determine reference model parameters for different size distribution reference models and non-linear regression fitting methods and (3) assess the measurement uncertainty of a size distribution parameter by using its coefficient of variation. The interlaboratory area-equivalent diameter mean, 27.6 nm ± 2.4 nm (computed based on a normal distribution), was quite similar to the area-equivalent diameter, 27.6 nm, assigned to NIST RM8012. The lognormal reference model was the preferred choice for these particle size distributions as, for all laboratories, its parameters had lower relative standard errors (RSEs) than the other size distribution reference models tested (normal, Weibull and Rosin–Rammler–Bennett). The RSEs for the fitted standard deviations were two orders of magnitude higher than those for the fitted means, suggesting that most of the parameter estimate errors were associated with estimating the breadth of the distributions. The coefficients of variation for the interlaboratory statistics also confirmed the lognormal reference model as the preferred choice. From quasi-linear plots, the typical range for good fits between the model and cumulative number-based distributions was 1.9 fitted standard deviations less than the mean to 2.3 fitted standard deviations above the mean. Automated image capture, automated particle analysis and statistical evaluation of the data and fitting coefficients provide a framework for assessing nanoparticle size distributions using TEM for image acquisition. PMID:26361398
Busch, Robyn M.; Lineweaver, Tara T.; Ferguson, Lisa; Haut, Jennifer S.
2015-01-01
Reliable change index scores (RCIs) and standardized regression-based change score norms (SRBs) permit evaluation of meaningful changes in test scores following treatment interventions, like epilepsy surgery, while accounting for test-retest reliability, practice effects, score fluctuations due to error, and relevant clinical and demographic factors. Although these methods are frequently used to assess cognitive change after epilepsy surgery in adults, they have not been widely applied to examine cognitive change in children with epilepsy. The goal of the current study was to develop RCIs and SRBs for use in children with epilepsy. Sixty-three children with epilepsy (age range 6–16; M=10.19, SD=2.58) underwent comprehensive neuropsychological evaluations at two time points an average of 12 months apart. Practice adjusted RCIs and SRBs were calculated for all cognitive measures in the battery. Practice effects were quite variable across the neuropsychological measures, with the greatest differences observed among older children, particularly on the Children’s Memory Scale and Wisconsin Card Sorting Test. There was also notable variability in test-retest reliabilities across measures in the battery, with coefficients ranging from 0.14 to 0.92. RCIs and SRBs for use in assessing meaningful cognitive change in children following epilepsy surgery are provided for measures with reliability coefficients above 0.50. This is the first study to provide RCIs and SRBs for a comprehensive neuropsychological battery based on a large sample of children with epilepsy. Tables to aid in evaluating cognitive changes in children who have undergone epilepsy surgery are provided for clinical use. An excel sheet to perform all relevant calculations is also available to interested clinicians or researchers. PMID:26043163
Tanaka, Haruka; Ogata, Soshiro; Omura, Kayoko; Honda, Chika; Kamide, Kei; Hayakawa, Kazuo
2016-03-01
The aim of this study was to investigate the association between subjective memory complaints (SMCs) and depressive symptoms, with and without adjustment for genetic and family environmental factors. We conducted a cross-sectional study using twins and measured SMCs and depressive symptoms as outcomes and explanatory variables, respectively. First, we performed regression analyses using generalized estimating equations to investigate the associations between SMCs and depressive symptoms without adjustment for genetic and family environmental factors (individual-level analyses). We then performed regression analyses for within-pair differences using monozygotic (MZ) and dizygotic (DZ) twin pairs and MZ twin pairs to investigate these associations with adjustment for genetic and family environmental factors by subtracting the values of one twin from those of co-twin variables (within-pair level analyses). Therefore, differences between the associations at individual- and within-pair level analyses suggested confounding by genetic factors. We included 556 twins aged ≥ 20 years. In the individual-level analyses, SMCs were significantly associated with depressive symptoms in both males and females [standardized coefficients: males, 0.23 (95% CI 0.08-0.38); females, 0.35 (95% CI 0.23-0.46)]. In the within-pair level analyses using MZ and same-sex DZ twin pairs, SMCs were significantly associated with depressive symptoms. In the within-pair level analyses using the MZ twin pairs, SMCs were significantly associated with depressive symptoms [standardized coefficients: males, 0.32 (95% CI 0.08-0.56); females, 0.24 (95% CI 0.13-0.42)]. This study suggested that SMCs were significantly associated with depressive symptoms after adjustment for genetic and family environmental factors.
Relationship between self-esteem and living conditions among stroke survivors at home.
Shida, Junko; Sugawara, Kyoko; Goto, Junko; Sekito, Yoshiko
2014-10-01
To clarify the relationship between self-esteem of stroke survivors at home and their living conditions. Study participants were stroke survivors who lived at home and commuted to one of two medical facilities in the Tohoku region of Japan. Stroke survivors were recruited for the present study when they came to the hospital for a routine visit. The researcher or research assistant explained the study objective and methods to the stroke survivor, and the questionnaire survey was conducted. Survey contents included the Japanese version of the Rosenberg Self-Esteem Scale (RSE) and questions designed to assess living conditions. A total of 65 participants with complete RSE data were included in the analysis. The mean (standard deviation) age of participants was 70.9 years (± 11.1), with a mean RSE score of 32.12 (± 8.32). Only a minor decrease in participant self-esteem was observed, even after having experienced a stroke. Factors associated with self-esteem, including "independent bathing" (standardized partial regression coefficient, β = 0.405, P < 0.001), "being needed by family members" (β = 0.389, P < 0.001), "independent grooming" (β = 0.292, P = 0.009), and "sleep satisfaction" (β = 0.237, P = 0.017), were analyzed by stepwise multiple regression analysis. The multiple correlation coefficient adjusted for the degrees of freedom was 0.738 (P < 0.001). Our analysis revealed that the maintenance of activities of daily living, and the presence of a suitable environment that enhances physical function recovery and promotes activity and participation, are necessary to improve self-esteem in stroke survivors living at home. © 2013 The Authors. Japan Journal of Nursing Science © 2013 Japan Academy of Nursing Science.
Risk factors for polyuria in a cross-section of community psychiatric lithium-treated patients.
Kinahan, James Conor; NiChorcorain, Aoife; Cunningham, Sean; Freyne, Aideen; Cooney, Colm; Barry, Siobhan; Kelly, Brendan D
2015-02-01
Polyuria increases the risk of dehydration and lithium toxicity in lithium-treated patients. Risk factors have been inconsistently described and the variance of this adverse effect remains poorly understood. This study aimed to establish independent risk factors for polyuria in a community, secondary-level lithium-treated sample of patients. This was a cross-sectional study of the lithium-treated patients attending a general adult and an old age psychiatry service. Participants completed a 24-hour urine collection. Urine volume and the presence of polyuria were the outcomes of interest. The relationship between outcome and the participant's demographic and clinical characteristics was explored with univariable and multivariable analysis. A total of 122 participants were included in the analysis, with 38% being diagnosed with polyuria. Female gender and increased body weight independently predicted the presence of polyuria (standardized regression coefficient 1.01 and 0.94, respectively; p = 0.002 and p = 0.003, respectively). Female gender and increased body weight, lithium dose, and duration of lithium treatment independently predicted higher 24-hour urine volumes (standardized regression coefficients 0.693, p < 0.0005; 0.791, p < 0.0005; 0.276, p = 0.043; 0.181, p = 0.034, respectively). Of three different weight metrics, lean body weight was the most predictive. Female gender and increased body weight explain part of the variance of this adverse effect. Both risk factors offer fresh insights into the pathophysiology of this potentially reversible and dangerous adverse effect of lithium treatment. Future research should focus on understanding the differences between the genders and between different body compositions in terms of lithium pharmacokinetics and pharmacodynamics. © 2014 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Busch, Robyn M; Lineweaver, Tara T; Ferguson, Lisa; Haut, Jennifer S
2015-06-01
Reliable change indices (RCIs) and standardized regression-based (SRB) change score norms permit evaluation of meaningful changes in test scores following treatment interventions, like epilepsy surgery, while accounting for test-retest reliability, practice effects, score fluctuations due to error, and relevant clinical and demographic factors. Although these methods are frequently used to assess cognitive change after epilepsy surgery in adults, they have not been widely applied to examine cognitive change in children with epilepsy. The goal of the current study was to develop RCIs and SRB change score norms for use in children with epilepsy. Sixty-three children with epilepsy (age range: 6-16; M=10.19, SD=2.58) underwent comprehensive neuropsychological evaluations at two time points an average of 12 months apart. Practice effect-adjusted RCIs and SRB change score norms were calculated for all cognitive measures in the battery. Practice effects were quite variable across the neuropsychological measures, with the greatest differences observed among older children, particularly on the Children's Memory Scale and Wisconsin Card Sorting Test. There was also notable variability in test-retest reliabilities across measures in the battery, with coefficients ranging from 0.14 to 0.92. Reliable change indices and SRB change score norms for use in assessing meaningful cognitive change in children following epilepsy surgery are provided for measures with reliability coefficients above 0.50. This is the first study to provide RCIs and SRB change score norms for a comprehensive neuropsychological battery based on a large sample of children with epilepsy. Tables to aid in evaluating cognitive changes in children who have undergone epilepsy surgery are provided for clinical use. An Excel sheet to perform all relevant calculations is also available to interested clinicians or researchers. Copyright © 2015 Elsevier Inc. All rights reserved.
Galloway, Joel M.
2014-01-01
The Red River of the North (hereafter referred to as “Red River”) Basin is an important hydrologic region where water is a valuable resource for the region’s economy. Continuous water-quality monitors have been operated by the U.S. Geological Survey, in cooperation with the North Dakota Department of Health, Minnesota Pollution Control Agency, City of Fargo, City of Moorhead, City of Grand Forks, and City of East Grand Forks at the Red River at Fargo, North Dakota, from 2003 through 2012 and at Grand Forks, N.Dak., from 2007 through 2012. The purpose of the monitoring was to provide a better understanding of the water-quality dynamics of the Red River and provide a way to track changes in water quality. Regression equations were developed that can be used to estimate concentrations and loads for dissolved solids, sulfate, chloride, nitrate plus nitrite, total phosphorus, and suspended sediment using explanatory variables such as streamflow, specific conductance, and turbidity. Specific conductance was determined to be a significant explanatory variable for estimating dissolved solids concentrations at the Red River at Fargo and Grand Forks. The regression equations provided good relations between dissolved solid concentrations and specific conductance for the Red River at Fargo and at Grand Forks, with adjusted coefficients of determination of 0.99 and 0.98, respectively. Specific conductance, log-transformed streamflow, and a seasonal component were statistically significant explanatory variables for estimating sulfate in the Red River at Fargo and Grand Forks. Regression equations provided good relations between sulfate concentrations and the explanatory variables, with adjusted coefficients of determination of 0.94 and 0.89, respectively. For the Red River at Fargo and Grand Forks, specific conductance, streamflow, and a seasonal component were statistically significant explanatory variables for estimating chloride. For the Red River at Grand Forks, a time component also was a statistically significant explanatory variable for estimating chloride. The regression equations for chloride at the Red River at Fargo provided a fair relation between chloride concentrations and the explanatory variables, with an adjusted coefficient of determination of 0.66 and the equation for the Red River at Grand Forks provided a relatively good relation between chloride concentrations and the explanatory variables, with an adjusted coefficient of determination of 0.77. Turbidity and streamflow were statistically significant explanatory variables for estimating nitrate plus nitrite concentrations at the Red River at Fargo and turbidity was the only statistically significant explanatory variable for estimating nitrate plus nitrite concentrations at Grand Forks. The regression equation for the Red River at Fargo provided a relatively poor relation between nitrate plus nitrite concentrations, turbidity, and streamflow, with an adjusted coefficient of determination of 0.46. The regression equation for the Red River at Grand Forks provided a fair relation between nitrate plus nitrite concentrations and turbidity, with an adjusted coefficient of determination of 0.73. Some of the variability that was not explained by the equations might be attributed to different sources contributing nitrates to the stream at different times. Turbidity, streamflow, and a seasonal component were statistically significant explanatory variables for estimating total phosphorus at the Red River at Fargo and Grand Forks. The regression equation for the Red River at Fargo provided a relatively fair relation between total phosphorus concentrations, turbidity, streamflow, and season, with an adjusted coefficient of determination of 0.74. The regression equation for the Red River at Grand Forks provided a good relation between total phosphorus concentrations, turbidity, streamflow, and season, with an adjusted coefficient of determination of 0.87. For the Red River at Fargo, turbidity and streamflow were statistically significant explanatory variables for estimating suspended-sediment concentrations. For the Red River at Grand Forks, turbidity was the only statistically significant explanatory variable for estimating suspended-sediment concentration. The regression equation at the Red River at Fargo provided a good relation between suspended-sediment concentration, turbidity, and streamflow, with an adjusted coefficient of determination of 0.95. The regression equation for the Red River at Grand Forks provided a good relation between suspended-sediment concentration and turbidity, with an adjusted coefficient of determination of 0.96.
Deep learning for biomarker regression: application to osteoporosis and emphysema on chest CT scans
NASA Astrophysics Data System (ADS)
González, Germán.; Washko, George R.; San José Estépar, Raúl
2018-03-01
Introduction: Biomarker computation using deep-learning often relies on a two-step process, where the deep learning algorithm segments the region of interest and then the biomarker is measured. We propose an alternative paradigm, where the biomarker is estimated directly using a regression network. We showcase this image-tobiomarker paradigm using two biomarkers: the estimation of bone mineral density (BMD) and the estimation of lung percentage of emphysema from CT scans. Materials and methods: We use a large database of 9,925 CT scans to train, validate and test the network for which reference standard BMD and percentage emphysema have been already computed. First, the 3D dataset is reduced to a set of canonical 2D slices where the organ of interest is visible (either spine for BMD or lungs for emphysema). This data reduction is performed using an automatic object detector. Second, The regression neural network is composed of three convolutional layers, followed by a fully connected and an output layer. The network is optimized using a momentum optimizer with an exponential decay rate, using the root mean squared error as cost function. Results: The Pearson correlation coefficients obtained against the reference standards are r = 0.940 (p < 0.00001) and r = 0.976 (p < 0.00001) for BMD and percentage emphysema respectively. Conclusions: The deep-learning regression architecture can learn biomarkers from images directly, without indicating the structures of interest. This approach simplifies the development of biomarker extraction algorithms. The proposed data reduction based on object detectors conveys enough information to compute the biomarkers of interest.
Bao, Jie; Hou, Zhangshuan; Huang, Maoyi; ...
2015-12-04
Here, effective sensitivity analysis approaches are needed to identify important parameters or factors and their uncertainties in complex Earth system models composed of multi-phase multi-component phenomena and multiple biogeophysical-biogeochemical processes. In this study, the impacts of 10 hydrologic parameters in the Community Land Model on simulations of runoff and latent heat flux are evaluated using data from a watershed. Different metrics, including residual statistics, the Nash-Sutcliffe coefficient, and log mean square error, are used as alternative measures of the deviations between the simulated and field observed values. Four sensitivity analysis (SA) approaches, including analysis of variance based on the generalizedmore » linear model, generalized cross validation based on the multivariate adaptive regression splines model, standardized regression coefficients based on a linear regression model, and analysis of variance based on support vector machine, are investigated. Results suggest that these approaches show consistent measurement of the impacts of major hydrologic parameters on response variables, but with differences in the relative contributions, particularly for the secondary parameters. The convergence behaviors of the SA with respect to the number of sampling points are also examined with different combinations of input parameter sets and output response variables and their alternative metrics. This study helps identify the optimal SA approach, provides guidance for the calibration of the Community Land Model parameters to improve the model simulations of land surface fluxes, and approximates the magnitudes to be adjusted in the parameter values during parametric model optimization.« less
NASA Astrophysics Data System (ADS)
Yoshida, Kenichiro; Nishidate, Izumi; Ojima, Nobutoshi; Iwata, Kayoko
2014-01-01
To quantitatively evaluate skin chromophores over a wide region of curved skin surface, we propose an approach that suppresses the effect of the shading-derived error in the reflectance on the estimation of chromophore concentrations, without sacrificing the accuracy of that estimation. In our method, we use multiple regression analysis, assuming the absorbance spectrum as the response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as the predictor variables. The concentrations of melanin and total hemoglobin are determined from the multiple regression coefficients using compensation formulae (CF) based on the diffuse reflectance spectra derived from a Monte Carlo simulation. To suppress the shading-derived error, we investigated three different combinations of multiple regression coefficients for the CF. In vivo measurements with the forearm skin demonstrated that the proposed approach can reduce the estimation errors that are due to shading-derived errors in the reflectance. With the best combination of multiple regression coefficients, we estimated that the ratio of the error to the chromophore concentrations is about 10%. The proposed method does not require any measurements or assumptions about the shape of the subjects; this is an advantage over other studies related to the reduction of shading-derived errors.
Correlation and simple linear regression.
Zou, Kelly H; Tuncali, Kemal; Silverman, Stuart G
2003-06-01
In this tutorial article, the concepts of correlation and regression are reviewed and demonstrated. The authors review and compare two correlation coefficients, the Pearson correlation coefficient and the Spearman rho, for measuring linear and nonlinear relationships between two continuous variables. In the case of measuring the linear relationship between a predictor and an outcome variable, simple linear regression analysis is conducted. These statistical concepts are illustrated by using a data set from published literature to assess a computed tomography-guided interventional technique. These statistical methods are important for exploring the relationships between variables and can be applied to many radiologic studies.
A Note on the Estimator of the Alpha Coefficient for Standardized Variables Under Normality
ERIC Educational Resources Information Center
Hayashi, Kentaro; Kamata, Akihito
2005-01-01
The asymptotic standard deviation (SD) of the alpha coefficient with standardized variables is derived under normality. The research shows that the SD of the standardized alpha coefficient becomes smaller as the number of examinees and/or items increase. Furthermore, this research shows that the degree of the dependence of the SD on the number of…
Exact Analysis of Squared Cross-Validity Coefficient in Predictive Regression Models
ERIC Educational Resources Information Center
Shieh, Gwowen
2009-01-01
In regression analysis, the notion of population validity is of theoretical interest for describing the usefulness of the underlying regression model, whereas the presumably more important concept of population cross-validity represents the predictive effectiveness for the regression equation in future research. It appears that the inference…
Determining Directional Dependency in Causal Associations
Pornprasertmanit, Sunthud; Little, Todd D.
2014-01-01
Directional dependency is a method to determine the likely causal direction of effect between two variables. This article aims to critique and improve upon the use of directional dependency as a technique to infer causal associations. We comment on several issues raised by von Eye and DeShon (2012), including: encouraging the use of the signs of skewness and excessive kurtosis of both variables, discouraging the use of D’Agostino’s K2, and encouraging the use of directional dependency to compare variables only within time points. We offer improved steps for determining directional dependency that fix the problems we note. Next, we discuss how to integrate directional dependency into longitudinal data analysis with two variables. We also examine the accuracy of directional dependency evaluations when several regression assumptions are violated. Directional dependency can suggest the direction of a relation if (a) the regression error in population is normal, (b) an unobserved explanatory variable correlates with any variables equal to or less than .2, (c) a curvilinear relation between both variables is not strong (standardized regression coefficient ≤ .2), (d) there are no bivariate outliers, and (e) both variables are continuous. PMID:24683282
NASA Astrophysics Data System (ADS)
Liu, Ronghua; Sun, Qiaofeng; Hu, Tian; Li, Lian; Nie, Lei; Wang, Jiayue; Zhou, Wanhui; Zang, Hengchang
2018-03-01
As a powerful process analytical technology (PAT) tool, near infrared (NIR) spectroscopy has been widely used in real-time monitoring. In this study, NIR spectroscopy was applied to monitor multi-parameters of traditional Chinese medicine (TCM) Shenzhiling oral liquid during the concentration process to guarantee the quality of products. Five lab scale batches were employed to construct quantitative models to determine five chemical ingredients and physical change (samples density) during concentration process. The paeoniflorin, albiflorin, liquiritin and samples density were modeled by partial least square regression (PLSR), while the content of the glycyrrhizic acid and cinnamic acid were modeled by support vector machine regression (SVMR). Standard normal variate (SNV) and/or Savitzkye-Golay (SG) smoothing with derivative methods were adopted for spectra pretreatment. Variable selection methods including correlation coefficient (CC), competitive adaptive reweighted sampling (CARS) and interval partial least squares regression (iPLS) were performed for optimizing the models. The results indicated that NIR spectroscopy was an effective tool to successfully monitoring the concentration process of Shenzhiling oral liquid.
NASA Astrophysics Data System (ADS)
Liu, Yande; Ying, Yibin; Lu, Huishan; Fu, Xiaping
2004-12-01
This work evaluates the feasibility of Fourier transform near infrared (FT-NIR) spectrometry for rapid determining the total soluble solids content and acidity of apple fruit. Intact apple fruit were measured by reflectance FT-NIR in 800-2500 nm range. FT-NIR models were developed based on partial least square (PLS) regression and principal component regress (PCR) with respect to the reflectance and its first derivative, the logarithms of the reflectance reciprocal and its second derivative. The above regression models, related the FT-NIR spectra to soluble solids content (SSC), titratable acidity (TA) and available acidity (pH). The best combination, based on the prediction results, was PLS models with respect to the logarithms of the reflectance reciprocal. Predictions with PLS models resulted standard errors of prediction (SEP) of 0.455, 0.044 and 0.068, and correlation coefficients of 0.968, 0.728 and 0.831 for SSC, TA and pH, respectively. It was concluded that by using the FT-NIR spectrometry measurement system, in the appropriate spectral range, it is possible to nondestructively assess the maturity factors of apple fruit.
NASA Astrophysics Data System (ADS)
Wheeler, David C.; Waller, Lance A.
2009-03-01
In this paper, we compare and contrast a Bayesian spatially varying coefficient process (SVCP) model with a geographically weighted regression (GWR) model for the estimation of the potentially spatially varying regression effects of alcohol outlets and illegal drug activity on violent crime in Houston, Texas. In addition, we focus on the inherent coefficient shrinkage properties of the Bayesian SVCP model as a way to address increased coefficient variance that follows from collinearity in GWR models. We outline the advantages of the Bayesian model in terms of reducing inflated coefficient variance, enhanced model flexibility, and more formal measuring of model uncertainty for prediction. We find spatially varying effects for alcohol outlets and drug violations, but the amount of variation depends on the type of model used. For the Bayesian model, this variation is controllable through the amount of prior influence placed on the variance of the coefficients. For example, the spatial pattern of coefficients is similar for the GWR and Bayesian models when a relatively large prior variance is used in the Bayesian model.
Sirisomboon, Panmanas; Chowbankrang, Rawiphan; Williams, Phil
2012-05-01
Near-infrared spectroscopy in diffuse reflection mode was used to evaluate the apparent viscosity of Para rubber field latex and concentrated latex over the wavelength range of 1100 to 2500 nm, using partial least square regression (PLSR). The model with ten principal components (PCs) developed using the raw spectra accurately predicted the apparent viscosity with correlation coefficient (r), standard error of prediction (SEP), and bias of 0.974, 8.6 cP, and -0.4 cP, respectively. The ratio of the SEP to the standard deviation (RPD) and the ratio of the SEP to the range (RER) for the prediction were 4.4 and 16.7, respectively. Therefore, the model can be used for measurement of the apparent viscosity of field latex and concentrated latex in quality assurance and process control in the factory.
Random effects coefficient of determination for mixed and meta-analysis models
Demidenko, Eugene; Sargent, James; Onega, Tracy
2011-01-01
The key feature of a mixed model is the presence of random effects. We have developed a coefficient, called the random effects coefficient of determination, Rr2, that estimates the proportion of the conditional variance of the dependent variable explained by random effects. This coefficient takes values from 0 to 1 and indicates how strong the random effects are. The difference from the earlier suggested fixed effects coefficient of determination is emphasized. If Rr2 is close to 0, there is weak support for random effects in the model because the reduction of the variance of the dependent variable due to random effects is small; consequently, random effects may be ignored and the model simplifies to standard linear regression. The value of Rr2 apart from 0 indicates the evidence of the variance reduction in support of the mixed model. If random effects coefficient of determination is close to 1 the variance of random effects is very large and random effects turn into free fixed effects—the model can be estimated using the dummy variable approach. We derive explicit formulas for Rr2 in three special cases: the random intercept model, the growth curve model, and meta-analysis model. Theoretical results are illustrated with three mixed model examples: (1) travel time to the nearest cancer center for women with breast cancer in the U.S., (2) cumulative time watching alcohol related scenes in movies among young U.S. teens, as a risk factor for early drinking onset, and (3) the classic example of the meta-analysis model for combination of 13 studies on tuberculosis vaccine. PMID:23750070
Cho, Yeoungjee; Büchel, Janine; Steppan, Sonja; Passlick-Deetjen, Jutta; Hawley, Carmel M.; Dimeski, Goce; Clarke, Margaret; Johnson, David W.
2016-01-01
♦ Background: The longitudinal trends of lipid parameters and the impact of biocompatible peritoneal dialysis (PD) solutions on these levels remain to be fully defined. The present study aimed to a) evaluate the influence of neutral pH, low glucose degradation product (GDP) PD solutions on serum lipid parameters, and b) explore the capacity of lipid parameters (total cholesterol [TC], triglyceride [TG], high density lipoprotein [HDL], TC/HDL, low density lipoprotein [LDL], very low density lipoprotein [VLDL]) to predict cardiovascular events (CVE) and mortality in PD patients. ♦ Methods: The study included 175 incident participants from the balANZ trial with at least 1 stored serum sample. A composite CVE score was used as a primary clinical outcome measure. Multilevel linear regression and Poisson regression models were fitted to describe the trend of lipid parameters over time and its ability to predict composite CVE, respectively. ♦ Results: Small but statistically significant increases in serum TG (coefficient 0.006, p < 0.001), TC/HDL (coefficient 0.004, p = 0.001), and VLDL cholesterol (coefficient 0.005, p = 0.001) levels and a decrease in the serum HDL cholesterol levels (coefficient −0.004, p = 0.009) were observed with longer time on PD, whilst the type of PD solution (biocompatible vs standard) received had no significant effect on these levels. Peritoneal dialysis glucose exposure was significantly associated with trends in TG, TC/HDL, HDL and VLDL levels. Baseline lipid parameter levels were not predictive of composite CVEs or all-cause mortality. ♦ Conclusion: Serum TG, TC/HDL, and VLDL levels increased and the serum HDL levels decreased with increasing PD duration. None of the lipid parameters were significantly modified by biocompatible PD solution use over the time period studied or predictive of composite CVE or mortality. PMID:26429421
NASA Astrophysics Data System (ADS)
Wilson, Barry T.; Knight, Joseph F.; McRoberts, Ronald E.
2018-03-01
Imagery from the Landsat Program has been used frequently as a source of auxiliary data for modeling land cover, as well as a variety of attributes associated with tree cover. With ready access to all scenes in the archive since 2008 due to the USGS Landsat Data Policy, new approaches to deriving such auxiliary data from dense Landsat time series are required. Several methods have previously been developed for use with finer temporal resolution imagery (e.g. AVHRR and MODIS), including image compositing and harmonic regression using Fourier series. The manuscript presents a study, using Minnesota, USA during the years 2009-2013 as the study area and timeframe. The study examined the relative predictive power of land cover models, in particular those related to tree cover, using predictor variables based solely on composite imagery versus those using estimated harmonic regression coefficients. The study used two common non-parametric modeling approaches (i.e. k-nearest neighbors and random forests) for fitting classification and regression models of multiple attributes measured on USFS Forest Inventory and Analysis plots using all available Landsat imagery for the study area and timeframe. The estimated Fourier coefficients developed by harmonic regression of tasseled cap transformation time series data were shown to be correlated with land cover, including tree cover. Regression models using estimated Fourier coefficients as predictor variables showed a two- to threefold increase in explained variance for a small set of continuous response variables, relative to comparable models using monthly image composites. Similarly, the overall accuracies of classification models using the estimated Fourier coefficients were approximately 10-20 percentage points higher than the models using the image composites, with corresponding individual class accuracies between six and 45 percentage points higher.
SPSS and SAS programs for comparing Pearson correlations and OLS regression coefficients.
Weaver, Bruce; Wuensch, Karl L
2013-09-01
Several procedures that use summary data to test hypotheses about Pearson correlations and ordinary least squares regression coefficients have been described in various books and articles. To our knowledge, however, no single resource describes all of the most common tests. Furthermore, many of these tests have not yet been implemented in popular statistical software packages such as SPSS and SAS. In this article, we describe all of the most common tests and provide SPSS and SAS programs to perform them. When they are applicable, our code also computes 100 × (1 - α)% confidence intervals corresponding to the tests. For testing hypotheses about independent regression coefficients, we demonstrate one method that uses summary data and another that uses raw data (i.e., Potthoff analysis). When the raw data are available, the latter method is preferred, because use of summary data entails some loss of precision due to rounding.
NASA Astrophysics Data System (ADS)
Zhai, Mengting; Chen, Yan; Li, Jing; Zhou, Jun
2017-12-01
The molecular electrongativity distance vector (MEDV-13) was used to describe the molecular structure of benzyl ether diamidine derivatives in this paper, Based on MEDV-13, The three-parameter (M 3, M 15, M 47) QSAR model of insecticidal activity (pIC 50) for 60 benzyl ether diamidine derivatives was constructed by leaps-and-bounds regression (LBR) . The traditional correlation coefficient (R) and the cross-validation correlation coefficient (R CV ) were 0.975 and 0.971, respectively. The robustness of the regression model was validated by Jackknife method, the correlation coefficient R were between 0.971 and 0.983. Meanwhile, the independent variables in the model were tested to be no autocorrelation. The regression results indicate that the model has good robust and predictive capabilities. The research would provide theoretical guidance for the development of new generation of anti African trypanosomiasis drugs with efficiency and low toxicity.
Zheng, Qi; Peng, Limin
2016-01-01
Quantile regression provides a flexible platform for evaluating covariate effects on different segments of the conditional distribution of response. As the effects of covariates may change with quantile level, contemporaneously examining a spectrum of quantiles is expected to have a better capacity to identify variables with either partial or full effects on the response distribution, as compared to focusing on a single quantile. Under this motivation, we study a general adaptively weighted LASSO penalization strategy in the quantile regression setting, where a continuum of quantile index is considered and coefficients are allowed to vary with quantile index. We establish the oracle properties of the resulting estimator of coefficient function. Furthermore, we formally investigate a BIC-type uniform tuning parameter selector and show that it can ensure consistent model selection. Our numerical studies confirm the theoretical findings and illustrate an application of the new variable selection procedure. PMID:28008212
Giacomelli, Giovanni; Virgili, Gianni; Giansanti, Fabrizio; Sato, Giovanni; Cappello, Ezio; Cruciani, Filippo; Varano, Monica; Menchini, Ugo
2013-06-27
To investigate the simultaneous association of several psychophysical measures with reading ability in patients with mild and moderate low vision attending rehabilitation services. Standard measurements of reading ability (Minnesota Reading [MNREAD] charts), visual acuity (Early Treatment of Diabetic Retinopathy Study [ETDRS] charts), contrast sensitivity (Pelli-Robson charts), reading contrast threshold (Reading Explorer [REX] charts), retinal sensitivity, and fixation stability and localization (Micro Perimeter 1 [MP1] fundus perimetry) were obtained in 160 low vision patients with better eye visual acuity ranging from 0.3 to 1.0 logarithm of the minimum angle of resolution and affected by either age-related macular degeneration or diabetic retinopathy. All variables were moderately associated with reading performance measures (MNREAD reading speed and reading acuity and REX reading contrast threshold), as well as among each other. In a structural equation model, REX reading contrast threshold was highly associated with MNREAD reading speed (standardized coefficient, 0.63) and moderately associated with reading acuity (standardized coefficient, -0.30). REX test also mediated the effects of Pelli-Robson contrast sensitivity (standardized coefficient, 0.44), MP1 fixation eccentricity (standardized coefficient, -0.19), and the mean retinal sensitivity (standardized coefficient, 0.23) on reading performance. The MP1 fixation stability was associated with both MNREAD reading acuity (standardized coefficient, -0.24) and MNREAD reading speed (standardized coefficient, 0.23), while ETDRS visual acuity only affected reading acuity (standardized coefficient, 0.44). Fixation instability and contrast sensitivity loss are key factors limiting reading performance of patients with mild or moderate low vision. REX charts directly assess the impact of text contrast on letter recognition and text navigation and may be a useful aid in reading rehabilitation.
Ramsthaler, Frank; Kettner, Mattias; Verhoff, Marcel A
2014-01-01
In forensic anthropological casework, estimating age-at-death is key to profiling unknown skeletal remains. The aim of this study was to examine the reliability of a new, simple, fast, and inexpensive digital odontological method for age-at-death estimation. The method is based on the original Lamendin method, which is a widely used technique in the repertoire of odontological aging methods in forensic anthropology. We examined 129 single root teeth employing a digital camera and imaging software for the measurement of the luminance of the teeth's translucent root zone. Variability in luminance detection was evaluated using statistical technical error of measurement analysis. The method revealed stable values largely unrelated to observer experience, whereas requisite formulas proved to be camera-specific and should therefore be generated for an individual recording setting based on samples of known chronological age. Multiple regression analysis showed a highly significant influence of the coefficients of the variables "arithmetic mean" and "standard deviation" of luminance for the regression formula. For the use of this primer multivariate equation for age-at-death estimation in casework, a standard error of the estimate of 6.51 years was calculated. Step-by-step reduction of the number of embedded variables to linear regression analysis employing the best contributor "arithmetic mean" of luminance yielded a regression equation with a standard error of 6.72 years (p < 0.001). The results of this study not only support the premise of root translucency as an age-related phenomenon, but also demonstrate that translucency reflects a number of other influencing factors in addition to age. This new digital measuring technique of the zone of dental root luminance can broaden the array of methods available for estimating chronological age, and furthermore facilitate measurement and age classification due to its low dependence on observer experience.
Testing a single regression coefficient in high dimensional linear models
Zhong, Ping-Shou; Li, Runze; Wang, Hansheng; Tsai, Chih-Ling
2017-01-01
In linear regression models with high dimensional data, the classical z-test (or t-test) for testing the significance of each single regression coefficient is no longer applicable. This is mainly because the number of covariates exceeds the sample size. In this paper, we propose a simple and novel alternative by introducing the Correlated Predictors Screening (CPS) method to control for predictors that are highly correlated with the target covariate. Accordingly, the classical ordinary least squares approach can be employed to estimate the regression coefficient associated with the target covariate. In addition, we demonstrate that the resulting estimator is consistent and asymptotically normal even if the random errors are heteroscedastic. This enables us to apply the z-test to assess the significance of each covariate. Based on the p-value obtained from testing the significance of each covariate, we further conduct multiple hypothesis testing by controlling the false discovery rate at the nominal level. Then, we show that the multiple hypothesis testing achieves consistent model selection. Simulation studies and empirical examples are presented to illustrate the finite sample performance and the usefulness of the proposed method, respectively. PMID:28663668
Testing a single regression coefficient in high dimensional linear models.
Lan, Wei; Zhong, Ping-Shou; Li, Runze; Wang, Hansheng; Tsai, Chih-Ling
2016-11-01
In linear regression models with high dimensional data, the classical z -test (or t -test) for testing the significance of each single regression coefficient is no longer applicable. This is mainly because the number of covariates exceeds the sample size. In this paper, we propose a simple and novel alternative by introducing the Correlated Predictors Screening (CPS) method to control for predictors that are highly correlated with the target covariate. Accordingly, the classical ordinary least squares approach can be employed to estimate the regression coefficient associated with the target covariate. In addition, we demonstrate that the resulting estimator is consistent and asymptotically normal even if the random errors are heteroscedastic. This enables us to apply the z -test to assess the significance of each covariate. Based on the p -value obtained from testing the significance of each covariate, we further conduct multiple hypothesis testing by controlling the false discovery rate at the nominal level. Then, we show that the multiple hypothesis testing achieves consistent model selection. Simulation studies and empirical examples are presented to illustrate the finite sample performance and the usefulness of the proposed method, respectively.
Yoneoka, Daisuke; Henmi, Masayuki
2017-06-01
Recently, the number of regression models has dramatically increased in several academic fields. However, within the context of meta-analysis, synthesis methods for such models have not been developed in a commensurate trend. One of the difficulties hindering the development is the disparity in sets of covariates among literature models. If the sets of covariates differ across models, interpretation of coefficients will differ, thereby making it difficult to synthesize them. Moreover, previous synthesis methods for regression models, such as multivariate meta-analysis, often have problems because covariance matrix of coefficients (i.e. within-study correlations) or individual patient data are not necessarily available. This study, therefore, proposes a brief explanation regarding a method to synthesize linear regression models under different covariate sets by using a generalized least squares method involving bias correction terms. Especially, we also propose an approach to recover (at most) threecorrelations of covariates, which is required for the calculation of the bias term without individual patient data. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Retro-regression--another important multivariate regression improvement.
Randić, M
2001-01-01
We review the serious problem associated with instabilities of the coefficients of regression equations, referred to as the MRA (multivariate regression analysis) "nightmare of the first kind". This is manifested when in a stepwise regression a descriptor is included or excluded from a regression. The consequence is an unpredictable change of the coefficients of the descriptors that remain in the regression equation. We follow with consideration of an even more serious problem, referred to as the MRA "nightmare of the second kind", arising when optimal descriptors are selected from a large pool of descriptors. This process typically causes at different steps of the stepwise regression a replacement of several previously used descriptors by new ones. We describe a procedure that resolves these difficulties. The approach is illustrated on boiling points of nonanes which are considered (1) by using an ordered connectivity basis; (2) by using an ordering resulting from application of greedy algorithm; and (3) by using an ordering derived from an exhaustive search for optimal descriptors. A novel variant of multiple regression analysis, called retro-regression (RR), is outlined showing how it resolves the ambiguities associated with both "nightmares" of the first and the second kind of MRA.
[From clinical judgment to linear regression model.
Palacios-Cruz, Lino; Pérez, Marcela; Rivas-Ruiz, Rodolfo; Talavera, Juan O
2013-01-01
When we think about mathematical models, such as linear regression model, we think that these terms are only used by those engaged in research, a notion that is far from the truth. Legendre described the first mathematical model in 1805, and Galton introduced the formal term in 1886. Linear regression is one of the most commonly used regression models in clinical practice. It is useful to predict or show the relationship between two or more variables as long as the dependent variable is quantitative and has normal distribution. Stated in another way, the regression is used to predict a measure based on the knowledge of at least one other variable. Linear regression has as it's first objective to determine the slope or inclination of the regression line: Y = a + bx, where "a" is the intercept or regression constant and it is equivalent to "Y" value when "X" equals 0 and "b" (also called slope) indicates the increase or decrease that occurs when the variable "x" increases or decreases in one unit. In the regression line, "b" is called regression coefficient. The coefficient of determination (R 2 ) indicates the importance of independent variables in the outcome.
Interpretation of commonly used statistical regression models.
Kasza, Jessica; Wolfe, Rory
2014-01-01
A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.
Verification of spectrophotometric method for nitrate analysis in water samples
NASA Astrophysics Data System (ADS)
Kurniawati, Puji; Gusrianti, Reny; Dwisiwi, Bledug Bernanti; Purbaningtias, Tri Esti; Wiyantoko, Bayu
2017-12-01
The aim of this research was to verify the spectrophotometric method to analyze nitrate in water samples using APHA 2012 Section 4500 NO3-B method. The verification parameters used were: linearity, method detection limit, level of quantitation, level of linearity, accuracy and precision. Linearity was obtained by using 0 to 50 mg/L nitrate standard solution and the correlation coefficient of standard calibration linear regression equation was 0.9981. The method detection limit (MDL) was defined as 0,1294 mg/L and limit of quantitation (LOQ) was 0,4117 mg/L. The result of a level of linearity (LOL) was 50 mg/L and nitrate concentration 10 to 50 mg/L was linear with a level of confidence was 99%. The accuracy was determined through recovery value was 109.1907%. The precision value was observed using % relative standard deviation (%RSD) from repeatability and its result was 1.0886%. The tested performance criteria showed that the methodology was verified under the laboratory conditions.
A SEMIPARAMETRIC BAYESIAN MODEL FOR CIRCULAR-LINEAR REGRESSION
We present a Bayesian approach to regress a circular variable on a linear predictor. The regression coefficients are assumed to have a nonparametric distribution with a Dirichlet process prior. The semiparametric Bayesian approach gives added flexibility to the model and is usefu...
Yao, Xin; Niu, Yandong; Li, Youzhi; Zou, Dongsheng; Ding, Xiaohui; Bian, Hualin
2018-05-09
Bioaccumulation of five heavy metals (Cd, Cu, Mn, Pb, and Zn) in six plant organs (panicle, leaf, stem, root, rhizome, and bud) of the emergent and perennial plant species, Miscanthus sacchariflorus, were investigated to estimate the plant's potential for accumulating heavy metals in the wetlands of Dongting Lake. We found the highest Cd concentrations in the panicles and leaves; while the highest Cu and Mn were observed in the roots, the highest Pb in the panicles, and the highest Zn in the panicles and buds. In contrast, the lowest Cd concentrations were detected in the stem, roots, and buds; the lowest Cu concentrations in the leaves and stems; the lowest Mn concentrations in the panicles, rhizomes, and buds; the lowest Pb concentrations in the stems; and the lowest Zn concentrations in the leaves, stems, and rhizomes. Mean Cu concentration in the plant showed a positive regression coefficient with plot elevation, soil organic matter content, and soil Cu concentration, whereas it showed a negative regression coefficient with soil moisture and electrolyte leakage. Mean Mn concentration showed positive and negative regression coefficients with soil organic matter and soil moisture, respectively. Mean Pb concentration exhibited positive regression coefficient with plot elevation and soil total P concentration, and Zn concentration showed a positive regression coefficient with soil available P and total P concentrations. However, there was no significant regression coefficient between mean Cd concentration in the plant and the investigated environmental parameters. Stems and roots were the main organs involved in heavy metal accumulation from the environment. The mean quantities of heavy metals accumulated in the plant tissues were 2.2 mg Cd, 86.7 mg Cu, 290.3 mg Mn, 15.9 mg Pb, and 307 mg Zn per square meter. In the Dongting Lake wetlands, 0.7 × 10 3 kg Cd, 22.9 × 10 3 kg Cu, 77.5 × 10 3 kg Mn, 3.1 × 10 3 kg Pb, and 95.9 × 10 3 kg Zn per year were accumulated by aboveground organs and removed from the lake through harvesting for paper manufacture.
NASA Astrophysics Data System (ADS)
Hammud, Hassan H.; Ghannoum, Amer; Masoud, Mamdouh S.
2006-02-01
Sixteen Schiff bases obtained from the condensation of benzaldehyde or salicylaldehyde with various amines (aniline, 4-carboxyaniline, phenylhydrazine, 2,4-dinitrophenylhydrazine, ethylenediamine, hydrazine, o-phenylenediamine and 2,6-pyridinediamine) are studied with UV-vis spectroscopy to observe the effect of solvents, substituents and other structural factors on the spectra. The bands involving different electronic transitions are interpreted. Computerized analysis and multiple regression techniques were applied to calculate the regression and correlation coefficients based on the equation that relates peak position λmax to the solvent parameters that depend on the H-bonding ability, refractive index and dielectric constant of solvents.
Fischer, A; Friggens, N C; Berry, D P; Faverdin, P
2018-07-01
The ability to properly assess and accurately phenotype true differences in feed efficiency among dairy cows is key to the development of breeding programs for improving feed efficiency. The variability among individuals in feed efficiency is commonly characterised by the residual intake approach. Residual feed intake is represented by the residuals of a linear regression of intake on the corresponding quantities of the biological functions that consume (or release) energy. However, the residuals include both, model fitting and measurement errors as well as any variability in cow efficiency. The objective of this study was to isolate the individual animal variability in feed efficiency from the residual component. Two separate models were fitted, in one the standard residual energy intake (REI) was calculated as the residual of a multiple linear regression of lactation average net energy intake (NEI) on lactation average milk energy output, average metabolic BW, as well as lactation loss and gain of body condition score. In the other, a linear mixed model was used to simultaneously fit fixed linear regressions and random cow levels on the biological traits and intercept using fortnight repeated measures for the variables. This method split the predicted NEI in two parts: one quantifying the population mean intercept and coefficients, and one quantifying cow-specific deviations in the intercept and coefficients. The cow-specific part of predicted NEI was assumed to isolate true differences in feed efficiency among cows. NEI and associated energy expenditure phenotypes were available for the first 17 fortnights of lactation from 119 Holstein cows; all fed a constant energy-rich diet. Mixed models fitting cow-specific intercept and coefficients to different combinations of the aforementioned energy expenditure traits, calculated on a fortnightly basis, were compared. The variance of REI estimated with the lactation average model represented only 8% of the variance of measured NEI. Among all compared mixed models, the variance of the cow-specific part of predicted NEI represented between 53% and 59% of the variance of REI estimated from the lactation average model or between 4% and 5% of the variance of measured NEI. The remaining 41% to 47% of the variance of REI estimated with the lactation average model may therefore reflect model fitting errors or measurement errors. In conclusion, the use of a mixed model framework with cow-specific random regressions seems to be a promising method to isolate the cow-specific component of REI in dairy cows.
Morikawa, Go; Suzuka, Chihiro; Shoji, Atsushi; Shibusawa, Yoichi; Yanagida, Akio
2016-01-05
A high-throughput method for determining the octanol/water partition coefficient (P(o/w)) of a large variety of compounds exhibiting a wide range in hydrophobicity was established. The method combines a simple shake-flask method with a novel two-phase solvent system comprising an acetonitrile-phosphate buffer (0.1 M, pH 7.4)-1-octanol (25:25:4, v/v/v; AN system). The AN system partition coefficients (K(AN)) of 51 standard compounds for which log P(o/w) (at pH 7.4; log D) values had been reported were determined by single two-phase partitioning in test tubes, followed by measurement of the solute concentration in both phases using an automatic flow injection-ultraviolet detection system. The log K(AN) values were closely related to reported log D values, and the relationship could be expressed by the following linear regression equation: log D=2.8630 log K(AN) -0.1497(n=51). The relationship reveals that log D values (+8 to -8) for a large variety of highly hydrophobic and/or hydrophilic compounds can be estimated indirectly from the narrow range of log K(AN) values (+3 to -3) determined using the present method. Furthermore, log K(AN) values for highly polar compounds for which no log D values have been reported, such as amino acids, peptides, proteins, nucleosides, and nucleotides, can be estimated using the present method. The wide-ranging log D values (+5.9 to -7.5) of these molecules were estimated for the first time from their log K(AN) values and the above regression equation. Copyright © 2015 Elsevier B.V. All rights reserved.
Chen, Hong; Wang, Wen-jun; Chen, Yu-zhen; Mai, Mei-qi; Ouyang, Neng-yong; Chen, Jing-hua; Tuo, Ping
2010-05-01
To investigate the impacts of body mass index (BMI) and age on in vitro fertilization-embryo transfer (IVF) and intracytoplasmic sperm injection (ICSI) treatment in infertile patients without polycystic ovary syndrome (PCOS). A retrospective study of 1426 patients during Jun. 2001 - Nov. 2009 was carried out. Multiple regression was used to analyze the effects of BMI (low weight: BMI < 18.5 kg/m(2), normal weight: BMI 18.5 - 23.99 kg/m(2) and over weight-obesity: BMI ≥ 24 kg/m(2)) and age (young: 20 - 34 years old, eld: 35 - 45 years old) on controlled ovarian stimulation (COH) [including: dose and duration of Gn, E2 level on day of human chorionic gonadotropin (HCG) administration, number of oocytes collected and full-grown follicles], number of fertilization, cleavage, two-pronucleus, normal embryos and cryopreserved embryos and clinical pregnancy outcome. (1) Gn dose for the patients whose age were 35 and the above, had a positive correlation with age (P < 0.001), 12.70% of the total variation of Gn dose was related to age (standardized partial regression coefficient was 0.343). (2) Estradiol level on day of HCG administration had a negative correlation with BMI in overweight-obesity patients, and so were the patients whose age were 35 and above (P value respectively lower than 0.037 and 0.018). 0.80% of the total variation of estradiol (HCG day) is related to age and overweight-obesity while age took greater proportion (standardized partial regression coefficients were 0.066 and 0.058 respectively). (3) For older patients, age appeared to have negative relationships with duration of Gn and number of oocytes collected, full-grown follicles, fertilization, cleavage, two-pronucleus, normal embryos and cryopreserved embryos (P < 0.05). (4) Compared to young-normal weight patients, the odds ratio of pregnancy in eld-low weight and eld-overweight-obesity patients were 0.482 and 0.529 (P < 0.05) respectively. Age, but not the BMI, had significant effects on IVF/ICSI treatment. It seems that factors as losing weight before IVF or ICSI treatment effective in reducing the dose of Gn.
Monitoring Energy Balance in Breast Cancer Survivors Using a Mobile App: Reliability Study
Lozano-Lozano, Mario; Galiano-Castillo, Noelia; Martín-Martín, Lydia; Pace-Bedetti, Nicolás; Fernández-Lao, Carolina; Cantarero-Villanueva, Irene
2018-01-01
Background The majority of breast cancer survivors do not meet recommendations in terms of diet and physical activity. To address this problem, we developed a mobile health (mHealth) app for assessing and monitoring healthy lifestyles in breast cancer survivors, called the Energy Balance on Cancer (BENECA) mHealth system. The BENECA mHealth system is a novel and interactive mHealth app, which allows breast cancer survivors to engage themselves in their energy balance monitoring. BENECA was designed to facilitate adherence to healthy lifestyles in an easy and intuitive way. Objective The objective of the study was to assess the concurrent validity and test-retest reliability between the BENECA mHealth system and the gold standard assessment methods for diet and physical activity. Methods A reliability study was conducted with 20 breast cancer survivors. In the study, tri-axial accelerometers (ActiGraphGT3X+) were used as gold standard for 8 consecutive days, in addition to 2, 24-hour dietary recalls, 4 dietary records, and sociodemographic questionnaires. Two-way random effect intraclass correlation coefficients, a linear regression-analysis, and a Passing-Bablok regression were calculated. Results The reliability estimates were very high for all variables (alpha≥.90). The lowest reliability was found in fruit and vegetable intakes (alpha=.94). The reliability between the accelerometer and the dietary assessment instruments against the BENECA system was very high (intraclass correlation coefficient=.90). We found a mean match rate of 93.51% between instruments and a mean phantom rate of 3.35%. The Passing-Bablok regression analysis did not show considerable bias in fat percentage, portions of fruits and vegetables, or minutes of moderate to vigorous physical activity. Conclusions The BENECA mHealth app could be a new tool to measure energy balance in breast cancer survivors in a reliable and simple way. Our results support the use of this technology to not only to encourage changes in breast cancer survivors' lifestyles, but also to remotely monitor energy balance. Trial Registration ClinicalTrials.gov NCT02817724; https://clinicaltrials.gov/ct2/show/NCT02817724 (Archived by WebCite at http://www.webcitation.org/6xVY1buCc) PMID:29588273
Kim, Jae-Hwan; Park, Saet-Byul; Roh, Hyo-Jeong; Shin, Min-Ki; Moon, Gui-Im; Hong, Jin-Hwan; Kim, Hae-Yeong
2017-07-01
One novel standard reference plasmid, namely pUC-RICE5, was constructed as a positive control and calibrator for event-specific qualitative and quantitative detection of genetically modified (GM) rice (Bt63, Kemingdao1, Kefeng6, Kefeng8, and LLRice62). pUC-RICE5 contained fragments of a rice-specific endogenous reference gene (sucrose phosphate synthase) as well as the five GM rice events. An existing qualitative PCR assay approach was modified using pUC-RICE5 to create a quantitative method with limits of detection correlating to approximately 1-10 copies of rice haploid genomes. In this quantitative PCR assay, the square regression coefficients ranged from 0.993 to 1.000. The standard deviation and relative standard deviation values for repeatability ranged from 0.02 to 0.22 and 0.10% to 0.67%, respectively. The Ministry of Food and Drug Safety (Korea) validated the method and the results suggest it could be used routinely to identify five GM rice events. Copyright © 2017 Elsevier Ltd. All rights reserved.
ERIC Educational Resources Information Center
Coskuntuncel, Orkun
2013-01-01
The purpose of this study is two-fold; the first aim being to show the effect of outliers on the widely used least squares regression estimator in social sciences. The second aim is to compare the classical method of least squares with the robust M-estimator using the "determination of coefficient" (R[superscript 2]). For this purpose,…
Stress Optical Coefficient, Test Methodology, and Glass Standard Evaluation
2016-05-01
identifying and mapping flaw size distributions on glass surfaces for predicting mechanical response. International Journal of Applied Glass ...ARL-TN-0756 ● MAY 2016 US Army Research Laboratory Stress Optical Coefficient, Test Methodology, and Glass Standard Evaluation...Stress Optical Coefficient, Test Methodology, and Glass Standard Evaluation by Clayton M Weiss Oak Ridge Institute for Science and Education
QSAR modeling of flotation collectors using principal components extracted from topological indices.
Natarajan, R; Nirdosh, Inderjit; Basak, Subhash C; Mills, Denise R
2002-01-01
Several topological indices were calculated for substituted-cupferrons that were tested as collectors for the froth flotation of uranium. The principal component analysis (PCA) was used for data reduction. Seven principal components (PC) were found to account for 98.6% of the variance among the computed indices. The principal components thus extracted were used in stepwise regression analyses to construct regression models for the prediction of separation efficiencies (Es) of the collectors. A two-parameter model with a correlation coefficient of 0.889 and a three-parameter model with a correlation coefficient of 0.913 were formed. PCs were found to be better than partition coefficient to form regression equations, and inclusion of an electronic parameter such as Hammett sigma or quantum mechanically derived electronic charges on the chelating atoms did not improve the correlation coefficient significantly. The method was extended to model the separation efficiencies of mercaptobenzothiazoles (MBT) and aminothiophenols (ATP) used in the flotation of lead and zinc ores, respectively. Five principal components were found to explain 99% of the data variability in each series. A three-parameter equation with correlation coefficient of 0.985 and a two-parameter equation with correlation coefficient of 0.926 were obtained for MBT and ATP, respectively. The amenability of separation efficiencies of chelating collectors to QSAR modeling using PCs based on topological indices might lead to the selection of collectors for synthesis and testing from a virtual database.
Correlation of track irregularities and vehicle responses based on measured data
NASA Astrophysics Data System (ADS)
Karis, Tomas; Berg, Mats; Stichel, Sebastian; Li, Martin; Thomas, Dirk; Dirks, Babette
2018-06-01
Track geometry quality and dynamic vehicle response are closely related, but do not always correspond with each other in terms of maximum values and standard deviations. This can often be seen to give poor results in analyses with correlation coefficients or regression analysis. Measured data from both the EU project DynoTRAIN and the Swedish Green Train (Gröna Tåget) research programme is used in this paper to evaluate track-vehicle response for three vehicles. A single degree of freedom model is used as an inspiration to divide track-vehicle interaction into three parts, which are analysed in terms of correlation. One part, the vertical axle box acceleration divided by vehicle speed squared (?) and the second spatial derivative of the vertical track irregularities (?), is shown to be the weak link with lower correlation coefficients than the other parts. Future efforts should therefore be directed towards investigating the relation between axle box accelerations and track irregularity second derivatives.
Mathematical modeling of tetrahydroimidazole benzodiazepine-1-one derivatives as an anti HIV agent
NASA Astrophysics Data System (ADS)
Ojha, Lokendra Kumar
2017-07-01
The goal of the present work is the study of drug receptor interaction via QSAR (Quantitative Structure-Activity Relationship) analysis for 89 set of TIBO (Tetrahydroimidazole Benzodiazepine-1-one) derivatives. MLR (Multiple Linear Regression) method is utilized to generate predictive models of quantitative structure-activity relationships between a set of molecular descriptors and biological activity (IC50). The best QSAR model was selected having a correlation coefficient (r) of 0.9299 and Standard Error of Estimation (SEE) of 0.5022, Fisher Ratio (F) of 159.822 and Quality factor (Q) of 1.852. This model is statistically significant and strongly favours the substitution of sulphur atom, IS i.e. indicator parameter for -Z position of the TIBO derivatives. Two other parameter logP (octanol-water partition coefficient) and SAG (Surface Area Grid) also played a vital role in the generation of best QSAR model. All three descriptor shows very good stability towards data variation in leave-one-out (LOO).
Pace, M.N.; Rosentreter, J.J.; Bartholomay, R.C.
2001-01-01
Idaho State University and the US Geological Survey, in cooperation with the US Department of Energy, conducted a study to determine and evaluate strontium distribution coefficients (Kds) of subsurface materials at the Idaho National Engineering and Environmental Laboratory (INEEL). The Kds were determined to aid in assessing the variability of strontium Kds and their effects on chemical transport of strontium-90 in the Snake River Plain aquifer system. Data from batch experiments done to determine strontium Kds of five sediment-infill samples and six standard reference material samples were analyzed by using multiple linear regression analysis and the stepwise variable-selection method in the statistical program, Statistical Product and Service Solutions, to derive an equation of variables that can be used to predict strontium Kds of sediment-infill samples. The sediment-infill samples were from basalt vesicles and fractures from a selected core at the INEEL; strontium Kds ranged from ???201 to 356 ml g-1. The standard material samples consisted of clay minerals and calcite. The statistical analyses of the batch-experiment results showed that the amount of strontium in the initial solution, the amount of manganese oxide in the sample material, and the amount of potassium in the initial solution are the most important variables in predicting strontium Kds of sediment-infill samples.
Prediction of soil organic carbon partition coefficients by soil column liquid chromatography.
Guo, Rongbo; Liang, Xinmiao; Chen, Jiping; Wu, Wenzhong; Zhang, Qing; Martens, Dieter; Kettrup, Antonius
2004-04-30
To avoid the limitation of the widely used prediction methods of soil organic carbon partition coefficients (KOC) from hydrophobic parameters, e.g., the n-octanol/water partition coefficients (KOW) and the reversed phase high performance liquid chromatographic (RP-HPLC) retention factors, the soil column liquid chromatographic (SCLC) method was developed for KOC prediction. The real soils were used as the packing materials of RP-HPLC columns, and the correlations between the retention factors of organic compounds on soil columns (ksoil) and KOC measured by batch equilibrium method were studied. Good correlations were achieved between ksoil and KOC for three types of soils with different properties. All the square of the correlation coefficients (R2) of the linear regression between log ksoil and log KOC were higher than 0.89 with standard deviations of less than 0.21. In addition, the prediction of KOC from KOW and the RP-HPLC retention factors on cyanopropyl (CN) stationary phase (kCN) was comparatively evaluated for the three types of soils. The results show that the prediction of KOC from kCN and KOW is only applicable to some specific types of soils. The results obtained in the present study proved that the SCLC method is appropriate for the KOC prediction for different types of soils, however the applicability of using hydrophobic parameters to predict KOC largely depends on the properties of soil concerned.
Parametric regression model for survival data: Weibull regression model as an example
2016-01-01
Weibull regression model is one of the most popular forms of parametric regression model that it provides estimate of baseline hazard function, as well as coefficients for covariates. Because of technical difficulties, Weibull regression model is seldom used in medical literature as compared to the semi-parametric proportional hazard model. To make clinical investigators familiar with Weibull regression model, this article introduces some basic knowledge on Weibull regression model and then illustrates how to fit the model with R software. The SurvRegCensCov package is useful in converting estimated coefficients to clinical relevant statistics such as hazard ratio (HR) and event time ratio (ETR). Model adequacy can be assessed by inspecting Kaplan-Meier curves stratified by categorical variable. The eha package provides an alternative method to model Weibull regression model. The check.dist() function helps to assess goodness-of-fit of the model. Variable selection is based on the importance of a covariate, which can be tested using anova() function. Alternatively, backward elimination starting from a full model is an efficient way for model development. Visualization of Weibull regression model after model development is interesting that it provides another way to report your findings. PMID:28149846
Guo, Ying; Little, Roderick J; McConnell, Daniel S
2012-01-01
Covariate measurement error is common in epidemiologic studies. Current methods for correcting measurement error with information from external calibration samples are insufficient to provide valid adjusted inferences. We consider the problem of estimating the regression of an outcome Y on covariates X and Z, where Y and Z are observed, X is unobserved, but a variable W that measures X with error is observed. Information about measurement error is provided in an external calibration sample where data on X and W (but not Y and Z) are recorded. We describe a method that uses summary statistics from the calibration sample to create multiple imputations of the missing values of X in the regression sample, so that the regression coefficients of Y on X and Z and associated standard errors can be estimated using simple multiple imputation combining rules, yielding valid statistical inferences under the assumption of a multivariate normal distribution. The proposed method is shown by simulation to provide better inferences than existing methods, namely the naive method, classical calibration, and regression calibration, particularly for correction for bias and achieving nominal confidence levels. We also illustrate our method with an example using linear regression to examine the relation between serum reproductive hormone concentrations and bone mineral density loss in midlife women in the Michigan Bone Health and Metabolism Study. Existing methods fail to adjust appropriately for bias due to measurement error in the regression setting, particularly when measurement error is substantial. The proposed method corrects this deficiency.
NASA Astrophysics Data System (ADS)
de Oliveira, Isadora R. N.; Roque, Jussara V.; Maia, Mariza P.; Stringheta, Paulo C.; Teófilo, Reinaldo F.
2018-04-01
A new method was developed to determine the antioxidant properties of red cabbage extract (Brassica oleracea) by mid (MID) and near (NIR) infrared spectroscopies and partial least squares (PLS) regression. A 70% (v/v) ethanolic extract of red cabbage was concentrated to 9° Brix and further diluted (12 to 100%) in water. The dilutions were used as external standards for the building of PLS models. For the first time, this strategy was applied for building multivariate regression models. Reference analyses and spectral data were obtained from diluted extracts. The determinate properties were total and monomeric anthocyanins, total polyphenols and antioxidant capacity by ABTS (2,2-azino-bis(3-ethyl-benzothiazoline-6-sulfonate)) and DPPH (2,2-diphenyl-1-picrylhydrazyl) methods. Ordered predictors selection (OPS) and genetic algorithm (GA) were used for feature selection before PLS regression (PLS-1). In addition, a PLS-2 regression was applied to all properties simultaneously. PLS-1 models provided more predictive models than did PLS-2 regression. PLS-OPS and PLS-GA models presented excellent prediction results with a correlation coefficient higher than 0.98. However, the best models were obtained using PLS and variable selection with the OPS algorithm and the models based on NIR spectra were considered more predictive for all properties. Then, these models provided a simple, rapid and accurate method for determination of red cabbage extract antioxidant properties and its suitability for use in the food industry.
The solar wind effect on cosmic rays and solar activity
NASA Technical Reports Server (NTRS)
Fujimoto, K.; Kojima, H.; Murakami, K.
1985-01-01
The relation of cosmic ray intensity to solar wind velocity is investigated, using neutron monitor data from Kiel and Deep River. The analysis shows that the regression coefficient of the average intensity for a time interval to the corresponding average velocity is negative and that the absolute effect increases monotonously with the interval of averaging, tau, that is, from -0.5% per 100km/s for tau = 1 day to -1.1% per 100km/s for tau = 27 days. For tau 27 days the coefficient becomes almost constant independently of the value of tau. The analysis also shows that this tau-dependence of the regression coefficiently is varying with the solar activity.
Gao, Yu; Shi, Lu
2015-08-21
To better understand the documented link between mindfulness and longevity, we examine the association between mindfulness and conscious avoidance of secondhand smoke (SHS), as well as the association between mindfulness and physical activity. In Shanghai University of Finance and Economics (SUFE) we surveyed a convenience sample of 1516 college freshmen. We measured mindfulness, weekly physical activity, and conscious avoidance of secondhand smoke, along with demographic and behavioral covariates. We used a multilevel logistic regression to test the association between mindfulness and conscious avoidance of secondhand smoke, and used a Tobit regression model to test the association between mindfulness and metabolic equivalent hours per week. In both models the home province of the student respondent was used as the cluster variable, and demographic and behavioral covariates, such as age, gender, smoking history, household registration status (urban vs. rural), the perceived smog frequency in their home towns, and the asthma diagnosis. The logistic regression of consciously avoiding SHS shows that a higher level of mindfulness was associated with an increase in the odds ratio of conscious SHS avoidance (logged odds: 0.22, standard error: 0.07, p < 0.01). The Tobit regression shows that a higher level of mindfulness was associated with more metabolic equivalent hours per week (Tobit coefficient: 4.09, standard error: 1.13, p < 0.001). This study is an innovative attempt to study the behavioral issue of secondhand smoke from the perspective of the potential victim, rather than the active smoker. The observed associational patterns here are consistent with previous findings that mindfulness is associated with healthier behaviors in obesity prevention and substance use. Research designs with interventions are needed to test the causal link between mindfulness and these healthy behaviors.
Gao, Yu; Shi, Lu
2015-01-01
Introduction: To better understand the documented link between mindfulness and longevity, we examine the association between mindfulness and conscious avoidance of secondhand smoke (SHS), as well as the association between mindfulness and physical activity. Method: In Shanghai University of Finance and Economics (SUFE) we surveyed a convenience sample of 1516 college freshmen. We measured mindfulness, weekly physical activity, and conscious avoidance of secondhand smoke, along with demographic and behavioral covariates. We used a multilevel logistic regression to test the association between mindfulness and conscious avoidance of secondhand smoke, and used a Tobit regression model to test the association between mindfulness and metabolic equivalent hours per week. In both models the home province of the student respondent was used as the cluster variable, and demographic and behavioral covariates, such as age, gender, smoking history, household registration status (urban vs. rural), the perceived smog frequency in their home towns, and the asthma diagnosis. Results: The logistic regression of consciously avoiding SHS shows that a higher level of mindfulness was associated with an increase in the odds ratio of conscious SHS avoidance (logged odds: 0.22, standard error: 0.07, p < 0.01). The Tobit regression shows that a higher level of mindfulness was associated with more metabolic equivalent hours per week (Tobit coefficient: 4.09, standard error: 1.13, p < 0.001). Discussion: This study is an innovative attempt to study the behavioral issue of secondhand smoke from the perspective of the potential victim, rather than the active smoker. The observed associational patterns here are consistent with previous findings that mindfulness is associated with healthier behaviors in obesity prevention and substance use. Research designs with interventions are needed to test the causal link between mindfulness and these healthy behaviors. PMID:26308029
Interpretation of the Coefficients in the Fit y = at + bx + c
ERIC Educational Resources Information Center
Farnsworth, David L.
2006-01-01
The goals of this note are to derive formulas for the coefficients a and b in the least-squares regression plane y = at + bx + c for observations (t[subscript]i,x[subscript]i,y[subscript]i), i = 1, 2, ..., n, and to present meanings for the coefficients a and b. In this note, formulas for the coefficients a and b in the least-squares fit are…
NASA Astrophysics Data System (ADS)
Jing, Ran; Gong, Zhaoning; Zhao, Wenji; Pu, Ruiliang; Deng, Lei
2017-12-01
Above-bottom biomass (ABB) is considered as an important parameter for measuring the growth status of aquatic plants, and is of great significance for assessing health status of wetland ecosystems. In this study, Structure from Motion (SfM) technique was used to rebuild the study area with high overlapped images acquired by an unmanned aerial vehicle (UAV). We generated orthoimages and SfM dense point cloud data, from which vegetation indices (VIs) and SfM point cloud variables including average height (HAVG), standard deviation of height (HSD) and coefficient of variation of height (HCV) were extracted. These VIs and SfM point cloud variables could effectively characterize the growth status of aquatic plants, and thus they could be used to develop a simple linear regression model (SLR) and a stepwise linear regression model (SWL) with field measured ABB samples of aquatic plants. We also utilized a decision tree method to discriminate different types of aquatic plants. The experimental results indicated that (1) the SfM technique could effectively process high overlapped UAV images and thus be suitable for the reconstruction of fine texture feature of aquatic plant canopy structure; and (2) an SWL model based on point cloud variables: HAVG, HSD, HCV and two VIs: NGRDI, ExGR as independent variables has produced the best predictive result of ABB of aquatic plants in the study area, with a coefficient of determination of 0.84 and a relative root mean square error of 7.13%. In this analysis, a novel method for the quantitative inversion of a growth parameter (i.e., ABB) of aquatic plants in wetlands was demonstrated.
Ozone and sulfur dioxide effects on three tall fescue cultivars
DOE Office of Scientific and Technical Information (OSTI.GOV)
Flagler, R.B.; Youngner, V.B.
Although many reports have been published concerning differential susceptibility of various crops and/or cultivars to air pollutants, most have used foliar injury instead of the marketable yield as the factor that determined susceptibility for the crop. In an examination of screening in terms of marketable yield, three cultivars of tall fescue (Festuca arundinacea Schreb.), 'Alta,' 'Fawn,' and 'Kentucky 31,' were exposed to 0-0.40 ppm O/sub 3/ or 0-0.50 ppm SO/sub 2/ 6 h/d, once a week, for 7 and 9 weeks, respectively. Experimental design was a randomized complete block with three replications. Statistical analysis was by standard analysis of variancemore » and regression techniques. Three variables were analyzed: top dry weight (yield), tiller number, and weight per tiller. Ozone had a significant effect on all three variables. Significant linear decreases in yield and weight per tiller occurred with increasing O/sub 3/ concentrations. Linear regressions of these variables on O/sub 3/ concentration produced significantly different regression coefficients. The coefficient for Kentucky 31 was significantly greater than Alta or Fawn, which did not differ from each other. This indicated that Kentucky 31 was more susceptible to O/sub 3/ than either of the other cultivars. Percent reductions in dry weight for the three cultivars at highest O/sub 3/ level were 35, 44, and 53%, respectively, for Fawn, Alta, and Kentucky 31. For weight per tiller, Kentucky 31 had a higher percent reduction than the other cultivars (59 vs. 46 and 44%). Tiller number was generally increased by O/sub 3/, but this variable was not useful for determining differential susceptibility to the pollutant. Sulfur dioxide treatments produced no significant effects on any of the variables analyzed.« less
Metal ion levels and lymphocyte counts: ASR hip resurfacing prosthesis vs. standard THA
2013-01-01
Background and purpose Wear particles from metal–on–metal arthroplasties are under suspicion for adverse effects both locally and systemically, and the DePuy ASR Hip Resurfacing System (RHA) has above–average failure rates. We compared lymphocyte counts in RHA and total hip arthroplasty (THA) and investigated whether cobalt and chromium ions affected the lymphocyte counts. Method In a randomized controlled trial, we followed 19 RHA patients and 19 THA patients. Lymphocyte subsets and chromium and cobalt ion concentrations were measured at baseline, at 8 weeks, at 6 months, and at 1 and 2 years. Results The T–lymphocyte counts for both implant types declined over the 2–year period. This decline was statistically significant for CD3+CD8+ in the THA group, with a regression coefficient of –0.04 × 109cells/year (95% CI: –0.08 to –0.01). Regression analysis indicated a depressive effect of cobalt ions in particular on T–cells with 2–year whole–blood cobalt regression coefficients for CD3+ of –0.10 (95% CI: –0.16 to –0.04) × 109 cells/parts per billion (ppb), for CD3+CD4+ of –0.06 (–0.09 to –0.03) × 109 cells/ppb, and for CD3+CD8+ of –0.02 (–0.03 to –0.00) × 109 cells/ppb. Interpretation Circulating T–lymphocyte levels may decline after surgery, regardless of implant type. Metal ions—particularly cobalt—may have a general depressive effect on T– and B–lymphocyte levels. Registered with ClinicalTrials.gov under # NCT01113762 PMID:23597114
Misyura, Maksym; Sukhai, Mahadeo A; Kulasignam, Vathany; Zhang, Tong; Kamel-Reid, Suzanne; Stockley, Tracy L
2018-01-01
Aims A standard approach in test evaluation is to compare results of the assay in validation to results from previously validated methods. For quantitative molecular diagnostic assays, comparison of test values is often performed using simple linear regression and the coefficient of determination (R2), using R2 as the primary metric of assay agreement. However, the use of R2 alone does not adequately quantify constant or proportional errors required for optimal test evaluation. More extensive statistical approaches, such as Bland-Altman and expanded interpretation of linear regression methods, can be used to more thoroughly compare data from quantitative molecular assays. Methods We present the application of Bland-Altman and linear regression statistical methods to evaluate quantitative outputs from next-generation sequencing assays (NGS). NGS-derived data sets from assay validation experiments were used to demonstrate the utility of the statistical methods. Results Both Bland-Altman and linear regression were able to detect the presence and magnitude of constant and proportional error in quantitative values of NGS data. Deming linear regression was used in the context of assay comparison studies, while simple linear regression was used to analyse serial dilution data. Bland-Altman statistical approach was also adapted to quantify assay accuracy, including constant and proportional errors, and precision where theoretical and empirical values were known. Conclusions The complementary application of the statistical methods described in this manuscript enables more extensive evaluation of performance characteristics of quantitative molecular assays, prior to implementation in the clinical molecular laboratory. PMID:28747393
Lenselink, Eelke B; Ten Dijke, Niels; Bongers, Brandon; Papadatos, George; van Vlijmen, Herman W T; Kowalczyk, Wojtek; IJzerman, Adriaan P; van Westen, Gerard J P
2017-08-14
The increase of publicly available bioactivity data in recent years has fueled and catalyzed research in chemogenomics, data mining, and modeling approaches. As a direct result, over the past few years a multitude of different methods have been reported and evaluated, such as target fishing, nearest neighbor similarity-based methods, and Quantitative Structure Activity Relationship (QSAR)-based protocols. However, such studies are typically conducted on different datasets, using different validation strategies, and different metrics. In this study, different methods were compared using one single standardized dataset obtained from ChEMBL, which is made available to the public, using standardized metrics (BEDROC and Matthews Correlation Coefficient). Specifically, the performance of Naïve Bayes, Random Forests, Support Vector Machines, Logistic Regression, and Deep Neural Networks was assessed using QSAR and proteochemometric (PCM) methods. All methods were validated using both a random split validation and a temporal validation, with the latter being a more realistic benchmark of expected prospective execution. Deep Neural Networks are the top performing classifiers, highlighting the added value of Deep Neural Networks over other more conventional methods. Moreover, the best method ('DNN_PCM') performed significantly better at almost one standard deviation higher than the mean performance. Furthermore, Multi-task and PCM implementations were shown to improve performance over single task Deep Neural Networks. Conversely, target prediction performed almost two standard deviations under the mean performance. Random Forests, Support Vector Machines, and Logistic Regression performed around mean performance. Finally, using an ensemble of DNNs, alongside additional tuning, enhanced the relative performance by another 27% (compared with unoptimized 'DNN_PCM'). Here, a standardized set to test and evaluate different machine learning algorithms in the context of multi-task learning is offered by providing the data and the protocols. Graphical Abstract .
Liang, Han; Cheng, Jing; Shen, Xingrong; Chen, Penglai; Tong, Guixian; Chai, Jing; Li, Kaichun; Xie, Shaoyu; Shi, Yong; Wang, Debin; Sun, Yehuan
2015-02-01
This study aims at examining the effects of stressful life events on risk of impaired fasting glucose among left-behind farmers in rural China. The study collected data about stressful life events, family history of diabetes, lifestyle, demographics and minimum anthropometrics from left-behind famers aged 40-70 years. Calculated life event index was applied to assess the combined effects of stressful life events experienced by the left-behind farmers and its association with impaired fasting glucose was estimated using binary logistic regression models. The prevalence of abnormal fasting glucose was 61.4% by American Diabetes Association (ADA) standard and 32.4% by World Health Organization (WHO) standard. Binary logistic regression analysis revealed a coefficient of 0.033 (P<.001) by ADA standard or 0.028 (P<.001) by WHO standard between impaired fasting glucose and life event index. The overall odds ratios of impaired glucose for the second, third and fourth (highest) versus the first (lowest) quartile of life event index were 1.419 [95% CI=(1.173, 1.717)], 1.711 [95% CI=(1.413, 2.071)] and 1.957 [95% CI=(1.606, 2.385)] respectively by ADA standard. When more and more confounding factors were controlled for, these odds ratios remained statistically significant though decreased to a small extent. The left-behind farmers showed over two-fold prevalence rate of pre-diabetes than that of the nation's average and their risk of impaired fasting glucose was positively associated with stressful life events in a dose-dependent way. Both the population studied and their life events merit special attention. Copyright © 2014 Elsevier Inc. All rights reserved.
ERIC Educational Resources Information Center
Waller, Niels; Jones, Jeff
2011-01-01
We describe methods for assessing all possible criteria (i.e., dependent variables) and subsets of criteria for regression models with a fixed set of predictors, x (where x is an n x 1 vector of independent variables). Our methods build upon the geometry of regression coefficients (hereafter called regression weights) in n-dimensional space. For a…
Prediction of oxygen consumption in cardiac rehabilitation patients performing leg ergometry
NASA Astrophysics Data System (ADS)
Alvarez, John Gershwin
The purpose of this study was two-fold. First, to determine the validity of the ACSM leg ergometry equation in the prediction of steady-state oxygen consumption (VO2) in a heterogeneous population of cardiac patients. Second, to determine whether a more accurate prediction equation could be developed for use in the cardiac population. Thirty-one cardiac rehabilitation patients participated in the study of which 24 were men and 7 were women. Biometric variables (mean +/- sd) of the participants were as follows: age = 61.9 +/- 9.5 years; height = 172.6 +/- 1.6 cm; and body mass = 82.3 +/- 10.6 kg. Subjects exercised on a MonarchTM cycle ergometer at 0, 180, 360, 540 and 720 kgm ˙ min-1. The length of each stage was five minutes. Heart rate, ECG, and VO2 were continuously monitored. Blood pressure and heart rate were collected at the end of each stage. Steady state VO 2 was calculated for each stage using the average of the last two minutes. Correlation coefficients, standard error of estimate, coefficient of determination, total error, and mean bias were used to determine the accuracy of the ACSM equation (1995). The analysis found the ACSM equation to be a valid means of estimating VO2 in cardiac patients. Simple linear regression was used to develop a new equation. Regression analysis found workload to be a significant predictor of VO2. The following equation is the result: VO2 = (1.6 x kgm ˙ min-1) + 444 ml ˙ min-1. The r of the equation was .78 (p < .05) and the standard error of estimate was 211 ml ˙ min-1. Analysis of variance was used to determine significant differences between means for actual and predicted VO2 values for each equation. The analysis found the ACSM and new equation to significantly (p < .05) under predict VO2 during unloaded pedaling. Furthermore, the ACSM equation was found to significantly (p < .05) under predict VO 2 during the first loaded stage of exercise. When the accuracy of the ACSM and new equations were compared based on correlation coefficients, coefficients of determinations, SEEs, total error, and mean bias the new equation was found to have equal or better accuracy at all workloads. The final form of the new equation is: VO2 (ml ˙ min-1) = (kgm ˙ min-1 x 1.6 ml ˙ kgm-1) + (3.5 ml ˙ kg-1 ˙ min-1 x body mass in kg) + 156 ml ˙ min-1.
Random effects coefficient of determination for mixed and meta-analysis models.
Demidenko, Eugene; Sargent, James; Onega, Tracy
2012-01-01
The key feature of a mixed model is the presence of random effects. We have developed a coefficient, called the random effects coefficient of determination, [Formula: see text], that estimates the proportion of the conditional variance of the dependent variable explained by random effects. This coefficient takes values from 0 to 1 and indicates how strong the random effects are. The difference from the earlier suggested fixed effects coefficient of determination is emphasized. If [Formula: see text] is close to 0, there is weak support for random effects in the model because the reduction of the variance of the dependent variable due to random effects is small; consequently, random effects may be ignored and the model simplifies to standard linear regression. The value of [Formula: see text] apart from 0 indicates the evidence of the variance reduction in support of the mixed model. If random effects coefficient of determination is close to 1 the variance of random effects is very large and random effects turn into free fixed effects-the model can be estimated using the dummy variable approach. We derive explicit formulas for [Formula: see text] in three special cases: the random intercept model, the growth curve model, and meta-analysis model. Theoretical results are illustrated with three mixed model examples: (1) travel time to the nearest cancer center for women with breast cancer in the U.S., (2) cumulative time watching alcohol related scenes in movies among young U.S. teens, as a risk factor for early drinking onset, and (3) the classic example of the meta-analysis model for combination of 13 studies on tuberculosis vaccine.
Seyfart, Tom; Friedrich, Nele; Kische, Hanna; Bülow, Robin; Wallaschofski, Henri; Völzke, Henry; Nauck, Matthias; Keevil, Brian G; Haring, Robin
2018-01-01
The aim of this study was to evaluate the association of sex hormones with anthropometry in a large population-based cohort, with liquid chromatography-mass spectrometry (LCMS)-based sex hormone measurements and imaging markers. Cross-sectional data from 957 men and women from the population-based Study of Health in Pomerania (SHIP) were used. Associations of a comprehensive panel of LCMS-measured sex hormones with anthropometric parameters, laboratory, and imaging markers were analyzed in multivariable regression models for the full sample and stratified by sex. Sex hormone measures included total testosterone (TT), free testosterone (fT), estrone and estradiol, androstenedione (ASD), dehydroepiandrosterone sulfate (DHEAS), and sex hormone-binding globulin (SHBG). Domains of anthropometry included physical measures (body-mass-index (BMI), waist circumference, waist-to-height-ratio, waist-to-hip-ratio, and hip circumference), laboratory measures of adipokines (leptin and vaspin), and magnet resonance imaging-based measures (visceral and subcutaneous adipose tissue). In men, inverse associations between all considered anthropometric parameters with TT were found: BMI (β-coefficient, standard error (SE): -0.159, 0.037), waist-circumference (β-coefficient, SE: -0.892, 0.292), subcutaneous adipose tissue (β-coefficient, SE: -0.156, 0.023), and leptin (β-coefficient, SE: -0.046, 0.009). In women TT (β-coefficient, SE: 1.356, 0.615) and estrone (β-coefficient, SE: 0.014, 0.005) were positively associated with BMI. In analyses of variance, BMI and leptin were inversely associated with TT, ASD, and DHEAS in men, but positively associated with estrone. In women, BMI and leptin were positively associated with all sex hormones. The present population-based study confirmed and extended previously reported sex-specific associations between sex hormones and various anthropometric markers of overweight and obesity.
Regression rate study of porous axial-injection, endburning hybrid fuel grains
NASA Astrophysics Data System (ADS)
Hitt, Matthew A.
This experimental and theoretical work examines the effects of gaseous oxidizer flow rates and pressure on the regression rates of porous fuels for hybrid rocket applications. Testing was conducted using polyethylene as the porous fuel and both gaseous oxygen and nitrous oxide as the oxidizer. Nominal test articles were tested using 200, 100, 50, and 15 micron fuel pore sizes. Pressures tested ranged from atmospheric to 1160 kPa for the gaseous oxygen tests and from 207 kPa to 1054 kPa for the nitrous oxide tests, and oxidizer injection velocities ranged from 35 m/s to 80 m/s for the gaseous oxygen tests and from 7.5 m/s to 16.8 m/s for the nitrous oxide tests. Regression rates were determined using pretest and posttest length measurements of the solid fuel. Experimental results demonstrated that the regression rate of the porous axial-injection, end-burning hybrid was a function of the chamber pressure, as opposed to the oxidizer mass flux typical in conventional hybrids. Regression rates ranged from approximately 0.75 mm/s at atmospheric pressure to 8.89 mm/s at 1160 kPa for the gaseous oxygen tests and 0.21 mm/s at 207 kPa to 1.44 mm/s at 1054 kPa for the nitrous oxide tests. The analytical model was developed based on a standard ablative model modified to include oxidizer flow through the grain. The heat transfer from the flame was primarily modeled using an empirically determined flame coefficient that included all heat transfer mechanisms in one term. An exploratory flame model based on the Granular Diffusion Flame model used for solid rocket motors was also adapted for comparison with the empirical flame coefficient. This model was then evaluated quantitatively using the experimental results of the gaseous oxygen tests as well as qualitatively using the experimental results of the nitrous oxide tests. The model showed agreement with the experimental results indicating it has potential for giving insight into the flame structure in this motor configuration. Results from the model suggested that both kinetic and diffusion processes could be relevant to the combustion depending on the chamber pressure.
Harmonic regression based multi-temporal cloud filtering algorithm for Landsat 8
NASA Astrophysics Data System (ADS)
Joshi, P.
2015-12-01
Landsat data archive though rich is seen to have missing dates and periods owing to the weather irregularities and inconsistent coverage. The satellite images are further subject to cloud cover effects resulting in erroneous analysis and observations of ground features. In earlier studies the change detection algorithm using statistical control charts on harmonic residuals of multi-temporal Landsat 5 data have been shown to detect few prominent remnant clouds [Brooks, Evan B., et al, 2014]. So, in this work we build on this harmonic regression approach to detect and filter clouds using a multi-temporal series of Landsat 8 images. Firstly, we compute the harmonic coefficients using the fitting models on annual training data. This time series of residuals is further subjected to Shewhart X-bar control charts which signal the deviations of cloud points from the fitted multi-temporal fourier curve. For the process with standard deviation σ we found the second and third order harmonic regression with a x-bar chart control limit [Lσ] ranging between [0.5σ < Lσ < σ] as most efficient in detecting clouds. By implementing second order harmonic regression with successive x-bar chart control limits of L and 0.5 L on the NDVI, NDSI and haze optimized transformation (HOT), and utilizing the seasonal physical properties of these parameters, we have designed a novel multi-temporal algorithm for filtering clouds from Landsat 8 images. The method is applied to Virginia and Alabama in Landsat8 UTM zones 17 and 16 respectively. Our algorithm efficiently filters all types of cloud cover with an overall accuracy greater than 90%. As a result of the multi-temporal operation and the ability to recreate the multi-temporal database of images using only the coefficients of the fourier regression, our algorithm is largely storage and time efficient. The results show a good potential for this multi-temporal approach for cloud detection as a timely and targeted solution for the Landsat 8 research community, catering to the need for innovative processing solutions in the infant stage of the satellite.
Interpreting Bivariate Regression Coefficients: Going beyond the Average
ERIC Educational Resources Information Center
Halcoussis, Dennis; Phillips, G. Michael
2010-01-01
Statistics, econometrics, investment analysis, and data analysis classes often review the calculation of several types of averages, including the arithmetic mean, geometric mean, harmonic mean, and various weighted averages. This note shows how each of these can be computed using a basic regression framework. By recognizing when a regression model…
Beyond Multiple Regression: Using Commonality Analysis to Better Understand R[superscript 2] Results
ERIC Educational Resources Information Center
Warne, Russell T.
2011-01-01
Multiple regression is one of the most common statistical methods used in quantitative educational research. Despite the versatility and easy interpretability of multiple regression, it has some shortcomings in the detection of suppressor variables and for somewhat arbitrarily assigning values to the structure coefficients of correlated…
Precision Efficacy Analysis for Regression.
ERIC Educational Resources Information Center
Brooks, Gordon P.
When multiple linear regression is used to develop a prediction model, sample size must be large enough to ensure stable coefficients. If the derivation sample size is inadequate, the model may not predict well for future subjects. The precision efficacy analysis for regression (PEAR) method uses a cross- validity approach to select sample sizes…
40 CFR 53.34 - Test procedure for methods for PM10 and Class I methods for PM2.5.
Code of Federal Regulations, 2011 CFR
2011-07-01
... linear regression parameters (slope, intercept, and correlation coefficient) describing the relationship... correlation coefficient. (2) To pass the test for comparability, the slope, intercept, and correlation...
ERIC Educational Resources Information Center
Berry, Kenneth J.; And Others
1977-01-01
A FORTRAN program, GAMMA, computes Goodman and Kruskal's coefficient of ordinal association, gamma, and Somer's coefficient. The program also provides associated standard errors, standard scores, and probability values. (Author/JKS)
Su, Liyun; Zhao, Yanyong; Yan, Tianshun; Li, Fenglan
2012-01-01
Multivariate local polynomial fitting is applied to the multivariate linear heteroscedastic regression model. Firstly, the local polynomial fitting is applied to estimate heteroscedastic function, then the coefficients of regression model are obtained by using generalized least squares method. One noteworthy feature of our approach is that we avoid the testing for heteroscedasticity by improving the traditional two-stage method. Due to non-parametric technique of local polynomial estimation, it is unnecessary to know the form of heteroscedastic function. Therefore, we can improve the estimation precision, when the heteroscedastic function is unknown. Furthermore, we verify that the regression coefficients is asymptotic normal based on numerical simulations and normal Q-Q plots of residuals. Finally, the simulation results and the local polynomial estimation of real data indicate that our approach is surely effective in finite-sample situations.
NASA Astrophysics Data System (ADS)
Mitra, Ashis; Majumdar, Prabal Kumar; Bannerjee, Debamalya
2013-03-01
This paper presents a comparative analysis of two modeling methodologies for the prediction of air permeability of plain woven handloom cotton fabrics. Four basic fabric constructional parameters namely ends per inch, picks per inch, warp count and weft count have been used as inputs for artificial neural network (ANN) and regression models. Out of the four regression models tried, interaction model showed very good prediction performance with a meager mean absolute error of 2.017 %. However, ANN models demonstrated superiority over the regression models both in terms of correlation coefficient and mean absolute error. The ANN model with 10 nodes in the single hidden layer showed very good correlation coefficient of 0.982 and 0.929 and mean absolute error of only 0.923 and 2.043 % for training and testing data respectively.
Improvement of Storm Forecasts Using Gridded Bayesian Linear Regression for Northeast United States
NASA Astrophysics Data System (ADS)
Yang, J.; Astitha, M.; Schwartz, C. S.
2017-12-01
Bayesian linear regression (BLR) is a post-processing technique in which regression coefficients are derived and used to correct raw forecasts based on pairs of observation-model values. This study presents the development and application of a gridded Bayesian linear regression (GBLR) as a new post-processing technique to improve numerical weather prediction (NWP) of rain and wind storm forecasts over northeast United States. Ten controlled variables produced from ten ensemble members of the National Center for Atmospheric Research (NCAR) real-time prediction system are used for a GBLR model. In the GBLR framework, leave-one-storm-out cross-validation is utilized to study the performances of the post-processing technique in a database composed of 92 storms. To estimate the regression coefficients of the GBLR, optimization procedures that minimize the systematic and random error of predicted atmospheric variables (wind speed, precipitation, etc.) are implemented for the modeled-observed pairs of training storms. The regression coefficients calculated for meteorological stations of the National Weather Service are interpolated back to the model domain. An analysis of forecast improvements based on error reductions during the storms will demonstrate the value of GBLR approach. This presentation will also illustrate how the variances are optimized for the training partition in GBLR and discuss the verification strategy for grid points where no observations are available. The new post-processing technique is successful in improving wind speed and precipitation storm forecasts using past event-based data and has the potential to be implemented in real-time.
Correlation and prediction of dynamic human isolated joint strength from lean body mass
NASA Technical Reports Server (NTRS)
Pandya, Abhilash K.; Hasson, Scott M.; Aldridge, Ann M.; Maida, James C.; Woolford, Barbara J.
1992-01-01
A relationship between a person's lean body mass and the amount of maximum torque that can be produced with each isolated joint of the upper extremity was investigated. The maximum dynamic isolated joint torque (upper extremity) on 14 subjects was collected using a dynamometer multi-joint testing unit. These data were reduced to a table of coefficients of second degree polynomials, computed using a least squares regression method. All the coefficients were then organized into look-up tables, a compact and convenient storage/retrieval mechanism for the data set. Data from each joint, direction and velocity, were normalized with respect to that joint's average and merged into files (one for each curve for a particular joint). Regression was performed on each one of these files to derive a table of normalized population curve coefficients for each joint axis, direction, and velocity. In addition, a regression table which included all upper extremity joints was built which related average torque to lean body mass for an individual. These two tables are the basis of the regression model which allows the prediction of dynamic isolated joint torques from an individual's lean body mass.
Li, Zhenghua; Cheng, Fansheng; Xia, Zhining
2011-01-01
The chemical structures of 114 polycyclic aromatic sulfur heterocycles (PASHs) have been studied by molecular electronegativity-distance vector (MEDV). The linear relationships between gas chromatographic retention index and the MEDV have been established by a multiple linear regression (MLR) model. The results of variable selection by stepwise multiple regression (SMR) and the powerful predictive abilities of the optimization model appraised by leave-one-out cross-validation showed that the optimization model with the correlation coefficient (R) of 0.994 7 and the cross-validated correlation coefficient (Rcv) of 0.994 0 possessed the best statistical quality. Furthermore, when the 114 PASHs compounds were divided into calibration and test sets in the ratio of 2:1, the statistical analysis showed our models possesses almost equal statistical quality, the very similar regression coefficients and the good robustness. The quantitative structure-retention relationship (QSRR) model established may provide a convenient and powerful method for predicting the gas chromatographic retention of PASHs.
Inter-annual and spatial variability of Hamon potential evapotranspiration model coefficients
McCabe, Gregory J.; Hay, Lauren E.; Bock, Andy; Markstrom, Steven L.; Atkinson, R. Dwight
2015-01-01
Monthly calibrated values of the Hamon PET coefficient (C) are determined for 109,951 hydrologic response units (HRUs) across the conterminous United States (U.S.). The calibrated coefficient values are determined by matching calculated mean monthly Hamon PET to mean monthly free-water surface evaporation. For most locations and months the calibrated coefficients are larger than the standard value reported by Hamon. The largest changes in the coefficients were for the late winter/early spring and fall months, whereas the smallest changes were for the summer months. Comparisons of PET computed using the standard value of C and computed using calibrated values of C indicate that for most of the conterminous U.S. PET is underestimated using the standard Hamon PET coefficient, except for the southeastern U.S.
Yong, Yan Ling; Tan, Li Kuo; McLaughlin, Robert A; Chee, Kok Han; Liew, Yih Miin
2017-12-01
Intravascular optical coherence tomography (OCT) is an optical imaging modality commonly used in the assessment of coronary artery diseases during percutaneous coronary intervention. Manual segmentation to assess luminal stenosis from OCT pullback scans is challenging and time consuming. We propose a linear-regression convolutional neural network to automatically perform vessel lumen segmentation, parameterized in terms of radial distances from the catheter centroid in polar space. Benchmarked against gold-standard manual segmentation, our proposed algorithm achieves average locational accuracy of the vessel wall of 22 microns, and 0.985 and 0.970 in Dice coefficient and Jaccard similarity index, respectively. The average absolute error of luminal area estimation is 1.38%. The processing rate is 40.6 ms per image, suggesting the potential to be incorporated into a clinical workflow and to provide quantitative assessment of vessel lumen in an intraoperative time frame. (2017) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE).
Performance evaluation of spectral vegetation indices using a statistical sensitivity function
Ji, Lei; Peters, Albert J.
2007-01-01
A great number of spectral vegetation indices (VIs) have been developed to estimate biophysical parameters of vegetation. Traditional techniques for evaluating the performance of VIs are regression-based statistics, such as the coefficient of determination and root mean square error. These statistics, however, are not capable of quantifying the detailed relationship between VIs and biophysical parameters because the sensitivity of a VI is usually a function of the biophysical parameter instead of a constant. To better quantify this relationship, we developed a “sensitivity function” for measuring the sensitivity of a VI to biophysical parameters. The sensitivity function is defined as the first derivative of the regression function, divided by the standard error of the dependent variable prediction. The function elucidates the change in sensitivity over the range of the biophysical parameter. The Student's t- or z-statistic can be used to test the significance of VI sensitivity. Additionally, we developed a “relative sensitivity function” that compares the sensitivities of two VIs when the biophysical parameters are unavailable.
Yadav, Dharmendra Kumar; Kalani, Komal; Khan, Feroz; Srivastava, Santosh Kumar
2013-12-01
For the prediction of anticancer activity of glycyrrhetinic acid (GA-1) analogs against the human lung cancer cell line (A-549), a QSAR model was developed by forward stepwise multiple linear regression methodology. The regression coefficient (r(2)) and prediction accuracy (rCV(2)) of the QSAR model were taken 0.94 and 0.82, respectively in terms of correlation. The QSAR study indicates that the dipole moments, size of smallest ring, amine counts, hydroxyl and nitro functional groups are correlated well with cytotoxic activity. The docking studies showed high binding affinity of the predicted active compounds against the lung cancer target EGFR. These active glycyrrhetinic acid derivatives were then semi-synthesized, characterized and in-vitro tested for anticancer activity. The experimental results were in agreement with the predicted values and the ethyl oxalyl derivative of GA-1 (GA-3) showed equal cytotoxic activity to that of standard anticancer drug paclitaxel.
Multivariate meta-analysis for non-linear and other multi-parameter associations
Gasparrini, A; Armstrong, B; Kenward, M G
2012-01-01
In this paper, we formalize the application of multivariate meta-analysis and meta-regression to synthesize estimates of multi-parameter associations obtained from different studies. This modelling approach extends the standard two-stage analysis used to combine results across different sub-groups or populations. The most straightforward application is for the meta-analysis of non-linear relationships, described for example by regression coefficients of splines or other functions, but the methodology easily generalizes to any setting where complex associations are described by multiple correlated parameters. The modelling framework of multivariate meta-analysis is implemented in the package mvmeta within the statistical environment R. As an illustrative example, we propose a two-stage analysis for investigating the non-linear exposure–response relationship between temperature and non-accidental mortality using time-series data from multiple cities. Multivariate meta-analysis represents a useful analytical tool for studying complex associations through a two-stage procedure. Copyright © 2012 John Wiley & Sons, Ltd. PMID:22807043
Thermal sensation and comfort during exposure to local airflow to face or legs.
Yamashita, Kazuaki; Matsuo, Juntaro; Tochihara, Yutaka; Kondo, Youichiro; Takayama, Shizuka; Nagayama, Hiroki
2005-01-01
The present study examined the contribution of local airflow temperature to thermal sensation and comfort in humans. Eight healthy male students were exposed to local airflow to their faces (summer condition) or legs (winter condition) for 30 minutes. Local airflow temperature (Tf) was maintained at 18 degrees C to 36 degrees C, and ambient temperature (Ta) was maintained at 17.4 degrees C to 31.4 degrees C. Each subject was exposed to 16 conditions chosen from the combination of Tf and Ta. Based on the results of multiple regression analysis, the standardized partial regression coefficient of Tf and Ta were determined to be 0.93 and 0.13 in the summer condition, and 0.71 and 0.36 in the winter condition at the end of the exposure. Also, thermal comfort was observed to depend closely on the interrelation between Tf and Ta. The present data suggested that local airflow temperature is an important thermal factor regarding thermal sensation and comfort.
NASA Technical Reports Server (NTRS)
Colwell, R. N. (Principal Investigator)
1984-01-01
The geometric quality of TM film and digital products is evaluated by making selective photomeasurements and by measuring the coordinates of known features on both the TM products and map products. These paired observations are related using a standard linear least squares regression approach. Using regression equations and coefficients developed from 225 (TM film product) and 20 (TM digital product) control points, map coordinates of test points are predicted. The residual error vectors and analysis of variance (ANOVA) were performed on the east and north residual using nine image segments (blocks) as treatments. Based on the root mean square error of the 223 (TM film product) and 22 (TM digital product) test points, users of TM data expect the planimetric accuracy of mapped points to be within 91 meters and within 117 meters for the film products, and to be within 12 meters and within 14 meters for the digital products.
Infrared microspectroscopic determination of collagen cross-links in articular cartilage
NASA Astrophysics Data System (ADS)
Rieppo, Lassi; Kokkonen, Harri T.; Kulmala, Katariina A. M.; Kovanen, Vuokko; Lammi, Mikko J.; Töyräs, Juha; Saarakkala, Simo
2017-03-01
Collagen forms an organized network in articular cartilage to give tensile stiffness to the tissue. Due to its long half-life, collagen is susceptible to cross-links caused by advanced glycation end-products. The current standard method for determination of cross-link concentrations in tissues is the destructive high-performance liquid chromatography (HPLC). The aim of this study was to analyze the cross-link concentrations nondestructively from standard unstained histological articular cartilage sections by using Fourier transform infrared (FTIR) microspectroscopy. Half of the bovine articular cartilage samples (n=27) were treated with threose to increase the collagen cross-linking while the other half (n=27) served as a control group. Partial least squares (PLS) regression with variable selection algorithms was used to predict the cross-link concentrations from the measured average FTIR spectra of the samples, and HPLC was used as the reference method for cross-link concentrations. The correlation coefficients between the PLS regression models and the biochemical reference values were r=0.84 (p<0.001), r=0.87 (p<0.001) and r=0.92 (p<0.001) for hydroxylysyl pyridinoline (HP), lysyl pyridinoline (LP), and pentosidine (Pent) cross-links, respectively. The study demonstrated that FTIR microspectroscopy is a feasible method for investigating cross-link concentrations in articular cartilage.
NASA Technical Reports Server (NTRS)
Batterson, J. G.
1986-01-01
The successful parametric modeling of the aerodynamics for an airplane operating at high angles of attack or sideslip is performed in two phases. First the aerodynamic model structure must be determined and second the associated aerodynamic parameters (stability and control derivatives) must be estimated for that model. The purpose of this paper is to document two versions of a stepwise regression computer program which were developed for the determination of airplane aerodynamic model structure and to provide two examples of their use on computer generated data. References are provided for the application of the programs to real flight data. The two computer programs that are the subject of this report, STEP and STEPSPL, are written in FORTRAN IV (ANSI l966) compatible with a CDC FTN4 compiler. Both programs are adaptations of a standard forward stepwise regression algorithm. The purpose of the adaptation is to facilitate the selection of a adequate mathematical model of the aerodynamic force and moment coefficients of an airplane from flight test data. The major difference between STEP and STEPSPL is in the basis for the model. The basis for the model in STEP is the standard polynomial Taylor's series expansion of the aerodynamic function about some steady-state trim condition. Program STEPSPL utilizes a set of spline basis functions.
Prediction of adult height in girls: the Beunen-Malina-Freitas method.
Beunen, Gaston P; Malina, Robert M; Freitas, Duarte L; Thomis, Martine A; Maia, José A; Claessens, Albrecht L; Gouveia, Elvio R; Maes, Hermine H; Lefevre, Johan
2011-12-01
The purpose of this study was to validate and cross-validate the Beunen-Malina-Freitas method for non-invasive prediction of adult height in girls. A sample of 420 girls aged 10-15 years from the Madeira Growth Study were measured at yearly intervals and then 8 years later. Anthropometric dimensions (lengths, breadths, circumferences, and skinfolds) were measured; skeletal age was assessed using the Tanner-Whitehouse 3 method and menarcheal status (present or absent) was recorded. Adult height was measured and predicted using stepwise, forward, and maximum R (2) regression techniques. Multiple correlations, mean differences, standard errors of prediction, and error boundaries were calculated. A sample of the Leuven Longitudinal Twin Study was used to cross-validate the regressions. Age-specific coefficients of determination (R (2)) between predicted and measured adult height varied between 0.57 and 0.96, while standard errors of prediction varied between 1.1 and 3.9 cm. The cross-validation confirmed the validity of the Beunen-Malina-Freitas method in girls aged 12-15 years, but at lower ages the cross-validation was less consistent. We conclude that the Beunen-Malina-Freitas method is valid for the prediction of adult height in girls aged 12-15 years. It is applicable to European populations or populations of European ancestry.
Ommen, Oliver; Thuem, Sonja; Pfaff, Holger; Janssen, Christian
2011-06-01
Empirical studies have confirmed that a trusting physician-patient interaction promotes patient satisfaction, adherence to treatment and improved health outcomes. The objective of this analysis was to investigate the relationship between social support, shared decision-making and inpatient's trust in physicians in a hospital setting. A written questionnaire was completed by 2,197 patients who were treated in the year 2000 in six hospitals in Germany. Logistic regression was performed with a dichotomized index for patient's trust in physicians. The logistic regression model identified significant relationships (p < 0.05) in terms of emotional support (standardized effect coefficient [sc], 3.65), informational support (sc, 1.70), shared decision-making (sc, 1.40), age (sc, 1.14), socioeconomic status (sc, 1.15) and gender (sc, 1.15). We found no significant relationship between 'tendency to excuse' and trust. The last regression model accounted for 49.1% of Nagelkerke's R-square. Insufficient physician communication skills can lead to extensive negative effects on the trust of patients in their physicians. Thus, it becomes clear that medical support requires not only biomedical, but also psychosocial skills.
Hayashi, K; Yamada, T; Sawa, T
2015-03-01
The return or Poincaré plot is a non-linear analytical approach in a two-dimensional plane, where a timed signal is plotted against itself after a time delay. Its scatter pattern reflects the randomness and variability in the signals. Quantification of a Poincaré plot of the electroencephalogram has potential to determine anaesthesia depth. We quantified the degree of dispersion (i.e. standard deviation, SD) along the diagonal line of the electroencephalogram-Poincaré plot (named as SD1/SD2), and compared SD1/SD2 values with spectral edge frequency 95 (SEF95) and bispectral index values. The regression analysis showed a tight linear regression equation with a coefficient of determination (R(2) ) value of 0.904 (p < 0.0001) between the Poincaré index (SD1/SD2) and SEF95, and a moderate linear regression equation between SD1/SD2 and bispectral index (R(2) = 0.346, p < 0.0001). Quantification of the Poincaré plot tightly correlates with SEF95, reflecting anaesthesia-dependent changes in electroencephalogram oscillation. © 2014 The Association of Anaesthetists of Great Britain and Ireland.
Ogata, Soshiro; Tanaka, Haruka; Omura, Kayoko; Honda, Chika; Hayakawa, Kazuo
2016-04-01
Previous studies have indicated associations between intake of dairy products and better cognitive function and reduced risk of dementia. However, these studies did not adjust for genetic and family environmental factors that may influence food intake, cognitive function, and metabolism of dairy product nutrients. In the present study, we investigated the association between intake of dairy products and short-term memory with and without adjustment for almost all genetic and family environmental factors using a genetically informative sample of twin pairs. A cross-sectional study was conducted among twin pairs aged between 20 and 74. Short-term memory was assessed as primary outcome variable, intake of dairy products was analyzed as the predictive variable, and sex, age, education level, marital status, current smoking status, body mass index, dietary alcohol intake, and medical history of hypertension or diabetes were included as possible covariates. Generalized estimating equations (GEE) were performed by treating twins as individuals and regression analyses were used to identify within-pair differences of a twin pair to adjust for genetic and family environmental factors. Data are reported as standardized coefficients and 95% confidence intervals (CI). Analyses were performed on data from 78 men and 278 women. Among men, high intake of dairy products was significantly associated with better short-term memory after adjustment for the possible covariates (standardized coefficients = 0.22; 95% CI, 0.06-0.38) and almost all genetic and family environmental factors (standardized coefficients = 0.38; 95% CI, 0.07-0.69). Among women, no significant associations were found between intake of dairy products and short-term memory. Subsequent sensitivity analyses were adjusted for small samples and showed similar results. Intake of dairy product may prevent cognitive declines regardless of genetic and family environmental factors in men. Copyright © 2015 Elsevier Ltd and European Society for Clinical Nutrition and Metabolism. All rights reserved.
Health status convergence at the local level: empirical evidence from Austria
2011-01-01
Introduction Health is an important dimension of welfare comparisons across individuals, regions and states. Particularly from a long-term perspective, within-country convergence of the health status has rarely been investigated by applying methods well established in other scientific fields. In the following paper we study the relation between initial levels of the health status and its improvement at the local community level in Austria in the time period 1969-2004. Methods We use age standardized mortality rates from 2381 Austrian communities as an indicator for the health status and analyze the convergence/divergence of overall mortality for (i) the whole population, (ii) females, (iii) males and (iv) the gender mortality gap. Convergence/Divergence is studied by applying different concepts of cross-regional inequality (weighted standard deviation, coefficient of variation, Theil-Coefficient of inequality). Various econometric techniques (weighted OLS, Quantile Regression, Kendall's Rank Concordance) are used to test for absolute and conditional beta-convergence in mortality. Results Regarding sigma-convergence, we find rather mixed results. While the weighted standard deviation indicates an increase in equality for all four variables, the picture appears less clear when correcting for the decreasing mean in the distribution. However, we find highly significant coefficients for absolute and conditional beta-convergence between the periods. While these results are confirmed by several robustness tests, we also find evidence for the existence of convergence clubs. Conclusions The highly significant beta-convergence across communities might be caused by (i) the efforts to harmonize and centralize the health policy at the federal level in Austria since the 1970s, (ii) the diminishing returns of the input factors in the health production function, which might lead to convergence, as the general conditions (e.g. income, education etc.) improve over time, and (iii) the mobility of people across regions, as people tend to move to regions/communities which exhibit more favorable living conditions. JEL classification: I10, I12, I18 PMID:21864364
Determining Sample Size for Accurate Estimation of the Squared Multiple Correlation Coefficient.
ERIC Educational Resources Information Center
Algina, James; Olejnik, Stephen
2000-01-01
Discusses determining sample size for estimation of the squared multiple correlation coefficient and presents regression equations that permit determination of the sample size for estimating this parameter for up to 20 predictor variables. (SLD)
Grajeda, Laura M; Ivanescu, Andrada; Saito, Mayuko; Crainiceanu, Ciprian; Jaganath, Devan; Gilman, Robert H; Crabtree, Jean E; Kelleher, Dermott; Cabrera, Lilia; Cama, Vitaliano; Checkley, William
2016-01-01
Childhood growth is a cornerstone of pediatric research. Statistical models need to consider individual trajectories to adequately describe growth outcomes. Specifically, well-defined longitudinal models are essential to characterize both population and subject-specific growth. Linear mixed-effect models with cubic regression splines can account for the nonlinearity of growth curves and provide reasonable estimators of population and subject-specific growth, velocity and acceleration. We provide a stepwise approach that builds from simple to complex models, and account for the intrinsic complexity of the data. We start with standard cubic splines regression models and build up to a model that includes subject-specific random intercepts and slopes and residual autocorrelation. We then compared cubic regression splines vis-à-vis linear piecewise splines, and with varying number of knots and positions. Statistical code is provided to ensure reproducibility and improve dissemination of methods. Models are applied to longitudinal height measurements in a cohort of 215 Peruvian children followed from birth until their fourth year of life. Unexplained variability, as measured by the variance of the regression model, was reduced from 7.34 when using ordinary least squares to 0.81 (p < 0.001) when using a linear mixed-effect models with random slopes and a first order continuous autoregressive error term. There was substantial heterogeneity in both the intercept (p < 0.001) and slopes (p < 0.001) of the individual growth trajectories. We also identified important serial correlation within the structure of the data (ρ = 0.66; 95 % CI 0.64 to 0.68; p < 0.001), which we modeled with a first order continuous autoregressive error term as evidenced by the variogram of the residuals and by a lack of association among residuals. The final model provides a parametric linear regression equation for both estimation and prediction of population- and individual-level growth in height. We show that cubic regression splines are superior to linear regression splines for the case of a small number of knots in both estimation and prediction with the full linear mixed effect model (AIC 19,352 vs. 19,598, respectively). While the regression parameters are more complex to interpret in the former, we argue that inference for any problem depends more on the estimated curve or differences in curves rather than the coefficients. Moreover, use of cubic regression splines provides biological meaningful growth velocity and acceleration curves despite increased complexity in coefficient interpretation. Through this stepwise approach, we provide a set of tools to model longitudinal childhood data for non-statisticians using linear mixed-effect models.
Fach, S; Sitzenfrei, R; Rauch, W
2009-01-01
It is state of the art to evaluate and optimise sewer systems with urban drainage models. Since spill flow data is essential in the calibration process of conceptual models it is important to enhance the quality of such data. A wide spread approach is to calculate the spill flow volume by using standard weir equations together with measured water levels. However, these equations are only applicable to combined sewer overflow (CSO) structures, whose weir constructions correspond with the standard weir layout. The objective of this work is to outline an alternative approach to obtain spill flow discharge data based on measurements with a sonic depth finder. The idea is to determine the relation between water level and rate of spill flow by running a detailed 3D computational fluid dynamics (CFD) model. Two real world CSO structures have been chosen due to their complex structure, especially with respect to the weir construction. In a first step the simulation results were analysed to identify flow conditions for discrete steady states. It will be shown that the flow conditions in the CSO structure change after the spill flow pipe acts as a controlled outflow and therefore the spill flow discharge cannot be described with a standard weir equation. In a second step the CFD results will be used to derive rating curves which can be easily applied in everyday practice. Therefore the rating curves are developed on basis of the standard weir equation and the equation for orifice-type outlets. Because the intersection of both equations is not known, the coefficients of discharge are regressed from CFD simulation results. Furthermore, the regression of the CFD simulation results are compared with the one of the standard weir equation by using historic water levels and hydrographs generated with a hydrodynamic model. The uncertainties resulting of the wide spread use of the standard weir equation are demonstrated.
NASA Astrophysics Data System (ADS)
Ben Shabat, Yael; Shitzer, Avraham
2012-07-01
Facial heat exchange convection coefficients were estimated from experimental data in cold and windy ambient conditions applicable to wind chill calculations. Measured facial temperature datasets, that were made available to this study, originated from 3 separate studies involving 18 male and 6 female subjects. Most of these data were for a -10°C ambient environment and wind speeds in the range of 0.2 to 6 m s-1. Additional single experiments were for -5°C, 0°C and 10°C environments and wind speeds in the same range. Convection coefficients were estimated for all these conditions by means of a numerical facial heat exchange model, applying properties of biological tissues and a typical facial diameter of 0.18 m. Estimation was performed by adjusting the guessed convection coefficients in the computed facial temperatures, while comparing them to measured data, to obtain a satisfactory fit ( r 2 > 0.98, in most cases). In one of the studies, heat flux meters were additionally used. Convection coefficients derived from these meters closely approached the estimated values for only the male subjects. They differed significantly, by about 50%, when compared to the estimated female subjects' data. Regression analysis was performed for just the -10°C ambient temperature, and the range of experimental wind speeds, due to the limited availability of data for other ambient temperatures. The regressed equation was assumed in the form of the equation underlying the "new" wind chill chart. Regressed convection coefficients, which closely duplicated the measured data, were consistently higher than those calculated by this equation, except for one single case. The estimated and currently used convection coefficients are shown to diverge exponentially from each other, as wind speed increases. This finding casts considerable doubts on the validity of the convection coefficients that are used in the computation of the "new" wind chill chart and their applicability to humans in cold and windy environments.
Ben Shabat, Yael; Shitzer, Avraham
2012-07-01
Facial heat exchange convection coefficients were estimated from experimental data in cold and windy ambient conditions applicable to wind chill calculations. Measured facial temperature datasets, that were made available to this study, originated from 3 separate studies involving 18 male and 6 female subjects. Most of these data were for a -10°C ambient environment and wind speeds in the range of 0.2 to 6 m s(-1). Additional single experiments were for -5°C, 0°C and 10°C environments and wind speeds in the same range. Convection coefficients were estimated for all these conditions by means of a numerical facial heat exchange model, applying properties of biological tissues and a typical facial diameter of 0.18 m. Estimation was performed by adjusting the guessed convection coefficients in the computed facial temperatures, while comparing them to measured data, to obtain a satisfactory fit (r(2) > 0.98, in most cases). In one of the studies, heat flux meters were additionally used. Convection coefficients derived from these meters closely approached the estimated values for only the male subjects. They differed significantly, by about 50%, when compared to the estimated female subjects' data. Regression analysis was performed for just the -10°C ambient temperature, and the range of experimental wind speeds, due to the limited availability of data for other ambient temperatures. The regressed equation was assumed in the form of the equation underlying the "new" wind chill chart. Regressed convection coefficients, which closely duplicated the measured data, were consistently higher than those calculated by this equation, except for one single case. The estimated and currently used convection coefficients are shown to diverge exponentially from each other, as wind speed increases. This finding casts considerable doubts on the validity of the convection coefficients that are used in the computation of the "new" wind chill chart and their applicability to humans in cold and windy environments.
Buhl, Sussi F; Andersen, Aino L; Andersen, Jens R; Andersen, Ove; Jensen, Jens-Erik B; Rasmussen, Anne Mette L; Pedersen, Mette M; Damkjær, Lars; Gilkes, Hanne; Petersen, Janne
2016-02-01
Stress metabolism is associated with accelerated loss of muscle that has large consequences for the old medical patient. The aim of this study was to investigate if an intervention combining protein and resistance training was more effective in counteracting loss of muscle than standard care. Secondary outcomes were changes in muscle strength, functional ability and body weight. 29 acutely admitted old (>65 years) patients were randomly assigned to the intervention (n = 14) or to standard care (n = 15). The Intervention Group received 1.7 g protein/kg/day during admission and a daily protein supplement (18.8 g protein) and resistance training 3 times per week the 12 weeks following discharge. Muscle mass was assessed by Dual-energy X-ray Absorptiometry. Muscle strength was assessed by Hand Grip Strength and Chair Stand Test. Functional ability was assessed by the de Morton Mobility Index, the Functional Recovery Score and the New Mobility Score. Changes in outcomes from time of admission to three-months after discharge were analysed by linear regression analysis. The intention-to-treat analysis showed no significant effect of the intervention on lean mass (unadjusted: β-coefficient = -1.28 P = 0.32, adjusted for gender: β-coefficient = -0.02 P = 0.99, adjusted for baseline lean mass: β-coefficient = -0.31 P = 0.80). The de Morton Mobility Index significantly increased in the Control Group (β-coefficient = -11.43 CI: 0.72-22.13, P = 0.04). No other differences were found. No significant effect on muscle mass was observed in this group of acutely ill old medical patients. High compliance was achieved with the dietary intervention, but resistance training was challenging. Clinical trials identifier NCT02077491. Copyright © 2015 Elsevier Ltd and European Society for Clinical Nutrition and Metabolism. All rights reserved.
Campbell, J Elliott; Moen, Jeremie C; Ney, Richard A; Schnoor, Jerald L
2008-03-01
Estimates of forest soil organic carbon (SOC) have applications in carbon science, soil quality studies, carbon sequestration technologies, and carbon trading. Forest SOC has been modeled using a regression coefficient methodology that applies mean SOC densities (mass/area) to broad forest regions. A higher resolution model is based on an approach that employs a geographic information system (GIS) with soil databases and satellite-derived landcover images. Despite this advancement, the regression approach remains the basis of current state and federal level greenhouse gas inventories. Both approaches are analyzed in detail for Wisconsin forest soils from 1983 to 2001, applying rigorous error-fixing algorithms to soil databases. Resulting SOC stock estimates are 20% larger when determined using the GIS method rather than the regression approach. Average annual rates of increase in SOC stocks are 3.6 and 1.0 million metric tons of carbon per year for the GIS and regression approaches respectively.
Radon-222 concentrations in ground water and soil gas on Indian reservations in Wisconsin
DeWild, John F.; Krohelski, James T.
1995-01-01
For sites with wells finished in the sand and gravel aquifer, the coefficient of determination (R2) of the regression of concentration of radon-222 in ground water as a function of well depth is 0.003 and the significance level is 0.32, which indicates that there is not a statistically significant relation between radon-222 concentrations in ground water and well depth. The coefficient of determination of the regression of radon-222 in ground water and soil gas is 0.19 and the root mean square error of the regression line is 271 picocuries per liter. Even though the significance level (0.036) indicates a statistical relation, the root mean square error of the regression is so large that the regression equation would not give reliable predictions. Because of an inadequate number of samples, similar statistical analyses could not be performed for sites with wells finished in the crystalline and sedimentary bedrock aquifers.
Incremental Net Effects in Multiple Regression
ERIC Educational Resources Information Center
Lipovetsky, Stan; Conklin, Michael
2005-01-01
A regular problem in regression analysis is estimating the comparative importance of the predictors in the model. This work considers the 'net effects', or shares of the predictors in the coefficient of the multiple determination, which is a widely used characteristic of the quality of a regression model. Estimation of the net effects can be a…
Seismic zoning (first approximation) using data of the main geomagnetic field
NASA Astrophysics Data System (ADS)
Khachikyan, Galina; Zhumabayev, Beibit; Toyshiev, Nursultan; Kairatkyzy, Dina; Seraliyev, Alibek; Khassanov, Eldar
2017-04-01
Seismic zoning is among the most complicated and extremely important problems of modern seismology. In solving this problem, a very important parameter is maximal possible earthquake magnitude (Mmax) which is believed at present depends on horizontal size of geoblocks. At the same time, it was found by Khachikyan et al. [2012, IJG, doi: 10.4236/ijg.2012.35109] that Mmax value in any seismic region may be determined using Z_GSM value that is geomagnetic Z-component in this region estimated in geocentric solar-magnetosphere coordinate system (GSM). On the base of the global seismological catalog NEIC with M≥4.5 for 1973-2010 years, and the International Geomagnetic Reference Field (IGRF) model, an empirical relation was obtained as follows: Mmax= a + b {log[abs(Z_GSM)]}. For the case of the whole planet, obtained empirical coefficients are as follows: a = (5,22 ± 0,17), and b = (0,78 ± 0,06) with correlation coefficient R=0.91, standard deviation SD=0.56, and probability 95%. Further investigations showed that the coefficients of the regression equation are different for different seismically active regions of the planet. For example, to the territory of the San Andreas Fault, defined by the coordinates 30-45N, 105-135W obtained values are as follows: a = (4,04 ± 0.38) and b = (0.7 ± 0.13) with correlation coefficient R = 0.91, standard deviation SD = 0.34, and probability of 95%. For territory of inland seismicity in Eurasia defined by the coordinates 30-45N, 0-110E, a = (12.44 ± 0.48) and b = (1,15 ± 0.2) with correlation coefficient R = 0.87, standard deviation SD = 0.98, and probability of 95%, and for the territory of the strongest seismicity in the world defined by the coordinates 20S-20N, 90-150E, obtained values of a = (- 17.5 ± 1,5) and b = (5,7 ± 0.4) with correlation coefficient R = 0.97, standard deviation SD = 0.4, and probability of 95%. The relationship between the intensity of the main geomagnetic field and released seismic energy is expectable, because both the main geomagnetic field and the tectonic activity of the planet originate from the same source - the convection in the Earth's liquid core. The relationship between earthquake magnitude and geomagnetic Z - component expressed namely in geocentric solar magnetosphere coordinate system (GSM), in which the interaction of the solar wind magnetic field with the geomagnetic field is better ordered, indicates at the external (triggering) earthquake occurrence in the extremely stressed tectonic area. Above empirical relationships may be used (in first approximation) for global seismic zoning and for prediction of possible Mmax, when a place and time of earthquake occurrence are predicted. In report we present global maps of Z_GSM and Mmax estimated for different seasons and different times.
Evaluation of Quality of Life and Safety of Seniors in Golestan Province, Iran
Foroushani, Abbas Rahimi; Badakhshan, Abbas; Gholipour, Mahin; Hosseini, Masoumeh
2015-01-01
This study evaluated the criteria for quality of life (QoL) using standardized short-form health survey with only 36 questions (SF-36; Version 2.0) and Consumer Product Safety Commission (CPSC) questionnaires to study the relationship between QoL and living conditions of seniors in Golestan province in Iran. This was an analytical cross-sectional study with descriptive and analytical parts. The population was individuals above 65 years of age in Golestan province in Iran. The sample size was calculated based on the correlation coefficient; a correlation of .2 or greater was considered statistically significant at 80% for the power of the test at the 95% confidence level. The data on QoL of seniors were collected by interview and observation using the CPSC questionnaire for nursing homes and the SF-36 for QoL health indicators. The reliability of the CPSC questionnaire was estimated using Cronbach’s alpha with a coefficient of .838. The SF-36 questionnaire was validated with Cronbach’s alpha with a coefficient of .95. Chi-square and logistic regression were used to interpret the probability of abnormal QoL between levels of independent predictors. The percentage of seniors in overall poor health as a binary outcome was 43.5, and the percentage of unsafe conditions was 49.8. PMID:28138463
Kinetic modeling of copper biosorption by immobilized biomass
DOE Office of Scientific and Technical Information (OSTI.GOV)
Veglio, F.; Beolchini, F.; Toro, L.
1998-03-01
Biosorption of heavy metals is one of the most promising technologies involved in the removal of toxic metals from industrial waste streams and natural waters. The kinetic modeling of copper biosorption by Arthrobacter sp. immobilized in a hydroxyethyl methacrylate-based matrix is reported in this work. The resin-biomass complex (RBC) has been used for copper biosorption in different conditions according to a factorial experiment. Factors investigated were cross-linker (trimethylolpropane trimethacrylate) concentration, biomass concentration in the solid, and particles` granulometry. A maximum copper specific uptake of abut 7 mg of Cu/g of biomass (dry weight) has been observed, in the case ofmore » a RBC with the following characteristics: 2% (w/w) cross-linker concentration, 8% (w/w) biomass concentration, and 425--750 {micro}m granulometry. The shrinking core model has been used for the fitting of experimental data. A good fit has been found in the case of controlling intraparticle diffusion in all experimental trials. The copper diffusion coefficient in RBC has been estimated from the slope of the regression lines. Values obtained for the diffusion coefficients do not differ from one another with respect to the estimated standard error. An average apparent copper diffusion coefficient of about 3 {times} 10{sup {minus}6} cm{sup 2}/s has been found.« less
Cheng, Dengmiao; Feng, Yao; Liu, Yuanwang; Li, Jinpeng; Xue, Jianming; Li, Zhaojun
2018-09-01
Understanding antibiotic adsorption in livestock manures is crucial to assess the fate and risk of antibiotics in the environment. In this study, three quantitative models developed with swine manure-water distribution coefficients (LgK d ) for oxytetracycline (OTC), ciprofloxacin (CIP) and sulfamerazine (SM1) in swine manures. Physicochemical parameters (n=12) of the swine manure were used as independent variables using partial least-squares (PLSs) analysis. The cumulative cross-validated regression coefficients (Q 2 cum ) values, standard deviations (SDs) and external validation coefficient (Q 2 ext ) ranged from 0.761 to 0.868, 0.027 to 0.064, and 0.743 to 0.827 for the three models; as such, internal and external predictability of the models were strong. The pH, soluble organic carbon (SOC) and nitrogen (SON), and Ca were important explanatory variables for the OTC-Model, pH, SOC, and SON for the CIP-model, and pH, total organic nitrogen (TON), and SOC for the SM1-model. The high VIPs (variable importance in the projections) of pH (1.178-1.396), SOC (0.968-1.034), and SON (0.822 and 0.865) established these physicochemical parameters as likely being dominant (associatively) in affecting transport of antibiotics in swine manures. Copyright © 2018 Elsevier B.V. All rights reserved.
FGWAS: Functional genome wide association analysis.
Huang, Chao; Thompson, Paul; Wang, Yalin; Yu, Yang; Zhang, Jingwen; Kong, Dehan; Colen, Rivka R; Knickmeyer, Rebecca C; Zhu, Hongtu
2017-10-01
Functional phenotypes (e.g., subcortical surface representation), which commonly arise in imaging genetic studies, have been used to detect putative genes for complexly inherited neuropsychiatric and neurodegenerative disorders. However, existing statistical methods largely ignore the functional features (e.g., functional smoothness and correlation). The aim of this paper is to develop a functional genome-wide association analysis (FGWAS) framework to efficiently carry out whole-genome analyses of functional phenotypes. FGWAS consists of three components: a multivariate varying coefficient model, a global sure independence screening procedure, and a test procedure. Compared with the standard multivariate regression model, the multivariate varying coefficient model explicitly models the functional features of functional phenotypes through the integration of smooth coefficient functions and functional principal component analysis. Statistically, compared with existing methods for genome-wide association studies (GWAS), FGWAS can substantially boost the detection power for discovering important genetic variants influencing brain structure and function. Simulation studies show that FGWAS outperforms existing GWAS methods for searching sparse signals in an extremely large search space, while controlling for the family-wise error rate. We have successfully applied FGWAS to large-scale analysis of data from the Alzheimer's Disease Neuroimaging Initiative for 708 subjects, 30,000 vertices on the left and right hippocampal surfaces, and 501,584 SNPs. Copyright © 2017 Elsevier Inc. All rights reserved.
Li, Ji; Gray, B.R.; Bates, D.M.
2008-01-01
Partitioning the variance of a response by design levels is challenging for binomial and other discrete outcomes. Goldstein (2003) proposed four definitions for variance partitioning coefficients (VPC) under a two-level logistic regression model. In this study, we explicitly derived formulae for multi-level logistic regression model and subsequently studied the distributional properties of the calculated VPCs. Using simulations and a vegetation dataset, we demonstrated associations between different VPC definitions, the importance of methods for estimating VPCs (by comparing VPC obtained using Laplace and penalized quasilikehood methods), and bivariate dependence between VPCs calculated at different levels. Such an empirical study lends an immediate support to wider applications of VPC in scientific data analysis.
Lamadrid-Figueroa, Héctor; Téllez-Rojo, Martha M; Angeles, Gustavo; Hernández-Ávila, Mauricio; Hu, Howard
2011-01-01
In-vivo measurement of bone lead by means of K-X-ray fluorescence (KXRF) is the preferred biological marker of chronic exposure to lead. Unfortunately, considerable measurement error associated with KXRF estimations can introduce bias in estimates of the effect of bone lead when this variable is included as the exposure in a regression model. Estimates of uncertainty reported by the KXRF instrument reflect the variance of the measurement error and, although they can be used to correct the measurement error bias, they are seldom used in epidemiological statistical analyzes. Errors-in-variables regression (EIV) allows for correction of bias caused by measurement error in predictor variables, based on the knowledge of the reliability of such variables. The authors propose a way to obtain reliability coefficients for bone lead measurements from uncertainty data reported by the KXRF instrument and compare, by the use of Monte Carlo simulations, results obtained using EIV regression models vs. those obtained by the standard procedures. Results of the simulations show that Ordinary Least Square (OLS) regression models provide severely biased estimates of effect, and that EIV provides nearly unbiased estimates. Although EIV effect estimates are more imprecise, their mean squared error is much smaller than that of OLS estimates. In conclusion, EIV is a better alternative than OLS to estimate the effect of bone lead when measured by KXRF. Copyright © 2010 Elsevier Inc. All rights reserved.
Kupek, Emil
2006-03-15
Structural equation modelling (SEM) has been increasingly used in medical statistics for solving a system of related regression equations. However, a great obstacle for its wider use has been its difficulty in handling categorical variables within the framework of generalised linear models. A large data set with a known structure among two related outcomes and three independent variables was generated to investigate the use of Yule's transformation of odds ratio (OR) into Q-metric by (OR-1)/(OR+1) to approximate Pearson's correlation coefficients between binary variables whose covariance structure can be further analysed by SEM. Percent of correctly classified events and non-events was compared with the classification obtained by logistic regression. The performance of SEM based on Q-metric was also checked on a small (N = 100) random sample of the data generated and on a real data set. SEM successfully recovered the generated model structure. SEM of real data suggested a significant influence of a latent confounding variable which would have not been detectable by standard logistic regression. SEM classification performance was broadly similar to that of the logistic regression. The analysis of binary data can be greatly enhanced by Yule's transformation of odds ratios into estimated correlation matrix that can be further analysed by SEM. The interpretation of results is aided by expressing them as odds ratios which are the most frequently used measure of effect in medical statistics.
Nguyen, Quynh C; Osypuk, Theresa L; Schmidt, Nicole M; Glymour, M Maria; Tchetgen Tchetgen, Eric J
2015-03-01
Despite the recent flourishing of mediation analysis techniques, many modern approaches are difficult to implement or applicable to only a restricted range of regression models. This report provides practical guidance for implementing a new technique utilizing inverse odds ratio weighting (IORW) to estimate natural direct and indirect effects for mediation analyses. IORW takes advantage of the odds ratio's invariance property and condenses information on the odds ratio for the relationship between the exposure (treatment) and multiple mediators, conditional on covariates, by regressing exposure on mediators and covariates. The inverse of the covariate-adjusted exposure-mediator odds ratio association is used to weight the primary analytical regression of the outcome on treatment. The treatment coefficient in such a weighted regression estimates the natural direct effect of treatment on the outcome, and indirect effects are identified by subtracting direct effects from total effects. Weighting renders treatment and mediators independent, thereby deactivating indirect pathways of the mediators. This new mediation technique accommodates multiple discrete or continuous mediators. IORW is easily implemented and is appropriate for any standard regression model, including quantile regression and survival analysis. An empirical example is given using data from the Moving to Opportunity (1994-2002) experiment, testing whether neighborhood context mediated the effects of a housing voucher program on obesity. Relevant Stata code (StataCorp LP, College Station, Texas) is provided. © The Author 2015. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Remote sensing of PM2.5 from ground-based optical measurements
NASA Astrophysics Data System (ADS)
Li, S.; Joseph, E.; Min, Q.
2014-12-01
Remote sensing of particulate matter concentration with aerodynamic diameter smaller than 2.5 um(PM2.5) by using ground-based optical measurements of aerosols is investigated based on 6 years of hourly average measurements of aerosol optical properties, PM2.5, ceilometer backscatter coefficients and meteorological factors from Howard University Beltsville Campus facility (HUBC). The accuracy of quantitative retrieval of PM2.5 using aerosol optical depth (AOD) is limited due to changes in aerosol size distribution and vertical distribution. In this study, ceilometer backscatter coefficients are used to provide vertical information of aerosol. It is found that the PM2.5-AOD ratio can vary largely for different aerosol vertical distributions. The ratio is also sensitive to mode parameters of bimodal lognormal aerosol size distribution when the geometric mean radius for the fine mode is small. Using two Angstrom exponents calculated at three wavelengths of 415, 500, 860nm are found better representing aerosol size distributions than only using one Angstrom exponent. A regression model is proposed to assess the impacts of different factors on the retrieval of PM2.5. Compared to a simple linear regression model, the new model combining AOD and ceilometer backscatter can prominently improve the fitting of PM2.5. The contribution of further introducing Angstrom coefficients is apparent. Using combined measurements of AOD, ceilometer backscatter, Angstrom coefficients and meteorological parameters in the regression model can get a correlation coefficient of 0.79 between fitted and expected PM2.5.
Factors in Variability of Serial Gabapentin Concentrations in Elderly Patients with Epilepsy.
Conway, Jeannine M; Eberly, Lynn E; Collins, Joseph F; Macias, Flavia M; Ramsay, R Eugene; Leppik, Ilo E; Birnbaum, Angela K
2017-10-01
To characterize and quantify the variability of serial gabapentin concentrations in elderly patients with epilepsy. This study included 83 patients (age ≥ 60 yrs) from an 18-center randomized double-blind double-dummy parallel study from the Veterans Affairs Cooperative 428 Study. All patients were taking 1500 mg/day gabapentin. Within-person coefficient of variation (CV) in gabapentin concentrations, measured weekly to bimonthly for up to 52 weeks, then quarterly, was computed. Impact of patient characteristics on gabapentin concentrations (linear mixed model) and CV (linear regression) were estimated. A total of 482 gabapentin concentration measurements were available for analysis. Gabapentin concentrations and intrapatient CVs ranged from 0.5 to 22.6 μg/ml (mean 7.9 μg/ml, standard deviation [SD] 4.1 μg/ml) and 2% to 79% (mean 27.9%, SD 15.3%), respectively, across all visits. Intrapatient CV was higher by 7.3% for those with a body mass index of ≥ 30 kg/m 2 (coefficient = 7.3, p=0.04). CVs were on average 0.5% higher for each 1-unit higher CV in creatinine clearance (coefficient = 0.5, p=0.03) and 1.2% higher for each 1-hour longer mean time after dose (coefficient = 1.2, p=0.04). Substantial intrapatient variability in serial gabapentin concentration was noted in elderly patients with epilepsy. Creatinine clearance, time of sampling relative to dose, and obesity were found to be positively associated with variability. © 2017 Pharmacotherapy Publications, Inc.
Wong, Ken; Smalarz, Amy; Wu, Ning; Boulanger, Luke; Wogen, Jenifer
2011-01-01
Care management processes (CMP) may be implemented in health systems to improve chronic disease quality of care. The objective of this study was to assess the relationship between the presence of hypertension-specific CMP and blood pressure (BP) control among hypertensive patients within selected physician organizations in the USA-modified version of the Physician Practice Connection Readiness Survey (PPC-RS), developed by The National Committee for Quality Assurance (NCQA), was administered to chief medical officers at 28 US-based physician organizations in 2010. Hypertension-specific survey items were added to the PPC-RS and focused on medication fill compliance, chronic disease management, and patient self-management. Demographic and clinical cross-sectional data from a random sample of 300 hypertensive patients age 18 years or older were collected at each site. Physician site and patient characteristics were reported. Regression models were used to assess the relationship between hypertension-specific physician practices and patient BP control. Eligible patients had at least a 1-year history of care with the physician organization and had an encounter within the past year of data collection. Of the 28 participating sites, most had electronic medical records that handle total functionality (71.4%) and had more than 50 staff members (78.6%). Across all sites, approximately 61% of patients had controlled BP. Regression analyses found that practices that used physician education as an effort to improve medication fill compliance demonstrated improvement in BP control (changes in systolic BP: beta coefficient = -1.366, P = .034; changes in diastolic BP: beta coefficient = -0.859, P = .056). The use of a systematic process to screen or assess patients for hypertension as a risk factor was also found to be associated with improvements in BP control (changes in diastolic BP: beta coefficient = -0.860, P = .006). In addition, physician practices that maintained a list of hypertensive patients along with the patients' associated clinical data demonstrated better BP control (currently controlled BP: beta coefficient = 0.282, P = .034; currently uncontrolled BP: beta coefficient = -0.292, P = .023). However, use of the following practices had a negative correlation with BP control: case management (changes in systolic BP: beta coefficient 1.649, P = .022; changes in diastolic BP: beta coefficient = 0.910, P = .078), follow-up for missed appointments (changes in systolic BP: beta coefficient = 0.937, P = .041; changes in diastolic BP: beta coefficient = 0.165, P = .627), adopted written evidence-based standards of care to treat hypertension (changes in systolic BP: beta coefficient = 0.985, P = .032; changes in diastolic BP: beta coefficient = 0.346, P = .305), and checklists for tests and interventions (changes in systolic BP: beta coefficient = 1.586, P = .004; changes in diastolic BP: beta coefficient = 0.938, P = .019). Findings from this multisite study provide evidence that the presence of some hypertension-specific CMP in physician organizations may be associated with better BP outcomes among hypertensive patients. In particular, patients may benefit from physician efforts to improve medication fill compliance as well as organizational monitoring of hypertensive patients and their clinical data. Further research is warranted to better assess the relationship between CMP and treatment of chronic diseases such as hypertension over time. Copyright © 2011 American Society of Hypertension. Published by Elsevier Inc. All rights reserved.
Fluctuations in air pollution give risk warning signals of asthma hospitalization
NASA Astrophysics Data System (ADS)
Hsieh, Nan-Hung; Liao, Chung-Min
2013-08-01
Recent studies have implicated that air pollution has been associated with asthma exacerbations. However, the key link between specific air pollutant and the consequent impact on asthma has not been shown. The purpose of this study was to quantify the fluctuations in air pollution time-series dynamics to correlate the relationships between statistical indicators and age-specific asthma hospital admissions. An indicators-based regression model was developed to predict the time-trend of asthma hospital admissions in Taiwan in the period 1998-2010. Five major pollutants such as particulate matters with aerodynamic diameter less than 10 μm (PM10), ozone (O3), nitrogen dioxide (NO2), sulfur dioxide (SO2), and carbon monoxide (CO) were included. We used Spearman's rank correlation to detect the relationships between time-series based statistical indicators of standard deviation, coefficient of variation, skewness, and kurtosis and monthly asthma hospitalization. We further used the indicators-guided Poisson regression model to test and predict the impact of target air pollutants on asthma incidence. Here we showed that standard deviation of PM10 data was the most correlated indicators for asthma hospitalization for all age groups, particularly for elderly. The skewness of O3 data gives the highest correlation to adult asthmatics. The proposed regression model shows a better predictability in annual asthma hospitalization trends for pediatrics. Our results suggest that a set of statistical indicators inferred from time-series information of major air pollutants can provide advance risk warning signals in complex air pollution-asthma systems and aid in asthma management that depends heavily on monitoring the dynamics of asthma incidence and environmental stimuli.
Quantifying the uncertainty of regional and national estimates of soil carbon stocks
NASA Astrophysics Data System (ADS)
Papritz, Andreas
2013-04-01
At regional and national scales, carbon (C) stocks are frequently estimated by means of regression models. Such statistical models link measurements of carbons stocks, recorded for a set of soil profiles or soil cores, to covariates that characterize soil formation conditions and land management. A prerequisite is that these covariates are available for any location within a region of interest G because they are used along with the fitted regression coefficients to predict the carbon stocks at the nodes of a fine-meshed grid that is laid over G. The mean C stock in G is then estimated by the arithmetic mean of the stock predictions for the grid nodes. Apart from the mean stock, the precision of the estimate is often also of interest, for example to judge whether the mean C stock has changed significantly between two inventories. The standard error of the estimated mean stock in G can be computed from the regression results as well. Two issues are thereby important: (i) How large is the area of G relative to the support of the measurements? (ii) Are the residuals of the regression model spatially auto-correlated or is the assumption of statistical independence tenable? Both issues are correctly handled if one adopts a geostatistical block kriging approach for estimating the mean C stock within a region and its standard error. In the presentation I shall summarize the main ideas of external drift block kriging. To compute the standard error of the mean stock, one has in principle to sum the elements a potentially very large covariance matrix of point prediction errors, but I shall show that the required term can be approximated very well by Monte Carlo techniques. I shall further illustrated with a few examples how the standard error of the mean stock estimate changes with the size of G and with the strenght of the auto-correlation of the regression residuals. As an application a robust variant of block kriging is used to quantify the mean carbon stock stored in the soils of Swiss forests (Nussbaum et al., 2012). Nussbaum, M., Papritz, A., Baltensweiler, A., and Walthert, L. (2012). Organic carbon stocks of swiss forest soils. Final report, Institute of Terrestrial Ecosystems, ETH Zürich and Swiss Federal Institute for Forest, Snow and Landscape Research (WSL), pp. 51, http://e-collection.library.ethz.ch/eserv/eth:6027/eth-6027-01.pdf
ISO/IEC 17025 Sysmex R-500 hematology reticulocyte analyzer validation.
Dimopoulou, H A; Theodoridis, T; Galea, V; Christopoulou-Cokkinou, V; Spyridaki, M-H E; Georgakopoulos, C G
2007-01-01
The Sysmex R-500 (R-500) Hematology Analyzer is a bench-top system appropriate for the analysis of limited batches of blood samples. The R-500 provides percentage proportional (RET%), absolute reticulocyte (RET#), and absolute red blood cell (RBC#) counts. The system was validated at the Doping Control Laboratory of Athens, according to the International Committee for Standardization in Hematology, International Standards Organization (ISO/IEC) 17025, and World Antidoping Agency (WADA) specifications. The instrument calibration was performed according to the manufacturer and validation parameters comprised linearity, precision, uncertainty (intermediate and long-term precision), comparability, effect of drift, carryover, stability, and accuracy. The linearity and the comparability studies for RET#, RET%, and RBC# were expressed in regression factors (R2) and coefficients of correlation [r(x, y)], respectively. For the precision studies, the coefficients of variation for RET#, RET%, and RBC# were 9.49%, 9.83%, and <1.5%, respectively. For the intermediate precision studies, the coefficients of variation for RET#, RET%, and RBC# were 3.1%, 3.6%, and 0.6%, respectively. Carryover was found to be negligible. Sample stability was demonstrated at both room temperature and at 4 degrees C over a 24-hour period. Comparability studies for the R-500 were performed using a Sysmex SE-9500. The total evaluation led to the conclusion that the R-500 is an accurate and precise analyzer and because of to its relatively limited size, it can be considered a portable instrument, capable to be used in sports competition and training sites, where doping control and health tests are conducted. The analytical methodology of RET% measurement by the R-500 has been incorporated into the Doping Control Laboratory of Athens' Scope of Accreditation according to the ISO/IEC 17025 and WADA specifications.
Quantitative analysis of bayberry juice acidity based on visible and near-infrared spectroscopy
DOE Office of Scientific and Technical Information (OSTI.GOV)
Shao Yongni; He Yong; Mao Jingyuan
Visible and near-infrared (Vis/NIR) reflectance spectroscopy has been investigated for its ability to nondestructively detect acidity in bayberry juice. What we believe to be a new, better mathematic model is put forward, which we have named principal component analysis-stepwise regression analysis-backpropagation neural network (PCA-SRA-BPNN), to build a correlation between the spectral reflectivity data and the acidity of bayberry juice. In this model, the optimum network parameters,such as the number of input nodes, hidden nodes, learning rate, and momentum, are chosen by the value of root-mean-square (rms) error. The results show that its prediction statistical parameters are correlation coefficient (r) ofmore » 0.9451 and root-mean-square error of prediction(RMSEP) of 0.1168. Partial least-squares (PLS) regression is also established to compare with this model. Before doing this, the influences of various spectral pretreatments (standard normal variate, multiplicative scatter correction, S. Golay first derivative, and wavelet package transform) are compared. The PLS approach with wavelet package transform preprocessing spectra is found to provide the best results, and its prediction statistical parameters are correlation coefficient (r) of 0.9061 and RMSEP of 0.1564. Hence, these two models are both desirable to analyze the data from Vis/NIR spectroscopy and to solve the problem of the acidity prediction of bayberry juice. This supplies basal research to ultimately realize the online measurements of the juice's internal quality through this Vis/NIR spectroscopy technique.« less
Aldosterone and glomerular filtration--observations in the general population.
Hannemann, Anke; Rettig, Rainer; Dittmann, Kathleen; Völzke, Henry; Endlich, Karlhans; Nauck, Matthias; Wallaschofski, Henri
2014-03-10
Increasing evidence suggests that aldosterone promotes renal damage. Since data on the association between aldosterone and renal function in the general population are sparse, we chose to address this issue. We investigated the associations between the plasma aldosterone concentration (PAC) or the aldosterone-to-renin ratio (ARR) and the estimated glomerular filtration rate (eGFR) in a sample of adult men and women from Northeast Germany. A study population of 1921 adult men and women who participated in the first follow-up of the Study of Health in Pomerania was selected. None of the subjects used drugs that alter PAC or ARR. The eGFR was calculated according to the four-variable Modification of Diet in Renal Disease formula. Chronic kidney disease (CKD) was defined as an eGFR < 60 ml/min/1.73 m2. Linear regression models, adjusted for sex, age, waist circumference, diabetes mellitus, smoking status, systolic and diastolic blood pressures, serum triglyceride concentrations and time of blood sampling revealed inverse associations of PAC or ARR with eGFR (ß-coefficient for log-transformed PAC -3.12, p < 0.001; ß-coefficient for log-transformed ARR -3.36, p < 0.001). Logistic regression models revealed increased odds for CKD with increasing PAC (odds ratio for a one standard deviation increase in PAC: 1.35, 95% confidence interval: 1.06-1.71). There was no statistically significant association between ARR and CKD. Our study demonstrates that PAC and ARR are inversely associated with the glomerular filtration rate in the general population.
Misyura, Maksym; Sukhai, Mahadeo A; Kulasignam, Vathany; Zhang, Tong; Kamel-Reid, Suzanne; Stockley, Tracy L
2018-02-01
A standard approach in test evaluation is to compare results of the assay in validation to results from previously validated methods. For quantitative molecular diagnostic assays, comparison of test values is often performed using simple linear regression and the coefficient of determination (R 2 ), using R 2 as the primary metric of assay agreement. However, the use of R 2 alone does not adequately quantify constant or proportional errors required for optimal test evaluation. More extensive statistical approaches, such as Bland-Altman and expanded interpretation of linear regression methods, can be used to more thoroughly compare data from quantitative molecular assays. We present the application of Bland-Altman and linear regression statistical methods to evaluate quantitative outputs from next-generation sequencing assays (NGS). NGS-derived data sets from assay validation experiments were used to demonstrate the utility of the statistical methods. Both Bland-Altman and linear regression were able to detect the presence and magnitude of constant and proportional error in quantitative values of NGS data. Deming linear regression was used in the context of assay comparison studies, while simple linear regression was used to analyse serial dilution data. Bland-Altman statistical approach was also adapted to quantify assay accuracy, including constant and proportional errors, and precision where theoretical and empirical values were known. The complementary application of the statistical methods described in this manuscript enables more extensive evaluation of performance characteristics of quantitative molecular assays, prior to implementation in the clinical molecular laboratory. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Shen, Wei; Scherzer, Rebecca; Gantz, Madeleine; Chen, Jun; Punyanitya, Mark; Lewis, Cora E; Grunfeld, Carl
2012-04-01
An increasing number of studies suggest that bone marrow adipose tissue (BMAT) might play a role in the pathogenesis of osteoporosis. Our previous study of Caucasian women demonstrated that there is an inverse relationship between BMAT and whole-body bone mineral density (BMD). It is unknown whether visceral adipose tissue (VAT), sc adipose tissue (SAT), and skeletal muscle had an effect on the relationship between BMAT and BMD. In the present study we investigated the relationship between pelvic, hip, and lumbar spine BMAT with hip and lumbar spine BMD in the population-based Coronary Artery Risk Development in Young Adults (CARDIA) sample with adjustment for whole-body magnetic resonance imaging (MRI)-measured VAT, SAT, and skeletal muscle. T1-weighted MRI was acquired for 210 healthy African-American and Caucasian men and women (age 38-52 yr). Hip and lumbar spine BMD were measured by dual-energy x-ray absorptiometry. Pelvic, hip, and lumbar spine BMAT had negative correlations with hip and lumbar spine BMD (r = -0.399 to -0.550, P < 0.001). The inverse associations between BMAT and BMD remained strong after adjusting for demographics, weight, skeletal muscle, SAT, VAT, total adipose tissue (TAT), menopausal status, lifestyle factors, and inflammatory markers (standardized regression coefficients = -0. 296 to -0.549, P < 0.001). Among body composition measures, skeletal muscle was the strongest correlate of BMD after adjusting for BMAT (standardized regression coefficients = 0.268-0.614, P < 0.05), with little additional contribution from weight, SAT, VAT, or total adipose tissue. In this middle-aged population, a negative relationship existed between MRI-measured BMAT and hip and lumbar spine BMD independent of demographics and body composition. These observations support the growing evidence linking BMAT with low bone density.
Scherzer, Rebecca; Gantz, Madeleine; Chen, Jun; Punyanitya, Mark; Lewis, Cora E.; Grunfeld, Carl
2012-01-01
Context: An increasing number of studies suggest that bone marrow adipose tissue (BMAT) might play a role in the pathogenesis of osteoporosis. Our previous study of Caucasian women demonstrated that there is an inverse relationship between BMAT and whole-body bone mineral density (BMD). It is unknown whether visceral adipose tissue (VAT), sc adipose tissue (SAT), and skeletal muscle had an effect on the relationship between BMAT and BMD. Objective: In the present study we investigated the relationship between pelvic, hip, and lumbar spine BMAT with hip and lumbar spine BMD in the population-based Coronary Artery Risk Development in Young Adults (CARDIA) sample with adjustment for whole-body magnetic resonance imaging (MRI)-measured VAT, SAT, and skeletal muscle. Design: T1-weighted MRI was acquired for 210 healthy African-American and Caucasian men and women (age 38–52 yr). Hip and lumbar spine BMD were measured by dual-energy x-ray absorptiometry. Results: Pelvic, hip, and lumbar spine BMAT had negative correlations with hip and lumbar spine BMD (r = −0.399 to −0.550, P < 0.001). The inverse associations between BMAT and BMD remained strong after adjusting for demographics, weight, skeletal muscle, SAT, VAT, total adipose tissue (TAT), menopausal status, lifestyle factors, and inflammatory markers (standardized regression coefficients = −0. 296 to −0.549, P < 0.001). Among body composition measures, skeletal muscle was the strongest correlate of BMD after adjusting for BMAT (standardized regression coefficients = 0.268–0.614, P < 0.05), with little additional contribution from weight, SAT, VAT, or total adipose tissue. Conclusion: In this middle-aged population, a negative relationship existed between MRI-measured BMAT and hip and lumbar spine BMD independent of demographics and body composition. These observations support the growing evidence linking BMAT with low bone density. PMID:22319043
DOE Office of Scientific and Technical Information (OSTI.GOV)
Dierauf, Timothy; Kurtz, Sarah; Riley, Evan
This paper provides a recommended method for evaluating the AC capacity of a photovoltaic (PV) generating station. It also presents companion guidance on setting the facilitys capacity guarantee value. This is a principles-based approach that incorporates plant fundamental design parameters such as loss factors, module coefficients, and inverter constraints. This method has been used to prove contract guarantees for over 700 MW of installed projects. The method is transparent, and the results are deterministic. In contrast, current industry practices incorporate statistical regression where the empirical coefficients may only characterize the collected data. Though these methods may work well when extrapolationmore » is not required, there are other situations where the empirical coefficients may not adequately model actual performance.This proposed Fundamentals Approach method provides consistent results even where regression methods start to lose fidelity.« less
Hunter, Paul R
2009-12-01
Household water treatment (HWT) is being widely promoted as an appropriate intervention for reducing the burden of waterborne disease in poor communities in developing countries. A recent study has raised concerns about the effectiveness of HWT, in part because of concerns over the lack of blinding and in part because of considerable heterogeneity in the reported effectiveness of randomized controlled trials. This study set out to attempt to investigate the causes of this heterogeneity and so identify factors associated with good health gains. Studies identified in an earlier systematic review and meta-analysis were supplemented with more recently published randomized controlled trials. A total of 28 separate studies of randomized controlled trials of HWT with 39 intervention arms were included in the analysis. Heterogeneity was studied using the "metareg" command in Stata. Initial analyses with single candidate predictors were undertaken and all variables significant at the P < 0.2 level were included in a final regression model. Further analyses were done to estimate the effect of the interventions over time by MonteCarlo modeling using @Risk and the parameter estimates from the final regression model. The overall effect size of all unblinded studies was relative risk = 0.56 (95% confidence intervals 0.51-0.63), but after adjusting for bias due to lack of blinding the effect size was much lower (RR = 0.85, 95% CI = 0.76-0.97). Four main variables were significant predictors of effectiveness of intervention in a multipredictor meta regression model: Log duration of study follow-up (regression coefficient of log effect size = 0.186, standard error (SE) = 0.072), whether or not the study was blinded (coefficient 0.251, SE 0.066) and being conducted in an emergency setting (coefficient -0.351, SE 0.076) were all significant predictors of effect size in the final model. Compared to the ceramic filter all other interventions were much less effective (Biosand 0.247, 0.073; chlorine and safe waste storage 0.295, 0.061; combined coagulant-chlorine 0.2349, 0.067; SODIS 0.302, 0.068). A Monte Carlo model predicted that over 12 months ceramic filters were likely to be still effective at reducing disease, whereas SODIS, chlorination, and coagulation-chlorination had little if any benefit. Indeed these three interventions are predicted to have the same or less effect than what may be expected due purely to reporting bias in unblinded studies With the currently available evidence ceramic filters are the most effective form of HWT in the longterm, disinfection-only interventions including SODIS appear to have poor if any longterm public health benefit.
Liyanaarachchi, G V V; Mahanama, K R R; Somasiri, H P P S; Punyasiri, P A N
2018-02-01
The study presents the validation results of the method carried out for analysis of free amino acids (FAAs) in rice using l-theanine as the internal standard (IS) with o-phthalaldehyde (OPA) reagent using high-performance liquid chromatography-fluorescence detection. The detection and quantification limits of the method were in the range 2-16μmol/kg and 3-19μmol/kg respectively. The method had a wide working range from 25 to 600μmol/kg for each individual amino acid, and good linearity with regression coefficients greater than 0.999. Precision measured in terms of repeatability and reproducibility, expressed as percentage relative standard deviation (% RSD) was below 9% for all the amino acids analyzed. The recoveries obtained after fortification at three concentration levels were in the range 75-105%. In comparison to l-norvaline, findings revealed that l-theanine is suitable as an IS and the validated method can be used for FAA determination in rice. Copyright © 2017 Elsevier Ltd. All rights reserved.
A New Test of Linear Hypotheses in OLS Regression under Heteroscedasticity of Unknown Form
ERIC Educational Resources Information Center
Cai, Li; Hayes, Andrew F.
2008-01-01
When the errors in an ordinary least squares (OLS) regression model are heteroscedastic, hypothesis tests involving the regression coefficients can have Type I error rates that are far from the nominal significance level. Asymptotically, this problem can be rectified with the use of a heteroscedasticity-consistent covariance matrix (HCCM)…
Osinga, Rik; Babst, Doris; Bodmer, Elvira S; Link, Bjoern C; Fritsche, Elmar; Hug, Urs
2017-12-01
This work assessed both subjective and objective postoperative parameters after breast reduction surgery and compared between patients and plastic surgeons. After an average postoperative observation period of 6.7 ± 2.7 (2 - 13) years, 159 out of 259 patients (61 %) were examined. The mean age at the time of surgery was 37 ± 14 (15 - 74) years. The postoperative anatomy of the breast and other anthropometric parameters were measured in cm with the patient in an upright position. The visual analogue scale (VAS) values for symmetry, size, shape, type of scar and overall satisfaction both from the patient's and from four plastic surgeons' perspectives were assessed and compared. Patients rated the postoperative result significantly better than surgeons. Good subjective ratings by patients for shape, symmetry and sensitivity correlated with high scores for overall assessment. Shape had the strongest influence on overall satisfaction (regression coefficient 0.357; p < 0.001), followed by symmetry (regression coefficient 0.239; p < 0.001) and sensitivity (regression coefficient 0.109; p = 0.040) of the breast. The better the subjective rating for symmetry by the patient, the smaller the measured difference of the jugulum-mamillary distance between left and right (regression coefficient -0.773; p = 0.002) and the smaller the difference in height of the lowest part of the breast between left and right (regression coefficient -0.465; p = 0.035). There was no significant correlation between age, weight, height, BMI, resected weight of the breast, postoperative breast size or type of scar with overall satisfaction. After breast reduction surgery, long-term outcome is rated significantly better by patients than by plastic surgeons. Good subjective ratings by patients for shape, symmetry and sensitivity correlated with high scores for overall assessment. Shape had the strongest influence on overall satisfaction, followed by symmetry and sensitivity of the breast. Postoperative size of the breast, resection weight, type of scar, age or BMI was not of significant influence. Symmetry was the only assessed subjective parameter of this study that could be objectified by postoperative measurements. Georg Thieme Verlag KG Stuttgart · New York.
Rights, Jason D; Sterba, Sonya K
2016-11-01
Multilevel data structures are common in the social sciences. Often, such nested data are analysed with multilevel models (MLMs) in which heterogeneity between clusters is modelled by continuously distributed random intercepts and/or slopes. Alternatively, the non-parametric multilevel regression mixture model (NPMM) can accommodate the same nested data structures through discrete latent class variation. The purpose of this article is to delineate analytic relationships between NPMM and MLM parameters that are useful for understanding the indirect interpretation of the NPMM as a non-parametric approximation of the MLM, with relaxed distributional assumptions. We define how seven standard and non-standard MLM specifications can be indirectly approximated by particular NPMM specifications. We provide formulas showing how the NPMM can serve as an approximation of the MLM in terms of intraclass correlation, random coefficient means and (co)variances, heteroscedasticity of residuals at level 1, and heteroscedasticity of residuals at level 2. Further, we discuss how these relationships can be useful in practice. The specific relationships are illustrated with simulated graphical demonstrations, and direct and indirect interpretations of NPMM classes are contrasted. We provide an R function to aid in implementing and visualizing an indirect interpretation of NPMM classes. An empirical example is presented and future directions are discussed. © 2016 The British Psychological Society.
Assessment of Uncertainties Related to Seismic Hazard Using Fuzzy Analysis
NASA Astrophysics Data System (ADS)
Jorjiashvili, N.; Yokoi, T.; Javakhishvili, Z.
2013-05-01
Seismic hazard analysis in last few decades has been become very important issue. Recently, new technologies and available data have been improved that helped many scientists to understand where and why earthquakes happen, physics of earthquakes, etc. They have begun to understand the role of uncertainty in Seismic hazard analysis. However, there is still significant problem how to handle existing uncertainty. The same lack of information causes difficulties to quantify uncertainty accurately. Usually attenuation curves are obtained in statistical way: regression analysis. Statistical and probabilistic analysis show overlapped results for the site coefficients. This overlapping takes place not only at the border between two neighboring classes, but also among more than three classes. Although the analysis starts from classifying sites using the geological terms, these site coefficients are not classified at all. In the present study, this problem is solved using Fuzzy set theory. Using membership functions the ambiguities at the border between neighboring classes can be avoided. Fuzzy set theory is performed for southern California by conventional way. In this study standard deviations that show variations between each site class obtained by Fuzzy set theory and classical way are compared. Results on this analysis show that when we have insufficient data for hazard assessment site classification based on Fuzzy set theory shows values of standard deviations less than obtained by classical way which is direct proof of less uncertainty.
Belief in complementary and alternative medicine is related to age and paranormal beliefs in adults.
Van den Bulck, Jan; Custers, Kathleen
2010-04-01
The use of complementary and alternative medicine (CAM) is widespread, even among people who use conventional medicine. Positive beliefs about CAM are common among physicians and medical students. Little is known about the beliefs regarding CAM among the general public. Among science students, belief in CAM was predicted by belief in the paranormal. In a cross-sectional study, 712 randomly selected adults (>18 years old) responded to the CAM Health Belief Questionnaire (CHBQ) and a paranormal beliefs scale. CAM beliefs were very prevalent in this sample of adult Flemish men and women. Zero-order correlations indicated that belief in CAM was associated with age (r = 0.173 P < 0.001) level of education (r = -0.079 P = 0.039) social desirability (r = -0.119 P = 0.002) and paranormal belief (r = 0.365 P < 0.001). In a multivariate model, two variables predicted CAM beliefs. Support for CAM increased with age (regression coefficient: 0.01; 95% confidence interval (CI): 0.006 to 0.014), but the strongest relationship existed between support for CAM and beliefs in the paranormal. Paranormal beliefs accounted for 14% of the variance of the CAM beliefs (regression coefficient: 0.376; 95%: CI 0.30-0.44). The level of education (regression coefficient: 0.06; 95% CI: -0.014-0.129) and social desirability (regression coefficient: -0.023; 95% CI: -0.048-0.026) did not make a significant contribution to the explained variance (<0.1%, P = 0.867). Support of CAM was very prevalent in this Flemish adult population. CAM beliefs were strongly associated with paranormal beliefs.
Mastin, Mark C.; Konrad, Christopher P.; Veilleux, Andrea G.; Tecca, Alison E.
2016-09-20
An investigation into the magnitude and frequency of floods in Washington State computed the annual exceedance probability (AEP) statistics for 648 U.S. Geological Survey unregulated streamgages in and near the borders of Washington using the recorded annual peak flows through water year 2014. This is an updated report from a previous report published in 1998 that used annual peak flows through the water year 1996. New in this report, a regional skew coefficient was developed for the Pacific Northwest region that includes areas in Oregon, Washington, Idaho and western Montana within the Columbia River drainage basin south of the United States-Canada border, the coastal areas of Oregon and western Washington, and watersheds draining into Puget Sound, Washington. The skew coefficient is an important term in the Log Pearson Type III equation used to define the distribution of the log-transformed annual peaks. The Expected Moments Algorithm was used to fit historical and censored peak-flow data to the log Pearson Type III distribution. A Multiple Grubb-Beck test was employed to censor low outliers of annual peak flows to improve on the frequency distribution. This investigation also includes a section on observed trends in annual peak flows that showed significant trends (p-value < 0.05) in 21 of 83 long-term sites, but with small magnitude Kendall tau values suggesting a limited monotonic trend in the time series of annual peaks. Most of the sites with a significant trend in western Washington were positive and all the sites with significant trends (three sites) in eastern Washington were negative.Multivariate regression analysis with measured basin characteristics and the AEP statistics at long-term, unregulated, and un-urbanized (defined as drainage basins with less than 5 percent impervious land cover for this investigation) streamgages within Washington and some in Idaho and Oregon that are near the Washington border was used to develop equations to estimate AEP statistics at ungaged basins. Washington was divided into four regions to improve the accuracy of the regression equations; a set of equations for eight selected AEPs and for each region were constructed. Selected AEP statistics included the annual peak flows that equaled or exceeded 50, 20, 10, 4, 2, 1, 0.5 and 0.2 percent of the time equivalent to peak flows for peaks with a 2-, 5-, 10-, 25-, 50-, 100-, 200-, and 500-year recurrence intervals, respectively. Annual precipitation and drainage area were the significant basin characteristics in the regression equations for all four regression regions in Washington and forest cover was significant for the two regression regions in eastern Washington. Average standard error of prediction for the regional regression equations ranged from 70.19 to 125.72 percent for Regression Regions 1 and 2 on the eastern side of the Cascade Mountains and from 43.22 to 58.04 percent for Regression Regions 3 and 4 on the western side of the Cascade Mountains. The pseudo coefficient of determination (where a value of 100 signifies a perfect regression model) ranged from 68.39 to 90.68 for Regression Regions 1 and 2, and 92.35 to 95.44 for Regions 3 and 4.The calculated AEP statistics for the streamgages and the regional regression equations are expected to be incorporated into StreamStats after the publication of this report. StreamStats is the interactive Web-based map tool created by the U.S. Geological Survey to allow the user to choose a streamgage and obtain published statistics or choose ungaged locations where the program automatically applies the regional regression equations and computes the estimates of the AEP statistics.
Raleigh, Veena; Sizmur, Steve; Tian, Yang; Thompson, James
2015-04-01
To examine the impact of patient-mix on National Health Service (NHS) acute hospital trust scores in two national NHS patient surveys. Secondary analysis of 2012 patient survey data for 57,915 adult inpatients at 142 NHS acute hospital trusts and 45,263 adult emergency department attendees at 146 NHS acute hospital trusts in England. Changes in trust scores for selected questions, ranks, inter-trust variance and score-based performance bands were examined using three methods: no adjustment for case-mix; the current standardization method with weighting for age, sex and, for inpatients only, admission method; and a regression model adjusting in addition for ethnicity, presence of a long-term condition, proxy response (inpatients only) and previous emergency attendances (emergency department survey only). For both surveys, all the variables examined were associated with patients' responses and affected inter-trust variance in scores, although the direction and strength of impact differed between variables. Inter-trust variance was generally greatest for the unadjusted scores and lowest for scores derived from the full regression model. Although trust scores derived from the three methods were highly correlated (Kendall's tau coefficients 0.70-0.94), up to 14% of trusts had discordant ranks of when the standardization and regression methods were compared. Depending on the survey and question, up to 14 trusts changed performance bands when the regression model with its fuller case-mix adjustment was used rather than the current standardization method. More comprehensive case-mix adjustment of patient survey data than the current limited adjustment reduces performance variation between NHS acute hospital trusts and alters the comparative performance bands of some trusts. Given the use of these data for high-impact purposes such as performance assessment, regulation, commissioning, quality improvement and patient choice, a review of the long-standing method for analysing patient survey data would be timely, and could improve rigour and comparability across the NHS. Performance comparisons need to be perceived as fair and scientifically robust to maintain confidence in publicly reported data, and to support their use by both the public and the NHS. © The Author(s) 2014 Reprints and permissions: sagepub.co.uk/journalsPermissions.nav.
Khan, Nazeer; Siddiqui, Junaid S; Baig-Ansari, Naila
2018-01-01
Background Growth charts are essential tools used by pediatricians as well as public health researchers in assessing and monitoring the well-being of pediatric populations. Development of these growth charts, especially for children above five years of age, is challenging and requires current anthropometric data and advanced statistical analysis. These growth charts are generally presented as a series of smooth centile curves. A number of modeling approaches are available for generating growth charts and applying these on national datasets is important for generating country-specific reference growth charts. Objective To demonstrate that quantile regression (QR) as a viable statistical approach to construct growth reference charts and to assess the applicability of the World Health Organization (WHO) 2007 growth standards to a large Pakistani population of school-going children. Methodology This is a secondary data analysis using anthropometric data of 9,515 students from a Pakistani survey conducted between 2007 and 2014 in four cities of Pakistan. Growth reference charts were created using QR as well as the LMS (Box-Cox transformation (L), the median (M), and the generalized coefficient of variation (S)) method and then compared with WHO 2007 growth standards. Results Centile values estimated by the LMS method and QR procedure had few differences. The centile values attained from QR procedure of BMI-for-age, weight-for-age, and height-for-age of Pakistani children were lower than the standard WHO 2007 centile. Conclusion QR should be considered as an alternative method to develop growth charts for its simplicity and lack of necessity to transform data. WHO 2007 standards are not suitable for Pakistani children. PMID:29632748
Iftikhar, Sundus; Khan, Nazeer; Siddiqui, Junaid S; Baig-Ansari, Naila
2018-02-02
Background Growth charts are essential tools used by pediatricians as well as public health researchers in assessing and monitoring the well-being of pediatric populations. Development of these growth charts, especially for children above five years of age, is challenging and requires current anthropometric data and advanced statistical analysis. These growth charts are generally presented as a series of smooth centile curves. A number of modeling approaches are available for generating growth charts and applying these on national datasets is important for generating country-specific reference growth charts. Objective To demonstrate that quantile regression (QR) as a viable statistical approach to construct growth reference charts and to assess the applicability of the World Health Organization (WHO) 2007 growth standards to a large Pakistani population of school-going children. Methodology This is a secondary data analysis using anthropometric data of 9,515 students from a Pakistani survey conducted between 2007 and 2014 in four cities of Pakistan. Growth reference charts were created using QR as well as the LMS (Box-Cox transformation (L), the median (M), and the generalized coefficient of variation (S)) method and then compared with WHO 2007 growth standards. Results Centile values estimated by the LMS method and QR procedure had few differences. The centile values attained from QR procedure of BMI-for-age, weight-for-age, and height-for-age of Pakistani children were lower than the standard WHO 2007 centile. Conclusion QR should be considered as an alternative method to develop growth charts for its simplicity and lack of necessity to transform data. WHO 2007 standards are not suitable for Pakistani children.
Characteristics of low-slope streams that affect O2 transfer rates
Parker, Gene W.; Desimone, Leslie A.
1991-01-01
Multiple-regression techniques were used to derive the reaeration coefficients estimating equation for low sloped streams: K2 = 3.83 MBAS-0.41 SL0.20 H-0.76, where K2 is the reaeration coefficient in base e units per day; MBAS is the methylene blue active substances concentration in milligrams per liter; SL is the water-surface slope in foot per foot; and H is the mean-flow depth in feet. Fourteen hydraulic, physical, and water-quality characteristics were regressed against 29 measured-reaeration coefficients for low-sloped (water surface slopes less than 0.002 foot per foot) streams in Massachusetts and New York. Reaeration coefficients measured from May 1985 to October 1988 ranged from 0.2 to 11.0 base e units per day for 29 low-sloped tracer studies. Concentration of methylene blue active substances is significant because it is thought to be an indicator of concentration of surfactants which could change the surface tension at the air-water interface.
A fluorometric determination of urinary 17-hydroxycorticosteroids using benzamidine.
Yamaguchi, Y; Seki, T
1984-10-01
A fluorometric determination of urinary 17-hydroxycorticosteroids using a reaction of benzamidine with compounds carrying the dihydroxyacetone side chain is described. The fluorescent compounds have excitation and emission maxima at 370 and 480 nm, respectively. The method includes enzymatic hydrolysis with beta-glucuronidase (EC 3.2.1.31, from Escherichia coli) and extraction with methylene chloride and generation of fluorescence in alkaline solution (pH 13.4). The specificity of the reaction was examined and the results were compared with those of an accepted method based on the Porter-Silber reaction (C. C. Porter and R. H. Silber, 1950, J. Biol. Chem. 185, 201-207). The coefficient of correlation was 0.945 with regression line of y = 0.91x + 0.7 mg/day (y, present method; x, Porter-Silber reaction method). Sensitivity of the reaction was 0.5 microgram/ml of standard or sample, mean recovery of cortisol added to five urine samples (5-micrograms addition) was 95%, and the coefficient of variation of the method (five repeated assays of sample with a value of 5.2 mg/liter) was 6.2%.
Schmitt, Neal; Golubovich, Juliya; Leong, Frederick T L
2011-12-01
The impact of measurement invariance and the provision for partial invariance in confirmatory factor analytic models on factor intercorrelations, latent mean differences, and estimates of relations with external variables is investigated for measures of two sets of widely assessed constructs: Big Five personality and the six Holland interests (RIASEC). In comparing models that include provisions for partial invariance with models that do not, the results indicate quite small differences in parameter estimates involving the relations between factors, one relatively large standardized mean difference in factors between the subgroups compared and relatively small differences in the regression coefficients when the factors are used to predict external variables. The results provide support for the use of partially invariant models, but there does not seem to be a great deal of difference between structural coefficients when the measurement model does or does not include separate estimates of subgroup parameters that differ across subgroups. Future research should include simulations in which the impact of various factors related to invariance is estimated.
Measurement of effective air diffusion coefficients for trichloroethene in undisturbed soil cores.
Bartelt-Hunt, Shannon L; Smith, James A
2002-06-01
In this study, we measure effective diffusion coefficients for trichloroethene in undisturbed soil samples taken from Picatinny Arsenal, New Jersey. The measured effective diffusion coefficients ranged from 0.0053 to 0.0609 cm2/s over a range of air-filled porosity of 0.23-0.49. The experimental data were compared to several previously published relations that predict diffusion coefficients as a function of air-filled porosity and porosity. A multiple linear regression analysis was developed to determine if a modification of the exponents in Millington's [Science 130 (1959) 100] relation would better fit the experimental data. The literature relations appeared to generally underpredict the effective diffusion coefficient for the soil cores studied in this work. Inclusion of a particle-size distribution parameter, d10, did not significantly improve the fit of the linear regression equation. The effective diffusion coefficient and porosity data were used to recalculate estimates of diffusive flux through the subsurface made in a previous study performed at the field site. It was determined that the method of calculation used in the previous study resulted in an underprediction of diffusive flux from the subsurface. We conclude that although Millington's [Science 130 (1959) 100] relation works well to predict effective diffusion coefficients in homogeneous soils with relatively uniform particle-size distributions, it may be inaccurate for many natural soils with heterogeneous structure and/or non-uniform particle-size distributions.
Smooth Scalar-on-Image Regression via Spatial Bayesian Variable Selection
Goldsmith, Jeff; Huang, Lei; Crainiceanu, Ciprian M.
2013-01-01
We develop scalar-on-image regression models when images are registered multidimensional manifolds. We propose a fast and scalable Bayes inferential procedure to estimate the image coefficient. The central idea is the combination of an Ising prior distribution, which controls a latent binary indicator map, and an intrinsic Gaussian Markov random field, which controls the smoothness of the nonzero coefficients. The model is fit using a single-site Gibbs sampler, which allows fitting within minutes for hundreds of subjects with predictor images containing thousands of locations. The code is simple and is provided in less than one page in the Appendix. We apply this method to a neuroimaging study where cognitive outcomes are regressed on measures of white matter microstructure at every voxel of the corpus callosum for hundreds of subjects. PMID:24729670
Park, Su San; Lee, Ju Yul; Cho, Sung-Il
2007-07-01
We investigated the validity of the dipstick method (Mossman Associates Inc. USA) and the expired CO method to distinguish between smokers and nonsmokers. We also elucidated the related factors of the two methods. This study included 244 smokers and 50 ex-smokers, recruited from smoking cessation clinics at 4 local public health centers, who had quit for over 4 weeks. We calculated the sensitivity, specificity and Kappa coefficient of each method for validity. We obtained ROC curve, predictive value and agreement to determine the cutoff of expired air CO method. Finally, we elucidated the related factors and compared their effect powers using the standardized regression coefficient. The dipstick method showed a sensitivity of 92.6%, specificity of 96.0% and Kappa coefficient of 0.79. The best cutoff value to distinguish smokers was 5-6 ppm. At 5 ppm, the expired CO method showed a sensitivity of 94.3%, specificity of 82.0% and Kappa coefficient of 0.73. And at 6 ppm, sensitivity, specificity and Kappa coefficient were 88.5%, 86.0% and 0.64, respectively. Therefore, the dipstick method had higher sensitivity and specificity than the expired CO method. The dipstick and expired CO methods were significantly increased with increasing smoking amount. With longer time since the last smoking, expired CO showed a rapid decrease after 4 hours, whereas the dipstick method showed relatively stable levels for more than 4 hours. The dipstick and expired CO methods were both good indicators for assessing smoking status. However, the former showed higher sensitivity and specificity and stable levels over longer hours after smoking, compared to the expired CO method.
Inbreeding coefficients and degree of consanguineous marriages in Spain: a review.
Fuster, Vicente; Colantonio, Sonia Edith
2003-01-01
The contribution of consanguineous marriages corresponding to uncle-niece or aunt-nephew (C12), first cousin (C22), first cousin once removed (C23), and second cousin (C33) to the inbreeding coefficient (alpha) was analyzed from a sample of Spanish areas and periods. Multiple regressions were performed taking as independent variables the different degrees of consanguinity previously selected (C12, C22, C23, and C33) and as dependent variable the inbreeding coefficient (alpha). According to the results obtained for any degree and period, rural frequencies always surpass urban. However, the pattern is similar in both areas. In the period where consanguinity was more elevated (1890-1929) the C22/C33 ratio increased. Its variation is not due to C22 and C33 changes in the same way. In rural areas, this ratio surpasses the expected value by a factor of 2-3, but in urban areas it was 7-10 times larger, in some cases due to migration. While in rural Spain the C33 frequency was approximately 1.5 times C22, in cities C22 was 1.5 times C33. The best fit among the various types of consanguineous matings and alpha involves a lineal relationship. Regardless of the number of variables contributing significantly to alpha, C22 matings are always present. Moreover, their standardized (beta) coefficients are the highest. The above indicates that this consanguineous relationship conditions the inbreeding coefficient the most. In the period of greater consanguinity, close relationships, uncle-niece C12, and first cousin once removed (C23) make a significant contribution to alpha. In rural Spain second cousins (C33) always significantly determined alpha; however, in cities the inbreeding variation was mainly due to C12 and C23. Copyright 2003 Wiley-Liss, Inc.
Development of a traveltime prediction equation for streams in Arkansas
Funkhouser, Jaysson E.; Barks, C. Shane
2004-01-01
During 1971 and 1981 and 2001 and 2003, traveltime measurements were made at 33 sample sites on 18 streams throughout northern and western Arkansas using fluorescent dye. Most measurements were made during steady-state base-flow conditions with the exception of three measurements made during near steady-state medium-flow conditions (for the study described in this report, medium-flow is approximately 100-150 percent of the mean monthly streamflow during the month the dye trace was conducted). These traveltime data were compared to the U.S. Geological Survey?s national traveltime prediction equation and used to develop a specific traveltime prediction equation for Arkansas streams. In general, the national traveltime prediction equation yielded results that over-predicted the velocity of the streams for 29 of the 33 sites measured. The standard error for the national traveltime prediction equation was 105 percent. The coefficient of determination was 0.78. The Arkansas prediction equation developed from a regression analysis of dye-tracing results was a significant improvement over the national prediction equation. This regression analysis yielded a standard error of 46 percent and a coefficient of determination of 0.74. The predicted velocities using this equation compared better to measured velocities. Using the variables in a regression analysis, the Arkansas prediction equation derived for the peak velocity in feet per second was: (Actual Equation Shown in report) In addition to knowing when the peak concentration will arrive at a site, it is of great interest to know when the leading edge of a contaminant plume will arrive. The traveltime of the leading edge of a contaminant plume indicates when a potential problem might first develop and also defines the overall shape of the concentration response function. Previous USGS reports have shown no significant relation between any of the variables and the time from injection to the arrival of the leading edge of the dye plume. For this report, the analysis of the dye-tracing data yielded a significant correlation between traveltime of the leading edge and traveltime of the peak concentration with an R2 value of 0.99. These data indicate that the traveltime of the leading edge can be estimated from: (Actual Equation Shown in Report)
ERIC Educational Resources Information Center
Tong, Fuhui
2006-01-01
Background: An extensive body of researches has favored the use of regression over other parametric analyses that are based on OVA. In case of noteworthy regression results, researchers tend to explore magnitude of beta weights for the respective predictors. Purpose: The purpose of this paper is to examine both beta weights and structure…
NASA Astrophysics Data System (ADS)
Mansouri, Edris; Feizi, Faranak; Jafari Rad, Alireza; Arian, Mehran
2018-03-01
This paper uses multivariate regression to create a mathematical model for iron skarn exploration in the Sarvian area, central Iran, using multivariate regression for mineral prospectivity mapping (MPM). The main target of this paper is to apply multivariate regression analysis (as an MPM method) to map iron outcrops in the northeastern part of the study area in order to discover new iron deposits in other parts of the study area. Two types of multivariate regression models using two linear equations were employed to discover new mineral deposits. This method is one of the reliable methods for processing satellite images. ASTER satellite images (14 bands) were used as unique independent variables (UIVs), and iron outcrops were mapped as dependent variables for MPM. According to the results of the probability value (p value), coefficient of determination value (R2) and adjusted determination coefficient (Radj2), the second regression model (which consistent of multiple UIVs) fitted better than other models. The accuracy of the model was confirmed by iron outcrops map and geological observation. Based on field observation, iron mineralization occurs at the contact of limestone and intrusive rocks (skarn type).
Estimation Methods for Non-Homogeneous Regression - Minimum CRPS vs Maximum Likelihood
NASA Astrophysics Data System (ADS)
Gebetsberger, Manuel; Messner, Jakob W.; Mayr, Georg J.; Zeileis, Achim
2017-04-01
Non-homogeneous regression models are widely used to statistically post-process numerical weather prediction models. Such regression models correct for errors in mean and variance and are capable to forecast a full probability distribution. In order to estimate the corresponding regression coefficients, CRPS minimization is performed in many meteorological post-processing studies since the last decade. In contrast to maximum likelihood estimation, CRPS minimization is claimed to yield more calibrated forecasts. Theoretically, both scoring rules used as an optimization score should be able to locate a similar and unknown optimum. Discrepancies might result from a wrong distributional assumption of the observed quantity. To address this theoretical concept, this study compares maximum likelihood and minimum CRPS estimation for different distributional assumptions. First, a synthetic case study shows that, for an appropriate distributional assumption, both estimation methods yield to similar regression coefficients. The log-likelihood estimator is slightly more efficient. A real world case study for surface temperature forecasts at different sites in Europe confirms these results but shows that surface temperature does not always follow the classical assumption of a Gaussian distribution. KEYWORDS: ensemble post-processing, maximum likelihood estimation, CRPS minimization, probabilistic temperature forecasting, distributional regression models
Guenole, Nigel; Brown, Anna
2014-01-01
We report a Monte Carlo study examining the effects of two strategies for handling measurement non-invariance – modeling and ignoring non-invariant items – on structural regression coefficients between latent variables measured with item response theory models for categorical indicators. These strategies were examined across four levels and three types of non-invariance – non-invariant loadings, non-invariant thresholds, and combined non-invariance on loadings and thresholds – in simple, partial, mediated and moderated regression models where the non-invariant latent variable occupied predictor, mediator, and criterion positions in the structural regression models. When non-invariance is ignored in the latent predictor, the focal group regression parameters are biased in the opposite direction to the difference in loadings and thresholds relative to the referent group (i.e., lower loadings and thresholds for the focal group lead to overestimated regression parameters). With criterion non-invariance, the focal group regression parameters are biased in the same direction as the difference in loadings and thresholds relative to the referent group. While unacceptable levels of parameter bias were confined to the focal group, bias occurred at considerably lower levels of ignored non-invariance than was previously recognized in referent and focal groups. PMID:25278911
Wagner, Daniel M.; Krieger, Joshua D.; Veilleux, Andrea G.
2016-08-04
In 2013, the U.S. Geological Survey initiated a study to update regional skew, annual exceedance probability discharges, and regional regression equations used to estimate annual exceedance probability discharges for ungaged locations on streams in the study area with the use of recent geospatial data, new analytical methods, and available annual peak-discharge data through the 2013 water year. An analysis of regional skew using Bayesian weighted least-squares/Bayesian generalized-least squares regression was performed for Arkansas, Louisiana, and parts of Missouri and Oklahoma. The newly developed constant regional skew of -0.17 was used in the computation of annual exceedance probability discharges for 281 streamgages used in the regional regression analysis. Based on analysis of covariance, four flood regions were identified for use in the generation of regional regression models. Thirty-nine basin characteristics were considered as potential explanatory variables, and ordinary least-squares regression techniques were used to determine the optimum combinations of basin characteristics for each of the four regions. Basin characteristics in candidate models were evaluated based on multicollinearity with other basin characteristics (variance inflation factor < 2.5) and statistical significance at the 95-percent confidence level (p ≤ 0.05). Generalized least-squares regression was used to develop the final regression models for each flood region. Average standard errors of prediction of the generalized least-squares models ranged from 32.76 to 59.53 percent, with the largest range in flood region D. Pseudo coefficients of determination of the generalized least-squares models ranged from 90.29 to 97.28 percent, with the largest range also in flood region D. The regional regression equations apply only to locations on streams in Arkansas where annual peak discharges are not substantially affected by regulation, diversion, channelization, backwater, or urbanization. The applicability and accuracy of the regional regression equations depend on the basin characteristics measured for an ungaged location on a stream being within range of those used to develop the equations.
Quantifying relative importance: Computing standardized effects in models with binary outcomes
Grace, James B.; Johnson, Darren; Lefcheck, Jonathan S.; Byrnes, Jarrett E.K.
2018-01-01
Results from simulation studies show that both the LT and OE methods of standardization support a similarly-broad range of coefficient comparisons. The LT method estimates effects that reflect underlying latent-linear propensities, while the OE method computes a linear approximation for the effects of predictors on binary responses. The contrast between assumptions for the two methods is reflected in persistently weaker standardized effects associated with OE standardization. Reliance on standard deviations for standardization (the traditional approach) is critically examined and shown to introduce substantial biases when predictors are non-Gaussian. The use of relevant ranges in place of standard deviations has the capacity to place LT and OE standardized coefficients on a more comparable scale. As ecologists address increasingly complex hypotheses, especially those that involve comparing the influences of different controlling factors (e.g., top-down versus bottom-up or biotic versus abiotic controls), comparable coefficients become a necessary component for evaluations.
Estimating the magnitude of peak flows for streams in Kentucky for selected recurrence intervals
Hodgkins, Glenn A.; Martin, Gary R.
2003-01-01
This report gives estimates of, and presents techniques for estimating, the magnitude of peak flows for streams in Kentucky for recurrence intervals of 2, 5, 10, 25, 50, 100, 200, and 500 years. A flowchart in this report guides the user to the appropriate estimates and (or) estimating techniques for a site on a specific stream. Estimates of peak flows are given for 222 U.S. Geological Survey streamflow-gaging stations in Kentucky. In the development of the peak-flow estimates at gaging stations, a new generalized skew coefficient was calculated for the State. This single statewide value of 0.011 (with a standard error of prediction of 0.520) is more appropriate for Kentucky than the national skew isoline map in Bulletin 17B of the Interagency Advisory Committee on Water Data. Regression equations are presented for estimating the peak flows on ungaged, unregulated streams in rural drainage basins. The equations were developed by use of generalized-least-squares regression procedures at 187 U.S. Geological Survey gaging stations in Kentucky and 51 stations in surrounding States. Kentucky was divided into seven flood regions. Total drainage area is used in the final regression equations as the sole explanatory variable, except in Regions 1 and 4 where main-channel slope also was used. The smallest average standard errors of prediction were in Region 3 (from -13.1 to +15.0 percent) and the largest average standard errors of prediction were in Region 5 (from -37.6 to +60.3 percent). One section of this report describes techniques for estimating peak flows for ungaged sites on gaged, unregulated streams in rural drainage basins. Another section references two previous U.S. Geological Survey reports for peak-flow estimates on ungaged, unregulated, urban streams. Estimating peak flows at ungaged sites on regulated streams is beyond the scope of this report, because peak flows on regulated streams are dependent upon variable human activities.
van Mierlo, Trevor; Hyatt, Douglas; Ching, Andrew T
2016-01-01
Digital Health Social Networks (DHSNs) are common; however, there are few metrics that can be used to identify participation inequality. The objective of this study was to investigate whether the Gini coefficient, an economic measure of statistical dispersion traditionally used to measure income inequality, could be employed to measure DHSN inequality. Quarterly Gini coefficients were derived from four long-standing DHSNs. The combined data set included 625,736 posts that were generated from 15,181 actors over 18,671 days. The range of actors (8-2323), posts (29-28,684), and Gini coefficients (0.15-0.37) varied. Pearson correlations indicated statistically significant associations between number of actors and number of posts (0.527-0.835, p < .001), and Gini coefficients and number of posts (0.342-0.725, p < .001). However, the association between Gini coefficient and number of actors was only statistically significant for the addiction networks (0.619 and 0.276, p < .036). Linear regression models had positive but mixed R 2 results (0.333-0.527). In all four regression models, the association between Gini coefficient and posts was statistically significant ( t = 3.346-7.381, p < .002). However, unlike the Pearson correlations, the association between Gini coefficient and number of actors was only statistically significant in the two mental health networks ( t = -4.305 and -5.934, p < .000). The Gini coefficient is helpful in measuring shifts in DHSN inequality. However, as a standalone metric, the Gini coefficient does not indicate optimal numbers or ratios of actors to posts, or effective network engagement. Further, mixed-methods research investigating quantitative performance metrics is required.
How to compare cross-lagged associations in a multilevel autoregressive model.
Schuurman, Noémi K; Ferrer, Emilio; de Boer-Sonnenschein, Mieke; Hamaker, Ellen L
2016-06-01
By modeling variables over time it is possible to investigate the Granger-causal cross-lagged associations between variables. By comparing the standardized cross-lagged coefficients, the relative strength of these associations can be evaluated in order to determine important driving forces in the dynamic system. The aim of this study was twofold: first, to illustrate the added value of a multilevel multivariate autoregressive modeling approach for investigating these associations over more traditional techniques; and second, to discuss how the coefficients of the multilevel autoregressive model should be standardized for comparing the strength of the cross-lagged associations. The hierarchical structure of multilevel multivariate autoregressive models complicates standardization, because subject-based statistics or group-based statistics can be used to standardize the coefficients, and each method may result in different conclusions. We argue that in order to make a meaningful comparison of the strength of the cross-lagged associations, the coefficients should be standardized within persons. We further illustrate the bivariate multilevel autoregressive model and the standardization of the coefficients, and we show that disregarding individual differences in dynamics can prove misleading, by means of an empirical example on experienced competence and exhaustion in persons diagnosed with burnout. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Influence of eye micromotions on spatially resolved refractometry
NASA Astrophysics Data System (ADS)
Chyzh, Igor H.; Sokurenko, Vyacheslav M.; Osipova, Irina Y.
2001-01-01
The influence eye micromotions on the accuracy of estimation of Zernike coefficients form eye transverse aberration measurements was investigated. By computer modeling, the following found eye aberrations have been examined: defocusing, primary astigmatism, spherical aberration of the 3rd and the 5th orders, as well as their combinations. It was determined that the standard deviation of estimated Zernike coefficients is proportional to the standard deviation of angular eye movements. Eye micromotions cause the estimation errors of Zernike coefficients of present aberrations and produce the appearance of Zernike coefficients of aberrations, absent in the eye. When solely defocusing is present, the biggest errors, cased by eye micromotions, are obtained for aberrations like coma and astigmatism. In comparison with other aberrations, spherical aberration of the 3rd and the 5th orders evokes the greatest increase of the standard deviation of other Zernike coefficients.
Waltemeyer, Scott D.
2006-01-01
Estimates of the magnitude and frequency of peak discharges are necessary for the reliable flood-hazard mapping in the Navajo Nation in Arizona, Utah, Colorado, and New Mexico. The Bureau of Indian Affairs, U.S. Army Corps of Engineers, and Navajo Nation requested that the U.S. Geological Survey update estimates of peak discharge magnitude for gaging stations in the region and update regional equations for estimation of peak discharge and frequency at ungaged sites. Equations were developed for estimating the magnitude of peak discharges for recurrence intervals of 2, 5, 10, 25, 50, 100, and 500 years at ungaged sites using data collected through 1999 at 146 gaging stations, an additional 13 years of peak-discharge data since a 1997 investigation, which used gaging-station data through 1986. The equations for estimation of peak discharges at ungaged sites were developed for flood regions 8, 11, high elevation, and 6 and are delineated on the basis of the hydrologic codes from the 1997 investigation. Peak discharges for selected recurrence intervals were determined at gaging stations by fitting observed data to a log-Pearson Type III distribution with adjustments for a low-discharge threshold and a zero skew coefficient. A low-discharge threshold was applied to frequency analysis of 82 of the 146 gaging stations. This application provides an improved fit of the log-Pearson Type III frequency distribution. Use of the low-discharge threshold generally eliminated the peak discharge having a recurrence interval of less than 1.4 years in the probability-density function. Within each region, logarithms of the peak discharges for selected recurrence intervals were related to logarithms of basin and climatic characteristics using stepwise ordinary least-squares regression techniques for exploratory data analysis. Generalized least-squares regression techniques, an improved regression procedure that accounts for time and spatial sampling errors, then was applied to the same data used in the ordinary least-squares regression analyses. The average standard error of prediction for a peak discharge have a recurrence interval of 100-years for region 8 was 53 percent (average) for the 100-year flood. The average standard of prediction, which includes average sampling error and average standard error of regression, ranged from 45 to 83 percent for the 100-year flood. Estimated standard error of prediction for a hybrid method for region 11 was large in the 1997 investigation. No distinction of floods produced from a high-elevation region was presented in the 1997 investigation. Overall, the equations based on generalized least-squares regression techniques are considered to be more reliable than those in the 1997 report because of the increased length of record and improved GIS method. Techniques for transferring flood-frequency relations to ungaged sites on the same stream can be estimated at an ungaged site by a direct application of the regional regression equation or at an ungaged site on a stream that has a gaging station upstream or downstream by using the drainage-area ratio and the drainage-area exponent from the regional regression equation of the respective region.
The Study of Rain Specific Attenuation for the Prediction of Satellite Propagation in Malaysia
NASA Astrophysics Data System (ADS)
Mandeep, J. S.; Ng, Y. Y.; Abdullah, H.; Abdullah, M.
2010-06-01
Specific attenuation is the fundamental quantity in the calculation of rain attenuation for terrestrial path and slant paths representing as rain attenuation per unit distance (dB/km). Specific attenuation is an important element in developing the predicted rain attenuation model. This paper deals with the empirical determination of the power law coefficients which allow calculating the specific attenuation in dB/km from the knowledge of the rain rate in mm/h. The main purpose of the paper is to obtain the coefficients of k and α of power law relationship between specific attenuation. Three years (from 1st January 2006 until 31st December 2008) rain gauge and beacon data taken from USM, Nibong Tebal have been used to do the empirical procedure analysis of rain specific attenuation. The data presented are semi-empirical in nature. A year-to-year variation of the coefficients has been indicated and the empirical measured data was compared with ITU-R provided regression coefficient. The result indicated that the USM empirical measured data was significantly vary from ITU-R predicted value. Hence, ITU-R recommendation for regression coefficients of rain specific attenuation is not suitable for predicting rain attenuation at Malaysia.
Joseph, Mini; Gupta, Riddhi Das; Prema, L; Inbakumari, Mercy; Thomas, Nihal
2017-01-01
The accuracy of existing predictive equations to determine the resting energy expenditure (REE) of professional weightlifters remains scarcely studied. Our study aimed at assessing the REE of male Asian Indian weightlifters with indirect calorimetry and to compare the measured REE (mREE) with published equations. A new equation using potential anthropometric variables to predict REE was also evaluated. REE was measured on 30 male professional weightlifters aged between 17 and 28 years using indirect calorimetry and compared with the eight formulas predicted by Harris-Benedicts, Mifflin-St. Jeor, FAO/WHO/UNU, ICMR, Cunninghams, Owen, Katch-McArdle, and Nelson. Pearson correlation coefficient, intraclass correlation coefficient, and multiple linear regression analysis were carried out to study the agreement between the different methods, association with anthropometric variables, and to formulate a new prediction equation for this population. Pearson correlation coefficients between mREE and the anthropometric variables showed positive significance with suprailiac skinfold thickness, lean body mass (LBM), waist circumference, hip circumference, bone mineral mass, and body mass. All eight predictive equations underestimated the REE of the weightlifters when compared with the mREE. The highest mean difference was 636 kcal/day (Owen, 1986) and the lowest difference was 375 kcal/day (Cunninghams, 1980). Multiple linear regression done stepwise showed that LBM was the only significant determinant of REE in this group of sportspersons. A new equation using LBM as the independent variable for calculating REE was computed. REE for weightlifters = -164.065 + 0.039 (LBM) (confidence interval -1122.984, 794.854]. This new equation reduced the mean difference with mREE by 2.36 + 369.15 kcal/day (standard error = 67.40). The significant finding of this study was that all the prediction equations underestimated the REE. The LBM was the sole determinant of REE in this population. In the absence of indirect calorimetry, the REE equation developed by us using LBM is a better predictor for calculating REE of professional male weightlifters of this region.
Hooper, C; De Souto Barreto, P; Payoux, P; Salabert, A S; Guyonnet, S; Andrieu, S; Sourdet, S; Delrieu, J; Vellas, B
2017-01-01
We examined the relationships between erythrocyte membrane monounsaturated fatty acids (MUFAs) and saturated fatty acids (SFAs) and cortical β-amyloid (Aβ) load in older adults reporting subjective memory complaints. This is a cross-sectional study using data from the Multidomain Alzheimer Preventive Trial (MAPT); a randomised controlled trial. French community dwellers aged 70 or over reporting subjective memory complaints, but free from a diagnosis of clinical dementia. Participants of this study were 61 individuals from the placebo arm of the MAPT trial with data on erythrocyte membrane fatty acid levels and cortical Aβ load. Cortical-to-cerebellar standard uptake value ratios were assessed using [18F] florbetapir positron emission tomography (PET). Fatty acids were measured in erythrocyte cell membranes using gas chromatography. Associations between erythrocyte membrane MUFAs and SFAs and cortical Aβ load were explored using adjusted multiple linear regression models and were considered significant at p ≤ 0.005 (10 comparisons) after correction for multiple testing. We found no significant associations between fatty acids and cortical Aβ load using multiple linear regression adjusted for age, sex, education, cognition, PET-scan to clinical assessment interval, PET-scan to blood collection interval and apolipoprotein E (ApoE) status. The association closest to significance was that between erythrocyte membrane stearic acid and Aβ (B-coefficient 0.03, 95 % CI: 0.00,0.05, p = 0.05). This association, although statistically non-significant, appeared to be stronger amongst ApoE ε4 carriers (B-coefficient 0.04, 95 % CI: -0.01,0.09, p = 0.08) compared to ApoE ε4 non-carriers (B-coefficient 0.02, 95 % CI: -0.01,0.05, p = 0.18) in age and sex stratified analysis. Future research in the form of large longitudinal observational study is needed to validate our findings, particularly regarding the potential association of stearic acid with cortical Aβ.
Jia, Guang; O'Dell, Craig; Heverhagen, Johannes T; Yang, Xiangyu; Liang, Jiachao; Jacko, Richard V; Sammet, Steffen; Pellas, Theodore; Cole, Patricia; Knopp, Michael V
2008-09-01
To describe and determine the reproducibility of a simplified model to quantitatively measure heterogeneous intralesion contrast agent diffusion in colorectal liver metastases. This HIPAA-compliant retrospective study received institutional review board approval, and written informed consent was obtained from 14 patients (mean age, 61 years +/- 9 [standard deviation]; range, 41-78 years), including 10 men (mean age, 65 years +/- 8; range, 47-78 years) and four women (mean age, 54 years +/- 9; range, 41-59 years), with colorectal liver metastases. Magnetic resonance (MR) imaging was performed twice (first baseline MR image [B(1)] and second baseline MR image [B(2)]) in a single target lesion prior to therapy. Dynamic contrast material-enhanced MR imaging was performed by using a saturation-recovery fast gradient-echo sequence. A simplified contrast agent diffusion model was proposed, and a contrast agent diffusion coefficient (CDC) was calculated. The reproducibility of the CDC measurement was evaluated by using the Bland-Altman plot and a linear regression model. The mean CDC was 0.22 mm(2)/sec (range, 0.01-0.73 mm(2)/sec) on B(1) and 0.24 mm(2)/sec (range, 0.01-0.71 mm(2)/sec) on B(2), with an intraclass correlation coefficient of 0.91 (P < .0001). Bland-Altman plot showed good agreement, with a mean difference in measurement pairs of 0.017 mm(2)/sec +/- 0.096. The slope from the linear regression model was 0.89 (95% confidence interval: 0.63, 1.15) and the intercept was 0.01 (95% confidence interval: -0.08, 0.09). The CDC enables a quantitative description of contrast enhancement heterogeneity in lesions. Given the high reproducibility of the CDC metric, CDC appears promising for further qualification as an imaging biomarker of change measurement in response assessment. http://radiology.rsnajnls.org/cgi/content/full/248/3/901/DC1. RSNA, 2008
Merchantable sawlog and bole-length equations for the Northeastern United States
Daniel A. Yaussy; Martin E. Dale; Martin E. Dale
1991-01-01
A modified Richards growth model is used to develop species-specific coefficients for equations estimating the merchantable sawlog and bole lengths of trees from 25 species groups common to the Northeastern United States. These regression coefficients have been incorporated into the growth-and-yield simulation software, NE-TWIGS.
Jokovic, Aleksandra; Locker, David; Guyatt, Gordan
2006-01-01
Background The Child Perceptions Questionnaire for children aged 11 to 14 years (CPQ11–14) is a 37-item measure of oral-health-related quality of life (OHRQoL) encompassing four domains: oral symptoms, functional limitations, emotional and social well-being. To facilitate its use in clinical settings and population-based health surveys, it was shortened to 16 and 8 items. Item impact and stepwise regression methods were used to produce each version. This paper describes the developmental process, compares the discriminative properties of the resulting four short-forms and evaluates their precision relative to the original CPQ11–14. Methods The item impact method used data from the CPQ11–14 item reduction study to select the questions with the highest impact scores in each domain. The regression method, where the dependent variable was the overall CPQ11–14 score and the independent variables its individual questions, was applied to the data collected in the validity study for the CPQ11–14. The measurement properties (i.e. criterion validity, construct validity, internal consistency reliability and test-retest reliability) of all 4 short-forms were evaluated using the data from the validity and reliability studies for the CPQ11–14. Results All short forms detected substantial variability in children's OHRQoL. The mean scores on the two 16-item questionnaires were almost identical, while on the two 8-item questionnaires they differed by only one score point. The mean scores standardized to 0–100 were higher on the short forms than the original CPQ11–14 (p < 0.001). There were strong significant correlations between all short-form scores and CPQ11–14 scores (0.87–0.98; p < 0.001). Hypotheses concerning construct validity were confirmed: the short-forms' scores were highest in the oro-facial, lower in the orthodontic and lowest in the paediatric dentistry group; all short-form questionnaires were positively correlated with the ratings of oral health and overall well-being, with the correlation coefficient being higher for the latter. The relative validity coefficients were 0.85 to 1.18. Cronbach's alpha and intraclass correlation coefficients ranged 0.71–0.83 and 0.71–0.77, respectively. Conclusion All short forms demonstrated excellent criterion validity and good construct validity. The reliability coefficients exceeded standards for group-level comparisons. However, these are preliminary findings based on the convenience sampling and further testing in replicated studies involving clinical and general samples of children in various settings is necessary to establish measurement sensitivity and discriminative properties of these questionnaires. PMID:16423298
Advanced statistics: linear regression, part II: multiple linear regression.
Marill, Keith A
2004-01-01
The applications of simple linear regression in medical research are limited, because in most situations, there are multiple relevant predictor variables. Univariate statistical techniques such as simple linear regression use a single predictor variable, and they often may be mathematically correct but clinically misleading. Multiple linear regression is a mathematical technique used to model the relationship between multiple independent predictor variables and a single dependent outcome variable. It is used in medical research to model observational data, as well as in diagnostic and therapeutic studies in which the outcome is dependent on more than one factor. Although the technique generally is limited to data that can be expressed with a linear function, it benefits from a well-developed mathematical framework that yields unique solutions and exact confidence intervals for regression coefficients. Building on Part I of this series, this article acquaints the reader with some of the important concepts in multiple regression analysis. These include multicollinearity, interaction effects, and an expansion of the discussion of inference testing, leverage, and variable transformations to multivariate models. Examples from the first article in this series are expanded on using a primarily graphic, rather than mathematical, approach. The importance of the relationships among the predictor variables and the dependence of the multivariate model coefficients on the choice of these variables are stressed. Finally, concepts in regression model building are discussed.
Determination of whey adulteration in milk powder by using laser induced breakdown spectroscopy.
Bilge, Gonca; Sezer, Banu; Eseller, Kemal Efe; Berberoglu, Halil; Topcu, Ali; Boyaci, Ismail Hakki
2016-12-01
A rapid and in situ method has been developed to detect and quantify adulterated milk powder through adding whey powder by using laser induced breakdown spectroscopy (LIBS). The methodology is based on elemental composition differences between milk and whey products. Milk powder, sweet and acid whey powders were produced as standard samples, and milk powder was adulterated with whey powders. Based on LIBS spectra of standard samples and commercial products, species was identified using principle component analysis (PCA) method, and discrimination rate of milk and whey powders was found as 80.5%. Calibration curves were obtained with partial least squares regression (PLS). Correlation coefficient (R(2)) and limit of detection (LOD) values were 0.981 and 1.55% for adulteration with sweet whey powder, and 0.985 and 0.55% for adulteration with acid whey powder, respectively. The results were found to be consistent with the data from inductively coupled plasma - mass spectrometer (ICP-MS) method. Copyright © 2016 Elsevier Ltd. All rights reserved.
Projection of incidence rates to a larger population using ecologic variables.
Frey, C M; Feuer, E J; Timmel, M J
1994-09-15
There is wide acceptance of direct standardization of vital rates to adjust for differing age distributions according to the representation within age categories of some referent population. One can use a similar process to standardize, and subsequently project vital rates with respect to continuous, or ratio scale ecologic variables. We obtained from the National Cancer Institute's Surveillance, Epidemiology and End Results (SEER) programme, a 10 per cent subset of the total U.S. population, country-level breast cancer incidence during 1987-1989 for white women aged 50 and over. We applied regression coefficients that relate ecologic factors to SEER incidence to the full national complement of county-level information to produce an age and ecologic factor adjusted rate that may be more representative of the U.S. than the simple age-adjusted SEER incidence. We conducted a validation study using breast cancer mortality data available for the entire U.S. and which supports the appropriateness of this method for projecting rates.
Near Infrared Spectrometry of Clinically Significant Fatty Acids Using Multicomponent Regression
NASA Astrophysics Data System (ADS)
Kalinin, A. V.; Krasheninnikov, V. N.; Sviridov, A. P.; Titov, V. N.
2016-11-01
We have developed methods for determining the content of clinically important fatty acids (FAs), primarily saturated palmitic acid, monounsaturated oleic acid, and the sum of polyenoic fatty acids (eicosapentaenoic + docosahexaenoic), in oily media (food products and supplements, fish oils) using different types of near infrared (NIR) spectrometers: Fourier-transform, linear photodiode array, and Raman. Based on a calibration method (regression) by means of projections to latent structures, using standard samples of oil and fat mixtures, we have confirmed the feasibility of reliable and selective quantitative analysis of the above-indicated fatty acids. As a result of comparing the calibration models for Fourier-transform spectrometers in different parts of the NIR range (based on different overtones and combinations of fatty acid absorption), we have provided a basis for selection of the spectral range for a portable linear InGaAs-photodiode array spectrometer. In testing the calibrations of a linear InGaAs-photodiode array spectrometer which is a prototype for a portable instrument, for palmitic and oleic acids and also the sum of the polyenoic fatty acids we have achieved a multiple correlation coefficient of 0.89, 0.85, and 0.96 and a standard error of 0.53%, 1.43%, and 0.39% respectively. We have confirmed the feasibility of using Raman spectra to determine the content of the above-indicated fatty acids in media where water is present.
Toledo-Martín, Eva María; Font, Rafael; Obregón-Cano, Sara; De Haro-Bailón, Antonio; Villatoro-Pulido, Myriam; Del Río-Celestino, Mercedes
2017-05-20
The potential of visible-near infrared spectroscopy to predict glucosinolates and total phenolic content in rocket ( Eruca vesicaria ) leaves has been evaluated. Accessions of the E. vesicaria species were scanned by NIRS as ground leaf, and their reference values regressed against different spectral transformations by modified partial least squares (MPLS) regression. The coefficients of determination in the external validation (R²VAL) for the different quality components analyzed in rocket ranged from 0.59 to 0.84, which characterize those equations as having from good to excellent quantitative information. These results show that the total glucosinolates, glucosativin and glucoerucin equations obtained, can be used to identify those samples with low and high contents. The glucoraphanin equation obtained can be used for rough predictions of samples and in case of total phenolic content, the equation showed good correlation. The standard deviation (SD) to standard error of prediction ratio (RPD) and SD to range (RER) were variable for the different quality compounds and showed values that were characteristic of equations suitable for screening purposes or to perform accurate analyses. From the study of the MPLS loadings of the first three terms of the different equations, it can be concluded that some major cell components such as protein and cellulose, highly participated in modelling the equations for glucosinolates.
NASA Astrophysics Data System (ADS)
ul-Haq, Zia; Rana, Asim Daud; Tariq, Salman; Mahmood, Khalid; Ali, Muhammad; Bashir, Iqra
2018-03-01
We have applied regression analyses for the modeling of tropospheric NO2 (tropo-NO2) as the function of anthropogenic nitrogen oxides (NOx) emissions, aerosol optical depth (AOD), and some important meteorological parameters such as temperature (Temp), precipitation (Preci), relative humidity (RH), wind speed (WS), cloud fraction (CLF) and outgoing long-wave radiation (OLR) over different climatic zones and land use/land cover types in South Asia during October 2004-December 2015. Simple linear regression shows that, over South Asia, tropo-NO2 variability is significantly linked to AOD, WS, NOx, Preci and CLF. Also zone-5, consisting of tropical monsoon areas of eastern India and Myanmar, is the only study zone over which all the selected parameters show their influence on tropo-NO2 at statistical significance levels. In stepwise multiple linear modeling, tropo-NO2 column over landmass of South Asia, is significantly predicted by the combination of RH (standardized regression coefficient, β = - 49), AOD (β = 0.42) and NOx (β = 0.25). The leading predictors of tropo-NO2 columns over zones 1-5 are OLR, AOD, Temp, OLR, and RH respectively. Overall, as revealed by the higher correlation coefficients (r), the multiple regressions provide reasonable models for tropo-NO2 over South Asia (r = 0.82), zone-4 (r = 0.90) and zone-5 (r = 0.93). The lowest r (of 0.66) has been found for hot semi-arid region in northwestern Indus-Ganges Basin (zone-2). The highest value of β for urban area AOD (of 0.42) is observed for megacity Lahore, located in warm semi-arid zone-2 with large scale crop-residue burning, indicating strong influence of aerosols on the modeled tropo-NO2 column. A statistical significant correlation (r = 0.22) at the 0.05 level is found between tropo-NO2 and AOD over Lahore. Also NOx emissions appear as the highest contributor (β = 0.59) for modeled tropo-NO2 column over megacity Dhaka.
Poursafa, Parinaz; Baradaran-Mahdavi, Sadegh; Moradi, Bita; Haghjooy Javanmard, Shaghayegh; Tajadini, Mohammadhasan; Mehrabian, Ferdous; Kelishadi, Roya
2016-04-01
This study aims to investigate the association of exposure to ambient air pollution during pregnancy with cord blood concentrations of surrogate markers of endothelial dysfunction. This population-based cohort was conducted from March 2014 to March 2015 among 250 mother-neonate pairs in urban areas of Isfahan, the second large and air-polluted city in Iran. We analyzed the association between the ambient carbon monoxide (CO), ozone (O3), nitrogen dioxide (NO2), sulfur dioxide (SO2), particular matter 10 (PM10), and air quality index (AQI) with cord blood levels of endothelin-1, vascular adhesion molecule (VCAM), and intercellular adhesion molecule (ICAM). Multiple regression analysis was conducted after adjustment for potential confounding factors and covariates. The regression coefficient (beta), standard error of the estimate (SE), and 95% confidence intervals for each regression coefficient (95% CI) are reported. Data of 233 mother-neonate pairs were complete, and included in the analysis. Multiple regression analyses showed that AQI, CO and O3 had significant correlation with cord blood ICAM-1 [Beta (SE), 95%CI: 2.93 (0.72), 1.33,5.54; 2.28(1.44), 1.56,5.12; and 2.02(0.01), 1.03,2.04, respectively] as well as with VCAM-1 [2.78(0.91), 1.69,4.57; 2.47(1.47), 1.43,5.37; and 2.01(0.01),1.07,2.04, respectively]. AQI, PM10, and SO2 were significantly associated with Endothelin-1 concentrations [Beta (SE), 95%CI: 10.16(5.08),7.61,14.28; 9.70(3.46), 2.88,16.52; and 1.07(0.02), 1.03,2.11, respectively]. The significant associations of air pollutants with markers of endothelial dysfunction during fetal period may provide another evidence on the adverse health effects of air pollutants on early stages of atherosclerosis from fetal period. Our findings underscore the importance of considering environmental factors in primordial prevention of chronic diseases. Copyright © 2015 Elsevier Inc. All rights reserved.
Keshavarzi, Sareh; Ayatollahi, Seyyed Mohammad Taghi; Zare, Najaf; Pakfetrat, Maryam
2012-01-01
BACKGROUND. In many studies with longitudinal data, time-dependent covariates can only be measured intermittently (not at all observation times), and this presents difficulties for standard statistical analyses. This situation is common in medical studies, and methods that deal with this challenge would be useful. METHODS. In this study, we performed the seemingly unrelated regression (SUR) based models, with respect to each observation time in longitudinal data with intermittently observed time-dependent covariates and further compared these models with mixed-effect regression models (MRMs) under three classic imputation procedures. Simulation studies were performed to compare the sample size properties of the estimated coefficients for different modeling choices. RESULTS. In general, the proposed models in the presence of intermittently observed time-dependent covariates showed a good performance. However, when we considered only the observed values of the covariate without any imputations, the resulted biases were greater. The performances of the proposed SUR-based models in comparison with MRM using classic imputation methods were nearly similar with approximately equal amounts of bias and MSE. CONCLUSION. The simulation study suggests that the SUR-based models work as efficiently as MRM in the case of intermittently observed time-dependent covariates. Thus, it can be used as an alternative to MRM.
Cuffless and Continuous Blood Pressure Estimation from the Heart Sound Signals
Peng, Rong-Chao; Yan, Wen-Rong; Zhang, Ning-Ling; Lin, Wan-Hua; Zhou, Xiao-Lin; Zhang, Yuan-Ting
2015-01-01
Cardiovascular disease, like hypertension, is one of the top killers of human life and early detection of cardiovascular disease is of great importance. However, traditional medical devices are often bulky and expensive, and unsuitable for home healthcare. In this paper, we proposed an easy and inexpensive technique to estimate continuous blood pressure from the heart sound signals acquired by the microphone of a smartphone. A cold-pressor experiment was performed in 32 healthy subjects, with a smartphone to acquire heart sound signals and with a commercial device to measure continuous blood pressure. The Fourier spectrum of the second heart sound and the blood pressure were regressed using a support vector machine, and the accuracy of the regression was evaluated using 10-fold cross-validation. Statistical analysis showed that the mean correlation coefficients between the predicted values from the regression model and the measured values from the commercial device were 0.707, 0.712, and 0.748 for systolic, diastolic, and mean blood pressure, respectively, and that the mean errors were less than 5 mmHg, with standard deviations less than 8 mmHg. These results suggest that this technique is of potential use for cuffless and continuous blood pressure monitoring and it has promising application in home healthcare services. PMID:26393591
Cuffless and Continuous Blood Pressure Estimation from the Heart Sound Signals.
Peng, Rong-Chao; Yan, Wen-Rong; Zhang, Ning-Ling; Lin, Wan-Hua; Zhou, Xiao-Lin; Zhang, Yuan-Ting
2015-09-17
Cardiovascular disease, like hypertension, is one of the top killers of human life and early detection of cardiovascular disease is of great importance. However, traditional medical devices are often bulky and expensive, and unsuitable for home healthcare. In this paper, we proposed an easy and inexpensive technique to estimate continuous blood pressure from the heart sound signals acquired by the microphone of a smartphone. A cold-pressor experiment was performed in 32 healthy subjects, with a smartphone to acquire heart sound signals and with a commercial device to measure continuous blood pressure. The Fourier spectrum of the second heart sound and the blood pressure were regressed using a support vector machine, and the accuracy of the regression was evaluated using 10-fold cross-validation. Statistical analysis showed that the mean correlation coefficients between the predicted values from the regression model and the measured values from the commercial device were 0.707, 0.712, and 0.748 for systolic, diastolic, and mean blood pressure, respectively, and that the mean errors were less than 5 mmHg, with standard deviations less than 8 mmHg. These results suggest that this technique is of potential use for cuffless and continuous blood pressure monitoring and it has promising application in home healthcare services.
Gianola, Daniel; Fariello, Maria I.; Naya, Hugo; Schön, Chris-Carolin
2016-01-01
Standard genome-wide association studies (GWAS) scan for relationships between each of p molecular markers and a continuously distributed target trait. Typically, a marker-based matrix of genomic similarities among individuals (G) is constructed, to account more properly for the covariance structure in the linear regression model used. We show that the generalized least-squares estimator of the regression of phenotype on one or on m markers is invariant with respect to whether or not the marker(s) tested is(are) used for building G, provided variance components are unaffected by exclusion of such marker(s) from G. The result is arrived at by using a matrix expression such that one can find many inverses of genomic relationship, or of phenotypic covariance matrices, stemming from removing markers tested as fixed, but carrying out a single inversion. When eigenvectors of the genomic relationship matrix are used as regressors with fixed regression coefficients, e.g., to account for population stratification, their removal from G does matter. Removal of eigenvectors from G can have a noticeable effect on estimates of genomic and residual variances, so caution is needed. Concepts were illustrated using genomic data on 599 wheat inbred lines, with grain yield as target trait, and on close to 200 Arabidopsis thaliana accessions. PMID:27520956
Tukiendorf, Andrzej; Mansournia, Mohammad Ali; Wydmański, Jerzy; Wolny-Rokicka, Edyta
2017-04-01
Background: Clinical datasets for epithelial ovarian cancer brain metastatic patients are usually small in size. When adequate case numbers are lacking, resulting estimates of regression coefficients may demonstrate bias. One of the direct approaches to reduce such sparse-data bias is based on penalized estimation. Methods: A re- analysis of formerly reported hazard ratios in diagnosed patients was performed using penalized Cox regression with a popular SAS package providing additional software codes for a statistical computational procedure. Results: It was found that the penalized approach can readily diminish sparse data artefacts and radically reduce the magnitude of estimated regression coefficients. Conclusions: It was confirmed that classical statistical approaches may exaggerate regression estimates or distort study interpretations and conclusions. The results support the thesis that penalization via weak informative priors and data augmentation are the safest approaches to shrink sparse data artefacts frequently occurring in epidemiological research. Creative Commons Attribution License
NASA Astrophysics Data System (ADS)
Rock, N. M. S.; Duffy, T. R.
REGRES allows a range of regression equations to be calculated for paired sets of data values in which both variables are subject to error (i.e. neither is the "independent" variable). Nonparametric regressions, based on medians of all possible pairwise slopes and intercepts, are treated in detail. Estimated slopes and intercepts are output, along with confidence limits, Spearman and Kendall rank correlation coefficients. Outliers can be rejected with user-determined stringency. Parametric regressions can be calculated for any value of λ (the ratio of the variances of the random errors for y and x)—including: (1) major axis ( λ = 1); (2) reduced major axis ( λ = variance of y/variance of x); (3) Y on Xλ = infinity; or (4) X on Y ( λ = 0) solutions. Pearson linear correlation coefficients also are output. REGRES provides an alternative to conventional isochron assessment techniques where bivariate normal errors cannot be assumed, or weighting methods are inappropriate.
Application of Temperature Sensitivities During Iterative Strain-Gage Balance Calibration Analysis
NASA Technical Reports Server (NTRS)
Ulbrich, N.
2011-01-01
A new method is discussed that may be used to correct wind tunnel strain-gage balance load predictions for the influence of residual temperature effects at the location of the strain-gages. The method was designed for the iterative analysis technique that is used in the aerospace testing community to predict balance loads from strain-gage outputs during a wind tunnel test. The new method implicitly applies temperature corrections to the gage outputs during the load iteration process. Therefore, it can use uncorrected gage outputs directly as input for the load calculations. The new method is applied in several steps. First, balance calibration data is analyzed in the usual manner assuming that the balance temperature was kept constant during the calibration. Then, the temperature difference relative to the calibration temperature is introduced as a new independent variable for each strain--gage output. Therefore, sensors must exist near the strain--gages so that the required temperature differences can be measured during the wind tunnel test. In addition, the format of the regression coefficient matrix needs to be extended so that it can support the new independent variables. In the next step, the extended regression coefficient matrix of the original calibration data is modified by using the manufacturer specified temperature sensitivity of each strain--gage as the regression coefficient of the corresponding temperature difference variable. Finally, the modified regression coefficient matrix is converted to a data reduction matrix that the iterative analysis technique needs for the calculation of balance loads. Original calibration data and modified check load data of NASA's MC60D balance are used to illustrate the new method.
Howard, Elizabeth J; Harville, Emily; Kissinger, Patricia; Xiong, Xu
2013-07-01
There is growing interest in the application of propensity scores (PS) in epidemiologic studies, especially within the field of reproductive epidemiology. This retrospective cohort study assesses the impact of a short interpregnancy interval (IPI) on preterm birth and compares the results of the conventional logistic regression analysis with analyses utilizing a PS. The study included 96,378 singleton infants from Louisiana birth certificate data (1995-2007). Five regression models designed for methods comparison are presented. Ten percent (10.17 %) of all births were preterm; 26.83 % of births were from a short IPI. The PS-adjusted model produced a more conservative estimate of the exposure variable compared to the conventional logistic regression method (β-coefficient: 0.21 vs. 0.43), as well as a smaller standard error (0.024 vs. 0.028), odds ratio and 95 % confidence intervals [1.15 (1.09, 1.20) vs. 1.23 (1.17, 1.30)]. The inclusion of more covariate and interaction terms in the PS did not change the estimates of the exposure variable. This analysis indicates that PS-adjusted regression may be appropriate for validation of conventional methods in a large dataset with a fairly common outcome. PS's may be beneficial in producing more precise estimates, especially for models with many confounders and effect modifiers and where conventional adjustment with logistic regression is unsatisfactory. Short intervals between pregnancies are associated with preterm birth in this population, according to either technique. Birth spacing is an issue that women have some control over. Educational interventions, including birth control, should be applied during prenatal visits and following delivery.
Sullivan, Sarah; Lewis, Glyn; Mohr, Christine; Herzig, Daniela; Corcoran, Rhiannon; Drake, Richard; Evans, Jonathan
2014-01-01
There is some cross-sectional evidence that theory of mind ability is associated with social functioning in those with psychosis but the direction of this relationship is unknown. This study investigates the longitudinal association between both theory of mind and psychotic symptoms and social functioning outcome in first-episode psychosis. Fifty-four people with first-episode psychosis were followed up at 6 and 12 months. Random effects regression models were used to estimate the stability of theory of mind over time and the association between baseline theory of mind and psychotic symptoms and social functioning outcome. Neither baseline theory of mind ability (regression coefficients: Hinting test 1.07 95% CI -0.74, 2.88; Visual Cartoon test -2.91 95% CI -7.32, 1.51) nor baseline symptoms (regression coefficients: positive symptoms -0.04 95% CI -1.24, 1.16; selected negative symptoms -0.15 95% CI -2.63, 2.32) were associated with social functioning outcome. There was evidence that theory of mind ability was stable over time, (regression coefficients: Hinting test 5.92 95% CI -6.66, 8.92; Visual Cartoon test score 0.13 95% CI -0.17, 0.44). Neither baseline theory of mind ability nor psychotic symptoms are associated with social functioning outcome. Further longitudinal work is needed to understand the origin of social functioning deficits in psychosis.
NASA Astrophysics Data System (ADS)
Chaiyarit, Sakdithep; Thongboonkerd, Visith
2017-12-01
Crystal aggregation is one of the most crucial steps in kidney stone pathogenesis. However, previous studies of crystal aggregation were rarely done and quantitative analysis of aggregation degree was handicapped by a lack of the standard measurement. We thus performed an in vitro assay to generate aggregation of calcium oxalate monohydrate (COM) crystals with various concentrations (25-800 µg/ml) in saturated aggregation buffer. The crystal aggregates were analyzed by microscopic examination, UV-visible spectrophotometry, and GraphPad Prism6 software to define a total of 12 aggregation indices (including number of aggregates, aggregated mass index, optical density, aggregation coefficient, span, number of aggregates at plateau time-point, aggregated area index, aggregated diameter index, aggregated symmetry index, time constant, half-life, and rate constant). The data showed linear correlation between crystal concentration and almost all of these indices, except only for rate constant. Among these, number of aggregates provided the greatest regression coefficient (r=0.997; p<0.001), whereas the equally second rank included aggregated mass index and optical density (r=0.993; p<0.001 and r=‑0.993; p<0.001, respectively) and the equally forth were aggregation coefficient and span (r=0.991; p<0.001 for both). These five indices are thus recommended as the most appropriate indices for quantitative analysis of COM crystal aggregation in vitro.
Seyfart, Tom; Friedrich, Nele; Bülow, Robin; Wallaschofski, Henri; Völzke, Henry; Nauck, Matthias; Keevil, Brian G.; Haring, Robin
2018-01-01
Objectives The aim of this study was to evaluate the association of sex hormones with anthropometry in a large population-based cohort, with liquid chromatography-mass spectrometry (LCMS)-based sex hormone measurements and imaging markers. Study design/Main outcome measures Cross-sectional data from 957 men and women from the population-based Study of Health in Pomerania (SHIP) were used. Associations of a comprehensive panel of LCMS-measured sex hormones with anthropometric parameters, laboratory, and imaging markers were analyzed in multivariable regression models for the full sample and stratified by sex. Sex hormone measures included total testosterone (TT), free testosterone (fT), estrone and estradiol, androstenedione (ASD), dehydroepiandrosterone sulfate (DHEAS), and sex hormone-binding globulin (SHBG). Domains of anthropometry included physical measures (body-mass-index (BMI), waist circumference, waist-to-height-ratio, waist-to-hip-ratio, and hip circumference), laboratory measures of adipokines (leptin and vaspin), and magnet resonance imaging-based measures (visceral and subcutaneous adipose tissue). Results In men, inverse associations between all considered anthropometric parameters with TT were found: BMI (β-coefficient, standard error (SE): -0.159, 0.037), waist-circumference (β-coefficient, SE: -0.892, 0.292), subcutaneous adipose tissue (β-coefficient, SE: -0.156, 0.023), and leptin (β-coefficient, SE: -0.046, 0.009). In women TT (β-coefficient, SE: 1.356, 0.615) and estrone (β-coefficient, SE: 0.014, 0.005) were positively associated with BMI. In analyses of variance, BMI and leptin were inversely associated with TT, ASD, and DHEAS in men, but positively associated with estrone. In women, BMI and leptin were positively associated with all sex hormones. Conclusion The present population-based study confirmed and extended previously reported sex-specific associations between sex hormones and various anthropometric markers of overweight and obesity. PMID:29324787
Guertler, Diana; Vandelanotte, Corneel; Short, Camille; Alley, Stephanie; Schoeppe, Stephanie; Duncan, Mitch J.
2015-01-01
Objective: This study aims to examine the relationship of lifestyle behaviors (physical activity, work and non-work sitting time, sleep quality, and sleep duration) with presenteeism while controlling for sociodemographics, work- and health-related variables. Methods: Data were collected from 710 workers (aged 20 to 76 years; 47.9% women) from randomly selected Australian adults who completed an online survey. Linear regression was used to examine the relationship between lifestyle behaviors and presenteeism. Results: Poorer sleep quality (standardized regression coefficients [B] = 0.112; P < 0.05), suboptimal duration (B = 0.081; P < 0.05), and lower work sitting time (B = −0.086; P < 0.05) were significantly associated with higher presenteeism when controlling for all lifestyle behaviors. Engaging in three risky lifestyle behaviors was associated with higher presenteeism (B = 0.150; P < 0.01) compared with engaging in none or one. Conclusions: The results of this study highlight the importance of sleep behaviors for presenteeism and call for behavioral interventions that simultaneously address sleep in conjunction with other activity-related behaviors. PMID:25742538
Life-space mobility and social support in elderly adults with orthopaedic disorders.
Suzuki, Tomoko; Kitaike, Tadashi; Ikezaki, Sumie
2014-03-01
The purpose of this cross-sectional survey was to explore relationships between life-space mobility and the related factors in elderly Japanese people who attend orthopaedic clinics. The study measures included surveys of life-space mobility (Life-space Assessment (LSA) score), social support (social network diversity and social ties), physical ability (instrumental self-maintenance, intellectual activity, social role), orthopaedic factors (diseases and symptoms) and demographic information. The questionnaire was distributed to 156 subjects; 152 persons responded, yielding 140 valid responses. Mean age of the sample was 76.0 ± 6.4 (range, 65-96 years), with 57.9% women (n = 81). In a multiple regression analysis, the six factors were significantly associated with LSA. Standardized partial regression coefficients (β) were gender (0.342), instrumental self-maintenance (0.297), social network diversity (0.217), age (-0.170), difficulty of motion (-0.156) and intellectual activity (0.150), with an adjusted R(2) = 0.488. These results suggest that outpatient health-care providers need to intervene in not only addressing orthopaedic factors but also promoting social support among elderly Japanese. © 2014 Wiley Publishing Asia Pty Ltd.
Gaspardo, B; Del Zotto, S; Torelli, E; Cividino, S R; Firrao, G; Della Riccia, G; Stefanon, B
2012-12-01
Fourier transform near infrared (FT-NIR) spectroscopy is an analytical procedure generally used to detect organic compounds in food. In this work the ability to predict fumonisin B(1)+B(2) contents in corn meal using an FT-NIR spectrophotometer, equipped with an integration sphere, was assessed. A total of 143 corn meal samples were collected in Friuli Venezia Giulia Region (Italy) and used to define a 15 principal components regression model, applying partial least square regression algorithm with full cross validation as internal validation. External validation was performed to 25 unknown samples. Coefficients of correlation, root mean square error and standard error of calibration were 0.964, 0.630 and 0.632, respectively and the external validation confirmed a fair potential of the model in predicting FB(1)+FB(2) concentration. Results suggest that FT-NIR analysis is a suitable method to detect FB(1)+FB(2) in corn meal and to discriminate safe meals from those contaminated. Copyright © 2012 Elsevier Ltd. All rights reserved.
Khazaei, Salman; Rezaeian, Shahab; Khazaei, Somayeh; Mansori, Kamyar; Sanjari Moghaddam, Ali; Ayubi, Erfan
2016-01-01
Geographic disparity for colorectal cancer (CRC) incidence and mortality according to the human development index (HDI) might be expected. This study aimed at quantifying the effect measure of association HDI and its components on the CRC incidence and mortality. In this ecological study, CRC incidence and mortality was obtained from GLOBOCAN, the global cancer project for 172 countries. Data were extracted about HDI 2013 for 169 countries from the World Bank report. Linear regression was constructed to measure effects of HDI and its components on CRC incidence and mortality. A positive trend between increasing HDI of countries and age-standardized rates per 100,000 of CRC incidence and mortality was observed. Among HDI components education was the strongest effect measure of association on CRC incidence and mortality, regression coefficients (95% confidence intervals) being 2.8 (2.4, 3.2) and 0.9 (0.8, 1), respectively. HDI and its components were positively related with CRC incidence and mortality and can be considered as targets for prevention and treatment intervention or tracking geographic disparities.
Density conversion factor determined using a cone-beam computed tomography unit NewTom QR-DVT 9000.
Lagravère, M O; Fang, Y; Carey, J; Toogood, R W; Packota, G V; Major, P W
2006-11-01
The purpose of this study was to determine a conversion coefficient for Hounsfield Units (HU) to material density (g cm(-3)) obtained from cone-beam computed tomography (CBCT-NewTom QR-DVT 9000) data. Six cylindrical models of materials with different densities were made and scanned using the NewTom QR-DVT 9000 Volume Scanner. The raw data were converted into DICOM format and analysed using Merge eFilm and AMIRA to determine the HU of different areas of the models. There was no significant difference (P = 0.846) between the HU given by each piece of software. A linear regression was performed using the density, rho (g cm(-3)), as the dependent variable in terms of the HU (H). The regression equation obtained was rho = 0.002H-0.381 with an R2 value of 0.986. The standard error of the estimation is 27.104 HU in the case of the Hounsfield Units and 0.064 g cm(-3) in the case of density. CBCT provides an effective option for determination of material density expressed as Hounsfield Units.
Use of Empirical Estimates of Shrinkage in Multiple Regression: A Caution.
ERIC Educational Resources Information Center
Kromrey, Jeffrey D.; Hines, Constance V.
1995-01-01
The accuracy of four empirical techniques to estimate shrinkage in multiple regression was studied through Monte Carlo simulation. None of the techniques provided unbiased estimates of the population squared multiple correlation coefficient, but the normalized jackknife and bootstrap techniques demonstrated marginally acceptable performance with…
Enhance-Synergism and Suppression Effects in Multiple Regression
ERIC Educational Resources Information Center
Lipovetsky, Stan; Conklin, W. Michael
2004-01-01
Relations between pairwise correlations and the coefficient of multiple determination in regression analysis are considered. The conditions for the occurrence of enhance-synergism and suppression effects when multiple determination becomes bigger than the total of squared correlations of the dependent variable with the regressors are discussed. It…
Soares, M P; Gaya, L G; Lorentz, L H; Batistel, F; Rovadoscki, G A; Ticiani, E; Zabot, V; Di Domenico, Q; Madureira, A P; Pértile, S F N
2011-09-06
Artificial insemination has been used to improve production in Brazilian dairy cattle; however, this can lead to problems due to increased inbreeding. To evaluate the effect of the magnitude of inbreeding coefficients on predicted transmitting abilities (PTAs) for milk traits of Holstein and Jersey breeds, data on 392 Holstein and 92 Jersey sires used in Brazil were tabulated. The second-degree polynomial equations and points of maximum or minimal response were estimated to establish the regression equation of the variables as a function of the inbreeding coefficients. The mean inbreeding coefficient of the Holstein bulls was 5.10%; this did not significantly affect the PTA for percent milk fat, protein percentage and protein (P = 0.479, 0.058 and 0.087, respectively). However, the PTAs for milk yield and fat decreased significantly after reaching inbreeding coefficients of 6.43 (P = 0.034) and 5.75 (P = 0.007), respectively. The mean inbreeding coefficient of Jersey bulls was 6.45%; the PTAs for milk yield, fat and protein, in pounds, decreased significantly after reaching inbreeding coefficients of 15.04, 9.83 and 12.82% (P < 0.001, P = 0.002, and P = 0.001, respectively). The linear regression was only significant for fat and protein percentages in the Jersey breed (P = 0.002 and P = 0.005, respectively). The PTAs of Holstein sires were more affected by smaller magnitudes of inbreeding coefficients than those of Jersey sires. It is necessary to monitor the inbreeding coefficients of sires used for artificial insemination in breeding schemes in Brazil, since the low genetic variability of the available sires may lead to reduced production.
Analyzing degradation data with a random effects spline regression model
Fugate, Michael Lynn; Hamada, Michael Scott; Weaver, Brian Phillip
2017-03-17
This study proposes using a random effects spline regression model to analyze degradation data. Spline regression avoids having to specify a parametric function for the true degradation of an item. A distribution for the spline regression coefficients captures the variation of the true degradation curves from item to item. We illustrate the proposed methodology with a real example using a Bayesian approach. The Bayesian approach allows prediction of degradation of a population over time and estimation of reliability is easy to perform.
Analyzing degradation data with a random effects spline regression model
DOE Office of Scientific and Technical Information (OSTI.GOV)
Fugate, Michael Lynn; Hamada, Michael Scott; Weaver, Brian Phillip
This study proposes using a random effects spline regression model to analyze degradation data. Spline regression avoids having to specify a parametric function for the true degradation of an item. A distribution for the spline regression coefficients captures the variation of the true degradation curves from item to item. We illustrate the proposed methodology with a real example using a Bayesian approach. The Bayesian approach allows prediction of degradation of a population over time and estimation of reliability is easy to perform.
Fananapazir, Ghaneh; Benzl, Robert; Corwin, Michael T; Chen, Ling-Xin; Sageshima, Junichiro; Stewart, Susan L; Troppmann, Christoph
2018-07-01
Purpose To determine whether the predonation computed tomography (CT)-based volume of the future remnant kidney is predictive of postdonation renal function in living kidney donors. Materials and Methods This institutional review board-approved, retrospective, HIPAA-compliant study included 126 live kidney donors who had undergone predonation renal CT between January 2007 and December 2014 as well as 2-year postdonation measurement of estimated glomerular filtration rate (eGFR). The whole kidney volume and cortical volume of the future remnant kidney were measured and standardized for body surface area (BSA). Bivariate linear associations between the ratios of whole kidney volume to BSA and cortical volume to BSA were obtained. A linear regression model for 2-year postdonation eGFR that incorporated donor age, sex, and either whole kidney volume-to-BSA ratio or cortical volume-to-BSA ratio was created, and the coefficient of determination (R 2 ) for the model was calculated. Factors not statistically additive in assessing 2-year eGFR were removed by using backward elimination, and the coefficient of determination for this parsimonious model was calculated. Results Correlation was slightly better for cortical volume-to-BSA ratio than for whole kidney volume-to-BSA ratio (r = 0.48 vs r = 0.44, respectively). The linear regression model incorporating all donor factors had an R 2 of 0.66. The only factors that were significantly additive to the equation were cortical volume-to-BSA ratio and predonation eGFR (P = .01 and P < .01, respectively), and the final parsimonious linear regression model incorporating these two variables explained almost the same amount of variance (R 2 = 0.65) as did the full model. Conclusion The cortical volume of the future remnant kidney helped predict postdonation eGFR at 2 years. The cortical volume-to-BSA ratio should thus be considered for addition as an important variable to living kidney donor evaluation and selection guidelines. © RSNA, 2018.
Energy Setting and Visual Outcomes in SMILE: A Retrospective Cohort Study.
Li, Liuyang; Schallhorn, Julie M; Ma, Jiaonan; Cui, Tong; Wang, Yan
2018-01-01
To assess the independent effect of energy setting on postoperative uncorrected distance visual acuity (UDVA) in small incision lenticule extraction (SMILE) and further investigate an optimal energy setting for the 4.5-μm spot-track-distance, which is in wide clinical use. A total of 1,130 eyes were included in a retrospective cohort study from Tianjin Eye Hospital, Tianjin Medical University from April 2015 to July 2016. Energy settings and baseline characteristics were recorded and 3-month UDVA was tested by a nurse blinded to the energy settings used. Multiple regression analysis and generalized estimating equations were used to take into account the correlation between the measurements from two eyes. The 3-month UDVA (mean ± standard deviation) of 125 to 160 nJ (by 5-nJ increments) was 1.39 ± 0.19, 1.40 ± 0.32, 1.33 ± 0.27, 1.36 ± 0.27, 1.34 ± 0.25, 1.29 ± 0.19, 1.36 ± 0.27, and 1.19 ± 0.22, respectively. Energy was significantly associated with postoperative logMAR UDVA in different models and the regression coefficient (β) was robust (β = 0.01, 95% confidence interval = 0.00 to 0.01). The regression coefficient β (0.01, 95% confidence interval = 0.00 to 0.02, P = .0029) of energy (125 to 150 nJ, by 5-nJ increments) on 4.5-μm spot-track-distance was still associated with the logMAR UDVA when adjusted for sex, age, myopia, astigmatism, mean keratometry, central corneal thickness, preoperative logMAR CDVA, and side spot-track-distance. The lower end of the energy studied was associated with a better postoperative UDVA in this population. The spot-track-distance of 4.5 μm with 125 nJ energy was the optimal combination within this range. [J Refract Surg. 2018;34(1):11-16.]. Copyright 2018, SLACK Incorporated.
Wong, Carlos K H; Lo, Yvonne Y C; Wong, Winnie H T; Fung, Colman S C
2013-08-21
This study aimed to determine the associations of various clinical factors with generic health-related quality of life (HRQOL) scores among Hong Kong Chinese patients with type 2 diabetes mellitus (T2DM) in the outpatient primary care setting using the short-form 12 (SF-12). A cross-sectional survey of 488 Chinese adults with T2DM recruited from a primary care outpatient clinic was conducted from May to August 2008. Data on the standard Chinese (HK) SF-12 Health Survey and patients' socio-demographics were collected from face-to-face interviews. Glycaemic control, body mass index (BMI), chronic co-morbidities, diabetic complications and treatment modalities were determined for each patient through medical records. Associations of socio-demographic and clinical factors with physical component summary (PCS-12) and mental component summary scores (MCS-12) were evaluated using multiple linear regression. The socio-demographic correlates of PCS-12 and MCS-12 were age, gender and BMI. After adjustment for socio-demographic variables, the BMI was negatively associated with PCS-12 but positively associated with MCS-12. The presence of diabetic complications was associated with lower PCS-12 (regression coefficient:-3.0 points, p < 0.05) while being on insulin treatment was associated with lower MCS-12 (regression coefficient:-5.8 points, p < 0.05). In contrast, glycaemic control, duration of T2DM and treatment with oral hypoglycaemic drugs were not significantly associated with PCS-12 or MCS-12. Among T2DM subjects in the primary care setting, impairments in the physical aspect of HRQOL were evident in subjects who were obese or had diabetic complications whereas defects in the mental aspect of HRQOL were observed in patients with lower BMI or receiving insulin injections.
On Teaching about the Coefficient of Variation in Introductory Statistics Courses
ERIC Educational Resources Information Center
Trafimow, David
2014-01-01
The standard deviation is related to the mean by virtue of the coefficient of variation. Teachers of statistics courses can make use of that fact to make the standard deviation more comprehensible for statistics students.
Influence of soil pH on the sorption of ionizable chemicals: modeling advances.
Franco, Antonio; Fu, Wenjing; Trapp, Stefan
2009-03-01
The soil-water distribution coefficient of ionizable chemicals (K(d)) depends on the soil acidity, mainly because the pH governs speciation. Using pH-specific K(d) values normalized to organic carbon (K(OC)) from the literature, a method was developed to estimate the K(OC) of monovalent organic acids and bases. The regression considers pH-dependent speciation and species-specific partition coefficients, calculated from the dissociation constant (pK(a)) and the octanol-water partition coefficient of the neutral molecule (log P(n)). Probably because of the lower pH near the organic colloid-water interface, the optimal pH to model dissociation was lower than the bulk soil pH. The knowledge of the soil pH allows calculation of the fractions of neutral and ionic molecules in the system, thus improving the existing regression for acids. The same approach was not successful with bases, for which the impact of pH on the total sorption is contrasting. In fact, the shortcomings of the model assumptions affect the predictive power for acids and for bases differently. We evaluated accuracy and limitations of the regressions for their use in the environmental fate assessment of ionizable chemicals.
Predictive value of age of walking for later motor performance in children with mental retardation.
Kokubun, M; Haishi, K; Okuzumi, H; Hosobuchi, T; Koike, T
1996-12-01
The purpose of the present study was to clarify the predictive value of age of walking for later motor performance in children with mental retardation. While paying due attention to other factors, our investigation focused on the relationship between a subject's age of walking, and his or her subsequent beam-walking performance. The subjects were 85 children with mental retardation with an average age of 13 years and 3 months. Beam-walking performance was measured by a procedure developed by the authors. Five low beams (5 cm) which varied in width (12.5, 10, 7.5, 5 and 2.5 cm) were employed. The performance of subjects was scored from zero to five points according to the width of the beam that they were able to walk without falling off. From the results of multiple regression analysis, three independent variables were found to be significantly related to beam-walking performance. The age of walking was the most basic variable: partial correlation coefficient (PCC) = -45; standardized partial regression coefficient (SPRC) = -0.41. The next variable in importance was walking duration (PCC = 0.38; SPRC = 0.31). The autism variable also contributed significantly (PCC = 0.28; SPRC = 0.22). Therefore, within the age range used in the present study, the age of walking in children with mental retardation was thought to have sufficient predictive value, even when the variables which might have possibly affected their subsequent performance were taken into consideration; the earlier the age of walking, the better the beam-walking performance.
Normal reference values for bladder wall thickness on CT in a healthy population.
Fananapazir, Ghaneh; Kitich, Aleksandar; Lamba, Ramit; Stewart, Susan L; Corwin, Michael T
2018-02-01
To determine normal bladder wall thickness on CT in patients without bladder disease. Four hundred and nineteen patients presenting for trauma with normal CTs of the abdomen and pelvis were included in our retrospective study. Bladder wall thickness was assessed, and bladder volume was measured using both the ellipsoid formula and an automated technique. Patient age, gender, and body mass index were recorded. Linear regression models were created to account for bladder volume, age, gender, and body mass index, and the multiple correlation coefficient with bladder wall thickness was computed. Bladder volume and bladder wall thickness were log-transformed to achieve approximate normality and homogeneity of variance. Variables that did not contribute substantively to the model were excluded, and a parsimonious model was created and the multiple correlation coefficient was calculated. Expected bladder wall thickness was estimated for different bladder volumes, and 1.96 standard deviation above expected provided the upper limit of normal on the log scale. Age, gender, and bladder volume were associated with bladder wall thickness (p = 0.049, 0.024, and < 0.001, respectively). The linear regression model had an R 2 of 0.52. Age and gender were negligible in contribution to the model, and a parsimonious model using only volume was created for both the ellipsoid and automated volumes (R 2 = 0.52 and 0.51, respectively). Bladder wall thickness correlates with bladder wall volume. The study provides reference bladder wall thicknesses on CT utilizing both the ellipsoid formula and automated bladder volumes.
Estimation of Relative Economic Weights of Hanwoo Carcass Traits Based on Carcass Market Price
Choy, Yun Ho; Park, Byoung Ho; Choi, Tae Jung; Choi, Jae Gwan; Cho, Kwang Hyun; Lee, Seung Soo; Choi, You Lim; Koh, Kyung Chul; Kim, Hyo Sun
2012-01-01
The objective of this study was to estimate economic weights of Hanwoo carcass traits that can be used to build economic selection indexes for selection of seedstocks. Data from carcass measures for determining beef yield and quality grades were collected and provided by the Korean Institute for Animal Products Quality Evaluation (KAPE). Out of 1,556,971 records, 476,430 records collected from 13 abattoirs from 2008 to 2010 after deletion of outlying observations were used to estimate relative economic weights of bid price per kg carcass weight on cold carcass weight (CW), eye muscle area (EMA), backfat thickness (BF) and marbling score (MS) and the phenotypic relationships among component traits. Price of carcass tended to increase linearly as yield grades or quality grades, in marginal or in combination, increased. Partial regression coefficients for MS, EMA, BF, and for CW in original scales were +948.5 won/score, +27.3 won/cm2, −95.2 won/mm and +7.3 won/kg when all three sex categories were taken into account. Among four grade determining traits, relative economic weight of MS was the greatest. Variations in partial regression coefficients by sex categories were great but the trends in relative weights for each carcass measures were similar. Relative economic weights of four traits in integer values when standardized measures were fit into covariance model were +4:+1:−1:+1 for MS:EMA:BF:CW. Further research is required to account for the cost of production per unit carcass weight or per unit production under different economic situations. PMID:25049531
Aldosterone and glomerular filtration – observations in the general population
2014-01-01
Background Increasing evidence suggests that aldosterone promotes renal damage. Since data on the association between aldosterone and renal function in the general population are sparse, we chose to address this issue. We investigated the associations between the plasma aldosterone concentration (PAC) or the aldosterone-to-renin ratio (ARR) and the estimated glomerular filtration rate (eGFR) in a sample of adult men and women from Northeast Germany. Methods A study population of 1921 adult men and women who participated in the first follow-up of the Study of Health in Pomerania was selected. None of the subjects used drugs that alter PAC or ARR. The eGFR was calculated according to the four-variable Modification of Diet in Renal Disease formula. Chronic kidney disease (CKD) was defined as an eGFR <60 ml/min/1.73 m2. Results Linear regression models, adjusted for sex, age, waist circumference, diabetes mellitus, smoking status, systolic and diastolic blood pressures, serum triglyceride concentrations and time of blood sampling revealed inverse associations of PAC or ARR with eGFR (ß-coefficient for log-transformed PAC −3.12, p < 0.001; ß-coefficient for log-transformed ARR −3.36, p < 0.001). Logistic regression models revealed increased odds for CKD with increasing PAC (odds ratio for a one standard deviation increase in PAC: 1.35, 95% confidence interval: 1.06-1.71). There was no statistically significant association between ARR and CKD. Conclusion Our study demonstrates that PAC and ARR are inversely associated with the glomerular filtration rate in the general population. PMID:24612948
General Framework for Meta-analysis of Rare Variants in Sequencing Association Studies
Lee, Seunggeun; Teslovich, Tanya M.; Boehnke, Michael; Lin, Xihong
2013-01-01
We propose a general statistical framework for meta-analysis of gene- or region-based multimarker rare variant association tests in sequencing association studies. In genome-wide association studies, single-marker meta-analysis has been widely used to increase statistical power by combining results via regression coefficients and standard errors from different studies. In analysis of rare variants in sequencing studies, region-based multimarker tests are often used to increase power. We propose meta-analysis methods for commonly used gene- or region-based rare variants tests, such as burden tests and variance component tests. Because estimation of regression coefficients of individual rare variants is often unstable or not feasible, the proposed method avoids this difficulty by calculating score statistics instead that only require fitting the null model for each study and then aggregating these score statistics across studies. Our proposed meta-analysis rare variant association tests are conducted based on study-specific summary statistics, specifically score statistics for each variant and between-variant covariance-type (linkage disequilibrium) relationship statistics for each gene or region. The proposed methods are able to incorporate different levels of heterogeneity of genetic effects across studies and are applicable to meta-analysis of multiple ancestry groups. We show that the proposed methods are essentially as powerful as joint analysis by directly pooling individual level genotype data. We conduct extensive simulations to evaluate the performance of our methods by varying levels of heterogeneity across studies, and we apply the proposed methods to meta-analysis of rare variant effects in a multicohort study of the genetics of blood lipid levels. PMID:23768515
Genetic parameters for stayability to consecutive calvings in Zebu cattle.
Silva, D O; Santana, M L; Ayres, D R; Menezes, G R O; Silva, L O C; Nobre, P R C; Pereira, R J
2017-12-22
Longer-lived cows tend to be more profitable and the stayability trait is a selection criterion correlated to longevity. An alternative to the traditional approach to evaluate stayability is its definition based on consecutive calvings, whose main advantage is the more accurate evaluation of young bulls. However, no study using this alternative approach has been conducted for Zebu breeds. Therefore, the objective of this study was to compare linear random regression models to fit stayability to consecutive calvings of Guzerá, Nelore and Tabapuã cows and to estimate genetic parameters for this trait in the respective breeds. Data up to the eighth calving were used. The models included the fixed effects of age at first calving and year-season of birth of the cow and the random effects of contemporary group, additive genetic, permanent environmental and residual. Random regressions were modeled by orthogonal Legendre polynomials of order 1 to 4 (2 to 5 coefficients) for contemporary group, additive genetic and permanent environmental effects. Using Deviance Information Criterion as the selection criterion, the model with 4 regression coefficients for each effect was the most adequate for the Nelore and Tabapuã breeds and the model with 5 coefficients is recommended for the Guzerá breed. For Guzerá, heritabilities ranged from 0.05 to 0.08, showing a quadratic trend with a peak between the fourth and sixth calving. For the Nelore and Tabapuã breeds, the estimates ranged from 0.03 to 0.07 and from 0.03 to 0.08, respectively, and increased with increasing calving number. The additive genetic correlations exhibited a similar trend among breeds and were higher for stayability between closer calvings. Even between more distant calvings (second v. eighth), stayability showed a moderate to high genetic correlation, which was 0.77, 0.57 and 0.79 for the Guzerá, Nelore and Tabapuã breeds, respectively. For Guzerá, when the models with 4 or 5 regression coefficients were compared, the rank correlations between predicted breeding values for the intercept were always higher than 0.99, indicating the possibility of practical application of the least parameterized model. In conclusion, the model with 4 random regression coefficients is recommended for the genetic evaluation of stayability to consecutive calvings in Zebu cattle.
Delgado, J; Liao, J C
1992-01-01
The methodology previously developed for determining the Flux Control Coefficients [Delgado & Liao (1992) Biochem. J. 282, 919-927] is extended to the calculation of metabolite Concentration Control Coefficients. It is shown that the transient metabolite concentrations are related by a few algebraic equations, attributed to mass balance, stoichiometric constraints, quasi-equilibrium or quasi-steady states, and kinetic regulations. The coefficients in these relations can be estimated using linear regression, and can be used to calculate the Control Coefficients. The theoretical basis and two examples are discussed. Although the methodology is derived based on the linear approximation of enzyme kinetics, it yields reasonably good estimates of the Control Coefficients for systems with non-linear kinetics. PMID:1497632
Revisiting crash spatial heterogeneity: A Bayesian spatially varying coefficients approach.
Xu, Pengpeng; Huang, Helai; Dong, Ni; Wong, S C
2017-01-01
This study was performed to investigate the spatially varying relationships between crash frequency and related risk factors. A Bayesian spatially varying coefficients model was elaborately introduced as a methodological alternative to simultaneously account for the unstructured and spatially structured heterogeneity of the regression coefficients in predicting crash frequencies. The proposed method was appealing in that the parameters were modeled via a conditional autoregressive prior distribution, which involved a single set of random effects and a spatial correlation parameter with extreme values corresponding to pure unstructured or pure spatially correlated random effects. A case study using a three-year crash dataset from the Hillsborough County, Florida, was conducted to illustrate the proposed model. Empirical analysis confirmed the presence of both unstructured and spatially correlated variations in the effects of contributory factors on severe crash occurrences. The findings also suggested that ignoring spatially structured heterogeneity may result in biased parameter estimates and incorrect inferences, while assuming the regression coefficients to be spatially clustered only is probably subject to the issue of over-smoothness. Copyright © 2016 Elsevier Ltd. All rights reserved.
Estimation of Flood Discharges at Selected Recurrence Intervals for Streams in New Hampshire
Olson, Scott A.
2009-01-01
This report provides estimates of flood discharges at selected recurrence intervals for streamgages in and adjacent to New Hampshire and equations for estimating flood discharges at recurrence intervals of 2-, 5-, 10-, 25-, 50-, 100-, and 500-years for ungaged, unregulated, rural streams in New Hampshire. The equations were developed using generalized least-squares regression. Flood-frequency and drainage-basin characteristics from 117 streamgages were used in developing the equations. The drainage-basin characteristics used as explanatory variables in the regression equations include drainage area, mean April precipitation, percentage of wetland area, and main channel slope. The average standard error of prediction for estimating the 2-, 5-, 10-, 25-, 50-, 100-, and 500-year recurrence interval flood discharges with these equations are 30.0, 30.8, 32.0, 34.2, 36.0, 38.1, and 43.4 percent, respectively. Flood discharges at selected recurrence intervals for selected streamgages were computed following the guidelines in Bulletin 17B of the U.S. Interagency Advisory Committee on Water Data. To determine the flood-discharge exceedence probabilities at streamgages in New Hampshire, a new generalized skew coefficient map covering the State was developed. The standard error of the data on new map is 0.298. To improve estimates of flood discharges at selected recurrence intervals for 20 streamgages with short-term records (10 to 15 years), record extension using the two-station comparison technique was applied. The two-station comparison method uses data from a streamgage with long-term record to adjust the frequency characteristics at a streamgage with a short-term record. A technique for adjusting a flood-discharge frequency curve computed from a streamgage record with results from the regression equations is described in this report. Also, a technique is described for estimating flood discharge at a selected recurrence interval for an ungaged site upstream or downstream from a streamgage using a drainage-area adjustment. The final regression equations and the flood-discharge frequency data used in this study will be available in StreamStats. StreamStats is a World Wide Web application providing automated regression-equation solutions for user-selected sites on streams.
Existence of consistent hypo- and hyperresponders to dietary cholesterol in man.
Katan, M B; Beynen, A C; de Vries, J H; Nobels, A
1986-02-01
Hyper- and hyporesponsiveness of serum cholesterol to dietary cholesterol is an established concept in animals but not in man. The authors studied the stability of the individual response of serum cholesterol to dietary cholesterol in three controlled experiments in 1982. The subjects were volunteers from the general population living in or near Wageningen, the Netherlands. Each experiment had a low-cholesterol baseline period (121, 106, and 129 mg/day in experiments 1, 2, and 3, respectively) and a high-cholesterol test period (625, 673, and 989 mg/day). Duplicate portion analysis showed that dietary cholesterol was the only variable. The 94 healthy men and women who completed experiment 1 showed an increase (mean +/- standard deviation (SD] in serum cholesterol of 0.50 +/- 0.39 mmol/liter (19 +/- 15 mg/dl). Seventeen putative hyperresponders, defined by their response in experiment 1, were retested in experiments 2 and 3; they showed responses of 0.28 +/- 0.38 mmol/liter (11 +/- 15 mg/dl) and 0.82 +/- 0.35 mmol/liter (32 +/- 14 mg/dl), respectively. Fifteen hyporesponders, selected in experiment 1, showed responses in experiments 2 and 3 of 0.06 +/- 0.35 mmol/liter (2 +/- 14 mg/dl) and 0.47 +/- 0.26 mmol/liter (18 +/- 10 mg/dl), significantly lower than the corresponding values for hyperresponders. The standardized regression coefficient for individual responses in experiment 2 on those in experiment 1 was beta = 0.34 (p = 0.03, n = 32); the corresponding regression coefficient for experiment 3 and experiment 1 was 0.53 (p less than 0.01). After correction for intraindividual fluctuations the true responsiveness distribution was found to have a between-subject standard deviation of about 0.29 mmol/liter (11 mg/dl). This implies that if the mean response to a certain dietary cholesterol load amounts to e.g., 0.58 mmol/liter (22 mg/dl), then the 16% of subjects least susceptible to diet will experience a rise of only 0.29 mmol/liter (11 mg/dl) or less, while in the 16% of subjects most susceptible to diet, serum cholesterol will rise by 0.87 mmol/liter (34 mg/dl) or more. The authors conclude that modest differences in responsiveness of serum cholesterol to dietary cholesterol do exist in man, and that the wide scatter of responses observed in single experiments is largely due to chance fluctuations.
Hess, Glen W.
2002-01-01
Techniques for estimating monthly streamflow-duration characteristics at ungaged and partial-record sites in central Nevada have been updated. These techniques were developed using streamflow records at six continuous-record sites, basin physical and climatic characteristics, and concurrent streamflow measurements at four partial-record sites. Two methods, the basin-characteristic method and the concurrent-measurement method, were developed to provide estimating techniques for selected streamflow characteristics at ungaged and partial-record sites in central Nevada. In the first method, logarithmic-regression analyses were used to relate monthly mean streamflows (from all months and by month) from continuous-record gaging sites of various percent exceedence levels or monthly mean streamflows (by month) to selected basin physical and climatic variables at ungaged sites. Analyses indicate that the total drainage area and percent of drainage area at altitudes greater than 10,000 feet are the most significant variables. For the equations developed from all months of monthly mean streamflow, the coefficient of determination averaged 0.84 and the standard error of estimate of the relations for the ungaged sites averaged 72 percent. For the equations derived from monthly means by month, the coefficient of determination averaged 0.72 and the standard error of estimate of the relations averaged 78 percent. If standard errors are compared, the relations developed in this study appear generally to be less accurate than those developed in a previous study. However, the new relations are based on additional data and the slight increase in error may be due to the wider range of streamflow for a longer period of record, 1995-2000. In the second method, streamflow measurements at partial-record sites were correlated with concurrent streamflows at nearby gaged sites by the use of linear-regression techniques. Statistical measures of results using the second method typically indicated greater accuracy than for the first method. However, to make estimates for individual months, the concurrent-measurement method requires several years additional streamflow data at more partial-record sites. Thus, exceedence values for individual months are not yet available due to the low number of concurrent-streamflow-measurement data available. Reliability, limitations, and applications of both estimating methods are described herein.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Mele, L.M.; Prodan, P.F.
1983-04-01
Hydrologic data were collected and analyzed for three coal refuse disposal sites in southern Illinois. The disposal sites were associated with underground mines and consisted of piles of coarse waste (gob) and slurry areas where fine waste rejected from coal washing was deposited. Prereclamation data were available for the Superior washer site in Macoupin County and the New Kathleen site in Perry County. Post-reclamation data were available for the Staunton 1 site in Macoupin County and the New Kathleen site. Data analyzed from each phase (i.e., pre- or post-reclamation) were limited to one year. Storm event runoff coefficients were calculatedmore » for each site. Average runoff coefficients were compared for sites within the same reclamation phase to determine the effects of topographical parameters such as gob pile slope and percentage of drainage basin covered by the gob pile. Average runoff coefficients were then compared for pre- and post-reclamation data. Multiple regression analyses were performed on rainfall-runoff data for each site to determine the significance of independent variables other than rainfall in determining runoff. A generalized regression equation corrected data for topographical differences and included only those independent variables that were significant at all sites. Regression coefficients were compared for pre- and post-reclamation sites. The results of rainfall-runoff analysis indicate that the runoff coefficient increases because of reclamation. It is hypothesized that this effect is due to the placement of a soil cover that is less permeable than gob or slurry and occurs despite reduction in slope and the establishment of vegetation.« less
van Mil, Anke C C M; Greyling, Arno; Zock, Peter L; Geleijnse, Johanna M; Hopman, Maria T; Mensink, Ronald P; Reesink, Koen D; Green, Daniel J; Ghiadoni, Lorenzo; Thijssen, Dick H
2016-09-01
Brachial artery flow-mediated dilation (FMD) is a popular technique to examine endothelial function in humans. Identifying volunteer and methodological factors related to variation in FMD is important to improve measurement accuracy and applicability. Volunteer-related and methodology-related parameters were collected in 672 volunteers from eight affiliated centres worldwide who underwent repeated measures of FMD. All centres adopted contemporary expert-consensus guidelines for FMD assessment. After calculating the coefficient of variation (%) of the FMD for each individual, we constructed quartiles (n = 168 per quartile). Based on two regression models (volunteer-related factors and methodology-related factors), statistically significant components of these two models were added to a final regression model (calculated as β-coefficient and R). This allowed us to identify factors that independently contributed to the variation in FMD%. Median coefficient of variation was 17.5%, with healthy volunteers demonstrating a coefficient of variation 9.3%. Regression models revealed age (β = 0.248, P < 0.001), hypertension (β = 0.104, P < 0.001), dyslipidemia (β = 0.331, P < 0.001), time between measurements (β = 0.318, P < 0.001), lab experience (β = -0.133, P < 0.001) and baseline FMD% (β = 0.082, P < 0.05) as contributors to the coefficient of variation. After including all significant factors in the final model, we found that time between measurements, hypertension, baseline FMD% and lab experience with FMD independently predicted brachial artery variability (total R = 0.202). Although FMD% showed good reproducibility, larger variation was observed in conditions with longer time between measurements, hypertension, less experience and lower baseline FMD%. Accounting for these factors may improve FMD% variability.
Inter- and intra-observer reliability of clinical movement-control tests for marines
2012-01-01
Background Musculoskeletal disorders particularly in the back and lower extremities are common among marines. Here, movement-control tests are considered clinically useful for screening and follow-up evaluation. However, few studies have addressed the reliability of clinical tests, and no such published data exists for marines. The present aim was therefore to determine the inter- and intra-observer reliability of clinically convenient tests emphasizing movement control of the back and hip among marines. A secondary aim was to investigate the sensitivity and specificity of these clinical tests for discriminating musculoskeletal pain disorders in this group of military personnel. Methods This inter- and intra-observer reliability study used a test-retest approach with six standardized clinical tests focusing on movement control for back and hip. Thirty-three marines (age 28.7 yrs, SD 5.9) on active duty volunteered and were recruited. They followed an in-vivo observation test procedure that covered both low- and high-load (threshold) tasks relevant for marines on operational duty. Two independent observers simultaneously rated performance as “correct” or “incorrect” following a standardized assessment protocol. Re-testing followed 7–10 days thereafter. Reliability was analysed using kappa (κ) coefficients, while discriminative power of the best-fitting tests for back- and lower-extremity pain was assessed using a multiple-variable regression model. Results Inter-observer reliability for the six tests was moderate to almost perfect with κ-coefficients ranging between 0.56-0.95. Three tests reached almost perfect inter-observer reliability with mean κ-coefficients > 0.81. However, intra-observer reliability was fair-to-moderate with mean κ-coefficients between 0.22-0.58. Three tests achieved moderate intra-observer reliability with κ-coefficients > 0.41. Combinations of one low- and one high-threshold test best discriminated prior back pain, but results were inconsistent for lower-extremity pain. Conclusions Our results suggest that clinical tests of movement control of back and hip are reliable for use in screening protocols using several observers with marines. However, test-retest reproducibility was less accurate, which should be considered in follow-up evaluations. The results also indicate that combinations of low- and high-threshold tests have discriminative validity for prior back pain, but were inconclusive for lower-extremity pain. PMID:23273285
NASA Technical Reports Server (NTRS)
Parker, Peter A.; Geoffrey, Vining G.; Wilson, Sara R.; Szarka, John L., III; Johnson, Nels G.
2010-01-01
The calibration of measurement systems is a fundamental but under-studied problem within industrial statistics. The origins of this problem go back to basic chemical analysis based on NIST standards. In today's world these issues extend to mechanical, electrical, and materials engineering. Often, these new scenarios do not provide "gold standards" such as the standard weights provided by NIST. This paper considers the classic "forward regression followed by inverse regression" approach. In this approach the initial experiment treats the "standards" as the regressor and the observed values as the response to calibrate the instrument. The analyst then must invert the resulting regression model in order to use the instrument to make actual measurements in practice. This paper compares this classical approach to "reverse regression," which treats the standards as the response and the observed measurements as the regressor in the calibration experiment. Such an approach is intuitively appealing because it avoids the need for the inverse regression. However, it also violates some of the basic regression assumptions.
Regression Analysis of Stage Variability for West-Central Florida Lakes
Sacks, Laura A.; Ellison, Donald L.; Swancar, Amy
2008-01-01
The variability in a lake's stage depends upon many factors, including surface-water flows, meteorological conditions, and hydrogeologic characteristics near the lake. An understanding of the factors controlling lake-stage variability for a population of lakes may be helpful to water managers who set regulatory levels for lakes. The goal of this study is to determine whether lake-stage variability can be predicted using multiple linear regression and readily available lake and basin characteristics defined for each lake. Regressions were evaluated for a recent 10-year period (1996-2005) and for a historical 10-year period (1954-63). Ground-water pumping is considered to have affected stage at many of the 98 lakes included in the recent period analysis, and not to have affected stage at the 20 lakes included in the historical period analysis. For the recent period, regression models had coefficients of determination (R2) values ranging from 0.60 to 0.74, and up to five explanatory variables. Standard errors ranged from 21 to 37 percent of the average stage variability. Net leakage was the most important explanatory variable in regressions describing the full range and low range in stage variability for the recent period. The most important explanatory variable in the model predicting the high range in stage variability was the height over median lake stage at which surface-water outflow would occur. Other explanatory variables in final regression models for the recent period included the range in annual rainfall for the period and several variables related to local and regional hydrogeology: (1) ground-water pumping within 1 mile of each lake, (2) the amount of ground-water inflow (by category), (3) the head gradient between the lake and the Upper Floridan aquifer, and (4) the thickness of the intermediate confining unit. Many of the variables in final regression models are related to hydrogeologic characteristics, underscoring the importance of ground-water exchange in controlling the stage of karst lakes in Florida. Regression equations were used to predict lake-stage variability for the recent period for 12 additional lakes, and the median difference between predicted and observed values ranged from 11 to 23 percent. Coefficients of determination for the historical period were considerably lower (maximum R2 of 0.28) than for the recent period. Reasons for these low R2 values are probably related to the small number of lakes (20) with stage data for an equivalent time period that were unaffected by ground-water pumping, the similarity of many of the lake types (large surface-water drainage lakes), and the greater uncertainty in defining historical basin characteristics. The lack of lake-stage data unaffected by ground-water pumping and the poor regression results obtained for that group of lakes limit the ability to predict natural lake-stage variability using this method in west-central Florida.
Software-assisted small bowel motility analysis using free-breathing MRI: feasibility study.
Bickelhaupt, Sebastian; Froehlich, Johannes M; Cattin, Roger; Raible, Stephan; Bouquet, Hanspeter; Bill, Urs; Patak, Michael A
2014-01-01
To validate a software prototype allowing for small bowel motility analysis in free breathing by comparing it to manual measurements. In all, 25 patients (15 male, 10 female; mean age 39 years) were included in this Institutional Review Board-approved, retrospective study. Magnetic resonance imaging (MRI) was performed on a 1.5T system after standardized preparation acquiring motility sequences in free breathing over 69-84 seconds. Small bowel motility was analyzed manually and with the software. Functional parameters, measurement time, and reproducibility were compared using the coefficient of variance and paired Student's t-test. Correlation was analyzed using Pearson's correlation coefficient and linear regression. The 25 segments were analyzed twice both by hand and using the software with automatic breathing correction. All assessed parameters significantly correlated between the methods (P < 0.01), but the scattering of repeated measurements was significantly (P < 0.01) lower using the software (3.90%, standard deviation [SD] ± 5.69) than manual examinations (9.77%, SD ± 11.08). The time needed was significantly less (P < 0.001) with the software (4.52 minutes, SD ± 1.58) compared to manual measurement, lasting 17.48 minutes for manual (SD ± 1.75 minutes). The use of the software proves reliable and faster small bowel motility measurements in free-breathing MRI compared to manual analyses. The new technique allows for analyses of prolonged sequences acquired in free breathing, improving the informative value of the examinations by amplifying the evaluable data. Copyright © 2013 Wiley Periodicals, Inc.
Kahn, Henry S; El ghormli, Laure; Jago, Russell; Foster, Gary D; McMurray, Robert G; Buse, John B; Stadler, Diane D; Treviño, Roberto P; Baranowski, Tom
2014-01-01
Convention defines pediatric adiposity by the body mass index z-score (BMIz) referenced to normative growth charts. Waist-to-height ratio (WHtR) does not depend on sex-and-age references. In the HEALTHY Study enrollment sample, we compared BMIz with WHtR for ability to identify adverse cardiometabolic risk. Among 5,482 sixth-grade students from 42 middle schools, we estimated explanatory variations (R (2)) and standardized beta coefficients of BMIz or WHtR for cardiometabolic risk factors: insulin resistance (HOMA-IR), lipids, blood pressures, and glucose. For each risk outcome variable, we prepared adjusted regression models for four subpopulations stratified by sex and high versus lower fatness. For HOMA-IR, R (2) attributed to BMIz or WHtR was 19%-28% among high-fatness and 8%-13% among lower-fatness students. R (2) for lipid variables was 4%-9% among high-fatness and 2%-7% among lower-fatness students. In the lower-fatness subpopulations, the standardized coefficients for total cholesterol/HDL cholesterol and triglycerides tended to be weaker for BMIz (0.13-0.20) than for WHtR (0.17-0.28). Among high-fatness students, BMIz and WHtR correlated with blood pressures for Hispanics and whites, but not black boys (systolic) or girls (systolic and diastolic). In 11-12 year olds, assessments by WHtR can provide cardiometabolic risk estimates similar to conventional BMIz without requiring reference to a normative growth chart.
Trophic magnification of PCBs and its relationship to the octanol-water partition coefficient
Walters, D.M.; Mills, M.A.; Cade, B.S.; Burkard, L.P.
2011-01-01
We investigated polychlorinated biphenyl (PCB) bioaccumulation relative to octanol-water partition coefficient (KOW) and organism trophic position (TP) at the Lake Hartwell Superfund site (South Carolina). We measured PCBs (127 congeners) and stable isotopes (??15N) in sediment, organic matter, phytoplankton, zooplankton, macroinvertebrates, and fish. TP, as calculated from ??15N, was significantly, positively related to PCB concentrations, and food web trophic magnification factors (TMFs) ranged from 1.5-6.6 among congeners. TMFs of individual congeners increased strongly with log KOW, as did the predictive power (r2) of individual TP-PCB regression models used to calculate TMFs. We developed log KOW-TMF models for eight food webs with vastly different environments (freshwater, marine, arctic, temperate) and species composition (cold- vs warmblooded consumers). The effect of KOW on congener TMFs varied strongly across food webs (model slopes 0.0-15.0) because the range of TMFs among studies was also highly variable. We standardized TMFs within studies to mean = 0, standard deviation (SD) = 1 to normalize for scale differences and found a remarkably consistent KOW effect on TMFs (no difference in model slopes among food webs). Our findings underscore the importance of hydrophobicity (as characterized by KOW) in regulating bioaccumulation of recalcitrant compounds in aquatic systems, and demonstrate that relationships between chemical KOW and bioaccumulation from field studies are more generalized than previously recognized. ?? This article not subject to U.S. Copyright. Published 2011 by the American Chemical Society.
Trophic magnification of PCBs and Its relationship to the octanol-water partition coefficient.
Walters, David M; Mills, Marc A; Cade, Brian S; Burkard, Lawrence P
2011-05-01
We investigated polychlorinated biphenyl (PCB) bioaccumulation relative to octanol-water partition coefficient (K(OW)) and organism trophic position (TP) at the Lake Hartwell Superfund site (South Carolina). We measured PCBs (127 congeners) and stable isotopes (δ¹⁵N) in sediment, organic matter, phytoplankton, zooplankton, macroinvertebrates, and fish. TP, as calculated from δ¹⁵N, was significantly, positively related to PCB concentrations, and food web trophic magnification factors (TMFs) ranged from 1.5-6.6 among congeners. TMFs of individual congeners increased strongly with log K(OW), as did the predictive power (r²) of individual TP-PCB regression models used to calculate TMFs. We developed log K(OW)-TMF models for eight food webs with vastly different environments (freshwater, marine, arctic, temperate) and species composition (cold- vs warmblooded consumers). The effect of K(OW) on congener TMFs varied strongly across food webs (model slopes 0.0-15.0) because the range of TMFs among studies was also highly variable. We standardized TMFs within studies to mean = 0, standard deviation (SD) = 1 to normalize for scale differences and found a remarkably consistent K(OW) effect on TMFs (no difference in model slopes among food webs). Our findings underscore the importance of hydrophobicity (as characterized by K(OW)) in regulating bioaccumulation of recalcitrant compounds in aquatic systems, and demonstrate that relationships between chemical K(OW) and bioaccumulation from field studies are more generalized than previously recognized.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Muir, B. R., E-mail: Bryan.Muir@nrc-cnrc.gc.ca
2015-04-15
Purpose: To analyze absorbed dose calibration coefficients, N{sub D,w}, measured at accredited dosimetry calibration laboratories (ADCLs) for client ionization chambers to study (i) variability among N{sub D,w} coefficients for chambers of the same type calibrated at each ADCL to investigate ion chamber volume fluctuations and chamber manufacturing tolerances; (ii) equivalency of ion chamber calibration coefficients measured at different ADCLs by intercomparing N{sub D,w} coefficients for chambers of the same type; and (iii) the long-term stability of N{sub D,w} coefficients for different chamber types by investigating repeated chamber calibrations. Methods: Large samples of N{sub D,w} coefficients for several chamber types measuredmore » over the time period between 1998 and 2014 were obtained from the three ADCLs operating in the United States. These are analyzed using various graphical and numerical statistical tests for the four chamber types with the largest samples of calibration coefficients to investigate (i) and (ii) above. Ratios of calibration coefficients for the same chamber, typically obtained two years apart, are calculated to investigate (iii) above and chambers with standard deviations of old/new ratios less than 0.3% meet stability requirements for accurate reference dosimetry recommended in dosimetry protocols. Results: It is found that N{sub D,w} coefficients for a given chamber type compared among different ADCLs may arise from differing probability distributions potentially due to slight differences in calibration procedures and/or the transfer of the primary standard. However, average N{sub D,w} coefficients from different ADCLs for given chamber types are very close with percent differences generally less than 0.2% for Farmer-type chambers and are well within reported uncertainties. Conclusions: The close agreement among calibrations performed at different ADCLs reaffirms the Calibration Laboratory Accreditation Subcommittee process of ensuring ADCL conformance with National Institute of Standards and Technology standards. This study shows that N{sub D,w} coefficients measured at different ADCLs are statistically equivalent, especially considering reasonable uncertainties. This analysis of N{sub D,w} coefficients also allows identification of chamber types that can be considered stable enough for accurate reference dosimetry.« less
Reimus, Paul W; Callahan, Timothy J; Ware, S Doug; Haga, Marc J; Counce, Dale A
2007-08-15
Diffusion cell experiments were conducted to measure nonsorbing solute matrix diffusion coefficients in forty-seven different volcanic rock matrix samples from eight different locations (with multiple depth intervals represented at several locations) at the Nevada Test Site. The solutes used in the experiments included bromide, iodide, pentafluorobenzoate (PFBA), and tritiated water ((3)HHO). The porosity and saturated permeability of most of the diffusion cell samples were measured to evaluate the correlation of these two variables with tracer matrix diffusion coefficients divided by the free-water diffusion coefficient (D(m)/D*). To investigate the influence of fracture coating minerals on matrix diffusion, ten of the diffusion cells represented paired samples from the same depth interval in which one sample contained a fracture surface with mineral coatings and the other sample consisted of only pure matrix. The log of (D(m)/D*) was found to be positively correlated with both the matrix porosity and the log of matrix permeability. A multiple linear regression analysis indicated that both parameters contributed significantly to the regression at the 95% confidence level. However, the log of the matrix diffusion coefficient was more highly-correlated with the log of matrix permeability than with matrix porosity, which suggests that matrix diffusion coefficients, like matrix permeabilities, have a greater dependence on the interconnectedness of matrix porosity than on the matrix porosity itself. The regression equation for the volcanic rocks was found to provide satisfactory predictions of log(D(m)/D*) for other types of rocks with similar ranges of matrix porosity and permeability as the volcanic rocks, but it did a poorer job predicting log(D(m)/D*) for rocks with lower porosities and/or permeabilities. The presence of mineral coatings on fracture walls did not appear to have a significant effect on matrix diffusion in the ten paired diffusion cell experiments.
NASA Astrophysics Data System (ADS)
Reimus, Paul W.; Callahan, Timothy J.; Ware, S. Doug; Haga, Marc J.; Counce, Dale A.
2007-08-01
Diffusion cell experiments were conducted to measure nonsorbing solute matrix diffusion coefficients in forty-seven different volcanic rock matrix samples from eight different locations (with multiple depth intervals represented at several locations) at the Nevada Test Site. The solutes used in the experiments included bromide, iodide, pentafluorobenzoate (PFBA), and tritiated water ( 3HHO). The porosity and saturated permeability of most of the diffusion cell samples were measured to evaluate the correlation of these two variables with tracer matrix diffusion coefficients divided by the free-water diffusion coefficient ( Dm/ D*). To investigate the influence of fracture coating minerals on matrix diffusion, ten of the diffusion cells represented paired samples from the same depth interval in which one sample contained a fracture surface with mineral coatings and the other sample consisted of only pure matrix. The log of ( Dm/ D*) was found to be positively correlated with both the matrix porosity and the log of matrix permeability. A multiple linear regression analysis indicated that both parameters contributed significantly to the regression at the 95% confidence level. However, the log of the matrix diffusion coefficient was more highly-correlated with the log of matrix permeability than with matrix porosity, which suggests that matrix diffusion coefficients, like matrix permeabilities, have a greater dependence on the interconnectedness of matrix porosity than on the matrix porosity itself. The regression equation for the volcanic rocks was found to provide satisfactory predictions of log( Dm/ D*) for other types of rocks with similar ranges of matrix porosity and permeability as the volcanic rocks, but it did a poorer job predicting log( Dm/ D*) for rocks with lower porosities and/or permeabilities. The presence of mineral coatings on fracture walls did not appear to have a significant effect on matrix diffusion in the ten paired diffusion cell experiments.
Multiple imputation for cure rate quantile regression with censored data.
Wu, Yuanshan; Yin, Guosheng
2017-03-01
The main challenge in the context of cure rate analysis is that one never knows whether censored subjects are cured or uncured, or whether they are susceptible or insusceptible to the event of interest. Considering the susceptible indicator as missing data, we propose a multiple imputation approach to cure rate quantile regression for censored data with a survival fraction. We develop an iterative algorithm to estimate the conditionally uncured probability for each subject. By utilizing this estimated probability and Bernoulli sample imputation, we can classify each subject as cured or uncured, and then employ the locally weighted method to estimate the quantile regression coefficients with only the uncured subjects. Repeating the imputation procedure multiple times and taking an average over the resultant estimators, we obtain consistent estimators for the quantile regression coefficients. Our approach relaxes the usual global linearity assumption, so that we can apply quantile regression to any particular quantile of interest. We establish asymptotic properties for the proposed estimators, including both consistency and asymptotic normality. We conduct simulation studies to assess the finite-sample performance of the proposed multiple imputation method and apply it to a lung cancer study as an illustration. © 2016, The International Biometric Society.
Measuring Retention in HIV Care: The Elusive Gold Standard
Mugavero, Michael J.; Westfall, Andrew O.; Zinski, Anne; Davila, Jessica; Drainoni, Mari-Lynn; Gardner, Lytt I.; Keruly, Jeanne C.; Malitz, Faye; Marks, Gary; Metsch, Lisa; Wilson, Tracey E.; Giordano, Thomas P.
2012-01-01
Background Measuring retention in HIV primary care is complex as care includes multiple visits scheduled at varying intervals over time. We evaluated six commonly used retention measures in predicting viral load (VL) suppression and the correlation among measures. Methods Clinic-wide patient-level data from six academic HIV clinics were used for 12-months preceding implementation of the CDC/HRSA Retention in Care intervention. Six retention measures were calculated for each patient based upon scheduled primary HIV provider visits: count and dichotomous missed visits, visit adherence, 6-month gap, 4-month visit constancy, and the HRSA HAB retention measure. Spearman correlation coefficients and separate unadjusted logistic regression models compared retention measures to one another and with 12-month VL suppression, respectively. The discriminatory capacity of each measure was assessed with the c-statistic. Results Among 10,053 patients, 8,235 (82%) had 12-month VL measures, with 6,304 (77%) achieving suppression (VL<400 c/mL). All six retention measures were significantly associated (P<0.0001) with VL suppression (OR;95%CI, c-statistic): missed visit count (0.73;0.71–0.75,0.67), missed visit dichotomous (3.2;2.8–3.6,0.62), visit adherence (3.9;3.5–4.3,0.69), gap (3.0;2.6–3.3,0.61), visit constancy (2.8;2.5–3.0,0.63), HRSA HAB (3.8;3.3–4.4,0.59). Measures incorporating “no show” visits were highly correlated (Spearman coefficient=0.83–0.85), as were measures based solely upon kept visits (Spearman coefficient=0.72–0.77). Correlation coefficients were lower across these two groups of measures (Range=0.16–0.57). Conclusions Six retention measures displayed a wide range of correlation with one another, yet each measure had significant association and modest discrimination for VL suppression. These data suggest there is no clear gold standard, and that selection of a retention measure may be tailored to context. PMID:23011397
Estimation of subsurface thermal structure using sea surface height and sea surface temperature
NASA Technical Reports Server (NTRS)
Kang, Yong Q. (Inventor); Jo, Young-Heon (Inventor); Yan, Xiao-Hai (Inventor)
2012-01-01
A method of determining a subsurface temperature in a body of water is disclosed. The method includes obtaining surface temperature anomaly data and surface height anomaly data of the body of water for a region of interest, and also obtaining subsurface temperature anomaly data for the region of interest at a plurality of depths. The method further includes regressing the obtained surface temperature anomaly data and surface height anomaly data for the region of interest with the obtained subsurface temperature anomaly data for the plurality of depths to generate regression coefficients, estimating a subsurface temperature at one or more other depths for the region of interest based on the generated regression coefficients and outputting the estimated subsurface temperature at the one or more other depths. Using the estimated subsurface temperature, signal propagation times and trajectories of marine life in the body of water are determined.
NASA Technical Reports Server (NTRS)
Rogers, R. H. (Principal Investigator)
1976-01-01
The author has identified the following significant results. Computer techniques were developed for mapping water quality parameters from LANDSAT data, using surface samples collected in an ongoing survey of water quality in Saginaw Bay. Chemical and biological parameters were measured on 31 July 1975 at 16 bay stations in concert with the LANDSAT overflight. Application of stepwise linear regression bands to nine of these parameters and corresponding LANDSAT measurements for bands 4 and 5 only resulted in regression correlation coefficients that varied from 0.94 for temperature to 0.73 for Secchi depth. Regression equations expressed with the pair of bands 4 and 5, rather than the ratio band 4/band 5, provided higher correlation coefficients for all the water quality parameters studied (temperature, Secchi depth, chloride, conductivity, total kjeldahl nitrogen, total phosphorus, chlorophyll a, total solids, and suspended solids).
Prediction of anthropometric foot characteristics in children.
Morrison, Stewart C; Durward, Brian R; Watt, Gordon F; Donaldson, Malcolm D C
2009-01-01
The establishment of growth reference values is needed in pediatric practice where pathologic conditions can have a detrimental effect on the growth and development of the pediatric foot. This study aims to use multiple regression to evaluate the effects of multiple predictor variables (height, age, body mass, and gender) on anthropometric characteristics of the peripubescent foot. Two hundred children aged 9 to 12 years were recruited, and three anthropometric measurements of the pediatric foot were recorded (foot length, forefoot width, and navicular height). Multiple regression analysis was conducted, and coefficients for gender, height, and body mass all had significant relationships for the prediction of forefoot width and foot length (P < or = .05, r > or = 0.7). The coefficients for gender and body mass were not significant for the prediction of navicular height (P > or = .05), whereas height was (P < or = .05). Normative growth reference values and prognostic regression equations are presented for the peripubescent foot.
Introducing diagnosis-related groups: is the information system ready?
Jian, Weiyan; Lu, Ming; Han, Wei; Hu, Mu
2016-01-01
Diagnosis-related group (DRG) system is a classification system widely used in health managements, the foundation of which lies in the medical information system. A large effort had been made to improve the quality of discharge data before the introduction of DRGs in Beijing. We extract discharge data from 108 local hospitals spanning 4 years before and after standardization to evaluate the impact of standardization on DRG grouping performance. The data was grouped on an annual basis in accordance with Beijing's local DRG system. Proportion of ungrouped data, coefficient of variation (CV) and reduction in variance (RIV) were used to measure the performance of the DRG system. Both the descriptive and regression analysis indicate a significant reduction in terms of ungrouped data and CV for expenditure, increase of RIV for expenditure and length of stay. However, when there was no intervention, that is, between 2005 and 2006 and between 2008 and 2009, changes in these indicators were all insignificant. Therefore, the standardization of discharge data did improve data quality and consequently enhanced the performance of DRGs. Developing countries with a relatively weak information infrastructure should strengthen their medical information system before the introduction of the DRG system. Copyright © 2014 John Wiley & Sons, Ltd.
Novel non-contact retina camera for the rat and its application to dynamic retinal vessel analysis
Link, Dietmar; Strohmaier, Clemens; Seifert, Bernd U.; Riemer, Thomas; Reitsamer, Herbert A.; Haueisen, Jens; Vilser, Walthard
2011-01-01
We present a novel non-invasive and non-contact system for reflex-free retinal imaging and dynamic retinal vessel analysis in the rat. Theoretical analysis was performed prior to development of the new optical design, taking into account the optical properties of the rat eye and its specific illumination and imaging requirements. A novel optical model of the rat eye was developed for use with standard optical design software, facilitating both sequential and non-sequential modes. A retinal camera for the rat was constructed using standard optical and mechanical components. The addition of a customized illumination unit and existing standard software enabled dynamic vessel analysis. Seven-minute in-vivo vessel diameter recordings performed on 9 Brown-Norway rats showed stable readings. On average, the coefficient of variation was (1.1 ± 0.19) % for the arteries and (0.6 ± 0.08) % for the veins. The slope of the linear regression analysis was (0.56 ± 0.26) % for the arteries and (0.15 ± 0.27) % for the veins. In conclusion, the device can be used in basic studies of retinal vessel behavior. PMID:22076270
New body fat prediction equations for severely obese patients.
Horie, Lilian Mika; Barbosa-Silva, Maria Cristina Gonzalez; Torrinhas, Raquel Susana; de Mello, Marco Túlio; Cecconello, Ivan; Waitzberg, Dan Linetzky
2008-06-01
Severe obesity imposes physical limitations to body composition assessment. Our aim was to compare body fat (BF) estimations of severely obese patients obtained by bioelectrical impedance (BIA) and air displacement plethysmography (ADP) for development of new equations for BF prediction. Severely obese subjects (83 female/36 male, mean age=41.6+/-11.6 years) had BF estimated by BIA and ADP. The agreement of the data was evaluated using Bland-Altman's graphic and concordance correlation coefficient (CCC). A multivariate regression analysis was performed to develop and validate new predictive equations. BF estimations from BIA (64.8+/-15 kg) and ADP (65.6+/-16.4 kg) did not differ (p>0.05, with good accuracy, precision, and CCC), but the Bland- Altman graphic showed a wide limit of agreement (-10.4; 8.8). The standard BIA equation overestimated BF in women (-1.3 kg) and underestimated BF in men (5.6 kg; p<0.05). Two BF new predictive equations were generated after BIA measurement, which predicted BF with higher accuracy, precision, CCC, and limits of agreement than the standard BIA equation. Standard BIA equations were inadequate for estimating BF in severely obese patients. Equations developed especially for this population provide more accurate BF assessment.
Determination of total phenolic compounds in compost by infrared spectroscopy.
Cascant, M M; Sisouane, M; Tahiri, S; Krati, M El; Cervera, M L; Garrigues, S; de la Guardia, M
2016-06-01
Middle and near infrared (MIR and NIR) were applied to determine the total phenolic compounds (TPC) content in compost samples based on models built by using partial least squares (PLS) regression. The multiplicative scatter correction, standard normal variate and first derivative were employed as spectra pretreatment, and the number of latent variable were optimized by leave-one-out cross-validation. The performance of PLS-ATR-MIR and PLS-DR-NIR models was evaluated according to root mean square error of cross validation and prediction (RMSECV and RMSEP), the coefficient of determination for prediction (Rpred(2)) and residual predictive deviation (RPD) being obtained for this latter values of 5.83 and 8.26 for MIR and NIR, respectively. Copyright © 2016 Elsevier B.V. All rights reserved.
Determination of Flavonoids in Wine by High Performance Liquid Chromatography
NASA Astrophysics Data System (ADS)
da Queija, Celeste; Queirós, M. A.; Rodrigues, Ligia M.
2001-02-01
The experiment presented is an application of HPLC to the analysis of flavonoids in wines, designed for students of instrumental methods. It is done in two successive 4-hour laboratory sessions. While the hydrolysis of the wines is in progress, the students prepare the calibration curves with standard solutions of flavonoids and calculate the regression lines and correlation coefficients. During the second session they analyze the hydrolyzed wine samples and calculate the concentrations of the flavonoids using the calibration curves obtained earlier. This laboratory work is very attractive to students because they deal with a common daily product whose components are reported to have preventive and therapeutic effects. Furthermore, students can execute preparative work and apply a more elaborate technique that is nowadays an indispensable tool in instrumental analysis.
Determination of teicoplanin concentrations in serum by high-pressure liquid chromatography.
Joos, B; Lüthy, R
1987-01-01
An isocratic reversed-phase high-pressure liquid chromatographic method for the determination of six components of the teicoplanin complex in biological fluid was developed. By using fluorescence detection after precolumn derivatization with fluorescamine, the assay is specific and highly sensitive, with reproducibility studies yielding coefficients of variation ranging from 1.5 to 8.5% (at 5 to 80 micrograms/ml). Response was linear from 2.5 to 80 micrograms/ml (r = 0.999); the recovery from spiked human serum was 76%. An external quality control was performed to compare this high-pressure liquid chromatographic method (H) with a standard microbiological assay (M); no significant deviation from slope = 1 and intercept = 0 was found by regression analysis (H = 1.03M - 0.45; n = 15). PMID:2957953
Methods for trend analysis: Examples with problem/failure data
NASA Technical Reports Server (NTRS)
Church, Curtis K.
1989-01-01
Statistics are emphasized as an important role in quality control and reliability. Consequently, Trend Analysis Techniques recommended a variety of statistical methodologies that could be applied to time series data. The major goal of the working handbook, using data from the MSFC Problem Assessment System, is to illustrate some of the techniques in the NASA standard, some different techniques, and to notice patterns of data. Techniques for trend estimation used are: regression (exponential, power, reciprocal, straight line) and Kendall's rank correlation coefficient. The important details of a statistical strategy for estimating a trend component are covered in the examples. However, careful analysis and interpretation is necessary because of small samples and frequent zero problem reports in a given time period. Further investigations to deal with these issues are being conducted.
Bradshaw, Elizabeth J; Keogh, Justin W L; Hume, Patria A; Maulder, Peter S; Nortje, Jacques; Marnewick, Michel
2009-06-01
The purpose of this study was to examine the role of neuromotor noise on golf swing performance in high- and low-handicap players. Selected two-dimensional kinematic measures of 20 male golfers (n=10 per high- or low-handicap group) performing 10 golf swings with a 5-iron club was obtained through video analysis. Neuromotor noise was calculated by deducting the standard error of the measurement from the coefficient of variation obtained from intra-individual analysis. Statistical methods included linear regression analysis and one-way analysis of variance using SPSS. Absolute invariance in the key technical positions (e.g., at the top of the backswing) of the golf swing appears to be a more favorable technique for skilled performance.
Imai, Masamichi; Ambale Venkatesh, Bharath; Samiei, Sanaz; Donekal, Sirisha; Habibi, Mohammadali; Armstrong, Anderson C.; Heckbert, Susan R.; Wu, Colin O.; Bluemke, David A.
2014-01-01
Purpose To investigate the association between left atrial (LAleft atrium) function and left ventricular myocardial fibrosis using cardiac magnetic resonance (MR) imaging in a multi-ethnic population. Materials and Methods For this HIPAA-compliant study, the institutional review board at each participating center approved the study protocol, and all participants provided informed consent. Of 2839 participants who had undergone cardiac MR in 2010–2012, 143 participants with myocardial scar determined with late gadolinium enhancement and 286 age-, sex-, and ethnicity-matched control participants were identified. LAleft atrium volume, strain, and strain rate were analyzed by using multimodality tissue tracking from cine MR imaging. T1 mapping was applied to assess diffuse myocardial fibrosis. The association between LAleft atrium parameters and myocardial fibrosis was evaluated with the Student t test and multivariable regression analysis. Results The scar group had significantly higher minimum LAleft atrium volume than the control group (mean, 22.0 ± 10.5 [standard deviation] vs 19.0 ± 7.8, P = .002) and lower LAleft atrium ejection fraction (45.9 ± 10.7 vs 51.3 ± 8.7, P < .001), maximal LAleft atrium strain (Smaxmaximum LA strain) (25.4 ± 10.7 vs 30.6 ± 10.6, P < .001) and maximum LAleft atrium strain rate (SRmaxmaximum LA strain rate) (1.08 ± 0.45 vs 1.29 ± 0.51, P < .001), and lower absolute LAleft atrium strain rate at early diastolic peak (SRELA strain rate at early diastolic peak) (−0.77 ± 0.42 vs −1.01 ± 0.48, P < .001) and LAleft atrium strain rate at atrial contraction peak (SRALA strain rate at atrial contraction peak) (−1.50 ± 0.62 vs −1.78 ± 0.69, P < .001) than the control group. T1 time 12 minutes after contrast material injection was significantly associated with Smaxmaximum LA strain (β coefficient = 0.043, P = .013), SRmaxmaximum LA strain rate (β coefficient = 0.0025, P = .001), SRELA strain rate at early diastolic peak (β coefficient = −0.0016, P = .027), and SRALA strain rate at atrial contraction peakLA strain rate at atrial contraction peak (β coefficient −0.0028, P = .01) in the regression model. T1 time 25 minutes after contrast material injection was significantly associated with SRmaxmaximum LA strain rate (β coefficient = 0.0019, P = .016) and SRALA strain rate at atrial contraction peak (β coefficient = −0.0022, P = .034). Conclusion Reduced LAleft atrium regional and global function are related to both replacement and diffuse myocardial fibrosis processes. Clinical trial registration no. NCT00005487 © RSNA, 2014 Online supplemental material is available for this article. PMID:25019562
Mean centering, multicollinearity, and moderators in multiple regression: The reconciliation redux.
Iacobucci, Dawn; Schneider, Matthew J; Popovich, Deidre L; Bakamitsos, Georgios A
2017-02-01
In this article, we attempt to clarify our statements regarding the effects of mean centering. In a multiple regression with predictors A, B, and A × B (where A × B serves as an interaction term), mean centering A and B prior to computing the product term can clarify the regression coefficients (which is good) and the overall model fit R 2 will remain undisturbed (which is also good).
To, Minh-Son; Prakash, Shivesh; Poonnoose, Santosh I; Bihari, Shailesh
2018-05-01
The study uses meta-regression analysis to quantify the dose-dependent effects of statin pharmacotherapy on vasospasm, delayed ischemic neurologic deficits (DIND), and mortality in aneurysmal subarachnoid hemorrhage. Prospective, retrospective observational studies, and randomized controlled trials (RCTs) were retrieved by a systematic database search. Summary estimates were expressed as absolute risk (AR) for a given statin dose or control (placebo). Meta-regression using inverse variance weighting and robust variance estimation was performed to assess the effect of statin dose on transformed AR in a random effects model. Dose-dependence of predicted AR with 95% confidence interval (CI) was recovered by using Miller's Freeman-Tukey inverse. The database search and study selection criteria yielded 18 studies (2594 patients) for analysis. These included 12 RCTs, 4 retrospective observational studies, and 2 prospective observational studies. Twelve studies investigated simvastatin, whereas the remaining studies investigated atorvastatin, pravastatin, or pitavastatin, with simvastatin-equivalent doses ranging from 20 to 80 mg. Meta-regression revealed dose-dependent reductions in Freeman-Tukey-transformed AR of vasospasm (slope coefficient -0.00404, 95% CI -0.00720 to -0.00087; P = 0.0321), DIND (slope coefficient -0.00316, 95% CI -0.00586 to -0.00047; P = 0.0392), and mortality (slope coefficient -0.00345, 95% CI -0.00623 to -0.00067; P = 0.0352). The present meta-regression provides weak evidence for dose-dependent reductions in vasospasm, DIND and mortality associated with acute statin use after aneurysmal subarachnoid hemorrhage. However, the analysis was limited by substantial heterogeneity among individual studies. Greater dosing strategies are a potential consideration for future RCTs. Copyright © 2018 Elsevier Inc. All rights reserved.
Cui, Yang; Wang, Silong; Yan, Shaokui
2016-01-01
Phi coefficient directly depends on the frequencies of occurrence of organisms and has been widely used in vegetation ecology to analyse the associations of organisms with site groups, providing a characterization of ecological preference, but its application in soil ecology remains rare. Based on a single field experiment, this study assessed the applicability of phi coefficient in indicating the habitat preferences of soil fauna, through comparing phi coefficient-induced results with those of ordination methods in charactering soil fauna-habitat(factors) relationships. Eight different habitats of soil fauna were implemented by reciprocal transfer of defaunated soil cores between two types of subtropical forests. Canonical correlation analysis (CCorA) showed that ecological patterns of fauna-habitat relationships and inter-fauna taxa relationships expressed, respectively, by phi coefficients and predicted abundances calculated from partial redundancy analysis (RDA), were extremely similar, and a highly significant relationship between the two datasets was observed (Pillai's trace statistic = 1.998, P = 0.007). In addition, highly positive correlations between phi coefficients and predicted abundances for Acari, Collembola, Nematode and Hemiptera were observed using linear regression analysis. Quantitative relationships between habitat preferences and soil chemical variables were also obtained by linear regression, which were analogous to the results displayed in a partial RDA biplot. Our results suggest that phi coefficient could be applicable on a local scale in evaluating habitat preferences of soil fauna at coarse taxonomic levels, and that the phi coefficient-induced information, such as ecological preferences and the associated quantitative relationships with habitat factors, will be largely complementary to the results of ordination methods. The application of phi coefficient in soil ecology may extend our knowledge about habitat preferences and distribution-abundance relationships, which will benefit the understanding of biodistributions and variations in community compositions in the soil. Similar studies in other places and scales apart from our local site will be need for further evaluation of phi coefficient.
Cui, Yang; Wang, Silong; Yan, Shaokui
2016-01-01
Phi coefficient directly depends on the frequencies of occurrence of organisms and has been widely used in vegetation ecology to analyse the associations of organisms with site groups, providing a characterization of ecological preference, but its application in soil ecology remains rare. Based on a single field experiment, this study assessed the applicability of phi coefficient in indicating the habitat preferences of soil fauna, through comparing phi coefficient-induced results with those of ordination methods in charactering soil fauna-habitat(factors) relationships. Eight different habitats of soil fauna were implemented by reciprocal transfer of defaunated soil cores between two types of subtropical forests. Canonical correlation analysis (CCorA) showed that ecological patterns of fauna-habitat relationships and inter-fauna taxa relationships expressed, respectively, by phi coefficients and predicted abundances calculated from partial redundancy analysis (RDA), were extremely similar, and a highly significant relationship between the two datasets was observed (Pillai's trace statistic = 1.998, P = 0.007). In addition, highly positive correlations between phi coefficients and predicted abundances for Acari, Collembola, Nematode and Hemiptera were observed using linear regression analysis. Quantitative relationships between habitat preferences and soil chemical variables were also obtained by linear regression, which were analogous to the results displayed in a partial RDA biplot. Our results suggest that phi coefficient could be applicable on a local scale in evaluating habitat preferences of soil fauna at coarse taxonomic levels, and that the phi coefficient-induced information, such as ecological preferences and the associated quantitative relationships with habitat factors, will be largely complementary to the results of ordination methods. The application of phi coefficient in soil ecology may extend our knowledge about habitat preferences and distribution-abundance relationships, which will benefit the understanding of biodistributions and variations in community compositions in the soil. Similar studies in other places and scales apart from our local site will be need for further evaluation of phi coefficient. PMID:26930593
Suppressor Variables: The Difference between "Is" versus "Acting As"
ERIC Educational Resources Information Center
Ludlow, Larry; Klein, Kelsey
2014-01-01
Correlated predictors in regression models are a fact of life in applied social science research. The extent to which they are correlated will influence the estimates and statistics associated with the other variables they are modeled along with. These effects, for example, may include enhanced regression coefficients for the other variables--a…
Causal Models with Unmeasured Variables: An Introduction to LISREL.
ERIC Educational Resources Information Center
Wolfle, Lee M.
Whenever one uses ordinary least squares regression, one is making an implicit assumption that all of the independent variables have been measured without error. Such an assumption is obviously unrealistic for most social data. One approach for estimating such regression models is to measure implied coefficients between latent variables for which…
Using the Ridge Regression Procedures to Estimate the Multiple Linear Regression Coefficients
NASA Astrophysics Data System (ADS)
Gorgees, HazimMansoor; Mahdi, FatimahAssim
2018-05-01
This article concerns with comparing the performance of different types of ordinary ridge regression estimators that have been already proposed to estimate the regression parameters when the near exact linear relationships among the explanatory variables is presented. For this situations we employ the data obtained from tagi gas filling company during the period (2008-2010). The main result we reached is that the method based on the condition number performs better than other methods since it has smaller mean square error (MSE) than the other stated methods.
Ghosh, Adarsh; Singh, Tulika; Singla, Veenu; Bagga, Rashmi; Khandelwal, Niranjan
2017-12-01
Apparent diffusion coefficient (ADC) maps are usually generated by builtin software provided by the MRI scanner vendors; however, various open-source postprocessing software packages are available for image manipulation and parametric map generation. The purpose of this study is to establish the reproducibility of absolute ADC values obtained using different postprocessing software programs. DW images with three b values were obtained with a 1.5-T MRI scanner, and the trace images were obtained. ADC maps were automatically generated by the in-line software provided by the vendor during image generation and were also separately generated on postprocessing software. These ADC maps were compared on the basis of ROIs using paired t test, Bland-Altman plot, mountain plot, and Passing-Bablok regression plot. There was a statistically significant difference in the mean ADC values obtained from the different postprocessing software programs when the same baseline trace DW images were used for the ADC map generation. For using ADC values as a quantitative cutoff for histologic characterization of tissues, standardization of the postprocessing algorithm is essential across processing software packages, especially in view of the implementation of vendor-neutral archiving.
The design of control system of livestock feeding processing
NASA Astrophysics Data System (ADS)
Sihombing, Juna; Napitupulu, Humala L.; Hidayati, Juliza
2018-03-01
PT. XYZ is a company that produces animal feed. One type of animal feed produced is 105 ISA P. In carrying out its production process, PT. XYZ faces the problem of rejected feed amounts during 2014 to June 2015 due to the amount of animal feed that exceeds the standard feed quality of 13% of moisture content and 3% for ash content. Therefore, the researchers analyzed the relationship between factors affecting the quality and extent of damage by using regression and correlation and determine the optimum value of each processing process. Analysis results found that variables affecting product quality are mixing time, steam conditioning temperature and cooling time. The most dominant variable affecting the product moisture content is mixing time with the correlation coefficient of (0.7959) and the most dominant variable affecting the ash content of the product during the processing is mixing time with the correlation coefficient of (0.8541). The design of the proposed product processing control is to run the product processing process with mixing time 235 seconds, steam conditioning temperature 87 0C and cooling time 192 seconds. Product quality 105 ISA P obtained by using this design is with 12.16% moisture content and ash content of 2.59%.
Effects of porosity on weld-joint tensile strength of aluminum alloys
NASA Technical Reports Server (NTRS)
Lovoy, C. V.
1974-01-01
Tensile properties in defect-free weldments of aluminum alloys 2014-T6 and 2219-T87 (sheet and plate) are shown to be related to the level or concentration of induced simulated porosity. The scatter diagram shows that the ultimate tensile strength of the weldments displays the most pronounced linear relationship with the level of porosity. The relationships between yield strength or elongation and porosity are either trivial or inconsequential in the lower and intermediate levels of porosity content. In highly concentrated levels of porosity, both yield strength and elongation values decrease markedly. Correlation coefficients were obtained by simple straight line regression analysis between the variables of ultimate tensile strength and pore level. The coefficients were greater, indicating a better correlation, using a pore area accumulation concept or pore volume accumulation than the accumulation of the pore diameters. These relationships provide a useful tool for assessing the existing aerospace radiographic acceptance standards with respect to permissible porosity. In addition, these relationships, in combination with known design load requirements, will serve as an engineering guideline in determining when a weld repair is necessary based on accumulative pore level as detected by radiographic techniques.
Ferguson, Christopher J
2015-09-01
This article responds to five comments on my "Angry Birds" meta-analysis of video game influences on children (Ferguson, 2015, this issue). Given ongoing debates on video game influences, comments varied from the supportive to the self-proclaimed "angry," yet hopefully they and this response will contribute to constructive discussion as the field moves forward. In this reply, I address some misconceptions in the comments and present data that challenge the assumption that standardized regression coefficients are invariably unsuitable for meta-analysis or that bivariate correlations are invariably suitable for meta-analysis. The suitability of any data should be considered on a case-by-case basis, and data indicates that the coefficients included in the "Angry Birds" meta-analysis did not distort results. Study selection, effect size extraction, and interpretation improved upon problematic issues in other recent meta-analyses. Further evidence is also provided to support the contention that publication bias remains problematic in video game literature. Sources of acrimony among scholars are explored as are areas of agreement. Ultimately, debates will only be resolved through a commitment to newer, more rigorous methods and open science. © The Author(s) 2015.
Brain-water diffusion coefficients reflect the severity of inherited prion disease
Hyare, H.; Wroe, S.; Siddique, D.; Webb, T.; Fox, N. C.; Stevens, J.; Collinge, J.; Yousry, T.; Thornton, J. S.
2010-01-01
Objective: Inherited prion diseases are progressive neurodegenerative conditions, characterized by cerebral spongiosis, gliosis, and neuronal loss, caused by mutations within the prion protein (PRNP) gene. We wished to assess the potential of diffusion-weighted MRI as a biomarker of disease severity in inherited prion diseases. Methods: Twenty-five subjects (mean age 45.2 years) with a known PRNP mutation including 19 symptomatic patients, 6 gene-positive asymptomatic subjects, and 7 controls (mean age 54.1 years) underwent conventional and diffusion-weighted MRI. An index of normalized brain volume (NBV) and region of interest (ROI) mean apparent diffusion coefficient (ADC) for the head of caudate, putamen, and pulvinar nuclei were recorded. ADC histograms were computed for whole brain (WB) and gray matter (GM) tissue fractions. Clinical assessment utilized standardized clinical scores. Mann-Whitney U test and regression analyses were performed. Results: Symptomatic patients exhibited an increased WB mean ADC (p = 0.006) and GM mean ADC (p = 0.024) compared to controls. Decreased NBV and increased mean ADC measures significantly correlated with clinical measures of disease severity. Using a stepwise multivariate regression procedure, GM mean ADC was an independent predictor of Clinician's Dementia Rating score (p = 0.001), Barthel Index of activities of daily living (p = 0.001), and Rankin disability score (p = 0.019). Conclusions: Brain volume loss in inherited prion diseases is accompanied by increased cerebral apparent diffusion coefficient (ADC), correlating with increased disease severity. The association between gray matter ADC and clinical neurologic status suggests this measure may prove a useful biomarker of disease activity in inherited prion diseases. GLOSSARY ADAS-Cog = Alzheimer's Disease Assessment Scale–Cognitive subscale; ADC = apparent diffusion coefficient; ADL = Barthel Activities of Daily Living scale; BET = brain extraction tool; BPRS = Brief Psychiatric Rating Scale; BSE = bovine spongiform encephalopathy; CDR = Clinician's Dementia Rating Scale; CGIS = Clinician's Global Impression of Disease; CI = confidence interval; DWI = diffusion-weighted imaging; FLAIR = fluid-attenuated inversion recovery; FOV = field of view; GM = gray matter; LC = left head of caudate; LP = left putamen; LPu = left pulvinar; MMSE = Mini-Mental State Examination; NBV = normalized brain volume; PH = peak height; PL = peak location; RC = right head of caudate; RP = right putamen; RPu = right pulvinar; ROI = region of interest; sCJD = sporadic Creutzfeldt-Jakob disease; TE = echo time; TI = inversion time; TR = repetition time; vCJD = variant Creutzfeldt-Jakob disease; WB = whole brain; WM = white matter. PMID:20177119
Association of Face-lift Surgery With Social Perception, Age, Attractiveness, Health, and Success.
Nellis, Jason C; Ishii, Masaru; Papel, Ira D; Kontis, Theda C; Byrne, Patrick J; Boahene, Kofi D O; Bater, Kristin L; Ishii, Lisa E
2017-07-01
Evidence quantifying the influence of face-lift surgery on societal perceptions is lacking. To measure the association of face-lift surgery with observer-graded perceived age, attractiveness, success, and overall health. In a web-based survey, 526 casual observers naive to the purpose of the study viewed independent images of 13 unique female patient faces before or after face-lift surgery from January 1, 2016, through June 30, 2016. The Delphi method was used to select standardized patient images confirming appropriate patient candidacy and overall surgical effect. Observers estimated age and rated the attractiveness, perceived success, and perceived overall health for each patient image. Facial perception questions were answered on a visual analog scale from 0 to 100, with higher scores corresponding to more positive responses. To evaluate the accuracy of observer age estimation, the patients' preoperative estimated mean age was compared with the patients' actual mean age. A multivariate mixed-effects regression model was used to determine the effect of face-lift surgery. To further characterize the effect of face-lift surgery, estimated ordinal-rank change was calculated for each domain. Blinded casual observer ratings of patients estimated age, attractiveness, perceived success, and perceived overall health. A total of 483 observers (mean [SD] age, 29 [8.6] years; 382 women [79.4%]) successfully completed the survey. Comparing patients' preoperative estimated mean (SD) age (59.6 [9.0] years) and patients' actual mean (SD) age (58.4 [6.9] years) revealed no significant difference (t2662 = -0.47; 95% CI, -6.07 to 3.72; P = .64). On multivariate regression, patients after face-lift surgery were rated as significantly younger (coefficient, -3.69; 95% CI -4.15 to -3.23; P < .001), more attractive (coefficient, 8.21; 95% CI, 7.41-9.02; P < .001), more successful (coefficient, 5.82; 95% CI, 5.05 to 6.59; P < .001), and overall healthier (coefficient, 8.72; 95% CI, 7.88-9.56; P < .001). The ordinal rank changes for an average individual were -21 for perceived age, 21 for attractiveness, 16 for success, and 21 for overall health. In this study, observer perceptions of face-lift surgery were associated with views that patients appeared younger, more attractive, healthier, and more successful. These findings highlight observer perceptions of face-lift surgery that could positively influence social interactions. NA.
Ali, Hina; Saleem, Muhammad; Anser, Muhammad Ramzan; Khan, Saranjam; Ullah, Rahat; Bilal, Muhammad
2018-01-01
Due to high price and nutritional values of extra virgin olive oil (EVOO), it is vulnerable to adulteration internationally. Refined oil or other vegetable oils are commonly blended with EVOO and to unmask such fraud, quick, and reliable technique needs to be standardized and developed. Therefore, in this study, adulteration of edible oil (sunflower oil) is made with pure EVOO and analyzed using fluorescence spectroscopy (excitation wavelength at 350 nm) in conjunction with principal component analysis (PCA) and partial least squares (PLS) regression. Fluorescent spectra contain fingerprints of chlorophyll and carotenoids that are characteristics of EVOO and differentiated it from sunflower oil. A broad intense hump corresponding to conjugated hydroperoxides is seen in sunflower oil in the range of 441-489 nm with the maximum at 469 nm whereas pure EVOO has low intensity doublet peaks in this region at 441 nm and 469 nm. Visible changes in spectra are observed in adulterated EVOO by increasing the concentration of sunflower oil, with an increase in doublet peak and correspondingly decrease in chlorophyll peak intensity. Principal component analysis showed a distinct clustering of adulterated samples of different concentrations. Subsequently, the PLS regression model was best fitted over the complete data set on the basis of coefficient of determination (R 2 ), standard error of calibration (SEC), and standard error of prediction (SEP) of values 0.99, 0.617, and 0.623 respectively. In addition to adulterant, test samples and imported commercial brands of EVOO were also used for prediction and validation of the models. Fluorescence spectroscopy combined with chemometrics showed its robustness to identify and quantify the specified adulterant in pure EVOO.
Procedures for adjusting regional regression models of urban-runoff quality using local data
Hoos, A.B.; Sisolak, J.K.
1993-01-01
Statistical operations termed model-adjustment procedures (MAP?s) can be used to incorporate local data into existing regression models to improve the prediction of urban-runoff quality. Each MAP is a form of regression analysis in which the local data base is used as a calibration data set. Regression coefficients are determined from the local data base, and the resulting `adjusted? regression models can then be used to predict storm-runoff quality at unmonitored sites. The response variable in the regression analyses is the observed load or mean concentration of a constituent in storm runoff for a single storm. The set of explanatory variables used in the regression analyses is different for each MAP, but always includes the predicted value of load or mean concentration from a regional regression model. The four MAP?s examined in this study were: single-factor regression against the regional model prediction, P, (termed MAP-lF-P), regression against P,, (termed MAP-R-P), regression against P, and additional local variables (termed MAP-R-P+nV), and a weighted combination of P, and a local-regression prediction (termed MAP-W). The procedures were tested by means of split-sample analysis, using data from three cities included in the Nationwide Urban Runoff Program: Denver, Colorado; Bellevue, Washington; and Knoxville, Tennessee. The MAP that provided the greatest predictive accuracy for the verification data set differed among the three test data bases and among model types (MAP-W for Denver and Knoxville, MAP-lF-P and MAP-R-P for Bellevue load models, and MAP-R-P+nV for Bellevue concentration models) and, in many cases, was not clearly indicated by the values of standard error of estimate for the calibration data set. A scheme to guide MAP selection, based on exploratory data analysis of the calibration data set, is presented and tested. The MAP?s were tested for sensitivity to the size of a calibration data set. As expected, predictive accuracy of all MAP?s for the verification data set decreased as the calibration data-set size decreased, but predictive accuracy was not as sensitive for the MAP?s as it was for the local regression models.
Stature estimation from the lengths of the growing foot-a study on North Indian adolescents.
Krishan, Kewal; Kanchan, Tanuj; Passi, Neelam; DiMaggio, John A
2012-12-01
Stature estimation is considered as one of the basic parameters of the investigation process in unknown and commingled human remains in medico-legal case work. Race, age and sex are the other parameters which help in this process. Stature estimation is of the utmost importance as it completes the biological profile of a person along with the other three parameters of identification. The present research is intended to formulate standards for stature estimation from foot dimensions in adolescent males from North India and study the pattern of foot growth during the growing years. 154 male adolescents from the Northern part of India were included in the study. Besides stature, five anthropometric measurements that included the length of the foot from each toe (T1, T2, T3, T4, and T5 respectively) to pternion were measured on each foot. The data was analyzed statistically using Student's t-test, Pearson's correlation, linear and multiple regression analysis for estimation of stature and growth of foot during ages 13-18 years. Correlation coefficients between stature and all the foot measurements were found to be highly significant and positively correlated. Linear regression models and multiple regression models (with age as a co-variable) were derived for estimation of stature from the different measurements of the foot. Multiple regression models (with age as a co-variable) estimate stature with greater accuracy than the regression models for 13-18 years age group. The study shows the growth pattern of feet in North Indian adolescents and indicates that anthropometric measurements of the foot and its segments are valuable in estimation of stature in growing individuals of that population. Copyright © 2012 Elsevier Ltd. All rights reserved.
The importance of regional models in assessing canine cancer incidences in Switzerland
Leyk, Stefan; Brunsdon, Christopher; Graf, Ramona; Pospischil, Andreas; Fabrikant, Sara Irina
2018-01-01
Fitting canine cancer incidences through a conventional regression model assumes constant statistical relationships across the study area in estimating the model coefficients. However, it is often more realistic to consider that these relationships may vary over space. Such a condition, known as spatial non-stationarity, implies that the model coefficients need to be estimated locally. In these kinds of local models, the geographic scale, or spatial extent, employed for coefficient estimation may also have a pervasive influence. This is because important variations in the local model coefficients across geographic scales may impact the understanding of local relationships. In this study, we fitted canine cancer incidences across Swiss municipal units through multiple regional models. We computed diagnostic summaries across the different regional models, and contrasted them with the diagnostics of the conventional regression model, using value-by-alpha maps and scalograms. The results of this comparative assessment enabled us to identify variations in the goodness-of-fit and coefficient estimates. We detected spatially non-stationary relationships, in particular, for the variables related to biological risk factors. These variations in the model coefficients were more important at small geographic scales, making a case for the need to model canine cancer incidences locally in contrast to more conventional global approaches. However, we contend that prior to undertaking local modeling efforts, a deeper understanding of the effects of geographic scale is needed to better characterize and identify local model relationships. PMID:29652921
The importance of regional models in assessing canine cancer incidences in Switzerland.
Boo, Gianluca; Leyk, Stefan; Brunsdon, Christopher; Graf, Ramona; Pospischil, Andreas; Fabrikant, Sara Irina
2018-01-01
Fitting canine cancer incidences through a conventional regression model assumes constant statistical relationships across the study area in estimating the model coefficients. However, it is often more realistic to consider that these relationships may vary over space. Such a condition, known as spatial non-stationarity, implies that the model coefficients need to be estimated locally. In these kinds of local models, the geographic scale, or spatial extent, employed for coefficient estimation may also have a pervasive influence. This is because important variations in the local model coefficients across geographic scales may impact the understanding of local relationships. In this study, we fitted canine cancer incidences across Swiss municipal units through multiple regional models. We computed diagnostic summaries across the different regional models, and contrasted them with the diagnostics of the conventional regression model, using value-by-alpha maps and scalograms. The results of this comparative assessment enabled us to identify variations in the goodness-of-fit and coefficient estimates. We detected spatially non-stationary relationships, in particular, for the variables related to biological risk factors. These variations in the model coefficients were more important at small geographic scales, making a case for the need to model canine cancer incidences locally in contrast to more conventional global approaches. However, we contend that prior to undertaking local modeling efforts, a deeper understanding of the effects of geographic scale is needed to better characterize and identify local model relationships.
A proof for Rhiel's range estimator of the coefficient of variation for skewed distributions.
Rhiel, G Steven
2007-02-01
In this research study is proof that the coefficient of variation (CV(high-low)) calculated from the highest and lowest values in a set of data is applicable to specific skewed distributions with varying means and standard deviations. Earlier Rhiel provided values for d(n), the standardized mean range, and a(n), an adjustment for bias in the range estimator of micro. These values are used in estimating the coefficient of variation from the range for skewed distributions. The d(n) and an values were specified for specific skewed distributions with a fixed mean and standard deviation. In this proof it is shown that the d(n) and an values are applicable for the specific skewed distributions when the mean and standard deviation can take on differing values. This will give the researcher confidence in using this statistic for skewed distributions regardless of the mean and standard deviation.
Sun, Wei; Chou, Chih-Ping; Stacy, Alan W; Ma, Huiyan; Unger, Jennifer; Gallaher, Peggy
2007-02-01
Cronbach's a is widely used in social science research to estimate the internal consistency of reliability of a measurement scale. However, when items are not strictly parallel, the Cronbach's a coefficient provides a lower-bound estimate of true reliability, and this estimate may be further biased downward when items are dichotomous. The estimation of standardized Cronbach's a for a scale with dichotomous items can be improved by using the upper bound of coefficient phi. SAS and SPSS macros have been developed in this article to obtain standardized Cronbach's a via this method. The simulation analysis showed that Cronbach's a from upper-bound phi might be appropriate for estimating the real reliability when standardized Cronbach's a is problematic.
Zer, Matan; Lindner, Arie; Greenstein, Alexander; Leibovici, Dan
2011-07-01
Academic careers of individual doctors are commonly evaluated by examining the number and quality of authored publications. Similarly, the extent and quality of medical research may be assessed nationwide by measuring the number of publications originating from the country of interest over time. This in turn, may indicate on the quality of medicine practiced. To evaluate the extent and quality of IsraeLi publications we measured the rate and quality of medical publications originating from Israel for two decades in the fields of urology, cardiology and orthopedics, and compared the data to those of other countries. Leading journals in urology, cardiology, and orthopedics were selected. A Medline search (http://www.ncbi.ntm.nih.gov/sites/entrez] was conducted for all the publications originating in Israel between the years 1990-2009 in the selected journals. Data from Israel was compared to those from Italy, France, Germany, Egypt and Turkey. The change in rate of publications was tested using Linear regression. The quality of publications was calculated by multiplying the number of publications by the relevant impact factor. While the urology publications rate in Israel increased by 32.7% in the second study decade as compared with the first, the urology publication rates during the same time period from Italy, France, Germany, Egypt and Turkey were 199%, 115%, 184%, 180% and 227% respectively. The regression coefficient for the urology publication rate was 0.51 for Israel, and 0.78, 0.95, 0.78, 0.87 and 0.97 for the other countries, respectively. The regression coefficient for the change in the quality of publications from Israel was 0.31 and 0.81, 0.75, 0.92, 0.73, and 0.92 for the other countries, respectively. In cardiology, the Israeli publication rate increased by 26% during the second study decade, whereas in the other countries the increments were 46%, 35%, 76%, 80% and 309% respectively. The regression coefficient for Israeli pubLication rate was 0.45, and 0.78, 0.54, 0.62, 0.13 and 0.75 for the other countries, respectively. The regression coefficient of the quality of publications in Israel was 0.3 as opposed to 0.47, 0.36, 0.48, 0.01, and 0.78 respectively. The Israeli publications in orthopedics increased by 9.3% during the second decade compared with the first. At the same time, other countries increased the publication rate in orthopedics by 69%, 121%, 173%, 140% and 296% respectively. The regression coefficient for the publication rate in orthopedics was 0.02 for Israel, and 0.62, 0.64, 0.78, 0.34 and 0.71 for the other countries, respectively. The regression coefficient of the quality of publications in Israel was 0.05 as opposed to 0.67, 0.62, 0.75, 0.31, and 0.66 in the other countries, respectively. Israel lags behind Italy, France, Germany, Egypt and Turkey with regard to the increase of both the number and the quality of medical publications in urology and orthopedics. While the rate and quality of IsraeLi publications in cardiology surpasses those from Egypt, they lag in the number of publications in this medical field behind those of all the rest of the countries examined. In a world of rapid progress and expansion of medical research, Israel has been stagnant in publications in 3 medical specialties, rendering it inferior to other nations.
Batterham, Philip J; Bunce, David; Mackinnon, Andrew J; Christensen, Helen
2014-01-01
very few studies have examined the association between intra-individual reaction time variability and subsequent mortality. Furthermore, the ability of simple measures of variability to predict mortality has not been compared with more complex measures. a prospective cohort study of 896 community-based Australian adults aged 70+ were interviewed up to four times from 1990 to 2002, with vital status assessed until June 2007. From this cohort, 770-790 participants were included in Cox proportional hazards regression models of survival. Vital status and time in study were used to conduct survival analyses. The mean reaction time and three measures of intra-individual reaction time variability were calculated separately across 20 trials of simple and choice reaction time tasks. Models were adjusted for a range of demographic, physical health and mental health measures. greater intra-individual simple reaction time variability, as assessed by the raw standard deviation (raw SD), coefficient of variation (CV) or the intra-individual standard deviation (ISD), was strongly associated with an increased hazard of all-cause mortality in adjusted Cox regression models. The mean reaction time had no significant association with mortality. intra-individual variability in simple reaction time appears to have a robust association with mortality over 17 years. Health professionals such as neuropsychologists may benefit in their detection of neuropathology by supplementing neuropsychiatric testing with the straightforward process of testing simple reaction time and calculating raw SD or CV.
Food Polyamine and Cardiovascular Disease -An Epidemiological Study-
Soda, Kuniyasu; Kano, Yoshihiko; Chiba, Fumihiro
2012-01-01
The purpose of this study was to examine the contribution of dietary polyamines toward preventing cardiovascular disease (CVD). Age-standardized mortality rates as well as other relevant information regarding individuals with CVD were gathered from the World Health Organization and the International Monetary Fund in 48 different European and other Western countries. Food supply data were collected from the database of the United Nations, and the amount of dietary polyamines was estimated by using polyamine concentrations in foods from published sources. The association between CVD mortality and the amount of polyamines was investigated by performing a series of multiple linear regression analyses. Analyses using factors known to modulate the risk of CVD including: Gross Domestic Product (GDP) (standardized regression coefficient (r) = -0.786, p < 0.001) and the amount of fruits, vegetable, nuts, and beans (r = -0.183, p = 0.001) but not including polyamines, showed negative associations with CVD, while smoking rate (r = 0.139, p = 0.041) and whole milk amount (r = 0.131, p = 0.028) showed positive associations with CVD. When the amount of polyamines was added to the analyses as a covariate, GDP (r = -0.864, p < 0.001) and polyamines (r = -0.355, p = 0.007) showed negative associations with CVD, while smoking rate (r = 0.183, p = 0.006) and whole milk (r = 0.113, p = 0.041) showed positive associations with CVD. The inverse association between dietary polyamines and CVD mortality revealed by the present study merits further evaluation. PMID:23121753
Food polyamine and cardiovascular disease--an epidemiological study.
Soda, Kuniyasu; Kano, Yoshihiko; Chiba, Fumihiro
2012-09-28
The purpose of this study was to examine the contribution of dietary polyamines toward preventing cardiovascular disease (CVD). Age-standardized mortality rates as well as other relevant information regarding individuals with CVD were gathered from the World Health Organization and the International Monetary Fund in 48 different European and other Western countries. Food supply data were collected from the database of the United Nations, and the amount of dietary polyamines was estimated by using polyamine concentrations in foods from published sources. The association between CVD mortality and the amount of polyamines was investigated by performing a series of multiple linear regression analyses. Analyses using factors known to modulate the risk of CVD including: Gross Domestic Product (GDP) (standardized regression coefficient (r) = -0.786, p < 0.001) and the amount of fruits, vegetable, nuts, and beans (r = -0.183, p = 0.001) but not including polyamines, showed negative associations with CVD, while smoking rate (r = 0.139, p = 0.041) and whole milk amount (r = 0.131, p = 0.028) showed positive associations with CVD. When the amount of polyamines was added to the analyses as a covariate, GDP (r = -0.864, p < 0.001) and polyamines (r = -0.355, p = 0.007) showed negative associations with CVD, while smoking rate (r = 0.183, p = 0.006) and whole milk (r = 0.113, p = 0.041) showed positive associations with CVD. The inverse association between dietary polyamines and CVD mortality revealed by the present study merits further evaluation.
Methodology for the development of normative data for Spanish-speaking pediatric populations.
Rivera, D; Arango-Lasprilla, J C
2017-01-01
To describe the methodology utilized to calculate reliability and the generation of norms for 10 neuropsychological tests for children in Spanish-speaking countries. The study sample consisted of over 4,373 healthy children from nine countries in Latin America (Chile, Cuba, Ecuador, Guatemala, Honduras, Mexico, Paraguay, Peru, and Puerto Rico) and Spain. Inclusion criteria for all countries were to have between 6 to 17 years of age, an Intelligence Quotient of≥80 on the Test of Non-Verbal Intelligence (TONI-2), and score of <19 on the Children's Depression Inventory. Participants completed 10 neuropsychological tests. Reliability and norms were calculated for all tests. Test-retest analysis showed excellent or good- reliability on all tests (r's>0.55; p's<0.001) except M-WCST perseverative errors whose coefficient magnitude was fair. All scores were normed using multiple linear regressions and standard deviations of residual values. Age, age2, sex, and mean level of parental education (MLPE) were included as predictors in the models by country. The non-significant variables (p > 0.05) were removed and the analysis were run again. This is the largest Spanish-speaking children and adolescents normative study in the world. For the generation of normative data, the method based on linear regression models and the standard deviation of residual values was used. This method allows determination of the specific variables that predict test scores, helps identify and control for collinearity of predictive variables, and generates continuous and more reliable norms than those of traditional methods.
Chaitoff, Alexander; Sun, Bob; Windover, Amy; Bokar, Daniel; Featherall, Joseph; Rothberg, Michael B; Misra-Hebert, Anita D
2017-10-01
To identify correlates of physician empathy and determine whether physician empathy is related to standardized measures of patient experience. Demographic, professional, and empathy data were collected during 2013-2015 from Cleveland Clinic Health System physicians prior to participation in mandatory communication skills training. Empathy was assessed using the Jefferson Scale of Empathy. Data were also collected for seven measures (six provider communication items and overall provider rating) from the visit-specific and 12-month Consumer Assessment of Healthcare Providers and Systems Clinician and Group (CG-CAHPS) surveys. Associations between empathy and provider characteristics were assessed by linear regression, ANOVA, or a nonparametric equivalent. Significant predictors were included in a multivariable linear regression model. Correlations between empathy and CG-CAHPS scores were assessed using Spearman rank correlation coefficients. In bivariable analysis (n = 847 physicians), female sex (P < .001), specialty (P < .01), outpatient practice setting (P < .05), and DO degree (P < .05) were associated with higher empathy scores. In multivariable analysis, female sex (P < .001) and four specialties (obstetrics-gynecology, pediatrics, psychiatry, and thoracic surgery; all P < .05) were significantly associated with higher empathy scores. Of the seven CG-CAHPS measures, scores on five for the 583 physicians with visit-specific data and on three for the 277 physicians with 12-month data were positively correlated with empathy. Specialty and sex were independently associated with physician empathy. Empathy was correlated with higher scores on multiple CG-CAHPS items, suggesting improving physician empathy might play a role in improving patient experience.
Oshio, Takashi; Inoue, Akiomi; Tsutsumi, Akizumi
2018-05-25
We examined the associations among job demands and resources, work engagement, and psychological distress, adjusted for time-invariant individual attributes. We used data from a Japanese occupational cohort survey, which included 18,702 observations of 7,843 individuals. We investigated how work engagement, measured by the Utrecht Work Engagement Scale, was associated with key aspects of job demands and resources, using fixed-effects regression models. We further estimated the fixed-effects models to assess how work engagement moderated the association between each job characteristic and psychological distress as measured by Kessler 6 scores. The fixed-effects models showed that work engagement was positively associated with job resources, as did pooled cross-sectional and prospective cohort models. Specifically, the standardized regression coefficients (β) were 0.148 and 0.120 for extrinsic reward and decision latitude, respectively, compared to -0.159 and 0.020 for role ambiguity and workload and time pressure, respectively (p < 0.001 for all associations). Work engagement modestly moderated the associations of psychological distress with workload and time pressure and extrinsic reward; a one-standard deviation increase in work engagement moderated their associations by 19.2% (p < 0.001) and 11.3% (p = 0.034), respectively. Work engagement was associated with job demands and resources, which is in line with the theoretical prediction of the job demands-resources model, even after controlling for time-invariant individual attributes. Work engagement moderated the association between selected aspects of job demands and resources and psychological distress.
Dos Santos Augusto, Amanda; Barsanelli, Paulo Lopes; Pereira, Fabiola Manhas Verbi; Pereira-Filho, Edenir Rodrigues
2017-04-01
This study describes the application of laser-induced breakdown spectroscopy (LIBS) for the direct determination of Ca, K and Mg in powdered milk and solid dietary supplements. The following two calibration strategies were applied: (i) use of the samples to calculate calibration models (milk) and (ii) use of sample mixtures (supplements) to obtain a calibration curve. In both cases, reference values obtained from inductively coupled plasma optical emission spectroscopy (ICP OES) after acid digestion were used. The emission line selection from LIBS spectra was accomplished by analysing the regression coefficients of partial least squares (PLS) regression models, and wavelengths of 534.947, 766.490 and 285.213nm were chosen for Ca, K and Mg, respectively. In the case of the determination of Ca in supplements, it was necessary to perform a dilution (10-fold) of the standards and samples to minimize matrix interference. The average accuracy for powdered milk ranged from 60% to 168% for Ca, 77% to 152% for K and 76% to 131% for Mg. In the case of dietary supplements, standard error of prediction (SEP) varied from 295 (Mg) to 3782mgkg -1 (Ca). The proposed method presented an analytical frequency of around 60 samples per hour and the step of sample manipulation was drastically reduced, with no generation of toxic chemical residues. Copyright © 2017 Elsevier Ltd. All rights reserved.
Urinary Angiotensinogen and Renin Excretion are Associated with Chronic Kidney Disease.
Juretzko, Annett; Steinbach, Antje; Hannemann, Anke; Endlich, Karlhans; Endlich, Nicole; Friedrich, Nele; Lendeckel, Uwe; Stracke, Sylvia; Rettig, Rainer
2017-01-01
Several studies sought to identify new biomarkers for chronic kidney disease (CKD). As the renal renin-angiotensin system is activated in CKD, urinary angiotensinogen or renin excretion may be suitable candidates. We tested whether urinary angiotensinogen or renin excretion is elevated in CKD and whether these parameters are associated with estimated glomerular filtration rate (eGFR). We further tested whether urinary angiotensinogen or renin excretion may convey additional information beyond that provided by albuminuria. We measured urinary and plasma angiotensinogen, renin, albumin and creatinine in 177 CKD patients from the Greifswald Approach to Individualized Medicine project and in 283 healthy controls from the Study of Health in Pomerania. The urinary excretion of specific proteins is given as protein-to-creatinine ratio. Receiver operating characteristic (ROC) curves, spearman correlation coefficients and linear regression models were calculated. Urinary angiotensinogen [2,511 (196-31,909) vs. 18.6 (8.3-44.0) pmol/g, *P<0.01] and renin excretion [0.311 (0.135-1.155) vs. 0.069 (0.045-0.148) pmol/g, *P<0.01] were significantly higher in CKD patients than in healthy controls. The area under the ROC curve was significantly larger when urinary angiotensinogen, renin and albumin excretion were combined than with urinary albumin excretion alone. Urinary angiotensinogen (ß-coefficient -2.405, standard error 0.117, P<0.01) and renin excretion (ß-coefficient -0.793, standard error 0.061, P<0.01) were inversely associated with eGFR. Adjustment for albuminuria, age, sex, systolic blood pressure and body mass index did not significantly affect the results. Urinary angiotensinogen and renin excretion are elevated in CKD patients. Both parameters are negatively associated with eGFR and these associations are independent of urinary albumin excretion. In CKD patients urinary angiotensinogen and renin excretion may convey additional information beyond that provided by albuminuria. © 2017 The Author(s)Published by S. Karger AG, Basel.
Large signal-to-noise ratio quantification in MLE for ARARMAX models
NASA Astrophysics Data System (ADS)
Zou, Yiqun; Tang, Xiafei
2014-06-01
It has been shown that closed-loop linear system identification by indirect method can be generally transferred to open-loop ARARMAX (AutoRegressive AutoRegressive Moving Average with eXogenous input) estimation. For such models, the gradient-related optimisation with large enough signal-to-noise ratio (SNR) can avoid the potential local convergence in maximum likelihood estimation. To ease the application of this condition, the threshold SNR needs to be quantified. In this paper, we build the amplitude coefficient which is an equivalence to the SNR and prove the finiteness of the threshold amplitude coefficient within the stability region. The quantification of threshold is achieved by the minimisation of an elaborately designed multi-variable cost function which unifies all the restrictions on the amplitude coefficient. The corresponding algorithm based on two sets of physically realisable system input-output data details the minimisation and also points out how to use the gradient-related method to estimate ARARMAX parameters when local minimum is present as the SNR is small. Then, the algorithm is tested on a theoretical AutoRegressive Moving Average with eXogenous input model for the derivation of the threshold and a gas turbine engine real system for model identification, respectively. Finally, the graphical validation of threshold on a two-dimensional plot is discussed.
Effects of Medical Insurance on the Health Status and Life Satisfaction of the Elderly
GU, Liubao; FENG, Huihui; JIN, Jian
2017-01-01
Background: Population aging has become increasingly serious in China. The demand for medical insurance of the elderly is increasing, and their health status and life satisfaction are becoming significant issues. This study investigates the effects of medical insurance on the health status and life satisfaction of the elderly. Methods: The national baseline survey data of the China Health and Retirement Longitudinal Survey in 2013 were adopted. The Ordered Probit Model was established. The effects of the medical insurance for urban employees, medical insurance for urban residents, and new rural cooperative medical insurance on the health status and life satisfaction of the elderly were investigated. Results: Medical insurance could facilitate the improvement of the health status and life satisfaction of the elderly. Accordingly, the health status and life satisfaction of the elderly who have medical insurance for urban residents improved significantly. The regression coefficients were 0.348 and 0.307. The corresponding regression coefficients of the medical insurance for urban employees were 0.189 and 0.236. The regression coefficients of the new rural cooperative medical insurance were 0.170 and 0.188. Conclusion: Medical insurance can significantly improve the health status and life satisfaction of the elderly. This development is of immense significance for the formulation of equal medical security. PMID:29026784
Periodontal disease in children and adolescents with type 1 diabetes in Serbia.
Dakovic, Dragana; Pavlovic, Milos D
2008-06-01
The purpose of this study was to evaluate periodontal health in young patients with type 1 diabetes mellitus in Serbia. Periodontal disease was clinically assessed and compared in 187 children and adolescents (6 to 18 years of age) with type 1 diabetes mellitus and 178 control subjects without diabetes. Children and adolescents with type 1 diabetes mellitus had significantly more plaque, gingival inflammation, and periodontal destruction than control subjects. The main risk factors for periodontitis were diabetes (odds ratio [OR] = 2.78; 95% confidence interval [CI]: 1.42 to 5.44), bleeding/plaque ratio (OR = 1.25; 95% CI: 1.06 to 1.48), and age (OR = 1.10; 95% CI: 1.01 to 1.21). In case subjects, the number of teeth affected by periodontal destruction was associated with mean hemoglobin A1c (regression coefficient 0.17; P = 0.026), duration of diabetes (regression coefficient 0.19; P = 0.021), and bleeding/plaque ratio (regression coefficient 0.17; P = 0.021). Compared to children and adolescents without diabetes, periodontal disease is more prevalent and widespread in children and adolescents with type 1 diabetes mellitus and depends on the duration of disease, metabolic control, and the severity of gingival inflammation. Gingival inflammation in young patients with diabetes is more evident and more often results in periodontal destruction.
Mohr, David C; Eaton, Jennifer Lipkowitz; McPhaul, Kathleen M; Hodgson, Michael J
2015-04-22
We examined relationships between employee safety climate and patient safety culture. Because employee safety may be a precondition for the development of patient safety, we hypothesized that employee safety culture would be strongly and positively related to patient safety culture. An employee safety climate survey was administered in 2010 and assessed employees' views and experiences of safety for employees. The patient safety survey administered in 2011 assessed the safety culture for patients. We performed Pearson correlations and multiple regression analysis to examine the relationships between a composite measure of employee safety with subdimensions of patient safety culture. The regression models controlled for size, geographic characteristics, and teaching affiliation. Analyses were conducted at the group level using data from 132 medical centers. Higher employee safety climate composite scores were positively associated with all 9 patient safety culture measures examined. Standardized multivariate regression coefficients ranged from 0.44 to 0.64. Medical facilities where staff have more positive perceptions of health care workplace safety climate tended to have more positive assessments of patient safety culture. This suggests that patient safety culture and employee safety climate could be mutually reinforcing, such that investments and improvements in one domain positively impacts the other. Further research is needed to better understand the nexus between health care employee and patient safety to generalize and act upon findings.
Mandel, Micha; Gauthier, Susan A; Guttmann, Charles R G; Weiner, Howard L; Betensky, Rebecca A
2007-12-01
The expanded disability status scale (EDSS) is an ordinal score that measures progression in multiple sclerosis (MS). Progression is defined as reaching EDSS of a certain level (absolute progression) or increasing of one point of EDSS (relative progression). Survival methods for time to progression are not adequate for such data since they do not exploit the EDSS level at the end of follow-up. Instead, we suggest a Markov transitional model applicable for repeated categorical or ordinal data. This approach enables derivation of covariate-specific survival curves, obtained after estimation of the regression coefficients and manipulations of the resulting transition matrix. Large sample theory and resampling methods are employed to derive pointwise confidence intervals, which perform well in simulation. Methods for generating survival curves for time to EDSS of a certain level, time to increase of EDSS of at least one point, and time to two consecutive visits with EDSS greater than three are described explicitly. The regression models described are easily implemented using standard software packages. Survival curves are obtained from the regression results using packages that support simple matrix calculation. We present and demonstrate our method on data collected at the Partners MS center in Boston, MA. We apply our approach to progression defined by time to two consecutive visits with EDSS greater than three, and calculate crude (without covariates) and covariate-specific curves.
Gianola, Daniel; Fariello, Maria I; Naya, Hugo; Schön, Chris-Carolin
2016-10-13
Standard genome-wide association studies (GWAS) scan for relationships between each of p molecular markers and a continuously distributed target trait. Typically, a marker-based matrix of genomic similarities among individuals ( G: ) is constructed, to account more properly for the covariance structure in the linear regression model used. We show that the generalized least-squares estimator of the regression of phenotype on one or on m markers is invariant with respect to whether or not the marker(s) tested is(are) used for building G,: provided variance components are unaffected by exclusion of such marker(s) from G: The result is arrived at by using a matrix expression such that one can find many inverses of genomic relationship, or of phenotypic covariance matrices, stemming from removing markers tested as fixed, but carrying out a single inversion. When eigenvectors of the genomic relationship matrix are used as regressors with fixed regression coefficients, e.g., to account for population stratification, their removal from G: does matter. Removal of eigenvectors from G: can have a noticeable effect on estimates of genomic and residual variances, so caution is needed. Concepts were illustrated using genomic data on 599 wheat inbred lines, with grain yield as target trait, and on close to 200 Arabidopsis thaliana accessions. Copyright © 2016 Gianola et al.
Application of near-infrared spectroscopy for the rapid quality assessment of Radix Paeoniae Rubra
NASA Astrophysics Data System (ADS)
Zhan, Hao; Fang, Jing; Tang, Liying; Yang, Hongjun; Li, Hua; Wang, Zhuju; Yang, Bin; Wu, Hongwei; Fu, Meihong
2017-08-01
Near-infrared (NIR) spectroscopy with multivariate analysis was used to quantify gallic acid, catechin, albiflorin, and paeoniflorin in Radix Paeoniae Rubra, and the feasibility to classify the samples originating from different areas was investigated. A new high-performance liquid chromatography method was developed and validated to analyze gallic acid, catechin, albiflorin, and paeoniflorin in Radix Paeoniae Rubra as the reference. Partial least squares (PLS), principal component regression (PCR), and stepwise multivariate linear regression (SMLR) were performed to calibrate the regression model. Different data pretreatments such as derivatives (1st and 2nd), multiplicative scatter correction, standard normal variate, Savitzky-Golay filter, and Norris derivative filter were applied to remove the systematic errors. The performance of the model was evaluated according to the root mean square of calibration (RMSEC), root mean square error of prediction (RMSEP), root mean square error of cross-validation (RMSECV), and correlation coefficient (r). The results show that compared to PCR and SMLR, PLS had a lower RMSEC, RMSECV, and RMSEP and higher r for all the four analytes. PLS coupled with proper pretreatments showed good performance in both the fitting and predicting results. Furthermore, the original areas of Radix Paeoniae Rubra samples were partly distinguished by principal component analysis. This study shows that NIR with PLS is a reliable, inexpensive, and rapid tool for the quality assessment of Radix Paeoniae Rubra.
Functional brain networks reconstruction using group sparsity-regularized learning.
Zhao, Qinghua; Li, Will X Y; Jiang, Xi; Lv, Jinglei; Lu, Jianfeng; Liu, Tianming
2018-06-01
Investigating functional brain networks and patterns using sparse representation of fMRI data has received significant interests in the neuroimaging community. It has been reported that sparse representation is effective in reconstructing concurrent and interactive functional brain networks. To date, most of data-driven network reconstruction approaches rarely take consideration of anatomical structures, which are the substrate of brain function. Furthermore, it has been rarely explored whether structured sparse representation with anatomical guidance could facilitate functional networks reconstruction. To address this problem, in this paper, we propose to reconstruct brain networks utilizing the structure guided group sparse regression (S2GSR) in which 116 anatomical regions from the AAL template, as prior knowledge, are employed to guide the network reconstruction when performing sparse representation of whole-brain fMRI data. Specifically, we extract fMRI signals from standard space aligned with the AAL template. Then by learning a global over-complete dictionary, with the learned dictionary as a set of features (regressors), the group structured regression employs anatomical structures as group information to regress whole brain signals. Finally, the decomposition coefficients matrix is mapped back to the brain volume to represent functional brain networks and patterns. We use the publicly available Human Connectome Project (HCP) Q1 dataset as the test bed, and the experimental results indicate that the proposed anatomically guided structure sparse representation is effective in reconstructing concurrent functional brain networks.
Choline in anxiety and depression: the Hordaland Health Study.
Bjelland, Ingvar; Tell, Grethe S; Vollset, Stein E; Konstantinova, Svetlana; Ueland, Per M
2009-10-01
Despite its importance in the central nervous system as a precursor for acetylcholine and membrane phosphatidylcholine, the role of choline in mental illness has been little studied. We examined the cross-sectional association between plasma choline concentrations and scores of anxiety and depression symptoms in a general population sample. We studied a subsample (n = 5918) of the Hordaland Health Study, including both sexes and 2 age groups of 46-49 and 70-74 y who had valid information on plasma choline concentrations and symptoms of anxiety and depression measured by the Hospital Anxiety and Depression Scale--the latter 2 as continuous measures and dichotomized at a score > or =8 for both subscales. The lowest choline quintile was significantly associated with high anxiety levels (odds ratio: 1.33; 95% CI: 1.06, 1.69) in the fully adjusted (age group, sex, time since last meal, educational level, and smoking habits) logistic regression model. Also, the trend test in the anxiety model was significant (P = 0.007). In the equivalent fully adjusted linear regression model, a significant inverse association was found between choline quintiles and anxiety levels (standardized regression coefficient = -0.027, P = 0.045). We found no significant associations in the corresponding analyses of the relation between plasma choline and depression symptoms. In this large population-based study, choline concentrations were negatively associated with anxiety symptoms but not with depression symptoms.
McDonald, Jasmine A.; Terry, Mary Beth; Tehranifar, Parisa
2013-01-01
Purpose Most studies of perceived discrimination have been cross-sectional and focused primarily on mental rather than physical health conditions. We examined the associations of perceived racial and gender discrimination reported in adulthood with early life factors and self-reported physician-diagnosis of chronic physical health conditions. Methods We used data from a racially diverse birth cohort of U.S. women (N=168, average age=41 years) with prospectively collected early life data (e.g., parental socioeconomic factors) and adult reported data on perceived discrimination, physical health conditions, and relevant risk factors. We performed modified robust Poisson regression due to the high prevalence of the outcomes. Results Fifty-percent of participants reported racial and 39% reported gender discrimination. Early life factors did not have strong associations with perceived discrimination. In adjusted regression models, participants reporting at least three experiences of gender or racial discrimination had a 38% increased risk of having at least one physical health conditions (RR=1.38, 95% CI: 1.01-1.87). Using standardized regression coefficients, the magnitude of the association of having physical health conditions was larger for perceived discrimination than for being overweight or obese. Conclusion Our results suggest a substantial chronic disease burden associated with perceived discrimination, which may exceed the impact of established risk factors for poor physical health. PMID:24345610
Tu, Yu-Kang; Krämer, Nicole; Lee, Wen-Chung
2012-07-01
In the analysis of trends in health outcomes, an ongoing issue is how to separate and estimate the effects of age, period, and cohort. As these 3 variables are perfectly collinear by definition, regression coefficients in a general linear model are not unique. In this tutorial, we review why identification is a problem, and how this problem may be tackled using partial least squares and principal components regression analyses. Both methods produce regression coefficients that fulfill the same collinearity constraint as the variables age, period, and cohort. We show that, because the constraint imposed by partial least squares and principal components regression is inherent in the mathematical relation among the 3 variables, this leads to more interpretable results. We use one dataset from a Taiwanese health-screening program to illustrate how to use partial least squares regression to analyze the trends in body heights with 3 continuous variables for age, period, and cohort. We then use another dataset of hepatocellular carcinoma mortality rates for Taiwanese men to illustrate how to use partial least squares regression to analyze tables with aggregated data. We use the second dataset to show the relation between the intrinsic estimator, a recently proposed method for the age-period-cohort analysis, and partial least squares regression. We also show that the inclusion of all indicator variables provides a more consistent approach. R code for our analyses is provided in the eAppendix.
No rationale for 1 variable per 10 events criterion for binary logistic regression analysis.
van Smeden, Maarten; de Groot, Joris A H; Moons, Karel G M; Collins, Gary S; Altman, Douglas G; Eijkemans, Marinus J C; Reitsma, Johannes B
2016-11-24
Ten events per variable (EPV) is a widely advocated minimal criterion for sample size considerations in logistic regression analysis. Of three previous simulation studies that examined this minimal EPV criterion only one supports the use of a minimum of 10 EPV. In this paper, we examine the reasons for substantial differences between these extensive simulation studies. The current study uses Monte Carlo simulations to evaluate small sample bias, coverage of confidence intervals and mean square error of logit coefficients. Logistic regression models fitted by maximum likelihood and a modified estimation procedure, known as Firth's correction, are compared. The results show that besides EPV, the problems associated with low EPV depend on other factors such as the total sample size. It is also demonstrated that simulation results can be dominated by even a few simulated data sets for which the prediction of the outcome by the covariates is perfect ('separation'). We reveal that different approaches for identifying and handling separation leads to substantially different simulation results. We further show that Firth's correction can be used to improve the accuracy of regression coefficients and alleviate the problems associated with separation. The current evidence supporting EPV rules for binary logistic regression is weak. Given our findings, there is an urgent need for new research to provide guidance for supporting sample size considerations for binary logistic regression analysis.
Analysis of oscillatory motion of a light airplane at high values of lift coefficient
NASA Technical Reports Server (NTRS)
Batterson, J. G.
1983-01-01
A modified stepwise regression is applied to flight data from a light research air-plane operating at high angles at attack. The well-known phenomenon referred to as buckling or porpoising is analyzed and modeled using both power series and spline expansions of the aerodynamic force and moment coefficients associated with the longitudinal equations of motion.
ERIC Educational Resources Information Center
Longford, Nicholas T.
Operational procedures for the Graduate Record Examinations Validity Study Service are reviewed, with emphasis on the problem of frequent occurrence of negative coefficients in the fitted within-department regressions obtained by the empirical Bayes method of H. I. Braun and D. Jones (1985). Several alterations of the operational procedures are…
Naff, R.L.
1998-01-01
The late-time macrodispersion coefficients are obtained for the case of flow in the presence of a small-scale deterministic transient in a three-dimensional anisotropic, heterogeneous medium. The transient is assumed to affect only the velocity component transverse to the mean flow direction and to take the form of a periodic function. For the case of a highly stratified medium, these late-time macrodispersion coefficients behave largely as the standard coefficients used in the transport equation. Only in the event that the medium is isotropic is it probable that significant deviations from the standard coefficients would occur.
Advanced colorectal neoplasia risk stratification by penalized logistic regression.
Lin, Yunzhi; Yu, Menggang; Wang, Sijian; Chappell, Richard; Imperiale, Thomas F
2016-08-01
Colorectal cancer is the second leading cause of death from cancer in the United States. To facilitate the efficiency of colorectal cancer screening, there is a need to stratify risk for colorectal cancer among the 90% of US residents who are considered "average risk." In this article, we investigate such risk stratification rules for advanced colorectal neoplasia (colorectal cancer and advanced, precancerous polyps). We use a recently completed large cohort study of subjects who underwent a first screening colonoscopy. Logistic regression models have been used in the literature to estimate the risk of advanced colorectal neoplasia based on quantifiable risk factors. However, logistic regression may be prone to overfitting and instability in variable selection. Since most of the risk factors in our study have several categories, it was tempting to collapse these categories into fewer risk groups. We propose a penalized logistic regression method that automatically and simultaneously selects variables, groups categories, and estimates their coefficients by penalizing the [Formula: see text]-norm of both the coefficients and their differences. Hence, it encourages sparsity in the categories, i.e. grouping of the categories, and sparsity in the variables, i.e. variable selection. We apply the penalized logistic regression method to our data. The important variables are selected, with close categories simultaneously grouped, by penalized regression models with and without the interactions terms. The models are validated with 10-fold cross-validation. The receiver operating characteristic curves of the penalized regression models dominate the receiver operating characteristic curve of naive logistic regressions, indicating a superior discriminative performance. © The Author(s) 2013.
2010-01-01
Background Whilst patellofemoral pain is one of the most common musculoskeletal disorders presenting to orthopaedic clinics, sports clinics, and general practices, factors contributing to its development in the absence of a defined arthropathy, such as osteoarthritis (OA), are unclear. The aim of this cross-sectional study was to describe the relationships between parameters of patellofemoral geometry (patella inclination, sulcus angle and patella height) and knee pain and patella cartilage volume. Methods 240 community-based adults aged 25-60 years were recruited to take part in a study of obesity and musculoskeletal health. Magnetic resonance imaging (MRI) of the dominant knee was used to determine the lateral condyle-patella angle, sulcus angle, and Insall-Salvati ratio, as well as patella cartilage and bone volumes. Pain was assessed by the Western Ontario and McMaster University Osteoarthritis Index (WOMAC) VA pain subscale. Results Increased lateral condyle-patella angle (increased medial patella inclination) was associated with a reduction in WOMAC pain score (Regression coefficient -1.57, 95% CI -3.05, -0.09) and increased medial patella cartilage volume (Regression coefficient 51.38 mm3, 95% CI 1.68, 101.08 mm3). Higher riding patella as indicated by increased Insall-Salvati ratio was associated with decreased medial patella cartilage volume (Regression coefficient -3187 mm3, 95% CI -5510, -864 mm3). There was a trend for increased lateral patella cartilage volume associated with increased (shallower) sulcus angle (Regression coefficient 43.27 mm3, 95% CI -2.43, 88.98 mm3). Conclusion These results suggest both symptomatic and structural benefits associated with a more medially inclined patella while a high-riding patella may be detrimental to patella cartilage. This provides additional theoretical support for the current use of corrective strategies for patella malalignment that are aimed at medial patella translation, although longitudinal studies will be needed to further substantiate this. PMID:20459700
Tanamas, Stephanie K; Teichtahl, Andrew J; Wluka, Anita E; Wang, Yuanyuan; Davies-Tuck, Miranda; Urquhart, Donna M; Jones, Graeme; Cicuttini, Flavia M
2010-05-10
Whilst patellofemoral pain is one of the most common musculoskeletal disorders presenting to orthopaedic clinics, sports clinics, and general practices, factors contributing to its development in the absence of a defined arthropathy, such as osteoarthritis (OA), are unclear.The aim of this cross-sectional study was to describe the relationships between parameters of patellofemoral geometry (patella inclination, sulcus angle and patella height) and knee pain and patella cartilage volume. 240 community-based adults aged 25-60 years were recruited to take part in a study of obesity and musculoskeletal health. Magnetic resonance imaging (MRI) of the dominant knee was used to determine the lateral condyle-patella angle, sulcus angle, and Insall-Salvati ratio, as well as patella cartilage and bone volumes. Pain was assessed by the Western Ontario and McMaster University Osteoarthritis Index (WOMAC) VA pain subscale. Increased lateral condyle-patella angle (increased medial patella inclination) was associated with a reduction in WOMAC pain score (Regression coefficient -1.57, 95% CI -3.05, -0.09) and increased medial patella cartilage volume (Regression coefficient 51.38 mm3, 95% CI 1.68, 101.08 mm3). Higher riding patella as indicated by increased Insall-Salvati ratio was associated with decreased medial patella cartilage volume (Regression coefficient -3187 mm3, 95% CI -5510, -864 mm3). There was a trend for increased lateral patella cartilage volume associated with increased (shallower) sulcus angle (Regression coefficient 43.27 mm3, 95% CI -2.43, 88.98 mm3). These results suggest both symptomatic and structural benefits associated with a more medially inclined patella while a high-riding patella may be detrimental to patella cartilage. This provides additional theoretical support for the current use of corrective strategies for patella malalignment that are aimed at medial patella translation, although longitudinal studies will be needed to further substantiate this.
Societal Value of Surgery for Facial Reanimation.
Su, Peiyi; Ishii, Lisa E; Joseph, Andrew; Nellis, Jason; Dey, Jacob; Bater, Kristin; Byrne, Patrick J; Boahene, Kofi D O; Ishii, Masaru
2017-03-01
Patients with facial paralysis are perceived negatively by society in a number of domains. Society's perception of the health utility of varying degrees of facial paralysis and the value society places on reconstructive surgery for facial reanimation need to be quantified. To measure health state utility of varying degrees of facial paralysis, willingness to pay (WTP) for a repair, and the subsequent value of facial reanimation surgery as perceived by society. This prospective observational study conducted in an academic tertiary referral center evaluated a group of 348 casual observers who viewed images of faces with unilateral facial paralysis of 3 severity levels (low, medium, and high) categorized by House-Brackmann grade. Structural equation modeling was performed to understand associations among health utility metrics, WTP, and facial perception domains. Data were collected from July 16 to September 26, 2015. Observer-rated (1) quality of life (QOL) using established health utility metrics (standard gamble, time trade-off, and a visual analog scale) and (2) their WTP for surgical repair. Among the 348 observers (248 women [71.3%]; 100 men [28.7%]; mean [SD] age, 29.3 [11.6] years), mixed-effects linear regression showed that WTP increased nonlinearly with increasing severity of paralysis. Participants were willing to pay $3487 (95% CI, $2362-$4961) to repair low-grade paralysis, $8571 (95% CI, $6401-$11 234) for medium-grade paralysis, and $20 431 (95% CI, $16 273-$25 317) for high-grade paralysis. The dominant factor affecting the participants' WTP was perceived QOL. Modeling showed that perceived QOL decreased with paralysis severity (regression coefficient, -0.004; 95% CI, -0.005 to -0.004; P < .001) and increased with attractiveness (regression coefficient, 0.002; 95% CI, 0.002 to 0.003; P < .001). Mean (SD) health utility scores calculated by the standard gamble metric for low- and high-grade paralysis were 0.98 (0.09) and 0.77 (0.25), respectively. Time trade-off and visual analog scale measures were highly correlated. We calculated mean (SD) WTP per quality-adjusted life-year, which ranged from $10 167 ($14 565) to $17 008 ($38 288) for low- to high-grade paralysis, respectively. Society perceives the repair of facial paralysis to be a high-value intervention. Societal WTP increases and perceived health state utility decreases with increasing House-Brackmann grade. This study demonstrates the usefulness of WTP as an objective measure to inform dimensions of disease severity and signal the value society places on proper facial function. NA.
Yousuf, Naveed; Violato, Claudio; Zuberi, Rukhsana W
2015-01-01
CONSTRUCT: Authentic standard setting methods will demonstrate high convergent validity evidence of their outcomes, that is, cutoff scores and pass/fail decisions, with most other methods when compared with each other. The objective structured clinical examination (OSCE) was established for valid, reliable, and objective assessment of clinical skills in health professions education. Various standard setting methods have been proposed to identify objective, reliable, and valid cutoff scores on OSCEs. These methods may identify different cutoff scores for the same examinations. Identification of valid and reliable cutoff scores for OSCEs remains an important issue and a challenge. Thirty OSCE stations administered at least twice in the years 2010-2012 to 393 medical students in Years 2 and 3 at Aga Khan University are included. Psychometric properties of the scores are determined. Cutoff scores and pass/fail decisions of Wijnen, Cohen, Mean-1.5SD, Mean-1SD, Angoff, borderline group and borderline regression (BL-R) methods are compared with each other and with three variants of cluster analysis using repeated measures analysis of variance and Cohen's kappa. The mean psychometric indices on the 30 OSCE stations are reliability coefficient = 0.76 (SD = 0.12); standard error of measurement = 5.66 (SD = 1.38); coefficient of determination = 0.47 (SD = 0.19), and intergrade discrimination = 7.19 (SD = 1.89). BL-R and Wijnen methods show the highest convergent validity evidence among other methods on the defined criteria. Angoff and Mean-1.5SD demonstrated least convergent validity evidence. The three cluster variants showed substantial convergent validity with borderline methods. Although there was a high level of convergent validity of Wijnen method, it lacks the theoretical strength to be used for competency-based assessments. The BL-R method is found to show the highest convergent validity evidences for OSCEs with other standard setting methods used in the present study. We also found that cluster analysis using mean method can be used for quality assurance of borderline methods. These findings should be further confirmed by studies in other settings.
Mortality Measures to Profile Hospital Performance for Patients With Septic Shock.
Walkey, Allan J; Shieh, Meng-Shiou; Liu, Vincent X; Lindenauer, Peter K
2018-04-30
Sepsis care is becoming a more common target for hospital performance measurement, but few studies have evaluated the acceptability of sepsis or septic shock mortality as a potential performance measure. In the absence of a gold standard to identify septic shock in claims data, we assessed agreement and stability of hospital mortality performance under different case definitions. Retrospective cohort study. U.S. acute care hospitals. Hospitalized with septic shock at admission, identified by either implicit diagnosis criteria (charges for antibiotics, cultures, and vasopressors) or by explicit International Classification of Diseases, 9th revision, codes. None. We used hierarchical logistic regression models to determine hospital risk-standardized mortality rates and hospital performance outliers. We assessed agreement in hospital mortality rankings when septic shock cases were identified by either explicit International Classification of Diseases, 9th revision, codes or implicit diagnosis criteria. Kappa statistics and intraclass correlation coefficients were used to assess agreement in hospital risk-standardized mortality and hospital outlier status, respectively. Fifty-six thousand six-hundred seventy-three patients in 308 hospitals fulfilled at least one case definition for septic shock, whereas 19,136 (33.8%) met both the explicit International Classification of Diseases, 9th revision, and implicit septic shock definition. Hospitals varied widely in risk-standardized septic shock mortality (interquartile range of implicit diagnosis mortality: 25.4-33.5%; International Classification of Diseases, 9th revision, diagnosis: 30.2-38.0%). The median absolute difference in hospital ranking between septic shock cohorts defined by International Classification of Diseases, 9th revision, versus implicit criteria was 37 places (interquartile range, 16-70), with an intraclass correlation coefficient of 0.72, p value of less than 0.001; agreement between case definitions for identification of outlier hospitals was moderate (kappa, 0.44 [95% CI, 0.30-0.58]). Risk-standardized septic shock mortality rates varied considerably between hospitals, suggesting that septic shock is an important performance target. However, efforts to profile hospital performance were sensitive to septic shock case definitions, suggesting that septic shock mortality is not currently ready for widespread use as a hospital quality measure.
Barnes, Allan J.; Scheidweiler, Karl B.; Huestis, Marilyn A.
2015-01-01
A sensitive and specific method for the quantification of 11-nor-9-carboxy-Δ9-tetrahydrocannabinol (THCCOOH) in oral fluid collected with the Quantisal and Oral-Eze devices was developed and fully validated. Extracted analytes were derivatized with hexafluoroisopropanol and trifluoroacetic anhydride and quantified by gas chromatography–tandem mass spectrometry with negative chemical ionization. Standard curves, using linear least-squares regression with 1/x2 weighting were linear from 10 to 1000 ng/L with coefficients of determination >0.998 for both collection devices. Bias was 89.2%–112.6%, total imprecision 4.0%–5.1% coefficient of variation, and extraction efficiency >79.8% across the linear range for Quantisal-collected specimens. Bias was 84.6%–109.3%, total imprecision 3.6%–7.3% coefficient of variation, and extraction efficiency >92.6% for specimens collected with the Oral-Eze device at all 3 quality control concentrations (10, 120, and 750 ng/L). This effective high-throughput method reduces analysis time by 9 minutes per sample compared with our current 2-dimensional gas chromatography–mass spectrometry method and extends the capability of quantifying this important oral fluid analyte to gas chromatography–tandem mass spectrometry. This method was applied to the analysis of oral fluid specimens collected from individuals participating in controlled cannabis studies and will be effective for distinguishing passive environmental contamination from active cannabis smoking. PMID:24622724
Maximum Entropy Principle for Transportation
NASA Astrophysics Data System (ADS)
Bilich, F.; DaSilva, R.
2008-11-01
In this work we deal with modeling of the transportation phenomenon for use in the transportation planning process and policy-impact studies. The model developed is based on the dependence concept, i.e., the notion that the probability of a trip starting at origin i is dependent on the probability of a trip ending at destination j given that the factors (such as travel time, cost, etc.) which affect travel between origin i and destination j assume some specific values. The derivation of the solution of the model employs the maximum entropy principle combining a priori multinomial distribution with a trip utility concept. This model is utilized to forecast trip distributions under a variety of policy changes and scenarios. The dependence coefficients are obtained from a regression equation where the functional form is derived based on conditional probability and perception of factors from experimental psychology. The dependence coefficients encode all the information that was previously encoded in the form of constraints. In addition, the dependence coefficients encode information that cannot be expressed in the form of constraints for practical reasons, namely, computational tractability. The equivalence between the standard formulation (i.e., objective function with constraints) and the dependence formulation (i.e., without constraints) is demonstrated. The parameters of the dependence-based trip-distribution model are estimated, and the model is also validated using commercial air travel data in the U.S. In addition, policy impact analyses (such as allowance of supersonic flights inside the U.S. and user surcharge at noise-impacted airports) on air travel are performed.
Burgers, Phillip; Alexander, David E
2012-01-01
For a century, researchers have used the standard lift coefficient C(L) to evaluate the lift, L, generated by fixed wings over an area S against dynamic pressure, ½ρv(2), where v is the effective velocity of the wing. Because the lift coefficient was developed initially for fixed wings in steady flow, its application to other lifting systems requires either simplifying assumptions or complex adjustments as is the case for flapping wings and rotating cylinders.This paper interprets the standard lift coefficient of a fixed wing slightly differently, as the work exerted by the wing on the surrounding flow field (L/ρ·S), compared against the total kinetic energy required for generating said lift, ½v(2). This reinterpreted coefficient, the normalized lift, is derived from the work-energy theorem and compares the lifting capabilities of dissimilar lift systems on a similar energy footing. The normalized lift is the same as the standard lift coefficient for fixed wings, but differs for wings with more complex motions; it also accounts for such complex motions explicitly and without complex modifications or adjustments. We compare the normalized lift with the previously-reported values of lift coefficient for a rotating cylinder in Magnus effect, a bat during hovering and forward flight, and a hovering dipteran.The maximum standard lift coefficient for a fixed wing without flaps in steady flow is around 1.5, yet for a rotating cylinder it may exceed 9.0, a value that implies that a rotating cylinder generates nearly 6 times the maximum lift of a wing. The maximum normalized lift for a rotating cylinder is 1.5. We suggest that the normalized lift can be used to evaluate propellers, rotors, flapping wings of animals and micro air vehicles, and underwater thrust-generating fins in the same way the lift coefficient is currently used to evaluate fixed wings.
Burgers, Phillip; Alexander, David E.
2012-01-01
For a century, researchers have used the standard lift coefficient CL to evaluate the lift, L, generated by fixed wings over an area S against dynamic pressure, ½ρv 2, where v is the effective velocity of the wing. Because the lift coefficient was developed initially for fixed wings in steady flow, its application to other lifting systems requires either simplifying assumptions or complex adjustments as is the case for flapping wings and rotating cylinders. This paper interprets the standard lift coefficient of a fixed wing slightly differently, as the work exerted by the wing on the surrounding flow field (L/ρ·S), compared against the total kinetic energy required for generating said lift, ½v2. This reinterpreted coefficient, the normalized lift, is derived from the work-energy theorem and compares the lifting capabilities of dissimilar lift systems on a similar energy footing. The normalized lift is the same as the standard lift coefficient for fixed wings, but differs for wings with more complex motions; it also accounts for such complex motions explicitly and without complex modifications or adjustments. We compare the normalized lift with the previously-reported values of lift coefficient for a rotating cylinder in Magnus effect, a bat during hovering and forward flight, and a hovering dipteran. The maximum standard lift coefficient for a fixed wing without flaps in steady flow is around 1.5, yet for a rotating cylinder it may exceed 9.0, a value that implies that a rotating cylinder generates nearly 6 times the maximum lift of a wing. The maximum normalized lift for a rotating cylinder is 1.5. We suggest that the normalized lift can be used to evaluate propellers, rotors, flapping wings of animals and micro air vehicles, and underwater thrust-generating fins in the same way the lift coefficient is currently used to evaluate fixed wings. PMID:22629326
ERIC Educational Resources Information Center
Vasu, Ellen Storey
1978-01-01
The effects of the violation of the assumption of normality in the conditional distributions of the dependent variable, coupled with the condition of multicollinearity upon the outcome of testing the hypothesis that the regression coefficient equals zero, are investigated via a Monte Carlo study. (Author/JKS)
ERIC Educational Resources Information Center
Marland, Eric; Bossé, Michael J.; Rhoads, Gregory
2018-01-01
Rounding is a necessary step in many mathematical processes. We are taught early in our education about significant figures and how to properly round a number. So when we are given a data set and asked to find a regression line, we are inclined to offer the line with rounded coefficients to reflect our model. However, the effects are not as…
Modeling maximum daily temperature using a varying coefficient regression model
Han Li; Xinwei Deng; Dong-Yum Kim; Eric P. Smith
2014-01-01
Relationships between stream water and air temperatures are often modeled using linear or nonlinear regression methods. Despite a strong relationship between water and air temperatures and a variety of models that are effective for data summarized on a weekly basis, such models did not yield consistently good predictions for summaries such as daily maximum temperature...
Expanded uncertainty estimation methodology in determining the sandy soils filtration coefficient
NASA Astrophysics Data System (ADS)
Rusanova, A. D.; Malaja, L. D.; Ivanov, R. N.; Gruzin, A. V.; Shalaj, V. V.
2018-04-01
The combined standard uncertainty estimation methodology in determining the sandy soils filtration coefficient has been developed. The laboratory researches were carried out which resulted in filtration coefficient determination and combined uncertainty estimation obtaining.
Hooper, Claudie; De Souto Barreto, Philipe; Payoux, Pierre; Salabert, Anne Sophie; Guyonnet, Sophie; Andrieu, Sandrine; Vellas, Bruno
2017-08-01
Omega-3 (n-3) and 6 (n-6) polyunsaturated fatty acids (PUFAs) have been associated with reduced cognitive decline in observational studies. Hence, we examined the cross-sectional associations between cortical β-amyloid (Aβ) and erythrocyte membrane PUFAs in 61 non-demented elderly individuals reporting subjective memory complaints from the Multidomain Alzheimer Preventive Trial placebo arm. Cortical-to-cerebellar standard uptake value ratios were obtained using [ 18 F] florbetapir positron emission tomography. Fatty acids were measured in erythrocyte membranes by gas chromatography. Associations were explored using adjusted multiple linear regression models and were considered significant at p ≤ 0.005 after correction for multiple testing (10 comparisons). We found no significant associations between cortical Aβ and erythrocyte membrane PUFAs. The associations closest to significance after adjustment were those between Aβ and erythrocyte membrane arachidonic acid (without apolipoprotein E status adjustment: B-coefficient, 0.03; CI, 0.01, 0.05; p = 0.02. Including Apolipoprotein E adjustment: B-coefficient, 0.03; CI, 0.00, 0.06; p = 0.04) and Aβ and erythrocyte membrane linoleic acid (without apolipoprotein E status adjustment: B-coefficient, -0.02; CI, -0.04, 0.00; p = 0.02. Including Apolipoprotein E adjustment: B-coefficient, -0.02; CI, -0.04, 0.00; p = 0.09). Furthermore, the association between Aβ and erythrocyte membrane arachidonic acid seemed to be specific to Apolipoprotein E ε4 non-carriers (B-coefficient 0.03, CI: 0.00, 0.06, p = 0.03, n = 36). In contrast, no association was found between Aβ and erythrocyte membrane linoleic acid in Apolipoprotein E ε4 stratified analysis. Investigating the relationships between Aβ and PUFAs longitudinally would provide further evidence as to whether fatty acids, particularly arachidonic acid and linoleic acid, might modulate cognition through Aβ-dependent mechanisms. © 2017 International Society for Neurochemistry.
NASA Astrophysics Data System (ADS)
Oliphant, Andrew J.; Stoy, Paul C.
2018-03-01
Photosynthesis is more efficient under diffuse than direct beam photosynthetically active radiation (PAR) per unit PAR, but diffuse PAR is infrequently measured at research sites. We examine four commonly used semiempirical models (Erbs et al., 1982, https://doi.org/10.1016/0038-092X(82)90302-4; Gu et al., 1999, https://doi.org/10.1029/1999JD901068; Roderick, 1999, https://doi.org/10.1016/S0168-1923(99)00028-3; Weiss & Norman, 1985, https://doi.org/10.1016/0168-1923(85)90020-6) that partition PAR into diffuse and direct beam components based on the negative relationship between atmospheric transparency and scattering of PAR. Radiation observations at 58 sites (140 site years) from the La Thuille FLUXNET data set were used for model validation and coefficient testing. All four models did a reasonable job of predicting the diffuse fraction of PAR (ϕ) at the 30 min timescale, with site median r2 values ranging between 0.85 and 0.87, model efficiency coefficients (MECs) between 0.62 and 0.69, and regression slopes within 10% of unity. Model residuals were not strongly correlated with astronomical or standard meteorological variables. We conclude that the Roderick (1999, https://doi.org/10.1016/S0168-1923(99)00028-3) and Gu et al. (1999, https://doi.org/10.1029/1999JD901068) models performed better overall than the two older models. Using the basic form of these models, the data set was used to find both individual site and universal model coefficients that optimized predictive accuracy. A new universal form of the model is presented in section 5 that increased site median MEC to 0.73. Site-specific model coefficients increased median MEC further to 0.78, indicating usefulness of local/regional training of coefficients to capture the local distributions of aerosols and cloud types.
Yadlapati, Ajay; Grogan, Tristan; Elashoff, David; Kelly, Robert B.
2013-01-01
Abstract: Using a novel noninvasive, visible-light optical diffusion oximeter (T-Stat VLS Tissue Oximeter; Spectros Corporation, Portola Valley, CA) to measure the tissue oxygen saturation (StO2) of the buccal mucosa, the correlation between StO2 and central venous oxygen saturation (ScvO2) was examined in children with congenital cyanotic heart disease undergoing a cardiac surgical procedure. Paired StO2 and serum ScvO2 measurements were obtained postoperatively and statistically analyzed for agreement and association. Thirteen children (nine male) participated in the study (age range, 4 days to 18 months). Surgeries included Glenn shunt procedures, Norwood procedures, unifocalization procedures with Blalock-Taussig shunt placement, a Kawashima/Glenn shunt procedure, a Blalock-Taussig shunt placement, and a modified Norwood procedure. A total of 45 paired StO2-ScvO2 measurements was obtained. Linear regression demonstrated a Pearson’s correlation of .58 (95% confidence interval [CI], .35–.75; p < .0001). The regression slope coefficient estimate was .95 (95% CI, .54–1.36) with an interclass correlation coefficient of .48 (95% CI, .22–.68). Below a clinically relevant average ScvO2 value, a receiver operator characteristic analysis yielded an area under the curve of .78. Statistical methods to control for repeatedly measuring the same subjects produced similar results. This study shows a moderate relationship and agreement between StO2 and ScvO2 measurements in pediatric patients with a history of congenital cyanotic heart disease undergoing a cardiac surgical procedure. This real-time monitoring device can act as a valuable adjunct to standard noninvasive monitoring in which serum ScvO2 sampling currently assists in the diagnosis of low cardiac output after pediatric cardiac surgery. PMID:23691783
Sun, Yi; Arning, Martin; Bochmann, Frank; Börger, Jutta; Heitmann, Thomas
2018-06-01
The Occupational Safety and Health Monitoring and Assessment Tool (OSH-MAT) is a practical instrument that is currently used in the German woodworking and metalworking industries to monitor safety conditions at workplaces. The 12-item scoring system has three subscales rating technical, organizational, and personnel-related conditions in a company. Each item has a rating value ranging from 1 to 9, with higher values indicating higher standard of safety conditions. The reliability of this instrument was evaluated in a cross-sectional survey among 128 companies and its validity among 30,514 companies. The inter-rater reliability of the instrument was examined independently and simultaneously by two well-trained safety engineers. Agreement between the double ratings was quantified by the intraclass correlation coefficient and absolute agreement of the rating values. The content validity of the OSH-MAT was evaluated by quantifying the association between OSH-MAT values and 5-year average injury rates by Poisson regression analysis adjusted for the size of the companies and industrial sectors. The construct validity of OSH-MAT was examined by principle component factor analysis. Our analysis indicated good to very good inter-rater reliability (intraclass correlation coefficient = 0.64-0.74) of OSH-MAT values with an absolute agreement of between 72% and 81%. Factor analysis identified three component subscales that met exactly the structure theory of this instrument. The Poisson regression analysis demonstrated a statistically significant exposure-response relationship between OSH-MAT values and the 5-year average injury rates. These analyses indicate that OSH-MAT is a valid and reliable instrument that can be used effectively to monitor safety conditions at workplaces.
Yin, Ji Yong; Huo, Jun Sheng; Ma, Xin Xin; Sun, Jing; Huang, Jian
2017-12-01
To research a protein chip method which can simultaneously quantitative detect β-Lactoglobulin (β-L) and Lactoferrin (Lf) at one time. Protein chip printer was used to print both anti-β-L antibodies and anti-Lf antibodies on each block of protein chip. And then an improved sandwich detection method was applied while the other two detecting antibodies for the two antigens were added in the block after they were mixed. The detection conditions of the quantitative detection for simultaneous measurement of β-L and Lf with protein chip were optimized and evaluated. Based on these detected conditions, two standard curves of the two proteins were simultaneously established on one protein chip. Finally, the new detection method was evaluated by using the analysis of precision and accuracy. By comparison experiment, mouse monoclonal antibodies of the two antigens were chosen as the printing probe. The concentrations of β-L and Lf probes were 0.5 mg/mL and 0.5 mg/mL, respectively, while the titers of detection antibodies both of β-L and Lf were 1:2,000. Intra- and inter-assay variability was between 4.88% and 38.33% for all tests. The regression coefficients of protein chip comparing with ELISA for β-L and Lf were better than 0.734, and both of the two regression coefficients were statistically significant (r = 0.734, t = 2.644, P = 0.038; and r = 0.774, t = 2.998, P = 0.024). A protein chip method of simultaneously quantitative detection for β-L and Lf has been established and this method is worthy in further application. Copyright © 2017 The Editorial Board of Biomedical and Environmental Sciences. Published by China CDC. All rights reserved.