variables multiple regression: Topics by Science.gov

Sample records for variables multiple regression

The Detection and Interpretation of Interaction Effects between Continuous Variables in Multiple Regression.

ERIC Educational Resources Information Center

Jaccard, James; And Others

1990-01-01

Issues in the detection and interpretation of interaction effects between quantitative variables in multiple regression analysis are discussed. Recent discussions associated with problems of multicollinearity are reviewed in the context of the conditional nature of multiple regression with product terms. (TJH)
Advanced statistics: linear regression, part II: multiple linear regression.

PubMed

Marill, Keith A

2004-01-01

The applications of simple linear regression in medical research are limited, because in most situations, there are multiple relevant predictor variables. Univariate statistical techniques such as simple linear regression use a single predictor variable, and they often may be mathematically correct but clinically misleading. Multiple linear regression is a mathematical technique used to model the relationship between multiple independent predictor variables and a single dependent outcome variable. It is used in medical research to model observational data, as well as in diagnostic and therapeutic studies in which the outcome is dependent on more than one factor. Although the technique generally is limited to data that can be expressed with a linear function, it benefits from a well-developed mathematical framework that yields unique solutions and exact confidence intervals for regression coefficients. Building on Part I of this series, this article acquaints the reader with some of the important concepts in multiple regression analysis. These include multicollinearity, interaction effects, and an expansion of the discussion of inference testing, leverage, and variable transformations to multivariate models. Examples from the first article in this series are expanded on using a primarily graphic, rather than mathematical, approach. The importance of the relationships among the predictor variables and the dependence of the multivariate model coefficients on the choice of these variables are stressed. Finally, concepts in regression model building are discussed.
Advanced statistics: linear regression, part I: simple linear regression.

PubMed

Marill, Keith A

2004-01-01

Simple linear regression is a mathematical technique used to model the relationship between a single independent predictor variable and a single dependent outcome variable. In this, the first of a two-part series exploring concepts in linear regression analysis, the four fundamental assumptions and the mechanics of simple linear regression are reviewed. The most common technique used to derive the regression line, the method of least squares, is described. The reader will be acquainted with other important concepts in simple linear regression, including: variable transformations, dummy variables, relationship to inference testing, and leverage. Simplified clinical examples with small datasets and graphic models are used to illustrate the points. This will provide a foundation for the second article in this series: a discussion of multiple linear regression, in which there are multiple predictor variables.
How Variables Uncorrelated with the Dependent Variable Can Actually Make Excellent Predictors: The Important Suppressor Variable Case.

ERIC Educational Resources Information Center

Woolley, Kristin K.

Many researchers are unfamiliar with suppressor variables and how they operate in multiple regression analyses. This paper describes the role suppressor variables play in a multiple regression model and provides practical examples that explain how they can change research results. A variable that when added as another predictor increases the total…
An improved multiple linear regression and data analysis computer program package

NASA Technical Reports Server (NTRS)

Sidik, S. M.

1972-01-01

NEWRAP, an improved version of a previous multiple linear regression program called RAPIER, CREDUC, and CRSPLT, allows for a complete regression analysis including cross plots of the independent and dependent variables, correlation coefficients, regression coefficients, analysis of variance tables, t-statistics and their probability levels, rejection of independent variables, plots of residuals against the independent and dependent variables, and a canonical reduction of quadratic response functions useful in optimum seeking experimentation. A major improvement over RAPIER is that all regression calculations are done in double precision arithmetic.
An Effect Size for Regression Predictors in Meta-Analysis

ERIC Educational Resources Information Center

Aloe, Ariel M.; Becker, Betsy Jane

2012-01-01

A new effect size representing the predictive power of an independent variable from a multiple regression model is presented. The index, denoted as r[subscript sp], is the semipartial correlation of the predictor with the outcome of interest. This effect size can be computed when multiple predictor variables are included in the regression model…
RAWS II: A MULTIPLE REGRESSION ANALYSIS PROGRAM,

DTIC Science & Technology

This memorandum gives instructions for the use and operation of a revised version of RAWS, a multiple regression analysis program. The program...of preprocessed data, the directed retention of variable, listing of the matrix of the normal equations and its inverse, and the bypassing of the regression analysis to provide the input variable statistics only. (Author)
Variables Associated with Communicative Participation in People with Multiple Sclerosis: A Regression Analysis

ERIC Educational Resources Information Center

Baylor, Carolyn; Yorkston, Kathryn; Bamer, Alyssa; Britton, Deanna; Amtmann, Dagmar

2010-01-01

Purpose: To explore variables associated with self-reported communicative participation in a sample (n = 498) of community-dwelling adults with multiple sclerosis (MS). Method: A battery of questionnaires was administered online or on paper per participant preference. Data were analyzed using multiple linear backward stepwise regression. The…
The M Word: Multicollinearity in Multiple Regression.

ERIC Educational Resources Information Center

Morrow-Howell, Nancy

1994-01-01

Notes that existence of substantial correlation between two or more independent variables creates problems of multicollinearity in multiple regression. Discusses multicollinearity problem in social work research in which independent variables are usually intercorrelated. Clarifies problems created by multicollinearity, explains detection of…
Advanced Statistics for Exotic Animal Practitioners.

PubMed

Hodsoll, John; Hellier, Jennifer M; Ryan, Elizabeth G

2017-09-01

Correlation and regression assess the association between 2 or more variables. This article reviews the core knowledge needed to understand these analyses, moving from visual analysis in scatter plots through correlation, simple and multiple linear regression, and logistic regression. Correlation estimates the strength and direction of a relationship between 2 variables. Regression can be considered more general and quantifies the numerical relationships between an outcome and 1 or multiple variables in terms of a best-fit line, allowing predictions to be made. Each technique is discussed with examples and the statistical assumptions underlying their correct application. Copyright © 2017 Elsevier Inc. All rights reserved.
Categorical Variables in Multiple Regression: Some Cautions.

ERIC Educational Resources Information Center

O'Grady, Kevin E.; Medoff, Deborah R.

1988-01-01

Limitations of dummy coding and nonsense coding as methods of coding categorical variables for use as predictors in multiple regression analysis are discussed. The combination of these approaches often yields estimates and tests of significance that are not intended by researchers for inclusion in their models. (SLD)
Multiple regression for physiological data analysis: the problem of multicollinearity.

PubMed

Slinker, B K; Glantz, S A

1985-07-01

Multiple linear regression, in which several predictor variables are related to a response variable, is a powerful statistical tool for gaining quantitative insight into complex in vivo physiological systems. For these insights to be correct, all predictor variables must be uncorrelated. However, in many physiological experiments the predictor variables cannot be precisely controlled and thus change in parallel (i.e., they are highly correlated). There is a redundancy of information about the response, a situation called multicollinearity, that leads to numerical problems in estimating the parameters in regression equations; the parameters are often of incorrect magnitude or sign or have large standard errors. Although multicollinearity can be avoided with good experimental design, not all interesting physiological questions can be studied without encountering multicollinearity. In these cases various ad hoc procedures have been proposed to mitigate multicollinearity. Although many of these procedures are controversial, they can be helpful in applying multiple linear regression to some physiological problems.
Use of principal-component, correlation, and stepwise multiple-regression analyses to investigate selected physical and hydraulic properties of carbonate-rock aquifers

USGS Publications Warehouse

Brown, C. Erwin

1993-01-01

Correlation analysis in conjunction with principal-component and multiple-regression analyses were applied to laboratory chemical and petrographic data to assess the usefulness of these techniques in evaluating selected physical and hydraulic properties of carbonate-rock aquifers in central Pennsylvania. Correlation and principal-component analyses were used to establish relations and associations among variables, to determine dimensions of property variation of samples, and to filter the variables containing similar information. Principal-component and correlation analyses showed that porosity is related to other measured variables and that permeability is most related to porosity and grain size. Four principal components are found to be significant in explaining the variance of data. Stepwise multiple-regression analysis was used to see how well the measured variables could predict porosity and (or) permeability for this suite of rocks. The variation in permeability and porosity is not totally predicted by the other variables, but the regression is significant at the 5% significance level. ?? 1993.
Beyond Multiple Regression: Using Commonality Analysis to Better Understand R[superscript 2] Results

ERIC Educational Resources Information Center

Warne, Russell T.

2011-01-01

Multiple regression is one of the most common statistical methods used in quantitative educational research. Despite the versatility and easy interpretability of multiple regression, it has some shortcomings in the detection of suppressor variables and for somewhat arbitrarily assigning values to the structure coefficients of correlated…
Factor analysis and multiple regression between topography and precipitation on Jeju Island, Korea

NASA Astrophysics Data System (ADS)

Um, Myoung-Jin; Yun, Hyeseon; Jeong, Chang-Sam; Heo, Jun-Haeng

2011-11-01

SummaryIn this study, new factors that influence precipitation were extracted from geographic variables using factor analysis, which allow for an accurate estimation of orographic precipitation. Correlation analysis was also used to examine the relationship between nine topographic variables from digital elevation models (DEMs) and the precipitation in Jeju Island. In addition, a spatial analysis was performed in order to verify the validity of the regression model. From the results of the correlation analysis, it was found that all of the topographic variables had a positive correlation with the precipitation. The relations between the variables also changed in accordance with a change in the precipitation duration. However, upon examining the correlation matrix, no significant relationship between the latitude and the aspect was found. According to the factor analysis, eight topographic variables (latitude being the exception) were found to have a direct influence on the precipitation. Three factors were then extracted from the eight topographic variables. By directly comparing the multiple regression model with the factors (model 1) to the multiple regression model with the topographic variables (model 3), it was found that model 1 did not violate the limits of statistical significance and multicollinearity. As such, model 1 was considered to be appropriate for estimating the precipitation when taking into account the topography. In the study of model 1, the multiple regression model using factor analysis was found to be the best method for estimating the orographic precipitation on Jeju Island.
Conjoint Analysis: A Study of the Effects of Using Person Variables.

ERIC Educational Resources Information Center

Fraas, John W.; Newman, Isadore

Three statistical techniques--conjoint analysis, a multiple linear regression model, and a multiple linear regression model with a surrogate person variable--were used to estimate the relative importance of five university attributes for students in the process of selecting a college. The five attributes include: availability and variety of…
Adjusted variable plots for Cox's proportional hazards regression model.

PubMed

Hall, C B; Zeger, S L; Bandeen-Roche, K J

1996-01-01

Adjusted variable plots are useful in linear regression for outlier detection and for qualitative evaluation of the fit of a model. In this paper, we extend adjusted variable plots to Cox's proportional hazards model for possibly censored survival data. We propose three different plots: a risk level adjusted variable (RLAV) plot in which each observation in each risk set appears, a subject level adjusted variable (SLAV) plot in which each subject is represented by one point, and an event level adjusted variable (ELAV) plot in which the entire risk set at each failure event is represented by a single point. The latter two plots are derived from the RLAV by combining multiple points. In each point, the regression coefficient and standard error from a Cox proportional hazards regression is obtained by a simple linear regression through the origin fit to the coordinates of the pictured points. The plots are illustrated with a reanalysis of a dataset of 65 patients with multiple myeloma.
Regression Analysis with Dummy Variables: Use and Interpretation.

ERIC Educational Resources Information Center

Hinkle, Dennis E.; Oliver, J. Dale

1986-01-01

Multiple regression analysis (MRA) may be used when both continuous and categorical variables are included as independent research variables. The use of MRA with categorical variables involves dummy coding, that is, assigning zeros and ones to levels of categorical variables. Caution is urged in results interpretation. (Author/CH)
Latent Variable Regression 4-Level Hierarchical Model Using Multisite Multiple-Cohorts Longitudinal Data. CRESST Report 801

ERIC Educational Resources Information Center

Choi, Kilchan

2011-01-01

This report explores a new latent variable regression 4-level hierarchical model for monitoring school performance over time using multisite multiple-cohorts longitudinal data. This kind of data set has a 4-level hierarchical structure: time-series observation nested within students who are nested within different cohorts of students. These…
False Positives in Multiple Regression: Unanticipated Consequences of Measurement Error in the Predictor Variables

ERIC Educational Resources Information Center

Shear, Benjamin R.; Zumbo, Bruno D.

2013-01-01

Type I error rates in multiple regression, and hence the chance for false positive research findings, can be drastically inflated when multiple regression models are used to analyze data that contain random measurement error. This article shows the potential for inflated Type I error rates in commonly encountered scenarios and provides new…

Soil Cd, Cr, Cu, Ni, Pb and Zn sorption and retention models using SVM: Variable selection and competitive model.

PubMed

González Costa, J J; Reigosa, M J; Matías, J M; Covelo, E F

2017-09-01

The aim of this study was to model the sorption and retention of Cd, Cu, Ni, Pb and Zn in soils. To that extent, the sorption and retention of these metals were studied and the soil characterization was performed separately. Multiple stepwise regression was used to produce multivariate models with linear techniques and with support vector machines, all of which included 15 explanatory variables characterizing soils. When the R-squared values are represented, two different groups are noticed. Cr, Cu and Pb sorption and retention show a higher R-squared; the most explanatory variables being humified organic matter, Al oxides and, in some cases, cation-exchange capacity (CEC). The other group of metals (Cd, Ni and Zn) shows a lower R-squared, and clays are the most explanatory variables, including a percentage of vermiculite and slime. In some cases, quartz, plagioclase or hematite percentages also show some explanatory capacity. Support Vector Machine (SVM) regression shows that the different models are not as regular as in multiple regression in terms of number of variables, the regression for nickel adsorption being the one with the highest number of variables in its optimal model. On the other hand, there are cases where the most explanatory variables are the same for two metals, as it happens with Cd and Cr adsorption. A similar adsorption mechanism is thus postulated. These patterns of the introduction of variables in the model allow us to create explainability sequences. Those which are the most similar to the selectivity sequences obtained by Covelo (2005) are Mn oxides in multiple regression and change capacity in SVM. Among all the variables, the only one that is explanatory for all the metals after applying the maximum parsimony principle is the percentage of sand in the retention process. In the competitive model arising from the aforementioned sequences, the most intense competitiveness for the adsorption and retention of different metals appears between Cr and Cd, Cu and Zn in multiple regression; and between Cr and Cd in SVM regression. Copyright © 2017 Elsevier B.V. All rights reserved.
General Nature of Multicollinearity in Multiple Regression Analysis.

ERIC Educational Resources Information Center

Liu, Richard

1981-01-01

Discusses multiple regression, a very popular statistical technique in the field of education. One of the basic assumptions in regression analysis requires that independent variables in the equation should not be highly correlated. The problem of multicollinearity and some of the solutions to it are discussed. (Author)
Modeling Polytomous Item Responses Using Simultaneously Estimated Multinomial Logistic Regression Models

ERIC Educational Resources Information Center

Anderson, Carolyn J.; Verkuilen, Jay; Peyton, Buddy L.

2010-01-01

Survey items with multiple response categories and multiple-choice test questions are ubiquitous in psychological and educational research. We illustrate the use of log-multiplicative association (LMA) models that are extensions of the well-known multinomial logistic regression model for multiple dependent outcome variables to reanalyze a set of…
Statistical methods and regression analysis of stratospheric ozone and meteorological variables in Isfahan

NASA Astrophysics Data System (ADS)

Hassanzadeh, S.; Hosseinibalam, F.; Omidvari, M.

2008-04-01

Data of seven meteorological variables (relative humidity, wet temperature, dry temperature, maximum temperature, minimum temperature, ground temperature and sun radiation time) and ozone values have been used for statistical analysis. Meteorological variables and ozone values were analyzed using both multiple linear regression and principal component methods. Data for the period 1999-2004 are analyzed jointly using both methods. For all periods, temperature dependent variables were highly correlated, but were all negatively correlated with relative humidity. Multiple regression analysis was used to fit the meteorological variables using the meteorological variables as predictors. A variable selection method based on high loading of varimax rotated principal components was used to obtain subsets of the predictor variables to be included in the linear regression model of the meteorological variables. In 1999, 2001 and 2002 one of the meteorological variables was weakly influenced predominantly by the ozone concentrations. However, the model did not predict that the meteorological variables for the year 2000 were not influenced predominantly by the ozone concentrations that point to variation in sun radiation. This could be due to other factors that were not explicitly considered in this study.
Enhance-Synergism and Suppression Effects in Multiple Regression

ERIC Educational Resources Information Center

Lipovetsky, Stan; Conklin, W. Michael

2004-01-01

Relations between pairwise correlations and the coefficient of multiple determination in regression analysis are considered. The conditions for the occurrence of enhance-synergism and suppression effects when multiple determination becomes bigger than the total of squared correlations of the dependent variable with the regressors are discussed. It…
Statistical experiments using the multiple regression research for prediction of proper hardness in areas of phosphorus cast-iron brake shoes manufacturing

NASA Astrophysics Data System (ADS)

Kiss, I.; Cioată, V. G.; Ratiu, S. A.; Rackov, M.; Penčić, M.

2018-01-01

Multivariate research is important in areas of cast-iron brake shoes manufacturing, because many variables interact with each other simultaneously. This article focuses on expressing the multiple linear regression model related to the hardness assurance by the chemical composition of the phosphorous cast irons destined to the brake shoes, having in view that the regression coefficients will illustrate the unrelated contributions of each independent variable towards predicting the dependent variable. In order to settle the multiple correlations between the hardness of the cast-iron brake shoes, and their chemical compositions several regression equations has been proposed. Is searched a mathematical solution which can determine the optimum chemical composition for the hardness desirable values. Starting from the above-mentioned affirmations two new statistical experiments are effectuated related to the values of Phosphorus [P], Manganese [Mn] and Silicon [Si]. Therefore, the regression equations, which describe the mathematical dependency between the above-mentioned elements and the hardness, are determined. As result, several correlation charts will be revealed.
Interpret with caution: multicollinearity in multiple regression of cognitive data.

PubMed

Morrison, Catriona M

2003-08-01

Shibihara and Kondo in 2002 reported a reanalysis of the 1997 Kanji picture-naming data of Yamazaki, Ellis, Morrison, and Lambon-Ralph in which independent variables were highly correlated. Their addition of the variable visual familiarity altered the previously reported pattern of results, indicating that visual familiarity, but not age of acquisition, was important in predicting Kanji naming speed. The present paper argues that caution should be taken when drawing conclusions from multiple regression analyses in which the independent variables are so highly correlated, as such multicollinearity can lead to unreliable output.
The prediction of intelligence in preschool children using alternative models to regression.

PubMed

Finch, W Holmes; Chang, Mei; Davis, Andrew S; Holden, Jocelyn E; Rothlisberg, Barbara A; McIntosh, David E

2011-12-01

Statistical prediction of an outcome variable using multiple independent variables is a common practice in the social and behavioral sciences. For example, neuropsychologists are sometimes called upon to provide predictions of preinjury cognitive functioning for individuals who have suffered a traumatic brain injury. Typically, these predictions are made using standard multiple linear regression models with several demographic variables (e.g., gender, ethnicity, education level) as predictors. Prior research has shown conflicting evidence regarding the ability of such models to provide accurate predictions of outcome variables such as full-scale intelligence (FSIQ) test scores. The present study had two goals: (1) to demonstrate the utility of a set of alternative prediction methods that have been applied extensively in the natural sciences and business but have not been frequently explored in the social sciences and (2) to develop models that can be used to predict premorbid cognitive functioning in preschool children. Predictions of Stanford-Binet 5 FSIQ scores for preschool-aged children is used to compare the performance of a multiple regression model with several of these alternative methods. Results demonstrate that classification and regression trees provided more accurate predictions of FSIQ scores than does the more traditional regression approach. Implications of these results are discussed.
Noninvasive spectral imaging of skin chromophores based on multiple regression analysis aided by Monte Carlo simulation

NASA Astrophysics Data System (ADS)

Nishidate, Izumi; Wiswadarma, Aditya; Hase, Yota; Tanaka, Noriyuki; Maeda, Takaaki; Niizeki, Kyuichi; Aizu, Yoshihisa

2011-08-01

In order to visualize melanin and blood concentrations and oxygen saturation in human skin tissue, a simple imaging technique based on multispectral diffuse reflectance images acquired at six wavelengths (500, 520, 540, 560, 580 and 600nm) was developed. The technique utilizes multiple regression analysis aided by Monte Carlo simulation for diffuse reflectance spectra. Using the absorbance spectrum as a response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as predictor variables, multiple regression analysis provides regression coefficients. Concentrations of melanin and total blood are then determined from the regression coefficients using conversion vectors that are deduced numerically in advance, while oxygen saturation is obtained directly from the regression coefficients. Experiments with a tissue-like agar gel phantom validated the method. In vivo experiments with human skin of the human hand during upper limb occlusion and of the inner forearm exposed to UV irradiation demonstrated the ability of the method to evaluate physiological reactions of human skin tissue.
Understanding logistic regression analysis.

PubMed

Sperandei, Sandro

2014-01-01

Logistic regression is used to obtain odds ratio in the presence of more than one explanatory variable. The procedure is quite similar to multiple linear regression, with the exception that the response variable is binomial. The result is the impact of each variable on the odds ratio of the observed event of interest. The main advantage is to avoid confounding effects by analyzing the association of all variables together. In this article, we explain the logistic regression procedure using examples to make it as simple as possible. After definition of the technique, the basic interpretation of the results is highlighted and then some special issues are discussed.
Multicollinearity is a red herring in the search for moderator variables: A guide to interpreting moderated multiple regression models and a critique of Iacobucci, Schneider, Popovich, and Bakamitsos (2016).

PubMed

McClelland, Gary H; Irwin, Julie R; Disatnik, David; Sivan, Liron

2017-02-01

Multicollinearity is irrelevant to the search for moderator variables, contrary to the implications of Iacobucci, Schneider, Popovich, and Bakamitsos (Behavior Research Methods, 2016, this issue). Multicollinearity is like the red herring in a mystery novel that distracts the statistical detective from the pursuit of a true moderator relationship. We show multicollinearity is completely irrelevant for tests of moderator variables. Furthermore, readers of Iacobucci et al. might be confused by a number of their errors. We note those errors, but more positively, we describe a variety of methods researchers might use to test and interpret their moderated multiple regression models, including two-stage testing, mean-centering, spotlighting, orthogonalizing, and floodlighting without regard to putative issues of multicollinearity. We cite a number of recent studies in the psychological literature in which the researchers used these methods appropriately to test, to interpret, and to report their moderated multiple regression models. We conclude with a set of recommendations for the analysis and reporting of moderated multiple regression that should help researchers better understand their models and facilitate generalizations across studies.
FIRE: an SPSS program for variable selection in multiple linear regression analysis via the relative importance of predictors.

PubMed

Lorenzo-Seva, Urbano; Ferrando, Pere J

2011-03-01

We provide an SPSS program that implements currently recommended techniques and recent developments for selecting variables in multiple linear regression analysis via the relative importance of predictors. The approach consists of: (1) optimally splitting the data for cross-validation, (2) selecting the final set of predictors to be retained in the equation regression, and (3) assessing the behavior of the chosen model using standard indices and procedures. The SPSS syntax, a short manual, and data files related to this article are available as supplemental materials from brm.psychonomic-journals.org/content/supplemental.
The extraction of simple relationships in growth factor-specific multiple-input and multiple-output systems in cell-fate decisions by backward elimination PLS regression.

PubMed

Akimoto, Yuki; Yugi, Katsuyuki; Uda, Shinsuke; Kudo, Takamasa; Komori, Yasunori; Kubota, Hiroyuki; Kuroda, Shinya

2013-01-01

Cells use common signaling molecules for the selective control of downstream gene expression and cell-fate decisions. The relationship between signaling molecules and downstream gene expression and cellular phenotypes is a multiple-input and multiple-output (MIMO) system and is difficult to understand due to its complexity. For example, it has been reported that, in PC12 cells, different types of growth factors activate MAP kinases (MAPKs) including ERK, JNK, and p38, and CREB, for selective protein expression of immediate early genes (IEGs) such as c-FOS, c-JUN, EGR1, JUNB, and FOSB, leading to cell differentiation, proliferation and cell death; however, how multiple-inputs such as MAPKs and CREB regulate multiple-outputs such as expression of the IEGs and cellular phenotypes remains unclear. To address this issue, we employed a statistical method called partial least squares (PLS) regression, which involves a reduction of the dimensionality of the inputs and outputs into latent variables and a linear regression between these latent variables. We measured 1,200 data points for MAPKs and CREB as the inputs and 1,900 data points for IEGs and cellular phenotypes as the outputs, and we constructed the PLS model from these data. The PLS model highlighted the complexity of the MIMO system and growth factor-specific input-output relationships of cell-fate decisions in PC12 cells. Furthermore, to reduce the complexity, we applied a backward elimination method to the PLS regression, in which 60 input variables were reduced to 5 variables, including the phosphorylation of ERK at 10 min, CREB at 5 min and 60 min, AKT at 5 min and JNK at 30 min. The simple PLS model with only 5 input variables demonstrated a predictive ability comparable to that of the full PLS model. The 5 input variables effectively extracted the growth factor-specific simple relationships within the MIMO system in cell-fate decisions in PC12 cells.
Cross Validation of Selection of Variables in Multiple Regression.

DTIC Science & Technology

1979-12-01

55 vii CROSS VALIDATION OF SELECTION OF VARIABLES IN MULTIPLE REGRESSION I Introduction Background Long term DoD planning gcals...028545024 .31109000 BF * SS - .008700618 .0471961 Constant - .70977903 85.146786 55 had adequate predictive capabilities; the other two models (the...71ZCO F111D Control 54 73EGO FlIID Computer, General Purpose 55 73EPO FII1D Converter-Multiplexer 56 73HAO flllD Stabilizer Platform 57 73HCO F1ID
MULGRES: a computer program for stepwise multiple regression analysis

Treesearch

A. Jeff Martin

1971-01-01

MULGRES is a computer program source deck that is designed for multiple regression analysis employing the technique of stepwise deletion in the search for most significant variables. The features of the program, along with inputs and outputs, are briefly described, with a note on machine compatibility.
Selection of a Geostatistical Method to Interpolate Soil Properties of the State Crop Testing Fields using Attributes of a Digital Terrain Model

NASA Astrophysics Data System (ADS)

Sahabiev, I. A.; Ryazanov, S. S.; Kolcova, T. G.; Grigoryan, B. R.

2018-03-01

The three most common techniques to interpolate soil properties at a field scale—ordinary kriging (OK), regression kriging with multiple linear regression drift model (RK + MLR), and regression kriging with principal component regression drift model (RK + PCR)—were examined. The results of the performed study were compiled into an algorithm of choosing the most appropriate soil mapping technique. Relief attributes were used as the auxiliary variables. When spatial dependence of a target variable was strong, the OK method showed more accurate interpolation results, and the inclusion of the auxiliary data resulted in an insignificant improvement in prediction accuracy. According to the algorithm, the RK + PCR method effectively eliminates multicollinearity of explanatory variables. However, if the number of predictors is less than ten, the probability of multicollinearity is reduced, and application of the PCR becomes irrational. In that case, the multiple linear regression should be used instead.
Determining the Spatial and Seasonal Variability in OM/OC Ratios across the U.S. Using Multiple Regression

EPA Science Inventory

Data from the Interagency Monitoring of Protected Visual Environments (IMPROVE) network are used to estimate organic mass to organic carbon (OM/OC) ratios across the United States by extending previously published multiple regression techniques. Our new methodology addresses com...
Multiple Regression: A Leisurely Primer.

ERIC Educational Resources Information Center

Daniel, Larry G.; Onwuegbuzie, Anthony J.

Multiple regression is a useful statistical technique when the researcher is considering situations in which variables of interest are theorized to be multiply caused. It may also be useful in those situations in which the researchers is interested in studies of predictability of phenomena of interest. This paper provides an introduction to…
A Technique of Fuzzy C-Mean in Multiple Linear Regression Model toward Paddy Yield

NASA Astrophysics Data System (ADS)

Syazwan Wahab, Nur; Saifullah Rusiman, Mohd; Mohamad, Mahathir; Amira Azmi, Nur; Che Him, Norziha; Ghazali Kamardan, M.; Ali, Maselan

2018-04-01

In this paper, we propose a hybrid model which is a combination of multiple linear regression model and fuzzy c-means method. This research involved a relationship between 20 variates of the top soil that are analyzed prior to planting of paddy yields at standard fertilizer rates. Data used were from the multi-location trials for rice carried out by MARDI at major paddy granary in Peninsular Malaysia during the period from 2009 to 2012. Missing observations were estimated using mean estimation techniques. The data were analyzed using multiple linear regression model and a combination of multiple linear regression model and fuzzy c-means method. Analysis of normality and multicollinearity indicate that the data is normally scattered without multicollinearity among independent variables. Analysis of fuzzy c-means cluster the yield of paddy into two clusters before the multiple linear regression model can be used. The comparison between two method indicate that the hybrid of multiple linear regression model and fuzzy c-means method outperform the multiple linear regression model with lower value of mean square error.
[Prediction model of health workforce and beds in county hospitals of Hunan by multiple linear regression].

PubMed

Ling, Ru; Liu, Jiawang

2011-12-01

To construct prediction model for health workforce and hospital beds in county hospitals of Hunan by multiple linear regression. We surveyed 16 counties in Hunan with stratified random sampling according to uniform questionnaires,and multiple linear regression analysis with 20 quotas selected by literature view was done. Independent variables in the multiple linear regression model on medical personnels in county hospitals included the counties' urban residents' income, crude death rate, medical beds, business occupancy, professional equipment value, the number of devices valued above 10 000 yuan, fixed assets, long-term debt, medical income, medical expenses, outpatient and emergency visits, hospital visits, actual available bed days, and utilization rate of hospital beds. Independent variables in the multiple linear regression model on county hospital beds included the the population of aged 65 and above in the counties, disposable income of urban residents, medical personnel of medical institutions in county area, business occupancy, the total value of professional equipment, fixed assets, long-term debt, medical income, medical expenses, outpatient and emergency visits, hospital visits, actual available bed days, utilization rate of hospital beds, and length of hospitalization. The prediction model shows good explanatory and fitting, and may be used for short- and mid-term forecasting.

Estimation of streamflow, base flow, and nitrate-nitrogen loads in Iowa using multiple linear regression models

USGS Publications Warehouse

Schilling, K.E.; Wolter, C.F.

2005-01-01

Nineteen variables, including precipitation, soils and geology, land use, and basin morphologic characteristics, were evaluated to develop Iowa regression models to predict total streamflow (Q), base flow (Qb), storm flow (Qs) and base flow percentage (%Qb) in gauged and ungauged watersheds in the state. Discharge records from a set of 33 watersheds across the state for the 1980 to 2000 period were separated into Qb and Qs. Multiple linear regression found that 75.5 percent of long term average Q was explained by rainfall, sand content, and row crop percentage variables, whereas 88.5 percent of Qb was explained by these three variables plus permeability and floodplain area variables. Qs was explained by average rainfall and %Qb was a function of row crop percentage, permeability, and basin slope variables. Regional regression models developed for long term average Q and Qb were adapted to annual rainfall and showed good correlation between measured and predicted values. Combining the regression model for Q with an estimate of mean annual nitrate concentration, a map of potential nitrate loads in the state was produced. Results from this study have important implications for understanding geomorphic and land use controls on streamflow and base flow in Iowa watersheds and similar agriculture dominated watersheds in the glaciated Midwest. (JAWRA) (Copyright ?? 2005).
Estimation of 1RM for knee extension based on the maximal isometric muscle strength and body composition.

PubMed

Kanada, Yoshikiyo; Sakurai, Hiroaki; Sugiura, Yoshito; Arai, Tomoaki; Koyama, Soichiro; Tanabe, Shigeo

2017-11-01

[Purpose] To create a regression formula in order to estimate 1RM for knee extensors, based on the maximal isometric muscle strength measured using a hand-held dynamometer and data regarding the body composition. [Subjects and Methods] Measurement was performed in 21 healthy males in their twenties to thirties. Single regression analysis was performed, with measurement values representing 1RM and the maximal isometric muscle strength as dependent and independent variables, respectively. Furthermore, multiple regression analysis was performed, with data regarding the body composition incorporated as another independent variable, in addition to the maximal isometric muscle strength. [Results] Through single regression analysis with the maximal isometric muscle strength as an independent variable, the following regression formula was created: 1RM (kg)=0.714 + 0.783 × maximal isometric muscle strength (kgf). On multiple regression analysis, only the total muscle mass was extracted. [Conclusion] A highly accurate regression formula to estimate 1RM was created based on both the maximal isometric muscle strength and body composition. Using a hand-held dynamometer and body composition analyzer, it was possible to measure these items in a short time, and obtain clinically useful results.
Ridge: a computer program for calculating ridge regression estimates

Treesearch

Donald E. Hilt; Donald W. Seegrist

1977-01-01

Least-squares coefficients for multiple-regression models may be unstable when the independent variables are highly correlated. Ridge regression is a biased estimation procedure that produces stable estimates of the coefficients. Ridge regression is discussed, and a computer program for calculating the ridge coefficients is presented.
Introduction to the use of regression models in epidemiology.

PubMed

Bender, Ralf

2009-01-01

Regression modeling is one of the most important statistical techniques used in analytical epidemiology. By means of regression models the effect of one or several explanatory variables (e.g., exposures, subject characteristics, risk factors) on a response variable such as mortality or cancer can be investigated. From multiple regression models, adjusted effect estimates can be obtained that take the effect of potential confounders into account. Regression methods can be applied in all epidemiologic study designs so that they represent a universal tool for data analysis in epidemiology. Different kinds of regression models have been developed in dependence on the measurement scale of the response variable and the study design. The most important methods are linear regression for continuous outcomes, logistic regression for binary outcomes, Cox regression for time-to-event data, and Poisson regression for frequencies and rates. This chapter provides a nontechnical introduction to these regression models with illustrating examples from cancer research.
Regression: The Apple Does Not Fall Far From the Tree.

PubMed

Vetter, Thomas R; Schober, Patrick

2018-05-15

Researchers and clinicians are frequently interested in either: (1) assessing whether there is a relationship or association between 2 or more variables and quantifying this association; or (2) determining whether 1 or more variables can predict another variable. The strength of such an association is mainly described by the correlation. However, regression analysis and regression models can be used not only to identify whether there is a significant relationship or association between variables but also to generate estimations of such a predictive relationship between variables. This basic statistical tutorial discusses the fundamental concepts and techniques related to the most common types of regression analysis and modeling, including simple linear regression, multiple regression, logistic regression, ordinal regression, and Poisson regression, as well as the common yet often underrecognized phenomenon of regression toward the mean. The various types of regression analysis are powerful statistical techniques, which when appropriately applied, can allow for the valid interpretation of complex, multifactorial data. Regression analysis and models can assess whether there is a relationship or association between 2 or more observed variables and estimate the strength of this association, as well as determine whether 1 or more variables can predict another variable. Regression is thus being applied more commonly in anesthesia, perioperative, critical care, and pain research. However, it is crucial to note that regression can identify plausible risk factors; it does not prove causation (a definitive cause and effect relationship). The results of a regression analysis instead identify independent (predictor) variable(s) associated with the dependent (outcome) variable. As with other statistical methods, applying regression requires that certain assumptions be met, which can be tested with specific diagnostics.
Early Parallel Activation of Semantics and Phonology in Picture Naming: Evidence from a Multiple Linear Regression MEG Study

PubMed Central

Miozzo, Michele; Pulvermüller, Friedemann; Hauk, Olaf

2015-01-01

The time course of brain activation during word production has become an area of increasingly intense investigation in cognitive neuroscience. The predominant view has been that semantic and phonological processes are activated sequentially, at about 150 and 200–400 ms after picture onset. Although evidence from prior studies has been interpreted as supporting this view, these studies were arguably not ideally suited to detect early brain activation of semantic and phonological processes. We here used a multiple linear regression approach to magnetoencephalography (MEG) analysis of picture naming in order to investigate early effects of variables specifically related to visual, semantic, and phonological processing. This was combined with distributed minimum-norm source estimation and region-of-interest analysis. Brain activation associated with visual image complexity appeared in occipital cortex at about 100 ms after picture presentation onset. At about 150 ms, semantic variables became physiologically manifest in left frontotemporal regions. In the same latency range, we found an effect of phonological variables in the left middle temporal gyrus. Our results demonstrate that multiple linear regression analysis is sensitive to early effects of multiple psycholinguistic variables in picture naming. Crucially, our results suggest that access to phonological information might begin in parallel with semantic processing around 150 ms after picture onset. PMID:25005037
The Use of Multiple Regression and Trend Analysis to Understand Enrollment Fluctuations. AIR Forum 1979 Paper.

ERIC Educational Resources Information Center

Campbell, S. Duke; Greenberg, Barry

The development of a predictive equation capable of explaining a significant percentage of enrollment variability at Florida International University is described. A model utilizing trend analysis and a multiple regression approach to enrollment forecasting was adapted to investigate enrollment dynamics at the university. Four independent…
Modelling fourier regression for time series data- a case study: modelling inflation in foods sector in Indonesia

NASA Astrophysics Data System (ADS)

Prahutama, Alan; Suparti; Wahyu Utami, Tiani

2018-03-01

Regression analysis is an analysis to model the relationship between response variables and predictor variables. The parametric approach to the regression model is very strict with the assumption, but nonparametric regression model isn’t need assumption of model. Time series data is the data of a variable that is observed based on a certain time, so if the time series data wanted to be modeled by regression, then we should determined the response and predictor variables first. Determination of the response variable in time series is variable in t-th (yt), while the predictor variable is a significant lag. In nonparametric regression modeling, one developing approach is to use the Fourier series approach. One of the advantages of nonparametric regression approach using Fourier series is able to overcome data having trigonometric distribution. In modeling using Fourier series needs parameter of K. To determine the number of K can be used Generalized Cross Validation method. In inflation modeling for the transportation sector, communication and financial services using Fourier series yields an optimal K of 120 parameters with R-square 99%. Whereas if it was modeled by multiple linear regression yield R-square 90%.
Order Selection for General Expression of Nonlinear Autoregressive Model Based on Multivariate Stepwise Regression

NASA Astrophysics Data System (ADS)

Shi, Jinfei; Zhu, Songqing; Chen, Ruwen

2017-12-01

An order selection method based on multiple stepwise regressions is proposed for General Expression of Nonlinear Autoregressive model which converts the model order problem into the variable selection of multiple linear regression equation. The partial autocorrelation function is adopted to define the linear term in GNAR model. The result is set as the initial model, and then the nonlinear terms are introduced gradually. Statistics are chosen to study the improvements of both the new introduced and originally existed variables for the model characteristics, which are adopted to determine the model variables to retain or eliminate. So the optimal model is obtained through data fitting effect measurement or significance test. The simulation and classic time-series data experiment results show that the method proposed is simple, reliable and can be applied to practical engineering.
Hierarchical multiple regression modelling on predictors of behavior and sexual practices at Takoradi Polytechnic, Ghana.

PubMed

Turkson, Anthony Joe; Otchey, James Eric

2015-01-14

Various psychosocial studies on health related lifestyles lay emphasis on the fact that the perception one has of himself as being at risk of HIV/AIDS infection was a necessary condition for preventive behaviors to be adopted. Hierarchical Multiple Regression models was used to examine the relationship between eight independent variables and one dependent variable to isolate predictors which have significant influence on behavior and sexual practices. A Cross-sectional design was used for the study. Structured close-ended interviewer-administered questionnaire was used to collect primary data. Multistage stratified technique was used to sample views from 380 students from Takoradi Polytechnic, Ghana. A Hierarchical multiple regression model was used to ascertain the significance of certain predictors of sexual behavior and practices. The variables that were extracted from the multiple regression were; for the constant; Beta=14.202, t=2.279, p=0.023, variable is significant; for the marital status; Beta=0.092, t=1.996, p<0.05, variable is significant; for the knowledge on AIDs; Beta=0.090, t=1.996, p<0.05, variable is significant; for the attitude towards HIV/AIDs; =0.486, t=10.575, p<0.001, variable is highly significant. Thus, the best fitting model for predicting behavior and sexual practices was a linear combination of the constant, one's marital status, knowledge on HIV/AIDs and Attitude towards HIV/AIDs., Y(Behavior and sexual practies)= Beta0+Beta1(Marital status)+Beta2(Knowledge on HIV/AIDs issues)+Beta3(Attitude towards HIV/AIDs issues) Beta0, Beta1, Beta2 and Beta3 are respectively 14.201, 2.038, 0.148 and 0.486; the higher the better. Attitude and behavior change education on HIV/AIDs should be intensified in the institution so that students could adopt better lifestyles.
Epidemiologic programs for computers and calculators. A microcomputer program for multiple logistic regression by unconditional and conditional maximum likelihood methods.

PubMed

Campos-Filho, N; Franco, E L

1989-02-01

A frequent procedure in matched case-control studies is to report results from the multivariate unmatched analyses if they do not differ substantially from the ones obtained after conditioning on the matching variables. Although conceptually simple, this rule requires that an extensive series of logistic regression models be evaluated by both the conditional and unconditional maximum likelihood methods. Most computer programs for logistic regression employ only one maximum likelihood method, which requires that the analyses be performed in separate steps. This paper describes a Pascal microcomputer (IBM PC) program that performs multiple logistic regression by both maximum likelihood estimation methods, which obviates the need for switching between programs to obtain relative risk estimates from both matched and unmatched analyses. The program calculates most standard statistics and allows factoring of categorical or continuous variables by two distinct methods of contrast. A built-in, descriptive statistics option allows the user to inspect the distribution of cases and controls across categories of any given variable.
Multiple linear regression and regression with time series error models in forecasting PM10 concentrations in Peninsular Malaysia.

PubMed

Ng, Kar Yong; Awang, Norhashidah

2018-01-06

Frequent haze occurrences in Malaysia have made the management of PM 10 (particulate matter with aerodynamic less than 10 μm) pollution a critical task. This requires knowledge on factors associating with PM 10 variation and good forecast of PM 10 concentrations. Hence, this paper demonstrates the prediction of 1-day-ahead daily average PM 10 concentrations based on predictor variables including meteorological parameters and gaseous pollutants. Three different models were built. They were multiple linear regression (MLR) model with lagged predictor variables (MLR1), MLR model with lagged predictor variables and PM 10 concentrations (MLR2) and regression with time series error (RTSE) model. The findings revealed that humidity, temperature, wind speed, wind direction, carbon monoxide and ozone were the main factors explaining the PM 10 variation in Peninsular Malaysia. Comparison among the three models showed that MLR2 model was on a same level with RTSE model in terms of forecasting accuracy, while MLR1 model was the worst.
Development of Multiple Regression Equations To Predict Fourth Graders' Achievement in Reading and Selected Content Areas.

ERIC Educational Resources Information Center

Hafner, Lawrence E.

A study developed a multiple regression prediction equation for each of six selected achievement variables in a popular standardized test of achievement. Subjects, 42 fourth-grade pupils randomly selected across several classes in a large elementary school in a north Florida city, were administered several standardized tests to determine predictor…
Regression Analysis of Optical Coherence Tomography Disc Variables for Glaucoma Diagnosis.

PubMed

Richter, Grace M; Zhang, Xinbo; Tan, Ou; Francis, Brian A; Chopra, Vikas; Greenfield, David S; Varma, Rohit; Schuman, Joel S; Huang, David

2016-08-01

To report diagnostic accuracy of optical coherence tomography (OCT) disc variables using both time-domain (TD) and Fourier-domain (FD) OCT, and to improve the use of OCT disc variable measurements for glaucoma diagnosis through regression analyses that adjust for optic disc size and axial length-based magnification error. Observational, cross-sectional. In total, 180 normal eyes of 112 participants and 180 eyes of 138 participants with perimetric glaucoma from the Advanced Imaging for Glaucoma Study. Diagnostic variables evaluated from TD-OCT and FD-OCT were: disc area, rim area, rim volume, optic nerve head volume, vertical cup-to-disc ratio (CDR), and horizontal CDR. These were compared with overall retinal nerve fiber layer thickness and ganglion cell complex. Regression analyses were performed that corrected for optic disc size and axial length. Area-under-receiver-operating curves (AUROC) were used to assess diagnostic accuracy before and after the adjustments. An index based on multiple logistic regression that combined optic disc variables with axial length was also explored with the aim of improving diagnostic accuracy of disc variables. Comparison of diagnostic accuracy of disc variables, as measured by AUROC. The unadjusted disc variables with the highest diagnostic accuracies were: rim volume for TD-OCT (AUROC=0.864) and vertical CDR (AUROC=0.874) for FD-OCT. Magnification correction significantly worsened diagnostic accuracy for rim variables, and while optic disc size adjustments partially restored diagnostic accuracy, the adjusted AUROCs were still lower. Axial length adjustments to disc variables in the form of multiple logistic regression indices led to a slight but insignificant improvement in diagnostic accuracy. Our various regression approaches were not able to significantly improve disc-based OCT glaucoma diagnosis. However, disc rim area and vertical CDR had very high diagnostic accuracy, and these disc variables can serve to complement additional OCT measurements for diagnosis of glaucoma.
A Ten Year Study of Salary Differential by Sex through a Regression Methodology.

ERIC Educational Resources Information Center

Williams, John Delane; And Others

A 10-year study of salary differential by sex was undertaken at the University of North Dakota using a multiple regression methodology, with rank, discipline, degree, years in department, years in current rank, and sex as predictors. The sex variable evidenced lower salaries for women when controlling for the other variables throughout the study…
Most Likely to Succeed: Exploring Predictor Variables for the Counselor Preparation Comprehensive Examination

ERIC Educational Resources Information Center

Hartwig, Elizabeth Kjellstrand; Van Overschelde, James P.

2016-01-01

The authors investigated predictor variables for the Counselor Preparation Comprehensive Examination (CPCE) to examine whether academic variables, demographic variables, and test version were associated with graduate counseling students' CPCE scores. Multiple regression analyses revealed all 3 variables were statistically significant predictors of…
Causal relationship model between variables using linear regression to improve professional commitment of lecturer

NASA Astrophysics Data System (ADS)

Setyaningsih, S.

2017-01-01

The main element to build a leading university requires lecturer commitment in a professional manner. Commitment is measured through willpower, loyalty, pride, loyalty, and integrity as a professional lecturer. A total of 135 from 337 university lecturers were sampled to collect data. Data were analyzed using validity and reliability test and multiple linear regression. Many studies have found a link on the commitment of lecturers, but the basic cause of the causal relationship is generally neglected. These results indicate that the professional commitment of lecturers affected by variables empowerment, academic culture, and trust. The relationship model between variables is composed of three substructures. The first substructure consists of endogenous variables professional commitment and exogenous three variables, namely the academic culture, empowerment and trust, as well as residue variable ɛ y . The second substructure consists of one endogenous variable that is trust and two exogenous variables, namely empowerment and academic culture and the residue variable ɛ 3. The third substructure consists of one endogenous variable, namely the academic culture and exogenous variables, namely empowerment as well as residue variable ɛ 2. Multiple linear regression was used in the path model for each substructure. The results showed that the hypothesis has been proved and these findings provide empirical evidence that increasing the variables will have an impact on increasing the professional commitment of the lecturers.
Simultaneous multiple non-crossing quantile regression estimation using kernel constraints

PubMed Central

Liu, Yufeng; Wu, Yichao

2011-01-01

Quantile regression (QR) is a very useful statistical tool for learning the relationship between the response variable and covariates. For many applications, one often needs to estimate multiple conditional quantile functions of the response variable given covariates. Although one can estimate multiple quantiles separately, it is of great interest to estimate them simultaneously. One advantage of simultaneous estimation is that multiple quantiles can share strength among them to gain better estimation accuracy than individually estimated quantile functions. Another important advantage of joint estimation is the feasibility of incorporating simultaneous non-crossing constraints of QR functions. In this paper, we propose a new kernel-based multiple QR estimation technique, namely simultaneous non-crossing quantile regression (SNQR). We use kernel representations for QR functions and apply constraints on the kernel coefficients to avoid crossing. Both unregularised and regularised SNQR techniques are considered. Asymptotic properties such as asymptotic normality of linear SNQR and oracle properties of the sparse linear SNQR are developed. Our numerical results demonstrate the competitive performance of our SNQR over the original individual QR estimation. PMID:22190842
Determination of osteoporosis risk factors using a multiple logistic regression model in postmenopausal Turkish women.

PubMed

Akkus, Zeki; Camdeviren, Handan; Celik, Fatma; Gur, Ali; Nas, Kemal

2005-09-01

To determine the risk factors of osteoporosis using a multiple binary logistic regression method and to assess the risk variables for osteoporosis, which is a major and growing health problem in many countries. We presented a case-control study, consisting of 126 postmenopausal healthy women as control group and 225 postmenopausal osteoporotic women as the case group. The study was carried out in the Department of Physical Medicine and Rehabilitation, Dicle University, Diyarbakir, Turkey between 1999-2002. The data from the 351 participants were collected using a standard questionnaire that contains 43 variables. A multiple logistic regression model was then used to evaluate the data and to find the best regression model. We classified 80.1% (281/351) of the participants using the regression model. Furthermore, the specificity value of the model was 67% (84/126) of the control group while the sensitivity value was 88% (197/225) of the case group. We found the distribution of residual values standardized for final model to be exponential using the Kolmogorow-Smirnow test (p=0.193). The receiver operating characteristic curve was found successful to predict patients with risk for osteoporosis. This study suggests that low levels of dietary calcium intake, physical activity, education, and longer duration of menopause are independent predictors of the risk of low bone density in our population. Adequate dietary calcium intake in combination with maintaining a daily physical activity, increasing educational level, decreasing birth rate, and duration of breast-feeding may contribute to healthy bones and play a role in practical prevention of osteoporosis in Southeast Anatolia. In addition, the findings of the present study indicate that the use of multivariate statistical method as a multiple logistic regression in osteoporosis, which maybe influenced by many variables, is better than univariate statistical evaluation.
A general equation to obtain multiple cut-off scores on a test from multinomial logistic regression.

PubMed

Bersabé, Rosa; Rivas, Teresa

2010-05-01

The authors derive a general equation to compute multiple cut-offs on a total test score in order to classify individuals into more than two ordinal categories. The equation is derived from the multinomial logistic regression (MLR) model, which is an extension of the binary logistic regression (BLR) model to accommodate polytomous outcome variables. From this analytical procedure, cut-off scores are established at the test score (the predictor variable) at which an individual is as likely to be in category j as in category j+1 of an ordinal outcome variable. The application of the complete procedure is illustrated by an example with data from an actual study on eating disorders. In this example, two cut-off scores on the Eating Attitudes Test (EAT-26) scores are obtained in order to classify individuals into three ordinal categories: asymptomatic, symptomatic and eating disorder. Diagnoses were made from the responses to a self-report (Q-EDD) that operationalises DSM-IV criteria for eating disorders. Alternatives to the MLR model to set multiple cut-off scores are discussed.

Sample size determination for logistic regression on a logit-normal distribution.

PubMed

Kim, Seongho; Heath, Elisabeth; Heilbrun, Lance

2017-06-01

Although the sample size for simple logistic regression can be readily determined using currently available methods, the sample size calculation for multiple logistic regression requires some additional information, such as the coefficient of determination ([Formula: see text]) of a covariate of interest with other covariates, which is often unavailable in practice. The response variable of logistic regression follows a logit-normal distribution which can be generated from a logistic transformation of a normal distribution. Using this property of logistic regression, we propose new methods of determining the sample size for simple and multiple logistic regressions using a normal transformation of outcome measures. Simulation studies and a motivating example show several advantages of the proposed methods over the existing methods: (i) no need for [Formula: see text] for multiple logistic regression, (ii) available interim or group-sequential designs, and (iii) much smaller required sample size.
Analysis of potential factors affecting microbiological cultures in tissue donors during procurement.

PubMed

Lannau, B; Van Geyt, C; Van Maele, G; Beele, H

2015-03-01

During the procurement of musculoskeletal grafts contamination may occur. As this might be detrimental for the acceptor, it is important to know which variables influence this occurrence and to alter procurement protocols accordingly. From 2004 to 2012 we gathered information on 6,428 allografts obtained from 291 donors. Using a multiple regression model we attempted to determine the factors that influence the contamination risk during procurement. We used the following variables: cause of death, type of hospital (i.e. university hospital vs. general hospital), previous blood vessel donation, previous organ donation, donor age, time between death and the start of the procurement, duration of the procurement, number of people attending the procurement and the number of procured grafts. The multiple regression model was only able to explain 5 % of the variability of the used outcome variable. None of the variables examined appear to have an important influence on the contamination risk.
Simple linear and multivariate regression models.

PubMed

Rodríguez del Águila, M M; Benítez-Parejo, N

2011-01-01

In biomedical research it is common to find problems in which we wish to relate a response variable to one or more variables capable of describing the behaviour of the former variable by means of mathematical models. Regression techniques are used to this effect, in which an equation is determined relating the two variables. While such equations can have different forms, linear equations are the most widely used form and are easy to interpret. The present article describes simple and multiple linear regression models, how they are calculated, and how their applicability assumptions are checked. Illustrative examples are provided, based on the use of the freely accessible R program. Copyright © 2011 SEICAP. Published by Elsevier Espana. All rights reserved.
Overcoming multicollinearity in multiple regression using correlation coefficient

NASA Astrophysics Data System (ADS)

Zainodin, H. J.; Yap, S. J.

2013-09-01

Multicollinearity happens when there are high correlations among independent variables. In this case, it would be difficult to distinguish between the contributions of these independent variables to that of the dependent variable as they may compete to explain much of the similar variance. Besides, the problem of multicollinearity also violates the assumption of multiple regression: that there is no collinearity among the possible independent variables. Thus, an alternative approach is introduced in overcoming the multicollinearity problem in achieving a well represented model eventually. This approach is accomplished by removing the multicollinearity source variables on the basis of the correlation coefficient values based on full correlation matrix. Using the full correlation matrix can facilitate the implementation of Excel function in removing the multicollinearity source variables. It is found that this procedure is easier and time-saving especially when dealing with greater number of independent variables in a model and a large number of all possible models. Hence, in this paper detailed insight of the procedure is shown, compared and implemented.
Nonparametric Bayesian Multiple Imputation for Incomplete Categorical Variables in Large-Scale Assessment Surveys

ERIC Educational Resources Information Center

Si, Yajuan; Reiter, Jerome P.

2013-01-01

In many surveys, the data comprise a large number of categorical variables that suffer from item nonresponse. Standard methods for multiple imputation, like log-linear models or sequential regression imputation, can fail to capture complex dependencies and can be difficult to implement effectively in high dimensions. We present a fully Bayesian,…
Multiplication factor versus regression analysis in stature estimation from hand and foot dimensions.

PubMed

Krishan, Kewal; Kanchan, Tanuj; Sharma, Abhilasha

2012-05-01

Estimation of stature is an important parameter in identification of human remains in forensic examinations. The present study is aimed to compare the reliability and accuracy of stature estimation and to demonstrate the variability in estimated stature and actual stature using multiplication factor and regression analysis methods. The study is based on a sample of 246 subjects (123 males and 123 females) from North India aged between 17 and 20 years. Four anthropometric measurements; hand length, hand breadth, foot length and foot breadth taken on the left side in each subject were included in the study. Stature was measured using standard anthropometric techniques. Multiplication factors were calculated and linear regression models were derived for estimation of stature from hand and foot dimensions. Derived multiplication factors and regression formula were applied to the hand and foot measurements in the study sample. The estimated stature from the multiplication factors and regression analysis was compared with the actual stature to find the error in estimated stature. The results indicate that the range of error in estimation of stature from regression analysis method is less than that of multiplication factor method thus, confirming that the regression analysis method is better than multiplication factor analysis in stature estimation. Copyright © 2012 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.
Accounting for estimated IQ in neuropsychological test performance with regression-based techniques.

PubMed

Testa, S Marc; Winicki, Jessica M; Pearlson, Godfrey D; Gordon, Barry; Schretlen, David J

2009-11-01

Regression-based normative techniques account for variability in test performance associated with multiple predictor variables and generate expected scores based on algebraic equations. Using this approach, we show that estimated IQ, based on oral word reading, accounts for 1-9% of the variability beyond that explained by individual differences in age, sex, race, and years of education for most cognitive measures. These results confirm that adding estimated "premorbid" IQ to demographic predictors in multiple regression models can incrementally improve the accuracy with which regression-based norms (RBNs) benchmark expected neuropsychological test performance in healthy adults. It remains to be seen whether the incremental variance in test performance explained by estimated "premorbid" IQ translates to improved diagnostic accuracy in patient samples. We describe these methods, and illustrate the step-by-step application of RBNs with two cases. We also discuss the rationale, assumptions, and caveats of this approach. More broadly, we note that adjusting test scores for age and other characteristics might actually decrease the accuracy with which test performance predicts absolute criteria, such as the ability to drive or live independently.
Relative efficiency of joint-model and full-conditional-specification multiple imputation when conditional models are compatible: The general location model.

PubMed

Seaman, Shaun R; Hughes, Rachael A

2018-06-01

Estimating the parameters of a regression model of interest is complicated by missing data on the variables in that model. Multiple imputation is commonly used to handle these missing data. Joint model multiple imputation and full-conditional specification multiple imputation are known to yield imputed data with the same asymptotic distribution when the conditional models of full-conditional specification are compatible with that joint model. We show that this asymptotic equivalence of imputation distributions does not imply that joint model multiple imputation and full-conditional specification multiple imputation will also yield asymptotically equally efficient inference about the parameters of the model of interest, nor that they will be equally robust to misspecification of the joint model. When the conditional models used by full-conditional specification multiple imputation are linear, logistic and multinomial regressions, these are compatible with a restricted general location joint model. We show that multiple imputation using the restricted general location joint model can be substantially more asymptotically efficient than full-conditional specification multiple imputation, but this typically requires very strong associations between variables. When associations are weaker, the efficiency gain is small. Moreover, full-conditional specification multiple imputation is shown to be potentially much more robust than joint model multiple imputation using the restricted general location model to mispecification of that model when there is substantial missingness in the outcome variable.
Future Performance Trend Indicators: A Current Value Approach to Human Resources Accounting. Report III. Multivariate Predictions of Organizational Performance Across Time.

ERIC Educational Resources Information Center

Pecorella, Patricia A.; Bowers, David G.

Multiple regression in a double cross-validated design was used to predict two performance measures (total variable expense and absence rate) by multi-month period in five industrial firms. The regressions do cross-validate, and produce multiple coefficients which display both concurrent and predictive effects, peaking 18 months to two years…
Covariate Selection for Multilevel Models with Missing Data

PubMed Central

Marino, Miguel; Buxton, Orfeu M.; Li, Yi

2017-01-01

Missing covariate data hampers variable selection in multilevel regression settings. Current variable selection techniques for multiply-imputed data commonly address missingness in the predictors through list-wise deletion and stepwise-selection methods which are problematic. Moreover, most variable selection methods are developed for independent linear regression models and do not accommodate multilevel mixed effects regression models with incomplete covariate data. We develop a novel methodology that is able to perform covariate selection across multiply-imputed data for multilevel random effects models when missing data is present. Specifically, we propose to stack the multiply-imputed data sets from a multiple imputation procedure and to apply a group variable selection procedure through group lasso regularization to assess the overall impact of each predictor on the outcome across the imputed data sets. Simulations confirm the advantageous performance of the proposed method compared with the competing methods. We applied the method to reanalyze the Healthy Directions-Small Business cancer prevention study, which evaluated a behavioral intervention program targeting multiple risk-related behaviors in a working-class, multi-ethnic population. PMID:28239457
Vocational Teacher Stress and the Educational System.

ERIC Educational Resources Information Center

Adams, Elaine; Heath-Camp, Betty; Camp, William G.

1999-01-01

A multiple regression analysis of data from 235 secondary vocational teachers in Virginia found that educational system-related variables explained most teacher stress. The most important explanatory variables were task stress and role overload. (SK)
Characterizing Individual Differences in Functional Connectivity Using Dual-Regression and Seed-Based Approaches

PubMed Central

Smith, David V.; Utevsky, Amanda V.; Bland, Amy R.; Clement, Nathan; Clithero, John A.; Harsch, Anne E. W.; Carter, R. McKell; Huettel, Scott A.

2014-01-01

A central challenge for neuroscience lies in relating inter-individual variability to the functional properties of specific brain regions. Yet, considerable variability exists in the connectivity patterns between different brain areas, potentially producing reliable group differences. Using sex differences as a motivating example, we examined two separate resting-state datasets comprising a total of 188 human participants. Both datasets were decomposed into resting-state networks (RSNs) using a probabilistic spatial independent components analysis (ICA). We estimated voxelwise functional connectivity with these networks using a dual-regression analysis, which characterizes the participant-level spatiotemporal dynamics of each network while controlling for (via multiple regression) the influence of other networks and sources of variability. We found that males and females exhibit distinct patterns of connectivity with multiple RSNs, including both visual and auditory networks and the right frontal-parietal network. These results replicated across both datasets and were not explained by differences in head motion, data quality, brain volume, cortisol levels, or testosterone levels. Importantly, we also demonstrate that dual-regression functional connectivity is better at detecting inter-individual variability than traditional seed-based functional connectivity approaches. Our findings characterize robust—yet frequently ignored—neural differences between males and females, pointing to the necessity of controlling for sex in neuroscience studies of individual differences. Moreover, our results highlight the importance of employing network-based models to study variability in functional connectivity. PMID:24662574
A psycholinguistic database for traditional Chinese character naming.

PubMed

Chang, Ya-Ning; Hsu, Chun-Hsien; Tsai, Jie-Li; Chen, Chien-Liang; Lee, Chia-Ying

2016-03-01

In this study, we aimed to provide a large-scale set of psycholinguistic norms for 3,314 traditional Chinese characters, along with their naming reaction times (RTs), collected from 140 Chinese speakers. The lexical and semantic variables in the database include frequency, regularity, familiarity, consistency, number of strokes, homophone density, semantic ambiguity rating, phonetic combinability, semantic combinability, and the number of disyllabic compound words formed by a character. Multiple regression analyses were conducted to examine the predictive powers of these variables for the naming RTs. The results demonstrated that these variables could account for a significant portion of variance (55.8%) in the naming RTs. An additional multiple regression analysis was conducted to demonstrate the effects of consistency and character frequency. Overall, the regression results were consistent with the findings of previous studies on Chinese character naming. This database should be useful for research into Chinese language processing, Chinese education, or cross-linguistic comparisons. The database can be accessed via an online inquiry system (http://ball.ling.sinica.edu.tw/namingdatabase/index.html).
A Statistical Multimodel Ensemble Approach to Improving Long-Range Forecasting in Pakistan

DTIC Science & Technology

2012-03-01

Impact of global warming on monsoon variability in Pakistan. J. Anim. Pl. Sci., 21, no. 1, 107–110. Gillies, S., T. Murphree, and D. Meyer, 2012...are generated by multiple regression models that relate globally distributed oceanic and atmospheric predictors to local predictands. The...generated by multiple regression models that relate globally distributed oceanic and atmospheric predictors to local predictands. The predictands are
Reduction of shading-derived artifacts in skin chromophore imaging without measurements or assumptions about the shape of the subject

NASA Astrophysics Data System (ADS)

Yoshida, Kenichiro; Nishidate, Izumi; Ojima, Nobutoshi; Iwata, Kayoko

2014-01-01

To quantitatively evaluate skin chromophores over a wide region of curved skin surface, we propose an approach that suppresses the effect of the shading-derived error in the reflectance on the estimation of chromophore concentrations, without sacrificing the accuracy of that estimation. In our method, we use multiple regression analysis, assuming the absorbance spectrum as the response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as the predictor variables. The concentrations of melanin and total hemoglobin are determined from the multiple regression coefficients using compensation formulae (CF) based on the diffuse reflectance spectra derived from a Monte Carlo simulation. To suppress the shading-derived error, we investigated three different combinations of multiple regression coefficients for the CF. In vivo measurements with the forearm skin demonstrated that the proposed approach can reduce the estimation errors that are due to shading-derived errors in the reflectance. With the best combination of multiple regression coefficients, we estimated that the ratio of the error to the chromophore concentrations is about 10%. The proposed method does not require any measurements or assumptions about the shape of the subjects; this is an advantage over other studies related to the reduction of shading-derived errors.
Quantitative assessment of cervical vertebral maturation using cone beam computed tomography in Korean girls.

PubMed

Byun, Bo-Ram; Kim, Yong-Il; Yamaguchi, Tetsutaro; Maki, Koutaro; Son, Woo-Sung

2015-01-01

This study was aimed to examine the correlation between skeletal maturation status and parameters from the odontoid process/body of the second vertebra and the bodies of third and fourth cervical vertebrae and simultaneously build multiple regression models to be able to estimate skeletal maturation status in Korean girls. Hand-wrist radiographs and cone beam computed tomography (CBCT) images were obtained from 74 Korean girls (6-18 years of age). CBCT-generated cervical vertebral maturation (CVM) was used to demarcate the odontoid process and the body of the second cervical vertebra, based on the dentocentral synchondrosis. Correlation coefficient analysis and multiple linear regression analysis were used for each parameter of the cervical vertebrae (P < 0.05). Forty-seven of 64 parameters from CBCT-generated CVM (independent variables) exhibited statistically significant correlations (P < 0.05). The multiple regression model with the greatest R (2) had six parameters (PH2/W2, UW2/W2, (OH+AH2)/LW2, UW3/LW3, D3, and H4/W4) as independent variables with a variance inflation factor (VIF) of <2. CBCT-generated CVM was able to include parameters from the second cervical vertebral body and odontoid process, respectively, for the multiple regression models. This suggests that quantitative analysis might be used to estimate skeletal maturation status.
Prediction of hearing outcomes by multiple regression analysis in patients with idiopathic sudden sensorineural hearing loss.

PubMed

Suzuki, Hideaki; Tabata, Takahisa; Koizumi, Hiroki; Hohchi, Nobusuke; Takeuchi, Shoko; Kitamura, Takuro; Fujino, Yoshihisa; Ohbuchi, Toyoaki

2014-12-01

This study aimed to create a multiple regression model for predicting hearing outcomes of idiopathic sudden sensorineural hearing loss (ISSNHL). The participants were 205 consecutive patients (205 ears) with ISSNHL (hearing level ≥ 40 dB, interval between onset and treatment ≤ 30 days). They received systemic steroid administration combined with intratympanic steroid injection. Data were examined by simple and multiple regression analyses. Three hearing indices (percentage hearing improvement, hearing gain, and posttreatment hearing level [HLpost]) and 7 prognostic factors (age, days from onset to treatment, initial hearing level, initial hearing level at low frequencies, initial hearing level at high frequencies, presence of vertigo, and contralateral hearing level) were included in the multiple regression analysis as dependent and explanatory variables, respectively. In the simple regression analysis, the percentage hearing improvement, hearing gain, and HLpost showed significant correlation with 2, 5, and 6 of the 7 prognostic factors, respectively. The multiple correlation coefficients were 0.396, 0.503, and 0.714 for the percentage hearing improvement, hearing gain, and HLpost, respectively. Predicted values of HLpost calculated by the multiple regression equation were reliable with 70% probability with a 40-dB-width prediction interval. Prediction of HLpost by the multiple regression model may be useful to estimate the hearing prognosis of ISSNHL. © The Author(s) 2014.
Using Logistic Regression To Predict the Probability of Debris Flows Occurring in Areas Recently Burned By Wildland Fires

USGS Publications Warehouse

Rupert, Michael G.; Cannon, Susan H.; Gartner, Joseph E.

2003-01-01

Logistic regression was used to predict the probability of debris flows occurring in areas recently burned by wildland fires. Multiple logistic regression is conceptually similar to multiple linear regression because statistical relations between one dependent variable and several independent variables are evaluated. In logistic regression, however, the dependent variable is transformed to a binary variable (debris flow did or did not occur), and the actual probability of the debris flow occurring is statistically modeled. Data from 399 basins located within 15 wildland fires that burned during 2000-2002 in Colorado, Idaho, Montana, and New Mexico were evaluated. More than 35 independent variables describing the burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated. The models were developed as follows: (1) Basins that did and did not produce debris flows were delineated from National Elevation Data using a Geographic Information System (GIS). (2) Data describing the burn severity, geology, land surface gradient, rainfall, and soil properties were determined for each basin. These data were then downloaded to a statistics software package for analysis using logistic regression. (3) Relations between the occurrence/non-occurrence of debris flows and burn severity, geology, land surface gradient, rainfall, and soil properties were evaluated and several preliminary multivariate logistic regression models were constructed. All possible combinations of independent variables were evaluated to determine which combination produced the most effective model. The multivariate model that best predicted the occurrence of debris flows was selected. (4) The multivariate logistic regression model was entered into a GIS, and a map showing the probability of debris flows was constructed. The most effective model incorporates the percentage of each basin with slope greater than 30 percent, percentage of land burned at medium and high burn severity in each basin, particle size sorting, average storm intensity (millimeters per hour), soil organic matter content, soil permeability, and soil drainage. The results of this study demonstrate that logistic regression is a valuable tool for predicting the probability of debris flows occurring in recently-burned landscapes.
Correlation and simple linear regression.

PubMed

Eberly, Lynn E

2007-01-01

This chapter highlights important steps in using correlation and simple linear regression to address scientific questions about the association of two continuous variables with each other. These steps include estimation and inference, assessing model fit, the connection between regression and ANOVA, and study design. Examples in microbiology are used throughout. This chapter provides a framework that is helpful in understanding more complex statistical techniques, such as multiple linear regression, linear mixed effects models, logistic regression, and proportional hazards regression.
Do climate variables and human density affect Achatina fulica (Bowditch) (Gastropoda: Pulmonata) shell length, total weight and condition factor?

PubMed

Albuquerque, F S; Peso-Aguiar, M C; Assunção-Albuquerque, M J T; Gálvez, L

2009-08-01

The length-weight relationship and condition factor have been broadly investigated in snails to obtain the index of physical condition of populations and evaluate habitat quality. Herein, our goal was to describe the best predictors that explain Achatina fulica biometrical parameters and well being in a recently introduced population. From November 2001 to November 2002, monthly snail samples were collected in Lauro de Freitas City, Bahia, Brazil. Shell length and total weight were measured in the laboratory and the potential curve and condition factor were calculated. Five environmental variables were considered: temperature range, mean temperature, humidity, precipitation and human density. Multiple regressions were used to generate models including multiple predictors, via model selection approach, and then ranked with AIC criteria. Partial regressions were used to obtain the separated coefficients of determination of climate and human density models. A total of 1.460 individuals were collected, presenting a shell length range between 4.8 to 102.5 mm (mean: 42.18 mm). The relationship between total length and total weight revealed that Achatina fulica presented a negative allometric growth. Simple regression indicated that humidity has a significant influence on A. fulica total length and weight. Temperature range was the main variable that influenced the condition factor. Multiple regressions showed that climatic and human variables explain a small proportion of the variance in shell length and total weight, but may explain up to 55.7% of the condition factor variance. Consequently, we believe that the well being and biometric parameters of A. fulica can be influenced by climatic and human density factors.

A Computer Program for Preliminary Data Analysis

Treesearch

Dennis L. Schweitzer

1967-01-01

ABSTRACT. -- A computer program written in FORTRAN has been designed to summarize data. Class frequencies, means, and standard deviations are printed for as many as 100 independent variables. Cross-classifications of an observed dependent variable and of a dependent variable predicted by a multiple regression equation can also be generated.
Determining Sample Size for Accurate Estimation of the Squared Multiple Correlation Coefficient.

ERIC Educational Resources Information Center

Algina, James; Olejnik, Stephen

2000-01-01

Discusses determining sample size for estimation of the squared multiple correlation coefficient and presents regression equations that permit determination of the sample size for estimating this parameter for up to 20 predictor variables. (SLD)
Cephalometric landmark detection in dental x-ray images using convolutional neural networks

NASA Astrophysics Data System (ADS)

Lee, Hansang; Park, Minseok; Kim, Junmo

2017-03-01

In dental X-ray images, an accurate detection of cephalometric landmarks plays an important role in clinical diagnosis, treatment and surgical decisions for dental problems. In this work, we propose an end-to-end deep learning system for cephalometric landmark detection in dental X-ray images, using convolutional neural networks (CNN). For detecting 19 cephalometric landmarks in dental X-ray images, we develop a detection system using CNN-based coordinate-wise regression systems. By viewing x- and y-coordinates of all landmarks as 38 independent variables, multiple CNN-based regression systems are constructed to predict the coordinate variables from input X-ray images. First, each coordinate variable is normalized by the length of either height or width of an image. For each normalized coordinate variable, a CNN-based regression system is trained on training images and corresponding coordinate variable, which is a variable to be regressed. We train 38 regression systems with the same CNN structure on coordinate variables, respectively. Finally, we compute 38 coordinate variables with these trained systems from unseen images and extract 19 landmarks by pairing the regressed coordinates. In experiments, the public database from the Grand Challenges in Dental X-ray Image Analysis in ISBI 2015 was used and the proposed system showed promising performance by successfully locating the cephalometric landmarks within considerable margins from the ground truths.
Aspects of porosity prediction using multivariate linear regression

DOE Office of Scientific and Technical Information (OSTI.GOV)

Byrnes, A.P.; Wilson, M.D.

1991-03-01

Highly accurate multiple linear regression models have been developed for sandstones of diverse compositions. Porosity reduction or enhancement processes are controlled by the fundamental variables, Pressure (P), Temperature (T), Time (t), and Composition (X), where composition includes mineralogy, size, sorting, fluid composition, etc. The multiple linear regression equation, of which all linear porosity prediction models are subsets, takes the generalized form: Porosity = C{sub 0} + C{sub 1}(P) + C{sub 2}(T) + C{sub 3}(X) + C{sub 4}(t) + C{sub 5}(PT) + C{sub 6}(PX) + C{sub 7}(Pt) + C{sub 8}(TX) + C{sub 9}(Tt) + C{sub 10}(Xt) + C{sub 11}(PTX) + C{submore » 12}(PXt) + C{sub 13}(PTt) + C{sub 14}(TXt) + C{sub 15}(PTXt). The first four primary variables are often interactive, thus requiring terms involving two or more primary variables (the form shown implies interaction and not necessarily multiplication). The final terms used may also involve simple mathematic transforms such as log X, e{sup T}, X{sup 2}, or more complex transformations such as the Time-Temperature Index (TTI). The X term in the equation above represents a suite of compositional variable and, therefore, a fully expanded equation may include a series of terms incorporating these variables. Numerous published bivariate porosity prediction models involving P (or depth) or Tt (TTI) are effective to a degree, largely because of the high degree of colinearity between p and TTI. However, all such bivariate models ignore the unique contributions of P and Tt, as well as various X terms. These simpler models become poor predictors in regions where colinear relations change, were important variables have been ignored, or where the database does not include a sufficient range or weight distribution for the critical variables.« less
Using multiple logistic regression and GIS technology to predict landslide hazard in northeast Kansas, USA

USGS Publications Warehouse

Ohlmacher, G.C.; Davis, J.C.

2003-01-01

Landslides in the hilly terrain along the Kansas and Missouri rivers in northeastern Kansas have caused millions of dollars in property damage during the last decade. To address this problem, a statistical method called multiple logistic regression has been used to create a landslide-hazard map for Atchison, Kansas, and surrounding areas. Data included digitized geology, slopes, and landslides, manipulated using ArcView GIS. Logistic regression relates predictor variables to the occurrence or nonoccurrence of landslides within geographic cells and uses the relationship to produce a map showing the probability of future landslides, given local slopes and geologic units. Results indicated that slope is the most important variable for estimating landslide hazard in the study area. Geologic units consisting mostly of shale, siltstone, and sandstone were most susceptible to landslides. Soil type and aspect ratio were considered but excluded from the final analysis because these variables did not significantly add to the predictive power of the logistic regression. Soil types were highly correlated with the geologic units, and no significant relationships existed between landslides and slope aspect. ?? 2003 Elsevier Science B.V. All rights reserved.
Multiple regression technique for Pth degree polynominals with and without linear cross products

NASA Technical Reports Server (NTRS)

Davis, J. W.

1973-01-01

A multiple regression technique was developed by which the nonlinear behavior of specified independent variables can be related to a given dependent variable. The polynomial expression can be of Pth degree and can incorporate N independent variables. Two cases are treated such that mathematical models can be studied both with and without linear cross products. The resulting surface fits can be used to summarize trends for a given phenomenon and provide a mathematical relationship for subsequent analysis. To implement this technique, separate computer programs were developed for the case without linear cross products and for the case incorporating such cross products which evaluate the various constants in the model regression equation. In addition, the significance of the estimated regression equation is considered and the standard deviation, the F statistic, the maximum absolute percent error, and the average of the absolute values of the percent of error evaluated. The computer programs and their manner of utilization are described. Sample problems are included to illustrate the use and capability of the technique which show the output formats and typical plots comparing computer results to each set of input data.
Efficacy of Social Media Adoption on Client Growth for Independent Management Consultants

DTIC Science & Technology

2017-02-01

design , a linear multiple regression with three predictor variables and one dependent variable per testing were used. Under those circumstances...regression test was used to compare the social media adoption of two groups on a single measure to determine if there was a statistical difference...number and types of social media platforms used and their influence on client growth was examined in this research design that used a descriptive
Temporal Synchronization Analysis for Improving Regression Modeling of Fecal Indicator Bacteria Levels

EPA Science Inventory

Multiple linear regression models are often used to predict levels of fecal indicator bacteria (FIB) in recreational swimming waters based on independent variables (IVs) such as meteorologic, hydrodynamic, and water-quality measures. The IVs used for these analyses are traditiona...
Combining multiple regression and principal component analysis for accurate predictions for column ozone in Peninsular Malaysia

NASA Astrophysics Data System (ADS)

Rajab, Jasim M.; MatJafri, M. Z.; Lim, H. S.

2013-06-01

This study encompasses columnar ozone modelling in the peninsular Malaysia. Data of eight atmospheric parameters [air surface temperature (AST), carbon monoxide (CO), methane (CH4), water vapour (H2Ovapour), skin surface temperature (SSKT), atmosphere temperature (AT), relative humidity (RH), and mean surface pressure (MSP)] data set, retrieved from NASA's Atmospheric Infrared Sounder (AIRS), for the entire period (2003-2008) was employed to develop models to predict the value of columnar ozone (O3) in study area. The combined method, which is based on using both multiple regressions combined with principal component analysis (PCA) modelling, was used to predict columnar ozone. This combined approach was utilized to improve the prediction accuracy of columnar ozone. Separate analysis was carried out for north east monsoon (NEM) and south west monsoon (SWM) seasons. The O3 was negatively correlated with CH4, H2Ovapour, RH, and MSP, whereas it was positively correlated with CO, AST, SSKT, and AT during both the NEM and SWM season periods. Multiple regression analysis was used to fit the columnar ozone data using the atmospheric parameter's variables as predictors. A variable selection method based on high loading of varimax rotated principal components was used to acquire subsets of the predictor variables to be comprised in the linear regression model of the atmospheric parameter's variables. It was found that the increase in columnar O3 value is associated with an increase in the values of AST, SSKT, AT, and CO and with a drop in the levels of CH4, H2Ovapour, RH, and MSP. The result of fitting the best models for the columnar O3 value using eight of the independent variables gave about the same values of the R (≈0.93) and R2 (≈0.86) for both the NEM and SWM seasons. The common variables that appeared in both regression equations were SSKT, CH4 and RH, and the principal precursor of the columnar O3 value in both the NEM and SWM seasons was SSKT.
Forecasting defoliation by the gypsy moth in oak stands

Treesearch

Robert W. Campbell; Joseph P. Standaert

1974-01-01

A multiple-regression model is presented that reflects statistically significant correlations between defoliation by the gypsy moth, the dependent variable, and a series of biotic and physical independent variables. Both possible uses and shortcomings of this model are discussed.
Introduction to uses and interpretation of principal component analyses in forest biology.

Treesearch

J. G. Isebrands; Thomas R. Crow

1975-01-01

The application of principal component analysis for interpretation of multivariate data sets is reviewed with emphasis on (1) reduction of the number of variables, (2) ordination of variables, and (3) applications in conjunction with multiple regression.
Black Male Labor Force Participation.

ERIC Educational Resources Information Center

Baer, Roger K.

This study attempts to test (via multiple regression analysis) hypothesized relationships between designated independent variables and age specific incidences of labor force participation for black male subpopulations in 54 Standard Metropolitan Statistical Areas. Leading independent variables tested include net migration, earnings, unemployment,…
Regression Models for the Analysis of Longitudinal Gaussian Data from Multiple Sources

PubMed Central

O’Brien, Liam M.; Fitzmaurice, Garrett M.

2006-01-01

We present a regression model for the joint analysis of longitudinal multiple source Gaussian data. Longitudinal multiple source data arise when repeated measurements are taken from two or more sources, and each source provides a measure of the same underlying variable and on the same scale. This type of data generally produces a relatively large number of observations per subject; thus estimation of an unstructured covariance matrix often may not be possible. We consider two methods by which parsimonious models for the covariance can be obtained for longitudinal multiple source data. The methods are illustrated with an example of multiple informant data arising from a longitudinal interventional trial in psychiatry. PMID:15726666
Financial Management and Control for Decision Making in Urban Local Bodies in India Using Statistical Techniques

NASA Astrophysics Data System (ADS)

Bhattacharyya, Sidhakam; Bandyopadhyay, Gautam

2010-10-01

The council of most of the Urban Local Bodies (ULBs) has a limited scope for decision making in the absence of appropriate financial control mechanism. The information about expected amount of own fund during a particular period is of great importance for decision making. Therefore, in this paper, efforts are being made to present set of findings and to establish a model of estimating receipts of own sources and payments thereof using multiple regression analysis. Data for sixty months from a reputed ULB in West Bengal have been considered for ascertaining the regression models. This can be used as a part of financial management and control procedure by the council to estimate the effect on own fund. In our study we have considered two models using multiple regression analysis. "Model I" comprises of total adjusted receipt as the dependent variable and selected individual receipts as the independent variables. Similarly "Model II" consists of total adjusted payments as the dependent variable and selected individual payments as independent variables. The resultant of Model I and Model II is the surplus or deficit effecting own fund. This may be applied for decision making purpose by the council.
Characterizing individual differences in functional connectivity using dual-regression and seed-based approaches.

PubMed

Smith, David V; Utevsky, Amanda V; Bland, Amy R; Clement, Nathan; Clithero, John A; Harsch, Anne E W; McKell Carter, R; Huettel, Scott A

2014-07-15

A central challenge for neuroscience lies in relating inter-individual variability to the functional properties of specific brain regions. Yet, considerable variability exists in the connectivity patterns between different brain areas, potentially producing reliable group differences. Using sex differences as a motivating example, we examined two separate resting-state datasets comprising a total of 188 human participants. Both datasets were decomposed into resting-state networks (RSNs) using a probabilistic spatial independent component analysis (ICA). We estimated voxel-wise functional connectivity with these networks using a dual-regression analysis, which characterizes the participant-level spatiotemporal dynamics of each network while controlling for (via multiple regression) the influence of other networks and sources of variability. We found that males and females exhibit distinct patterns of connectivity with multiple RSNs, including both visual and auditory networks and the right frontal-parietal network. These results replicated across both datasets and were not explained by differences in head motion, data quality, brain volume, cortisol levels, or testosterone levels. Importantly, we also demonstrate that dual-regression functional connectivity is better at detecting inter-individual variability than traditional seed-based functional connectivity approaches. Our findings characterize robust-yet frequently ignored-neural differences between males and females, pointing to the necessity of controlling for sex in neuroscience studies of individual differences. Moreover, our results highlight the importance of employing network-based models to study variability in functional connectivity. Copyright © 2014 Elsevier Inc. All rights reserved.
The Utilization of Community Mental Health Services by the Hispanic Elderly.

ERIC Educational Resources Information Center

Starrett,Richard A.; And Others

Multiple regression and path analyses of 29 demographic, social, and psychological variables were carried out to determine those variables that influenced the use of community-based mental health services by the Hispanic elderly. The variables were classified using the Andersen and Newman framework which conceptualizes the individual's demand for…
Quantitative Assessment of Cervical Vertebral Maturation Using Cone Beam Computed Tomography in Korean Girls

PubMed Central

Byun, Bo-Ram; Kim, Yong-Il; Maki, Koutaro; Son, Woo-Sung

2015-01-01

This study was aimed to examine the correlation between skeletal maturation status and parameters from the odontoid process/body of the second vertebra and the bodies of third and fourth cervical vertebrae and simultaneously build multiple regression models to be able to estimate skeletal maturation status in Korean girls. Hand-wrist radiographs and cone beam computed tomography (CBCT) images were obtained from 74 Korean girls (6–18 years of age). CBCT-generated cervical vertebral maturation (CVM) was used to demarcate the odontoid process and the body of the second cervical vertebra, based on the dentocentral synchondrosis. Correlation coefficient analysis and multiple linear regression analysis were used for each parameter of the cervical vertebrae (P < 0.05). Forty-seven of 64 parameters from CBCT-generated CVM (independent variables) exhibited statistically significant correlations (P < 0.05). The multiple regression model with the greatest R 2 had six parameters (PH2/W2, UW2/W2, (OH+AH2)/LW2, UW3/LW3, D3, and H4/W4) as independent variables with a variance inflation factor (VIF) of <2. CBCT-generated CVM was able to include parameters from the second cervical vertebral body and odontoid process, respectively, for the multiple regression models. This suggests that quantitative analysis might be used to estimate skeletal maturation status. PMID:25878721
REGRESSION MODELS THAT RELATE STREAMS TO WATERSHEDS: COPING WITH NUMEROUS, COLLINEAR PEDICTORS

EPA Science Inventory

GIS efforts can produce a very large number of watershed variables (climate, land use/land cover and topography, all defined for multiple areas of influence) that could serve as candidate predictors in a regression model of reach-scale stream features. Invariably, many of these ...
Identifying the Factors That Influence Change in SEBD Using Logistic Regression Analysis

ERIC Educational Resources Information Center

Camilleri, Liberato; Cefai, Carmel

2013-01-01

Multiple linear regression and ANOVA models are widely used in applications since they provide effective statistical tools for assessing the relationship between a continuous dependent variable and several predictors. However these models rely heavily on linearity and normality assumptions and they do not accommodate categorical dependent…
Relationship between rice yield and climate variables in southwest Nigeria using multiple linear regression and support vector machine analysis

NASA Astrophysics Data System (ADS)

Oguntunde, Philip G.; Lischeid, Gunnar; Dietrich, Ottfried

2018-03-01

This study examines the variations of climate variables and rice yield and quantifies the relationships among them using multiple linear regression, principal component analysis, and support vector machine (SVM) analysis in southwest Nigeria. The climate and yield data used was for a period of 36 years between 1980 and 2015. Similar to the observed decrease ( P < 0.001) in rice yield, pan evaporation, solar radiation, and wind speed declined significantly. Eight principal components exhibited an eigenvalue > 1 and explained 83.1% of the total variance of predictor variables. The SVM regression function using the scores of the first principal component explained about 75% of the variance in rice yield data and linear regression about 64%. SVM regression between annual solar radiation values and yield explained 67% of the variance. Only the first component of the principal component analysis (PCA) exhibited a clear long-term trend and sometimes short-term variance similar to that of rice yield. Short-term fluctuations of the scores of the PC1 are closely coupled to those of rice yield during the 1986-1993 and the 2006-2013 periods thereby revealing the inter-annual sensitivity of rice production to climate variability. Solar radiation stands out as the climate variable of highest influence on rice yield, and the influence was especially strong during monsoon and post-monsoon periods, which correspond to the vegetative, booting, flowering, and grain filling stages in the study area. The outcome is expected to provide more in-depth regional-specific climate-rice linkage for screening of better cultivars that can positively respond to future climate fluctuations as well as providing information that may help optimized planting dates for improved radiation use efficiency in the study area.

Multivariate research in areas of phosphorus cast-iron brake shoes manufacturing using the statistical analysis and the multiple regression equations

NASA Astrophysics Data System (ADS)

Kiss, I.; Cioată, V. G.; Alexa, V.; Raţiu, S. A.

2017-05-01

The braking system is one of the most important and complex subsystems of railway vehicles, especially when it comes for safety. Therefore, installing efficient safe brakes on the modern railway vehicles is essential. Nowadays is devoted attention to solving problems connected with using high performance brake materials and its impact on thermal and mechanical loading of railway wheels. The main factor that influences the selection of a friction material for railway applications is the performance criterion, due to the interaction between the brake block and the wheel produce complex thermos-mechanical phenomena. In this work, the investigated subjects are the cast-iron brake shoes, which are still widely used on freight wagons. Therefore, the cast-iron brake shoes - with lamellar graphite and with a high content of phosphorus (0.8-1.1%) - need a special investigation. In order to establish the optimal condition for the cast-iron brake shoes we proposed a mathematical modelling study by using the statistical analysis and multiple regression equations. Multivariate research is important in areas of cast-iron brake shoes manufacturing, because many variables interact with each other simultaneously. Multivariate visualization comes to the fore when researchers have difficulties in comprehending many dimensions at one time. Technological data (hardness and chemical composition) obtained from cast-iron brake shoes were used for this purpose. In order to settle the multiple correlation between the hardness of the cast-iron brake shoes, and the chemical compositions elements several model of regression equation types has been proposed. Because a three-dimensional surface with variables on three axes is a common way to illustrate multivariate data, in which the maximum and minimum values are easily highlighted, we plotted graphical representation of the regression equations in order to explain interaction of the variables and locate the optimal level of each variable for maximal response. For the calculation of the regression coefficients, dispersion and correlation coefficients, the software Matlab was used.
Novel Index (Hepatic Receptor: IHR) to Evaluate Hepatic Functional Reserve Using (99m)Tc-GSA Scintigraphy.

PubMed

Hasegawa, Daisuke; Onishi, Hideo; Matsutomo, Norikazu

2016-02-01

This study aimed to evaluate the novel index of hepatic receptor (IHR) on the regression analysis derived from time activity curve of the liver for hepatic functional reserve. Sixty patients had undergone (99m)Tc-galactosyl serum albumin ((99m)Tc-GSA) scintigraphy in the retrospective clinical study. Time activity curves for liver were obtained by region of interest (ROI) on the whole liver. A novel hepatic functional predictor was calculated with multiple regression analysis of time activity curves. In the multiple regression function, the objective variables were the indocyanine green (ICG) retention rate at 15 min, and the explanatory variables were the liver counts in 3-min intervals until end from beginning. Then, this result was defined by IHR, and we analyzed the correlation between IHR and ICG, uptake ratio of the heart at 15 minutes to that at 3 minutes (HH15), uptake ratio of the liver to the liver plus heart at 15 minutes (LHL15), and index of convexity (IOC). Regression function of IHR was derived as follows: IHR=0.025×L(6)-0.052×L(12)+0.027×L(27). The multiple regression analysis indicated that liver counts at 6 min, 12 min, and 27 min were significantly related to objective variables. The correlation coefficient between IHR and ICG was 0.774, and the correlation coefficient between ICG and conventional indices (HH15, LHL15, and IOC) were 0.837, 0.773, and 0.793, respectively. IHR had good correlation with HH15, LHL15, and IOC. The finding results suggested that IHR would provide clinical benefit for hepatic functional assessment in the (99m)Tc-GSA scintigraphy.
Assessment of Communications-related Admissions Criteria in a Three-year Pharmacy Program

PubMed Central

Tejada, Frederick R.; Lang, Lynn A.; Purnell, Miriam; Acedera, Lisa; Ngonga, Ferdinand

2015-01-01

Objective. To determine if there is a correlation between TOEFL and other admissions criteria that assess communications skills (ie, PCAT variables: verbal, reading, essay, and composite), interview, and observational scores and to evaluate TOEFL and these admissions criteria as predictors of academic performance. Methods. Statistical analyses included two sample t tests, multiple regression and Pearson’s correlations for parametric variables, and Mann-Whitney U for nonparametric variables, which were conducted on the retrospective data of 162 students, 57 of whom were foreign-born. Results. The multiple regression model of the other admissions criteria on TOEFL was significant. There was no significant correlation between TOEFL scores and academic performance. However, significant correlations were found between the other admissions criteria and academic performance. Conclusion. Since TOEFL is not a significant predictor of either communication skills or academic success of foreign-born PharmD students in the program, it may be eliminated as an admissions criterion. PMID:26430273
Assessment of Communications-related Admissions Criteria in a Three-year Pharmacy Program.

PubMed

Parmar, Jayesh R; Tejada, Frederick R; Lang, Lynn A; Purnell, Miriam; Acedera, Lisa; Ngonga, Ferdinand

2015-08-25

To determine if there is a correlation between TOEFL and other admissions criteria that assess communications skills (ie, PCAT variables: verbal, reading, essay, and composite), interview, and observational scores and to evaluate TOEFL and these admissions criteria as predictors of academic performance. Statistical analyses included two sample t tests, multiple regression and Pearson's correlations for parametric variables, and Mann-Whitney U for nonparametric variables, which were conducted on the retrospective data of 162 students, 57 of whom were foreign-born. The multiple regression model of the other admissions criteria on TOEFL was significant. There was no significant correlation between TOEFL scores and academic performance. However, significant correlations were found between the other admissions criteria and academic performance. Since TOEFL is not a significant predictor of either communication skills or academic success of foreign-born PharmD students in the program, it may be eliminated as an admissions criterion.
Food insecurity and CD4% Among HIV+ children in Gaborone, Botswana.

PubMed

Mendoza, Jason A; Matshaba, Mogomotsi; Makhanda, Jeremiah; Liu, Yan; Boitshwarelo, Matshwenyego; Anabwani, Gabriel M

2014-08-01

We investigated the association between household food insecurity (HFI) and CD4% among 2-6-year old HIV+ outpatients (n = 78) at the Botswana-Baylor Children's Clinical Center of Excellence in Gaborone, Botswana. HFI was assessed by a validated survey. CD4% data were abstracted from the medical record. We used multiple linear regression with CD4% (dependent variable), HFI (independent variable), and controlled for sociodemographic and clinical covariates. Multiple linear regression showed a significant main effect for HFI [beta = -0.6, 95% confidence interval (CI): -1.0 to -0.1] and child gender (beta = 5.6, 95% CI: 1.3 to 9.8). Alleviating food insecurity may improve pediatric HIV outcomes in Botswana and similar Sub-Saharan settings.
Which Variables Associated with Data-Driven Instruction Are Believed to Best Predict Urban Student Achievement?

ERIC Educational Resources Information Center

Greer, Wil

2013-01-01

This study identified the variables associated with data-driven instruction (DDI) that are perceived to best predict student achievement. Of the DDI variables discussed in the literature, 51 of them had a sufficient enough research base to warrant statistical analysis. Of them, 26 were statistically significant. Multiple regression and an…
The Effects of Home-School Dissonance on African American Male High School Students

ERIC Educational Resources Information Center

Brown-Wright, Lynda; Tyler, Kenneth Maurice

2010-01-01

The current study examined associations between home-school dissonance and several academic and psychological variables among 80 African American male high school students. Regression analyses revealed that home-school dissonance significantly predicted multiple academic and psychological variables, including amotivation, academic cheating,…
Moderation analysis using a two-level regression model.

PubMed

Yuan, Ke-Hai; Cheng, Ying; Maxwell, Scott

2014-10-01

Moderation analysis is widely used in social and behavioral research. The most commonly used model for moderation analysis is moderated multiple regression (MMR) in which the explanatory variables of the regression model include product terms, and the model is typically estimated by least squares (LS). This paper argues for a two-level regression model in which the regression coefficients of a criterion variable on predictors are further regressed on moderator variables. An algorithm for estimating the parameters of the two-level model by normal-distribution-based maximum likelihood (NML) is developed. Formulas for the standard errors (SEs) of the parameter estimates are provided and studied. Results indicate that, when heteroscedasticity exists, NML with the two-level model gives more efficient and more accurate parameter estimates than the LS analysis of the MMR model. When error variances are homoscedastic, NML with the two-level model leads to essentially the same results as LS with the MMR model. Most importantly, the two-level regression model permits estimating the percentage of variance of each regression coefficient that is due to moderator variables. When applied to data from General Social Surveys 1991, NML with the two-level model identified a significant moderation effect of race on the regression of job prestige on years of education while LS with the MMR model did not. An R package is also developed and documented to facilitate the application of the two-level model.
Relationship between body composition and postural control in prepubertal overweight/obese children: A cross-sectional study.

PubMed

Villarrasa-Sapiña, Israel; Álvarez-Pitti, Julio; Cabeza-Ruiz, Ruth; Redón, Pau; Lurbe, Empar; García-Massó, Xavier

2018-02-01

Excess body weight during childhood causes reduced motor functionality and problems in postural control, a negative influence which has been reported in the literature. Nevertheless, no information regarding the effect of body composition on the postural control of overweight and obese children is available. The objective of this study was therefore to establish these relationships. A cross-sectional design was used to establish relationships between body composition and postural control variables obtained in bipedal eyes-open and eyes-closed conditions in twenty-two children. Centre of pressure signals were analysed in the temporal and frequency domains. Pearson correlations were applied to establish relationships between variables. Principal component analysis was applied to the body composition variables to avoid potential multicollinearity in the regression models. These principal components were used to perform a multiple linear regression analysis, from which regression models were obtained to predict postural control. Height and leg mass were the body composition variables that showed the highest correlation with postural control. Multiple regression models were also obtained and several of these models showed a higher correlation coefficient in predicting postural control than simple correlations. These models revealed that leg and trunk mass were good predictors of postural control. More equations were found in the eyes-open than eyes-closed condition. Body weight and height are negatively correlated with postural control. However, leg and trunk mass are better postural control predictors than arm or body mass. Finally, body composition variables are more useful in predicting postural control when the eyes are open. Copyright © 2017 Elsevier Ltd. All rights reserved.
Forecasting models for sugi (Cryptomeria japonica D. Don) pollen count showing an alternate dispersal rhythm.

PubMed

Ito, Yukiko; Hattori, Reiko; Mase, Hiroki; Watanabe, Masako; Shiotani, Itaru

2008-12-01

Pollen information is indispensable for allergic individuals and clinicians. This study aimed to develop forecasting models for the total annual count of airborne pollen grains based on data monitored over the last 20 years at the Mie Chuo Medical Center, Tsu, Mie, Japan. Airborne pollen grains were collected using a Durham sampler. Total annual pollen count and pollen count from October to December (OD pollen count) of the previous year were transformed to logarithms. Regression analysis of the total pollen count was performed using variables such as the OD pollen count and the maximum temperature for mid-July of the previous year. Time series analysis revealed an alternate rhythm of the series of total pollen count. The alternate rhythm consisted of a cyclic alternation of an "on" year (high pollen count) and an "off" year (low pollen count). This rhythm was used as a dummy variable in regression equations. Of the three models involving the OD pollen count, a multiple regression equation that included the alternate rhythm variable and the interaction of this rhythm with OD pollen count showed a high coefficient of determination (0.844). Of the three models involving the maximum temperature for mid-July, those including the alternate rhythm variable and the interaction of this rhythm with maximum temperature had the highest coefficient of determination (0.925). An alternate pollen dispersal rhythm represented by a dummy variable in the multiple regression analysis plays a key role in improving forecasting models for the total annual sugi pollen count.
Confidence Intervals for Squared Semipartial Correlation Coefficients: The Effect of Nonnormality

ERIC Educational Resources Information Center

Algina, James; Keselman, H. J.; Penfield, Randall D.

2010-01-01

The increase in the squared multiple correlation coefficient ([delta]R[superscript 2]) associated with a variable in a regression equation is a commonly used measure of importance in regression analysis. Algina, Keselman, and Penfield found that intervals based on asymptotic principles were typically very inaccurate, even though the sample size…
Multiple Logistic Regression Analysis of Cigarette Use among High School Students

ERIC Educational Resources Information Center

Adwere-Boamah, Joseph

2011-01-01

A binary logistic regression analysis was performed to predict high school students' cigarette smoking behavior from selected predictors from 2009 CDC Youth Risk Behavior Surveillance Survey. The specific target student behavior of interest was frequent cigarette use. Five predictor variables included in the model were: a) race, b) frequency of…
Evaluating the performance of different predictor strategies in regression-based downscaling with a focus on glacierized mountain environments

NASA Astrophysics Data System (ADS)

Hofer, Marlis; Nemec, Johanna

2016-04-01

This study presents first steps towards verifying the hypothesis that uncertainty in global and regional glacier mass simulations can be reduced considerably by reducing the uncertainty in the high-resolution atmospheric input data. To this aim, we systematically explore the potential of different predictor strategies for improving the performance of regression-based downscaling approaches. The investigated local-scale target variables are precipitation, air temperature, wind speed, relative humidity and global radiation, all at a daily time scale. Observations of these target variables are assessed from three sites in geo-environmentally and climatologically very distinct settings, all within highly complex topography and in the close proximity to mountain glaciers: (1) the Vernagtbach station in the Northern European Alps (VERNAGT), (2) the Artesonraju measuring site in the tropical South American Andes (ARTESON), and (3) the Brewster measuring site in the Southern Alps of New Zealand (BREWSTER). As the large-scale predictors, ERA interim reanalysis data are used. In the applied downscaling model training and evaluation procedures, particular emphasis is put on appropriately accounting for the pitfalls of limited and/or patchy observation records that are usually the only (if at all) available data from the glacierized mountain sites. Generalized linear models and beta regression are investigated as alternatives to ordinary least squares regression for the non-Gaussian target variables. By analyzing results for the three different sites, five predictands and for different times of the year, we look for systematic improvements in the downscaling models' skill specifically obtained by (i) using predictor data at the optimum scale rather than the minimum scale of the reanalysis data, (ii) identifying the optimum predictor allocation in the vertical, and (iii) considering multiple (variable, level and/or grid point) predictor options combined with state-of-art empirical feature selection tools. First results show that in particular for air temperature, those downscaling models based on direct predictor selection show comparative skill like those models based on multiple predictors. For all other target variables, however, multiple predictor approaches can considerably outperform those models based on single predictors. Including multiple variable types emerges as the most promising predictor option (in particular for wind speed at all sites), even if the same predictor set is used across the different cases.
Application of Multiregressive Linear Models, Dynamic Kriging Models and Neural Network Models to Predictive Maintenance of Hydroelectric Power Systems

NASA Astrophysics Data System (ADS)

Lucifredi, A.; Mazzieri, C.; Rossi, M.

2000-05-01

Since the operational conditions of a hydroelectric unit can vary within a wide range, the monitoring system must be able to distinguish between the variations of the monitored variable caused by variations of the operation conditions and those due to arising and progressing of failures and misoperations. The paper aims to identify the best technique to be adopted for the monitoring system. Three different methods have been implemented and compared. Two of them use statistical techniques: the first, the linear multiple regression, expresses the monitored variable as a linear function of the process parameters (independent variables), while the second, the dynamic kriging technique, is a modified technique of multiple linear regression representing the monitored variable as a linear combination of the process variables in such a way as to minimize the variance of the estimate error. The third is based on neural networks. Tests have shown that the monitoring system based on the kriging technique is not affected by some problems common to the other two models e.g. the requirement of a large amount of data for their tuning, both for training the neural network and defining the optimum plane for the multiple regression, not only in the system starting phase but also after a trivial operation of maintenance involving the substitution of machinery components having a direct impact on the observed variable. Or, in addition, the necessity of different models to describe in a satisfactory way the different ranges of operation of the plant. The monitoring system based on the kriging statistical technique overrides the previous difficulties: it does not require a large amount of data to be tuned and is immediately operational: given two points, the third can be immediately estimated; in addition the model follows the system without adapting itself to it. The results of the experimentation performed seem to indicate that a model based on a neural network or on a linear multiple regression is not optimal, and that a different approach is necessary to reduce the amount of work during the learning phase using, when available, all the information stored during the initial phase of the plant to build the reference baseline, elaborating, if it is the case, the raw information available. A mixed approach using the kriging statistical technique and neural network techniques could optimise the result.
Simple and multiple linear regression: sample size considerations.

PubMed

Hanley, James A

2016-11-01

The suggested "two subjects per variable" (2SPV) rule of thumb in the Austin and Steyerberg article is a chance to bring out some long-established and quite intuitive sample size considerations for both simple and multiple linear regression. This article distinguishes two of the major uses of regression models that imply very different sample size considerations, neither served well by the 2SPV rule. The first is etiological research, which contrasts mean Y levels at differing "exposure" (X) values and thus tends to focus on a single regression coefficient, possibly adjusted for confounders. The second research genre guides clinical practice. It addresses Y levels for individuals with different covariate patterns or "profiles." It focuses on the profile-specific (mean) Y levels themselves, estimating them via linear compounds of regression coefficients and covariates. By drawing on long-established closed-form variance formulae that lie beneath the standard errors in multiple regression, and by rearranging them for heuristic purposes, one arrives at quite intuitive sample size considerations for both research genres. Copyright Â© 2016 Elsevier Inc. All rights reserved.
Finding structure in data using multivariate tree boosting

PubMed Central

Miller, Patrick J.; Lubke, Gitta H.; McArtor, Daniel B.; Bergeman, C. S.

2016-01-01

Technology and collaboration enable dramatic increases in the size of psychological and psychiatric data collections, but finding structure in these large data sets with many collected variables is challenging. Decision tree ensembles such as random forests (Strobl, Malley, & Tutz, 2009) are a useful tool for finding structure, but are difficult to interpret with multiple outcome variables which are often of interest in psychology. To find and interpret structure in data sets with multiple outcomes and many predictors (possibly exceeding the sample size), we introduce a multivariate extension to a decision tree ensemble method called gradient boosted regression trees (Friedman, 2001). Our extension, multivariate tree boosting, is a method for nonparametric regression that is useful for identifying important predictors, detecting predictors with nonlinear effects and interactions without specification of such effects, and for identifying predictors that cause two or more outcome variables to covary. We provide the R package ‘mvtboost’ to estimate, tune, and interpret the resulting model, which extends the implementation of univariate boosting in the R package ‘gbm’ (Ridgeway et al., 2015) to continuous, multivariate outcomes. To illustrate the approach, we analyze predictors of psychological well-being (Ryff & Keyes, 1995). Simulations verify that our approach identifies predictors with nonlinear effects and achieves high prediction accuracy, exceeding or matching the performance of (penalized) multivariate multiple regression and multivariate decision trees over a wide range of conditions. PMID:27918183
The Relationship between Mental Ability and Eight Background Variables

ERIC Educational Resources Information Center

Gill, Peter Edward

1976-01-01

Multiple regression is used to discover interconnections between IQ and vocabulary test scores as one variable, and socioeconomic factors as the other. Results show total variance as explained by predictors is never more than eight per cent, indicating differences in IQ scores are not attributable to environmental factors. (RW)
Predicting Adaptive Functioning of Mentally Retarded Persons in Community Settings.

ERIC Educational Resources Information Center

Hull, John T.; Thompson, Joy C.

1980-01-01

The impact of a variety of individual, residential, and community variables on adaptive functioning of 369 retarded persons (18 to 73 years old) was examined using a multiple regression analysis. Individual characteristics (especially IQ) accounted for 21 percent of the variance, while environmental variables, primarily those related to…
Fusing Data Mining, Machine Learning and Traditional Statistics to Detect Biomarkers Associated with Depression

PubMed Central

Dipnall, Joanna F.

2016-01-01

Background Atheoretical large-scale data mining techniques using machine learning algorithms have promise in the analysis of large epidemiological datasets. This study illustrates the use of a hybrid methodology for variable selection that took account of missing data and complex survey design to identify key biomarkers associated with depression from a large epidemiological study. Methods The study used a three-step methodology amalgamating multiple imputation, a machine learning boosted regression algorithm and logistic regression, to identify key biomarkers associated with depression in the National Health and Nutrition Examination Study (2009–2010). Depression was measured using the Patient Health Questionnaire-9 and 67 biomarkers were analysed. Covariates in this study included gender, age, race, smoking, food security, Poverty Income Ratio, Body Mass Index, physical activity, alcohol use, medical conditions and medications. The final imputed weighted multiple logistic regression model included possible confounders and moderators. Results After the creation of 20 imputation data sets from multiple chained regression sequences, machine learning boosted regression initially identified 21 biomarkers associated with depression. Using traditional logistic regression methods, including controlling for possible confounders and moderators, a final set of three biomarkers were selected. The final three biomarkers from the novel hybrid variable selection methodology were red cell distribution width (OR 1.15; 95% CI 1.01, 1.30), serum glucose (OR 1.01; 95% CI 1.00, 1.01) and total bilirubin (OR 0.12; 95% CI 0.05, 0.28). Significant interactions were found between total bilirubin with Mexican American/Hispanic group (p = 0.016), and current smokers (p<0.001). Conclusion The systematic use of a hybrid methodology for variable selection, fusing data mining techniques using a machine learning algorithm with traditional statistical modelling, accounted for missing data and complex survey sampling methodology and was demonstrated to be a useful tool for detecting three biomarkers associated with depression for future hypothesis generation: red cell distribution width, serum glucose and total bilirubin. PMID:26848571
Fusing Data Mining, Machine Learning and Traditional Statistics to Detect Biomarkers Associated with Depression.

PubMed

Dipnall, Joanna F; Pasco, Julie A; Berk, Michael; Williams, Lana J; Dodd, Seetal; Jacka, Felice N; Meyer, Denny

2016-01-01

Atheoretical large-scale data mining techniques using machine learning algorithms have promise in the analysis of large epidemiological datasets. This study illustrates the use of a hybrid methodology for variable selection that took account of missing data and complex survey design to identify key biomarkers associated with depression from a large epidemiological study. The study used a three-step methodology amalgamating multiple imputation, a machine learning boosted regression algorithm and logistic regression, to identify key biomarkers associated with depression in the National Health and Nutrition Examination Study (2009-2010). Depression was measured using the Patient Health Questionnaire-9 and 67 biomarkers were analysed. Covariates in this study included gender, age, race, smoking, food security, Poverty Income Ratio, Body Mass Index, physical activity, alcohol use, medical conditions and medications. The final imputed weighted multiple logistic regression model included possible confounders and moderators. After the creation of 20 imputation data sets from multiple chained regression sequences, machine learning boosted regression initially identified 21 biomarkers associated with depression. Using traditional logistic regression methods, including controlling for possible confounders and moderators, a final set of three biomarkers were selected. The final three biomarkers from the novel hybrid variable selection methodology were red cell distribution width (OR 1.15; 95% CI 1.01, 1.30), serum glucose (OR 1.01; 95% CI 1.00, 1.01) and total bilirubin (OR 0.12; 95% CI 0.05, 0.28). Significant interactions were found between total bilirubin with Mexican American/Hispanic group (p = 0.016), and current smokers (p<0.001). The systematic use of a hybrid methodology for variable selection, fusing data mining techniques using a machine learning algorithm with traditional statistical modelling, accounted for missing data and complex survey sampling methodology and was demonstrated to be a useful tool for detecting three biomarkers associated with depression for future hypothesis generation: red cell distribution width, serum glucose and total bilirubin.

Multiplicative Forests for Continuous-Time Processes

PubMed Central

Weiss, Jeremy C.; Natarajan, Sriraam; Page, David

2013-01-01

Learning temporal dependencies between variables over continuous time is an important and challenging task. Continuous-time Bayesian networks effectively model such processes but are limited by the number of conditional intensity matrices, which grows exponentially in the number of parents per variable. We develop a partition-based representation using regression trees and forests whose parameter spaces grow linearly in the number of node splits. Using a multiplicative assumption we show how to update the forest likelihood in closed form, producing efficient model updates. Our results show multiplicative forests can be learned from few temporal trajectories with large gains in performance and scalability. PMID:25284967
Multiplicative Forests for Continuous-Time Processes.

PubMed

Weiss, Jeremy C; Natarajan, Sriraam; Page, David

2012-01-01

Learning temporal dependencies between variables over continuous time is an important and challenging task. Continuous-time Bayesian networks effectively model such processes but are limited by the number of conditional intensity matrices, which grows exponentially in the number of parents per variable. We develop a partition-based representation using regression trees and forests whose parameter spaces grow linearly in the number of node splits. Using a multiplicative assumption we show how to update the forest likelihood in closed form, producing efficient model updates. Our results show multiplicative forests can be learned from few temporal trajectories with large gains in performance and scalability.
Models for predicting the mass of lime fruits by some engineering properties.

PubMed

Miraei Ashtiani, Seyed-Hassan; Baradaran Motie, Jalal; Emadi, Bagher; Aghkhani, Mohammad-Hosein

2014-11-01

Grading fruits based on mass is important in packaging and reduces the waste, also increases the marketing value of agricultural produce. The aim of this study was mass modeling of two major cultivars of Iranian limes based on engineering attributes. Models were classified into three: 1-Single and multiple variable regressions of lime mass and dimensional characteristics. 2-Single and multiple variable regressions of lime mass and projected areas. 3-Single regression of lime mass based on its actual volume and calculated volume assumed as ellipsoid and prolate spheroid shapes. All properties considered in the current study were found to be statistically significant (ρ < 0.01). The results indicated that mass modeling of lime based on minor diameter and first projected area are the most appropriate models in the first and the second classifications, respectively. In third classification, the best model was obtained on the basis of the prolate spheroid volume. It was finally concluded that the suitable grading system of lime mass is based on prolate spheroid volume.
Waste generated in high-rise buildings construction: a quantification model based on statistical multiple regression.

PubMed

Parisi Kern, Andrea; Ferreira Dias, Michele; Piva Kulakowski, Marlova; Paulo Gomes, Luciana

2015-05-01

Reducing construction waste is becoming a key environmental issue in the construction industry. The quantification of waste generation rates in the construction sector is an invaluable management tool in supporting mitigation actions. However, the quantification of waste can be a difficult process because of the specific characteristics and the wide range of materials used in different construction projects. Large variations are observed in the methods used to predict the amount of waste generated because of the range of variables involved in construction processes and the different contexts in which these methods are employed. This paper proposes a statistical model to determine the amount of waste generated in the construction of high-rise buildings by assessing the influence of design process and production system, often mentioned as the major culprits behind the generation of waste in construction. Multiple regression was used to conduct a case study based on multiple sources of data of eighteen residential buildings. The resulting statistical model produced dependent (i.e. amount of waste generated) and independent variables associated with the design and the production system used. The best regression model obtained from the sample data resulted in an adjusted R(2) value of 0.694, which means that it predicts approximately 69% of the factors involved in the generation of waste in similar constructions. Most independent variables showed a low determination coefficient when assessed in isolation, which emphasizes the importance of assessing their joint influence on the response (dependent) variable. Copyright © 2015 Elsevier Ltd. All rights reserved.
Modification of the USLE K factor for soil erodibility assessment on calcareous soils in Iran

NASA Astrophysics Data System (ADS)

Ostovari, Yaser; Ghorbani-Dashtaki, Shoja; Bahrami, Hossein-Ali; Naderi, Mehdi; Dematte, Jose Alexandre M.; Kerry, Ruth

2016-11-01

The measurement of soil erodibility (K) in the field is tedious, time-consuming and expensive; therefore, its prediction through pedotransfer functions (PTFs) could be far less costly and time-consuming. The aim of this study was to develop new PTFs to estimate the K factor using multiple linear regression, Mamdani fuzzy inference systems, and artificial neural networks. For this purpose, K was measured in 40 erosion plots with natural rainfall. Various soil properties including the soil particle size distribution, calcium carbonate equivalent, organic matter, permeability, and wet-aggregate stability were measured. The results showed that the mean measured K was 0.014 t h MJ- 1 mm- 1 and 2.08 times less than the estimated mean K (0.030 t h MJ- 1 mm- 1) using the USLE model. Permeability, wet-aggregate stability, very fine sand, and calcium carbonate were selected as independent variables by forward stepwise regression in order to assess the ability of multiple linear regression, Mamdani fuzzy inference systems and artificial neural networks to predict K. The calcium carbonate equivalent, which is not accounted for in the USLE model, had a significant impact on K in multiple linear regression due to its strong influence on the stability of aggregates and soil permeability. Statistical indices in validation and calibration datasets determined that the artificial neural networks method with the highest R2, lowest RMSE, and lowest ME was the best model for estimating the K factor. A strong correlation (R2 = 0.81, n = 40, p < 0.05) between the estimated K from multiple linear regression and measured K indicates that the use of calcium carbonate equivalent as a predictor variable gives a better estimation of K in areas with calcareous soils.
Evaluation and prediction of shrub cover in coastal Oregon forests (USA)

Treesearch

Becky K. Kerns; Janet L. Ohmann

2004-01-01

We used data from regional forest inventories and research programs, coupled with mapped climatic and topographic information, to explore relationships and develop multiple linear regression (MLR) and regression tree models for total and deciduous shrub cover in the Oregon coastal province. Results from both types of models indicate that forest structure variables were...
Multiple linear regression analysis

NASA Technical Reports Server (NTRS)

Edwards, T. R.

1980-01-01

Program rapidly selects best-suited set of coefficients. User supplies only vectors of independent and dependent data and specifies confidence level required. Program uses stepwise statistical procedure for relating minimal set of variables to set of observations; final regression contains only most statistically significant coefficients. Program is written in FORTRAN IV for batch execution and has been implemented on NOVA 1200.
Multiple Regression with Varying Levels of Correlation among Predictors: Monte Carlo Sampling from Normal and Non-Normal Populations.

ERIC Educational Resources Information Center

Vasu, Ellen Storey

1978-01-01

The effects of the violation of the assumption of normality in the conditional distributions of the dependent variable, coupled with the condition of multicollinearity upon the outcome of testing the hypothesis that the regression coefficient equals zero, are investigated via a Monte Carlo study. (Author/JKS)
Crop status evaluations and yield predictions

NASA Technical Reports Server (NTRS)

Haun, J. R.

1975-01-01

A model was developed for predicting the day 50 percent of the wheat crop is planted in North Dakota. This model incorporates location as an independent variable. The Julian date when 50 percent of the crop was planted for the nine divisions of North Dakota for seven years was regressed on the 49 variables through the step-down multiple regression procedure. This procedure begins with all of the independent variables and sequentially removes variables that are below a predetermined level of significance after each step. The prediction equation was tested on daily data. The accuracy of the model is considered satisfactory for finding the historic dates on which to initiate yield prediction model. Growth prediction models were also developed for spring wheat.
Cineradiographic Examination of Articulatory Movement of Pseudo-Tongue, Hyoid, and Mandible in Congenital Aglossia

ERIC Educational Resources Information Center

McMicken, Betty; Vento-Wilson, Margaret; Von Berg, Shelley; Rogers, Kelly

2014-01-01

This research examined cineradiographic films (CRF) of articulatory movements in a person with congenital aglossia (PWCA) during speech production of four phrases. Pearson correlations and a multiple regression model investigated co-variation of independent variables, positions of mandible and hyoid; and pseudo-tongue-dependent variables,…
Military Enlistments: What Can We Learn from Geographic Variation? Technical Report 620.

ERIC Educational Resources Information Center

Brown, Charles

Some economic variables were examined that affect enlistment decisions and therefore affect the continued success of the All-Volunteer Force. The study used a multiple regression, pooled cross-section/time-series model over the 1975-1982 period, including pay, unemployment, educational benefits, and recruiting resources as independent variables.…
Job Satisfaction in Mexican Faculty: An Analysis of its Predictor Variables. ASHE Annual Meeting Paper.

ERIC Educational Resources Information Center

Galaz-Fontes, Jesus Francisco; Gil-Anton, Manuel

This study examined overall job satisfaction among college faculty in Mexico. The study used data from a 1992-93 Carnegie International Faculty Survey. Secondary multiple regression analysis identified predictor variables for several faculty subgroups. Results were interpreted by differentiating between work-related and intrinsic factors, as well…
On the Misconception of Multicollinearity in Detection of Moderating Effects: Multicollinearity Is Not Always Detrimental

ERIC Educational Resources Information Center

Shieh, Gwowen

2010-01-01

Due to its extensive applicability and computational ease, moderated multiple regression (MMR) has been widely employed to analyze interaction effects between 2 continuous predictor variables. Accordingly, considerable attention has been drawn toward the supposed multicollinearity problem between predictor variables and their cross-product term.…
A Comparison between Multiple Regression Models and CUN-BAE Equation to Predict Body Fat in Adults

PubMed Central

Fuster-Parra, Pilar; Bennasar-Veny, Miquel; Tauler, Pedro; Yañez, Aina; López-González, Angel A.; Aguiló, Antoni

2015-01-01

Background Because the accurate measure of body fat (BF) is difficult, several prediction equations have been proposed. The aim of this study was to compare different multiple regression models to predict BF, including the recently reported CUN-BAE equation. Methods Multi regression models using body mass index (BMI) and body adiposity index (BAI) as predictors of BF will be compared. These models will be also compared with the CUN-BAE equation. For all the analysis a sample including all the participants and another one including only the overweight and obese subjects will be considered. The BF reference measure was made using Bioelectrical Impedance Analysis. Results The simplest models including only BMI or BAI as independent variables showed that BAI is a better predictor of BF. However, adding the variable sex to both models made BMI a better predictor than the BAI. For both the whole group of participants and the group of overweight and obese participants, using simple models (BMI, age and sex as variables) allowed obtaining similar correlations with BF as when the more complex CUN-BAE was used (ρ = 0:87 vs. ρ = 0:86 for the whole sample and ρ = 0:88 vs. ρ = 0:89 for overweight and obese subjects, being the second value the one for CUN-BAE). Conclusions There are simpler models than CUN-BAE equation that fits BF as well as CUN-BAE does. Therefore, it could be considered that CUN-BAE overfits. Using a simple linear regression model, the BAI, as the only variable, predicts BF better than BMI. However, when the sex variable is introduced, BMI becomes the indicator of choice to predict BF. PMID:25821960
A comparison between multiple regression models and CUN-BAE equation to predict body fat in adults.

PubMed

Fuster-Parra, Pilar; Bennasar-Veny, Miquel; Tauler, Pedro; Yañez, Aina; López-González, Angel A; Aguiló, Antoni

2015-01-01

Because the accurate measure of body fat (BF) is difficult, several prediction equations have been proposed. The aim of this study was to compare different multiple regression models to predict BF, including the recently reported CUN-BAE equation. Multi regression models using body mass index (BMI) and body adiposity index (BAI) as predictors of BF will be compared. These models will be also compared with the CUN-BAE equation. For all the analysis a sample including all the participants and another one including only the overweight and obese subjects will be considered. The BF reference measure was made using Bioelectrical Impedance Analysis. The simplest models including only BMI or BAI as independent variables showed that BAI is a better predictor of BF. However, adding the variable sex to both models made BMI a better predictor than the BAI. For both the whole group of participants and the group of overweight and obese participants, using simple models (BMI, age and sex as variables) allowed obtaining similar correlations with BF as when the more complex CUN-BAE was used (ρ = 0:87 vs. ρ = 0:86 for the whole sample and ρ = 0:88 vs. ρ = 0:89 for overweight and obese subjects, being the second value the one for CUN-BAE). There are simpler models than CUN-BAE equation that fits BF as well as CUN-BAE does. Therefore, it could be considered that CUN-BAE overfits. Using a simple linear regression model, the BAI, as the only variable, predicts BF better than BMI. However, when the sex variable is introduced, BMI becomes the indicator of choice to predict BF.
Parental education predicts change in intelligence quotient after childhood epilepsy surgery.

PubMed

Meekes, Joost; van Schooneveld, Monique M J; Braams, Olga B; Jennekens-Schinkel, Aag; van Rijen, Peter C; Hendriks, Marc P H; Braun, Kees P J; van Nieuwenhuizen, Onno

2015-04-01

To know whether change in the intelligence quotient (IQ) of children who undergo epilepsy surgery is associated with the educational level of their parents. Retrospective analysis of data obtained from a cohort of children who underwent epilepsy surgery between January 1996 and September 2010. We performed simple and multiple regression analyses to identify predictors associated with IQ change after surgery. In addition to parental education, six variables previously demonstrated to be associated with IQ change after surgery were included as predictors: age at surgery, duration of epilepsy, etiology, presurgical IQ, reduction of antiepileptic drugs, and seizure freedom. We used delta IQ (IQ 2 years after surgery minus IQ shortly before surgery) as the primary outcome variable, but also performed analyses with pre- and postsurgical IQ as outcome variables to support our findings. To validate the results we performed simple regression analysis with parental education as the predictor in specific subgroups. The sample for regression analysis included 118 children (60 male; median age at surgery 9.73 years). Parental education was significantly associated with delta IQ in simple regression analysis (p = 0.004), and also contributed significantly to postsurgical IQ in multiple regression analysis (p = 0.008). Additional analyses demonstrated that parental education made a unique contribution to prediction of delta IQ, that is, it could not be replaced by the illness-related variables. Subgroup analyses confirmed the association of parental education with IQ change after surgery for most groups. Children whose parents had higher education demonstrate on average a greater increase in IQ after surgery and a higher postsurgical--but not presurgical--IQ than children whose parents completed at most lower secondary education. Parental education--and perhaps other environmental variables--should be considered in the prognosis of cognitive function after childhood epilepsy surgery. Wiley Periodicals, Inc. © 2015 International League Against Epilepsy.
Determination of biodiesel content in biodiesel/diesel blends using NIR and visible spectroscopy with variable selection.

PubMed

Fernandes, David Douglas Sousa; Gomes, Adriano A; Costa, Gean Bezerra da; Silva, Gildo William B da; Véras, Germano

2011-12-15

This work is concerned of evaluate the use of visible and near-infrared (NIR) range, separately and combined, to determine the biodiesel content in biodiesel/diesel blends using Multiple Linear Regression (MLR) and variable selection by Successive Projections Algorithm (SPA). Full spectrum models employing Partial Least Squares (PLS) and variables selection by Stepwise (SW) regression coupled with Multiple Linear Regression (MLR) and PLS models also with variable selection by Jack-Knife (Jk) were compared the proposed methodology. Several preprocessing were evaluated, being chosen derivative Savitzky-Golay with second-order polynomial and 17-point window for NIR and visible-NIR range, with offset correction. A total of 100 blends with biodiesel content between 5 and 50% (v/v) prepared starting from ten sample of biodiesel. In the NIR and visible region the best model was the SPA-MLR using only two and eight wavelengths with RMSEP of 0.6439% (v/v) and 0.5741 respectively, while in the visible-NIR region the best model was the SW-MLR using five wavelengths and RMSEP of 0.9533% (v/v). Results indicate that both spectral ranges evaluated showed potential for developing a rapid and nondestructive method to quantify biodiesel in blends with mineral diesel. Finally, one can still mention that the improvement in terms of prediction error obtained with the procedure for variables selection was significant. Copyright © 2011 Elsevier B.V. All rights reserved.
Panel regressions to estimate low-flow response to rainfall variability in ungaged basins

USGS Publications Warehouse

Bassiouni, Maoya; Vogel, Richard M.; Archfield, Stacey A.

2016-01-01

Multicollinearity and omitted-variable bias are major limitations to developing multiple linear regression models to estimate streamflow characteristics in ungaged areas and varying rainfall conditions. Panel regression is used to overcome limitations of traditional regression methods, and obtain reliable model coefficients, in particular to understand the elasticity of streamflow to rainfall. Using annual rainfall and selected basin characteristics at 86 gaged streams in the Hawaiian Islands, regional regression models for three stream classes were developed to estimate the annual low-flow duration discharges. Three panel-regression structures (random effects, fixed effects, and pooled) were compared to traditional regression methods, in which space is substituted for time. Results indicated that panel regression generally was able to reproduce the temporal behavior of streamflow and reduce the standard errors of model coefficients compared to traditional regression, even for models in which the unobserved heterogeneity between streams is significant and the variance inflation factor for rainfall is much greater than 10. This is because both spatial and temporal variability were better characterized in panel regression. In a case study, regional rainfall elasticities estimated from panel regressions were applied to ungaged basins on Maui, using available rainfall projections to estimate plausible changes in surface-water availability and usable stream habitat for native species. The presented panel-regression framework is shown to offer benefits over existing traditional hydrologic regression methods for developing robust regional relations to investigate streamflow response in a changing climate.
Panel regressions to estimate low-flow response to rainfall variability in ungaged basins

NASA Astrophysics Data System (ADS)

Bassiouni, Maoya; Vogel, Richard M.; Archfield, Stacey A.

2016-12-01

Multicollinearity and omitted-variable bias are major limitations to developing multiple linear regression models to estimate streamflow characteristics in ungaged areas and varying rainfall conditions. Panel regression is used to overcome limitations of traditional regression methods, and obtain reliable model coefficients, in particular to understand the elasticity of streamflow to rainfall. Using annual rainfall and selected basin characteristics at 86 gaged streams in the Hawaiian Islands, regional regression models for three stream classes were developed to estimate the annual low-flow duration discharges. Three panel-regression structures (random effects, fixed effects, and pooled) were compared to traditional regression methods, in which space is substituted for time. Results indicated that panel regression generally was able to reproduce the temporal behavior of streamflow and reduce the standard errors of model coefficients compared to traditional regression, even for models in which the unobserved heterogeneity between streams is significant and the variance inflation factor for rainfall is much greater than 10. This is because both spatial and temporal variability were better characterized in panel regression. In a case study, regional rainfall elasticities estimated from panel regressions were applied to ungaged basins on Maui, using available rainfall projections to estimate plausible changes in surface-water availability and usable stream habitat for native species. The presented panel-regression framework is shown to offer benefits over existing traditional hydrologic regression methods for developing robust regional relations to investigate streamflow response in a changing climate.
Using the Ridge Regression Procedures to Estimate the Multiple Linear Regression Coefficients

NASA Astrophysics Data System (ADS)

Gorgees, HazimMansoor; Mahdi, FatimahAssim

2018-05-01

This article concerns with comparing the performance of different types of ordinary ridge regression estimators that have been already proposed to estimate the regression parameters when the near exact linear relationships among the explanatory variables is presented. For this situations we employ the data obtained from tagi gas filling company during the period (2008-2010). The main result we reached is that the method based on the condition number performs better than other methods since it has smaller mean square error (MSE) than the other stated methods.

Job Satisfaction of Female and Male Superintendents: The Influence of Job Facets and Contextual Variables as Potential Predictors

ERIC Educational Resources Information Center

Young, I. Phillip; Kowalski, Theodore J.; McCord, Robert S.; Petersen, George J.

2012-01-01

A descriptive multiple regression approach was used to assess the job satisfaction of female and male public school superintendents taking part in a decennial survey conducted by AASA. Self-reported job satisfaction of public school superintendents was regressed on their affective reactions to specific job facets (supervision, co-workers, and…
CAHOST: An Excel Workbook for Facilitating the Johnson-Neyman Technique for Two-Way Interactions in Multiple Regression.

PubMed

Carden, Stephen W; Holtzman, Nicholas S; Strube, Michael J

2017-01-01

When using multiple regression, researchers frequently wish to explore how the relationship between two variables is moderated by another variable; this is termed an interaction. Historically, two approaches have been used to probe interactions: the pick-a-point approach and the Johnson-Neyman (JN) technique. The pick-a-point approach has limitations that can be avoided using the JN technique. Currently, the software available for implementing the JN technique and creating corresponding figures lacks several desirable features-most notably, ease of use and figure quality. To fill this gap in the literature, we offer a free Microsoft Excel 2013 workbook, CAHOST (a concatenation of the first two letters of the authors' last names), that allows the user to seamlessly create publication-ready figures of the results of the JN technique.
A multiple linear regression analysis of hot corrosion attack on a series of nickel base turbine alloys

NASA Technical Reports Server (NTRS)

Barrett, C. A.

1985-01-01

Multiple linear regression analysis was used to determine an equation for estimating hot corrosion attack for a series of Ni base cast turbine alloys. The U transform (i.e., 1/sin (% A/100) to the 1/2) was shown to give the best estimate of the dependent variable, y. A complete second degree equation is described for the centered" weight chemistries for the elements Cr, Al, Ti, Mo, W, Cb, Ta, and Co. In addition linear terms for the minor elements C, B, and Zr were added for a basic 47 term equation. The best reduced equation was determined by the stepwise selection method with essentially 13 terms. The Cr term was found to be the most important accounting for 60 percent of the explained variability hot corrosion attack.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Penna, M.L.; Duchiade, M.P.

The authors report the results of an investigation into the possible association between air pollution and infant mortality from pneumonia in the Rio de Janeiro Metropolitan Area. This investigation employed multiple linear regression analysis (stepwise method) for infant mortality from pneumonia in 1980, including the study population's areas of residence, incomes, and pollution exposure as independent variables. With the income variable included in the regression, a statistically significant association was observed between the average annual level of particulates and infant mortality from pneumonia. While this finding should be accepted with caution, it does suggest a biological association between these variables.more » The authors' conclusion is that air quality indicators should be included in studies of acute respiratory infections in developing countries.« less
Prediction of anthropometric foot characteristics in children.

PubMed

Morrison, Stewart C; Durward, Brian R; Watt, Gordon F; Donaldson, Malcolm D C

2009-01-01

The establishment of growth reference values is needed in pediatric practice where pathologic conditions can have a detrimental effect on the growth and development of the pediatric foot. This study aims to use multiple regression to evaluate the effects of multiple predictor variables (height, age, body mass, and gender) on anthropometric characteristics of the peripubescent foot. Two hundred children aged 9 to 12 years were recruited, and three anthropometric measurements of the pediatric foot were recorded (foot length, forefoot width, and navicular height). Multiple regression analysis was conducted, and coefficients for gender, height, and body mass all had significant relationships for the prediction of forefoot width and foot length (P < or = .05, r > or = 0.7). The coefficients for gender and body mass were not significant for the prediction of navicular height (P > or = .05), whereas height was (P < or = .05). Normative growth reference values and prognostic regression equations are presented for the peripubescent foot.
The Impact of Selected Academic and Demographic Variables on Mathematics College Readiness Predicted by ACT

ERIC Educational Resources Information Center

Smith, Marcia

2013-01-01

The purpose of the study was to determine the degree to which academic and demographic variables affected the ACT results used in determining college readiness. This quantitative research study followed a non-experimental correlational design. A multiple regression was used to analyze archival data to determine the impact the combined Arkansas…
Predicting Middle School Students' Use of Web 2.0 Technologies out of School Using Home and School Technological Variables

ERIC Educational Resources Information Center

Hughes, Joan E.; Read, Michelle F.; Jones, Sara; Mahometa, Michael

2015-01-01

This study used multiple regression to identify predictors of middle school students' Web 2.0 activities out of school, a construct composed of 15 technology activities. Three middle schools participated, where sixth- and seventh-grade students completed a questionnaire. Independent predictor variables included three demographic and five computer…
Dysglycemia, Glycemic Variability, and Outcome After Cardiac Arrest and Temperature Management at 33°C and 36°C.

PubMed

Borgquist, Ola; Wise, Matt P; Nielsen, Niklas; Al-Subaie, Nawaf; Cranshaw, Julius; Cronberg, Tobias; Glover, Guy; Hassager, Christian; Kjaergaard, Jesper; Kuiper, Michael; Smid, Ondrej; Walden, Andrew; Friberg, Hans

2017-08-01

Dysglycemia and glycemic variability are associated with poor outcomes in critically ill patients. Targeted temperature management alters blood glucose homeostasis. We investigated the association between blood glucose concentrations and glycemic variability and the neurologic outcomes of patients randomized to targeted temperature management at 33°C or 36°C after cardiac arrest. Post hoc analysis of the multicenter TTM-trial. Primary outcome of this analysis was neurologic outcome after 6 months, referred to as "Cerebral Performance Category." Thirty-six sites in Europe and Australia. All 939 patients with out-of-hospital cardiac arrest of presumed cardiac cause that had been included in the TTM-trial. Targeted temperature management at 33°C or 36°C. Nonparametric tests as well as multiple logistic regression and mixed effects logistic regression models were used. Median glucose concentrations on hospital admission differed significantly between Cerebral Performance Category outcomes (p < 0.0001). Hyper- and hypoglycemia were associated with poor neurologic outcome (p = 0.001 and p = 0.054). In the multiple logistic regression models, the median glycemic level was an independent predictor of poor Cerebral Performance Category (Cerebral Performance Category, 3-5) with an odds ratio (OR) of 1.13 in the adjusted model (p = 0.008; 95% CI, 1.03-1.24). It was also a predictor in the mixed model, which served as a sensitivity analysis to adjust for the multiple time points. The proportion of hyperglycemia was higher in the 33°C group compared with the 36°C group. Higher blood glucose levels at admission and during the first 36 hours, and higher glycemic variability, were associated with poor neurologic outcome and death. More patients in the 33°C treatment arm had hyperglycemia.
Penalized regression procedures for variable selection in the potential outcomes framework

PubMed Central

Ghosh, Debashis; Zhu, Yeying; Coffman, Donna L.

2015-01-01

A recent topic of much interest in causal inference is model selection. In this article, we describe a framework in which to consider penalized regression approaches to variable selection for causal effects. The framework leads to a simple ‘impute, then select’ class of procedures that is agnostic to the type of imputation algorithm as well as penalized regression used. It also clarifies how model selection involves a multivariate regression model for causal inference problems, and that these methods can be applied for identifying subgroups in which treatment effects are homogeneous. Analogies and links with the literature on machine learning methods, missing data and imputation are drawn. A difference LASSO algorithm is defined, along with its multiple imputation analogues. The procedures are illustrated using a well-known right heart catheterization dataset. PMID:25628185
Multiple Linear Regression Analysis of Factors Affecting Real Property Price Index From Case Study Research In Istanbul/Turkey

NASA Astrophysics Data System (ADS)

Denli, H. H.; Koc, Z.

2015-12-01

Estimation of real properties depending on standards is difficult to apply in time and location. Regression analysis construct mathematical models which describe or explain relationships that may exist between variables. The problem of identifying price differences of properties to obtain a price index can be converted into a regression problem, and standard techniques of regression analysis can be used to estimate the index. Considering regression analysis for real estate valuation, which are presented in real marketing process with its current characteristics and quantifiers, the method will help us to find the effective factors or variables in the formation of the value. In this study, prices of housing for sale in Zeytinburnu, a district in Istanbul, are associated with its characteristics to find a price index, based on information received from a real estate web page. The associated variables used for the analysis are age, size in m2, number of floors having the house, floor number of the estate and number of rooms. The price of the estate represents the dependent variable, whereas the rest are independent variables. Prices from 60 real estates have been used for the analysis. Same price valued locations have been found and plotted on the map and equivalence curves have been drawn identifying the same valued zones as lines.
Biostatistics Series Module 10: Brief Overview of Multivariate Methods.

PubMed

Hazra, Avijit; Gogtay, Nithya

2017-01-01

Multivariate analysis refers to statistical techniques that simultaneously look at three or more variables in relation to the subjects under investigation with the aim of identifying or clarifying the relationships between them. These techniques have been broadly classified as dependence techniques, which explore the relationship between one or more dependent variables and their independent predictors, and interdependence techniques, that make no such distinction but treat all variables equally in a search for underlying relationships. Multiple linear regression models a situation where a single numerical dependent variable is to be predicted from multiple numerical independent variables. Logistic regression is used when the outcome variable is dichotomous in nature. The log-linear technique models count type of data and can be used to analyze cross-tabulations where more than two variables are included. Analysis of covariance is an extension of analysis of variance (ANOVA), in which an additional independent variable of interest, the covariate, is brought into the analysis. It tries to examine whether a difference persists after "controlling" for the effect of the covariate that can impact the numerical dependent variable of interest. Multivariate analysis of variance (MANOVA) is a multivariate extension of ANOVA used when multiple numerical dependent variables have to be incorporated in the analysis. Interdependence techniques are more commonly applied to psychometrics, social sciences and market research. Exploratory factor analysis and principal component analysis are related techniques that seek to extract from a larger number of metric variables, a smaller number of composite factors or components, which are linearly related to the original variables. Cluster analysis aims to identify, in a large number of cases, relatively homogeneous groups called clusters, without prior information about the groups. The calculation intensive nature of multivariate analysis has so far precluded most researchers from using these techniques routinely. The situation is now changing with wider availability, and increasing sophistication of statistical software and researchers should no longer shy away from exploring the applications of multivariate methods to real-life data sets.
Anthropometric Survey of US Army Personnel (1988): Correlation Coefficients and Regression Equations. Part 5. Stepwise and Standard Multiple Regression Tables

DTIC Science & Technology

1990-05-01

0.759 0.744 0.768 0.753 106 (THUMBBR) THUMB BREADTH -0.652 -0.673 -0.539 -0.663 217 (LIPLGTHH) LIP LENGTH HEADBOARD 0.017 0.019 0.020 51 (FTBRHOR) FOOT...DEPENDENT VARIABLE: (106) THUMB BREADTH (THUBBR) MODEL INDEPENDENT VARIABLE 1 2 3 4 5 INTERCEPT 6.621 5.016 6.267 5.697 4.528 59 (HANDCIRC) HAND...95 (SLLSPEL) SLEEVE LENGTH: SPINE-ELBOW -0.020 -0.019 -C.018 9 (BLFTCIRC) BALL OF FOOT CIRCUMFERENCE -0.032 -0.039 106 (THUMBBR) THUMB BREADTH 0.228
The effects of multiple interpersonal traumas on psychological maladjustment of sexually abused children in Korea.

PubMed

Choi, Ji Young; Oh, Kyung Ja

2013-02-01

The purpose of the present study was to explore the effects of multiple interpersonal traumas on psychiatric diagnosis and behavior problems of sexually abused children in Korea. With 495 children (ages 4-13 years) referred to a public counseling center for sexual abuse in Korea, we found significant differences in the rate of psychiatric diagnoses (r = .23) and severity of behavioral problems (internalizing d = 0.49, externalizing d = 0.40, total d = 0.52) between children who were victims of sexual abuse only (n = 362) and youth who were victims of interpersonal trauma experiences in addition to sexual abuse (n = 133). The effects of multiple interpersonal trauma experiences on single versus multiple diagnoses remained significant in the logistic regression analysis where demographic variables, family environmental factors, sexual abuse characteristics, and postincident factors were considered together, odds ratio (OR) = 0.44, 95% confidence interval (CI) = [0.25, 0.77], p < .01. Similarly, multiple regression analyses revealed a significant effect of multiple interpersonal trauma experiences on severity of behavioral problems above and beyond all aforementioned variables (internalizing β =.12, p = .019, externalizing β = .11, p = .036, total β = .14, p =.008). The results suggested that children with multiple interpersonal traumas are clearly at a greater risk for negative consequences following sexual abuse. Copyright © 2013 International Society for Traumatic Stress Studies.
Apical root resorption in orthodontically treated adults.

PubMed

Baumrind, S; Korn, E L; Boyd, R L

1996-09-01

This study analyzed the relationship in orthodontically treated adults between upper central incisor displacement measured on lateral cephalograms and apical root resorption measured on anterior periapical x-ray films. A multiple linear regression examined incisor displacements in four directions (retraction, advancement, intrusion, and extrusion) as independent variables, attempting to account for observed differences in the dependent variable, resorption. Mean apical resorption was 1.36 mm (sd +/- 1.46, n = 73). Mean horizontal displacement of the apex was -0.83 mm (sd +/- 1.74, n = 67); mean vertical displacement was 0.19 mm (sd +/- 1.48, n = 67). The regression coefficients for the intercept and for retraction were highly significant; those for extrusion, intrusion, and advancement were not. At the 95% confidence level, an average of 0.99 mm (se = +/- 0.34) of resorption was implied in the absence of root displacement and an average of 0.49 mm (se = +/- 0.14) of resorption was implied per millimeter of retraction. R2 for all four directional displacement variables (DDVs) taken together was only 0.20, which implied that only a relatively small portion of the observed apical resorption could be accounted for by tooth displacement alone. In a secondary set of univariate analyses, the associations between apical resorption and each of 14 additional treatment-related variables were examined. Only Gender, Elapsed Time, and Total Apical Displacement displayed statistically significant associations with apical resorption. Additional multiple regressions were then performed in which the data for each of these three statistically significant variables were considered separately, with the data for the four directional displacement variables. The addition of information on Elapsed Time or Total Apical Displacement did not explain a significant additional portion of the variability in apical resorption. On the other hand, the addition of information on Gender to the information on the four directional displacement variables yielded an R2 value of 0.35, which indicated that these variables taken together could account for approximately a third of the observed variability in apical resorption in this sample.
Modeling Joint Exposures and Health Outcomes for Cumulative Risk Assessment: The Case of Radon and Smoking

PubMed Central

Chahine, Teresa; Schultz, Bradley D.; Zartarian, Valerie G.; Xue, Jianping; Subramanian, SV; Levy, Jonathan I.

2011-01-01

Community-based cumulative risk assessment requires characterization of exposures to multiple chemical and non-chemical stressors, with consideration of how the non-chemical stressors may influence risks from chemical stressors. Residential radon provides an interesting case example, given its large attributable risk, effect modification due to smoking, and significant variability in radon concentrations and smoking patterns. In spite of this fact, no study to date has estimated geographic and sociodemographic patterns of both radon and smoking in a manner that would allow for inclusion of radon in community-based cumulative risk assessment. In this study, we apply multi-level regression models to explain variability in radon based on housing characteristics and geological variables, and construct a regression model predicting housing characteristics using U.S. Census data. Multi-level regression models of smoking based on predictors common to the housing model allow us to link the exposures. We estimate county-average lifetime lung cancer risks from radon ranging from 0.15 to 1.8 in 100, with high-risk clusters in areas and for subpopulations with high predicted radon and smoking rates. Our findings demonstrate the viability of screening-level assessment to characterize patterns of lung cancer risk from radon, with an approach that can be generalized to multiple chemical and non-chemical stressors. PMID:22016710
Identifying maternal and infant factors associated with newborn size in rural Bangladesh by partial least squares (PLS) regression analysis

PubMed Central

Rahman, Md. Jahanur; Shamim, Abu Ahmed; Klemm, Rolf D. W.; Labrique, Alain B.; Rashid, Mahbubur; Christian, Parul; West, Keith P.

2017-01-01

Birth weight, length and circumferences of the head, chest and arm are key measures of newborn size and health in developing countries. We assessed maternal socio-demographic factors associated with multiple measures of newborn size in a large rural population in Bangladesh using partial least squares (PLS) regression method. PLS regression, combining features from principal component analysis and multiple linear regression, is a multivariate technique with an ability to handle multicollinearity while simultaneously handling multiple dependent variables. We analyzed maternal and infant data from singletons (n = 14,506) born during a double-masked, cluster-randomized, placebo-controlled maternal vitamin A or β-carotene supplementation trial in rural northwest Bangladesh. PLS regression results identified numerous maternal factors (parity, age, early pregnancy MUAC, living standard index, years of education, number of antenatal care visits, preterm delivery and infant sex) significantly (p<0.001) associated with newborn size. Among them, preterm delivery had the largest negative influence on newborn size (Standardized β = -0.29 − -0.19; p<0.001). Scatter plots of the scores of first two PLS components also revealed an interaction between newborn sex and preterm delivery on birth size. PLS regression was found to be more parsimonious than both ordinary least squares regression and principal component regression. It also provided more stable estimates than the ordinary least squares regression and provided the effect measure of the covariates with greater accuracy as it accounts for the correlation among the covariates and outcomes. Therefore, PLS regression is recommended when either there are multiple outcome measurements in the same study, or the covariates are correlated, or both situations exist in a dataset. PMID:29261760
Identifying maternal and infant factors associated with newborn size in rural Bangladesh by partial least squares (PLS) regression analysis.

PubMed

Kabir, Alamgir; Rahman, Md Jahanur; Shamim, Abu Ahmed; Klemm, Rolf D W; Labrique, Alain B; Rashid, Mahbubur; Christian, Parul; West, Keith P

2017-01-01

Birth weight, length and circumferences of the head, chest and arm are key measures of newborn size and health in developing countries. We assessed maternal socio-demographic factors associated with multiple measures of newborn size in a large rural population in Bangladesh using partial least squares (PLS) regression method. PLS regression, combining features from principal component analysis and multiple linear regression, is a multivariate technique with an ability to handle multicollinearity while simultaneously handling multiple dependent variables. We analyzed maternal and infant data from singletons (n = 14,506) born during a double-masked, cluster-randomized, placebo-controlled maternal vitamin A or β-carotene supplementation trial in rural northwest Bangladesh. PLS regression results identified numerous maternal factors (parity, age, early pregnancy MUAC, living standard index, years of education, number of antenatal care visits, preterm delivery and infant sex) significantly (p<0.001) associated with newborn size. Among them, preterm delivery had the largest negative influence on newborn size (Standardized β = -0.29 - -0.19; p<0.001). Scatter plots of the scores of first two PLS components also revealed an interaction between newborn sex and preterm delivery on birth size. PLS regression was found to be more parsimonious than both ordinary least squares regression and principal component regression. It also provided more stable estimates than the ordinary least squares regression and provided the effect measure of the covariates with greater accuracy as it accounts for the correlation among the covariates and outcomes. Therefore, PLS regression is recommended when either there are multiple outcome measurements in the same study, or the covariates are correlated, or both situations exist in a dataset.
Regression modeling and mapping of coniferous forest basal area and tree density from discrete-return lidar and multispectral data

Treesearch

Andrew T. Hudak; Nicholas L. Crookston; Jeffrey S. Evans; Michael K. Falkowski; Alistair M. S. Smith; Paul E. Gessler; Penelope Morgan

2006-01-01

We compared the utility of discrete-return light detection and ranging (lidar) data and multispectral satellite imagery, and their integration, for modeling and mapping basal area and tree density across two diverse coniferous forest landscapes in north-central Idaho. We applied multiple linear regression models subset from a suite of 26 predictor variables derived...
Interactions among Variables Affecting Hospital Utilization

PubMed Central

Ro, Kong-kyun

1973-01-01

For purposes of developing a more refined basis for prediction of hospital utilization using readily available demographic variables, data for some 9000 patients admitted to 22 short-term general hospitals in the Pittsburgh area are analyzed to determine the relationship of age, sex, and race to hospital use. Significant differences in length of stay and number of services used are found for various combinations of these variables when a form of multiple regression is used that allows for interaction effects among the variables. PMID:4783753
Mean annual runoff and peak flow estimates based on channel geometry of streams in northeastern and western Montana

USGS Publications Warehouse

Parrett, Charles; Omang, R.J.; Hull, J.A.

1983-01-01

Equations for estimating mean annual runoff and peak discharge from measurements of channel geometry were developed for western and northeastern Montana. The study area was divided into two regions for the mean annual runoff analysis, and separate multiple-regression equations were developed for each region. The active-channel width was determined to be the most important independent variable in each region. The standard error of estimate for the estimating equation using active-channel width was 61 percent in the Northeast Region and 38 percent in the West region. The study area was divided into six regions for the peak discharge analysis, and multiple regression equations relating channel geometry and basin characteristics to peak discharges having recurrence intervals of 2, 5, 10, 25, 50 and 100 years were developed for each region. The standard errors of estimate for the regression equations using only channel width as an independent variable ranged from 35 to 105 percent. The standard errors improved in four regions as basin characteristics were added to the estimating equations. (USGS)

Modeling the energy content of combustible ship-scrapping waste at Alang-Sosiya, India, using multiple regression analysis.

PubMed

Reddy, M Srinivasa; Basha, Shaik; Joshi, H V; Sravan Kumar, V G; Jha, B; Ghosh, P K

2005-01-01

Alang-Sosiya is the largest ship-scrapping yard in the world, established in 1982. Every year an average of 171 ships having a mean weight of 2.10 x 10(6)(+/-7.82 x 10(5)) of light dead weight tonnage (LDT) being scrapped. Apart from scrapped metals, this yard generates a massive amount of combustible solid waste in the form of waste wood, plastic, insulation material, paper, glass wool, thermocol pieces (polyurethane foam material), sponge, oiled rope, cotton waste, rubber, etc. In this study multiple regression analysis was used to develop predictive models for energy content of combustible ship-scrapping solid wastes. The scope of work comprised qualitative and quantitative estimation of solid waste samples and performing a sequential selection procedure for isolating variables. Three regression models were developed to correlate the energy content (net calorific values (LHV)) with variables derived from material composition, proximate and ultimate analyses. The performance of these models for this particular waste complies well with the equations developed by other researchers (Dulong, Steuer, Scheurer-Kestner and Bento's) for estimating energy content of municipal solid waste.
Sociological and economic theories of suicide: a comparison of the U.S.A. and Taiwan.

PubMed

Yang, B; Lester, D; Yang, C H

1992-02-01

Time-series analyses were carried out to explore the importance of sociological and economic variables in accounting for the suicide rate in the U.S.A. and in Taiwan for 1952-1984. Sociological variables (divorce and female labor force participation) played similar roles in the multiple regressions for both nations while economic variables (GNP per capita/growth and unemployment) played a role only in the U.S.A.
Evaluation of drainage-area ratio method used to estimate streamflow for the Red River of the North Basin, North Dakota and Minnesota

USGS Publications Warehouse

Emerson, Douglas G.; Vecchia, Aldo V.; Dahl, Ann L.

2005-01-01

The drainage-area ratio method commonly is used to estimate streamflow for sites where no streamflow data were collected. To evaluate the validity of the drainage-area ratio method and to determine if an improved method could be developed to estimate streamflow, a multiple-regression technique was used to determine if drainage area, main channel slope, and precipitation were significant variables for estimating streamflow in the Red River of the North Basin. A separate regression analysis was performed for streamflow for each of three seasons-- winter, spring, and summer. Drainage area and summer precipitation were the most significant variables. However, the regression equations generally overestimated streamflows for North Dakota stations and underestimated streamflows for Minnesota stations. To correct the bias in the residuals for the two groups of stations, indicator variables were included to allow both the intercept and the coefficient for the logarithm of drainage area to depend on the group. Drainage area was the only significant variable in the revised regression equations. The exponents for the drainage-area ratio were 0.85 for the winter season, 0.91 for the spring season, and 1.02 for the summer season.
Meteorological adjustment of yearly mean values for air pollutant concentration comparison

NASA Technical Reports Server (NTRS)

Sidik, S. M.; Neustadter, H. E.

1976-01-01

Using multiple linear regression analysis, models which estimate mean concentrations of Total Suspended Particulate (TSP), sulfur dioxide, and nitrogen dioxide as a function of several meteorologic variables, two rough economic indicators, and a simple trend in time are studied. Meteorologic data were obtained and do not include inversion heights. The goodness of fit of the estimated models is partially reflected by the squared coefficient of multiple correlation which indicates that, at the various sampling stations, the models accounted for about 23 to 47 percent of the total variance of the observed TSP concentrations. If the resulting model equations are used in place of simple overall means of the observed concentrations, there is about a 20 percent improvement in either: (1) predicting mean concentrations for specified meteorological conditions; or (2) adjusting successive yearly averages to allow for comparisons devoid of meteorological effects. An application to source identification is presented using regression coefficients of wind velocity predictor variables.
Effect of partition board color on mood and autonomic nervous function.

PubMed

Sakuragi, Sokichi; Sugiyama, Yoshiki

2011-12-01

The purpose of this study was to evaluate the effects of the presence or absence (control) of a partition board and its color (red, yellow, blue) on subjective mood ratings and changes in autonomic nervous system indicators induced by a video game task. The increase in the mean Profile of Mood States (POMS) Fatigue score and mean Oppressive feeling rating after the task was lowest with the blue partition board. Multiple-regression analysis identified oppressive feeling and error scores on the second half of the task as statistically significant contributors to Fatigue. While explanatory variables were limited to the physiological indices, multiple-regression analysis identified a significant contribution of autonomic reactivity (assessed by heart rate variability) to Fatigue. These results suggest that a blue partition board would reduce task-induced subjective fatigue, in part by lowering the oppressive feeling of being enclosed during the task, possibly by increasing autonomic reactivity.
High-level language ability in healthy individuals and its relationship with verbal working memory.

PubMed

Antonsson, Malin; Longoni, Francesca; Einald, Christina; Hallberg, Lina; Kurt, Gabriella; Larsson, Kajsa; Nilsson, Tina; Hartelius, Lena

2016-01-01

The aims of the study were to investigate healthy subjects' performance on a clinical test of high-level language (HLL) and how it is related to demographic characteristics and verbal working memory (VWM). One hundred healthy subjects (20-79 years old) were assessed with the Swedish BeSS test (Laakso, Brunnegård, Hartelius, & Ahlsén, 2000) and two digit span tasks. Relationships between the demographic variables, VWM and BeSS were investigated both with bivariate correlations and multiple regression analysis. The results present the norms for BeSS. The correlations and multiple regression analysis show that demographic variables had limited influence on test performance. Measures of VWM were moderately related to total BeSS score and weakly to moderately correlated with five of the seven subtests. To conclude, education has an influence on the test as a whole but measures of VWM stood out as the most robust predictor of HLL.
Gender interactions and success.

PubMed

Wiggins, Carla; Peterson, Teri

2004-01-01

Does gender by itself, or does gender's interaction with career variables, better explain the difference between women and men's careers in healthcare management? US healthcare managers were surveyed regarding career and personal experiences. Gender was statistically interacted with explanatory variables. Multiple regression with backwards selection systematically removed non-significant variables. All gender interaction variables were non-significant. Much of the literature proposes that work and career factors impact working women differently than working men. We find that while gender alone is a significant predictor of income, it does not significantly interact with other career variables.
Estimating basin lagtime and hydrograph-timing indexes used to characterize stormflows for runoff-quality analysis

USGS Publications Warehouse

Granato, Gregory E.

2012-01-01

A nationwide study to better define triangular-hydrograph statistics for use with runoff-quality and flood-flow studies was done by the U.S. Geological Survey (USGS) in cooperation with the Federal Highway Administration. Although the triangular hydrograph is a simple linear approximation, the cumulative distribution of stormflow with a triangular hydrograph is a curvilinear S-curve that closely approximates the cumulative distribution of stormflows from measured data. The temporal distribution of flow within a runoff event can be estimated using the basin lagtime, (which is the time from the centroid of rainfall excess to the centroid of the corresponding runoff hydrograph) and the hydrograph recession ratio (which is the ratio of the duration of the falling limb to the rising limb of the hydrograph). This report documents results of the study, methods used to estimate the variables, and electronic files that facilitate calculation of variables. Ten viable multiple-linear regression equations were developed to estimate basin lagtimes from readily determined drainage basin properties using data published in 37 stormflow studies. Regression equations using the basin lag factor (BLF, which is a variable calculated as the main-channel length, in miles, divided by the square root of the main-channel slope in feet per mile) and two variables describing development in the drainage basin were selected as the best candidates, because each equation explains about 70 percent of the variability in the data. The variables describing development are the USGS basin development factor (BDF, which is a function of the amount of channel modifications, storm sewers, and curb-and-gutter streets in a basin) and the total impervious area variable (IMPERV) in the basin. Two datasets were used to develop regression equations. The primary dataset included data from 493 sites that have values for the BLF, BDF, and IMPERV variables. This dataset was used to develop the best-fit regression equation using the BLF and BDF variables. The secondary dataset included data from 896 sites that have values for the BLF and IMPERV variables. This dataset was used to develop the best-fit regression equation using the BLF and IMPERV variables. Analysis of hydrograph recession ratios and basin characteristics for 41 sites indicated that recession ratios are random variables. Thus, recession ratios cannot be estimated quantitatively using multiple linear regression equations developed using the data available for these sites. The minimums of recession ratios for different streamgages are well characterized by a value of one. The most probable values and maximum values of recession ratios for different streamgages are, however, more variable than the minimums. The most probable values of recession ratios for the 41 streamgages analyzed ranged from 1.0 to 3.52 and had a median of 1.85. The maximum values ranged from 2.66 to 11.3 and had a median of 4.36.
Prediction of performance on the RCMP physical ability requirement evaluation.

PubMed

Stanish, H I; Wood, T M; Campagna, P

1999-08-01

The Royal Canadian Mounted Police use the Physical Ability Requirement Evaluation (PARE) for screening applicants. The purposes of this investigation were to identify those field tests of physical fitness that were associated with PARE performance and determine which most accurately classified successful and unsuccessful PARE performers. The participants were 27 female and 21 male volunteers. Testing included measures of aerobic power, anaerobic power, agility, muscular strength, muscular endurance, and body composition. Multiple regression analysis revealed a three-variable model for males (70-lb bench press, standing long jump, and agility) explaining 79% of the variability in PARE time, whereas a one-variable model (agility) explained 43% of the variability for females. Analysis of the classification accuracy of the males' data was prohibited because 91% of the males passed the PARE. Classification accuracy of the females' data, using logistic regression, produced a two-variable model (agility, 1.5-mile endurance run) with 93% overall classification accuracy.
Cognitive and Behavioural Correlates of Non-Adherence to HIV Anti-Retroviral Therapy: Theoretical and Practical Insight for Clinical Psychology and Health Psychology

ERIC Educational Resources Information Center

Begley, Kim; McLaws, Mary-Louise; Ross, Michael W.; Gold, Julian

2008-01-01

This cross-sectional study identified variables associated with protease inhibitor (PI) non-adherence in 179 patients taking anti-retroviral therapy. Univariate analyses identified 11 variables associated with PI non-adherence. Multiple logistic regression modelling identified three predictors of PI non-adherence: low adherence self-efficacy and…
Using a Market Ratio Factor in Faculty Salary Equity Studies. Professional File Number 103, Spring 2007

ERIC Educational Resources Information Center

Luna, Andrew L.

2007-01-01

This study used two multiple regression analyses to develop an explanatory model to determine which model might best explain faculty salaries. The central purpose of the study was to determine if using a single market ratio variable was a stronger predictor for faculty salaries than the use of dummy variables representing various disciplines.…
Predictors of Dropout by Female Obese Patients Treated with a Group Cognitive Behavioral Therapy to Promote Weight Loss.

PubMed

Sawamoto, Ryoko; Nozaki, Takehiro; Furukawa, Tomokazu; Tanahashi, Tokusei; Morita, Chihiro; Hata, Tomokazu; Komaki, Gen; Sudo, Nobuyuki

2016-01-01

To investigate predictors of dropout from a group cognitive behavioral therapy (CBT) intervention for overweight or obese women. 119 overweight and obese Japanese women aged 25-65 years who attended an outpatient weight loss intervention were followed throughout the 7-month weight loss phase. Somatic characteristics, socioeconomic status, obesity-related diseases, diet and exercise habits, and psychological variables (depression, anxiety, self-esteem, alexithymia, parenting style, perfectionism, and eating attitude) were assessed at baseline. Significant variables, extracted by univariate statistical analysis, were then used as independent variables in a stepwise multiple logistic regression analysis with dropout as the dependent variable. 90 participants completed the weight loss phase, giving a dropout rate of 24.4%. The multiple logistic regression analysis demonstrated that compared to completers the dropouts had significantly stronger body shape concern, tended to not have jobs, perceived their mothers to be less caring, and were more disorganized in temperament. Of all these factors, the best predictor of dropout was shape concern. Shape concern, job condition, parenting care, and organization predicted dropout from the group CBT weight loss intervention for overweight or obese Japanese women. © 2016 S. Karger GmbH, Freiburg.
Predictors of Dropout by Female Obese Patients Treated with a Group Cognitive Behavioral Therapy to Promote Weight Loss

PubMed Central

Sawamoto, Ryoko; Nozaki, Takehiro; Furukawa, Tomokazu; Tanahashi, Tokusei; Morita, Chihiro; Hata, Tomokazu; Komaki, Gen; Sudo, Nobuyuki

2016-01-01

Objective To investigate predictors of dropout from a group cognitive behavioral therapy (CBT) intervention for overweight or obese women. Methods 119 overweight and obese Japanese women aged 25-65 years who attended an outpatient weight loss intervention were followed throughout the 7-month weight loss phase. Somatic characteristics, socioeconomic status, obesity-related diseases, diet and exercise habits, and psychological variables (depression, anxiety, self-esteem, alexithymia, parenting style, perfectionism, and eating attitude) were assessed at baseline. Significant variables, extracted by univariate statistical analysis, were then used as independent variables in a stepwise multiple logistic regression analysis with dropout as the dependent variable. Results 90 participants completed the weight loss phase, giving a dropout rate of 24.4%. The multiple logistic regression analysis demonstrated that compared to completers the dropouts had significantly stronger body shape concern, tended to not have jobs, perceived their mothers to be less caring, and were more disorganized in temperament. Of all these factors, the best predictor of dropout was shape concern. Conclusion Shape concern, job condition, parenting care, and organization predicted dropout from the group CBT weight loss intervention for overweight or obese Japanese women. PMID:26745715
Bayesian LASSO, scale space and decision making in association genetics.

PubMed

Pasanen, Leena; Holmström, Lasse; Sillanpää, Mikko J

2015-01-01

LASSO is a penalized regression method that facilitates model fitting in situations where there are as many, or even more explanatory variables than observations, and only a few variables are relevant in explaining the data. We focus on the Bayesian version of LASSO and consider four problems that need special attention: (i) controlling false positives, (ii) multiple comparisons, (iii) collinearity among explanatory variables, and (iv) the choice of the tuning parameter that controls the amount of shrinkage and the sparsity of the estimates. The particular application considered is association genetics, where LASSO regression can be used to find links between chromosome locations and phenotypic traits in a biological organism. However, the proposed techniques are relevant also in other contexts where LASSO is used for variable selection. We separate the true associations from false positives using the posterior distribution of the effects (regression coefficients) provided by Bayesian LASSO. We propose to solve the multiple comparisons problem by using simultaneous inference based on the joint posterior distribution of the effects. Bayesian LASSO also tends to distribute an effect among collinear variables, making detection of an association difficult. We propose to solve this problem by considering not only individual effects but also their functionals (i.e. sums and differences). Finally, whereas in Bayesian LASSO the tuning parameter is often regarded as a random variable, we adopt a scale space view and consider a whole range of fixed tuning parameters, instead. The effect estimates and the associated inference are considered for all tuning parameters in the selected range and the results are visualized with color maps that provide useful insights into data and the association problem considered. The methods are illustrated using two sets of artificial data and one real data set, all representing typical settings in association genetics.
Assessing the impact of local meteorological variables on surface ozone in Hong Kong during 2000-2015 using quantile and multiple line regression models

NASA Astrophysics Data System (ADS)

Zhao, Wei; Fan, Shaojia; Guo, Hai; Gao, Bo; Sun, Jiaren; Chen, Laiguo

2016-11-01

The quantile regression (QR) method has been increasingly introduced to atmospheric environmental studies to explore the non-linear relationship between local meteorological conditions and ozone mixing ratios. In this study, we applied QR for the first time, together with multiple linear regression (MLR), to analyze the dominant meteorological parameters influencing the mean, 10th percentile, 90th percentile and 99th percentile of maximum daily 8-h average (MDA8) ozone concentrations in 2000-2015 in Hong Kong. The dominance analysis (DA) was used to assess the relative importance of meteorological variables in the regression models. Results showed that the MLR models worked better at suburban and rural sites than at urban sites, and worked better in winter than in summer. QR models performed better in summer for 99th and 90th percentiles and performed better in autumn and winter for 10th percentile. And QR models also performed better in suburban and rural areas for 10th percentile. The top 3 dominant variables associated with MDA8 ozone concentrations, changing with seasons and regions, were frequently associated with the six meteorological parameters: boundary layer height, humidity, wind direction, surface solar radiation, total cloud cover and sea level pressure. Temperature rarely became a significant variable in any season, which could partly explain the peak of monthly average ozone concentrations in October in Hong Kong. And we found the effect of solar radiation would be enhanced during extremely ozone pollution episodes (i.e., the 99th percentile). Finally, meteorological effects on MDA8 ozone had no significant changes before and after the 2010 Asian Games.
Multiple regression analysis of anthropometric measurements influencing the cephalic index of male Japanese university students.

PubMed

Hossain, Md Golam; Saw, Aik; Alam, Rashidul; Ohtsuki, Fumio; Kamarul, Tunku

2013-09-01

Cephalic index (CI), the ratio of head breadth to head length, is widely used to categorise human populations. The aim of this study was to access the impact of anthropometric measurements on the CI of male Japanese university students. This study included 1,215 male university students from Tokyo and Kyoto, selected using convenient sampling. Multiple regression analysis was used to determine the effect of anthropometric measurements on CI. The variance inflation factor (VIF) showed no evidence of a multicollinearity problem among independent variables. The coefficients of the regression line demonstrated a significant positive relationship between CI and minimum frontal breadth (p < 0.01), bizygomatic breadth (p < 0.01) and head height (p < 0.05), and a negative relationship between CI and morphological facial height (p < 0.01) and head circumference (p < 0.01). Moreover, the coefficient and odds ratio of logistic regression analysis showed a greater likelihood for minimum frontal breadth (p < 0.01) and bizygomatic breadth (p < 0.01) to predict round-headedness, and morphological facial height (p < 0.05) and head circumference (p < 0.01) to predict long-headedness. Stepwise regression analysis revealed bizygomatic breadth, head circumference, minimum frontal breadth, head height and morphological facial height to be the best predictor craniofacial measurements with respect to CI. The results suggest that most of the variables considered in this study appear to influence the CI of adult male Japanese students.
Modeling brook trout presence and absence from landscape variables using four different analytical methods

USGS Publications Warehouse

Steen, Paul J.; Passino-Reader, Dora R.; Wiley, Michael J.

2006-01-01

As a part of the Great Lakes Regional Aquatic Gap Analysis Project, we evaluated methodologies for modeling associations between fish species and habitat characteristics at a landscape scale. To do this, we created brook trout Salvelinus fontinalis presence and absence models based on four different techniques: multiple linear regression, logistic regression, neural networks, and classification trees. The models were tested in two ways: by application to an independent validation database and cross-validation using the training data, and by visual comparison of statewide distribution maps with historically recorded occurrences from the Michigan Fish Atlas. Although differences in the accuracy of our models were slight, the logistic regression model predicted with the least error, followed by multiple regression, then classification trees, then the neural networks. These models will provide natural resource managers a way to identify habitats requiring protection for the conservation of fish species.
Flood-frequency characteristics of Wisconsin streams

USGS Publications Warehouse

Walker, John F.; Peppler, Marie C.; Danz, Mari E.; Hubbard, Laura E.

2017-05-22

Flood-frequency characteristics for 360 gaged sites on unregulated rural streams in Wisconsin are presented for percent annual exceedance probabilities ranging from 0.2 to 50 using a statewide skewness map developed for this report. Equations of the relations between flood-frequency and drainage-basin characteristics were developed by multiple-regression analyses. Flood-frequency characteristics for ungaged sites on unregulated, rural streams can be estimated by use of the equations presented in this report. The State was divided into eight areas of similar physiographic characteristics. The most significant basin characteristics are drainage area, soil saturated hydraulic conductivity, main-channel slope, and several land-use variables. The standard error of prediction for the equation for the 1-percent annual exceedance probability flood ranges from 56 to 70 percent for Wisconsin Streams; these values are larger than results presented in previous reports. The increase in the standard error of prediction is likely due to increased variability of the annual-peak discharges, resulting in increased variability in the magnitude of flood peaks at higher frequencies. For each of the unregulated rural streamflow-gaging stations, a weighted estimate based on the at-site log Pearson type III analysis and the multiple regression results was determined. The weighted estimate generally has a lower uncertainty than either the Log Pearson type III or multiple regression estimates. For regulated streams, a graphical method for estimating flood-frequency characteristics was developed from the relations of discharge and drainage area for selected annual exceedance probabilities. Graphs for the major regulated streams in Wisconsin are presented in the report.
Genetic Programming Transforms in Linear Regression Situations

NASA Astrophysics Data System (ADS)

Castillo, Flor; Kordon, Arthur; Villa, Carlos

The chapter summarizes the use of Genetic Programming (GP) inMultiple Linear Regression (MLR) to address multicollinearity and Lack of Fit (LOF). The basis of the proposed method is applying appropriate input transforms (model respecification) that deal with these issues while preserving the information content of the original variables. The transforms are selected from symbolic regression models with optimal trade-off between accuracy of prediction and expressional complexity, generated by multiobjective Pareto-front GP. The chapter includes a comparative study of the GP-generated transforms with Ridge Regression, a variant of ordinary Multiple Linear Regression, which has been a useful and commonly employed approach for reducing multicollinearity. The advantages of GP-generated model respecification are clearly defined and demonstrated. Some recommendations for transforms selection are given as well. The application benefits of the proposed approach are illustrated with a real industrial application in one of the broadest empirical modeling areas in manufacturing - robust inferential sensors. The chapter contributes to increasing the awareness of the potential of GP in statistical model building by MLR.
Stature estimation from the lengths of the growing foot-a study on North Indian adolescents.

PubMed

Krishan, Kewal; Kanchan, Tanuj; Passi, Neelam; DiMaggio, John A

2012-12-01

Stature estimation is considered as one of the basic parameters of the investigation process in unknown and commingled human remains in medico-legal case work. Race, age and sex are the other parameters which help in this process. Stature estimation is of the utmost importance as it completes the biological profile of a person along with the other three parameters of identification. The present research is intended to formulate standards for stature estimation from foot dimensions in adolescent males from North India and study the pattern of foot growth during the growing years. 154 male adolescents from the Northern part of India were included in the study. Besides stature, five anthropometric measurements that included the length of the foot from each toe (T1, T2, T3, T4, and T5 respectively) to pternion were measured on each foot. The data was analyzed statistically using Student's t-test, Pearson's correlation, linear and multiple regression analysis for estimation of stature and growth of foot during ages 13-18 years. Correlation coefficients between stature and all the foot measurements were found to be highly significant and positively correlated. Linear regression models and multiple regression models (with age as a co-variable) were derived for estimation of stature from the different measurements of the foot. Multiple regression models (with age as a co-variable) estimate stature with greater accuracy than the regression models for 13-18 years age group. The study shows the growth pattern of feet in North Indian adolescents and indicates that anthropometric measurements of the foot and its segments are valuable in estimation of stature in growing individuals of that population. Copyright © 2012 Elsevier Ltd. All rights reserved.

Confounding Problems in Multifactor AOV When Using Several Organismic Variables of Limited Reliability

ERIC Educational Resources Information Center

Games, Paul A.

1975-01-01

A brief introduction is presented on how multiple regression and linear model techniques can handle data analysis situations that most educators and psychologists think of as appropriate for analysis of variance. (Author/BJG)
Variable selection in near-infrared spectroscopy: benchmarking of feature selection methods on biodiesel data.

PubMed

Balabin, Roman M; Smirnov, Sergey V

2011-04-29

During the past several years, near-infrared (near-IR/NIR) spectroscopy has increasingly been adopted as an analytical tool in various fields from petroleum to biomedical sectors. The NIR spectrum (above 4000 cm(-1)) of a sample is typically measured by modern instruments at a few hundred of wavelengths. Recently, considerable effort has been directed towards developing procedures to identify variables (wavelengths) that contribute useful information. Variable selection (VS) or feature selection, also called frequency selection or wavelength selection, is a critical step in data analysis for vibrational spectroscopy (infrared, Raman, or NIRS). In this paper, we compare the performance of 16 different feature selection methods for the prediction of properties of biodiesel fuel, including density, viscosity, methanol content, and water concentration. The feature selection algorithms tested include stepwise multiple linear regression (MLR-step), interval partial least squares regression (iPLS), backward iPLS (BiPLS), forward iPLS (FiPLS), moving window partial least squares regression (MWPLS), (modified) changeable size moving window partial least squares (CSMWPLS/MCSMWPLSR), searching combination moving window partial least squares (SCMWPLS), successive projections algorithm (SPA), uninformative variable elimination (UVE, including UVE-SPA), simulated annealing (SA), back-propagation artificial neural networks (BP-ANN), Kohonen artificial neural network (K-ANN), and genetic algorithms (GAs, including GA-iPLS). Two linear techniques for calibration model building, namely multiple linear regression (MLR) and partial least squares regression/projection to latent structures (PLS/PLSR), are used for the evaluation of biofuel properties. A comparison with a non-linear calibration model, artificial neural networks (ANN-MLP), is also provided. Discussion of gasoline, ethanol-gasoline (bioethanol), and diesel fuel data is presented. The results of other spectroscopic techniques application, such as Raman, ultraviolet-visible (UV-vis), or nuclear magnetic resonance (NMR) spectroscopies, can be greatly improved by an appropriate feature selection choice. Copyright © 2011 Elsevier B.V. All rights reserved.
Normality of Residuals Is a Continuous Variable, and Does Seem to Influence the Trustworthiness of Confidence Intervals: A Response to, and Appreciation of, Williams, Grajales, and Kurkiewicz (2013)

ERIC Educational Resources Information Center

Osborne, Jason W.

2013-01-01

Osborne and Waters (2002) focused on checking some of the assumptions of multiple linear regression. In a critique of that paper, Williams, Grajales, and Kurkiewicz correctly clarify that regression models estimated using ordinary least squares require the assumption of normally distributed errors, but not the assumption of normally distributed…
Influence of the Separation of Prescription and Dispensation of Medicine on Its Cost in Japanese Prefectures

PubMed Central

Yokoi, Masayuki; Tashiro, Takao

2014-01-01

We studied how the separation of dispensing and prescribing of medicines between pharmacies and clinics (the “separation system”) can reduce internal medicine costs. To do so, we obtained publicly available data by searching electronic databases and official web pages of the Japanese government and non-profit public service corporations on the Internet. For Japanese medical institutions, participation in the separation system is optional. Consequently, the expansion rate of the separation system for each of the administrative districts is highly variable. The data were subjected to multiple regression analysis; daily internal medicines were the objective variable and expansion rate of the separation system was the explanatory variable. A multiple regression analysis revealed that the expansion rate of the separation system and the rate of replacing brand name medicine with generic medicine showed a significant negative partial correlation with daily internal medicine costs. Thus, the separation system was as effective in reducing medicine costs as the use of generic medicines. Because of its medical economic efficiency, the separation system should be expanded, especially in Asian countries in which the system is underdeveloped. PMID:24999122
Influence of the separation of prescription and dispensation of medicine on its cost in Japanese prefectures.

PubMed

Yokoi, Masayuki; Tashiro, Takao

2014-04-07

We studied how the separation of dispensing and prescribing of medicines between pharmacies and clinics (the "separation system") can reduce internal medicine costs. To do so, we obtained publicly available data by searching electronic databases and official web pages of the Japanese government and non-profit public service corporations on the Internet. For Japanese medical institutions, participation in the separation system is optional. Consequently, the expansion rate of the separation system for each of the administrative districts is highly variable. The data were subjected to multiple regression analysis; daily internal medicines were the objective variable and expansion rate of the separation system was the explanatory variable. A multiple regression analysis revealed that the expansion rate of the separation system and the rate of replacing brand name medicine with generic medicine showed a significant negative partial correlation with daily internal medicine costs. Thus, the separation system was as effective in reducing medicine costs as the use of generic medicines. Because of its medical economic efficiency, the separation system should be expanded, especially in Asian countries in which the system is underdeveloped.
A method for fitting regression splines with varying polynomial order in the linear mixed model.

PubMed

Edwards, Lloyd J; Stewart, Paul W; MacDougall, James E; Helms, Ronald W

2006-02-15

The linear mixed model has become a widely used tool for longitudinal analysis of continuous variables. The use of regression splines in these models offers the analyst additional flexibility in the formulation of descriptive analyses, exploratory analyses and hypothesis-driven confirmatory analyses. We propose a method for fitting piecewise polynomial regression splines with varying polynomial order in the fixed effects and/or random effects of the linear mixed model. The polynomial segments are explicitly constrained by side conditions for continuity and some smoothness at the points where they join. By using a reparameterization of this explicitly constrained linear mixed model, an implicitly constrained linear mixed model is constructed that simplifies implementation of fixed-knot regression splines. The proposed approach is relatively simple, handles splines in one variable or multiple variables, and can be easily programmed using existing commercial software such as SAS or S-plus. The method is illustrated using two examples: an analysis of longitudinal viral load data from a study of subjects with acute HIV-1 infection and an analysis of 24-hour ambulatory blood pressure profiles.
Modeling relationships between catchment attributes and river water quality in southern catchments of the Caspian Sea.

PubMed

Hasani Sangani, Mohammad; Jabbarian Amiri, Bahman; Alizadeh Shabani, Afshin; Sakieh, Yousef; Ashrafi, Sohrab

2015-04-01

Increasing land utilization through diverse forms of human activities, such as agriculture, forestry, urban growth, and industrial development, has led to negative impacts on the water quality of rivers. To find out how catchment attributes, such as land use, hydrologic soil groups, and lithology, can affect water quality variables (Ca(2+), Mg(2+), Na(+), Cl(-), HCO 3 (-) , pH, TDS, EC, SAR), a spatio-statistical approach was applied to 23 catchments in southern basins of the Caspian Sea. All input data layers (digital maps of land use, soil, and lithology) were prepared using geographic information system (GIS) and spatial analysis. Relationships between water quality variables and catchment attributes were then examined by Spearman rank correlation tests and multiple linear regression. Stepwise approach-based multiple linear regressions were developed to examine the relationship between catchment attributes and water quality variables. The areas (%) of marl, tuff, or diorite, as well as those of good-quality rangeland and bare land had negative effects on all water quality variables, while those of basalt, forest land cover were found to contribute to improved river water quality. Moreover, lithological variables showed the greatest most potential for predicting the mean concentration values of water quality variables, and noting that measure of EC and TDS have inversely associated with area (%) of urban land use.
Biomechanical, anthropometric, and psychological determinants of barbell back squat strength.

PubMed

Vigotsky, Andrew D; Bryanton, Megan A; Nuckols, Greg; Beardsley, Chris; Contreras, Bret; Evans, Jessica; Schoenfeld, Brad J

2018-02-27

Previous investigations of strength have only focused on biomechanical or psychological determinants, while ignoring the potential interplay and relative contributions of these variables. The purpose of this study was to investigate the relative contributions of biomechanical, anthropometric, and psychological variables to the prediction of maximum parallel barbell back squat strength. Twenty-one college-aged participants (male = 14; female = 7; age = 23 ± 3 years) reported to the laboratory for two visits. The first visit consisted of anthropometric, psychometric, and parallel barbell back squat one-repetition maximum (1RM) testing. On the second visit, participants performed isometric dynamometry testing for the knee, hip, and spinal extensors in a sticking point position-specific manner. Multiple linear regression and correlations were used to investigate the combined and individual relationships between biomechanical, anthropometric, and psychological variables and squat 1RM. Multiple regression revealed only one statistically predictive determinant: fat free mass normalized to height (standardized estimate ± SE = 0.6 ± 0.3; t(16) = 2.28; p = 0.037). Correlation coefficients for individual variables and squat 1RM ranged from r = -0.79-0.83, with biomechanical, anthropometric, experiential, and sex predictors showing the strongest relationships, and psychological variables displaying the weakest relationships. These data suggest that back squat strength in a heterogeneous population is multifactorial and more related to physical rather than psychological variables.
Statistically extracted fundamental watershed variables for estimating the loads of total nitrogen in small streams

USGS Publications Warehouse

Kronholm, Scott C.; Capel, Paul D.; Terziotti, Silvia

2016-01-01

Accurate estimation of total nitrogen loads is essential for evaluating conditions in the aquatic environment. Extrapolation of estimates beyond measured streams will greatly expand our understanding of total nitrogen loading to streams. Recursive partitioning and random forest regression were used to assess 85 geospatial, environmental, and watershed variables across 636 small (<585 km2) watersheds to determine which variables are fundamentally important to the estimation of annual loads of total nitrogen. Initial analysis led to the splitting of watersheds into three groups based on predominant land use (agricultural, developed, and undeveloped). Nitrogen application, agricultural and developed land area, and impervious or developed land in the 100-m stream buffer were commonly extracted variables by both recursive partitioning and random forest regression. A series of multiple linear regression equations utilizing the extracted variables were created and applied to the watersheds. As few as three variables explained as much as 76 % of the variability in total nitrogen loads for watersheds with predominantly agricultural land use. Catchment-scale national maps were generated to visualize the total nitrogen loads and yields across the USA. The estimates provided by these models can inform water managers and help identify areas where more in-depth monitoring may be beneficial.
Predicting MHC-II binding affinity using multiple instance regression

PubMed Central

EL-Manzalawy, Yasser; Dobbs, Drena; Honavar, Vasant

2011-01-01

Reliably predicting the ability of antigen peptides to bind to major histocompatibility complex class II (MHC-II) molecules is an essential step in developing new vaccines. Uncovering the amino acid sequence correlates of the binding affinity of MHC-II binding peptides is important for understanding pathogenesis and immune response. The task of predicting MHC-II binding peptides is complicated by the significant variability in their length. Most existing computational methods for predicting MHC-II binding peptides focus on identifying a nine amino acids core region in each binding peptide. We formulate the problems of qualitatively and quantitatively predicting flexible length MHC-II peptides as multiple instance learning and multiple instance regression problems, respectively. Based on this formulation, we introduce MHCMIR, a novel method for predicting MHC-II binding affinity using multiple instance regression. We present results of experiments using several benchmark datasets that show that MHCMIR is competitive with the state-of-the-art methods for predicting MHC-II binding peptides. An online web server that implements the MHCMIR method for MHC-II binding affinity prediction is freely accessible at http://ailab.cs.iastate.edu/mhcmir. PMID:20855923
Multiple-Shrinkage Multinomial Probit Models with Applications to Simulating Geographies in Public Use Data.

PubMed

Burgette, Lane F; Reiter, Jerome P

2013-06-01

Multinomial outcomes with many levels can be challenging to model. Information typically accrues slowly with increasing sample size, yet the parameter space expands rapidly with additional covariates. Shrinking all regression parameters towards zero, as often done in models of continuous or binary response variables, is unsatisfactory, since setting parameters equal to zero in multinomial models does not necessarily imply "no effect." We propose an approach to modeling multinomial outcomes with many levels based on a Bayesian multinomial probit (MNP) model and a multiple shrinkage prior distribution for the regression parameters. The prior distribution encourages the MNP regression parameters to shrink toward a number of learned locations, thereby substantially reducing the dimension of the parameter space. Using simulated data, we compare the predictive performance of this model against two other recently-proposed methods for big multinomial models. The results suggest that the fully Bayesian, multiple shrinkage approach can outperform these other methods. We apply the multiple shrinkage MNP to simulating replacement values for areal identifiers, e.g., census tract indicators, in order to protect data confidentiality in public use datasets.
High school science enrollment of black students

NASA Astrophysics Data System (ADS)

Goggins, Ellen O.; Lindbeck, Joy S.

How can the high school science enrollment of black students be increased? School and home counseling and classroom procedures could benefit from variables identified as predictors of science enrollment. The problem in this study was to identify a set of variables which characterize science course enrollment by black secondary students. The population consisted of a subsample of 3963 black high school seniors from The High School and Beyond 1980 Base-Year Survey. Using multiple linear regression, backward regression, and correlation analyses, the US Census regions and grades mostly As and Bs in English were found to be significant predictors of the number of science courses scheduled by black seniors.
ERP correlates of word production predictors in picture naming: a trial by trial multiple regression analysis from stimulus onset to response.

PubMed

Valente, Andrea; Bürki, Audrey; Laganaro, Marina

2014-01-01

A major effort in cognitive neuroscience of language is to define the temporal and spatial characteristics of the core cognitive processes involved in word production. One approach consists in studying the effects of linguistic and pre-linguistic variables in picture naming tasks. So far, studies have analyzed event-related potentials (ERPs) during word production by examining one or two variables with factorial designs. Here we extended this approach by investigating simultaneously the effects of multiple theoretical relevant predictors in a picture naming task. High density EEG was recorded on 31 participants during overt naming of 100 pictures. ERPs were extracted on a trial by trial basis from picture onset to 100 ms before the onset of articulation. Mixed-effects regression models were conducted to examine which variables affected production latencies and the duration of periods of stable electrophysiological patterns (topographic maps). Results revealed an effect of a pre-linguistic variable, visual complexity, on an early period of stable electric field at scalp, from 140 to 180 ms after picture presentation, a result consistent with the proposal that this time period is associated with visual object recognition processes. Three other variables, word Age of Acquisition, Name Agreement, and Image Agreement influenced response latencies and modulated ERPs from ~380 ms to the end of the analyzed period. These results demonstrate that a topographic analysis fitted into the single trial ERPs and covering the entire processing period allows one to associate the cost generated by psycholinguistic variables to the duration of specific stable electrophysiological processes and to pinpoint the precise time-course of multiple word production predictors at once.
The Influence of the Student Mobility Rate on the Graduation Rate in the State of New Jersey

ERIC Educational Resources Information Center

Ross, Lavetta S.

2016-01-01

This study examined the influence of the student mobility rate on the high school graduation rate of schools in the state of New Jersey. Variables found to have an influence on the graduation rate in the extant literature were evaluated and reported. The analysis included multiple and hierarchical regression models for school variables (i.e.,…
Study of process variables associated with manufacturing hermetically-sealed nickel-cadmium cells

NASA Technical Reports Server (NTRS)

Miller, L.; Doan, D. J.; Carr, E. S.

1971-01-01

A program to determine and study the critical process variables associated with the manufacture of aerospace, hermetically-sealed, nickel-cadmium cells is described. The determination and study of the process variables associated with the positive and negative plaque impregnation/polarization process are emphasized. The experimental data resulting from the implementation of fractional factorial design experiments are analyzed by means of a linear multiple regression analysis technique. This analysis permits the selection of preferred levels for certain process variables to achieve desirable impregnated plaque characteristics.
Independence of heritable influences on the food intake of free-living humans.

PubMed

de Castro, John M

2002-01-01

The time of day of meal ingestion, the number of people present at the meal, the subjective state of hunger, and the estimated before-meal contents in the stomach have been established as influences on the amount eaten in a meal and these influences have been shown to be heritable. Because these factors intercorrelate, the calculated heritabilities for some of these variables might result indirectly from their covariation with one of the other heritable variables. The independence of the heritability of the influence of these four factors was investigated with 110 identical and 102 fraternal same-sex and 53 fraternal mixed-sex adult twin pairs who were paid to maintain 7-d food-intake diaries. From the diary reports, the meal sizes were calculated and subjected to multiple regression analysis using the estimated before-meal stomach contents, the reported number of other people present, the subjective hunger ratings, and the time of day of the meal as predictors. Linear structural modeling was applied to the beta-coefficients from the multiple regression to investigate whether the heritability of the influences of these four variables was independent. Significant genetic effects were found for the beta-coefficients for all four variables, indicating that the heritability of their relationship with intake is to some extent independent and heritable. This suggests that influences of multiple factors on intake are influenced by the genes and become part of the total package of genetically determined physiologic, sociocultural, and psychological processes that regulate energy balance.
Heritability of diurnal changes in food intake in free-living humans.

PubMed

de Castro, J M

2001-09-01

The time of day of meal ingestion, the number of people present at the meal, the subjective state of hunger, and the estimated before-meal contents in the stomach have been established as influences on the amount eaten in a meal, and this influence has been shown to be heritable. Because these factors intercorrelate, the possibility that the calculated heritabilities for some of these variables could result indirectly from their convariation with one of the other heritable variables was assessed. The independence of the heritability of the influence of these four factors was investigated with 110 identical and 102 fraternal same-sex and 53 fraternal mixed-sex adult twin pairs who were paid to maintain 7-d food intake diaries. From the diary reports, the meal sizes were calculated and subjected to multiple regression analysis using the estimated before-meal stomach contents, the reported number of other people present, the subjective hunger ratings, and the time of day of the meal as predictors. Linear structural modeling was applied to the beta coefficients from the multiple regression to investigate whether the heritability of the influences of these four variables was independent. Significant genetic effects were found for the beta coefficients for all four variables, indicating that the heritability of their relationship with intake is to some extent heritable. These results suggest that the influences of multiple factors on intake are influenced by the genes and become part of the total package of genetically determined physiologic, sociocultural, and psychological processes that regulate energy balance.
Predicting location of recurrence using FDG, FLT, and Cu-ATSM PET in canine sinonasal tumors treated with radiotherapy

NASA Astrophysics Data System (ADS)

Bradshaw, Tyler; Fu, Rau; Bowen, Stephen; Zhu, Jun; Forrest, Lisa; Jeraj, Robert

2015-07-01

Dose painting relies on the ability of functional imaging to identify resistant tumor subvolumes to be targeted for additional boosting. This work assessed the ability of FDG, FLT, and Cu-ATSM PET imaging to predict the locations of residual FDG PET in canine tumors following radiotherapy. Nineteen canines with spontaneous sinonasal tumors underwent PET/CT imaging with radiotracers FDG, FLT, and Cu-ATSM prior to hypofractionated radiotherapy. Therapy consisted of 10 fractions of 4.2 Gy to the sinonasal cavity with or without an integrated boost of 0.8 Gy to the GTV. Patients had an additional FLT PET/CT scan after fraction 2, a Cu-ATSM PET/CT scan after fraction 3, and follow-up FDG PET/CT scans after radiotherapy. Following image registration, simple and multiple linear and logistic voxel regressions were performed to assess how well pre- and mid-treatment PET imaging predicted post-treatment FDG uptake. R2 and pseudo R2 were used to assess the goodness of fits. For simple linear regression models, regression coefficients for all pre- and mid-treatment PET images were significantly positive across the population (P < 0.05). However, there was large variability among patients in goodness of fits: R2 ranged from 0.00 to 0.85, with a median of 0.12. Results for logistic regression models were similar. Multiple linear regression models resulted in better fits (median R2 = 0.31), but there was still large variability between patients in R2. The R2 from regression models for different predictor variables were highly correlated across patients (R ≈ 0.8), indicating tumors that were poorly predicted with one tracer were also poorly predicted by other tracers. In conclusion, the high inter-patient variability in goodness of fits indicates that PET was able to predict locations of residual tumor in some patients, but not others. This suggests not all patients would be good candidates for dose painting based on a single biological target.
Predicting location of recurrence using FDG, FLT, and Cu-ATSM PET in canine sinonasal tumors treated with radiotherapy.

PubMed

Bradshaw, Tyler; Fu, Rau; Bowen, Stephen; Zhu, Jun; Forrest, Lisa; Jeraj, Robert

2015-07-07

Dose painting relies on the ability of functional imaging to identify resistant tumor subvolumes to be targeted for additional boosting. This work assessed the ability of FDG, FLT, and Cu-ATSM PET imaging to predict the locations of residual FDG PET in canine tumors following radiotherapy. Nineteen canines with spontaneous sinonasal tumors underwent PET/CT imaging with radiotracers FDG, FLT, and Cu-ATSM prior to hypofractionated radiotherapy. Therapy consisted of 10 fractions of 4.2 Gy to the sinonasal cavity with or without an integrated boost of 0.8 Gy to the GTV. Patients had an additional FLT PET/CT scan after fraction 2, a Cu-ATSM PET/CT scan after fraction 3, and follow-up FDG PET/CT scans after radiotherapy. Following image registration, simple and multiple linear and logistic voxel regressions were performed to assess how well pre- and mid-treatment PET imaging predicted post-treatment FDG uptake. R(2) and pseudo R(2) were used to assess the goodness of fits. For simple linear regression models, regression coefficients for all pre- and mid-treatment PET images were significantly positive across the population (P < 0.05). However, there was large variability among patients in goodness of fits: R(2) ranged from 0.00 to 0.85, with a median of 0.12. Results for logistic regression models were similar. Multiple linear regression models resulted in better fits (median R(2) = 0.31), but there was still large variability between patients in R(2). The R(2) from regression models for different predictor variables were highly correlated across patients (R ≈ 0.8), indicating tumors that were poorly predicted with one tracer were also poorly predicted by other tracers. In conclusion, the high inter-patient variability in goodness of fits indicates that PET was able to predict locations of residual tumor in some patients, but not others. This suggests not all patients would be good candidates for dose painting based on a single biological target.
Climatological Modeling of Monthly Air Temperature and Precipitation in Egypt through GIS Techniques

NASA Astrophysics Data System (ADS)

El Kenawy, A.

2009-09-01

This paper describes a method for modeling and mapping four climatic variables (maximum temperature, minimum temperature, mean temperature and total precipitation) in Egypt using a multiple regression approach implemented in a GIS environment. In this model, a set of variables including latitude, longitude, elevation within a distance of 5, 10 and 15 km, slope, aspect, distance to the Mediterranean Sea, distance to the Red Sea, distance to the Nile, ratio between land and water masses within a radius of 5, 10, 15 km, the Normalized Difference Vegetation Index (NDVI), the Normalized Difference Water Index (NDWI), the Normalized Difference Temperature Index (NDTI) and reflectance are included as independent variables. These variables were integrated as raster layers in MiraMon software at a spatial resolution of 1 km. Climatic variables were considered as dependent variables and averaged from quality controlled and homogenized 39 series distributing across the entire country during the period of (1957-2006). For each climatic variable, digital and objective maps were finally obtained using the multiple regression coefficients at monthly, seasonal and annual timescale. The accuracy of these maps were assessed through cross-validation between predicted and observed values using a set of statistics including coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), mean bias Error (MBE) and D Willmott statistic. These maps are valuable in the sense of spatial resolution as well as the number of observatories involved in the current analysis.

Estimating Interaction Effects With Incomplete Predictor Variables

PubMed Central

Enders, Craig K.; Baraldi, Amanda N.; Cham, Heining

2014-01-01

The existing missing data literature does not provide a clear prescription for estimating interaction effects with missing data, particularly when the interaction involves a pair of continuous variables. In this article, we describe maximum likelihood and multiple imputation procedures for this common analysis problem. We outline 3 latent variable model specifications for interaction analyses with missing data. These models apply procedures from the latent variable interaction literature to analyses with a single indicator per construct (e.g., a regression analysis with scale scores). We also discuss multiple imputation for interaction effects, emphasizing an approach that applies standard imputation procedures to the product of 2 raw score predictors. We thoroughly describe the process of probing interaction effects with maximum likelihood and multiple imputation. For both missing data handling techniques, we outline centering and transformation strategies that researchers can implement in popular software packages, and we use a series of real data analyses to illustrate these methods. Finally, we use computer simulations to evaluate the performance of the proposed techniques. PMID:24707955
Simultaneous estimation of transcutaneous bilirubin, hemoglobin, and melanin based on diffuse reflectance spectroscopy

NASA Astrophysics Data System (ADS)

Nishidate, Izumi; Abdul, Wares MD.; Ohtsu, Mizuki; Nakano, Kazuya; Haneishi, Hideaki

2018-02-01

We propose a method to estimate transcutaneous bilirubin, hemoglobin, and melanin based on the diffuse reflectance spectroscopy. In the proposed method, the Monte Carlo simulation-based multiple regression analysis for an absorbance spectrum in the visible wavelength region (460-590 nm) is used to specify the concentrations of bilirubin (Cbil), oxygenated hemoglobin (Coh), deoxygenated hemoglobin (Cdh), and melanin (Cm). Using the absorbance spectrum calculated from the measured diffuse reflectance spectrum as a response variable and the extinction coefficients of bilirubin, oxygenated hemoglobin, deoxygenated hemoglobin, and melanin, as predictor variables, multiple regression analysis provides regression coefficients. Concentrations of bilirubin, oxygenated hemoglobin, deoxygenated hemoglobin, and melanin, are then determined from the regression coefficients using conversion vectors that are numerically deduced in advance by the Monte Carlo simulations for light transport in skin. Total hemoglobin concentration (Cth) and tissue oxygen saturation (StO2) are simply calculated from the oxygenated hemoglobin and deoxygenated hemoglobin. In vivo animal experiments with bile duct ligation in rats demonstrated that the estimated Cbil is increased after ligation of bile duct and reaches to around 20 mg/dl at 72 h after the onset of the ligation, which corresponds to the reference value of Cbil measured by a commercially available transcutaneous bilirubin meter. We also performed in vivo experiments with rats while varying the fraction of inspired oxygen (FiO2). Coh and Cdh decreased and increased, respectively, as FiO2 decreased. Consequently, StO2 was dramatically decreased. The results in this study indicate potential of the method for simultaneous evaluation of multiple chromophores in skin tissue.
Quantile regression models of animal habitat relationships

USGS Publications Warehouse

Cade, Brian S.

2003-01-01

Typically, all factors that limit an organism are not measured and included in statistical models used to investigate relationships with their environment. If important unmeasured variables interact multiplicatively with the measured variables, the statistical models often will have heterogeneous response distributions with unequal variances. Quantile regression is an approach for estimating the conditional quantiles of a response variable distribution in the linear model, providing a more complete view of possible causal relationships between variables in ecological processes. Chapter 1 introduces quantile regression and discusses the ordering characteristics, interval nature, sampling variation, weighting, and interpretation of estimates for homogeneous and heterogeneous regression models. Chapter 2 evaluates performance of quantile rankscore tests used for hypothesis testing and constructing confidence intervals for linear quantile regression estimates (0 ≤ τ ≤ 1). A permutation F test maintained better Type I errors than the Chi-square T test for models with smaller n, greater number of parameters p, and more extreme quantiles τ. Both versions of the test required weighting to maintain correct Type I errors when there was heterogeneity under the alternative model. An example application related trout densities to stream channel width:depth. Chapter 3 evaluates a drop in dispersion, F-ratio like permutation test for hypothesis testing and constructing confidence intervals for linear quantile regression estimates (0 ≤ τ ≤ 1). Chapter 4 simulates from a large (N = 10,000) finite population representing grid areas on a landscape to demonstrate various forms of hidden bias that might occur when the effect of a measured habitat variable on some animal was confounded with the effect of another unmeasured variable (spatially and not spatially structured). Depending on whether interactions of the measured habitat and unmeasured variable were negative (interference interactions) or positive (facilitation interactions), either upper (τ > 0.5) or lower (τ < 0.5) quantile regression parameters were less biased than mean rate parameters. Sampling (n = 20 - 300) simulations demonstrated that confidence intervals constructed by inverting rankscore tests provided valid coverage of these biased parameters. Quantile regression was used to estimate effects of physical habitat resources on a bivalve mussel (Macomona liliana) in a New Zealand harbor by modeling the spatial trend surface as a cubic polynomial of location coordinates.
Spirometry results (FEV1 and FVC) in young Bantu men from Tanzania vs environmental and family characteristics.

PubMed

Rębacz-Maron, Ewa; Parafiniuk, Mirosław

2014-01-01

The aim of this paper was to examine the extent to which socioeconomic factors, anthropological data and somatic indices influenced the results of spirometric measurements (FEV1 and FVC) in Tanzanian youth. The population studied were young black Bantu men aged 12.8-24.0 years. Analysis was performed for the whole data set (n = 255), as well as separately for two age groups: under 17.5 years (n = 168) and 17.5 + (n = 87). A backward stepwise multiple regression analysis was performed for FEV1 and FVC as dependent variables on socioeconomic and anthropometric data. Multiple regression analysis for the whole group revealed that the socioeconomic and anthropometric data under analysis accounted for 38% of the variation in FEV1. In addition the analysis demonstrated that 34% of the variation in FVC could be accounted for by the variables used in the regression. A significant impact in explaining the variability of FVC was exhibited by the thorax mobility, financial situation of the participants and Pignet-Verwaecka Index. Analysis of the data indicates the significant role of selected socio-economic factors on the development of the biological specimens investigated. There were no perceptible pathologies, and the results can be treated as a credible interpretation of the influence exerted by the environment in which the teenagers under study grew up.
Quantification and regionalization of groundwater recharge in South-Central Kansas: Integrating field characterization, statistical analysis, and GIS

USGS Publications Warehouse

Sophocleous, M.

2000-01-01

A practical methodology for recharge characterization was developed based on several years of field-oriented research at 10 sites in the Great Bend Prairie of south-central Kansas. This methodology combines the soil-water budget on a storm-by-storm year-round basis with the resulting watertable rises. The estimated 1985-1992 average annual recharge was less than 50mm/year with a range from 15 mm/year (during the 1998 drought) to 178 mm/year (during the 1993 flood year). Most of this recharge occurs during the spring months. To regionalize these site-specific estimates, an additional methodology based on multiple (forward) regression analysis combined with classification and GIS overlay analyses was developed and implemented. The multiple regression analysis showed that the most influential variables were, in order of decreasing importance, total annual precipitation, average maximum springtime soil-profile water storage, average shallowest springtime depth to watertable, and average springtime precipitation rate. Therefore, four GIS (ARC/INFO) data "layers" or coverages were constructed for the study region based on these four variables, and each such coverage was classified into the same number of data classes to avoid biasing the results. The normalized regression coefficients were employed to weigh the class rankings of each recharge-affecting variable. This approach resulted in recharge zonations that agreed well with the site recharge estimates. During the "Great Flood of 1993," when rainfall totals exceeded normal levels by -200% in the northern portion of the study region, the developed regionalization methodology was tested against such extreme conditions, and proved to be both practical, based on readily available or easily measurable data, and robust. It was concluded that the combination of multiple regression and GIS overlay analyses is a powerful and practical approach to regionalizing small samples of recharge estimates.
Effect of Ankle Range of Motion (ROM) and Lower-Extremity Muscle Strength on Static Balance Control Ability in Young Adults: A Regression Analysis

PubMed Central

Kim, Seong-Gil

2018-01-01

Background The purpose of this study was to investigate the effect of ankle ROM and lower-extremity muscle strength on static balance control ability in young adults. Material/Methods This study was conducted with 65 young adults, but 10 young adults dropped out during the measurement, so 55 young adults (male: 19, female: 36) completed the study. Postural sway (length and velocity) was measured with eyes open and closed, and ankle ROM (AROM and PROM of dorsiflexion and plantarflexion) and lower-extremity muscle strength (flexor and extensor of hip, knee, and ankle joint) were measured. Pearson correlation coefficient was used to examine the correlation between variables and static balance ability. Simple linear regression analysis and multiple linear regression analysis were used to examine the effect of variables on static balance ability. Results In correlation analysis, plantarflexion ROM (AROM and PROM) and lower-extremity muscle strength (except hip extensor) were significantly correlated with postural sway (p<0.05). In simple correlation analysis, all variables that passed the correlation analysis procedure had significant influence (p<0.05). In multiple linear regression analysis, plantar flexion PROM with eyes open significantly influenced sway length (B=0.681) and sway velocity (B=0.011). Conclusions Lower-extremity muscle strength and ankle plantarflexion ROM influenced static balance control ability, with ankle plantarflexion PROM showing the greatest influence. Therefore, both contractile structures and non-contractile structures should be of interest when considering static balance control ability improvement. PMID:29760375
Effect of Ankle Range of Motion (ROM) and Lower-Extremity Muscle Strength on Static Balance Control Ability in Young Adults: A Regression Analysis.

PubMed

Kim, Seong-Gil; Kim, Wan-Soo

2018-05-15

BACKGROUND The purpose of this study was to investigate the effect of ankle ROM and lower-extremity muscle strength on static balance control ability in young adults. MATERIAL AND METHODS This study was conducted with 65 young adults, but 10 young adults dropped out during the measurement, so 55 young adults (male: 19, female: 36) completed the study. Postural sway (length and velocity) was measured with eyes open and closed, and ankle ROM (AROM and PROM of dorsiflexion and plantarflexion) and lower-extremity muscle strength (flexor and extensor of hip, knee, and ankle joint) were measured. Pearson correlation coefficient was used to examine the correlation between variables and static balance ability. Simple linear regression analysis and multiple linear regression analysis were used to examine the effect of variables on static balance ability. RESULTS In correlation analysis, plantarflexion ROM (AROM and PROM) and lower-extremity muscle strength (except hip extensor) were significantly correlated with postural sway (p<0.05). In simple correlation analysis, all variables that passed the correlation analysis procedure had significant influence (p<0.05). In multiple linear regression analysis, plantar flexion PROM with eyes open significantly influenced sway length (B=0.681) and sway velocity (B=0.011). CONCLUSIONS Lower-extremity muscle strength and ankle plantarflexion ROM influenced static balance control ability, with ankle plantarflexion PROM showing the greatest influence. Therefore, both contractile structures and non-contractile structures should be of interest when considering static balance control ability improvement.
Multicollinearity may lead to artificial interaction: an example from a cross sectional study of biomarkers.

PubMed

Sithisarankul, P; Weaver, V M; Diener-West, M; Strickland, P T

1997-06-01

Collinearity is the situation which arises in multiple regression when some or all of the explanatory variables are so highly correlated with one another that it becomes very difficult, if not impossible, to disentangle their influences and obtain a reasonably precise estimate of their effects. Suppressor variable is one of the extreme situations of collinearity that one variable can substantially increase the multiple correlation when combined with a variable that is only modestly correlated with the response variable. In this study, we describe the process by which we disentangled and discovered multicollinearity and its consequences, namely artificial interaction, using the data from cross-sectional quantification of several biomarkers. We showed how the collinearity between one biomarker (blood lead level) and another (urinary trans, trans-muconic acid) and their interaction (blood lead level* urinary trans, trans-muconic acid) can lead to the observed artificial interaction on the third biomarker (urinary 5-aminolevulinic acid).
Relationship among several measurements of slipperiness obtained in a laboratory environment.

PubMed

Chang, Wen-Ruey; Chang, Chien-Chi

2018-04-01

Multiple sensing mechanisms could be used in forming responses to avoid slips, but previous studies, correlating only two parameters, revealed a limited picture of this complex system. In this study, the participants walked as fast as possible without a slip under 15 conditions of different degrees of slipperiness. The relationships among various response parameters, including perceived slipperiness rating, utilized coefficient of friction (UCOF), slipmeter measurement and kinematic parameters, were evaluated. The results showed that the UCOF, perceived rating and heel angle had higher adjusted R 2 values as dependent variables in the multiple linear regressions with the remaining variables in the final pool as independent variables. Although each variable in the final data pool could reflect some measurement of slipperiness, these three variables are more inclusive than others in representing the other variables and were bigger predictors of other variables, so they could be better candidates for measurements of slipperiness. Copyright © 2017 Elsevier Ltd. All rights reserved.
Spatial interpolation schemes of daily precipitation for hydrologic modeling

USGS Publications Warehouse

Hwang, Y.; Clark, M.R.; Rajagopalan, B.; Leavesley, G.

2012-01-01

Distributed hydrologic models typically require spatial estimates of precipitation interpolated from sparsely located observational points to the specific grid points. We compare and contrast the performance of regression-based statistical methods for the spatial estimation of precipitation in two hydrologically different basins and confirmed that widely used regression-based estimation schemes fail to describe the realistic spatial variability of daily precipitation field. The methods assessed are: (1) inverse distance weighted average; (2) multiple linear regression (MLR); (3) climatological MLR; and (4) locally weighted polynomial regression (LWP). In order to improve the performance of the interpolations, the authors propose a two-step regression technique for effective daily precipitation estimation. In this simple two-step estimation process, precipitation occurrence is first generated via a logistic regression model before estimate the amount of precipitation separately on wet days. This process generated the precipitation occurrence, amount, and spatial correlation effectively. A distributed hydrologic model (PRMS) was used for the impact analysis in daily time step simulation. Multiple simulations suggested noticeable differences between the input alternatives generated by three different interpolation schemes. Differences are shown in overall simulation error against the observations, degree of explained variability, and seasonal volumes. Simulated streamflows also showed different characteristics in mean, maximum, minimum, and peak flows. Given the same parameter optimization technique, LWP input showed least streamflow error in Alapaha basin and CMLR input showed least error (still very close to LWP) in Animas basin. All of the two-step interpolation inputs resulted in lower streamflow error compared to the directly interpolated inputs. ?? 2011 Springer-Verlag.
Modelling space of spread Dengue Hemorrhagic Fever (DHF) in Central Java use spatial durbin model

NASA Astrophysics Data System (ADS)

Ispriyanti, Dwi; Prahutama, Alan; Taryono, Arkadina PN

2018-05-01

Dengue Hemorrhagic Fever is one of the major public health problems in Indonesia. From year to year, DHF causes Extraordinary Event in most parts of Indonesia, especially Central Java. Central Java consists of 35 districts or cities where each region is close to each other. Spatial regression is an analysis that suspects the influence of independent variables on the dependent variables with the influences of the region inside. In spatial regression modeling, there are spatial autoregressive model (SAR), spatial error model (SEM) and spatial autoregressive moving average (SARMA). Spatial Durbin model is the development of SAR where the dependent and independent variable have spatial influence. In this research dependent variable used is number of DHF sufferers. The independent variables observed are population density, number of hospitals, residents and health centers, and mean years of schooling. From the multiple regression model test, the variables that significantly affect the spread of DHF disease are the population and mean years of schooling. By using queen contiguity and rook contiguity, the best model produced is the SDM model with queen contiguity because it has the smallest AIC value of 494,12. Factors that generally affect the spread of DHF in Central Java Province are the number of population and the average length of school.
Epidemiologic Evaluation of Measurement Data in the Presence of Detection Limits

PubMed Central

Lubin, Jay H.; Colt, Joanne S.; Camann, David; Davis, Scott; Cerhan, James R.; Severson, Richard K.; Bernstein, Leslie; Hartge, Patricia

2004-01-01

Quantitative measurements of environmental factors greatly improve the quality of epidemiologic studies but can pose challenges because of the presence of upper or lower detection limits or interfering compounds, which do not allow for precise measured values. We consider the regression of an environmental measurement (dependent variable) on several covariates (independent variables). Various strategies are commonly employed to impute values for interval-measured data, including assignment of one-half the detection limit to nondetected values or of “fill-in” values randomly selected from an appropriate distribution. On the basis of a limited simulation study, we found that the former approach can be biased unless the percentage of measurements below detection limits is small (5–10%). The fill-in approach generally produces unbiased parameter estimates but may produce biased variance estimates and thereby distort inference when 30% or more of the data are below detection limits. Truncated data methods (e.g., Tobit regression) and multiple imputation offer two unbiased approaches for analyzing measurement data with detection limits. If interest resides solely on regression parameters, then Tobit regression can be used. If individualized values for measurements below detection limits are needed for additional analysis, such as relative risk regression or graphical display, then multiple imputation produces unbiased estimates and nominal confidence intervals unless the proportion of missing data is extreme. We illustrate various approaches using measurements of pesticide residues in carpet dust in control subjects from a case–control study of non-Hodgkin lymphoma. PMID:15579415
Prevalence of consistent condom use with various types of sex partners and associated factors among money boys in Changsha, China.

PubMed

Wang, Lian-Hong; Yan, Jin; Yang, Guo-Li; Long, Shuo; Yu, Yong; Wu, Xi-Lin

2015-04-01

Money boys with inconsistent condom use (less than 100% of the time) are at high risk of infection by human immunodeficiency virus (HIV) or sexually transmitted infection (STI), but relatively little research has examined their risk behaviors. We investigated the prevalence of consistent condom use (100% of the time) and associated factors among money boys. A cross-sectional study using a structured questionnaire was conducted among money boys in Changsha, China, between July 2012 and January 2013. Independent variables included socio-demographic data, substance abuse history, work characteristics, and self-reported HIV and STI history. Dependent variables included the consistent condom use with different types of sex partners. Among the participants, 82.4% used condoms consistently with male clients, 80.2% with male sex partners, and 77.1% with female sex partners in the past 3 months. A multiple stepwise logistic regression model identified four statistically significant factors associated with lower likelihoods of consistent condom use with male clients: age group, substance abuse, lack of an "employment" arrangement, and having no HIV test within the prior 6 months. In a similar model, only one factor associated significantly with lower likelihoods of consistent condom use with male sex partners was identified in multiple stepwise logistic regression analyses: having no HIV test within the prior six months. As for female sex partners, two significant variables were statistically significant in the multiple stepwise logistic regression analysis: having no HIV test within the prior 6 months and having STI history. Interventions which are linked with more realistic and acceptable HIV prevention methods are greatly warranted and should increase risk awareness and the behavior of consistent condom use in both commercial and personal relationship. © 2015 International Society for Sexual Medicine.
Predictive ability of a comprehensive incremental test in mountain bike marathon.

PubMed

Ahrend, Marc-Daniel; Schneeweiss, Patrick; Martus, Peter; Niess, Andreas M; Krauss, Inga

2018-01-01

Traditional performance tests in mountain bike marathon (XCM) primarily quantify aerobic metabolism and may not describe the relevant capacities in XCM. We aimed to validate a comprehensive test protocol quantifying its intermittent demands. Forty-nine athletes (38.8±9.1 years; 38 male; 11 female) performed a laboratory performance test, including an incremental test, to determine individual anaerobic threshold (IAT), peak power output (PPO) and three maximal efforts (10 s all-out sprint, 1 min maximal effort and 5 min maximal effort). Within 2 weeks, the athletes participated in one of three XCM races (n=15, n=9 and n=25). Correlations between test variables and race times were calculated separately. In addition, multiple regression models of the predictive value of laboratory outcomes were calculated for race 3 and across all races (z-transformed data). All variables were correlated with race times 1, 2 and 3: 10 s all-out sprint (r=-0.72; r=-0.59; r=-0.61), 1 min maximal effort (r=-0.85; r=-0.84; r=-0.82), 5 min maximal effort (r=-0.57; r=-0.85; r=-0.76), PPO (r=-0.77; r=-0.73; r=-0.76) and IAT (r=-0.71; r=-0.67; r=-0.68). The best-fitting multiple regression models for race 3 (r 2 =0.868) and across all races (r 2 =0.757) comprised 1 min maximal effort, IAT and body weight. Aerobic and intermittent variables correlated least strongly with race times. Their use in a multiple regression model confirmed additional explanatory power to predict XCM performance. These findings underline the usefulness of the comprehensive incremental test to predict performance in that sport more precisely.
Meteorological Modes of Variability for Fine Particulate Matter (PM2.5) Air Quality in the United States: Implications for PM2.5 Sensitivity to Climate Change

EPA Science Inventory

We applied a multiple linear regression model to understand the relationships of PM_2.5 with meteorological variables in the contiguous US and from there to infer the sensitivity of PM_2.5 to climate change. We used 2004-2008 PM_2.5 observations fro...
The effect of playing tactics and situational variables on achieving score-box possessions in a professional soccer team.

PubMed

Lago-Ballesteros, Joaquin; Lago-Peñas, Carlos; Rey, Ezequiel

2012-01-01

The aim of this study was to analyse the influence of playing tactics, opponent interaction and situational variables on achieving score-box possessions in professional soccer. The sample was constituted by 908 possessions obtained by a team from the Spanish soccer league in 12 matches played during the 2009-2010 season. Multidimensional qualitative data obtained from 12 ordered categorical variables were used. Sampled matches were registered by the AMISCO PRO system. Data were analysed using chi-square analysis and multiple logistic regression analysis. Of 908 possessions, 303 (33.4%) produced score-box possessions, 477 (52.5%) achieved progression and 128 (14.1%) failed to reach any sort of progression. Multiple logistic regression showed that, for the main variable "team possession type", direct attacks and counterattacks were three times more effective than elaborate attacks for producing a score-box possession (P < 0.05). Team possession originating from the middle zones and playing against less than six defending players (P < 0.001) registered a higher success than those started in the defensive zone with a balanced defence. When the team was drawing or winning, the probability of reaching the score-box decreased by 43 and 53 percent, respectively, compared with the losing situation (P < 0.05). Accounting for opponent interactions and situational variables is critical to evaluate the effectiveness of offensive playing tactics on producing score-box possessions.
Variables associated with health-related quality of life in a Brazilian sample of patients from a tertiary outpatient clinic for depression and anxiety disorders.

PubMed

Schwab, Bianca; Daniel, Heloisa Silveira; Lutkemeyer, Carine; Neves, João Arthur Lange Lins; Zilli, Louise Nassif; Guarnieri, Ricardo; Diaz, Alexandre Paim; Michels, Ana Maria Maykot Prates

2015-01-01

Health-related quality of life (HRQOL) assessment tools have been broadly used in the medical context. These tools are used to measure the subjective impact of the disease on patients. The objective of this study was to evaluate the variables associated with HRQOL in a Brazilian sample of patients followed up in a tertiary outpatient clinic for depression and anxiety disorders. Cross-sectional study. Independent variables were those included in a sociodemographic questionnaire and the Hospital Anxiety and Depression Scale (HADS) scores. Dependent variables were those included in the short version of the World Health Organization Quality of Life (WHOQOL-BREF) and the scores for its subdomains (overall quality of life and general health, physical health, psychological health, social relationships, and environment). A multiple linear regression analysis was used to find the variables independently associated with each outcome. Seventy-five adult patients were evaluated. After multiple linear regression analysis, the HADS scores were associated with all outcomes, except social relationships (p = 0.08). Female gender was associated with poor total scores, as well as psychological health and environment. Unemployment was associated with poor physical health. Identifying the factors associated with HRQOL and recognizing that depression and anxiety are major factors are essential to improve the care of patients.
Nursing home cost and ownership type: evidence of interaction effects.

PubMed

Arling, G; Nordquist, R H; Capitman, J A

1987-06-01

Due to steadily increasing public expenditures for nursing home care, much research has focused on factors that influence nursing home costs, especially for Medicaid patients. Nursing home cost function studies have typically used a number of predictor variables in a multiple regression analysis to determine the effect of these variables on operating cost. Although several authors have suggested that nursing home ownership types have different goal orientations, not necessarily based on economic factors, little attention has been paid to this issue in empirical research. In this study, data from 150 Virginia nursing homes were used in multiple regression analysis to examine factors accounting for nursing home operating costs. The context of the study was the Virginia Medicaid reimbursement system, which has intermediate care and skilled nursing facility (ICF and SNF) facility-specific per diem rates, set according to facility cost histories. The analysis revealed interaction effects between ownership and other predictor variables (e.g., percentage Medicaid residents, case mix, and region), with predictor variables having different effects on cost depending on ownership type. Conclusions are drawn about the goal orientations and behavior of chain-operated, individual for-profit, and public and nonprofit facilities. The implications of these findings for long-term care reimbursement policies are discussed.
Nursing home cost and ownership type: evidence of interaction effects.

PubMed Central

Arling, G; Nordquist, R H; Capitman, J A

1987-01-01

Due to steadily increasing public expenditures for nursing home care, much research has focused on factors that influence nursing home costs, especially for Medicaid patients. Nursing home cost function studies have typically used a number of predictor variables in a multiple regression analysis to determine the effect of these variables on operating cost. Although several authors have suggested that nursing home ownership types have different goal orientations, not necessarily based on economic factors, little attention has been paid to this issue in empirical research. In this study, data from 150 Virginia nursing homes were used in multiple regression analysis to examine factors accounting for nursing home operating costs. The context of the study was the Virginia Medicaid reimbursement system, which has intermediate care and skilled nursing facility (ICF and SNF) facility-specific per diem rates, set according to facility cost histories. The analysis revealed interaction effects between ownership and other predictor variables (e.g., percentage Medicaid residents, case mix, and region), with predictor variables having different effects on cost depending on ownership type. Conclusions are drawn about the goal orientations and behavior of chain-operated, individual for-profit, and public and nonprofit facilities. The implications of these findings for long-term care reimbursement policies are discussed. PMID:3301746
Occlusal factors are not related to self-reported bruxism.

PubMed

Manfredini, Daniele; Visscher, Corine M; Guarda-Nardini, Luca; Lobbezoo, Frank

2012-01-01

To estimate the contribution of various occlusal features of the natural dentition that may identify self-reported bruxers compared to nonbruxers. Two age- and sex-matched groups of self-reported bruxers (n = 67) and self-reported nonbruxers (n = 75) took part in the study. For each patient, the following occlusal features were clinically assessed: retruded contact position (RCP) to intercuspal contact position (ICP) slide length (< 2 mm was considered normal), vertical overlap (< 0 mm was considered an anterior open bite; > 4 mm, a deep bite), horizontal overlap (> 4 mm was considered a large horizontal overlap), incisor dental midline discrepancy (< 2 mm was considered normal), and the presence of a unilateral posterior crossbite, mediotrusive interferences, and laterotrusive interferences. A multiple logistic regression model was used to identify the significant associations between the assessed occlusal features (independent variables) and self-reported bruxism (dependent variable). Accuracy values to predict self-reported bruxism were unacceptable for all occlusal variables. The only variable remaining in the final regression model was laterotrusive interferences (P = .030). The percentage of explained variance for bruxism by the final multiple regression model was 4.6%. This model including only one occlusal factor showed low positive (58.1%) and negative predictive values (59.7%), thus showing a poor accuracy to predict the presence of self-reported bruxism (59.2%). This investigation suggested that the contribution of occlusion to the differentiation between bruxers and nonbruxers is negligible. This finding supports theories that advocate a much diminished role for peripheral anatomical-structural factors in the pathogenesis of bruxism.

Empirical predictive models of daily relativistic electron flux at geostationary orbit: Multiple regression analysis

DOE PAGES

Simms, Laura E.; Engebretson, Mark J.; Pilipenko, Viacheslav; ...

2016-04-07

The daily maximum relativistic electron flux at geostationary orbit can be predicted well with a set of daily averaged predictor variables including previous day's flux, seed electron flux, solar wind velocity and number density, AE index, IMF Bz, Dst, and ULF and VLF wave power. As predictor variables are intercorrelated, we used multiple regression analyses to determine which are the most predictive of flux when other variables are controlled. Empirical models produced from regressions of flux on measured predictors from 1 day previous were reasonably effective at predicting novel observations. Adding previous flux to the parameter set improves the predictionmore » of the peak of the increases but delays its anticipation of an event. Previous day's solar wind number density and velocity, AE index, and ULF wave activity are the most significant explanatory variables; however, the AE index, measuring substorm processes, shows a negative correlation with flux when other parameters are controlled. This may be due to the triggering of electromagnetic ion cyclotron waves by substorms that cause electron precipitation. VLF waves show lower, but significant, influence. The combined effect of ULF and VLF waves shows a synergistic interaction, where each increases the influence of the other on flux enhancement. Correlations between observations and predictions for this 1 day lag model ranged from 0.71 to 0.89 (average: 0.78). Furthermore, a path analysis of correlations between predictors suggests that solar wind and IMF parameters affect flux through intermediate processes such as ring current ( Dst), AE, and wave activity.« less
Empirical predictive models of daily relativistic electron flux at geostationary orbit: Multiple regression analysis

DOE Office of Scientific and Technical Information (OSTI.GOV)

Simms, Laura E.; Engebretson, Mark J.; Pilipenko, Viacheslav

The daily maximum relativistic electron flux at geostationary orbit can be predicted well with a set of daily averaged predictor variables including previous day's flux, seed electron flux, solar wind velocity and number density, AE index, IMF Bz, Dst, and ULF and VLF wave power. As predictor variables are intercorrelated, we used multiple regression analyses to determine which are the most predictive of flux when other variables are controlled. Empirical models produced from regressions of flux on measured predictors from 1 day previous were reasonably effective at predicting novel observations. Adding previous flux to the parameter set improves the predictionmore » of the peak of the increases but delays its anticipation of an event. Previous day's solar wind number density and velocity, AE index, and ULF wave activity are the most significant explanatory variables; however, the AE index, measuring substorm processes, shows a negative correlation with flux when other parameters are controlled. This may be due to the triggering of electromagnetic ion cyclotron waves by substorms that cause electron precipitation. VLF waves show lower, but significant, influence. The combined effect of ULF and VLF waves shows a synergistic interaction, where each increases the influence of the other on flux enhancement. Correlations between observations and predictions for this 1 day lag model ranged from 0.71 to 0.89 (average: 0.78). Furthermore, a path analysis of correlations between predictors suggests that solar wind and IMF parameters affect flux through intermediate processes such as ring current ( Dst), AE, and wave activity.« less
Modulation of brain activity by multiple lexical and word form variables in visual word recognition: A parametric fMRI study.

PubMed

Hauk, Olaf; Davis, Matthew H; Pulvermüller, Friedemann

2008-09-01

Psycholinguistic research has documented a range of variables that influence visual word recognition performance. Many of these variables are highly intercorrelated. Most previous studies have used factorial designs, which do not exploit the full range of values available for continuous variables, and are prone to skewed stimulus selection as well as to effects of the baseline (e.g. when contrasting words with pseudowords). In our study, we used a parametric approach to study the effects of several psycholinguistic variables on brain activation. We focussed on the variable word frequency, which has been used in numerous previous behavioural, electrophysiological and neuroimaging studies, in order to investigate the neuronal network underlying visual word processing. Furthermore, we investigated the variable orthographic typicality as well as a combined variable for word length and orthographic neighbourhood size (N), for which neuroimaging results are still either scarce or inconsistent. Data were analysed using multiple linear regression analysis of event-related fMRI data acquired from 21 subjects in a silent reading paradigm. The frequency variable correlated negatively with activation in left fusiform gyrus, bilateral inferior frontal gyri and bilateral insulae, indicating that word frequency can affect multiple aspects of word processing. N correlated positively with brain activity in left and right middle temporal gyri as well as right inferior frontal gyrus. Thus, our analysis revealed multiple distinct brain areas involved in visual word processing within one data set.
Examining Preservice Science Teacher Understanding of Nature of Science: Discriminating Variables on the Aspects of Nature of Science

NASA Astrophysics Data System (ADS)

Jones, William I.

This study examined the understanding of nature of science among participants in their final year of a 4-year undergraduate teacher education program at a Midwest liberal arts university. The Logic Model Process was used as an integrative framework to focus the collection, organization, analysis, and interpretation of the data for the purpose of (1) describing participant understanding of NOS and (2) to identify participant characteristics and teacher education program features related to those understandings. The Views of Nature of Science Questionnaire form C (VNOS-C) was used to survey participant understanding of 7 target aspects of Nature of Science (NOS). A rubric was developed from a review of the literature to categorize and score participant understanding of the target aspects of NOS. Participants' high school and college transcripts, planning guides for their respective teacher education program majors, and science content and science teaching methods course syllabi were examined to identify and categorize participant characteristics and teacher education program features. The R software (R Project for Statistical Computing, 2010) was used to conduct an exploratory analysis to determine correlations of the antecedent and transaction predictor variables with participants' scores on the 7 target aspects of NOS. Fourteen participant characteristics and teacher education program features were moderately and significantly ( p < .01) correlated with participant scores on the target aspects of NOS. The 6 antecedent predictor variables were entered into multiple regression analyses to determine the best-fit model of antecedent predictor variables for each target NOS aspect. The transaction predictor variables were entered into separate multiple regression analyses to determine the best-fit model of transaction predictor variables for each target NOS aspect. Variables from the best-fit antecedent and best-fit transaction models for each target aspect of NOS were then combined. A regression analysis for each of the combined models was conducted to determine the relative effect of these variables on the target aspects of NOS. Findings from the multiple regression analyses revealed that each of the fourteen predictor variables was present in the best-fit model for at least 1 of the 7 target aspects of NOS. However, not all of the predictor variables were statistically significant (p < .007) in the models and their effect (beta) varied. Participants in the teacher education program who had higher ACT Math scores, completed more high school science credits, and were enrolled either in the Middle Childhood with a science concentration program major or in the Adolescent/Young Adult Science Education program major were more likely to have an informed understanding on each of the 7 target aspects of NOS. Analyses of the planning guides and the course syllabi in each teacher education program major revealed differences between the program majors that may account for the results.
Predicting ecological flow regime at ungaged sites: A comparison of methods

USGS Publications Warehouse

Murphy, Jennifer C.; Knight, Rodney R.; Wolfe, William J.; Gain, W. Scott

2012-01-01

Nineteen ecologically relevant streamflow characteristics were estimated using published rainfall–runoff and regional regression models for six sites with observed daily streamflow records in Kentucky. The regional regression model produced median estimates closer to the observed median for all but two characteristics. The variability of predictions from both models was generally less than the observed variability. The variability of the predictions from the rainfall–runoff model was greater than that from the regional regression model for all but three characteristics. Eight characteristics predicted by the rainfall–runoff model display positive or negative bias across all six sites; biases are not as pronounced for the regional regression model. Results suggest that a rainfall–runoff model calibrated on a single characteristic is less likely to perform well as a predictor of a range of other characteristics (flow regime) when compared with a regional regression model calibrated individually on multiple characteristics used to represent the flow regime. Poor model performance may misrepresent hydrologic conditions, potentially distorting the perceived risk of ecological degradation. Without prior selection of streamflow characteristics, targeted calibration, and error quantification, the widespread application of general hydrologic models to ecological flow studies is problematic. Published 2012. This article is a U.S. Government work and is in the public domain in the USA.
Exhaustive Search for Sparse Variable Selection in Linear Regression

NASA Astrophysics Data System (ADS)

Igarashi, Yasuhiko; Takenaka, Hikaru; Nakanishi-Ohno, Yoshinori; Uemura, Makoto; Ikeda, Shiro; Okada, Masato

2018-04-01

We propose a K-sparse exhaustive search (ES-K) method and a K-sparse approximate exhaustive search method (AES-K) for selecting variables in linear regression. With these methods, K-sparse combinations of variables are tested exhaustively assuming that the optimal combination of explanatory variables is K-sparse. By collecting the results of exhaustively computing ES-K, various approximate methods for selecting sparse variables can be summarized as density of states. With this density of states, we can compare different methods for selecting sparse variables such as relaxation and sampling. For large problems where the combinatorial explosion of explanatory variables is crucial, the AES-K method enables density of states to be effectively reconstructed by using the replica-exchange Monte Carlo method and the multiple histogram method. Applying the ES-K and AES-K methods to type Ia supernova data, we confirmed the conventional understanding in astronomy when an appropriate K is given beforehand. However, we found the difficulty to determine K from the data. Using virtual measurement and analysis, we argue that this is caused by data shortage.
Multivariate Linear Regression and CART Regression Analysis of TBM Performance at Abu Hamour Phase-I Tunnel

NASA Astrophysics Data System (ADS)

Jakubowski, J.; Stypulkowski, J. B.; Bernardeau, F. G.

2017-12-01

The first phase of the Abu Hamour drainage and storm tunnel was completed in early 2017. The 9.5 km long, 3.7 m diameter tunnel was excavated with two Earth Pressure Balance (EPB) Tunnel Boring Machines from Herrenknecht. TBM operation processes were monitored and recorded by Data Acquisition and Evaluation System. The authors coupled collected TBM drive data with available information on rock mass properties, cleansed, completed with secondary variables and aggregated by weeks and shifts. Correlations and descriptive statistics charts were examined. Multivariate Linear Regression and CART regression tree models linking TBM penetration rate (PR), penetration per revolution (PPR) and field penetration index (FPI) with TBM operational and geotechnical characteristics were performed for the conditions of the weak/soft rock of Doha. Both regression methods are interpretable and the data were screened with different computational approaches allowing enriched insight. The primary goal of the analysis was to investigate empirical relations between multiple explanatory and responding variables, to search for best subsets of explanatory variables and to evaluate the strength of linear and non-linear relations. For each of the penetration indices, a predictive model coupling both regression methods was built and validated. The resultant models appeared to be stronger than constituent ones and indicated an opportunity for more accurate and robust TBM performance predictions.
Regression Analysis of Stage Variability for West-Central Florida Lakes

USGS Publications Warehouse

Sacks, Laura A.; Ellison, Donald L.; Swancar, Amy

2008-01-01

The variability in a lake's stage depends upon many factors, including surface-water flows, meteorological conditions, and hydrogeologic characteristics near the lake. An understanding of the factors controlling lake-stage variability for a population of lakes may be helpful to water managers who set regulatory levels for lakes. The goal of this study is to determine whether lake-stage variability can be predicted using multiple linear regression and readily available lake and basin characteristics defined for each lake. Regressions were evaluated for a recent 10-year period (1996-2005) and for a historical 10-year period (1954-63). Ground-water pumping is considered to have affected stage at many of the 98 lakes included in the recent period analysis, and not to have affected stage at the 20 lakes included in the historical period analysis. For the recent period, regression models had coefficients of determination (R2) values ranging from 0.60 to 0.74, and up to five explanatory variables. Standard errors ranged from 21 to 37 percent of the average stage variability. Net leakage was the most important explanatory variable in regressions describing the full range and low range in stage variability for the recent period. The most important explanatory variable in the model predicting the high range in stage variability was the height over median lake stage at which surface-water outflow would occur. Other explanatory variables in final regression models for the recent period included the range in annual rainfall for the period and several variables related to local and regional hydrogeology: (1) ground-water pumping within 1 mile of each lake, (2) the amount of ground-water inflow (by category), (3) the head gradient between the lake and the Upper Floridan aquifer, and (4) the thickness of the intermediate confining unit. Many of the variables in final regression models are related to hydrogeologic characteristics, underscoring the importance of ground-water exchange in controlling the stage of karst lakes in Florida. Regression equations were used to predict lake-stage variability for the recent period for 12 additional lakes, and the median difference between predicted and observed values ranged from 11 to 23 percent. Coefficients of determination for the historical period were considerably lower (maximum R2 of 0.28) than for the recent period. Reasons for these low R2 values are probably related to the small number of lakes (20) with stage data for an equivalent time period that were unaffected by ground-water pumping, the similarity of many of the lake types (large surface-water drainage lakes), and the greater uncertainty in defining historical basin characteristics. The lack of lake-stage data unaffected by ground-water pumping and the poor regression results obtained for that group of lakes limit the ability to predict natural lake-stage variability using this method in west-central Florida.
Genetic instrumental variable regression: Explaining socioeconomic and health outcomes in nonexperimental data

PubMed Central

DiPrete, Thomas A.; Burik, Casper A. P.; Koellinger, Philipp D.

2018-01-01

Identifying causal effects in nonexperimental data is an enduring challenge. One proposed solution that recently gained popularity is the idea to use genes as instrumental variables [i.e., Mendelian randomization (MR)]. However, this approach is problematic because many variables of interest are genetically correlated, which implies the possibility that many genes could affect both the exposure and the outcome directly or via unobserved confounding factors. Thus, pleiotropic effects of genes are themselves a source of bias in nonexperimental data that would also undermine the ability of MR to correct for endogeneity bias from nongenetic sources. Here, we propose an alternative approach, genetic instrumental variable (GIV) regression, that provides estimates for the effect of an exposure on an outcome in the presence of pleiotropy. As a valuable byproduct, GIV regression also provides accurate estimates of the chip heritability of the outcome variable. GIV regression uses polygenic scores (PGSs) for the outcome of interest which can be constructed from genome-wide association study (GWAS) results. By splitting the GWAS sample for the outcome into nonoverlapping subsamples, we obtain multiple indicators of the outcome PGSs that can be used as instruments for each other and, in combination with other methods such as sibling fixed effects, can address endogeneity bias from both pleiotropy and the environment. In two empirical applications, we demonstrate that our approach produces reasonable estimates of the chip heritability of educational attainment (EA) and show that standard regression and MR provide upwardly biased estimates of the effect of body height on EA. PMID:29686100
Genetic instrumental variable regression: Explaining socioeconomic and health outcomes in nonexperimental data.

PubMed

DiPrete, Thomas A; Burik, Casper A P; Koellinger, Philipp D

2018-05-29

Identifying causal effects in nonexperimental data is an enduring challenge. One proposed solution that recently gained popularity is the idea to use genes as instrumental variables [i.e., Mendelian randomization (MR)]. However, this approach is problematic because many variables of interest are genetically correlated, which implies the possibility that many genes could affect both the exposure and the outcome directly or via unobserved confounding factors. Thus, pleiotropic effects of genes are themselves a source of bias in nonexperimental data that would also undermine the ability of MR to correct for endogeneity bias from nongenetic sources. Here, we propose an alternative approach, genetic instrumental variable (GIV) regression, that provides estimates for the effect of an exposure on an outcome in the presence of pleiotropy. As a valuable byproduct, GIV regression also provides accurate estimates of the chip heritability of the outcome variable. GIV regression uses polygenic scores (PGSs) for the outcome of interest which can be constructed from genome-wide association study (GWAS) results. By splitting the GWAS sample for the outcome into nonoverlapping subsamples, we obtain multiple indicators of the outcome PGSs that can be used as instruments for each other and, in combination with other methods such as sibling fixed effects, can address endogeneity bias from both pleiotropy and the environment. In two empirical applications, we demonstrate that our approach produces reasonable estimates of the chip heritability of educational attainment (EA) and show that standard regression and MR provide upwardly biased estimates of the effect of body height on EA. Copyright © 2018 the Author(s). Published by PNAS.
Analytical framework for reconstructing heterogeneous environmental variables from mammal community structure.

PubMed

Louys, Julien; Meloro, Carlo; Elton, Sarah; Ditchfield, Peter; Bishop, Laura C

2015-01-01

We test the performance of two models that use mammalian communities to reconstruct multivariate palaeoenvironments. While both models exploit the correlation between mammal communities (defined in terms of functional groups) and arboreal heterogeneity, the first uses a multiple multivariate regression of community structure and arboreal heterogeneity, while the second uses a linear regression of the principal components of each ecospace. The success of these methods means the palaeoenvironment of a particular locality can be reconstructed in terms of the proportions of heavy, moderate, light, and absent tree canopy cover. The linear regression is less biased, and more precisely and accurately reconstructs heavy tree canopy cover than the multiple multivariate model. However, the multiple multivariate model performs better than the linear regression for all other canopy cover categories. Both models consistently perform better than randomly generated reconstructions. We apply both models to the palaeocommunity of the Upper Laetolil Beds, Tanzania. Our reconstructions indicate that there was very little heavy tree cover at this site (likely less than 10%), with the palaeo-landscape instead comprising a mixture of light and absent tree cover. These reconstructions help resolve the previous conflicting palaeoecological reconstructions made for this site. Copyright © 2014 Elsevier Ltd. All rights reserved.
Building a new predictor for multiple linear regression technique-based corrective maintenance turnaround time.

PubMed

Cruz, Antonio M; Barr, Cameron; Puñales-Pozo, Elsa

2008-01-01

This research's main goals were to build a predictor for a turnaround time (TAT) indicator for estimating its values and use a numerical clustering technique for finding possible causes of undesirable TAT values. The following stages were used: domain understanding, data characterisation and sample reduction and insight characterisation. Building the TAT indicator multiple linear regression predictor and clustering techniques were used for improving corrective maintenance task efficiency in a clinical engineering department (CED). The indicator being studied was turnaround time (TAT). Multiple linear regression was used for building a predictive TAT value model. The variables contributing to such model were clinical engineering department response time (CE(rt), 0.415 positive coefficient), stock service response time (Stock(rt), 0.734 positive coefficient), priority level (0.21 positive coefficient) and service time (0.06 positive coefficient). The regression process showed heavy reliance on Stock(rt), CE(rt) and priority, in that order. Clustering techniques revealed the main causes of high TAT values. This examination has provided a means for analysing current technical service quality and effectiveness. In doing so, it has demonstrated a process for identifying areas and methods of improvement and a model against which to analyse these methods' effectiveness.
Statistical Prediction in Proprietary Rehabilitation.

ERIC Educational Resources Information Center

Johnson, Kurt L.; And Others

1987-01-01

Applied statistical methods to predict case expenditures for low back pain rehabilitation cases in proprietary rehabilitation. Extracted predictor variables from case records of 175 workers compensation claimants with some degree of permanent disability due to back injury. Performed several multiple regression analyses resulting in a formula that…
Death Anxiety as a Function of Aging Anxiety

ERIC Educational Resources Information Center

Benton, Jeremy P.; Christopher, Andrew N.; Walter, Mark I.

2007-01-01

To assess how different facets of aging anxiety contributed to the prediction of tangible and existential death anxiety, 167 Americans of various Christian denominations completed a battery of questionnaires. Multiple regression analyses, controlling for demographic variables and previously demonstrated predictors of death anxiety, revealed that…
Nursing Scholars, Writing Dimensions, and Productivity.

ERIC Educational Resources Information Center

Megel, Mary Erickson

1987-01-01

A study to describe cognitive, affective, and behavioral dimensions associated with writing among doctorally prepared nurses and to determine relationships between writing dimensions and journal article publication is discussed. Multiple regression analysis showed that five variables accounted for 18 percent of the variance in research article…
Artificial neural networks and multiple linear regression model using principal components to estimate rainfall over South America

NASA Astrophysics Data System (ADS)

Soares dos Santos, T.; Mendes, D.; Rodrigues Torres, R.

2016-01-01

Several studies have been devoted to dynamic and statistical downscaling for analysis of both climate variability and climate change. This paper introduces an application of artificial neural networks (ANNs) and multiple linear regression (MLR) by principal components to estimate rainfall in South America. This method is proposed for downscaling monthly precipitation time series over South America for three regions: the Amazon; northeastern Brazil; and the La Plata Basin, which is one of the regions of the planet that will be most affected by the climate change projected for the end of the 21st century. The downscaling models were developed and validated using CMIP5 model output and observed monthly precipitation. We used general circulation model (GCM) experiments for the 20th century (RCP historical; 1970-1999) and two scenarios (RCP 2.6 and 8.5; 2070-2100). The model test results indicate that the ANNs significantly outperform the MLR downscaling of monthly precipitation variability.
Artificial neural networks and multiple linear regression model using principal components to estimate rainfall over South America

NASA Astrophysics Data System (ADS)

dos Santos, T. S.; Mendes, D.; Torres, R. R.

2015-08-01

Several studies have been devoted to dynamic and statistical downscaling for analysis of both climate variability and climate change. This paper introduces an application of artificial neural networks (ANN) and multiple linear regression (MLR) by principal components to estimate rainfall in South America. This method is proposed for downscaling monthly precipitation time series over South America for three regions: the Amazon, Northeastern Brazil and the La Plata Basin, which is one of the regions of the planet that will be most affected by the climate change projected for the end of the 21st century. The downscaling models were developed and validated using CMIP5 model out- put and observed monthly precipitation. We used GCMs experiments for the 20th century (RCP Historical; 1970-1999) and two scenarios (RCP 2.6 and 8.5; 2070-2100). The model test results indicate that the ANN significantly outperforms the MLR downscaling of monthly precipitation variability.
Modeling Pan Evaporation for Kuwait by Multiple Linear Regression

PubMed Central

Almedeij, Jaber

2012-01-01

Evaporation is an important parameter for many projects related to hydrology and water resources systems. This paper constitutes the first study conducted in Kuwait to obtain empirical relations for the estimation of daily and monthly pan evaporation as functions of available meteorological data of temperature, relative humidity, and wind speed. The data used here for the modeling are daily measurements of substantial continuity coverage, within a period of 17 years between January 1993 and December 2009, which can be considered representative of the desert climate of the urban zone of the country. Multiple linear regression technique is used with a procedure of variable selection for fitting the best model forms. The correlations of evaporation with temperature and relative humidity are also transformed in order to linearize the existing curvilinear patterns of the data by using power and exponential functions, respectively. The evaporation models suggested with the best variable combinations were shown to produce results that are in a reasonable agreement with observation values. PMID:23226984
Improving Prediction Accuracy for WSN Data Reduction by Applying Multivariate Spatio-Temporal Correlation

PubMed Central

Carvalho, Carlos; Gomes, Danielo G.; Agoulmine, Nazim; de Souza, José Neuman

2011-01-01

This paper proposes a method based on multivariate spatial and temporal correlation to improve prediction accuracy in data reduction for Wireless Sensor Networks (WSN). Prediction of data not sent to the sink node is a technique used to save energy in WSNs by reducing the amount of data traffic. However, it may not be very accurate. Simulations were made involving simple linear regression and multiple linear regression functions to assess the performance of the proposed method. The results show a higher correlation between gathered inputs when compared to time, which is an independent variable widely used for prediction and forecasting. Prediction accuracy is lower when simple linear regression is used, whereas multiple linear regression is the most accurate one. In addition to that, our proposal outperforms some current solutions by about 50% in humidity prediction and 21% in light prediction. To the best of our knowledge, we believe that we are probably the first to address prediction based on multivariate correlation for WSN data reduction. PMID:22346626
Health-related quality of life in multiple sclerosis: role of cognitive appraisals of self, illness and treatment.

PubMed

Wilski, Maciej; Tasiemski, Tomasz

2016-07-01

Health-related quality of life (HRQoL) is considered an important measure of treatment and rehabilitation outcomes in multiple sclerosis (MS) patients. In this study, we used multivariate regression analysis to examine the role of cognitive appraisals, adjusted for clinical, socioeconomic and demographic variables, as correlates of HRQoL in MS. The cross-sectional study included 257 MS patients, who completed Multiple Sclerosis Impact Scale, Generalized Self-Efficacy Scale, Rosenberg Self-Esteem Scale, Brief Illness Perception Questionnaire, Treatment Beliefs Scale, Actually Received Support Scale (a part of Berlin Social Support Scale) and Socioeconomic Resources Scale. Demographic and clinical characteristics of the participants were collected with a self-report survey. Correlation and regression analyses were conducted to determine associations between the variables. Five variables, illness identity (β = 0.29, p ≤ 0.001), self-esteem (β = -0.22, p ≤ 0.001), general self-efficacy (β = -0.21, p ≤ 0.001), disability subgroup "EDSS" (β = 0.14, p = 0.006) and age (β = 0.12, p = 0.012), were significant correlates of HRQoL in MS. These variables explained 46 % of variance in the dependent variable. Moreover, we identified correlates of physical and psychological dimensions of HRQoL. Cognitive appraisals, such as general self-efficacy, self-esteem and illness perception, are more salient correlates of HRQoL than social support, socioeconomic resources and clinical characteristics, such as type and duration of MS. Therefore, interventions aimed at cognitive appraisals may also improve HRQoL of MS patients.

Exact and Approximate Statistical Inference for Nonlinear Regression and the Estimating Equation Approach.

PubMed

Demidenko, Eugene

2017-09-01

The exact density distribution of the nonlinear least squares estimator in the one-parameter regression model is derived in closed form and expressed through the cumulative distribution function of the standard normal variable. Several proposals to generalize this result are discussed. The exact density is extended to the estimating equation (EE) approach and the nonlinear regression with an arbitrary number of linear parameters and one intrinsically nonlinear parameter. For a very special nonlinear regression model, the derived density coincides with the distribution of the ratio of two normally distributed random variables previously obtained by Fieller (1932), unlike other approximations previously suggested by other authors. Approximations to the density of the EE estimators are discussed in the multivariate case. Numerical complications associated with the nonlinear least squares are illustrated, such as nonexistence and/or multiple solutions, as major factors contributing to poor density approximation. The nonlinear Markov-Gauss theorem is formulated based on the near exact EE density approximation.
Statistical summary of selected physical, chemical, and toxicity characteristics and estimates of annual constituent loads in urban stormwater, Maricopa County, Arizona

USGS Publications Warehouse

Fossum, Kenneth D.; O'Day, Christie M.; Wilson, Barbara J.; Monical, Jim E.

2001-01-01

Stormwater and streamflow in Maricopa County were monitored to (1) describe the physical, chemical, and toxicity characteristics of stormwater from areas having different land uses, (2) describe the physical, chemical, and toxicity characteristics of streamflow from areas that receive urban stormwater, and (3) estimate constituent loads in stormwater. Urban stormwater and streamflow had similar ranges in most constituent concentrations. The mean concentration of dissolved solids in urban stormwater was lower than in streamflow from the Salt River and Indian Bend Wash. Urban stormwater, however, had a greater chemical oxygen demand and higher concentrations of most nutrients. Mean seasonal loads and mean annual loads of 11 constituents and volumes of runoff were estimated for municipalities in the metropolitan Phoenix area, Arizona, by adjusting regional regression equations of loads. This adjustment procedure uses the original regional regression equation and additional explanatory variables that were not included in the original equation. The adjusted equations had standard errors that ranged from 161 to 196 percent. The large standard errors of the prediction result from the large variability of the constituent concentration data used in the regression analysis. Adjustment procedures produced unsatisfactory results for nine of the regressions?suspended solids, dissolved solids, total phosphorus, dissolved phosphorus, total recoverable cadmium, total recoverable copper, total recoverable lead, total recoverable zinc, and storm runoff. These equations had no consistent direction of bias and no other additional explanatory variables correlated with the observed loads. A stepwise-multiple regression or a three-variable regression (total storm rainfall, drainage area, and impervious area) and local data were used to develop local regression equations for these nine constituents. These equations had standard errors from 15 to 183 percent.
Assessing risk factors for periodontitis using regression

NASA Astrophysics Data System (ADS)

Lobo Pereira, J. A.; Ferreira, Maria Cristina; Oliveira, Teresa

2013-10-01

Multivariate statistical analysis is indispensable to assess the associations and interactions between different factors and the risk of periodontitis. Among others, regression analysis is a statistical technique widely used in healthcare to investigate and model the relationship between variables. In our work we study the impact of socio-demographic, medical and behavioral factors on periodontal health. Using regression, linear and logistic models, we can assess the relevance, as risk factors for periodontitis disease, of the following independent variables (IVs): Age, Gender, Diabetic Status, Education, Smoking status and Plaque Index. The multiple linear regression analysis model was built to evaluate the influence of IVs on mean Attachment Loss (AL). Thus, the regression coefficients along with respective p-values will be obtained as well as the respective p-values from the significance tests. The classification of a case (individual) adopted in the logistic model was the extent of the destruction of periodontal tissues defined by an Attachment Loss greater than or equal to 4 mm in 25% (AL≥4mm/≥25%) of sites surveyed. The association measures include the Odds Ratios together with the correspondent 95% confidence intervals.
Female homicide in Rio Grande do Sul, Brazil.

PubMed

Leites, Gabriela Tomedi; Meneghel, Stela Nazareth; Hirakata, Vania Noemi

2014-01-01

This study aimed to assess the female homicide rate due to aggression in Rio Grande do Sul, Brazil, using this as a "proxy" of femicide. This was an ecological study which correlated the female homicide rate due to aggression in Rio Grande do Sul, according to the 35 microregions defined by the Brazilian Institute of Geography and Statistics (IBGE), with socioeconomic and demographic variables access and health indicators. Pearson's correlation test was performed with the selected variables. After this, multiple linear regressions were performed with variables with p < 0.20. The standardized average of female homicide rate due to aggression in the period from 2003 to 2007 was 3.1 obits per 100 thousand. After multiple regression analysis, the final model included male mortality due to aggression (p = 0.016), the percentage of hospital admissions for alcohol (p = 0.005) and the proportion of ill-defined deaths (p = 0.015). The model have an explanatory power of 39% (adjusted r2 = 0.391). The results are consistent with other studies and indicate a strong relationship between structural violence in society and violence against women, in addition to a higher incidence of female deaths in places with high alcohol hospitalization.
Forecasting on the total volumes of Malaysia's imports and exports by multiple linear regression

NASA Astrophysics Data System (ADS)

Beh, W. L.; Yong, M. K. Au

2017-04-01

This study is to give an insight on the doubt of the important of macroeconomic variables that affecting the total volumes of Malaysia's imports and exports by using multiple linear regression (MLR) analysis. The time frame for this study will be determined by using quarterly data of the total volumes of Malaysia's imports and exports covering the period between 2000-2015. The macroeconomic variables will be limited to eleven variables which are the exchange rate of US Dollar with Malaysia Ringgit (USD-MYR), exchange rate of China Yuan with Malaysia Ringgit (RMB-MYR), exchange rate of European Euro with Malaysia Ringgit (EUR-MYR), exchange rate of Singapore Dollar with Malaysia Ringgit (SGD-MYR), crude oil prices, gold prices, producer price index (PPI), interest rate, consumer price index (CPI), industrial production index (IPI) and gross domestic product (GDP). This study has applied the Johansen Co-integration test to investigate the relationship among the total volumes to Malaysia's imports and exports. The result shows that crude oil prices, RMB-MYR, EUR-MYR and IPI play important roles in the total volumes of Malaysia's imports. Meanwhile crude oil price, USD-MYR and GDP play important roles in the total volumes of Malaysia's exports.
Spatio-temporal variations of nitric acid total columns from 9 years of IASI measurements - a driver study

NASA Astrophysics Data System (ADS)

Ronsmans, Gaétane; Wespes, Catherine; Hurtmans, Daniel; Clerbaux, Cathy; Coheur, Pierre-François

2018-04-01

This study aims to understand the spatial and temporal variability of HNO3 total columns in terms of explanatory variables. To achieve this, multiple linear regressions are used to fit satellite-derived time series of HNO3 daily averaged total columns. First, an analysis of the IASI 9-year time series (2008-2016) is conducted based on various equivalent latitude bands. The strong and systematic denitrification of the southern polar stratosphere is observed very clearly. It is also possible to distinguish, within the polar vortex, three regions which are differently affected by the denitrification. Three exceptional denitrification episodes in 2011, 2014 and 2016 are also observed in the Northern Hemisphere, due to unusually low arctic temperatures. The time series are then fitted by multivariate regressions to identify what variables are responsible for HNO3 variability in global distributions and time series, and to quantify their respective influence. Out of an ensemble of proxies (annual cycle, solar flux, quasi-biennial oscillation, multivariate ENSO index, Arctic and Antarctic oscillations and volume of polar stratospheric clouds), only the those defined as significant (p value < 0.05) by a selection algorithm are retained for each equivalent latitude band. Overall, the regression gives a good representation of HNO3 variability, with especially good results at high latitudes (60-80 % of the observed variability explained by the model). The regressions show the dominance of annual variability in all latitudinal bands, which is related to specific chemistry and dynamics depending on the latitudes. We find that the polar stratospheric clouds (PSCs) also have a major influence in the polar regions, and that their inclusion in the model improves the correlation coefficients and the residuals. However, there is still a relatively large portion of HNO3 variability that remains unexplained by the model, especially in the intertropical regions, where factors not included in the regression model (such as vegetation fires or lightning) may be at play.
Neural correlates of gait variability in people with multiple sclerosis with fall history.

PubMed

Kalron, Alon; Allali, Gilles; Achiron, Anat

2018-05-28

Investigate the association between step time variability and related brain structures in accordance with fall status in people with multiple sclerosis (PwMS). The study included 225 PwMS. A whole-brain MRI was performed by a high-resolution 3.0-Telsa MR scanner in addition to volumetric analysis based on 3D T1-weighted images using the FreeSurfer image analysis suite. Step time variability was measured by an electronic walkway. Participants were defined as "fallers" (at least two falls during the previous year) and "non-fallers". One hundred and five PwMS were defined as fallers and had a greater step time variability compared to non-fallers (5.6% (S.D.=3.4) vs. 3.4% (S.D.=1.5); p=0.001). MS fallers exhibited a reduced volume in the left caudate and both cerebellum hemispheres compared to non-fallers. By using a linear regression analysis no association was found between gait variability and related brain structures in the total cohort and non-fallers group. However, the analysis found an association between the left hippocampus and left putamen volumes with step time variability in the faller group; p=0.031, 0.048, respectively, controlling for total cranial volume, walking speed, disability, age and gender. Nevertheless, according to the hierarchical regression model, the contribution of these brain measures to predict gait variability was relatively small compared to walking speed. An association between low left hippocampal, putamen volumes and step time variability was found in PwMS with a history of falls, suggesting brain structural characteristics may be related to falls and increased gait variability in PwMS. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.
Addressing Gender Equity in Nonfaculty Salaries.

ERIC Educational Resources Information Center

Toukoushian, Robert K.

2000-01-01

Discusses methodology of gender equity studies on noninstructional employees of colleges and universities, including variable selection in the multiple regression model and alternative approaches for measuring wage gaps. Analysis of staff data at one institution finds that experience and market differences account for 80 percent of gender pay…
Correlates of Successful Aging: Are They Universal?

ERIC Educational Resources Information Center

Litwin, Howard

2005-01-01

The analysis compared differing correlates of life satisfaction among three diverse population groups in Israel, examining background and health status variables, social environment factors, and activity indicators. Multiple regression analysis revealed that veteran Jewish-Israelis (n = 2,043) had the largest set of predictors, the strongest of…
Using Multilevel Modeling in Language Assessment Research: A Conceptual Introduction

ERIC Educational Resources Information Center

Barkaoui, Khaled

2013-01-01

This article critiques traditional single-level statistical approaches (e.g., multiple regression analysis) to examining relationships between language test scores and variables in the assessment setting. It highlights the conceptual, methodological, and statistical problems associated with these techniques in dealing with multilevel or nested…
Paranormal belief, experience, and the Keirsey Temperament Sorter.

PubMed

Fox, J; Williams, C

2000-06-01

121 college students completed the Anomalous Experience Inventory and the Keirsey Temperament Sorter. Multiple regression analyses provided significant models predicting both Paranormal Experience and Belief; the main predictors were the other subscales of the Anomalous Experience Inventory with the Keirsey variables playing only a minor role.
Impact of Collegiate Recreation on Academic Success

ERIC Educational Resources Information Center

Sanderson, Heather; DeRousie, Jason; Guistwite, Nicole

2018-01-01

This study examined the impact of collegiate recreation participation on academic success as measured by grade point average, course credit completion, and persistence or graduation. Logistic and multiple regressions were run to explore the relationship between total recreation contact hours and outcome variables. Results indicated a positive and…
Partial Least Square Analyses of Landscape and Surface Water Biota Associations in the Savannah River Basin

EPA Science Inventory

Ecologists are often faced with problem of small sample size, correlated and large number of predictors, and high noise-to-signal relationships. This necessitates excluding important variables from the model when applying standard multiple or multivariate regression analyses. In ...
Predicting daily use of urban forest recreation sites

Treesearch

John F. Dwyer

1988-01-01

A multiple linear regression model explains 90% of the variance in daily use of an urban recreation site. Explanatory variables include season, day of the week, and weather. The results offer guides for recreation site planning and management as well as suggestions for improving the model.
Argentina soybean yield model

NASA Technical Reports Server (NTRS)

Callis, S. L.; Sakamoto, C.

1984-01-01

A model based on multiple regression was developed to estimate soybean yields for the country of Argentina. A meteorological data set was obtained for the country by averaging data for stations within the soybean growing area. Predictor variables for the model were derived from monthly total precipitation and monthly average temperature. A trend variable was included for the years 1969 to 1978 since an increasing trend in yields due to technology was observed between these years.
Age, Body Mass Index, and Frequency of Sexual Activity are Independent Predictors of Testosterone Deficiency in Men With Erectile Dysfunction.

PubMed

Pagano, Matthew J; De Fazio, Adam; Levy, Alison; RoyChoudhury, Arindam; Stahl, Peter J

2016-04-01

To identify clinical predictors of testosterone deficiency (TD) in men with erectile dysfunction (ED), thereby identifying subgroups that are most likely to benefit from targeted testosterone screening. Retrospective review was conducted on 498 men evaluated for ED between January 2013 and July 2014. Testing for TD by early morning serum measurement was offered to all eligible men. Patients with history of prostate cancer or testosterone replacement were excluded. Univariable linear regression was conducted to analyze 19 clinical variables for associations with serum total testosterone (TT), calculated free testosterone (cFT), and TD (T <300 ng/dL or cFT <6.5 ng/dL). Variables significant on univariable analysis were included in multiple regression models. A total of 225 men met inclusion criteria. Lower TT levels were associated with greater body mass index (BMI), less frequent sexual activity, and absence of clinical depression on multiple regression analysis. TT decreased by 49.5 ng/dL for each 5-point increase in BMI. BMI and age were the only independent predictors of cFT levels on multivariable analysis. Overall, 62 subjects (27.6%) met criteria for TD. Older age, greater BMI, and less frequent sexual activity were the only independent predictors of TD on multiple regression. We observed a 2.2-fold increase in the odds of TD for every 5-point increase in BMI, and a 1.8-fold increase for every 10 year increase in age. Men with ED and elevated BMI, advanced age, or infrequent sexual activity appear to be at high risk of TD, and such patients represent excellent potential candidates for targeted testosterone screening. Copyright © 2016 Elsevier Inc. All rights reserved.
Two SPSS programs for interpreting multiple regression results.

PubMed

Lorenzo-Seva, Urbano; Ferrando, Pere J; Chico, Eliseo

2010-02-01

When multiple regression is used in explanation-oriented designs, it is very important to determine both the usefulness of the predictor variables and their relative importance. Standardized regression coefficients are routinely provided by commercial programs. However, they generally function rather poorly as indicators of relative importance, especially in the presence of substantially correlated predictors. We provide two user-friendly SPSS programs that implement currently recommended techniques and recent developments for assessing the relevance of the predictors. The programs also allow the user to take into account the effects of measurement error. The first program, MIMR-Corr.sps, uses a correlation matrix as input, whereas the second program, MIMR-Raw.sps, uses the raw data and computes bootstrap confidence intervals of different statistics. The SPSS syntax, a short manual, and data files related to this article are available as supplemental materials from http://brm.psychonomic-journals.org/content/supplemental.
Remote-sensing data processing with the multivariate regression analysis method for iron mineral resource potential mapping: a case study in the Sarvian area, central Iran

NASA Astrophysics Data System (ADS)

Mansouri, Edris; Feizi, Faranak; Jafari Rad, Alireza; Arian, Mehran

2018-03-01

This paper uses multivariate regression to create a mathematical model for iron skarn exploration in the Sarvian area, central Iran, using multivariate regression for mineral prospectivity mapping (MPM). The main target of this paper is to apply multivariate regression analysis (as an MPM method) to map iron outcrops in the northeastern part of the study area in order to discover new iron deposits in other parts of the study area. Two types of multivariate regression models using two linear equations were employed to discover new mineral deposits. This method is one of the reliable methods for processing satellite images. ASTER satellite images (14 bands) were used as unique independent variables (UIVs), and iron outcrops were mapped as dependent variables for MPM. According to the results of the probability value (p value), coefficient of determination value (R2) and adjusted determination coefficient (Radj2), the second regression model (which consistent of multiple UIVs) fitted better than other models. The accuracy of the model was confirmed by iron outcrops map and geological observation. Based on field observation, iron mineralization occurs at the contact of limestone and intrusive rocks (skarn type).
Calibration of multivariate scatter plots for exploratory analysis of relations within and between sets of variables in genomic research.

PubMed

Graffelman, Jan; van Eeuwijk, Fred

2005-12-01

The scatter plot is a well known and easily applicable graphical tool to explore relationships between two quantitative variables. For the exploration of relations between multiple variables, generalisations of the scatter plot are useful. We present an overview of multivariate scatter plots focussing on the following situations. Firstly, we look at a scatter plot for portraying relations between quantitative variables within one data matrix. Secondly, we discuss a similar plot for the case of qualitative variables. Thirdly, we describe scatter plots for the relationships between two sets of variables where we focus on correlations. Finally, we treat plots of the relationships between multiple response and predictor variables, focussing on the matrix of regression coefficients. We will present both known and new results, where an important original contribution concerns a procedure for the inclusion of scales for the variables in multivariate scatter plots. We provide software for drawing such scales. We illustrate the construction and interpretation of the plots by means of examples on data collected in a genomic research program on taste in tomato.
Body Fat Percentage Prediction Using Intelligent Hybrid Approaches

PubMed Central

Shao, Yuehjen E.

2014-01-01

Excess of body fat often leads to obesity. Obesity is typically associated with serious medical diseases, such as cancer, heart disease, and diabetes. Accordingly, knowing the body fat is an extremely important issue since it affects everyone's health. Although there are several ways to measure the body fat percentage (BFP), the accurate methods are often associated with hassle and/or high costs. Traditional single-stage approaches may use certain body measurements or explanatory variables to predict the BFP. Diverging from existing approaches, this study proposes new intelligent hybrid approaches to obtain fewer explanatory variables, and the proposed forecasting models are able to effectively predict the BFP. The proposed hybrid models consist of multiple regression (MR), artificial neural network (ANN), multivariate adaptive regression splines (MARS), and support vector regression (SVR) techniques. The first stage of the modeling includes the use of MR and MARS to obtain fewer but more important sets of explanatory variables. In the second stage, the remaining important variables are served as inputs for the other forecasting methods. A real dataset was used to demonstrate the development of the proposed hybrid models. The prediction results revealed that the proposed hybrid schemes outperformed the typical, single-stage forecasting models. PMID:24723804

Modeling Laterality of the Globus Pallidus Internus in Patients With Parkinson's Disease.

PubMed

Sharim, Justin; Yazdi, Daniel; Baohan, Amy; Behnke, Eric; Pouratian, Nader

2017-04-01

Neurosurgical interventions such as deep brain stimulation surgery of the globus pallidus internus (GPi) play an important role in the treatment of medically refractory Parkinson's disease (PD), and require high targeting accuracy. Variability in the laterality of the GPi across patients with PD has not been well characterized. The aim of this report is to identify factors that may contribute to differences in position of the motor region of GPi. The charts and operative reports of 101 PD patients following deep brain stimulation surgery (70 males, aged 11-78 years) representing 201 GPi were retrospectively reviewed. Data extracted for each subject include age, gender, anterior and posterior commissures (AC-PC) distance, and third ventricular width. Multiple linear regression, stepwise regression, and relative importance of regressors analysis were performed to assess the predictive ability of these variables on GPi laterality. Multiple linear regression for target vs. third ventricular width, gender, AC-PC distance, and age were significant for normalized linear regression coefficients of 0.333 (p < 0.0001), 0.206 (p = 0.00219), 0.168 (p = 0.0119), and 0.159 (p = 0.0136), respectively. Third ventricular width, gender, AC-PC distance, and age each account for 44.06% (21.38-65.69%, 95% CI), 20.82% (10.51-35.88%), 21.46% (8.28-37.05%), and 13.66% (2.62-28.64%) of the R 2 value, respectively. Effect size calculation was significant for a change in the GPi laterality of 0.19 mm per mm of ventricular width, 0.11 mm per mm of AC-PC distance, 0.017 mm per year in age, and 0.54 mm increase for male gender. This variability highlights the limitations of indirect targeting alone, and argues for the continued use of MRI as well as intraoperative physiological testing to account for such factors that contribute to patient-specific variability in GPi localization. © 2016 International Neuromodulation Society.
Using an innovative multiple regression procedure in a cancer population (Part 1): detecting and probing relationships of common interacting symptoms (pain, fatigue/weakness, sleep problems) as a strategy to discover influential symptom pairs and clusters

PubMed Central

Francoeur, Richard B

2015-01-01

Background The majority of patients with advanced cancer experience symptom pairs or clusters among pain, fatigue, and insomnia. Improved methods are needed to detect and interpret interactions among symptoms or diesease markers to reveal influential pairs or clusters. In prior work, I developed and validated sequential residual centering (SRC), a method that improves the sensitivity of multiple regression to detect interactions among predictors, by conditioning for multicollinearity (shared variation) among interactions and component predictors. Materials and methods Using a hypothetical three-way interaction among pain, fatigue, and sleep to predict depressive affect, I derive and explain SRC multiple regression. Subsequently, I estimate raw and SRC multiple regressions using real data for these symptoms from 268 palliative radiation outpatients. Results Unlike raw regression, SRC reveals that the three-way interaction (pain × fatigue/weakness × sleep problems) is statistically significant. In follow-up analyses, the relationship between pain and depressive affect is aggravated (magnified) within two partial ranges: 1) complete-to-some control over fatigue/weakness when there is complete control over sleep problems (ie, a subset of the pain–fatigue/weakness symptom pair), and 2) no control over fatigue/weakness when there is some-to-no control over sleep problems (ie, a subset of the pain–fatigue/weakness–sleep problems symptom cluster). Otherwise, the relationship weakens (buffering) as control over fatigue/weakness or sleep problems diminishes. Conclusion By reducing the standard error, SRC unmasks a three-way interaction comprising a symptom pair and cluster. Low-to-moderate levels of the moderator variable for fatigue/weakness magnify the relationship between pain and depressive affect. However, when the comoderator variable for sleep problems accompanies fatigue/weakness, only frequent or unrelenting levels of both symptoms magnify the relationship. These findings suggest that a countervailing mechanism involving depressive affect could account for the effectiveness of a cognitive behavioral intervention to reduce the severity of a pain, fatigue, and sleep disturbance cluster in a previous randomized trial. PMID:25565865
Using an innovative multiple regression procedure in a cancer population (Part 1): detecting and probing relationships of common interacting symptoms (pain, fatigue/weakness, sleep problems) as a strategy to discover influential symptom pairs and clusters.

PubMed

Francoeur, Richard B

2015-01-01

The majority of patients with advanced cancer experience symptom pairs or clusters among pain, fatigue, and insomnia. Improved methods are needed to detect and interpret interactions among symptoms or diesease markers to reveal influential pairs or clusters. In prior work, I developed and validated sequential residual centering (SRC), a method that improves the sensitivity of multiple regression to detect interactions among predictors, by conditioning for multicollinearity (shared variation) among interactions and component predictors. Using a hypothetical three-way interaction among pain, fatigue, and sleep to predict depressive affect, I derive and explain SRC multiple regression. Subsequently, I estimate raw and SRC multiple regressions using real data for these symptoms from 268 palliative radiation outpatients. Unlike raw regression, SRC reveals that the three-way interaction (pain × fatigue/weakness × sleep problems) is statistically significant. In follow-up analyses, the relationship between pain and depressive affect is aggravated (magnified) within two partial ranges: 1) complete-to-some control over fatigue/weakness when there is complete control over sleep problems (ie, a subset of the pain-fatigue/weakness symptom pair), and 2) no control over fatigue/weakness when there is some-to-no control over sleep problems (ie, a subset of the pain-fatigue/weakness-sleep problems symptom cluster). Otherwise, the relationship weakens (buffering) as control over fatigue/weakness or sleep problems diminishes. By reducing the standard error, SRC unmasks a three-way interaction comprising a symptom pair and cluster. Low-to-moderate levels of the moderator variable for fatigue/weakness magnify the relationship between pain and depressive affect. However, when the comoderator variable for sleep problems accompanies fatigue/weakness, only frequent or unrelenting levels of both symptoms magnify the relationship. These findings suggest that a countervailing mechanism involving depressive affect could account for the effectiveness of a cognitive behavioral intervention to reduce the severity of a pain, fatigue, and sleep disturbance cluster in a previous randomized trial.
Validating the absolute reliability of a fat free mass estimate equation in hemodialysis patients using near-infrared spectroscopy.

PubMed

Kono, Kenichi; Nishida, Yusuke; Moriyama, Yoshihumi; Taoka, Masahiro; Sato, Takashi

2015-06-01

The assessment of nutritional states using fat free mass (FFM) measured with near-infrared spectroscopy (NIRS) is clinically useful. This measurement should incorporate the patient's post-dialysis weight ("dry weight"), in order to exclude the effects of any change in water mass. We therefore used NIRS to investigate the regression, independent variables, and absolute reliability of FFM in dry weight. The study included 47 outpatients from the hemodialysis unit. Body weight was measured before dialysis, and FFM was measured using NIRS before and after dialysis treatment. Multiple regression analysis was used to estimate the FFM in dry weight as the dependent variable. The measured FFM before dialysis treatment (Mw-FFM), and the difference between measured and dry weight (Mw-Dw) were independent variables. We performed Bland-Altman analysis to detect errors between the statistically estimated FFM and the measured FFM after dialysis treatment. The multiple regression equation to estimate the FFM in dry weight was: Dw-FFM = 0.038 + (0.984 × Mw-FFM) + (-0.571 × [Mw-Dw]); R(2) = 0.99). There was no systematic bias between the estimated and the measured values of FFM in dry weight. Using NIRS, FFM in dry weight can be calculated by an equation including FFM in measured weight and the difference between the measured weight and the dry weight. © 2015 The Authors. Therapeutic Apheresis and Dialysis © 2015 International Society for Apheresis.
Variables influencing allocation of capital expenditure in Indonesia

NASA Astrophysics Data System (ADS)

Muda, Iskandar; Naibaho, Revmianson

2018-03-01

The purpose of this study is to examine the factors affecting capital expenditure in Indonesia. The independent variables used are The Effects of Financing Surplus, Total Population and Regional Sizes and the dependent variable used is The Effects of Financing Surplus. This type of research is a causal associative research. The type of data used is secondary data in severals provinces in Indonesia with multiple regression analysis. The results show significantly the determinants of capital expenditure allocation in Indonesia are affected by Financing Surplus, Total Population and Regional Sizes.
The relationship between attendance at birth and maternal mortality rates: an exploration of United Nations' data sets including the ratios of physicians and nurses to population, GNP per capita and female literacy.

PubMed

Robinson, J J; Wharrad, H

2001-05-01

The relationship between attendance at birth and maternal mortality rates: an exploration of United Nations' data sets including the ratios of physicians and nurses to population, GNP per capita and female literacy. This is the third and final paper drawing on data taken from United Nations (UN) data sets. The first paper examined the global distribution of health professionals (as measured by ratios of physicians and nurses to population), and its relationship to gross national product per capita (GNP) (Wharrad & Robinson 1999). The second paper explored the relationships between the global distribution of physicians and nurses, GNP, female literacy and the health outcome indicators of infant and under five mortality rates (IMR and u5MR) (Robinson & Wharrad 2000). In the present paper, the global distribution of health professionals is explored in relation to maternal mortality rates (MMRs). The proportion of births attended by medical and nonmedical staff defined as "attendance at birth by trained personnel" (physicians, nurses, midwives or primary health care workers trained in midwifery skills), is included as an additional independent variable in the regression analyses, together with the ratio of physicians and nurses to population, female literacy and GNP. To extend our earlier analyses by considering the relationships between the global distribution of health professionals (ratios of physicians and nurses to population, and the proportion of births attended by trained health personnel), GNP, female literacy and MMR.
Flood characteristics of Alaskan streams

USGS Publications Warehouse

Lamke, R.D.

1979-01-01

Peak discharge data for Alaskan streams are summarized and analyzed. Multiple-regression equations relating peak discharge magnitude and frequency to climatic and physical characteristics of 260 gaged basins were determined in order to estimate average recurrence interval of floods at ungaged sites. These equations are for 1.25-, 2-, 5-, 10-, 25-, and 50-year average recurrence intervals. In this report, Alaska was divided into two regions, one having a maritime climate with fall and winter rains and floods, the other having spring and summer floods of a variety or combinations of causes. Average standard errors of the six multiple-regression equations for these two regions were 48 and 74 percent, respectively. Maximum recorded floods at more than 400 sites throughout Alaska are tabulated. Maps showing lines of equal intensity of the principal climatic variables found to be significant (mean annual precipitation and mean minimum January temperature), and location of the 260 sites used in the multiple-regression analyses are included. Little flood data have been collected in western and arctic Alaska, and the predictive equations are therefore less reliable for those areas. (Woodard-USGS)
Theory of mind and executive function: working-memory capacity and inhibitory control as predictors of false-belief task performance.

PubMed

Mutter, Brigitte; Alcorn, Mark B; Welsh, Marilyn

2006-06-01

This study of the relationship between theory of mind and executive function examined whether on the false-belief task age differences between 3 and 5 ears of age are related to development of working-memory capacity and inhibitory processes. 72 children completed tasks measuring false belief, working memory, and inhibition. Significant age effects were observed for false-belief and working-memory performance, as well as for the false-alarm and perseveration measures of inhibition. A simultaneous multiple linear regression specified the contribution of age, inhibition, and working memory to the prediction of false-belief performance. This model was significant, explaining a total of 36% of the variance. To examine the independent contributions of the working-memory and inhibition variables, after controlling for age, two hierarchical multiple linear regressions were conducted. These multiple regression analyses indicate that working memory and inhibition make small, overlapping contributions to false-belief performance after accounting for age, but that working memory, as measured in this study, is a somewhat better predictor of false-belief understanding than is inhibition.
Clifford support vector machines for classification, regression, and recurrence.

PubMed

Bayro-Corrochano, Eduardo Jose; Arana-Daniel, Nancy

2010-11-01

This paper introduces the Clifford support vector machines (CSVM) as a generalization of the real and complex-valued support vector machines using the Clifford geometric algebra. In this framework, we handle the design of kernels involving the Clifford or geometric product. In this approach, one redefines the optimization variables as multivectors. This allows us to have a multivector as output. Therefore, we can represent multiple classes according to the dimension of the geometric algebra in which we work. We show that one can apply CSVM for classification and regression and also to build a recurrent CSVM. The CSVM is an attractive approach for the multiple input multiple output processing of high-dimensional geometric entities. We carried out comparisons between CSVM and the current approaches to solve multiclass classification and regression. We also study the performance of the recurrent CSVM with experiments involving time series. The authors believe that this paper can be of great use for researchers and practitioners interested in multiclass hypercomplex computing, particularly for applications in complex and quaternion signal and image processing, satellite control, neurocomputation, pattern recognition, computer vision, augmented virtual reality, robotics, and humanoids.
Automatic identification of variables in epidemiological datasets using logic regression.

PubMed

Lorenz, Matthias W; Abdi, Negin Ashtiani; Scheckenbach, Frank; Pflug, Anja; Bülbül, Alpaslan; Catapano, Alberico L; Agewall, Stefan; Ezhov, Marat; Bots, Michiel L; Kiechl, Stefan; Orth, Andreas

2017-04-13

For an individual participant data (IPD) meta-analysis, multiple datasets must be transformed in a consistent format, e.g. using uniform variable names. When large numbers of datasets have to be processed, this can be a time-consuming and error-prone task. Automated or semi-automated identification of variables can help to reduce the workload and improve the data quality. For semi-automation high sensitivity in the recognition of matching variables is particularly important, because it allows creating software which for a target variable presents a choice of source variables, from which a user can choose the matching one, with only low risk of having missed a correct source variable. For each variable in a set of target variables, a number of simple rules were manually created. With logic regression, an optimal Boolean combination of these rules was searched for every target variable, using a random subset of a large database of epidemiological and clinical cohort data (construction subset). In a second subset of this database (validation subset), this optimal combination rules were validated. In the construction sample, 41 target variables were allocated on average with a positive predictive value (PPV) of 34%, and a negative predictive value (NPV) of 95%. In the validation sample, PPV was 33%, whereas NPV remained at 94%. In the construction sample, PPV was 50% or less in 63% of all variables, in the validation sample in 71% of all variables. We demonstrated that the application of logic regression in a complex data management task in large epidemiological IPD meta-analyses is feasible. However, the performance of the algorithm is poor, which may require backup strategies.
A Comparison of Various MRA Methods Applied to Longitudinal Evaluation Studies in Vocational Education.

ERIC Educational Resources Information Center

Kapes, Jerome T.; And Others

Three models of multiple regression analysis (MRA): single equation, commonality analysis, and path analysis, were applied to longitudinal data from the Pennsylvania Vocational Development Study. Variables influencing weekly income of vocational education students one year after high school graduation were examined: grade point averages (grades…
Combining data visualization and statistical approaches for interpreting measurements and meta-data: Integrating heatmaps, variable clustering, and mixed regression models

EPA Science Inventory

The advent of new higher throughput analytical instrumentation has put a strain on interpreting and explaining the results from complex studies. Contemporary human, environmental, and biomonitoring data sets are comprised of tens or hundreds of analytes, multiple repeat measures...
The Effects of Market Structure on Television News Pricing.

ERIC Educational Resources Information Center

Wirth, Michael O.; Wollert, James A.

Multiple regression techniques were used to examine the business side of local television news operations for November 1978. Research questions examined the effect of several variables on local television news prices (advertising rates), including type of ownership, network affiliation/signal type, market size, cable network penetration, market…
MULTIVARIATE STATISTICAL MODELS FOR EFFECTS OF PM AND COPOLLUTANTS IN A DAILY TIME SERIES EPIDEMIOLOGY STUDY

EPA Science Inventory

Most analyses of daily time series epidemiology data relate mortality or morbidity counts to PM and other air pollutants by means of single-outcome regression models using multiple predictors, without taking into account the complex statistical structure of the predictor variable...
Psychosocial and demographic predictors of fruit, juice and vegetable consumption among 11-14-year-old Boy Scouts

USDA-ARS?s Scientific Manuscript database

Psychosocial and demographic correlates of fruit, juice, and vegetable (FJV) consumption were investigated to guide how to increase FJV intake. Experimental design consisted of hierarchical multiple regression analysis of FJV consumption on demographics and psychosocial variables. Subjects were boys...
Rural Economic Development: What Makes Rural Communities Grow?

ERIC Educational Resources Information Center

Aldrich, Lorna; Kusmin, Lorin

This report identifies local factors that foster rural economic growth. A review of the literature revealed potential indicators of county economic growth, and those indicators were then tested against data for nonmetro counties during the 1980s using multiple regression analysis. The principal variables examined included demographic and labor…
New Zealand Management Students' Perceptions of Communication Technologies in Correspondence Education.

ERIC Educational Resources Information Center

Ostman, Ronald E.; Wagner, Graham A.

1987-01-01

Describes a survey of 724 management students in New Zealand's Technical Correspondence Institute which was conducted to determine whether the introduction of educational technologies could decrease the dropout rate. The multiple linear regression model that was used to analyze the questionnaire responses is presented, and predictor variables are…
Introduction

Treesearch

J. Michael Scott; C. John Ralph

1981-01-01

Counting birds has a long tradition. Since early in human history, man has noted and recorded the presence, absence, and abundance of birds. This long, and presumably honorable, pursuit that we all engage in, to a greater or lesser extent, is the common currency of many ornithological studies. These studies range from multiple regression analyses of habitat variables...
Environmental factors affecting understory diversity in second-growth deciduous forests

Treesearch

Cynthia D. Huebner; J.C. Randolph; G.R. Parker

1995-01-01

The purpose of this study was to determine the most important nonanthropogenic factors affecting understory (herbs, shrubs and low-growing vines) diversity in forested landscapes of southern Indiana. Fourteen environmental variables were measured for 46 sites. Multiple regression analysis showed significant positive correlation between understory diversity and tree...
Evidencing the association between swimming capacities and performance indicators in water polo: a multiple regression study.

PubMed

Kontic, Dean; Zenic, Natasa; Uljevic, Ognjen; Sekulic, Damir; Lesnik, Blaz

2017-06-01

Swimming capacities are hypothesized to be important determinants of water polo performance but there is an evident lack of studies examining different swimming capacities in relation to specific offensive and defensive performance variables in this sport. The aim of this study was to determine the relationship between five swimming capacities and six performance determinants in water polo. The sample comprised 79 high-level youth water polo players (all males, 17-18 years of age). The variables included six performance-related variables (agility in offence and defense, efficacy in offence and defense, polyvalence in offence and defense), and five swimming-capacity tests (water polo sprint test [15 m], swimming sprint test [25 m], short-distance [100 m], aerobic endurance [400 m] and an anaerobic lactate endurance test [4× 50 m]). First, multiple regressions were calculated for one-half of the sample of subjects which were then validated with the remaining half of the sample. The 25-m swim was not included in the regression analyses due to the multicollinearity with other predictors. The originally calculated regression models were validated for defensive agility (R=0.67 and R=0.55 for the original regression calculation and validation subsample, respectively) offensive agility (R=0.59 and R=0.61), and offensive efficacy (R=0.64 and R=0.58). Anaerobic lactate endurance is a significant predictor of offensive and defensive agility, while 15 m sprint significantly contributes to offensive efficacy. Swimming capacities are not found to be related to the polyvalence of the players. The most superior offensive performance can be expected from those players with a high level of anaerobic lactate endurance and advanced sprinting capacity, while anaerobic lactate endurance is recognized as most important quality in defensive duties. Future studies should observe players' polyvalence in relation to (theoretical) knowledge of technical and tactical tasks. Results reinforce the need for the cross-validation of the prediction-models in sport and exercise sciences.

Sociodemographic and social contextual predictors of multiple health behavior change: data from the Healthy Directions-Small Business study.

PubMed

Harley, Amy E; Sapp, Amy L; Li, Yi; Marino, Miguel; Quintiliani, Lisa M; Sorensen, Glorian

2013-03-01

Multiple modifiable health behaviors contribute to the chronic diseases that are the leading causes of death in the USA. Disparities for meeting recommended health behavior guidelines exist across occupational classes and socioeconomic levels. The purpose of this paper was to investigate sociodemographic and social contextual predictors of multiple health behavior change in a worksite intervention. We analyzed data on four diet and exercise variables from an intervention trial with worksite-level randomization. Eight hundred forty-one employees had complete data from baseline (response rate = 84 %) and follow-up surveys (response rate = 77 %). Multilevel logistic regression estimated associations between least absolute shrinkage and selection operator-selected sociodemographic and social contextual predictor variables and the multiple health behavior change outcome (changing 2+ versus 0 behaviors). Gender, being married/partnered, and perceived discrimination were significantly associated with multiple health behavior change. Sociodemographic and social contextual factors predict multiple health behavior change and could inform the design and delivery of worksite interventions targeting multiple health behaviors.
Deriving the Intrahepatic Arteriovenous Shunt Rate from CT Images and Biochemical Data Instead of from Arterial Perfusion Scintigraphy in Hepatic Arterial Infusion Chemotherapy

DOE Office of Scientific and Technical Information (OSTI.GOV)

Ozaki, Toshiro, E-mail: ganronbun@amail.plala.or.jp; Seki, Hiroshi; Shiina, Makoto

2009-09-15

The purpose of the present study was to elucidate a method for predicting the intrahepatic arteriovenous shunt rate from computed tomography (CT) images and biochemical data, instead of from arterial perfusion scintigraphy, because adverse exacerbated systemic effects may be induced in cases where a high shunt rate exists. CT and arterial perfusion scintigraphy were performed in patients with liver metastases from gastric or colorectal cancer. Biochemical data and tumor marker levels of 33 enrolled patients were measured. The results were statistically verified by multiple regression analysis. The total metastatic hepatic tumor volume (V{sub metastasized}), residual hepatic parenchyma volume (V{sub residual};more » calculated from CT images), and biochemical data were treated as independent variables; the intrahepatic arteriovenous (IHAV) shunt rate (calculated from scintigraphy) was treated as a dependent variable. The IHAV shunt rate was 15.1 {+-} 11.9%. Based on the correlation matrixes, the best correlation coefficient of 0.84 was established between the IHAV shunt rate and V{sub metastasized} (p < 0.01). In the multiple regression analysis with the IHAV shunt rate as the dependent variable, the coefficient of determination (R{sup 2}) was 0.75, which was significant at the 0.1% level with two significant independent variables (V{sub metastasized} and V{sub residual}). The standardized regression coefficients ({beta}) of V{sub metastasized} and V{sub residual} were significant at the 0.1 and 5% levels, respectively. Based on this result, we can obtain a predicted value of IHAV shunt rate (p < 0.001) using CT images. When a high shunt rate was predicted, beneficial and consistent clinical monitoring can be initiated in, for example, hepatic arterial infusion chemotherapy.« less
Spatial analysis and land use regression of VOCs and NO(2) from school-based urban air monitoring in Detroit/Dearborn, USA.

PubMed

Mukerjee, Shaibal; Smith, Luther A; Johnson, Mary M; Neas, Lucas M; Stallings, Casson A

2009-08-01

Passive ambient air sampling for nitrogen dioxide (NO(2)) and volatile organic compounds (VOCs) was conducted at 25 school and two compliance sites in Detroit and Dearborn, Michigan, USA during the summer of 2005. Geographic Information System (GIS) data were calculated at each of 116 schools. The 25 selected schools were monitored to assess and model intra-urban gradients of air pollutants to evaluate impact of traffic and urban emissions on pollutant levels. Schools were chosen to be statistically representative of urban land use variables such as distance to major roadways, traffic intensity around the schools, distance to nearest point sources, population density, and distance to nearest border crossing. Two approaches were used to investigate spatial variability. First, Kruskal-Wallis analyses and pairwise comparisons on data from the schools examined coarse spatial differences based on city section and distance from heavily trafficked roads. Secondly, spatial variation on a finer scale and as a response to multiple factors was evaluated through land use regression (LUR) models via multiple linear regression. For weeklong exposures, VOCs did not exhibit spatial variability by city section or distance from major roads; NO(2) was significantly elevated in a section dominated by traffic and industrial influence versus a residential section. Somewhat in contrast to coarse spatial analyses, LUR results revealed spatial gradients in NO(2) and selected VOCs across the area. The process used to select spatially representative sites for air sampling and the results of coarse and fine spatial variability of air pollutants provide insights that may guide future air quality studies in assessing intra-urban gradients.
The Moderating Role of Power Distance on the Relationship between Employee Participation and Outcome Variables.

PubMed

Rafiei, Sima; Pourreza, Abolghasem

2013-06-01

Many organisations have realised the importance of human resource for their competitive advantage. Empowering employees is therefore essential for organisational effectiveness. This study aimed to investigate the relationship between employee participation with outcome variables such as organisational commitment, job satisfaction, perception of justice in an organisation and readiness to accept job responsibilities. It further examined the impact of power distance on the relationship between participation and four outcome variables. This was a cross sectional study with a descriptive research design conducted among employees and managers of hospitals affiliated with Tehran University of Medical Sciences, Tehran, Iran. A questionnaire as a main procedure to gather data was developed, distributed and collected. Descriptive statistics, Pearson correlation coefficient and moderated multiple regression were used to analyse the study data. Findings of the study showed that the level of power distance perceived by employees had a significant relationship with employee participation, organisational commitment, job satisfaction, perception of justice and readiness to accept job responsibilities. There was also a significant relationship between employee participation and four outcome variables. The moderated multiple regression results supported the hypothesis that power distance had a significant effect on the relationship between employee participation and four outcome variables. Organisations in which employee empowerment is practiced through diverse means such as participating them in decision making related to their field of work, appear to have more committed and satisfied employees with positive perception toward justice in the organisational interactions and readiness to accept job responsibilities.
Maternal risk factors predicting child physical characteristics and dysmorphology in fetal alcohol syndrome and partial fetal alcohol syndrome.

PubMed

May, Philip A; Tabachnick, Barbara G; Gossage, J Phillip; Kalberg, Wendy O; Marais, Anna-Susan; Robinson, Luther K; Manning, Melanie; Buckley, David; Hoyme, H Eugene

2011-12-01

Previous research in South Africa revealed very high rates of fetal alcohol syndrome (FAS), of 46-89 per 1000 among young children. Maternal and child data from studies in this community summarize the multiple predictors of FAS and partial fetal alcohol syndrome (PFAS). Sequential regression was employed to examine influences on child physical characteristics and dysmorphology from four categories of maternal traits: physical, demographic, childbearing, and drinking. Then, a structural equation model (SEM) was constructed to predict influences on child physical characteristics. Individual sequential regressions revealed that maternal drinking measures were the most powerful predictors of a child's physical anomalies (R² = .30, p < .001), followed by maternal demographics (R² = .24, p < .001), maternal physical characteristics (R²=.15, p < .001), and childbearing variables (R² = .06, p < .001). The SEM utilized both individual variables and the four composite categories of maternal traits to predict a set of child physical characteristics, including a total dysmorphology score. As predicted, drinking behavior is a relatively strong predictor of child physical characteristics (β = 0.61, p < .001), even when all other maternal risk variables are included; higher levels of drinking predict child physical anomalies. Overall, the SEM model explains 62% of the variance in child physical anomalies. As expected, drinking variables explain the most variance. But this highly controlled estimation of multiple effects also reveals a significant contribution played by maternal demographics and, to a lesser degree, maternal physical and childbearing variables. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.
Female Literacy Rate is a Better Predictor of Birth Rate and Infant Mortality Rate in India.

PubMed

Saurabh, Suman; Sarkar, Sonali; Pandey, Dhruv K

2013-01-01

Educated women are known to take informed reproductive and healthcare decisions. These result in population stabilization and better infant care reflected by lower birth rates and infant mortality rates (IMRs), respectively. Our objective was to study the relationship of male and female literacy rates with crude birth rates (CBRs) and IMRs of the states and union territories (UTs) of India. The data were analyzed using linear regression. CBR and IMR were taken as the dependent variables; while the overall literacy rates, male, and female literacy rates were the independent variables. CBRs were inversely related to literacy rates (slope parameter = -0.402, P < 0.001). On multiple linear regression with male and female literacy rates, a significant inverse relationship emerged between female literacy rate and CBR (slope = -0.363, P < 0.001), while male literacy rate was not significantly related to CBR (P = 0.674). IMR of the states were also inversely related to their literacy rates (slope = -1.254, P < 0.001). Multiple linear regression revealed a significant inverse relationship between IMR and female literacy (slope = -0.816, P = 0.031), whereas male literacy rate was not significantly related (P = 0.630). Female literacy is relatively highly important for both population stabilization and better infant health.
Birthweight Related Factors in Northwestern Iran: Using Quantile Regression Method.

PubMed

Fallah, Ramazan; Kazemnejad, Anoshirvan; Zayeri, Farid; Shoghli, Alireza

2015-11-18

Birthweight is one of the most important predicting indicators of the health status in adulthood. Having a balanced birthweight is one of the priorities of the health system in most of the industrial and developed countries. This indicator is used to assess the growth and health status of the infants. The aim of this study was to assess the birthweight of the neonates by using quantile regression in Zanjan province. This analytical descriptive study was carried out using pre-registered (March 2010 - March 2012) data of neonates in urban/rural health centers of Zanjan province using multiple-stage cluster sampling. Data were analyzed using multiple linear regressions andquantile regression method and SAS 9.2 statistical software. From 8456 newborn baby, 4146 (49%) were female. The mean age of the mothers was 27.1±5.4 years. The mean birthweight of the neonates was 3104 ± 431 grams. Five hundred and seventy-three patients (6.8%) of the neonates were less than 2500 grams. In all quantiles, gestational age of neonates (p<0.05), weight and educational level of the mothers (p<0.05) showed a linear significant relationship with the i of the neonates. However, sex and birth rank of the neonates, mothers age, place of residence (urban/rural) and career were not significant in all quantiles (p>0.05). This study revealed the results of multiple linear regression and quantile regression were not identical. We strictly recommend the use of quantile regression when an asymmetric response variable or data with outliers is available.
Birthweight Related Factors in Northwestern Iran: Using Quantile Regression Method

PubMed Central

Fallah, Ramazan; Kazemnejad, Anoshirvan; Zayeri, Farid; Shoghli, Alireza

2016-01-01

Introduction: Birthweight is one of the most important predicting indicators of the health status in adulthood. Having a balanced birthweight is one of the priorities of the health system in most of the industrial and developed countries. This indicator is used to assess the growth and health status of the infants. The aim of this study was to assess the birthweight of the neonates by using quantile regression in Zanjan province. Methods: This analytical descriptive study was carried out using pre-registered (March 2010 - March 2012) data of neonates in urban/rural health centers of Zanjan province using multiple-stage cluster sampling. Data were analyzed using multiple linear regressions andquantile regression method and SAS 9.2 statistical software. Results: From 8456 newborn baby, 4146 (49%) were female. The mean age of the mothers was 27.1±5.4 years. The mean birthweight of the neonates was 3104 ± 431 grams. Five hundred and seventy-three patients (6.8%) of the neonates were less than 2500 grams. In all quantiles, gestational age of neonates (p<0.05), weight and educational level of the mothers (p<0.05) showed a linear significant relationship with the i of the neonates. However, sex and birth rank of the neonates, mothers age, place of residence (urban/rural) and career were not significant in all quantiles (p>0.05). Conclusion: This study revealed the results of multiple linear regression and quantile regression were not identical. We strictly recommend the use of quantile regression when an asymmetric response variable or data with outliers is available. PMID:26925889
Argentina wheat yield model

NASA Technical Reports Server (NTRS)

Callis, S. L.; Sakamoto, C.

1984-01-01

Five models based on multiple regression were developed to estimate wheat yields for the five wheat growing provinces of Argentina. Meteorological data sets were obtained for each province by averaging data for stations within each province. Predictor variables for the models were derived from monthly total precipitation, average monthly mean temperature, and average monthly maximum temperature. Buenos Aires was the only province for which a trend variable was included because of increasing trend in yield due to technology from 1950 to 1963.
Argentina corn yield model

NASA Technical Reports Server (NTRS)

Callis, S. L.; Sakamoto, C.

1984-01-01

A model based on multiple regression was developed to estimate corn yields for the country of Argentina. A meteorological data set was obtained for the country by averaging data for stations within the corn-growing area. Predictor variables for the model were derived from monthly total precipitation, average monthly mean temperature, and average monthly maximum temperature. A trend variable was included for the years 1965 to 1980 since an increasing trend in yields due to technology was observed between these years.
Influence of age on the correlations of hematological and biochemical variables with the stability of erythrocyte membrane in relation to sodium dodecyl sulfate.

PubMed

de Freitas, Mariana V; Marquez-Bernardes, Liandra F; de Arvelos, Letícia R; Paraíso, Lara F; Gonçalves E Oliveira, Ana Flávia M; Mascarenhas Netto, Rita de C; Neto, Morun Bernardino; Garrote-Filho, Mario S; de Souza, Paulo César A; Penha-Silva, Nilson

2014-10-01

To evaluate the influence of age on the relationships between biochemical and hematological variables and stability of erythrocyte membrane in relation to the sodium dodecyl sulfate (SDS) in population of 105 female volunteers between 20 and 90 years. The stability of RBC membrane was determined by non-linear regression of the dependency of the absorbance of hemoglobin released as a function of SDS concentration, represented by the half-transition point of the curve (D50) and the variation in the concentration of the detergent to promote lysis (dD). There was an age-dependent increase in the membrane stability in relation to SDS. Analyses by multiple linear regression showed that this stability increase is significantly related to the hematological variable red cell distribution width (RDW) and the biochemical variables blood albumin and cholesterol. The positive association between erythrocyte stability and RDW may reflect one possible mechanism involved in the clinical meaning of this hematological index.
Relationship of negative self-schemas and attachment styles with appearance schemas.

PubMed

Ledoux, Tracey; Winterowd, Carrie; Richardson, Tamara; Clark, Julie Dorton

2010-06-01

The purpose was to test, among women, the relationship between negative self-schemas and styles of attachment with men and women and two types of appearance investment (Self-evaluative and Motivational Salience). Predominantly Caucasian undergraduate women (N=194) completed a modified version of the Relationship Questionnaire, the Young Schema Questionnaire-Short Form, and the Appearance Schemas Inventory-Revised. Linear multiple regression analyses were conducted with Motivational Salience and Self-evaluative Salience of appearance serving as dependent variables and relevant demographic variables, negative self-schemas, and styles of attachment to men serving as independent variables. Styles of attachment to women were not entered into these regression models because Pearson correlations indicated they were not related to either dependent variable. Self-evaluative Salience of appearance was related to impaired autonomy and performance negative self-schema and the preoccupation style of attachment with men, while Motivational Salience of appearance was related only to the preoccupation style of attachment with men. 2010 Elsevier Ltd. All rights reserved.
The interaction between stratospheric monthly mean regional winds and sporadic-E

NASA Astrophysics Data System (ADS)

Çetin, Kenan; Özcan, Osman; Korlaelçi, Serhat

2017-03-01

In the present study, a statistical investigation is carried out to explore whether there is a relationship between the critical frequency (foEs) of the sporadic-E layer that is occasionally seen on the E region of the ionosphere and the quasi-biennial oscillation (QBO) that flows in the east-west direction in the equatorial stratosphere. Multiple regression model as a statistical tool was used to determine the relationship between variables. In this model, the stationarity of the variables (foEs and QBO) was firstly analyzed for each station (Cocos Island, Gibilmanna, Niue Island, and Tahiti). Then, a co-integration test was made to determine the existence of a long-term relationship between QBO and foEs. After verifying the presence of a long-term relationship between the variables, the magnitude of the relationship between variables was further determined using the multiple regression model. As a result, it is concluded that the variations in foEs were explainable with QBO measured at 10 hPa altitude at the rate of 69%, 94%, 79%, and 58% for Cocos Island, Gibilmanna, Niue Island, and Tahiti stations, respectively. It is observed that the variations in foEs were explainable with QBO measured at 70 hPa altitude at the rate of 66%, 69%, 53%, and 47% for Cocos Island, Gibilmanna, Niue Island, and Tahiti stations, respectively.
Personal growth, symptoms, and uncertainty in community-residing adults with heart failure.

PubMed

Overbaugh, Kristen J; Parshall, Mark B

Personal growth has not been studied extensively in heart failure (HF). To characterize personal growth in HF and its relationships with symptom burden, uncertainty, and demographic and clinical factors. Associations among personal growth, uncertainty, symptom burden, and clinical and demographic variables were examined in adult outpatients with HF using bivariate correlations and multiple regressions. Participants (N = 103; 76% male, mean age = 74 years, 97% New York Heart Association classes II and III) reported moderate levels of personal growth, uncertainty, and symptom burden. Personal growth was weakly correlated with age and symptom burden but not with other study variables. In a regression model, age, sex, ethnicity, disease severity, time since diagnosis, symptom burden, and uncertainty were not significant independent correlates of personal growth. Community-residing patients with HF report moderate personal growth that is not explained by uncertainty, symptom burden, or demographic and clinical variables. Copyright © 2016 Elsevier Inc. All rights reserved.
Relationships between Lifestyle, Living Environments, and Incidence of Hypertension in Japan (in Men): Based on Participant’s Data from the Nationwide Medical Check-Up

PubMed Central

Oka, Mayumi; Yamamoto, Mio; Mure, Kanae; Takeshita, Tatsuya; Arita, Mikio

2016-01-01

This study aims to investigate factors that contribute to the differences in incidence of hypertension between different regions in Japan, by accounting for not only individual lifestyles, but also their living environments. The target participants of this survey were individuals who received medical treatment for hypertension, as well as hypertension patients who have not received any treatment. The objective variable for analysis was the incidence of hypertension as data aggregated per prefecture. We used data (in men) including obesity, salt intake, vegetable intake, habitual alcohol consumption, habitual smoking, and number of steps walked per day. The variables within living environment included number of rail stations, standard/light vehicle usage, and slope of habitable land. In addition, we analyzed data for the variables related to medical environment including, participation rate in medical check-ups and number of hospitals. We performed multiple stepwise regression analyses to elucidate the correlation of these variables by using hypertension incidence as the objective variable. Hypertension incidence showed a significant negative correlation with walking and medical check-ups, and a significant positive correlation with light-vehicle usage and slope. Between the number of steps and variables related to the living environment, number of rail stations showed a significant positive correlation, while, standard- and light-vehicle usage showed significant negative correlation. Moreover, with stepwise multiple regression analysis, walking showed the strongest effect. The differences in daily walking based on living environment were associated with the disparities in the hypertension incidence in Japan. PMID:27788198
Predicting punching acceleration from selected strength and power variables in elite karate athletes: a multiple regression analysis.

PubMed

Loturco, Irineu; Artioli, Guilherme Giannini; Kobal, Ronaldo; Gil, Saulo; Franchini, Emerson

2014-07-01

This study investigated the relationship between punching acceleration and selected strength and power variables in 19 professional karate athletes from the Brazilian National Team (9 men and 10 women; age, 23 ± 3 years; height, 1.71 ± 0.09 m; and body mass [BM], 67.34 ± 13.44 kg). Punching acceleration was assessed under 4 different conditions in a randomized order: (a) fixed distance aiming to attain maximum speed (FS), (b) fixed distance aiming to attain maximum impact (FI), (c) self-selected distance aiming to attain maximum speed, and (d) self-selected distance aiming to attain maximum impact. The selected strength and power variables were as follows: maximal dynamic strength in bench press and squat-machine, squat and countermovement jump height, mean propulsive power in bench throw and jump squat, and mean propulsive velocity in jump squat with 40% of BM. Upper- and lower-body power and maximal dynamic strength variables were positively correlated to punch acceleration in all conditions. Multiple regression analysis also revealed predictive variables: relative mean propulsive power in squat jump (W·kg-1), and maximal dynamic strength 1 repetition maximum in both bench press and squat-machine exercises. An impact-oriented instruction and a self-selected distance to start the movement seem to be crucial to reach the highest acceleration during punching execution. This investigation, while demonstrating strong correlations between punching acceleration and strength-power variables, also provides important information for coaches, especially for designing better training strategies to improve punching speed.
Multiple emotions: a person-centered approach to the relationship between intergroup emotion and action orientation.

PubMed

Fernando, Julian W; Kashima, Yoshihisa; Laham, Simon M

2014-08-01

Although a great deal of research has investigated the relationship between emotions and action orientations, most studies to date have used variable-centered techniques to identify the best emotion predictor(s) of a particular action. Given that people frequently report multiple or blended emotions, a profitable area of research may be to adopt person-centered approaches to examine the action orientations elicited by a particular combination of emotions or "emotion profile." In two studies, across instances of intergroup inequality in Australia and Canada, we examined participants' experiences of six intergroup emotions: sympathy, anger directed at three targets, shame, and pride. In both studies, five groups of participants with similar emotion profiles were identified by cluster analysis and their action orientations were compared; clusters indicated that the majority of participants experienced multiple emotions. Each action orientation was also regressed on the six emotions. There were a number of differences in the results obtained from the person-centered and variable-centered approaches. This was most apparent for sympathy: the group of participants experiencing only sympathy showed little inclination to perform prosocial actions, yet sympathy was a significant predictor of numerous action orientations in regression analyses. These results imply that sympathy may only prompt a desire for action when experienced in combination with other emotions. We suggest that the use of person-centered and variable-centered approaches as complementary analytic strategies may enrich research into not only the affective predictors of action, but emotion research in general.
Estimation of premorbid general fluid intelligence using traditional Chinese reading performance in Taiwanese samples.

PubMed

Chen, Ying-Jen; Ho, Meng-Yang; Chen, Kwan-Ju; Hsu, Chia-Fen; Ryu, Shan-Jin

2009-08-01

The aims of the present study were to (i) investigate if traditional Chinese word reading ability can be used for estimating premorbid general intelligence; and (ii) to provide multiple regression equations for estimating premorbid performance on Raven's Standard Progressive Matrices (RSPM), using age, years of education and Chinese Graded Word Reading Test (CGWRT) scores as predictor variables. Four hundred and twenty-six healthy volunteers (201 male, 225 female), aged 16-93 years (mean +/- SD, 41.92 +/- 18.19 years) undertook the tests individually under supervised conditions. Seventy percent of subjects were randomly allocated to the derivation group (n = 296), and the rest to the validation group (n = 130). RSPM score was positively correlated with CGWRT score and years of education. RSPM and CGWRT scores and years of education were also inversely correlated with age, but the declining trend for RSPM performance against age was steeper than that for CGWRT performance. Separate multiple regression equations were derived for estimating RSPM scores using different combinations of age, years of education, and CGWRT score for both groups. The multiple regression coefficient of each equation ranged from 0.71 to 0.80 with the standard error of estimate between 7 and 8 RSPM points. When fitting the data of one group to the equations derived from its counterpart group, the cross-validation multiple regression coefficients ranged from 0.71 to 0.79. There were no significant differences in the 'predicted-obtained' RSPM discrepancies between any equations. The regression equations derived in the present study may provide a basis for estimating premorbid RSPM performance.
Overall Preference of Running Shoes Can Be Predicted by Suitable Perception Factors Using a Multiple Regression Model.

PubMed

Tay, Cheryl Sihui; Sterzing, Thorsten; Lim, Chen Yen; Ding, Rui; Kong, Pui Wah

2017-05-01

This study examined (a) the strength of four individual footwear perception factors to influence the overall preference of running shoes and (b) whether these perception factors satisfied the nonmulticollinear assumption in a regression model. Running footwear must fulfill multiple functional criteria to satisfy its potential users. Footwear perception factors, such as fit and cushioning, are commonly used to guide shoe design and development, but it is unclear whether running-footwear users are able to differentiate one factor from another. One hundred casual runners assessed four running shoes on a 15-cm visual analogue scale for four footwear perception factors (fit, cushioning, arch support, and stability) as well as for overall preference during a treadmill running protocol. Diagnostic tests showed an absence of multicollinearity between factors, where values for tolerance ranged from .36 to .72, corresponding to variance inflation factors of 2.8 to 1.4. The multiple regression model of these four footwear perception variables accounted for 77.7% to 81.6% of variance in overall preference, with each factor explaining a unique part of the total variance. Casual runners were able to rate each footwear perception factor separately, thus assigning each factor a true potential to improve overall preference for the users. The results also support the use of a multiple regression model of footwear perception factors to predict overall running shoe preference. Regression modeling is a useful tool for running-shoe manufacturers to more precisely evaluate how individual factors contribute to the subjective assessment of running footwear.
Guidelines and Procedures for Computing Time-Series Suspended-Sediment Concentrations and Loads from In-Stream Turbidity-Sensor and Streamflow Data

USGS Publications Warehouse

Rasmussen, Patrick P.; Gray, John R.; Glysson, G. Douglas; Ziegler, Andrew C.

2009-01-01

In-stream continuous turbidity and streamflow data, calibrated with measured suspended-sediment concentration data, can be used to compute a time series of suspended-sediment concentration and load at a stream site. Development of a simple linear (ordinary least squares) regression model for computing suspended-sediment concentrations from instantaneous turbidity data is the first step in the computation process. If the model standard percentage error (MSPE) of the simple linear regression model meets a minimum criterion, this model should be used to compute a time series of suspended-sediment concentrations. Otherwise, a multiple linear regression model using paired instantaneous turbidity and streamflow data is developed and compared to the simple regression model. If the inclusion of the streamflow variable proves to be statistically significant and the uncertainty associated with the multiple regression model results in an improvement over that for the simple linear model, the turbidity-streamflow multiple linear regression model should be used to compute a suspended-sediment concentration time series. The computed concentration time series is subsequently used with its paired streamflow time series to compute suspended-sediment loads by standard U.S. Geological Survey techniques. Once an acceptable regression model is developed, it can be used to compute suspended-sediment concentration beyond the period of record used in model development with proper ongoing collection and analysis of calibration samples. Regression models to compute suspended-sediment concentrations are generally site specific and should never be considered static, but they represent a set period in a continually dynamic system in which additional data will help verify any change in sediment load, type, and source.

Prevalence of vitamin D deficiency and associated factors in women and newborns in the immediate postpartum period

PubMed Central

do Prado, Mara Rúbia Maciel Cardoso; Oliveira, Fabiana de Cássia Carvalho; Assis, Karine Franklin; Ribeiro, Sarah Aparecida Vieira; do Prado, Pedro Paulo; Sant'Ana, Luciana Ferreira da Rocha; Priore, Silvia Eloiza; Franceschini, Sylvia do Carmo Castro

2015-01-01

Abstract Objective: To assess the prevalence of vitamin D deficiency and its associated factors in women and their newborns in the postpartum period. Methods: This cross-sectional study evaluated vitamin D deficiency/insufficiency in 226 women and their newborns in Viçosa (Minas Gerais, BR) between December 2011 and November 2012. Cord blood and venous maternal blood were collected to evaluate the following biochemical parameters: vitamin D, alkaline phosphatase, calcium, phosphorus and parathyroid hormone. Poisson regression analysis, with a confidence interval of 95%, was applied to assess vitamin D deficiency and its associated factors. Multiple linear regression analysis was performed to identify factors associated with 25(OH)D deficiency in the newborns and women from the study. The criteria for variable inclusion in the multiple linear regression model was the association with the dependent variable in the simple linear regression analysis, considering p<0.20. Significance level was α <5%. Results: From 226 women included, 200 (88.5%) were 20-44 years old; the median age was 28 years. Deficient/insufficient levels of vitamin D were found in 192 (85%) women and in 182 (80.5%) neonates. The maternal 25(OH)D and alkaline phosphatase levels were independently associated with vitamin D deficiency in infants. Conclusions: This study identified a high prevalence of vitamin D deficiency and insufficiency in women and newborns and the association between maternal nutritional status of vitamin D and their infants' vitamin D status. PMID:26100593
The 11-year solar cycle in current reanalyses: a (non)linear attribution study of the middle atmosphere

NASA Astrophysics Data System (ADS)

Kuchar, A.; Sacha, P.; Miksovsky, J.; Pisoft, P.

2015-06-01

This study focusses on the variability of temperature, ozone and circulation characteristics in the stratosphere and lower mesosphere with regard to the influence of the 11-year solar cycle. It is based on attribution analysis using multiple nonlinear techniques (support vector regression, neural networks) besides the multiple linear regression approach. The analysis was applied to several current reanalysis data sets for the 1979-2013 period, including MERRA, ERA-Interim and JRA-55, with the aim to compare how these types of data resolve especially the double-peaked solar response in temperature and ozone variables and the consequent changes induced by these anomalies. Equatorial temperature signals in the tropical stratosphere were found to be in qualitative agreement with previous attribution studies, although the agreement with observational results was incomplete, especially for JRA-55. The analysis also pointed to the solar signal in the ozone data sets (i.e. MERRA and ERA-Interim) not being consistent with the observed double-peaked ozone anomaly extracted from satellite measurements. The results obtained by linear regression were confirmed by the nonlinear approach through all data sets, suggesting that linear regression is a relevant tool to sufficiently resolve the solar signal in the middle atmosphere. The seasonal evolution of the solar response was also discussed in terms of dynamical causalities in the winter hemispheres. The hypothetical mechanism of a weaker Brewer-Dobson circulation at solar maxima was reviewed together with a discussion of polar vortex behaviour.
Hydraulic geometry and streamflow of channels in the Piceance Basin, Rio Blanco and Garfield counties, Colorado

USGS Publications Warehouse

Elliott, J.G.; Cartier, K.D.

1986-01-01

The influence of streamflow and basin characteristics on channel geometry was investigated at 18 perennial and ephemeral stream reaches in the Piceance basin of northwestern Colorado. Results of stepwise multiple regression analyses indicated that the variabilities of mean bankfull depth (D) and bankfull cross-sectional flow area (Af) were predominantly a function of bankfull discharge (QB), and that most of the variability in channel slopes (S) could be explained by drainage area (DA). None of the independent variables selected for the study could account for a large part of the variability in bankfull channel width (W). (USGS)
Advanced glycation end products and antioxidant status in type 2 diabetic patients with and without peripheral artery disease.

PubMed

Lapolla, Annunziata; Piarulli, Francesco; Sartore, Giovanni; Ceriello, Antonio; Ragazzi, Eugenio; Reitano, Rachele; Baccarin, Lorenzo; Laverda, Barbara; Fedele, Domenico

2007-03-01

Advanced glycation end products (AGEs), pentosidine and malondialdehyde (MDA), are elevated in type 2 diabetic subjects with coronary and carotid angiopathy. We investigated the relationship of AGEs, MDA, total reactive antioxidant potentials (TRAPs), and vitamin E in type 2 diabetic patients with and without peripheral artery disease (PAD). AGEs, pentosidine, MDA, TRAP, vitamin E, and ankle-brachial index (ABI) were measured in 99 consecutive type 2 diabetic subjects and 20 control subjects. AGEs, pentosidine, and MDA were higher and vitamin E and TRAP were lower in patients with PAD (ABI <0.9) than in patients without PAD (ABI >0.9) (P < 0.001). After multiple regression analysis, a correlation between AGEs and pentosidine, as independent variables, and ABI, as the dependent variable, was found in both patients with and without PAD (r = 0.9198, P < 0.001 and r = 0.5764, P < 0.001, respectively) but not in control subjects. When individual regression coefficients were evaluated, only that due to pentosidine was confirmed as significant. For patients with PAD, considering TRAP, vitamin E, and MDA as independent variables and ABI as the dependent variable produced an overall significant regression (r = 0.6913, P < 0.001). The regression coefficients for TRAP and vitamin E were not significant, indicating that the model is best explained by a single linear regression between MDA and ABI. These findings were also confirmed by principal component analysis. Results show that pentosidine and MDA are strongly associated with PAD in type 2 diabetic patients.
Women's perceptions of their male batterers' characteristics and level of violence.

PubMed

Torres, Sara; Han, Hae-Ra

2003-01-01

This article describes the characteristics of male perpetrators of domestic violence and their relationship to the level of violence. The data about the male partners obtained from 151 battered women were used for this analysis. Using multiple regression, demographic variables and three behavioral indicators, including use of alcohol before a violent episode, history of arrests, and the generality of violence, were examined together for their relationship with the violence scores. With the level of violence as measured by the Conflict Tactics Scale (CTS) as the dependent variable, demographic variables explained 19.1% of the variability, with the behavioral indicators accounting for an additional 4.6% of the variability. Several research and clinical implications are addressed.
Estimating peak discharges, flood volumes, and hydrograph shapes of small ungaged urban streams in Ohio

USGS Publications Warehouse

Sherwood, J.M.

1986-01-01

Methods are presented for estimating peak discharges, flood volumes and hydrograph shapes of small (less than 5 sq mi) urban streams in Ohio. Examples of how to use the various regression equations and estimating techniques also are presented. Multiple-regression equations were developed for estimating peak discharges having recurrence intervals of 2, 5, 10, 25, 50, and 100 years. The significant independent variables affecting peak discharge are drainage area, main-channel slope, average basin-elevation index, and basin-development factor. Standard errors of regression and prediction for the peak discharge equations range from +/-37% to +/-41%. An equation also was developed to estimate the flood volume of a given peak discharge. Peak discharge, drainage area, main-channel slope, and basin-development factor were found to be the significant independent variables affecting flood volumes for given peak discharges. The standard error of regression for the volume equation is +/-52%. A technique is described for estimating the shape of a runoff hydrograph by applying a specific peak discharge and the estimated lagtime to a dimensionless hydrograph. An equation for estimating the lagtime of a basin was developed. Two variables--main-channel length divided by the square root of the main-channel slope and basin-development factor--have a significant effect on basin lagtime. The standard error of regression for the lagtime equation is +/-48%. The data base for the study was established by collecting rainfall-runoff data at 30 basins distributed throughout several metropolitan areas of Ohio. Five to eight years of data were collected at a 5-min record interval. The USGS rainfall-runoff model A634 was calibrated for each site. The calibrated models were used in conjunction with long-term rainfall records to generate a long-term streamflow record for each site. Each annual peak-discharge record was fitted to a Log-Pearson Type III frequency curve. Multiple-regression techniques were then used to analyze the peak discharge data as a function of the basin characteristics of the 30 sites. (Author 's abstract)
Estimating and Modelling Bias of the Hierarchical Partitioning Public-Domain Software: Implications in Environmental Management and Conservation

PubMed Central

Olea, Pedro P.; Mateo-Tomás, Patricia; de Frutos, Ángel

2010-01-01

Background Hierarchical partitioning (HP) is an analytical method of multiple regression that identifies the most likely causal factors while alleviating multicollinearity problems. Its use is increasing in ecology and conservation by its usefulness for complementing multiple regression analysis. A public-domain software “hier.part package” has been developed for running HP in R software. Its authors highlight a “minor rounding error” for hierarchies constructed from >9 variables, however potential bias by using this module has not yet been examined. Knowing this bias is pivotal because, for example, the ranking obtained in HP is being used as a criterion for establishing priorities of conservation. Methodology/Principal Findings Using numerical simulations and two real examples, we assessed the robustness of this HP module in relation to the order the variables have in the analysis. Results indicated a considerable effect of the variable order on the amount of independent variance explained by predictors for models with >9 explanatory variables. For these models the nominal ranking of importance of the predictors changed with variable order, i.e. predictors declared important by its contribution in explaining the response variable frequently changed to be either most or less important with other variable orders. The probability of changing position of a variable was best explained by the difference in independent explanatory power between that variable and the previous one in the nominal ranking of importance. The lesser is this difference, the more likely is the change of position. Conclusions/Significance HP should be applied with caution when more than 9 explanatory variables are used to know ranking of covariate importance. The explained variance is not a useful parameter to use in models with more than 9 independent variables. The inconsistency in the results obtained by HP should be considered in future studies as well as in those already published. Some recommendations to improve the analysis with this HP module are given. PMID:20657734
Suicidal Ideation and Schizophrenia: Contribution of Appraisal, Stigmatization, and Cognition.

PubMed

Stip, Emmanuel; Caron, Jean; Tousignant, Michel; Lecomte, Yves

2017-10-01

To predict suicidal ideation in people with schizophrenia, certain studies have measured its relationship with the variables of defeat and entrapment. The relationships are positive, but their interactions remain undefined. To further their understanding, this research sought to measure the relationship between suicidal ideation with the variables of loss, entrapment, and humiliation. The convenience sample included 30 patients with schizophrenia spectrum disorders. The study was prospective (3 measurement times) during a 6-month period. Results were analyzed by stepwise multiple regression. The contribution of the 3 variables to the variance of suicidal ideation was not significant at any of the 3 times (T1: 16.2%, P = 0.056; T2: 19.9%, P = 0.117; T3: 11.2%, P = 0.109). Further analyses measured the relationship between the variables of stigmatization, perceived cognitive dysfunction, symptoms, depression, self-esteem, reason to live, spirituality, social provision, and suicidal ideation. Stepwise multiple regression demonstrated that the contribution of the variables of stigmatization and perceived cognitive dysfunction to the variance of suicidal ideation was significant at all 3 times (T1: 41.7.5%, P = 0.000; T2: 35.2%, P = 0.001; T3: 21.5%, P = 0.012). Yet, over time, the individual contribution of the variables changed: T1, stigmatization (β = 0.518; P = 0.002); T2, stigmatization (β = 0.394; P = 0.025) and perceived cognitive dysfunction (β = 0.349; P = 0.046). Then, at T3, only perceived cognitive dysfunction contributed significantly to suicidal ideation (β = 0.438; P = 0.016). The results highlight the importance of the contribution of the variables of perceived cognitive dysfunction and stigmatization in the onset of suicidal ideation in people with schizophrenia spectrum disorders.
Suicidal Ideation and Schizophrenia: Contribution of Appraisal, Stigmatization, and Cognition

PubMed Central

Stip, Emmanuel; Caron, Jean; Tousignant, Michel

2017-01-01

Objective: To predict suicidal ideation in people with schizophrenia, certain studies have measured its relationship with the variables of defeat and entrapment. The relationships are positive, but their interactions remain undefined. To further their understanding, this research sought to measure the relationship between suicidal ideation with the variables of loss, entrapment, and humiliation. Method: The convenience sample included 30 patients with schizophrenia spectrum disorders. The study was prospective (3 measurement times) during a 6-month period. Results were analyzed by stepwise multiple regression. Results: The contribution of the 3 variables to the variance of suicidal ideation was not significant at any of the 3 times (T1: 16.2%, P = 0.056; T2: 19.9%, P = 0.117; T3: 11.2%, P = 0.109). Further analyses measured the relationship between the variables of stigmatization, perceived cognitive dysfunction, symptoms, depression, self-esteem, reason to live, spirituality, social provision, and suicidal ideation. Stepwise multiple regression demonstrated that the contribution of the variables of stigmatization and perceived cognitive dysfunction to the variance of suicidal ideation was significant at all 3 times (T1: 41.7.5%, P = 0.000; T2: 35.2%, P = 0.001; T3: 21.5%, P = 0.012). Yet, over time, the individual contribution of the variables changed: T1, stigmatization (β = 0.518; P = 0.002); T2, stigmatization (β = 0.394; P = 0.025) and perceived cognitive dysfunction (β = 0.349; P = 0.046). Then, at T3, only perceived cognitive dysfunction contributed significantly to suicidal ideation (β = 0.438; P = 0.016). Conclusion: The results highlight the importance of the contribution of the variables of perceived cognitive dysfunction and stigmatization in the onset of suicidal ideation in people with schizophrenia spectrum disorders. PMID:28673099
Exploring correlates of diabetes-related stress among adults with Type 1 diabetes in the T1D exchange clinic registry.

PubMed

Boden, Matthew Tyler; Gala, Sasha

2018-04-01

To explore relations between diabetes-related stress and multiple sociodemographic, diabetes health, other health, and treatment-related variables among a large sample of adults with Type 1 Diabetes (T1D). The sample consisted of 10,821 adults (over 18 years old) enrolled in the T1D Exchange Clinic Registry. The T1D Exchange clinic network consists of 67 diabetes clinical centers throughout the United States selected to broadly represent pediatric and adult patients with T1D. Variables were assessed through participant self-report and extraction of clinic chart data. Univariate and multiple linear regression (with simultaneous entry of all predictors) analyses were conducted. Robustly associated with increased diabetes-related stress across analyses were multiple sociodemographic (female [vs. male], native Hawaiian/other Pacific islander [vs. white/Caucasian], decreased age and diabetes duration), diabetes health (higher HbA1c), other health (lower general health, presence of major life stress and depression, less physical activity), and treatment related variables (use of injections/pen or combination injection/pen/pump [vs. pump], use of CGM, increased frequency of missing insulin doses and BG checking, decreased frequency of BG checking prior to bolus, receipt of mental health treatment). We replicated and extended research demonstrating that diabetes-related stress among people with T1D occurs at higher levels among those with particular sociodemographic characteristics and is associated with a range poorer diabetes health and other health variables, and multiple treatment-related variables. The strong incremental prediction of diabetes-related stress by multiple variables in our study suggests that a multi-variable, personalized approach may increase the effectiveness of treatments for diabetes-related stress. Published by Elsevier B.V.
Additive effects prevail: The response of biota to multiple stressors in an intensively monitored watershed.

PubMed

Gieswein, Alexander; Hering, Daniel; Feld, Christian K

2017-09-01

Freshwater ecosystems are impacted by a range of stressors arising from diverse human-caused land and water uses. Identifying the relative importance of single stressors and understanding how multiple stressors interact and jointly affect biology is crucial for River Basin Management. This study addressed multiple human-induced stressors and their effects on the aquatic flora and fauna based on data from standard WFD monitoring schemes. For altogether 1095 sites within a mountainous catchment, we used 12 stressor variables covering three different stressor groups: riparian land use, physical habitat quality and nutrient enrichment. Twenty-one biological metrics calculated from taxa lists of three organism groups (fish, benthic invertebrates and aquatic macrophytes) served as response variables. Stressor and response variables were subjected to Boosted Regression Tree (BRT) analysis to identify stressor hierarchy and stressor interactions and subsequently to Generalised Linear Regression Modelling (GLM) to quantify the stressors standardised effect size. Our results show that riverine habitat degradation was the dominant stressor group for the river fauna, notably the bed physical habitat structure. Overall, the explained variation in benthic invertebrate metrics was higher than it was in fish and macrophyte metrics. In particular, general integrative (aggregate) metrics such as % Ephemeroptera, Plecoptera and Trichoptera (EPT) taxa performed better than ecological traits (e.g. % feeding types). Overall, additive stressor effects dominated, while significant and meaningful stressor interactions were generally rare and weak. We concluded that given the type of stressor and ecological response variables addressed in this study, river basin managers do not need to bother much about complex stressor interactions, but can focus on the prevailing stressors according to the hierarchy identified. Copyright © 2017 Elsevier B.V. All rights reserved.
VoxelStats: A MATLAB Package for Multi-Modal Voxel-Wise Brain Image Analysis.

PubMed

Mathotaarachchi, Sulantha; Wang, Seqian; Shin, Monica; Pascoal, Tharick A; Benedet, Andrea L; Kang, Min Su; Beaudry, Thomas; Fonov, Vladimir S; Gauthier, Serge; Labbe, Aurélie; Rosa-Neto, Pedro

2016-01-01

In healthy individuals, behavioral outcomes are highly associated with the variability on brain regional structure or neurochemical phenotypes. Similarly, in the context of neurodegenerative conditions, neuroimaging reveals that cognitive decline is linked to the magnitude of atrophy, neurochemical declines, or concentrations of abnormal protein aggregates across brain regions. However, modeling the effects of multiple regional abnormalities as determinants of cognitive decline at the voxel level remains largely unexplored by multimodal imaging research, given the high computational cost of estimating regression models for every single voxel from various imaging modalities. VoxelStats is a voxel-wise computational framework to overcome these computational limitations and to perform statistical operations on multiple scalar variables and imaging modalities at the voxel level. VoxelStats package has been developed in Matlab(®) and supports imaging formats such as Nifti-1, ANALYZE, and MINC v2. Prebuilt functions in VoxelStats enable the user to perform voxel-wise general and generalized linear models and mixed effect models with multiple volumetric covariates. Importantly, VoxelStats can recognize scalar values or image volumes as response variables and can accommodate volumetric statistical covariates as well as their interaction effects with other variables. Furthermore, this package includes built-in functionality to perform voxel-wise receiver operating characteristic analysis and paired and unpaired group contrast analysis. Validation of VoxelStats was conducted by comparing the linear regression functionality with existing toolboxes such as glim_image and RMINC. The validation results were identical to existing methods and the additional functionality was demonstrated by generating feature case assessments (t-statistics, odds ratio, and true positive rate maps). In summary, VoxelStats expands the current methods for multimodal imaging analysis by allowing the estimation of advanced regional association metrics at the voxel level.
Communicative participation restrictions in multiple sclerosis: associated variables and correlation with social functioning.

PubMed

Yorkston, Kathryn M; Baylor, Carolyn; Amtmann, Dagmar

2014-01-01

Individuals with multiple sclerosis (MS) are at risk for communication problems that may restrict their ability to take participation in important life roles such as maintenance of relationships, work, or household management. The aim of this project is to examine selected demographic and symptom-related variables that may contribute to participation restrictions. This examination is intended to aid clinicians in predicting who might be at risk for such restrictions and what variables may be targeted in interventions. Community-dwelling adults with MS (n=216) completed a survey either online or using paper forms. The survey included the 46-item version of the Communicative Participation Item Bank, demographics (age, sex, living situation, employment status, education, and time since onset of diagnosis of MS), and self-reported symptom-related variables (physical activity, emotional problems, fatigue, pain, speech severity, and cognitive/communication skills). In order to identify predictors of restrictions in communicative participation, these variables were entered into a backwards stepwise multiple linear regression analysis. Five variables (cognitive/communication skills, speech severity, speech usage, physical activity, and education) were statistically significant predictors of communication participation. In order to examine the relationship of communicative participation and social role variables, bivariate Spearman correlations were conducted. Results suggest only a fair to moderate relationship between communicative participation and measures of social roles. Communicative participation is a complex construct associated with a number of self-reported variables. Clinicians should be alert to risk factors for reduced communicative participation including reduced cognitive and speech skills, lower levels of speech usage, limitations in physical activities and higher levels of education. The reader will be able to: (a) describe the factors that may restrict participation in individuals with multiple sclerosis; (b) list measures of social functioning that may be pertinent in adults with multiple sclerosis; (c) discuss factors that can be used to predict communicative participation in multiple sclerosis. Copyright © 2014 Elsevier Inc. All rights reserved.
Impact of Depression, Fatigue, and Global Measure of Cortical Volume on Cognitive Impairment in Multiple Sclerosis

PubMed Central

De Cola, Maria Cristina; D'Aleo, Giangaetano; Sessa, Edoardo; Marino, Silvia

2015-01-01

Objective. To investigate the influence of demographic and clinical variables, such as depression, fatigue, and quantitative MRI marker on cognitive performances in a sample of patients affected by multiple sclerosis (MS). Methods. 60 MS patients (52 relapsing remitting and 8 primary progressive) underwent neuropsychological assessments using Rao's Brief Repeatable Battery of Neuropsychological Tests (BRB-N), the Beck Depression Inventory-second edition (BDI-II), and the Fatigue Severity Scale (FSS). We performed magnetic resonance imaging to all subjects using a 3 T scanner and obtained tissue-specific volumes (normalized brain volume and cortical brain volume). We used Student's t-test to compare depressed and nondepressed MS patients. Finally, we performed a multivariate regression analysis in order to assess possible predictors of patients' cognitive outcome among demographic and clinical variables. Results. 27.12% of the sample (16/59) was cognitively impaired, especially in tasks requiring attention and information processing speed. From between group comparison, we find that depressed patients had worse performances on BRB-N score, greater disability and disease duration, and brain volume decrease. According to multiple regression analysis, the BDI-II score was a significant predictor for most of the neuropsychological tests. Conclusions. Our findings suggest that the presence of depressive symptoms is an important determinant of cognitive performance in MS patients. PMID:25861633
Depoliticizing Minority Admissions through Predicted Graduation Equations. AIR Forum 1982 Paper.

ERIC Educational Resources Information Center

Sanford, Timothy R.

The way that the University of North Carolina, Chapel Hill, has tried to depoliticize minority admissions through the use of predicted graduation equations that are race specific is examined. Multiple regression and discriminant analyses were used with nine independent variables (primarily academic) to predict graduation status of 1974 entering…
Determination of Habitat Requirements For Birds in Suburban Areas

Treesearch

Jack Ward Thomas; Richard M. DeGraaf; Joseph C. Mawson

1977-01-01

Songbird populations can be related to habitat components by a method that allows the simultaneous determination of habitat requirements for a variety of species . Through correlation and multiple-regression analyses, 10 bird species were studied in a suburban habitat, which was stratified according to human density. Variables used to account for bird distribution...
Predictors of College Readiness: An Analysis of the Student Readiness Inventory

ERIC Educational Resources Information Center

Wilson, James K., III

2012-01-01

The purpose of this study was to better predict how a first semester college freshman becomes prepared for college. The theoretical framework guiding this study is Vrooms' expectancy theory, motivation plays a key role in success. This study used a hierarchical multiple regression model. The independent variables of interest included high school…
Relationships between Parental Attachment, Work and Family Roles, and Life Satisfaction

ERIC Educational Resources Information Center

Perrone, Kristin M.; Webb, L. Kay; Jackson, Z. Vance

2007-01-01

The purpose of this study was to examine the relationship between parental attachment and satisfaction with work and family roles, as well as the relationship of these variables to life satisfaction. Results from a multiple regression analysis indicated that satisfaction with work and marriage, but not parenting satisfaction or parental…
Pressures, Stresses, Anxieties, and On-Job Safety of the School Superintendent.

ERIC Educational Resources Information Center

Chand, Krishan

Identification of the causes of job stress for public school superintendents, with a focus on personal-experiential and task variables, is the purpose of this study. Methodology involved a mail survey of 1,531 randomly selected superintendents. Canonical correlation analysis (CCA) and multiple regression correlation (MCR) analysis were used to…
Exact Interval Estimation, Power Calculation, and Sample Size Determination in Normal Correlation Analysis

ERIC Educational Resources Information Center

Shieh, Gwowen

2006-01-01

This paper considers the problem of analysis of correlation coefficients from a multivariate normal population. A unified theorem is derived for the regression model with normally distributed explanatory variables and the general results are employed to provide useful expressions for the distributions of simple, multiple, and partial-multiple…

The Prediction of Achievement and Time Spent in Instruction in a Self-Paced Individualized Course.

ERIC Educational Resources Information Center

Franklin, Thomas E.

Multiple linear regressions were employed to determine the relative contributions of cognitive and affective variables accounting for variance in college students' achievement and amount of time taken to complete a self-paced, individualized course. Study habits and attitudes (SSHA) made greater relative contributions to explaining total course…
Novice Teachers' Perceptions of Support, Teacher Preparation Quality, and Student Teaching Experience Related to Teacher Efficacy

ERIC Educational Resources Information Center

Knobloch, Neil A.; Whittington, M. Susie

2002-01-01

This multiple regression study analyzed the percent of variance in teacher efficacy of 106 student teachers and novice teachers in agricultural education in Ohio explained by selected variables related to perceived support (utilizing a mentor, supportive principal behaviors, collective efficacy), teacher preparation quality, and student teaching…
Electronic Resource Expenditure and the Decline in Reference Transaction Statistics in Academic Libraries

ERIC Educational Resources Information Center

Dubnjakovic, Ana

2012-01-01

The current study investigates factors influencing increase in reference transactions in a typical week in academic libraries across the United States of America. Employing multiple regression analysis and general linear modeling, variables of interest from the "Academic Library Survey (ALS) 2006" survey (sample size 3960 academic libraries) were…
The Roles of Attitudinal and Personality Variables in the Prediction of Environmental Behavior and Knowledge

ERIC Educational Resources Information Center

Arbuthnot, Jack

1977-01-01

This study explored the relationships among selected attitudinal and personality characteristics, attitudes toward environmental problems, and environmental knowledge and behavioral commitment of two diverse samples: 85 users of a recycling center and 60 conservative church members. Multiple regression analysis was utilized to determine the best…
Hispanic Community College Students: Acculturation, Family Support, Perceived Educational Barriers, and Vocational Planning

ERIC Educational Resources Information Center

Fiebig, Jennifer Nepper; Braid, Barbara L.; Ross, Patricia A.; Tom, Matthew A.; Prinzo, Cara

2010-01-01

A multiple logistic regression model was used to determine the associations between the role of acculturation, perception of educational barriers, need for family kin support, vocational planning, and expectations for attaining future vocational goals against the demographic variables (gender, age, being the oldest child, the first to attend…
Vitamin D levels and their associations with survival and major disease outcomes in a large cohort of patients with chronic graft-vs-host disease

PubMed Central

Katić, Mašenjka; Pirsl, Filip; Steinberg, Seth M.; Dobbin, Marnie; Curtis, Lauren M.; Pulanić, Dražen; Desnica, Lana; Titarenko, Irina; Pavletic, Steven Z.

2016-01-01

Aim To identify the factors associated with vitamin D status in patients with chronic graft-vs-host disease (cGVHD) and evaluate the association between serum vitamin D (25(OH)D) levels and cGVHD characteristics and clinical outcomes defined by the National Institutes of Health (NIH) criteria. Methods 310 cGVHD patients enrolled in the NIH cGVHD natural history study (clinicaltrials.gov: NCT00092235) were analyzed. Univariate analysis and multiple logistic regression were used to determine the associations between various parameters and 25(OH)D levels, dichotomized into categorical variables: ≤20 and >20 ng/mL, and as a continuous parameter. Multiple logistic regression was used to develop a predictive model for low vitamin D. Survival analysis and association between cGVHD outcomes and 25(OH)D as a continuous as well as categorical variable: ≤20 and >20 ng/mL; <50 and ≥50 ng/mL, and among three ordered categories: ≤20, 20-50, and ≥50 ng/mL, was performed. PMID:27374829
Modeling the dynamics of urban growth using multinomial logistic regression: a case study of Jiayu County, Hubei Province, China

NASA Astrophysics Data System (ADS)

Nong, Yu; Du, Qingyun; Wang, Kun; Miao, Lei; Zhang, Weiwei

2008-10-01

Urban growth modeling, one of the most important aspects of land use and land cover change study, has attracted substantial attention because it helps to comprehend the mechanisms of land use change thus helps relevant policies made. This study applied multinomial logistic regression to model urban growth in the Jiayu county of Hubei province, China to discover the relationship between urban growth and the driving forces of which biophysical and social-economic factors are selected as independent variables. This type of regression is similar to binary logistic regression, but it is more general because the dependent variable is not restricted to two categories, as those previous studies did. The multinomial one can simulate the process of multiple land use competition between urban land, bare land, cultivated land and orchard land. Taking the land use type of Urban as reference category, parameters could be estimated with odds ratio. A probability map is generated from the model to predict where urban growth will occur as a result of the computation.
Hydrology and trout populations of cold-water rivers of Michigan and Wisconsin

USGS Publications Warehouse

Hendrickson, G.E.; Knutilla, R.L.

1974-01-01

Statistical multiple-regression analyses showed significant relationships between trout populations and hydrologic parameters. Parameters showing the higher levels of significance were temperature, hardness of water, percentage of gravel bottom, percentage of bottom vegetation, variability of streamflow, and discharge per unit drainage area. Trout populations increase with lower levels of annual maximum water temperatures, with increase in water hardness, and with increase in percentage of gravel and bottom vegetation. Trout populations also increase with decrease in variability of streamflow, and with increase in discharge per unit drainage area. Most hydrologic parameters were significant when evaluated collectively, but no parameter, by itself, showed a high degree of correlation with trout populations in regression analyses that included all the streams sampled. Regression analyses of stream segments that were restricted to certain limits of hardness, temperature, or percentage of gravel bottom showed improvements in correlation. Analyses of trout populations, in pounds per acre and pounds per mile and hydrologic parameters resulted in regression equations from which trout populations could be estimated with standard errors of 89 and 84 per cent, respectively.
Prediction system of hydroponic plant growth and development using algorithm Fuzzy Mamdani method

NASA Astrophysics Data System (ADS)

Sudana, I. Made; Purnawirawan, Okta; Arief, Ulfa Mediaty

2017-03-01

Hydroponics is a method of farming without soil. One of the Hydroponic plants is Watercress (Nasturtium Officinale). The development and growth process of hydroponic Watercress was influenced by levels of nutrients, acidity and temperature. The independent variables can be used as input variable system to predict the value level of plants growth and development. The prediction system is using Fuzzy Algorithm Mamdani method. This system was built to implement the function of Fuzzy Inference System (Fuzzy Inference System/FIS) as a part of the Fuzzy Logic Toolbox (FLT) by using MATLAB R2007b. FIS is a computing system that works on the principle of fuzzy reasoning which is similar to humans' reasoning. Basically FIS consists of four units which are fuzzification unit, fuzzy logic reasoning unit, base knowledge unit and defuzzification unit. In addition to know the effect of independent variables on the plants growth and development that can be visualized with the function diagram of FIS output surface that is shaped three-dimensional, and statistical tests based on the data from the prediction system using multiple linear regression method, which includes multiple linear regression analysis, T test, F test, the coefficient of determination and donations predictor that are calculated using SPSS (Statistical Product and Service Solutions) software applications.
A rotor optimization using regression analysis

NASA Technical Reports Server (NTRS)

Giansante, N.

1984-01-01

The design and development of helicopter rotors is subject to the many design variables and their interactions that effect rotor operation. Until recently, selection of rotor design variables to achieve specified rotor operational qualities has been a costly, time consuming, repetitive task. For the past several years, Kaman Aerospace Corporation has successfully applied multiple linear regression analysis, coupled with optimization and sensitivity procedures, in the analytical design of rotor systems. It is concluded that approximating equations can be developed rapidly for a multiplicity of objective and constraint functions and optimizations can be performed in a rapid and cost effective manner; the number and/or range of design variables can be increased by expanding the data base and developing approximating functions to reflect the expanded design space; the order of the approximating equations can be expanded easily to improve correlation between analyzer results and the approximating equations; gradients of the approximating equations can be calculated easily and these gradients are smooth functions reducing the risk of numerical problems in the optimization; the use of approximating functions allows the problem to be started easily and rapidly from various initial designs to enhance the probability of finding a global optimum; and the approximating equations are independent of the analysis or optimization codes used.
Application of neural networks to prediction of fish diversity and salmonid production in the Lake Ontario basin

USGS Publications Warehouse

McKenna, James E.

2005-01-01

Diversity and fish productivity are important measures of the health and status of aquatic systems. Being able to predict the values of these indices as a function of environmental variables would be valuable to management. Diversity and productivity have been related to environmental conditions by multiple linear regression and discriminant analysis, but such methods have several shortcomings. In an effort to predict fish species diversity and estimate salmonid production for streams in the eastern basin of Lake Ontario, I constructed neural networks and trained them on a data set containing abiotic information and either fish diversity or juvenile salmonid abundance. Twenty percent of the original data were retained as a test data set and used in the training. The ability to extend these neural networks to conditions throughout the streams was tested with data not involved in the network training. The resulting neural networks were able to predict the number of salmonids with more than 84% accuracy and diversity with more than 73% accuracy, which was far superior to the performance of multiple regression. The networks also identified the environmental variables with the greatest predictive power, namely, those describing water movement, stream size, and water chemistry. Thirteen input variables were used to predict diversity and 17 to predict salmonid abundance.
A New Metric for Land-Atmosphere Coupling Strength: Applications on Observations and Modeling

NASA Astrophysics Data System (ADS)

Tang, Q.; Xie, S.; Zhang, Y.; Phillips, T. J.; Santanello, J. A., Jr.; Cook, D. R.; Riihimaki, L.; Gaustad, K.

2017-12-01

A new metric is proposed to quantify the land-atmosphere (LA) coupling strength and is elaborated by correlating the surface evaporative fraction and impacting land and atmosphere variables (e.g., soil moisture, vegetation, and radiation). Based upon multiple linear regression, this approach simultaneously considers multiple factors and thus represents complex LA coupling mechanisms better than existing single variable metrics. The standardized regression coefficients quantify the relative contributions from individual drivers in a consistent manner, avoiding the potential inconsistency in relative influence of conventional metrics. Moreover, the unique expendable feature of the new method allows us to verify and explore potentially important coupling mechanisms. Our observation-based application of the new metric shows moderate coupling with large spatial variations at the U.S. Southern Great Plains. The relative importance of soil moisture vs. vegetation varies by location. We also show that LA coupling strength is generally underestimated by single variable methods due to their incompleteness. We also apply this new metric to evaluate the representation of LA coupling in the Accelerated Climate Modeling for Energy (ACME) V1 Contiguous United States (CONUS) regionally refined model (RRM). This work is performed under the auspices of the U.S. Department of Energy by Lawrence Livermore National Laboratory under Contract DE-AC52-07NA27344. LLNL-ABS-734201
Societal integration and age-standardized suicide rates in 21 developed countries, 1955-1989.

PubMed

Fernquist, R M; Cutright, P

1998-01-01

Gender-specific age-standardized suicide rates for 21 developed countries over seven 5-year periods (1955-59...1985-89) form the two dependent variables. Durkheim's theory of societal integration is the framework used to generate the independent variables, although several recent theories are also examined. The results from a MGLS multiple regression analysis of both male and female rates provide overwhelming support for a multidimensional theory of societal integration and suicide, as first suggested by Durkheim.
Associations of blood lead, cadmium, and mercury with estimated glomerular filtration rate in the Korean general population: Analysis of 2008-2010 Korean National Health and Nutrition Examination Survey data

DOE Office of Scientific and Technical Information (OSTI.GOV)

Kim, Yangho; Lee, Byung-Kook, E-mail: bklee@sch.ac.kr

Introduction: The objective of this study was to evaluate associations between blood lead, cadmium, and mercury levels with estimated glomerular filtration rate in a general population of South Korean adults. Methods: This was a cross-sectional study based on data obtained in the Korean National Health and Nutrition Examination Survey (KNHANES) (2008-2010). The final analytical sample consisted of 5924 participants. Estimated glomerular filtration rate (eGFR) was calculated using the MDRD Study equation as an indicator of glomerular function. Results: In multiple linear regression analysis of log2-transformed blood lead as a continuous variable on eGFR, after adjusting for covariates including cadmium andmore » mercury, the difference in eGFR levels associated with doubling of blood lead were -2.624 mL/min per 1.73 m Superscript-Two (95% CI: -3.803 to -1.445). In multiple linear regression analysis using quartiles of blood lead as the independent variable, the difference in eGFR levels comparing participants in the highest versus the lowest quartiles of blood lead was -3.835 mL/min per 1.73 m Superscript-Two (95% CI: -5.730 to -1.939). In a multiple linear regression analysis using blood cadmium and mercury, as continuous or categorical variables, as independent variables, neither metal was a significant predictor of eGFR. Odds ratios (ORs) and 95% CI values for reduced eGFR calculated for log2-transformed blood metals and quartiles of the three metals showed similar trends after adjustment for covariates. Discussion: In this large, representative sample of South Korean adults, elevated blood lead level was consistently associated with lower eGFR levels and with the prevalence of reduced eGFR even in blood lead levels below 10 {mu}g/dL. In conclusion, elevated blood lead level was associated with lower eGFR in a Korean general population, supporting the role of lead as a risk factor for chronic kidney disease.« less
Learning style and concept acquisition of community college students in introductory biology

NASA Astrophysics Data System (ADS)

Bobick, Sandra Burin

This study investigated the influence of learning style on concept acquisition within a sample of community college students in a general biology course. There are two subproblems within the larger problem: (1) the influence of demographic variables (age, gender, number of college credits, prior exposure to scientific information) on learning style, and (2) the correlations between prior scientific knowledge, learning style and student understanding of the concept of the gene. The sample included all students enrolled in an introductory general biology course during two consecutive semesters at an urban community college. Initial data was gathered during the first week of the semester, at which time students filled in a short questionnaire (age, gender, number of college credits, prior exposure to science information either through reading/visual sources or a prior biology course). Subjects were then given the Inventory of Learning Processes-Revised (ILP-R) which measures general preferences in five learning styles; Deep Learning; Elaborative Learning, Agentic Learning, Methodical Learning and Literal Memorization. Subjects were then given the Gene Conceptual Knowledge pretest: a 15 question objective section and an essay section. Subjects were exposed to specific concepts during lecture and laboratory exercises. At the last lab, students were given the Genetics Conceptual Knowledge Posttest. Pretest/posttest gains were correlated with demographic variables and learning styles were analyzed for significant correlations. Learning styles, as the independent variable in a simultaneous multiple regression, were significant predictors of results on the gene assessment tests, including pretest, posttest and gain. Of the learning styles, Deep Learning accounted for the greatest positive predictive value of pretest essay and pretest objective results. Literal Memorization was a significant negative predictor for posttest essay, essay gain and objective gain. Simultaneous multiple regression indicated that demographic variables were significant positive predictors for Methodical, Deep and Elaborative Learning Styles. Stepwise multiple regression resulted in number of credits, Read Science and gender (female) as significant predictors of learning styles. The findings of this study emphasize the importance of learning styles in conceptual understanding of the gene and the correlation of nonformal exposure to science information with learning style and conceptual understanding.
4D-LQTA-QSAR and docking study on potent Gram-negative specific LpxC inhibitors: a comparison to CoMFA modeling.

PubMed

Ghasemi, Jahan B; Safavi-Sohi, Reihaneh; Barbosa, Euzébio G

2012-02-01

A quasi 4D-QSAR has been carried out on a series of potent Gram-negative LpxC inhibitors. This approach makes use of the molecular dynamics (MD) trajectories and topology information retrieved from the GROMACS package. This new methodology is based on the generation of a conformational ensemble profile, CEP, for each compound instead of only one conformation, followed by the calculation intermolecular interaction energies at each grid point considering probes and all aligned conformations resulting from MD simulations. These interaction energies are independent variables employed in a QSAR analysis. The comparison of the proposed methodology to comparative molecular field analysis (CoMFA) formalism was performed. This methodology explores jointly the main features of CoMFA and 4D-QSAR models. Step-wise multiple linear regression was used for the selection of the most informative variables. After variable selection, multiple linear regression (MLR) and partial least squares (PLS) methods used for building the regression models. Leave-N-out cross-validation (LNO), and Y-randomization were performed in order to confirm the robustness of the model in addition to analysis of the independent test set. Best models provided the following statistics: [Formula in text] (PLS) and [Formula in text] (MLR). Docking study was applied to investigate the major interactions in protein-ligand complex with CDOCKER algorithm. Visualization of the descriptors of the best model helps us to interpret the model from the chemical point of view, supporting the applicability of this new approach in rational drug design.
Stepwise multiple regression method of greenhouse gas emission modeling in the energy sector in Poland.

PubMed

Kolasa-Wiecek, Alicja

2015-04-01

The energy sector in Poland is the source of 81% of greenhouse gas (GHG) emissions. Poland, among other European Union countries, occupies a leading position with regard to coal consumption. Polish energy sector actively participates in efforts to reduce GHG emissions to the atmosphere, through a gradual decrease of the share of coal in the fuel mix and development of renewable energy sources. All evidence which completes the knowledge about issues related to GHG emissions is a valuable source of information. The article presents the results of modeling of GHG emissions which are generated by the energy sector in Poland. For a better understanding of the quantitative relationship between total consumption of primary energy and greenhouse gas emission, multiple stepwise regression model was applied. The modeling results of CO2 emissions demonstrate a high relationship (0.97) with the hard coal consumption variable. Adjustment coefficient of the model to actual data is high and equal to 95%. The backward step regression model, in the case of CH4 emission, indicated the presence of hard coal (0.66), peat and fuel wood (0.34), solid waste fuels, as well as other sources (-0.64) as the most important variables. The adjusted coefficient is suitable and equals R2=0.90. For N2O emission modeling the obtained coefficient of determination is low and equal to 43%. A significant variable influencing the amount of N2O emission is the peat and wood fuel consumption. Copyright © 2015. Published by Elsevier B.V.
Multiple Use One-Sided Hypotheses Testing in Univariate Linear Calibration

NASA Technical Reports Server (NTRS)

Krishnamoorthy, K.; Kulkarni, Pandurang M.; Mathew, Thomas

1996-01-01

Consider a normally distributed response variable, related to an explanatory variable through the simple linear regression model. Data obtained on the response variable, corresponding to known values of the explanatory variable (i.e., calibration data), are to be used for testing hypotheses concerning unknown values of the explanatory variable. We consider the problem of testing an unlimited sequence of one sided hypotheses concerning the explanatory variable, using the corresponding sequence of values of the response variable and the same set of calibration data. This is the situation of multiple use of the calibration data. The tests derived in this context are characterized by two types of uncertainties: one uncertainty associated with the sequence of values of the response variable, and a second uncertainty associated with the calibration data. We derive tests based on a condition that incorporates both of these uncertainties. The solution has practical applications in the decision limit problem. We illustrate our results using an example dealing with the estimation of blood alcohol concentration based on breath estimates of the alcohol concentration. In the example, the problem is to test if the unknown blood alcohol concentration of an individual exceeds a threshold that is safe for driving.
Predictive ability of a comprehensive incremental test in mountain bike marathon

PubMed Central

Schneeweiss, Patrick; Martus, Peter; Niess, Andreas M; Krauss, Inga

2018-01-01

Objectives Traditional performance tests in mountain bike marathon (XCM) primarily quantify aerobic metabolism and may not describe the relevant capacities in XCM. We aimed to validate a comprehensive test protocol quantifying its intermittent demands. Methods Forty-nine athletes (38.8±9.1 years; 38 male; 11 female) performed a laboratory performance test, including an incremental test, to determine individual anaerobic threshold (IAT), peak power output (PPO) and three maximal efforts (10 s all-out sprint, 1 min maximal effort and 5 min maximal effort). Within 2 weeks, the athletes participated in one of three XCM races (n=15, n=9 and n=25). Correlations between test variables and race times were calculated separately. In addition, multiple regression models of the predictive value of laboratory outcomes were calculated for race 3 and across all races (z-transformed data). Results All variables were correlated with race times 1, 2 and 3: 10 s all-out sprint (r=−0.72; r=−0.59; r=−0.61), 1 min maximal effort (r=−0.85; r=−0.84; r=−0.82), 5 min maximal effort (r=−0.57; r=−0.85; r=−0.76), PPO (r=−0.77; r=−0.73; r=−0.76) and IAT (r=−0.71; r=−0.67; r=−0.68). The best-fitting multiple regression models for race 3 (r2=0.868) and across all races (r2=0.757) comprised 1 min maximal effort, IAT and body weight. Conclusion Aerobic and intermittent variables correlated least strongly with race times. Their use in a multiple regression model confirmed additional explanatory power to predict XCM performance. These findings underline the usefulness of the comprehensive incremental test to predict performance in that sport more precisely. PMID:29387445
The comparison of robust partial least squares regression with robust principal component regression on a real

NASA Astrophysics Data System (ADS)

Polat, Esra; Gunay, Suleyman

2013-10-01

One of the problems encountered in Multiple Linear Regression (MLR) is multicollinearity, which causes the overestimation of the regression parameters and increase of the variance of these parameters. Hence, in case of multicollinearity presents, biased estimation procedures such as classical Principal Component Regression (CPCR) and Partial Least Squares Regression (PLSR) are then performed. SIMPLS algorithm is the leading PLSR algorithm because of its speed, efficiency and results are easier to interpret. However, both of the CPCR and SIMPLS yield very unreliable results when the data set contains outlying observations. Therefore, Hubert and Vanden Branden (2003) have been presented a robust PCR (RPCR) method and a robust PLSR (RPLSR) method called RSIMPLS. In RPCR, firstly, a robust Principal Component Analysis (PCA) method for high-dimensional data on the independent variables is applied, then, the dependent variables are regressed on the scores using a robust regression method. RSIMPLS has been constructed from a robust covariance matrix for high-dimensional data and robust linear regression. The purpose of this study is to show the usage of RPCR and RSIMPLS methods on an econometric data set, hence, making a comparison of two methods on an inflation model of Turkey. The considered methods have been compared in terms of predictive ability and goodness of fit by using a robust Root Mean Squared Error of Cross-validation (R-RMSECV), a robust R2 value and Robust Component Selection (RCS) statistic.

A principal component regression model to forecast airborne concentration of Cupressaceae pollen in the city of Granada (SE Spain), during 1995-2006.

PubMed

Ocaña-Peinado, Francisco M; Valderrama, Mariano J; Bouzas, Paula R

2013-05-01

The problem of developing a 2-week-on ahead forecast of atmospheric cypress pollen levels is tackled in this paper by developing a principal component multiple regression model involving several climatic variables. The efficacy of the proposed model is validated by means of an application to real data of Cupressaceae pollen concentration in the city of Granada (southeast of Spain). The model was applied to data from 11 consecutive years (1995-2005), with 2006 being used to validate the forecasts. Based on the work of different authors, factors as temperature, humidity, hours of sun and wind speed were incorporated in the model. This methodology explains approximately 75-80% of the variability in the airborne Cupressaceae pollen concentration.
Forecasting daily patient volumes in the emergency department.

PubMed

Jones, Spencer S; Thomas, Alun; Evans, R Scott; Welch, Shari J; Haug, Peter J; Snow, Gregory L

2008-02-01

Shifts in the supply of and demand for emergency department (ED) resources make the efficient allocation of ED resources increasingly important. Forecasting is a vital activity that guides decision-making in many areas of economic, industrial, and scientific planning, but has gained little traction in the health care industry. There are few studies that explore the use of forecasting methods to predict patient volumes in the ED. The goals of this study are to explore and evaluate the use of several statistical forecasting methods to predict daily ED patient volumes at three diverse hospital EDs and to compare the accuracy of these methods to the accuracy of a previously proposed forecasting method. Daily patient arrivals at three hospital EDs were collected for the period January 1, 2005, through March 31, 2007. The authors evaluated the use of seasonal autoregressive integrated moving average, time series regression, exponential smoothing, and artificial neural network models to forecast daily patient volumes at each facility. Forecasts were made for horizons ranging from 1 to 30 days in advance. The forecast accuracy achieved by the various forecasting methods was compared to the forecast accuracy achieved when using a benchmark forecasting method already available in the emergency medicine literature. All time series methods considered in this analysis provided improved in-sample model goodness of fit. However, post-sample analysis revealed that time series regression models that augment linear regression models by accounting for serial autocorrelation offered only small improvements in terms of post-sample forecast accuracy, relative to multiple linear regression models, while seasonal autoregressive integrated moving average, exponential smoothing, and artificial neural network forecasting models did not provide consistently accurate forecasts of daily ED volumes. This study confirms the widely held belief that daily demand for ED services is characterized by seasonal and weekly patterns. The authors compared several time series forecasting methods to a benchmark multiple linear regression model. The results suggest that the existing methodology proposed in the literature, multiple linear regression based on calendar variables, is a reasonable approach to forecasting daily patient volumes in the ED. However, the authors conclude that regression-based models that incorporate calendar variables, account for site-specific special-day effects, and allow for residual autocorrelation provide a more appropriate, informative, and consistently accurate approach to forecasting daily ED patient volumes.
A regression approach to the mapping of bio-physical characteristics of surface sediment using in situ and airborne hyperspectral acquisitions

NASA Astrophysics Data System (ADS)

Ibrahim, Elsy; Kim, Wonkook; Crawford, Melba; Monbaliu, Jaak

2017-02-01

Remote sensing has been successfully utilized to distinguish and quantify sediment properties in the intertidal environment. Classification approaches of imagery are popular and powerful yet can lead to site- and case-specific results. Such specificity creates challenges for temporal studies. Thus, this paper investigates the use of regression models to quantify sediment properties instead of classifying them. Two regression approaches, namely multiple regression (MR) and support vector regression (SVR), are used in this study for the retrieval of bio-physical variables of intertidal surface sediment of the IJzermonding, a Belgian nature reserve. In the regression analysis, mud content, chlorophyll a concentration, organic matter content, and soil moisture are estimated using radiometric variables of two airborne sensors, namely airborne hyperspectral sensor (AHS) and airborne prism experiment (APEX) and and using field hyperspectral acquisitions by analytical spectral device (ASD). The performance of the two regression approaches is best for the estimation of moisture content. SVR attains the highest accuracy without feature reduction while MR achieves good results when feature reduction is carried out. Sediment property maps are successfully obtained using the models and hyperspectral imagery where SVR used with all bands achieves the best performance. The study also involves the extraction of weights identifying the contribution of each band of the images in the quantification of each sediment property when MR and principal component analysis are used.
Energy expenditure estimation during daily military routine with body-fixed sensors.

PubMed

Wyss, Thomas; Mäder, Urs

2011-05-01

The purpose of this study was to develop and validate an algorithm for estimating energy expenditure during the daily military routine on the basis of data collected using body-fixed sensors. First, 8 volunteers completed isolated physical activities according to an established protocol, and the resulting data were used to develop activity-class-specific multiple linear regressions for physical activity energy expenditure on the basis of hip acceleration, heart rate, and body mass as independent variables. Second, the validity of these linear regressions was tested during the daily military routine using indirect calorimetry (n = 12). Volunteers' mean estimated energy expenditure did not significantly differ from the energy expenditure measured with indirect calorimetry (p = 0.898, 95% confidence interval = -1.97 to 1.75 kJ/min). We conclude that the developed activity-class-specific multiple linear regressions applied to the acceleration and heart rate data allow estimation of energy expenditure in 1-minute intervals during daily military routine, with accuracy equal to indirect calorimetry.
[Quantitative structure-gas chromatographic retention relationship of polycyclic aromatic sulfur heterocycles using molecular electronegativity-distance vector].

PubMed

Li, Zhenghua; Cheng, Fansheng; Xia, Zhining

2011-01-01

The chemical structures of 114 polycyclic aromatic sulfur heterocycles (PASHs) have been studied by molecular electronegativity-distance vector (MEDV). The linear relationships between gas chromatographic retention index and the MEDV have been established by a multiple linear regression (MLR) model. The results of variable selection by stepwise multiple regression (SMR) and the powerful predictive abilities of the optimization model appraised by leave-one-out cross-validation showed that the optimization model with the correlation coefficient (R) of 0.994 7 and the cross-validated correlation coefficient (Rcv) of 0.994 0 possessed the best statistical quality. Furthermore, when the 114 PASHs compounds were divided into calibration and test sets in the ratio of 2:1, the statistical analysis showed our models possesses almost equal statistical quality, the very similar regression coefficients and the good robustness. The quantitative structure-retention relationship (QSRR) model established may provide a convenient and powerful method for predicting the gas chromatographic retention of PASHs.
Modelling long-term fire occurrence factors in Spain by accounting for local variations with geographically weighted regression

NASA Astrophysics Data System (ADS)

Martínez-Fernández, J.; Chuvieco, E.; Koutsias, N.

2013-02-01

Humans are responsible for most forest fires in Europe, but anthropogenic factors behind these events are still poorly understood. We tried to identify the driving factors of human-caused fire occurrence in Spain by applying two different statistical approaches. Firstly, assuming stationary processes for the whole country, we created models based on multiple linear regression and binary logistic regression to find factors associated with fire density and fire presence, respectively. Secondly, we used geographically weighted regression (GWR) to better understand and explore the local and regional variations of those factors behind human-caused fire occurrence. The number of human-caused fires occurring within a 25-yr period (1983-2007) was computed for each of the 7638 Spanish mainland municipalities, creating a binary variable (fire/no fire) to develop logistic models, and a continuous variable (fire density) to build standard linear regression models. A total of 383 657 fires were registered in the study dataset. The binary logistic model, which estimates the probability of having/not having a fire, successfully classified 76.4% of the total observations, while the ordinary least squares (OLS) regression model explained 53% of the variation of the fire density patterns (adjusted R2 = 0.53). Both approaches confirmed, in addition to forest and climatic variables, the importance of variables related with agrarian activities, land abandonment, rural population exodus and developmental processes as underlying factors of fire occurrence. For the GWR approach, the explanatory power of the GW linear model for fire density using an adaptive bandwidth increased from 53% to 67%, while for the GW logistic model the correctly classified observations improved only slightly, from 76.4% to 78.4%, but significantly according to the corrected Akaike Information Criterion (AICc), from 3451.19 to 3321.19. The results from GWR indicated a significant spatial variation in the local parameter estimates for all the variables and an important reduction of the autocorrelation in the residuals of the GW linear model. Despite the fitting improvement of local models, GW regression, more than an alternative to "global" or traditional regression modelling, seems to be a valuable complement to explore the non-stationary relationships between the response variable and the explanatory variables. The synergy of global and local modelling provides insights into fire management and policy and helps further our understanding of the fire problem over large areas while at the same time recognizing its local character.
Multiple Imputation For Combined-Survey Estimation With Incomplete Regressors In One But Not Both Surveys

PubMed Central

Rendall, Michael S.; Ghosh-Dastidar, Bonnie; Weden, Margaret M.; Baker, Elizabeth H.; Nazarov, Zafar

2013-01-01

Within-survey multiple imputation (MI) methods are adapted to pooled-survey regression estimation where one survey has more regressors, but typically fewer observations, than the other. This adaptation is achieved through: (1) larger numbers of imputations to compensate for the higher fraction of missing values; (2) model-fit statistics to check the assumption that the two surveys sample from a common universe; and (3) specificying the analysis model completely from variables present in the survey with the larger set of regressors, thereby excluding variables never jointly observed. In contrast to the typical within-survey MI context, cross-survey missingness is monotonic and easily satisfies the Missing At Random (MAR) assumption needed for unbiased MI. Large efficiency gains and substantial reduction in omitted variable bias are demonstrated in an application to sociodemographic differences in the risk of child obesity estimated from two nationally-representative cohort surveys. PMID:24223447
A comparison of two microscale laboratory reporting methods in a secondary chemistry classroom

NASA Astrophysics Data System (ADS)

Martinez, Lance Michael

This study attempted to determine if there was a difference between the laboratory achievement of students who used a modified reporting method and those who used traditional laboratory reporting. The study also determined the relationships between laboratory performance scores and the independent variables score on the Group Assessment of Logical Thinking (GALT) test, chronological age in months, gender, and ethnicity for each of the treatment groups. The study was conducted using 113 high school students who were enrolled in first-year general chemistry classes at Pueblo South High School in Colorado. The research design used was the quasi-experimental Nonequivalent Control Group Design. The statistical treatment consisted of the Multiple Regression Analysis and the Analysis of Covariance. Based on the GALT, students in the two groups were generally in the concrete and transitional stages of the Piagetian cognitive levels. The findings of the study revealed that the traditional and the modified methods of laboratory reporting did not have any effect on the laboratory performance outcome of the subjects. However, the students who used the traditional method of reporting showed a higher laboratory performance score when evaluation was conducted using the New Standards rubric recommended by the state. Multiple Regression Analysis revealed that there was a significant relationship between the criterion variable student laboratory performance outcome of individuals who employed traditional laboratory reporting methods and the composite set of predictor variables. On the contrary, there was no significant relationship between the criterion variable student laboratory performance outcome of individuals who employed modified laboratory reporting methods and the composite set of predictor variables.
The Moderating Role of Power Distance on the Relationship between Employee Participation and Outcome Variables

PubMed Central

Rafiei, Sima; Pourreza, Abolghasem

2013-01-01

Background: Many organisations have realised the importance of human resource for their competitive advantage. Empowering employees is therefore essential for organisational effectiveness. This study aimed to investigate the relationship between employee participation with outcome variables such as organisational commitment, job satisfaction, perception of justice in an organisation and readiness to accept job responsibilities. It further examined the impact of power distance on the relationship between participation and four outcome variables. Methods: This was a cross sectional study with a descriptive research design conducted among employees and managers of hospitals affiliated with Tehran University of Medical Sciences, Tehran, Iran. A questionnaire as a main procedure to gather data was developed, distributed and collected. Descriptive statistics, Pearson correlation coefficient and moderated multiple regression were used to analyse the study data. Results: Findings of the study showed that the level of power distance perceived by employees had a significant relationship with employee participation, organisational commitment, job satisfaction, perception of justice and readiness to accept job responsibilities. There was also a significant relationship between employee participation and four outcome variables. The moderated multiple regression results supported the hypothesis that power distance had a significant effect on the relationship between employee participation and four outcome variables. Conclusion: Organisations in which employee empowerment is practiced through diverse means such as participating them in decision making related to their field of work, appear to have more committed and satisfied employees with positive perception toward justice in the organisational interactions and readiness to accept job responsibilities. PMID:24596840
[Factors Influencing Quality of Life of Alcoholics Anonymous Members in Korea].

PubMed

Yoo, Jae Soon; Lee, Jongeun; Park, Woo Young

2016-04-01

The purpose of this study was to determine quality of life (QOL) related factors in Alcoholics Anonymous (AA) members based on PRECEDE Model. A cross sectional survey was conducted with participants (N =203) from AA meeting in 11 alcohol counsel centers all over South Korea. Data were collected using a specially designed questionnaire based on the PRECEDE model and including QOL, epidemiological factors (including depression and perceived health status), behavioral factors (continuous abstinence and physical health status and practice), predisposing factors (abstinence self-efficacy and self-esteem), reinforcing factors (social capital and family functioning), and enabling factors. Data were analyzed using t-test, one way ANOVA, Tukey HSD test and hierarchical multiple regression analysis with SPSS (ver. 21.0). Of the educational diagnostic variables, self-esteem (β=.23), family functioning (β=.12), abstinence self-efficacy (β=.12) and social capital (β=.11) were strong influential factors in AA members' QOL. In addition, epidemiological diagnostic variables such as depression (β=-.44) and perceived health status (β=.35) were the main factors in QOL. Also, physical health status and practice (β=.106), one of behavioral diagnostic variables was a beneficial factor in QOL. Hierarchical multiple regression analysis showed the determinant variables accounted for 44.0% of the variation in QOL (F=25.76, p<.001). The finding of the study can be used as a framework for planning interventions in order to promote the quality of life of AA members. It is necessary to develop nursing intervention strategies for strengthening educational and epidemiological diagnostic variables in order to improve AA members' QOL.
Handgrip fatiguing exercise can provide objective assessment of cancer-related fatigue: a pilot study.

PubMed

Veni, T; Boyas, S; Beaune, B; Bourgeois, H; Rahmani, A; Landry, S; Bochereau, A; Durand, S; Morel, B

2018-06-24

As a subjective symptom, cancer-related fatigue is assessed via patient-reported outcomes. Due to the inherent bias of such evaluation, screening and treatment for cancer-related fatigue remains suboptimal. The purpose is to evaluate whether objective cancer patients' hand muscle mechanical parameters (maximal force, critical force, force variability) extracted from a fatiguing handgrip exercise may be correlated to the different dimensions (physical, emotional, and cognitive) of cancer-related fatigue. Fourteen women with advanced breast cancer, still under or having previously received chemotherapy within the preceding 3 months, and 11 healthy women participated to the present study. Cancer-related fatigue was first assessed through the EORTC QLQ-30 and its fatigue module. Fatigability was then measured during 60 maximal repeated handgrip contractions. The maximum force, critical force (asymptote of the force-time evolution), and force variability (root mean square of the successive differences) were extracted. Multiple regression models were performed to investigate the influence of the force parameters on cancer-related fatigue's dimensions. The multiple linear regression analysis evidenced that physical fatigue was best explained by maximum force and critical force (r = 0.81; p = 0.029). The emotional fatigue was best explained by maximum force, critical force, and force variability (r = 0.83; p = 0.008). The cognitive fatigue was best explained by critical force and force variability (r = 0.62; p = 0.035). The handgrip maximal force, critical force, and force variability may offer objective measures of the different dimensions of cancer-related fatigue and could provide a complementary approach to the patient reported outcomes.
Climate change but not unemployment explains the changing suicidality in Thessaloniki Greece (2000-2012).

PubMed

Fountoulakis, Konstantinos N; Savopoulos, Christos; Zannis, Prodromos; Apostolopoulou, Martha; Fountoukidis, Ilias; Kakaletsis, Nikolaos; Kanellos, Ilias; Dimellis, Dimos; Hyphantis, Thomas; Tsikerdekis, Athanasios; Pompili, Maurizio; Hatzitolios, Apostolos I

2016-03-15

Recently there was a debate concerning the etiology behind attempts and completed suicides. The aim of the current study was to search for possible correlations between the rates of attempted and completed suicide and climate variables and regional unemployment per year in the county of Thessaloniki, Macedonia, northern Greece, for the years 2000-12. The regional rates of suicide and attempted suicide as well as regional unemployment were available from previous publications of the authors. The climate variables were calculated from the daily E-OBS gridded dataset which is based on observational data Only the male suicide rates correlate significantly with high mean annual temperature but not with unemployment. The multiple linear regression analysis results suggest that temperature is the only variable that determines male suicides and explains 51% of their variance. Unemployment fails to contribute significantly to the model. There seems to be a seasonal distribution for attempts with mean rates being higher for the period from May to October and the rates clearly correlate with temperature. The highest mean rates were observed during May and August and the lowest during December and February. Multiple linear regression analysis suggests that temperature also determines the female attempts rate although the explained variable is significant but very low (3-5%) Climate variables and specifically high temperature correlate both with suicide and attempted suicide rates but with a different way between males and females. The climate effect was stronger than the effect of unemployment. Copyright © 2016 Elsevier B.V. All rights reserved.
Estimating magnitude and frequency of peak discharges for rural, unregulated, streams in West Virginia

USGS Publications Warehouse

Wiley, J.B.; Atkins, John T.; Tasker, Gary D.

2000-01-01

Multiple and simple least-squares regression models for the log10-transformed 100-year discharge with independent variables describing the basin characteristics (log10-transformed and untransformed) for 267 streamflow-gaging stations were evaluated, and the regression residuals were plotted as areal distributions that defined three regions of the State, designated East, North, and South. Exploratory data analysis procedures identified 31 gaging stations at which discharges are different than would be expected for West Virginia. Regional equations for the 2-, 5-, 10-, 25-, 50-, 100-, 200-, and 500-year peak discharges were determined by generalized least-squares regression using data from 236 gaging stations. Log10-transformed drainage area was the most significant independent variable for all regions.Equations developed in this study are applicable only to rural, unregulated, streams within the boundaries of West Virginia. The accuracy of estimating equations is quantified by measuring the average prediction error (from 27.7 to 44.7 percent) and equivalent years of record (from 1.6 to 20.0 years).
Above-ground biomass of mangrove species. I. Analysis of models

NASA Astrophysics Data System (ADS)

Soares, Mário Luiz Gomes; Schaeffer-Novelli, Yara

2005-10-01

This study analyzes the above-ground biomass of Rhizophora mangle and Laguncularia racemosa located in the mangroves of Bertioga (SP) and Guaratiba (RJ), Southeast Brazil. Its purpose is to determine the best regression model to estimate the total above-ground biomass and compartment (leaves, reproductive parts, twigs, branches, trunk and prop roots) biomass, indirectly. To do this, we used structural measurements such as height, diameter at breast-height (DBH), and crown area. A combination of regression types with several compositions of independent variables generated 2.272 models that were later tested. Subsequent analysis of the models indicated that the biomass of reproductive parts, branches, and prop roots yielded great variability, probably because of environmental factors and seasonality (in the case of reproductive parts). It also indicated the superiority of multiple regression to estimate above-ground biomass as it allows researchers to consider several aspects that affect above-ground biomass, specially the influence of environmental factors. This fact has been attested to the models that estimated the biomass of crown compartments.
Total energy expenditure in adults with cerebral palsy as assessed by doubly labeled water.

PubMed

Johnson, R K; Hildreth, H G; Contompasis, S H; Goran, M I

1997-09-01

To characterize total energy expenditure (TEE) in free-living adults with cerebral palsy (CP) using the doubly labeled water technique, and to determine those physiologic variables and characteristics of CP that were markers of TEE in adults with CP. TEE was measured using the doubly labeled water technique in 30 free-living adults with CP (12 women, 18 men). To determine the best markers of TEE, the following factors were examined: CP status, resting metabolic rate (RMR), anthropometric characteristics and body composition by means of dual-energy x-ray absorptiometry (DXA) and skinfold thickness measurements, energy cost of leisure-time activities, and oral-motor impairment. Means +/- standard deviations, t tests, Pearson product-moment correlation coefficients, Spearman rank correlation coefficients, chi 2, stepwise multiple-correlation regression analysis, and analysis of covariance were used to examine the relationships among variables of interest. TEE was highly variable in the sample (mean = 2,455 +/- 622 kcal/day for men and 1,986 +/- 363 kcal/day for women). Stepwise regression analysis showed that TEE was best predicted in the sample by RMR, percentage body fat determined by DXA, ambulation status, and sex (multiple R = .68, P = .003). When practical, easily measured variables were used, TEE was best predicted by height, ambulation status, percentage body fat by skinfold thickness measurements, and sex (multiple R = .61, P. = 018). The contribution of energy expended in physical activity to TEE was significantly higher in the ambulatory subjects than the nonambulatory subjects (25% vs 16%, respectively; P = .009). The high degree of variability in TEE, largely attributable to high interindividual variation in energy expended in physical activity, makes it difficult to provide general guidelines for energy requirements for adults with CP. Because ambulation status was an important predictor of TEE, it must be accounted for in estimating energy requirements in this population.
A multiscaled model of southwestern willow flycatcher breeding habitat

USGS Publications Warehouse

Hatten, J.R.; Paradzick, C.E.

2003-01-01

The southwestern willow flycatcher (SWFL; Empidonax traillii extimus) is an endangered songbird whose habitat has declined dramatically over the last century. Understanding habitat selection patterns and the ability to identify potential breeding areas for the SWFL is crucial to the management and conservation of this species. We developed a multiscaled model of SWTL breeding habitat with a Geographic Information System (GIS), survey data, GIS variables, and multiple logistic regressions. We obtained presence and absence survey data from a riverine ecosystem and a reservoir delta in south-central Arizona, USA, in 1999. We extracted the GIS variables from satellite imagery and digital elevation models to characterize vegetation and floodplain within the project area. We used multiple logistic regressions within a cell-based (30 X 30 m) modeling environment to (1) determine associations between GIS variables and breeding-site occurrence at different spatial scales (0.09-72 ha), and (2) construct a predictive model. Our best model explained 54% of the variability in breeding-site occurrence with the following variables: vegetation density at the site (0.09 ha), proportion of dense vegetation and variability in vegetation density within a 4.5-ha neighborhood, and amount of floodplain or flat terrain within a 41-ha neighborhood. The density of breeding sites was highest in areas that the model predicted to be most suitable within the project area and at an external test site 200 km away. Conservation efforts must focus on protecting not only occupied patches, but also surrounding riparian forests and floodplain to ensure long-term viability of SWTL. We will use the multiscaled model to map SWTL breeding habitat in Arizona, prioritize future survey effort, and examine changes in habitat abundance and quality over time.
What is the relationship between renal function and visit-to-visit blood pressure variability in primary care? Retrospective cohort study from routinely collected healthcare data.

PubMed

Lasserson, Daniel S; Scherpbier de Haan, Nynke; de Grauw, Wim; van der Wel, Mark; Wetzels, Jack F; O'Callaghan, Christopher A

2016-06-10

To determine the relationship between renal function and visit-to-visit blood pressure (BP) variability in a cohort of primary care patients. Retrospective cohort study from routinely collected healthcare data. Primary care in Nijmegen, the Netherlands, from 2007 to 2012. 19 175 patients who had a measure of renal function, and 7 separate visits with BP readings in the primary care record. Visit-to-visit variability in systolic BP, calculated from the first 7 office measurements, including SD, successive variation, absolute real variation and metrics of variability shown to be independent of mean. Multiple linear regression was used to analyse the influence of estimated glomerular filtration rate (eGFR) on BP variability measures with adjustment for age, sex, diabetes, mean BP, proteinuria, cardiovascular disease, time interval between measures and antihypertensive use. In the patient cohort, 57% were women, mean (SD) age was 65.5 (12.3) years, mean (SD) eGFR was 75.6 (18.0) mL/min/1.73m(2) and SD systolic BP 148.3 (21.4) mm Hg. All BP variability measures were negatively correlated with eGFR and positively correlated with age. However, multiple linear regressions demonstrated consistent, small magnitude negative relationships between eGFR and all measures of BP variability adjusting for confounding variables. Worsening renal function is associated with small increases in measures of visit-to-visit BP variability after adjustment for confounding factors. This is seen across the spectrum of renal function in the population, and provides a mechanism whereby chronic kidney disease may raise the risk of cardiovascular events. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/
[Prevalence and factors associated with peripheral artery disease in patients with type 2 diabetes mellitus in Primary Care].

PubMed

Montero-Monterroso, J L; Gascón-Jiménez, J A; Vargas-Rubio, M D; Quero-Salado, C; Villalba-Marín, P; Pérula-de Torres, L A

2015-01-01

Peripheral artery disease in the lower limbs (PAD) is a prevalent condition that entails high morbidity in diabetic patients; this study assesses PAD in these patients and its socio-demographic and clinic associated variables. Descriptive study in a systematic sample of diabetic patients (DM2) aged 50-80 years, in Primary Care settings. The dependent variable was the presence of PAD diagnosed by ankle-brachial index (ABI) ≤ 0.9; independent variables: socio-demographic, clinical and laboratory. bivariate and multiple logistic regression analyses were performed to determine the variables associated with low ABI. A sample of 251 patients, 52.6% women; mean age: 68.5 ±8.5. A low ABI was detected in 18.3% (95% Confidence Interval (95% CI):13.3-23.3%), with 6 subjets (2.4%) previously diagnosed as suffering PAD. Age (OR=1.07; 95% CI: 1.02-1.12) and retinopathy (OR=2.69; 95% CI: 1.06-6.81) were associated (multiple logistic regression analysis) with ABI. The percentage of patients diagnosed with PAD is very low, although PAD prevalence is high among DM2 patients attending Primary Care clinics, especially in older patients and those with retinopathy. We emphasize the recommendation of performing the ABI test in this population at risk. Copyright © 2014 Sociedad Española de Médicos de Atención Primaria (SEMERGEN). Publicado por Elsevier España, S.L.U. All rights reserved.
Stoichiometry of hydrological C, N, and P losses across climate and geology: An environmental matrix approach across New Zealand primary forests

NASA Astrophysics Data System (ADS)

McGroddy, M. E.; Baisden, W. T.; Hedin, L. O.

2008-03-01

Hydrologic losses can play a key role in regulating ecosystem nutrient balances, particularly in regions where baseline nutrient cycles are not augmented by industrial deposition. We used first-order streams to integrate hydrologic losses at the watershed scale across unpolluted old-growth forests in New Zealand. We employed a matrix approach to resolve how stream water concentrations of dissolved organic carbon (DOC), organic and inorganic nitrogen (DON and DIN), and organic and inorganic phosphorus (DOP and DIP) varied as a function of landscape differences in climate and geology. We found stream water total dissolved nitrogen (TDN) to be dominated by organic forms (medians for DON, 81.3%, nitrate-N, 12.6%, and ammonium-N, 3.9%). The median stream water DOC:TDN:TDP molar ratio of 1050:21:1 favored C slightly over N and P when compared to typical temperate forest foliage ratios. Using the full set of variables in a multiple regression approach explained approximately half of the variability in DON, DOC, and TDP concentrations. Building on this approach we combined a simplified set of variables with a simple water balance model in a regression designed to predict DON export at larger spatial scales. Incorporating the effects of climate and geologic variables on nutrient exports will greatly aid the development of integrated Earth-climate biogeochemical models which are able to take into account multiple element dynamics and complex natural landscapes.
Psychosocial and cognitive factors associated with adherence to dietary and fluid restriction regimens by people on chronic haemodialysis.

PubMed

Sensky, T; Leger, C; Gilmour, S

1996-01-01

Failure by people on chronic haemodialysis to adhere adequately to dietary and fluid restrictions can have serious medical consequences. Numerous psychosocial factors possibly associated with adherence have been investigated in previous research. However, most previous studies have examined one or a few variables in isolation, and have tended to focus on sociodemographic variables not easily amenable to intervention. Much previous work has tended to ignore potential differences in adherence between male and female dialysands. Sociodemographic and psychosocial factors associated with adherence to dietary and fluid restrictions were investigated in 45 people on haemodialysis attending one renal unit, excluding those with a residual urine volume > 500 ml/day. Multiple regression analyses were used to estimate the contribution to adherence of a range of variables, including gender, age, duration of dialysis, affective disturbance, past psychiatric history, health locus of control, social adjustment and social supports. Adherence to diet (measured by predialysis serum potassium) and to fluid restriction (interdialysis weight gain) were not linked, and had different psychosocial correlates. Regression models of four different aspects of adherence revealed very distinct psychosocial correlates, with contributions to adherence from complex interactions between psychosocial and cognitive variables, notably gender, age, social adjustment, health locus of control, and depression. The findings cast doubt on the results of many previous studies which have used simple models of adherence. Adherence is likely to be influenced in a complex manner by multiple factors including age, gender, locus of control, social adjustment, and past psychiatric history.

Female Literacy Rate is a Better Predictor of Birth Rate and Infant Mortality Rate in India

PubMed Central

Saurabh, Suman; Sarkar, Sonali; Pandey, Dhruv K.

2013-01-01

Background: Educated women are known to take informed reproductive and healthcare decisions. These result in population stabilization and better infant care reflected by lower birth rates and infant mortality rates (IMRs), respectively. Materials and Methods: Our objective was to study the relationship of male and female literacy rates with crude birth rates (CBRs) and IMRs of the states and union territories (UTs) of India. The data were analyzed using linear regression. CBR and IMR were taken as the dependent variables; while the overall literacy rates, male, and female literacy rates were the independent variables. Results: CBRs were inversely related to literacy rates (slope parameter = −0.402, P < 0.001). On multiple linear regression with male and female literacy rates, a significant inverse relationship emerged between female literacy rate and CBR (slope = −0.363, P < 0.001), while male literacy rate was not significantly related to CBR (P = 0.674). IMR of the states were also inversely related to their literacy rates (slope = −1.254, P < 0.001). Multiple linear regression revealed a significant inverse relationship between IMR and female literacy (slope = −0.816, P = 0.031), whereas male literacy rate was not significantly related (P = 0.630). Conclusion: Female literacy is relatively highly important for both population stabilization and better infant health. PMID:26664840
Prediction of erodibility in Oxisols using iron oxides, soil color and diffuse reflectance spectroscopy

NASA Astrophysics Data System (ADS)

Arantes Camargo, Livia; Marques, José, Jr.

2015-04-01

The prediction of erodibility using indirect methods such as diffuse reflectance spectroscopy could facilitate the characterization of the spatial variability in large areas and optimize implementation of conservation practices. The aim of this study was to evaluate the prediction of interrill erodibility (Ki) and rill erodibility (Kr) by means of iron oxides content and soil color using multiple linear regression and diffuse reflectance spectroscopy (DRS) using regression analysis by least squares partial (PLSR). The soils were collected from three geomorphic surfaces and analyzed for chemical, physical and mineralogical properties, plus scanned in the spectral range from the visible and infrared. Maps of spatial distribution of Ki and Kr were built with the values calculated by the calibrated models that obtained the best accuracy using geostatistics. Interrill-rill erodibility presented negative correlation with iron extracted by dithionite-citrate-bicarbonate, hematite, and chroma, confirming the influence of iron oxides in soil structural stability. Hematite and hue were the attributes that most contributed in calibration models by multiple linear regression for the prediction of Ki (R2 = 0.55) and Kr (R2 = 0.53). The diffuse reflectance spectroscopy via PLSR allowed to predict Interrill-rill erodibility with high accuracy (R2adj = 0.76, 0.81 respectively and RPD> 2.0) in the range of the visible spectrum (380-800 nm) and the characterization of the spatial variability of these attributes by geostatistics.
Multiple regression based imputation for individualizing template human model from a small number of measured dimensions.

PubMed

Nohara, Ryuki; Endo, Yui; Murai, Akihiko; Takemura, Hiroshi; Kouchi, Makiko; Tada, Mitsunori

2016-08-01

Individual human models are usually created by direct 3D scanning or deforming a template model according to the measured dimensions. In this paper, we propose a method to estimate all the necessary dimensions (full set) for the human model individualization from a small number of measured dimensions (subset) and human dimension database. For this purpose, we solved multiple regression equation from the dimension database given full set dimensions as the objective variable and subset dimensions as the explanatory variables. Thus, the full set dimensions are obtained by simply multiplying the subset dimensions to the coefficient matrix of the regression equation. We verified the accuracy of our method by imputing hand, foot, and whole body dimensions from their dimension database. The leave-one-out cross validation is employed in this evaluation. The mean absolute errors (MAE) between the measured and the estimated dimensions computed from 4 dimensions (hand length, breadth, middle finger breadth at proximal, and middle finger depth at proximal) in the hand, 2 dimensions (foot length, breadth, and lateral malleolus height) in the foot, and 1 dimension (height) and weight in the whole body are computed. The average MAE of non-measured dimensions were 4.58% in the hand, 4.42% in the foot, and 3.54% in the whole body, while that of measured dimensions were 0.00%.
Comparing the index-flood and multiple-regression methods using L-moments

NASA Astrophysics Data System (ADS)

Malekinezhad, H.; Nachtnebel, H. P.; Klik, A.

In arid and semi-arid regions, the length of records is usually too short to ensure reliable quantile estimates. Comparing index-flood and multiple-regression analyses based on L-moments was the main objective of this study. Factor analysis was applied to determine main influencing variables on flood magnitude. Ward’s cluster and L-moments approaches were applied to several sites in the Namak-Lake basin in central Iran to delineate homogeneous regions based on site characteristics. Homogeneity test was done using L-moments-based measures. Several distributions were fitted to the regional flood data and index-flood and multiple-regression methods as two regional flood frequency methods were compared. The results of factor analysis showed that length of main waterway, compactness coefficient, mean annual precipitation, and mean annual temperature were the main variables affecting flood magnitude. The study area was divided into three regions based on the Ward’s method of clustering approach. The homogeneity test based on L-moments showed that all three regions were acceptably homogeneous. Five distributions were fitted to the annual peak flood data of three homogeneous regions. Using the L-moment ratios and the Z-statistic criteria, GEV distribution was identified as the most robust distribution among five candidate distributions for all the proposed sub-regions of the study area, and in general, it was concluded that the generalised extreme value distribution was the best-fit distribution for every three regions. The relative root mean square error (RRMSE) measure was applied for evaluating the performance of the index-flood and multiple-regression methods in comparison with the curve fitting (plotting position) method. In general, index-flood method gives more reliable estimations for various flood magnitudes of different recurrence intervals. Therefore, this method should be adopted as regional flood frequency method for the study area and the Namak-Lake basin in central Iran. To estimate floods of various return periods for gauged catchments in the study area, the mean annual peak flood of the catchments may be multiplied by corresponding values of the growth factors, and computed using the GEV distribution.
Functional Capacity Evaluation in Different Societal Contexts: Results of a Multicountry Study.

PubMed

Ansuategui Echeita, Jone; Bethge, Matthias; van Holland, Berry J; Gross, Douglas P; Kool, Jan; Oesch, Peter; Trippolini, Maurizio A; Chapman, Elizabeth; Cheng, Andy S K; Sellars, Robert; Spavins, Megan; Streibelt, Marco; van der Wurff, Peter; Reneman, Michiel F

2018-05-25

Purpose To examine factors associated with Functional Capacity Evaluation (FCE) results in patients with painful musculoskeletal conditions, with focus on social factors across multiple countries. Methods International cross-sectional study was performed within care as usual. Simple and multiple multilevel linear regression analyses which considered measurement's dependency within clinicians and country were conducted: FCE characteristics and biopsychosocial variables from patients and clinicians as independent variables; and FCE results (floor-to-waist lift, six-minute walk, and handgrip strength) as dependent variables. Results Data were collected for 372 patients, 54 clinicians, 18 facilities and 8 countries. Patients' height and reported pain intensity were consistently associated with every FCE result. Patients' sex, height, reported pain intensity, effort during FCE, social isolation, and disability, clinician's observed physical effort, and whether FCE test was prematurely ended were associated with lift. Patient's height, Body Mass Index, post-test heart-rate, reported pain intensity and effort during FCE, days off work, and whether FCE test was prematurely ended were associated with walk. Patient's age, sex, height, affected body area, reported pain intensity and catastrophizing, and physical work demands were associated with handgrip. Final regression models explained 38‒65% of total variance. Clinician and country random effects composed 1-39% of total residual variance in these models. Conclusion Biopsychosocial factors were associated with every FCE result across multiple countries; specifically, patients' height, reported pain intensity, clinician, and measurement country. Social factors, which had been under-researched, were consistently associated with FCE performances. Patients' FCE results should be considered from a biopsychosocial perspective, including different social contexts.
Pre-Adult Background Variables and Divorce: A Note of Caution about Overreliance on Explained Variance.

ERIC Educational Resources Information Center

Glenn, Norval D.; Shelton, Beth Ann

1983-01-01

Examined multiple regression results in adjusted percentages for data from a project to assess the impact of formative experiences on adult well being. Differences from some of the adjusted percentages reveal a divorce rate higher for female children of divorce and for individuals from a three-region "divorce belt." (Author/JAC)
A Prediction Model for Community Colleges Using Graduation Rate as the Performance Indicator

ERIC Educational Resources Information Center

Moosai, Susan

2010-01-01

In this thesis a prediction model using graduation rate as the performance indicator is obtained for community colleges for three cohort years, 2003, 2004, and 2005 in the states of California, Florida, and Michigan. Multiple Regression analysis, using an aggregate of seven predictor variables, was employed in determining this prediction model.…
An investigation of the effect of seasonal activity levels on avian censusing

Treesearch

C. John Ralph

1981-01-01

Intensive variable distance circular-plot censuses and timed activity budget data were used to compare the effects of conspicuousness upon census results. In six of ten species no correlation was found, suggesting that all birds within the "Effective Detection Distance" (EDD) were seen. In four species there were significant correlations. Multiple regression...
Using the Graded Response Model to Control Spurious Interactions in Moderated Multiple Regression

ERIC Educational Resources Information Center

Morse, Brendan J.; Johanson, George A.; Griffeth, Rodger W.

2012-01-01

Recent simulation research has demonstrated that using simple raw score to operationalize a latent construct can result in inflated Type I error rates for the interaction term of a moderated statistical model when the interaction (or lack thereof) is proposed at the latent variable level. Rescaling the scores using an appropriate item response…
Diameter and height growth of suppressed grand fir saplings after overstory removal.

Treesearch

K.W. Seidel

1980-01-01

The 2- and 5-year diameter and height growth of suppressed grand fir (Abies grandis (Dougl. ex D. Don) Lindl.) advance reproduction was measured in central Oregon after the overstory was removed. Multiple regression analyses were used to predict growth response as a function of individual tree variables. The resulting equations, although highly...
A multiple regression model for parasitization of gypsy moths by the introduced larval parasite Cotesia melanoscelus (Hymenoptera: Braconidae)

Treesearch

Roger W. Fuester

1991-01-01

Cotesia melanoscelus (Ratzeburg) is a bivoltine, solitary, endoparasite of larvae of the gypsy moth, Lymantria dispar (L.). Imported from Europe after the turn of the century, it readily became established and now occurs throughout the generally infested area. Rates of parasitization are highly variable, particularly during the...
Four Dimensions of Student Leadership: What Predicts Students' Attitudes toward Leadership Development?

ERIC Educational Resources Information Center

Shertzer, John; Wall, Vernon; Frandsen, Alisa; Guo, Yan; Whalen, Donald F.; Shelley, Mack C., II

2005-01-01

Multiple regression was performed on four dependent variables derived from the results of a student survey measuring attitudes about student leadership: (a) leadership is important to the student, (b) the student considers himself or herself to be a leader, (c) leadership will be important to the student after college, and (d) leaders need to be…
Site conditions related to erosion on logging roads

Treesearch

R. M. Rice; J. D. McCashion

1985-01-01

Synopsis - Data collected from 299 road segments in northwestern California were used to develop and test a procedure for estimating and managing road-related erosion. Site conditions and the design of each segment were described by 30 variables. Equations developed using 149 of the road segments were tested on the other 150. The best multiple regression equation...
A Mixed-Methods Study Investigating the Relationship between Media Multitasking Orientation and Grade Point Average

ERIC Educational Resources Information Center

Lee, Jennifer

2012-01-01

The intent of this study was to examine the relationship between media multitasking orientation and grade point average. The study utilized a mixed-methods approach to investigate the research questions. In the quantitative section of the study, the primary method of statistical analyses was multiple regression. The independent variables for the…
School-Related Variables in the Dimensions of Anger in High School Students in Turkey

ERIC Educational Resources Information Center

Siyez, Digdem M.

2018-01-01

The study aimed to examine the effects of perceived social support from teachers, expectation of academic achievement, school control, and gender on anger dimensions in high school students in Izmir, Turkey. In total, 446 high school students (234 girls, 212 boys) participated in the study. Pearson's correlation and multiple regression analyses…
Methods for Improving Information from "Undesigned" Human Factors Experiments. Technical Report No. p75-287.

ERIC Educational Resources Information Center

Simon, Charles W.

An "undesigned" experiment is one in which the predictor variables are correlated, either due to a failure to complete a design or because the investigator was unable to select or control relevant experimental conditions. The traditional method of analyzing this class of experiment--multiple regression analysis based on a least squares…
Analyzing the Gender Gap in Math Achievement: Evidence from a Large-Scale US Sample

ERIC Educational Resources Information Center

Cheema, Jehanzeb R.; Galluzzo, Gary

2013-01-01

The US portion of the Program for International Student Assessment (PISA) 2003 student questionnaire comprising of 4,733 observations was used in a multiple regression framework to predict math achievement from demographic variables, such as gender, race, and socioeconomic status, and two student-specific measures of perception, math anxiety and…
Multi scale habitat relationships of Martes americana in northern Idaho, U.S.A.

Treesearch

Tzeidle N. Wasserman; Samuel A. Cushman; David O. Wallin; Jim Hayden

2012-01-01

We used bivariate scaling and logistic regression to investigate multiple-scale habitat selection by American marten (Martes americana). Bivariate scaling reveals dramatic differences in the apparent nature and strength of relationships between marten occupancy and a number of habitat variables across a range of spatial scales. These differences include reversals in...
Computation of Effect Size for Moderating Effects of Categorical Variables in Multiple Regression

ERIC Educational Resources Information Center

Aguinis, Herman; Pierce, Charles A.

2006-01-01

The computation and reporting of effect size estimates is becoming the norm in many journals in psychology and related disciplines. Despite the increased importance of effect sizes, researchers may not report them or may report inaccurate values because of a lack of appropriate computational tools. For instance, Pierce, Block, and Aguinis (2004)…
Ecological and Topographic Features of Volcanic Ash-Influenced Forest Soils

Treesearch

Mark Kimsey; Brian Gardner; Alan Busacca

2007-01-01

Volcanic ash distribution and thickness were determined for a forested region of north-central Idaho. Mean ash thickness and multiple linear regression analyses were used to model the effect of environmental variables on ash thickness. Slope and slope curvature relationships with volcanic ash thickness varied on a local spatial scale across the study area. Ash...

The Impact of Managerial Coaching on Learning Outcomes within the Team Context: An Analysis

ERIC Educational Resources Information Center

Hagen, Marcia; Aguilar, Mariya Gavrilova

2012-01-01

This study investigates the relationship between coaching expertise, project difficulty, and team empowerment on team learning outcomes within the context of a high-performance work team. Variables were tested using multiple regression analysis. The data were analyzed for two groups--team leaders and team members--using t-tests, factor analysis,…
Advances in Testing the Statistical Significance of Mediation Effects

ERIC Educational Resources Information Center

Mallinckrodt, Brent; Abraham, W. Todd; Wei, Meifen; Russell, Daniel W.

2006-01-01

P. A. Frazier, A. P. Tix, and K. E. Barron (2004) highlighted a normal theory method popularized by R. M. Baron and D. A. Kenny (1986) for testing the statistical significance of indirect effects (i.e., mediator variables) in multiple regression contexts. However, simulation studies suggest that this method lacks statistical power relative to some…
Novel associations between contaminant body burdens and biomarkers of reproductive condition in male Common Carp along multiple gradients of contaminant exposure in Lake Mead National Recreation Area, USA

USGS Publications Warehouse

Patino, Reynaldo; VanLandeghem, Matthew M.; Goodbred, Steven L.; Orsak, Erik; Jenkins, Jill A.; Echols, Kathy R.; Rosen, Michael R.; Torres, Leticia

2015-01-01

Adult male Common Carp were sampled in 2007/08 over a full reproductive cycle at Lake Mead National Recreation Area. Sites sampled included a stream dominated by treated wastewater effluent, a lake basin receiving the streamflow, an upstream lake basin (reference), and a site below Hoover Dam. Individual body burdens for 252 contaminants were measured, and biological variables assessed included physiological [plasma vitellogenin (VTG), estradiol-17β (E2), 11-ketotestosterone (11KT)] and organ [gonadosomatic index (GSI)] endpoints. Patterns in contaminant composition and biological condition were determined by Principal Component Analysis, and their associations modeled by Principal Component Regression. Three spatially distinct but temporally stable gradients of contaminant distribution were recognized: a contaminant mixture typical of wastewaters (PBDEs, methyl triclosan, galaxolide), PCBs, and DDTs. Two spatiotemporally variable patterns of biological condition were recognized: a primary pattern consisting of reproductive condition variables (11KT, E2, GSI), and a secondary pattern including general condition traits (condition factor, hematocrit, fork length). VTG was low in all fish, indicating low estrogenic activity of water at all sites. Wastewater contaminants associated negatively with GSI, 11KT and E2; PCBs associated negatively with GSI and 11KT; and DDTs associated positively with GSI and 11KT. Regression of GSI on sex steroids revealed a novel, nonlinear association between these variables. Inclusion of sex steroids in the GSI regression on contaminants rendered wastewater contaminants nonsignificant in the model and reduced the influence of PCBs and DDTs. Thus, the influence of contaminants on GSI may have been partially driven by organismal modes-of-action that include changes in sex steroid production. The positive association of DDTs with 11KT and GSI suggests that lifetime, sub-lethal exposures to DDTs have effects on male carp opposite of those reported by studies where exposure concentrations were relatively high. Lastly, this study highlighted advantages of multivariate/multiple regression approaches for exploring associations between complex contaminant mixtures and gradients and reproductive condition in wild fishes.
Novel associations between contaminant body burdens and biomarkers of reproductive condition in male Common Carp along multiple gradients of contaminant exposure in Lake Mead National Recreation Area, USA.

PubMed

Patiño, Reynaldo; VanLandeghem, Matthew M; Goodbred, Steven L; Orsak, Erik; Jenkins, Jill A; Echols, Kathy; Rosen, Michael R; Torres, Leticia

2015-08-01

Adult male Common Carp were sampled in 2007/08 over a full reproductive cycle at Lake Mead National Recreation Area. Sites sampled included a stream dominated by treated wastewater effluent, a lake basin receiving the streamflow, an upstream lake basin (reference), and a site below Hoover Dam. Individual body burdens for 252 contaminants were measured, and biological variables assessed included physiological [plasma vitellogenin (VTG), estradiol-17β (E2), 11-ketotestosterone (11KT)] and organ [gonadosomatic index (GSI)] endpoints. Patterns in contaminant composition and biological condition were determined by Principal Component Analysis, and their associations modeled by Principal Component Regression. Three spatially distinct but temporally stable gradients of contaminant distribution were recognized: a contaminant mixture typical of wastewaters (PBDEs, methyl triclosan, galaxolide), PCBs, and DDTs. Two spatiotemporally variable patterns of biological condition were recognized: a primary pattern consisting of reproductive condition variables (11KT, E2, GSI), and a secondary pattern including general condition traits (condition factor, hematocrit, fork length). VTG was low in all fish, indicating low estrogenic activity of water at all sites. Wastewater contaminants associated negatively with GSI, 11KT and E2; PCBs associated negatively with GSI and 11KT; and DDTs associated positively with GSI and 11KT. Regression of GSI on sex steroids revealed a novel, nonlinear association between these variables. Inclusion of sex steroids in the GSI regression on contaminants rendered wastewater contaminants nonsignificant in the model and reduced the influence of PCBs and DDTs. Thus, the influence of contaminants on GSI may have been partially driven by organismal modes-of-action that include changes in sex steroid production. The positive association of DDTs with 11KT and GSI suggests that lifetime, sub-lethal exposures to DDTs have effects on male carp opposite of those reported by studies where exposure concentrations were relatively high. Lastly, this study highlighted advantages of multivariate/multiple regression approaches for exploring associations between complex contaminant mixtures and gradients and reproductive condition in wild fishes. Published by Elsevier Inc.
[Prevalence of vitamin D deficiency and associated factors in women and newborns in the immediate postpartum period].

PubMed

do Prado, Mara Rúbia Maciel Cardoso; Oliveira, Fabiana de Cássia Carvalho; Assis, Karine Franklin; Ribeiro, Sarah Aparecida Vieira; do Prado Junior, Pedro Paulo; Sant'Ana, Luciana Ferreira da Rocha; Priore, Silvia Eloiza; Franceschini, Sylvia do Carmo Castro

2015-01-01

To assess the prevalence of vitamin D deficiency and its associated factors in women and their newborns in the postpartum period. This cross-sectional study evaluated vitamin D deficiency/insufficiency in 226 women and their newborns in Viçosa (Minas Gerais, BR) between December 2011 and November 2012. Cord blood and venous maternal blood were collected to evaluate the following biochemical parameters: vitamin D, alkaline phosphatase, calcium, phosphorus and parathyroid hormone. Poisson regression analysis, with a confidence interval of 95% was applied to assess vitamin D deficiency and its associated factors. Multiple linear regression analysis was performed to identify factors associated with 25(OH)D deficiency in the newborns and women from the study. The criteria for variable inclusion in the multiple linear regression model was the association with the dependent variable in the simple linear regression analysis, considering p<0.20. Significance level was α<5%. From 226 women included, 200 (88.5%) were 20 to 44 years old; the median age was 28 years. Deficient/insufficient levels of vitamin D were found in 192 (85%) women and in 182 (80.5%) neonates. The maternal 25(OH)D and alkaline phosphatase levels were independently associated with vitamin D deficiency in infants. This study identified a high prevalence of vitamin D deficiency and insufficiency in women and newborns and the association between maternal nutritional status of vitamin D and their infants' vitamin D status. Copyright © 2015 Sociedade de Pediatria de São Paulo. Publicado por Elsevier Editora Ltda. All rights reserved.
Alterations of papilla dimensions after orthodontic closure of the maxillary midline diastema: a retrospective longitudinal study.

PubMed

Jeong, Jin-Seok; Lee, Seung-Youp; Chang, Moontaek

2016-06-01

The aim of this study was to evaluate alterations of papilla dimensions after orthodontic closure of the diastema between maxillary central incisors. Sixty patients who had a visible diastema between maxillary central incisors that had been closed by orthodontic approximation were selected for this study. Various papilla dimensions were assessed on clinical photographs and study models before the orthodontic treatment and at the follow-up examination after closure of the diastema. Influences of the variables assessed before orthodontic treatment on the alterations of papilla height (PH) and papilla base thickness (PBT) were evaluated by univariate regression analysis. To analyze potential influences of the 3-dimensional papilla dimensions before orthodontic treatment on the alterations of PH and PBT, a multiple regression model was formulated including the 3-dimensional papilla dimensions as predictor variables. On average, PH decreased by 0.80 mm and PBT increased after orthodontic closure of the diastema (P<0.01). Univariate regression analysis revealed that the PH (P=0.002) and PBT (P=0.047) before orthodontic treatment influenced the alteration of PH. With respect to the alteration of PBT, the diastema width (P=0.045) and PBT (P=0.000) were found to be influential factors. PBT before the orthodontic treatment significantly influenced the alteration of PBT in the multiple regression model. PH decreased but PBT increased after orthodontic closure of the diastema. The papilla dimensions before orthodontic treatment influenced the alterations of PH and PBT after closure of the diastema. The PBT increased more when the diastema width before the orthodontic treatment was larger.
Estimation of aboveground biomass in Mediterranean forests by statistical modelling of ASTER fraction images

NASA Astrophysics Data System (ADS)

Fernández-Manso, O.; Fernández-Manso, A.; Quintano, C.

2014-09-01

Aboveground biomass (AGB) estimation from optical satellite data is usually based on regression models of original or synthetic bands. To overcome the poor relation between AGB and spectral bands due to mixed-pixels when a medium spatial resolution sensor is considered, we propose to base the AGB estimation on fraction images from Linear Spectral Mixture Analysis (LSMA). Our study area is a managed Mediterranean pine woodland (Pinus pinaster Ait.) in central Spain. A total of 1033 circular field plots were used to estimate AGB from Advanced Spaceborne Thermal Emission and Reflection Radiometer (ASTER) optical data. We applied Pearson correlation statistics and stepwise multiple regression to identify suitable predictors from the set of variables of original bands, fraction imagery, Normalized Difference Vegetation Index and Tasselled Cap components. Four linear models and one nonlinear model were tested. A linear combination of ASTER band 2 (red, 0.630-0.690 μm), band 8 (short wave infrared 5, 2.295-2.365 μm) and green vegetation fraction (from LSMA) was the best AGB predictor (Radj2=0.632, the root-mean-squared error of estimated AGB was 13.3 Mg ha-1 (or 37.7%), resulting from cross-validation), rather than other combinations of the above cited independent variables. Results indicated that using ASTER fraction images in regression models improves the AGB estimation in Mediterranean pine forests. The spatial distribution of the estimated AGB, based on a multiple linear regression model, may be used as baseline information for forest managers in future studies, such as quantifying the regional carbon budget, fuel accumulation or monitoring of management practices.
Multivariate classification of small order watersheds in the Quabbin Reservoir Basin, Massachusetts

USGS Publications Warehouse

Lent, R.M.; Waldron, M.C.; Rader, J.C.

1998-01-01

A multivariate approach was used to analyze hydrologic, geologic, geographic, and water-chemistry data from small order watersheds in the Quabbin Reservoir Basin in central Massachusetts. Eighty three small order watersheds were delineated and landscape attributes defining hydrologic, geologic, and geographic features of the watersheds were compiled from geographic information system data layers. Principal components analysis was used to evaluate 11 chemical constituents collected bi-weekly for 1 year at 15 surface-water stations in order to subdivide the basin into subbasins comprised of watersheds with similar water quality characteristics. Three principal components accounted for about 90 percent of the variance in water chemistry data. The principal components were defined as a biogeochemical variable related to wetland density, an acid-neutralization variable, and a road-salt variable related to density of primary roads. Three subbasins were identified. Analysis of variance and multiple comparisons of means were used to identify significant differences in stream water chemistry and landscape attributes among subbasins. All stream water constituents were significantly different among subbasins. Multiple regression techniques were used to relate stream water chemistry to landscape attributes. Important differences in landscape attributes were related to wetlands, slope, and soil type.A multivariate approach was used to analyze hydrologic, geologic, geographic, and water-chemistry data from small order watersheds in the Quabbin Reservoir Basin in central Massachusetts. Eighty three small order watersheds were delineated and landscape attributes defining hydrologic, geologic, and geographic features of the watersheds were compiled from geographic information system data layers. Principal components analysis was used to evaluate 11 chemical constituents collected bi-weekly for 1 year at 15 surface-water stations in order to subdivide the basin into subbasins comprised of watersheds with similar water quality characteristics. Three principal components accounted for about 90 percent of the variance in water chemistry data. The principal components were defined as a biogeochemical variable related to wetland density, an acid-neutralization variable, and a road-salt variable related to density of primary roads. Three subbasins were identified. Analysis of variance and multiple comparisons of means were used to identify significant differences in stream water chemistry and landscape attributes among subbasins. All stream water constituents were significantly different among subbasins. Multiple regression techniques were used to relate stream water chemistry to landscape attributes. Important differences in landscape attributes were related to wetlands, slope, and soil type.
A quantitative study of factors influencing quality of life in rural Mexican women diagnosed with HIV.

PubMed

Holtz, Carol; Sowell, Richard; VanBrackle, Lewis; Velasquez, Gabriela; Hernandez-Alonso, Virginia

2014-01-01

This quantitative study explored the level of Quality of Life (QoL) in indigenous Mexican women and identified psychosocial factors that significantly influenced their QoL, using face-to-face interviews with 101 women accessing care in an HIV clinic in Oaxaca, Mexico. Variables included demographic characteristics, levels of depression, coping style, family functioning, HIV-related beliefs, and QoL. Descriptive statistics were used to analyze participant characteristics, and women's scores on data collection instruments. Pearson's R correlational statistics were used to determine the level of significance between study variables. Multiple regression analysis examined all variables that were significantly related to QoL. Pearson's correlational analysis of relationships between Spirituality, Educating Self about HIV, Family Functioning, Emotional Support, Physical Care, and Staying Positive demonstrated positive correlation to QoL. Stigma, depression, and avoidance coping were significantly and negatively associated with QoL. The final regression model indicated that depression and avoidance coping were the best predictor variables for QoL. Copyright © 2014 Association of Nurses in AIDS Care. Published by Elsevier Inc. All rights reserved.
Predicting recreational water quality advisories: A comparison of statistical methods

USGS Publications Warehouse

Brooks, Wesley R.; Corsi, Steven R.; Fienen, Michael N.; Carvin, Rebecca B.

2016-01-01

Epidemiological studies indicate that fecal indicator bacteria (FIB) in beach water are associated with illnesses among people having contact with the water. In order to mitigate public health impacts, many beaches are posted with an advisory when the concentration of FIB exceeds a beach action value. The most commonly used method of measuring FIB concentration takes 18–24 h before returning a result. In order to avoid the 24 h lag, it has become common to ”nowcast” the FIB concentration using statistical regressions on environmental surrogate variables. Most commonly, nowcast models are estimated using ordinary least squares regression, but other regression methods from the statistical and machine learning literature are sometimes used. This study compares 14 regression methods across 7 Wisconsin beaches to identify which consistently produces the most accurate predictions. A random forest model is identified as the most accurate, followed by multiple regression fit using the adaptive LASSO.
Prediction of monthly rainfall in Victoria, Australia: Clusterwise linear regression approach

NASA Astrophysics Data System (ADS)

Bagirov, Adil M.; Mahmood, Arshad; Barton, Andrew

2017-05-01

This paper develops the Clusterwise Linear Regression (CLR) technique for prediction of monthly rainfall. The CLR is a combination of clustering and regression techniques. It is formulated as an optimization problem and an incremental algorithm is designed to solve it. The algorithm is applied to predict monthly rainfall in Victoria, Australia using rainfall data with five input meteorological variables over the period of 1889-2014 from eight geographically diverse weather stations. The prediction performance of the CLR method is evaluated by comparing observed and predicted rainfall values using four measures of forecast accuracy. The proposed method is also compared with the CLR using the maximum likelihood framework by the expectation-maximization algorithm, multiple linear regression, artificial neural networks and the support vector machines for regression models using computational results. The results demonstrate that the proposed algorithm outperforms other methods in most locations.
Accounting for measurement error in log regression models with applications to accelerated testing.

PubMed

Richardson, Robert; Tolley, H Dennis; Evenson, William E; Lunt, Barry M

2018-01-01

In regression settings, parameter estimates will be biased when the explanatory variables are measured with error. This bias can significantly affect modeling goals. In particular, accelerated lifetime testing involves an extrapolation of the fitted model, and a small amount of bias in parameter estimates may result in a significant increase in the bias of the extrapolated predictions. Additionally, bias may arise when the stochastic component of a log regression model is assumed to be multiplicative when the actual underlying stochastic component is additive. To account for these possible sources of bias, a log regression model with measurement error and additive error is approximated by a weighted regression model which can be estimated using Iteratively Re-weighted Least Squares. Using the reduced Eyring equation in an accelerated testing setting, the model is compared to previously accepted approaches to modeling accelerated testing data with both simulations and real data.
Estimating verbal fluency and naming ability from the test of premorbid functioning and demographic variables: Regression equations derived from a regional UK sample.

PubMed

Jenkinson, Toni-Marie; Muncer, Steven; Wheeler, Miranda; Brechin, Don; Evans, Stephen

2018-06-01

Neuropsychological assessment requires accurate estimation of an individual's premorbid cognitive abilities. Oral word reading tests, such as the test of premorbid functioning (TOPF), and demographic variables, such as age, sex, and level of education, provide a reasonable indication of premorbid intelligence, but their ability to predict other related cognitive abilities is less well understood. This study aimed to develop regression equations, based on the TOPF and demographic variables, to predict scores on tests of verbal fluency and naming ability. A sample of 119 healthy adults provided demographic information and were tested using the TOPF, FAS, animal naming test (ANT), and graded naming test (GNT). Multiple regression analyses, using the TOPF and demographics as predictor variables, were used to estimate verbal fluency and naming ability test scores. Change scores and cases of significant impairment were calculated for two clinical samples with diagnosed neurological conditions (TBI and meningioma) using the method in Knight, McMahon, Green, and Skeaff (). Demographic variables provided a significant contribution to the prediction of all verbal fluency and naming ability test scores; however, adding TOPF score to the equation considerably improved prediction beyond that afforded by demographic variables alone. The percentage of variance accounted for by demographic variables and/or TOPF score varied from 19 per cent (FAS), 28 per cent (ANT), and 41 per cent (GNT). Change scores revealed significant differences in performance in the clinical groups, particularity the TBI group. Demographic variables, particularly education level, and scores on the TOPF should be taken into consideration when interpreting performance on tests of verbal fluency and naming ability. © 2017 The British Psychological Society.
Techniques for estimating flood-peak discharges of rural, unregulated streams in Ohio

USGS Publications Warehouse

Koltun, G.F.; Roberts, J.W.

1990-01-01

Multiple-regression equations are presented for estimating flood-peak discharges having recurrence intervals of 2, 5, 10, 25, 50, and 100 years at ungaged sites on rural, unregulated streams in Ohio. The average standard errors of prediction for the equations range from 33.4% to 41.4%. Peak discharge estimates determined by log-Pearson Type III analysis using data collected through the 1987 water year are reported for 275 streamflow-gaging stations. Ordinary least-squares multiple-regression techniques were used to divide the State into three regions and to identify a set of basin characteristics that help explain station-to- station variation in the log-Pearson estimates. Contributing drainage area, main-channel slope, and storage area were identified as suitable explanatory variables. Generalized least-square procedures, which include historical flow data and account for differences in the variance of flows at different gaging stations, spatial correlation among gaging station records, and variable lengths of station record were used to estimate the regression parameters. Weighted peak-discharge estimates computed as a function of the log-Pearson Type III and regression estimates are reported for each station. A method is provided to adjust regression estimates for ungaged sites by use of weighted and regression estimates for a gaged site located on the same stream. Limitations and shortcomings cited in an earlier report on the magnitude and frequency of floods in Ohio are addressed in this study. Geographic bias is no longer evident for the Maumee River basin of northwestern Ohio. No bias is found to be associated with the forested-area characteristic for the range used in the regression analysis (0.0 to 99.0%), nor is this characteristic significant in explaining peak discharges. Surface-mined area likewise is not significant in explaining peak discharges, and the regression equations are not biased when applied to basins having approximately 30% or less surface-mined area. Analyses of residuals indicate that the equations tend to overestimate flood-peak discharges for basins having approximately 30% or more surface-mined area. (USGS)
Carotid artery intima-media complex thickening in patients with relatively long-surviving type 1 diabetes mellitus.

PubMed

Distiller, Larry A; Joffe, Barry I; Melville, Vanessa; Welman, Tania; Distiller, Greg B

2006-01-01

The factors responsible for premature coronary atherosclerosis in patients with type 1 diabetes are ill defined. We therefore assessed carotid intima-media complex thickness (IMT) in relatively long-surviving patients with type 1 diabetes as a marker of atherosclerosis and correlated this with traditional risk factors. Cross-sectional study of 148 patients with relatively long-surviving (>18 years) type 1 diabetes (76 men and 72 women) attending the Centre for Diabetes and Endocrinology, Johannesburg. The mean common carotid artery IMT and presence or absence of plaque was evaluated by high-resolution B-mode ultrasound. Their median age was 48 years and duration of diabetes 26 years (range 18-59 years). Traditional risk factors (age, duration of diabetes, glycemic control, hypertension, smoking and lipoprotein concentrations) were recorded. Three response variables were defined and modeled. Standard multiple regression was used for a continuous IMT variable, logistic regression for the presence/absence of plaque and ordinal logistic regression to model three categories of "risk." The median common carotid IMT was 0.62 mm (range 0.44-1.23 mm) with plaque detected in 28 cases. The multiple regression model found significant associations between IMT and current age (P=.001), duration of diabetes (P=.033), BMI (P=.008) and diagnosed hypertension (P=.046) with HDL showing a protective effect (P=.022). Current age (P=.001) and diagnosed hypertension (P=.004), smoking (P=.008) and retinopathy (P=.033) were significant in the logistic regression model. Current age was also significant in the ordinal logistic regression model (P<.001), as was total cholesterol/HDL ratio (P<.001) and mean HbA(1c) concentration (P=.073). The major factors influencing common carotid IMT in patients with relatively long-surviving type 1 diabetes are age, duration of diabetes, existing hypertension and HDL (protective) with a relatively minor role ascribed to relatively long-standing glycemic control.
The association between insured male expatriates' knowledge of health insurance benefits and lack of access to health care in Saudi Arabia.

PubMed

Alkhamis, Abdulwahab A

2018-03-15

Insufficient knowledge of health insurance benefits could be associated with lack of access to health care, particularly for minority populations. This study aims to assess the association between expatriates' knowledge of health insurance benefits and lack of access to health care. A cross-sectional study design was conducted from March 2015 to February 2016 among 3398 insured male expatriates in Riyadh, Saudi Arabia. The dependent variable was binary and expresses access or lack of access to health care. Independent variables included perceived and validated knowledge of health insurance benefits and other variables. Data were summarized by computing frequencies and percentage of all quantities of variables. To evaluate variations in knowledge, personal and job characteristics with lack of access to health care, the Chi square test was used. Odds ratio (OR) and 95% confidence interval (CI) were recorded for each independent variable. Multiple logistic regression and stepwise logistic regression were performed and adjusted ORs were extracted. Descriptive analysis showed that 15% of participants lacked access to health care. The majority of these were unskilled laborers, usually with no education (17.5%), who had been working for less than 3 years (28.1%) in Saudi Arabia. A total of 23.3% worked for companies with less than 50 employees and 16.5% earned less than 4500 Saudi Riyals monthly ($1200). Many (20.3%) were young (< 30 years old) or older (17.9% ≥ 56 years old) and had no formal education (24.7%). Nearly half had fair or poor health status (49.5%), were uncomfortable conversing in Arabic (29.7%) or English (16.7%) and lacked previous knowledge of health insurance (18%). For perceived knowledge of health insurance, 55.2% scored 1 or 0 from total of 3. For validated knowledge, 16.9% scored 1 or 0 from total score of 4. Multiple logistic regression analysis showed that only perceived knowledge of health insurance had significant associations with lack of access to health care ((OR) = 0.393, (CI) = 0.335-0.461), but the result was insignificant for validated knowledge. Stepwise logistic regression gave similar findings. Our results confirmed that low perceived knowledge of health insurance in expatriates was associated with less access to health care.
Hierarchical Bayesian spatial models for predicting multiple forest variables using waveform LiDAR, hyperspectral imagery, and large inventory datasets

USGS Publications Warehouse

Finley, Andrew O.; Banerjee, Sudipto; Cook, Bruce D.; Bradford, John B.

2013-01-01

In this paper we detail a multivariate spatial regression model that couples LiDAR, hyperspectral and forest inventory data to predict forest outcome variables at a high spatial resolution. The proposed model is used to analyze forest inventory data collected on the US Forest Service Penobscot Experimental Forest (PEF), ME, USA. In addition to helping meet the regression model's assumptions, results from the PEF analysis suggest that the addition of multivariate spatial random effects improves model fit and predictive ability, compared with two commonly applied modeling approaches. This improvement results from explicitly modeling the covariation among forest outcome variables and spatial dependence among observations through the random effects. Direct application of such multivariate models to even moderately large datasets is often computationally infeasible because of cubic order matrix algorithms involved in estimation. We apply a spatial dimension reduction technique to help overcome this computational hurdle without sacrificing richness in modeling.
Sources of variability in satellite-derived estimates of phytoplankton production in the eastern tropical Pacific

NASA Technical Reports Server (NTRS)

Banse, Karl; Yong, Marina

1990-01-01

As a proxy for satellite CZCS observations and concurrent measurements of primary production rates, data from 138 stations occupied seasonally during 1967-1968 in the offshore eastern tropical Pacific were analyzed in terms of six temporal groups and our current regimes. Multiple linear regressions on column production Pt show that simulated satellite pigment is generally weakly correlated, but sometimes not correlated with Pt, and that incident irradiance, sea surface temperature, nitrate, transparency, and depths of mixed layer or nitracline assume little or no importance. After a proxy for the light-saturated chlorophyll-specific photosynthetic rate P(max) is added, the coefficient of determination ranges from 0.55 to 0.91 (median of 0.85) for the 10 cases. In stepwise multiple linear regressions the P(max) proxy is the best predictor for Pt.
The relationship between quality of work life and turnover intention of primary health care nurses in Saudi Arabia.

PubMed

Almalki, Mohammed J; FitzGerald, Gerry; Clark, Michele

2012-09-12

Quality of work life (QWL) has been found to influence the commitment of health professionals, including nurses. However, reliable information on QWL and turnover intention of primary health care (PHC) nurses is limited. The aim of this study was to examine the relationship between QWL and turnover intention of PHC nurses in Saudi Arabia. A cross-sectional survey was used in this study. Data were collected using Brooks' survey of Quality of Nursing Work Life, the Anticipated Turnover Scale and demographic data questions. A total of 508 PHC nurses in the Jazan Region, Saudi Arabia, completed the questionnaire (RR = 87%). Descriptive statistics, t-test, ANOVA, General Linear Model (GLM) univariate analysis, standard multiple regression, and hierarchical multiple regression were applied for analysis using SPSS v17 for Windows. Findings suggested that the respondents were dissatisfied with their work life, with almost 40% indicating a turnover intention from their current PHC centres. Turnover intention was significantly related to QWL. Using standard multiple regression, 26% of the variance in turnover intention was explained by QWL, p < 0.001, with R2 = .263. Further analysis using hierarchical multiple regression found that the total variance explained by the model as a whole (demographics and QWL) was 32.1%, p < 0.001. QWL explained an additional 19% of the variance in turnover intention, after controlling for demographic variables. Creating and maintaining a healthy work life for PHC nurses is very important to improve their work satisfaction, reduce turnover, enhance productivity and improve nursing care outcomes.
The relationship between quality of work life and turnover intention of primary health care nurses in Saudi Arabia

PubMed Central

2012-01-01

Background Quality of work life (QWL) has been found to influence the commitment of health professionals, including nurses. However, reliable information on QWL and turnover intention of primary health care (PHC) nurses is limited. The aim of this study was to examine the relationship between QWL and turnover intention of PHC nurses in Saudi Arabia. Methods A cross-sectional survey was used in this study. Data were collected using Brooks’ survey of Quality of Nursing Work Life, the Anticipated Turnover Scale and demographic data questions. A total of 508 PHC nurses in the Jazan Region, Saudi Arabia, completed the questionnaire (RR = 87%). Descriptive statistics, t-test, ANOVA, General Linear Model (GLM) univariate analysis, standard multiple regression, and hierarchical multiple regression were applied for analysis using SPSS v17 for Windows. Results Findings suggested that the respondents were dissatisfied with their work life, with almost 40% indicating a turnover intention from their current PHC centres. Turnover intention was significantly related to QWL. Using standard multiple regression, 26% of the variance in turnover intention was explained by QWL, p < 0.001, with R2 = .263. Further analysis using hierarchical multiple regression found that the total variance explained by the model as a whole (demographics and QWL) was 32.1%, p < 0.001. QWL explained an additional 19% of the variance in turnover intention, after controlling for demographic variables. Conclusions Creating and maintaining a healthy work life for PHC nurses is very important to improve their work satisfaction, reduce turnover, enhance productivity and improve nursing care outcomes. PMID:22970764

[Association between hours of television watched, physical activity, sleep and excess weight among young adults].

PubMed

Martínez-Moyá, María; Navarrete-Muñoz, Eva M; García de la Hera, Manuela; Giménez-Monzo, Daniel; González-Palacios, Sandra; Valera-Gran, Desirée; Sempere-Orts, María; Vioque, Jesús

2014-01-01

To explore the association between excess weight or body mass index (BMI) and the time spent watching television, self-reported physical activity and sleep duration in a young adult population. We analyzed cross-sectional baseline data of 1,135 participants (17-35 years old) from the project Dieta, salud y antropometría en población universitaria (Diet, Health and Anthrompmetric Variables in Univeristy Students). Information about time spent watching television, sleep duration, self-reported physical activity and self-reported height and weight was provided by a baseline questionnaire. BMI was calculated as kg/m(2) and excess of weight was defined as ≥25. We used multiple logistic regression to explore the association between excess weight (no/yes) and independent variables, and multiple linear regression for BMI. The prevalence of excess weight was 13.7% (11.2% were overweight and 2.5% were obese). A significant positive association was found between excess weight and a greater amount of time spent watching television. Participants who reported watching television >2h a day had a higher risk of excess weight than those who watched television ≤1h a day (OR=2.13; 95%CI: 1.37-3.36; p-trend: 0.002). A lower level of physical activity was associated with an increased risk of excess weight, although the association was statistically significant only in multiple linear regression (p=0.037). No association was observed with sleep duration. A greater number of hours spent watching television and lower physical activity were significantly associated with a higher BMI in young adults. Both factors are potentially modifiable with preventive strategies. Copyright © 2013 SESPAS. Published by Elsevier Espana. All rights reserved.
Examination of Factors that Influence the Operation Income and Expenditure Balance Difference Rate of 20 Educational Foundation Universities.

PubMed

Nakajima, Hisato; Yano, Kouya; Nagasawa, Kaoko; Katou, Satoka; Yokota, Kuninobu

2017-01-01

The objective of this study is to examine the factors that influence the operation income and expenditure balance ratio of school corporations running university hospitals by multiple regression analysis. 1. We conducted cluster analysis of the financial ratio and classified the school corporations into those running colleges and universities.2. We conducted multiple regression analysis using the operation income and expenditure balance ratio of the colleges as the variables and the Diagnosis Procedure Combination data as the explaining variables.3. The predictive expression was used for multiple regression analysis. 1. The school corporations were divided into those running universities (7), colleges (20) and others. The medical income ratio and the debt ratio were high and the student payment ratio was low in the colleges.2. The numbers of emergency care hospitalizations, operations, radiation therapies, and ambulance conveyances, and the complexity index had a positive influence on the operation income and expenditure balance ratio. On the other hand, the number of general anesthesia procedures, the cover rate index, and the emergency care index had a negative influence.3. The predictive expression was as follows.Operation income and expenditure balance ratio = 0.027 × number of emergency care hospitalizations + 0.005 × number of operations + 0.019 × number of radiation therapies + 0.007 × number of ambulance conveyances - 0.003 × number of general anesthesia procedures + 648.344 × complexity index - 5877.210 × cover rate index - 2746.415 × emergency care index - 38.647Conclusion: In colleges, the number of emergency care hospitalizations, the number of operations, the number of radiation therapies, and the number of ambulance conveyances and the complexity index were factors for gaining ordinary profit.
Caries risk assessment in schoolchildren - a form based on Cariogram® software

PubMed Central

CABRAL, Renata Nunes; HILGERT, Leandro Augusto; FABER, Jorge; LEAL, Soraya Coelho

2014-01-01

Identifying caries risk factors is an important measure which contributes to best understanding of the cariogenic profile of the patient. The Cariogram® software provides this analysis, and protocols simplifying the method were suggested. Objectives The aim of this study was to determine whether a newly developed Caries Risk Assessment (CRA) form based on the Cariogram® software could classify schoolchildren according to their caries risk and to evaluate relationships between caries risk and the variables in the form. Material and Methods 150 schoolchildren aged 5 to 7 years old were included in this survey. Caries prevalence was obtained according to International Caries Detection and Assessment System (ICDAS) II. Information for filling in the form based on Cariogram® was collected clinically and from questionnaires sent to parents. Linear regression and a forward stepwise multiple regression model were applied to correlate the variables included in the form with the caries risk. Results Caries prevalence, in primary dentition, including enamel and dentine carious lesions was 98.6%, and 77.3% when only dentine lesions were considered. Eighty-six percent of the children were classified as at moderate caries risk. The forward stepwise multiple regression model result was significant (R2=0.904; p<0.00001), showing that the most significant factors influencing caries risk were caries experience, oral hygiene, frequency of food consumption, sugar consumption and fluoride sources. Conclusion The use of the form based on the Cariogram® software enabled classification of the schoolchildren at low, moderate and high caries risk. Caries experience, oral hygiene, frequency of food consumption, sugar consumption and fluoride sources are the variables that were shown to be highly correlated with caries risk. PMID:25466473
Impact of wearing fixed orthodontic appliances on quality of life among adolescents: Case-control study.

PubMed

Costa, Andréa A; Serra-Negra, Júnia M; Bendo, Cristiane B; Pordeus, Isabela A; Paiva, Saul M

2016-01-01

To investigate the impact of wearing a fixed orthodontic appliance on oral health-related quality of life (OHRQoL) among adolescents. A case-control study (1 ∶ 2) was carried out with a population-based randomized sample of 327 adolescents aged 11 to 14 years enrolled at public and private schools in the City of Brumadinho, southeast of Brazil. The case group (n = 109) was made up of adolescents with a high negative impact on OHRQoL, and the control group (n = 218) was made up of adolescents with a low negative impact. The outcome variable was the impact on OHRQoL measured by the Brazilian version of the Child Perceptions Questionnaire (CPQ 11-14) - Impact Short Form (ISF:16). The main independent variable was wearing fixed orthodontic appliances. Malocclusion and the type of school were identified as possible confounding variables. Bivariate and multiple conditional logistic regressions were employed in the statistical analysis. A multiple conditional logistic regression model demonstrated that adolescents wearing fixed orthodontic appliances had a 4.88-fold greater chance of presenting high negative impact on OHRQoL (95% CI: 2.93-8.13; P < .001) than those who did not wear fixed orthodontic appliances. A bivariate conditional logistic regression demonstrated that malocclusion was significantly associated with OHRQoL (P = .017), whereas no statistically significant association was found between the type of school and OHRQoL (P = .108). Adolescents who wore fixed orthodontic appliances had a greater chance of reporting a negative impact on OHRQoL than those who did not wear such appliances.
Use and misuse of motor-vehicle crash death rates in assessing highway-safety performance.

PubMed

O'Neill, Brian; Kyrychenko, Sergey Y

2006-12-01

The objectives of the article are to assess the extent to which comparisons of motor-vehicle crash death rates can be used to determine the effectiveness of highway-safety policies over time in a country or to compare policy effectiveness across countries. Motor-vehicle crash death rates per mile traveled in the 50 U.S. states from 1980 to 2003 are used to show the influence on these rates of factors independent of highway-safety interventions. Multiple regression models relating state death rates to various measures related to urbanization and demographics are used. The analyses demonstrate strong relationships between state death rates and urbanization and demographics. Almost 60% of the variability among the state death rates can be explained by the independent variables in the multiple regression models. When the death rates for passenger vehicle occupants (i.e., excluding motorcycle, pedestrian, and other deaths) are used in the regression models, almost 70% of the variability in the rates can be explained by urbanization and demographics. The analyses presented in the article demonstrate that motor-vehicle crash death rates are strongly influenced by factors unrelated to highway-safety countermeasures. Overall death rates should not be used as a basis for judging the effectiveness (or ineffectiveness) of specific highway-safety countermeasures or to assess overall highway-safety policies, especially across jurisdictions. There can be no substitute for the use of carefully designed scientific evaluations of highway-safety interventions that use outcome measures directly related to the intervention; e.g., motorcyclist deaths should be used to assess the effectiveness of motorcycle helmet laws. While this may seem obvious, there are numerous examples in the literature of death rates from all crashes being used to assess the effectiveness of interventions aimed at specific subsets of crashes.
Factors contributing to practice variation in post-stroke rehabilitation.

PubMed Central

Lee, A J; Huber, J H; Stason, W B

1997-01-01

OBJECTIVE: To analyze geographic variability in the utilization and cost of post-stroke medical care using multiple linear regression. DATA SOURCES/STUDY SETTING: A 20 percent random sample of Medicare beneficiaries with an admission to an acute care hospital for stroke during the first six months of 1991, supplemented by data from their Medicare claims and beneficiary records, the Medicare Cost Reports for hospitals and nursing homes, and the Area Resource File. STUDY DESIGN: Weighted least squares regression is used to analyze variations in post-stroke practice patterns across 151 MSAs (Metropolitan Statistical Areas). Average post-stroke costs, utilization rates, and facility lengths of stay are regressed on patient and market characteristics. DATA COLLECTION/EXTRACTION METHODS: For a six-month post-stroke interval, beneficiary-level post-stroke costs and service utilization are averaged by MSA. Variables describing market conditions are then added to these MSA-level records. PRINCIPAL FINDINGS: Patient variables rarely explain more than a third of practice variation, and often they explain substantially less than that. Market variables (with some exception) tend to be relatively less important. Finally, one-half to two-thirds of the practice variation across MSAs is unexplained by the patient and market factors measured in our data. CONCLUSIONS: A substantial portion of inter-MSA variability in utilization and intensity of post-stroke rehabilitation services cannot be explained by differences in patient characteristics. Given the large practice differences observed across MSAs, it seems unlikely that unmeasured patient differences can account for much more of the practice differences. PMID:9180616
Incorporating wind availability into land use regression modelling of air quality in mountainous high-density urban environment.

PubMed

Shi, Yuan; Lau, Kevin Ka-Lun; Ng, Edward

2017-08-01

Urban air quality serves as an important function of the quality of urban life. Land use regression (LUR) modelling of air quality is essential for conducting health impacts assessment but more challenging in mountainous high-density urban scenario due to the complexities of the urban environment. In this study, a total of 21 LUR models are developed for seven kinds of air pollutants (gaseous air pollutants CO, NO 2 , NO x , O 3 , SO 2 and particulate air pollutants PM 2.5 , PM 10 ) with reference to three different time periods (summertime, wintertime and annual average of 5-year long-term hourly monitoring data from local air quality monitoring network) in Hong Kong. Under the mountainous high-density urban scenario, we improved the traditional LUR modelling method by incorporating wind availability information into LUR modelling based on surface geomorphometrical analysis. As a result, 269 independent variables were examined to develop the LUR models by using the "ADDRESS" independent variable selection method and stepwise multiple linear regression (MLR). Cross validation has been performed for each resultant model. The results show that wind-related variables are included in most of the resultant models as statistically significant independent variables. Compared with the traditional method, a maximum increase of 20% was achieved in the prediction performance of annual averaged NO 2 concentration level by incorporating wind-related variables into LUR model development. Copyright © 2017 Elsevier Inc. All rights reserved.
TV watching, soap opera and happiness.

PubMed

Lu, L; Argyle, M

1993-09-01

One hundred and fourteen subjects reported the amount of time they spent watching television in general, and soap opera in particular. They also completed scales measuring happiness and other personality variables, such as extraversion and cooperativeness. In the multiple regression analysis, having controlled for the demographic variables, watching TV was related to unhappiness, whereas watching soap opera was related to happiness. Discriminant analysis showed that females, higher happiness and extraversion distinguished regular soap watchers (who nevertheless watched little TV in general) from irregular soap watchers (who nevertheless watched a lot of TV in general).
A Study of the Effect of the Front-End Styling of Sport Utility Vehicles on Pedestrian Head Injuries

PubMed Central

Qin, Qin; Chen, Zheng; Bai, Zhonghao; Cao, Libo

2018-01-01

Background The number of sport utility vehicles (SUVs) on China market is continuously increasing. It is necessary to investigate the relationships between the front-end styling features of SUVs and head injuries at the styling design stage for improving the pedestrian protection performance and product development efficiency. Methods Styling feature parameters were extracted from the SUV side contour line. And simplified finite element models were established based on the 78 SUV side contour lines. Pedestrian headform impact simulations were performed and validated. The head injury criterion of 15 ms (HIC15) at four wrap-around distances was obtained. A multiple linear regression analysis method was employed to describe the relationships between the styling feature parameters and the HIC15 at each impact point. Results The relationship between the selected styling features and the HIC15 showed reasonable correlations, and the regression models and the selected independent variables showed statistical significance. Conclusions The regression equations obtained by multiple linear regression can be used to assess the performance of SUV styling in protecting pedestrians' heads and provide styling designers with technical guidance regarding their artistic creations.
Air Pollutants, Climate, and the Prevalence of Pediatric Asthma in Urban Areas of China

PubMed Central

Zhang, Juanjuan; Yan, Li; Fu, Wenlong; Yi, Jing; Chen, Yuzhi; Liu, Chuanhe; Xu, Dongqun; Wang, Qiang

2016-01-01

Background. Prevalence of childhood asthma varies significantly among regions, while its reasons are not clear yet with only a few studies reporting relevant causes for this variation. Objective. To investigate the potential role of city-average levels of air pollutants and climatic factors in order to distinguish differences in asthma prevalence in China and explain their reasons. Methods. Data pertaining to 10,777 asthmatic patients were obtained from the third nationwide survey of childhood asthma in China's urban areas. Annual mean concentrations of air pollutants and other climatic factors were obtained for the same period from several government departments. Data analysis was implemented with descriptive statistics, Pearson correlation coefficient, and multiple regression analysis. Results. Pearson correlation analysis showed that the situation of childhood asthma was strongly linked with SO2, relative humidity, and hours of sunshine (p < 0.05). Multiple regression analysis indicated that, among the predictor variables in the final step, SO2 was found to be the most powerful predictor variable amongst all (β = −19.572, p < 0.05). Furthermore, results had shown that hours of sunshine (β = −0.014, p < 0.05) was a significant component summary predictor variable. Conclusion. The findings of this study do not suggest that air pollutants or climate, at least in terms of children, plays a major role in explaining regional differences in asthma prevalence in China. PMID:27556031
Case-related factors affecting cutting errors of the proximal tibia in total knee arthroplasty assessed by computer navigation.

PubMed

Tsukeoka, Tadashi; Tsuneizumi, Yoshikazu; Yoshino, Kensuke; Suzuki, Mashiko

2018-05-01

The aim of this study was to determine factors that contribute to bone cutting errors of conventional instrumentation for tibial resection in total knee arthroplasty (TKA) as assessed by an image-free navigation system. The hypothesis is that preoperative varus alignment is a significant contributory factor to tibial bone cutting errors. This was a prospective study of a consecutive series of 72 TKAs. The amount of the tibial first-cut errors with reference to the planned cutting plane in both coronal and sagittal planes was measured by an image-free computer navigation system. Multiple regression models were developed with the amount of tibial cutting error in the coronal and sagittal planes as dependent variables and sex, age, disease, height, body mass index, preoperative alignment, patellar height (Insall-Salvati ratio) and preoperative flexion angle as independent variables. Multiple regression analysis showed that sex (male gender) (R = 0.25 p = 0.047) and preoperative varus alignment (R = 0.42, p = 0.001) were positively associated with varus tibial cutting errors in the coronal plane. In the sagittal plane, none of the independent variables was significant. When performing TKA in varus deformity, careful confirmation of the bone cutting surface should be performed to avoid varus alignment. The results of this study suggest technical considerations that can help a surgeon achieve more accurate component placement. IV.
An Application of Robust Method in Multiple Linear Regression Model toward Credit Card Debt

NASA Astrophysics Data System (ADS)

Amira Azmi, Nur; Saifullah Rusiman, Mohd; Khalid, Kamil; Roslan, Rozaini; Sufahani, Suliadi; Mohamad, Mahathir; Salleh, Rohayu Mohd; Hamzah, Nur Shamsidah Amir

2018-04-01

Credit card is a convenient alternative replaced cash or cheque, and it is essential component for electronic and internet commerce. In this study, the researchers attempt to determine the relationship and significance variables between credit card debt and demographic variables such as age, household income, education level, years with current employer, years at current address, debt to income ratio and other debt. The provided data covers 850 customers information. There are three methods that applied to the credit card debt data which are multiple linear regression (MLR) models, MLR models with least quartile difference (LQD) method and MLR models with mean absolute deviation method. After comparing among three methods, it is found that MLR model with LQD method became the best model with the lowest value of mean square error (MSE). According to the final model, it shows that the years with current employer, years at current address, household income in thousands and debt to income ratio are positively associated with the amount of credit debt. Meanwhile variables for age, level of education and other debt are negatively associated with amount of credit debt. This study may serve as a reference for the bank company by using robust methods, so that they could better understand their options and choice that is best aligned with their goals for inference regarding to the credit card debt.
Fasting insulin levels and metabolic risk factors in type 2 diabetic patients at the first visit in Japan: a 10-year, nationwide, observational study (JDDM 28).

PubMed

Matsuba, Ikuro; Saito, Kazumi; Takai, Masahiko; Hirao, Koichi; Sone, Hirohito

2012-09-01

To investigate the relationship between fasting insulin levels and metabolic risk factors (MRFs) in type 2 diabetic patients at the first clinic/hospital visit in Japan over the years 2000 to 2009. In total, 4,798 drug-naive Japanese patients with type 2 diabetes were registered on their first clinic/hospital visits. Conventional clinical factors and fasting insulin levels were observed at baseline within the Japan Diabetes Clinical Data Management (JDDM) study between consecutive 2-year groups. Multiple linear regression analysis was performed using a model in which the dependent variable was fasting insulin values using various clinical explanatory variables. Fasting insulin levels were found to be decreasing from 2000 to 2009. Multiple linear regression analysis with the fasting insulin levels as the dependent variable showed that waist circumference (WC), BMI, mean blood pressure, triglycerides, and HDL cholesterol were significant, with WC and BMI as the main factors. ANCOVA after adjustment for age and fasting plasma glucose clearly shows the decreasing trend in fasting insulin levels and the increasing trend in BMI. During the 10-year observation period, the decreasing trend in fasting insulin was related to the slight increase in WC/BMI in type 2 diabetes. Low pancreatic β-cell reserve on top of a lifestyle background might be dependent on an increase in MRFs.
Fasting Insulin Levels and Metabolic Risk Factors in Type 2 Diabetic Patients at the First Visit in Japan

PubMed Central

Matsuba, Ikuro; Saito, Kazumi; Takai, Masahiko; Hirao, Koichi; Sone, Hirohito

2012-01-01

OBJECTIVE To investigate the relationship between fasting insulin levels and metabolic risk factors (MRFs) in type 2 diabetic patients at the first clinic/hospital visit in Japan over the years 2000 to 2009. RESEARCH DESIGN AND METHODS In total, 4,798 drug-naive Japanese patients with type 2 diabetes were registered on their first clinic/hospital visits. Conventional clinical factors and fasting insulin levels were observed at baseline within the Japan Diabetes Clinical Data Management (JDDM) study between consecutive 2-year groups. Multiple linear regression analysis was performed using a model in which the dependent variable was fasting insulin values using various clinical explanatory variables. RESULTS Fasting insulin levels were found to be decreasing from 2000 to 2009. Multiple linear regression analysis with the fasting insulin levels as the dependent variable showed that waist circumference (WC), BMI, mean blood pressure, triglycerides, and HDL cholesterol were significant, with WC and BMI as the main factors. ANCOVA after adjustment for age and fasting plasma glucose clearly shows the decreasing trend in fasting insulin levels and the increasing trend in BMI. CONCLUSIONS During the 10-year observation period, the decreasing trend in fasting insulin was related to the slight increase in WC/BMI in type 2 diabetes. Low pancreatic β-cell reserve on top of a lifestyle background might be dependent on an increase in MRFs. PMID:22665215
Childhood trauma is not a confounder of the overlap between autistic and schizotypal traits: A study in a non-clinical adult sample.

PubMed

Gong, Jing-Bo; Wang, Ya; Lui, Simon S Y; Cheung, Eric F C; Chan, Raymond C K

2017-11-01

Childhood trauma has been shown to be a robust risk factor for mental disorders, and may exacerbate schizotypal traits or contribute to autistic trait severity. However, little is known whether childhood trauma confounds the overlap between schizotypal traits and autistic traits. This study examined whether childhood trauma acts as a confounding variable in the overlap between autistic and schizotypal traits in a large non-clinical adult sample. A total of 2469 participants completed the Autism Spectrum Quotient (AQ), the Schizotypal Personality Questionnaire (SPQ), and the Childhood Trauma Questionnaire-Short Form. Correlation analysis showed that the majority of associations between AQ variables and SPQ variables were significant (p < 0.05). In the multiple regression models predicting scores on the AQ total, scores on the three SPQ subscales were significant predictors(Ps < 0.05). Scores on the Positive schizotypy and Negative schizotypy subscales were significant predictors in the multiple regression model predicting scores on the AQ Social Skill, AQ Attention Switching, AQ Attention to Detail, AQ Communication, and AQ Imagination subscales. The association between autistic and schizotypal traits could not be explained by shared variance in terms of exposure to childhood trauma. The findings point to important overlaps in the conceptualization of ASD and SSD, independent of childhood trauma. Copyright © 2017 Elsevier B.V. All rights reserved.
Male Saudi Arabian freshman science majors at Jazan University: Their perceptions of parental educational practices on their science achievements

NASA Astrophysics Data System (ADS)

Alrehaly, Essa D.

Examination of Saudi Arabian educational practices is scarce, but increasingly important, especially in light of the country's pace in worldwide mathematics and science rankings. The purpose of the study is to understand and evaluate parental influence on male children's science education achievements in Saudi Arabia. Parental level of education and participant's choice of science major were used to identify groups for the purpose of data analysis. Data were gathered using five independent variables concerning parental educational practices (attitude, involvement, autonomy support, structure and control) and the dependent variable of science scores in high school. The sample consisted of 338 participants and was arbitrarily drawn from the science-based colleges (medical, engineering, and natural science) at Jazan University in Saudi Arabia. The data were tested using Pearson's analysis, backward multiple regression, one way ANOVA and independent t-test. The findings of the study reveal significant correlations for all five of the variables. Multiple regressions revealed that all five of the parents' educational practices indicators combined together could explain 19% of the variance in science scores and parental attitude toward science and educational involvement combined accounted for more than 18% of the variance. Analysis indicates that no significant difference is attributable to parental involvement and educational level. This finding is important because it indicates that, in Saudi Arabia, results are not consistent with research in Western or other Asian contexts.
The association between subgingival periodontal pathogens and systemic inflammation.

PubMed

Winning, Lewis; Patterson, Christopher C; Cullen, Kathy M; Stevenson, Kathryn A; Lundy, Fionnuala T; Kee, Frank; Linden, Gerard J

2015-09-01

To investigate associations between periodontal disease pathogens and levels of systemic inflammation measured by C-reactive protein (CRP). A representative sample of dentate 60-70-year-old men in Northern Ireland had a comprehensive periodontal examination. Men taking statins were excluded. Subgingival plaque samples were analysed by quantitative real time PCR to identify the presence of Aggregatibacter actinomycetemcomitans, Porphyromonas gingivalis, Treponema denticola and Tannerella forsythia. High-sensitivity CRP (mg/l) was measured from fasting blood samples. Multiple linear regression analysis was performed using log-transformed CRP concentration as the dependent variable, with the presence of each periodontal pathogen as predictor variables, with adjustment for various potential confounders. A total of 518 men (mean age 63.6 SD 3.0 years) were included in the analysis. Multiple regression analysis showed that body mass index (p < 0.001), current smoking (p < 0.01), the detectable presence of P. gingivalis (p < 0.01) and hypertension (p = 0.01), were independently associated with an increased CRP. The detectable presence of P. gingivalis was associated with a 20% (95% confidence interval 4-35%) increase in CRP (mg/l) after adjustment for all other predictor variables. In these 60-70-year-old dentate men, the presence of P. gingivalis in subgingival plaque was significantly associated with a raised level of C-reactive protein. © 2015 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Meteorological Contribution to Variability in Particulate Matter Concentrations

NASA Astrophysics Data System (ADS)

Woods, H. L.; Spak, S. N.; Holloway, T.

2006-12-01

Local concentrations of fine particulate matter (PM) are driven by a number of processes, including emissions of aerosols and gaseous precursors, atmospheric chemistry, and meteorology at local, regional, and global scales. We apply statistical downscaling methods, typically used for regional climate analysis, to estimate the contribution of regional scale meteorology to PM mass concentration variability at a range of sites in the Upper Midwestern U.S. Multiple years of daily PM10 and PM2.5 data, reported by the U.S. Environmental Protection Agency (EPA), are correlated with large-scale meteorology over the region from the National Centers for Environmental Prediction (NCEP) reanalysis data. We use two statistical downscaling methods (multiple linear regression, MLR, and analog) to identify which processes have the greatest impact on aerosol concentration variability. Empirical Orthogonal Functions of the NCEP meteorological data are correlated with PM timeseries at measurement sites. We examine which meteorological variables exert the greatest influence on PM variability, and which sites exhibit the greatest response to regional meteorology. To evaluate model performance, measurement data are withheld for limited periods, and compared with model results. Preliminary results suggest that regional meteorological processes account over 50% of aerosol concentration variability at study sites.
Precision Interval Estimation of the Response Surface by Means of an Integrated Algorithm of Neural Network and Linear Regression

NASA Technical Reports Server (NTRS)

Lo, Ching F.

1999-01-01

The integration of Radial Basis Function Networks and Back Propagation Neural Networks with the Multiple Linear Regression has been accomplished to map nonlinear response surfaces over a wide range of independent variables in the process of the Modem Design of Experiments. The integrated method is capable to estimate the precision intervals including confidence and predicted intervals. The power of the innovative method has been demonstrated by applying to a set of wind tunnel test data in construction of response surface and estimation of precision interval.
A simulation study on Bayesian Ridge regression models for several collinearity levels

NASA Astrophysics Data System (ADS)

Efendi, Achmad; Effrihan

2017-12-01

When analyzing data with multiple regression model if there are collinearities, then one or several predictor variables are usually omitted from the model. However, there sometimes some reasons, for instance medical or economic reasons, the predictors are all important and should be included in the model. Ridge regression model is not uncommon in some researches to use to cope with collinearity. Through this modeling, weights for predictor variables are used for estimating parameters. The next estimation process could follow the concept of likelihood. Furthermore, for the estimation nowadays the Bayesian version could be an alternative. This estimation method does not match likelihood one in terms of popularity due to some difficulties; computation and so forth. Nevertheless, with the growing improvement of computational methodology recently, this caveat should not at the moment become a problem. This paper discusses about simulation process for evaluating the characteristic of Bayesian Ridge regression parameter estimates. There are several simulation settings based on variety of collinearity levels and sample sizes. The results show that Bayesian method gives better performance for relatively small sample sizes, and for other settings the method does perform relatively similar to the likelihood method.

A Method for Calculating the Probability of Successfully Completing a Rocket Propulsion Ground Test

NASA Technical Reports Server (NTRS)

Messer, Bradley

2007-01-01

Propulsion ground test facilities face the daily challenge of scheduling multiple customers into limited facility space and successfully completing their propulsion test projects. Over the last decade NASA s propulsion test facilities have performed hundreds of tests, collected thousands of seconds of test data, and exceeded the capabilities of numerous test facility and test article components. A logistic regression mathematical modeling technique has been developed to predict the probability of successfully completing a rocket propulsion test. A logistic regression model is a mathematical modeling approach that can be used to describe the relationship of several independent predictor variables X(sub 1), X(sub 2),.., X(sub k) to a binary or dichotomous dependent variable Y, where Y can only be one of two possible outcomes, in this case Success or Failure of accomplishing a full duration test. The use of logistic regression modeling is not new; however, modeling propulsion ground test facilities using logistic regression is both a new and unique application of the statistical technique. Results from this type of model provide project managers with insight and confidence into the effectiveness of rocket propulsion ground testing.
The evaluation of the National Long Term Care Demonstration. 2. Estimation methodology.

PubMed Central

Brown, R S

1988-01-01

Channeling effects were estimated by comparing the post-application experience of the treatment and control groups using multiple regression. A variety of potential threats to the validity of the results, including sample composition issues, data issues, and estimation issues, were identified and assessed. Of all the potential problems examined, the only one determined to be likely to cause widespread distortion of program impact estimates was noncomparability of the baseline data. To avoid this distortion, baseline variables judged to be noncomparably measured were excluded from use as control variables in the regression equation. (Where they existed, screen counterparts to these noncomparable baseline variables were used as substitutes.) All of the other potential problems with the sample, data, or regression estimation approach were found to have little or no actual effect on impact estimates or their interpretation. Broad implementation of special procedures, therefore, was not necessary. The study did find that, because of the frequent use of proxy respondents, the estimated effects of channeling on clients' well-being actually may reflect impacts on the well-being of the informal caregiver rather than the client. This and other isolated cases in which there was some evidence of a potential problem for specific outcome variables were identified and examined in detail in technical reports dealing with those outcomes. Where appropriate, alternative estimates were presented. PMID:3130329
Empirical analyses of plant-climate relationships for the western United States

Treesearch

Gerald E. Rehfeldt; Nicholas L. Crookston; Marcus V. Warwell; Jeffrey S. Evans

2006-01-01

The Random Forests multiple-regression tree was used to model climate profiles of 25 biotic communities of the western United States and nine of their constituent species. Analyses of the communities were based on a gridded sample of ca. 140,000 points, while those for the species used presence-absence data from ca. 120,000 locations. Independent variables included 35...
Revising the Rorschach Ego Impairment Index to Accommodate Recent Recommendations about Improving Rorschach Validity

ERIC Educational Resources Information Center

Viglione, Donald J.; Perry, William; Giromini, Luciano; Meyer, Gregory J.

2011-01-01

We used multiple regression to calculate a new Ego Impairment Index (EII-3). The aim was to incorporate changes in the component variables and distribution of the number of responses as found in the new Rorschach Performance Assessment System, while sustaining the validity and reliability of previous EIIs. The EII-3 formula was derived from a…
Spatial, spectral and temporal patterns of tropical forest cover change as observed with multiple scales of optical satellite data.

Treesearch

D.J. Hayes; W.B. Cohen

2006-01-01

This article describes the development of a methodology for scaling observations of changes in tropical forest cover to large areas at high temporal frequency from coarse-resolution satellite imagery. The approach for estimating proportional forest cover change as a continuous variable is based on a regression model that relates multispectral, multitemporal Moderate...
Associations between Resilience and the Well-Being of Mothers of Children with Autism Spectrum Disorder and Other Developmental Disabilities

ERIC Educational Resources Information Center

Halstead, Elizabeth; Ekas, Naomi; Hastings, Richard P.; Griffith, Gemma M.

2018-01-01

There is variability in the extent to which mothers are affected by the behavior problems of their children with developmental disabilities (DD). We explore whether maternal resilience functions as a protective or compensatory factor. In Studies 1 and 2, using moderated multiple regression models, we found evidence that maternal resilience…
Building "e-rater"® Scoring Models Using Machine Learning Methods. Research Report. ETS RR-16-04

ERIC Educational Resources Information Center

Chen, Jing; Fife, James H.; Bejar, Isaac I.; Rupp, André A.

2016-01-01

The "e-rater"® automated scoring engine used at Educational Testing Service (ETS) scores the writing quality of essays. In the current practice, e-rater scores are generated via a multiple linear regression (MLR) model as a linear combination of various features evaluated for each essay and human scores as the outcome variable. This…
Tumble Graphs: Avoiding Misleading End Point Extrapolation When Graphing Interactions From a Moderated Multiple Regression Analysis

ERIC Educational Resources Information Center

Bodner, Todd E.

2016-01-01

This article revisits how the end points of plotted line segments should be selected when graphing interactions involving a continuous target predictor variable. Under the standard approach, end points are chosen at ±1 or 2 standard deviations from the target predictor mean. However, when the target predictor and moderator are correlated or the…
Brazil soybean yield covariance model

NASA Technical Reports Server (NTRS)

Callis, S. L.; Sakamoto, C.

1984-01-01

A model based on multiple regression was developed to estimate soybean yields for the seven soybean-growing states of Brazil. The meteorological data of these seven states were pooled and the years 1975 to 1980 were used to model since there was no technological trend in the yields during these years. Predictor variables were derived from monthly total precipitation and monthly average temperature.
Estimating heating times of wood boards, square timbers, and logs in saturated steam by multiple regression

Treesearch

William T. Simpson

2006-01-01

Heat sterilization is used to kill insects and fungi in wood being traded internationally. Determining the time required to reach the kill temperature is difficult considering the many variables that can affect it, such as heating temperature, target center temperature, initial wood temperature, wood configuration dimensions, specific gravity, and moisture content. In...
Examining the Influence of Selected Factors on Perceived Co-Op Work-Term Quality from a Student Perspective

ERIC Educational Resources Information Center

Drewery, David; Nevison, Colleen; Pretti, T. Judene; Cormier, Lauren; Barclay, Sage; Pennaforte, Antoine

2016-01-01

This study discusses and tests a conceptual model of co-op work-term quality from a student perspective. Drawing from an earlier exploration of co-op students' perceptions of work-term quality, variables related to role characteristics, interpersonal dynamics, and organizational elements were used in a multiple linear regression analysis to…
IQ at Age Four in Relation to Maternal Alcohol Use and Smoking during Pregnancy.

ERIC Educational Resources Information Center

Streissguth, Ann Pytkowicz; And Others

1989-01-01

Multiple regression analyses on data from 421 children indicated that mother's use of more than 1.5 ounces (approximately three drinks) of alcohol per day during pregnancy was significantly related to average IQ decrement at four years of age of almost five IQ points even after adjustment for numerous variables. Readers cautioned against using…
The Multidimensionality of Multicultural Service Learning: The Variable Effects of Social Identity, Context and Pedagogy on Pre-Service Teachers' Learning

ERIC Educational Resources Information Center

Chang, Shih-pei; Anagnostopoulos, Dorothea; Omae, Hilda

2011-01-01

Multicultural service learning (MSL) seeks to develop pre-service teachers' capacities and commitment to teach diverse student populations. We use multiple regression analyses of survey data collected from 212 pre-service teachers engaged in 22 MSL sites to assess the effects of pre-service teachers' social identities, MSL contexts, and university…
Harmonic regression of Landsat time series for modeling attributes from national forest inventory data

NASA Astrophysics Data System (ADS)

Wilson, Barry T.; Knight, Joseph F.; McRoberts, Ronald E.

2018-03-01

Imagery from the Landsat Program has been used frequently as a source of auxiliary data for modeling land cover, as well as a variety of attributes associated with tree cover. With ready access to all scenes in the archive since 2008 due to the USGS Landsat Data Policy, new approaches to deriving such auxiliary data from dense Landsat time series are required. Several methods have previously been developed for use with finer temporal resolution imagery (e.g. AVHRR and MODIS), including image compositing and harmonic regression using Fourier series. The manuscript presents a study, using Minnesota, USA during the years 2009-2013 as the study area and timeframe. The study examined the relative predictive power of land cover models, in particular those related to tree cover, using predictor variables based solely on composite imagery versus those using estimated harmonic regression coefficients. The study used two common non-parametric modeling approaches (i.e. k-nearest neighbors and random forests) for fitting classification and regression models of multiple attributes measured on USFS Forest Inventory and Analysis plots using all available Landsat imagery for the study area and timeframe. The estimated Fourier coefficients developed by harmonic regression of tasseled cap transformation time series data were shown to be correlated with land cover, including tree cover. Regression models using estimated Fourier coefficients as predictor variables showed a two- to threefold increase in explained variance for a small set of continuous response variables, relative to comparable models using monthly image composites. Similarly, the overall accuracies of classification models using the estimated Fourier coefficients were approximately 10-20 percentage points higher than the models using the image composites, with corresponding individual class accuracies between six and 45 percentage points higher.
Motor excitability measurements: the influence of gender, body mass index, age and temperature in healthy controls.

PubMed

Casanova, I; Diaz, A; Pinto, S; de Carvalho, M

2014-04-01

The technique of threshold tracking to test axonal excitability gives information about nodal and internodal ion channel function. We aimed to investigate variability of the motor excitability measurements in healthy controls, taking into account age, gender, body mass index (BMI) and small changes in skin temperature. We examined the left median nerve of 47 healthy controls using the automated threshold-tacking program, QTRAC. Statistical multiple regression analysis was applied to test relationship between nerve excitability measurements and subject variables. Comparisons between genders did not find any significant difference (P>0.2 for all comparisons). Multiple regression analysis showed that motor amplitude decreases with age and temperature, stimulus-response slope decreases with age and BMI, and that accommodation half-time decrease with age and temperature. The changes related to demographic features on TRONDE protocol parameters are small and less important than in conventional nerve conduction studies. Nonetheless, our results underscore the relevance of careful temperature control, and indicate that interpretation of stimulus-response slope and accommodation half-time should take into account age and BMI. In contrast, gender is not of major relevance to axonal threshold findings in motor nerves. Copyright © 2014 Elsevier Masson SAS. All rights reserved.
Can we "predict" long-term outcome for ambulatory transcutaneous electrical nerve stimulation in patients with chronic pain?

PubMed

Köke, Albère J; Smeets, Rob J E M; Perez, Roberto S; Kessels, Alphons; Winkens, Bjorn; van Kleef, Maarten; Patijn, Jacob

2015-03-01

Evidence for effectiveness of transcutaneous electrical nerve stimulation (TENS) is still inconclusive. As heterogeneity of chronic pain patients might be an important factor for this lack of efficacy, identifying factors for a successful long-term outcome is of great importance. A prospective study was performed to identify variables with potential predictive value for 2 outcome measures on long term (6 months); (1) continuation of TENS, and (2) a minimally clinical important pain reduction of ≥ 33%. At baseline, a set of risk factors including pain-related variables, psychological factors, and disability was measured. In a multiple logistic regression analysis, higher patient's expectations, neuropathic pain, no severe pain (< 80 mm visual analogue scale [VAS]) were independently related to long-term continuation of TENS. For the outcome "minimally clinical important pain reduction," the multiple logistic regression analysis indicated that no multisited pain (> 2 pain locations) and intermittent pain were positively and independently associated with a minimally clinical important pain reduction of ≥ 33%. The results showed that factors associated with a successful outcome in the long term are dependent on definition of successful outcome. © 2014 World Institute of Pain.
Factors associated with preventable infant death: a multiple logistic regression.

PubMed

Vidal E Silva, Sandra Maria Cunha; Tuon, Rogério Antonio; Probst, Livia Fernandes; Gondinho, Brunna Verna Castro; Pereira, Antonio Carlos; Meneghim, Marcelo de Castro; Cortellazzi, Karine Laura; Ambrosano, Glaucia Maria Bovi

2018-01-01

OBJECTIVE To identify and analyze factors associated with preventable child deaths. METHODS This analytical cross-sectional study had preventable child mortality as dependent variable. From a population of 34,284 live births, we have selected a systematic sample of 4,402 children who did not die compared to 272 children who died from preventable causes during the period studied. The independent variables were analyzed in four hierarchical blocks: sociodemographic factors, the characteristics of the mother, prenatal and delivery care, and health conditions of the patient and neonatal care. We performed a descriptive statistical analysis and estimated multiple hierarchical logistic regression models. RESULTS Approximatelly 35.3% of the deaths could have been prevented with the early diagnosis and treatment of diseases during pregnancy and 26.8% of them could have been prevented with better care conditions for pregnant women. CONCLUSIONS The following characteristics of the mother are determinant for the higher mortality of children before the first year of life: living in neighborhoods with an average family income lower than four minimum wages, being aged ≤ 19 years, having one or more alive children, having a child with low APGAR level at the fifth minute of life, and having a child with low birth weight.
Dental calculus is associated with death from heart infarction.

PubMed

Söder, Birgitta; Meurman, Jukka H; Söder, Per-Östen

2014-01-01

We studied whether the amount of dental calculus is associated with death from heart infarction in the dental infection-atherosclerosis paradigm. Participants were 1676 healthy young Swedes followed up from 1985 to 2011. At the beginning of the study all subjects underwent oral clinical examination including dental calculus registration scored with calculus index (CI). Outcome measure was cause of death classified according to WHO International Classification of Diseases. Unpaired t-test, Chi-square tests, and multiple logistic regressions were used. Of the 1676 participants, 2.8% had died during follow-up. Women died at a mean age of 61.5 years and men at 61.7 years. The difference in the CI index score between the survivors versus deceased patients was significant by the year 2009 (P < 0.01). In multiple regression analysis of the relationship between death from heart infarction as a dependent variable and CI as independent variable with controlling for age, gender, dental visits, dental plaque, periodontal pockets, education, income, socioeconomic status, and pack-years of smoking, CI score appeared to be associated with 2.3 times the odds ratio for cardiac death. The results confirmed our study hypothesis by showing that dental calculus indeed associated statistically with cardiac death due to infarction.
[Breast feeding and systemic blood pressure in infants].

PubMed

Hernández-González, Martha A; Díaz-De-León, Luz V; Guízar-Mendoza, Juan M; Amador-Licona, Norma; Cipriano-González, Marisol; Díaz-Pérez, Raúl; Murillo-Ortiz, Blanca O; De-la-Roca-Chiapas, José María; Solorio-Meza, Sergio Eduardo

2012-01-01

Blood pressure levels in childhood influence these levels in adulthood, and breastfeeding has been considered such as a cardioprotective. We evaluated the association between blood pressure levels and feeding type in a group of infants. We conducted a comparative cross-sectional study in term infants with appropriate weight at birth, to compare blood pressure levels in those children with exclusively breastfeeding, mixed-feeding and formula feeding. The comparison of groups was performed using ANOVA and multiple regression analysis was used to identify variables associated with mean arterial blood pressure levels. A p value < 0.05 was considered significant. We included 20 men and 24 women per group. Infant Formula Feeding had higher current weight and weight gain compared with the other two groups (p < 0.05). Systolic, diastolic and mean blood pressure levels, as well as respiratory and heart rate were higher in the groups of exclusively formula feeding and mixed-feeding than in those with exclusively breastfeeding (p < 0.05). Multiple regression analysis identified that variables associated with mean blood pressure levels were current body mass index, weight gain and formula feeding. Infants in breastfeeding show lower blood pressure, BMI and weight gain.
Potential suicide ideation and its association with observing bullying at school.

PubMed

Rivers, Ian; Noret, Nathalie

2013-07-01

To explore those contextual factors that predict potential suicide ideation among students who observe bullying at school. 1,592 students of whom 1,009 who reported having observed bullying at school were surveyed from 14 secondary schools in the North of England. Role-related (not-involved, victim, perpetrator, 'bully-victim' and observer) and gender-wise comparisons of key variables were undertaken prior to hierarchical multiple regressions to determine those associated with potential suicide ideation. Analyses indicated that students who observed bullying behavior were significantly more likely than those not involved in bullying to report symptoms of interpersonal sensitivity, to indicate greater helplessness and potential suicide ideation. Hierarchical multiple regression analyses indicated that, among boys, helplessness (β = .48, p < .001) followed by frequency of bullying perpetration (β = .11, p < .001), and a less supportive home climate (β = -.10, p < .004) were associated with potential suicide ideation. Helplessness was found to be the only variable associated with potential suicide ideation among girls (β = .49, p < .001). Perceived helplessness is significantly associated with potential suicide ideation among students who observe bullying at school. Copyright © 2013 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.

Resource utilization in home health care: results of a prospective study.

PubMed

Trisolini, M G; Thomas, C P; Cashman, S B; Payne, S M

1994-01-01

Resource utilization in home health care has become an issue of concern due to rising costs and recent initiatives to develop prospective payment systems for home health care. A number of issues remain unresolved for the development of prospective reimbursement in this sector, including the types of variables to be included as payment variables and appropriate measures of resource use. This study supplements previous work on home health case-mix by analyzing the factors affecting one aspect of resource use for skilled nursing visits--visit length--and explores the usefulness of several specially collected variables which are not routinely available in administrative records. A data collection instrument was developed with a focus group of skilled nurses, identifying a range of variables hypothesized to affect visit length. Five categories of variables were studied using multiple regression analysis: provider-related; patient's socio-economic status; patient's clinical status; patient's support services; and visit-specific. The final regression model identifies 9 variables which significantly affect visit time. Five of the 9 are visit-specific variables, a significant finding since these are not routinely collected. Case-mix systems which include visit time as a measure of resource use will need to investigate visit-specific variables, as this study indicates they could have the largest influence on visit time. Two other types of resources used in home health care, supplies and security drivers, were also investigated in less detail.
Problems with change in R2 as applied to theory of reasoned action research.

PubMed

Trafimow, David

2004-12-01

The paradigm of choice for theory of reasoned action research seems to depend largely on the notion of change in variance accounted for (DeltaR2) as new independent variables are added to a multiple regression equation. If adding a particular independent variable of interest increases the variance in the dependent variable that can be accounted for by the list of independent variables, then the research is deemed to be 'successful', and the researcher is considered to have made a convincing argument about the importance of the new variable. In contrast to this trend, I present arguments that suggest serious problems with the paradigm, and conclude that studies on attitude-behaviour relations would advance the field of psychology to a far greater extent if researchers abandoned it.
Broad-scale adaptive genetic variation in alpine plants is driven by temperature and precipitation

PubMed Central

MANEL, STÉPHANIE; GUGERLI, FELIX; THUILLER, WILFRIED; ALVAREZ, NADIR; LEGENDRE, PIERRE; HOLDEREGGER, ROLF; GIELLY, LUDOVIC; TABERLET, PIERRE

2014-01-01

Identifying adaptive genetic variation is a challenging task, in particular in non-model species for which genomic information is still limited or absent. Here, we studied distribution patterns of amplified fragment length polymorphisms (AFLPs) in response to environmental variation, in 13 alpine plant species consistently sampled across the entire European Alps. Multiple linear regressions were performed between AFLP allele frequencies per site as dependent variables and two categories of independent variables, namely Moran’s eigenvector map MEM variables (to account for spatial and unaccounted environmental variation, and historical demographic processes) and environmental variables. These associations allowed the identification of 153 loci of ecological relevance. Univariate regressions between allele frequency and each environmental factor further showed that loci of ecological relevance were mainly correlated with MEM variables. We found that precipitation and temperature were the best environmental predictors, whereas topographic factors were rarely involved in environmental associations. Climatic factors, subject to rapid variation as a result of the current global warming, are known to strongly influence the fate of alpine plants. Our study shows, for the first time for a large number of species, that the same environmental variables are drivers of plant adaptation at the scale of a whole biome, here the European Alps. PMID:22680783
Regression modeling of ground-water flow

USGS Publications Warehouse

Cooley, R.L.; Naff, R.L.

1985-01-01

Nonlinear multiple regression methods are developed to model and analyze groundwater flow systems. Complete descriptions of regression methodology as applied to groundwater flow models allow scientists and engineers engaged in flow modeling to apply the methods to a wide range of problems. Organization of the text proceeds from an introduction that discusses the general topic of groundwater flow modeling, to a review of basic statistics necessary to properly apply regression techniques, and then to the main topic: exposition and use of linear and nonlinear regression to model groundwater flow. Statistical procedures are given to analyze and use the regression models. A number of exercises and answers are included to exercise the student on nearly all the methods that are presented for modeling and statistical analysis. Three computer programs implement the more complex methods. These three are a general two-dimensional, steady-state regression model for flow in an anisotropic, heterogeneous porous medium, a program to calculate a measure of model nonlinearity with respect to the regression parameters, and a program to analyze model errors in computed dependent variables such as hydraulic head. (USGS)
Uni- and multi-variable modelling of flood losses: experiences gained from the Secchia river inundation event.

NASA Astrophysics Data System (ADS)

Carisi, Francesca; Domeneghetti, Alessio; Kreibich, Heidi; Schröter, Kai; Castellarin, Attilio

2017-04-01

Flood risk is function of flood hazard and vulnerability, therefore its accurate assessment depends on a reliable quantification of both factors. The scientific literature proposes a number of objective and reliable methods for assessing flood hazard, yet it highlights a limited understanding of the fundamental damage processes. Loss modelling is associated with large uncertainty which is, among other factors, due to a lack of standard procedures; for instance, flood losses are often estimated based on damage models derived in completely different contexts (i.e. different countries or geographical regions) without checking its applicability, or by considering only one explanatory variable (i.e. typically water depth). We consider the Secchia river flood event of January 2014, when a sudden levee-breach caused the inundation of nearly 200 km2 in Northern Italy. In the aftermath of this event, local authorities collected flood loss data, together with additional information on affected private households and industrial activities (e.g. buildings surface and economic value, number of company's employees and others). Based on these data we implemented and compared a quadratic-regression damage function, with water depth as the only explanatory variable, and a multi-variable model that combines multiple regression trees and considers several explanatory variables (i.e. bagging decision trees). Our results show the importance of data collection revealing that (1) a simple quadratic regression damage function based on empirical data from the study area can be significantly more accurate than literature damage-models derived for a different context and (2) multi-variable modelling may outperform the uni-variable approach, yet it is more difficult to develop and apply due to a much higher demand of detailed data.
The use of generalised additive models (GAM) in dentistry.

PubMed

Helfenstein, U; Steiner, M; Menghini, G

1997-12-01

Ordinary multiple regression and logistic multiple regression are widely applied statistical methods which allow a researcher to 'explain' or 'predict' a response variable from a set of explanatory variables or predictors. In these models it is usually assumed that quantitative predictors such as age enter linearly into the model. During recent years these methods have been further developed to allow more flexibility in the way explanatory variables 'act' on a response variable. The methods are called 'generalised additive models' (GAM). The rigid linear terms characterising the association between response and predictors are replaced in an optimal way by flexible curved functions of the predictors (the 'profiles'). Plotting the 'profiles' allows the researcher to visualise easily the shape by which predictors 'act' over the whole range of values. The method facilitates detection of particular shapes such as 'bumps', 'U-shapes', 'J-shapes, 'threshold values' etc. Information about the shape of the association is not revealed by traditional methods. The shapes of the profiles may be checked by performing a Monte Carlo simulation ('bootstrapping'). After the presentation of the GAM a relevant case study is presented in order to demonstrate application and use of the method. The dependence of caries in primary teeth on a set of explanatory variables is investigated. Since GAMs may not be easily accessible to dentists, this article presents them in an introductory condensed form. It was thought that a nonmathematical summary and a worked example might encourage readers to consider the methods described. GAMs may be of great value to dentists in allowing visualisation of the shape by which predictors 'act' and obtaining a better understanding of the complex relationships between predictors and response.
Quality of life in multiple sclerosis (MS) and role of fatigue, depression, anxiety, and stress: A bicenter study from north of Iran.

PubMed

Salehpoor, Ghasem; Rezaei, Sajjad; Hosseininezhad, Mozaffar

2014-11-01

Although studies have demonstrated significant negative relationships between quality of life (QOL), fatigue, and the most common psychological symptoms (depression, anxiety, stress), the main ambiguity of previous studies on QOL is in the relative importance of these predictors. Also, there is lack of adequate knowledge about the actual contribution of each of them in the prediction of QOL dimensions. Thus, the main objective of this study is to assess the role of fatigue, depression, anxiety, and stress in relation to QOL of multiple sclerosis (MS) patients. One hundred and sixty-two MS patients completed the questionnaire on demographic variables, and then they were evaluated by the Persian versions of Short-Form Health Survey Questionnaire (SF-36), Fatigue Survey Scale (FSS), and Depression, Anxiety, Stress Scale-21 (DASS-21). Data were analyzed by Pearson correlation coefficient and hierarchical regression. Correlation analysis showed a significant relationship between QOL elements in SF-36 (physical component summary and mental component summary) and depression, fatigue, stress, and anxiety (P < 0.01). Hierarchical regression analysis indicated that among the predictor variables in the final step, fatigue, depression, and anxiety were identified as the physical component summary predictor variables. Anxiety was found to be the most powerful predictor variable amongst all (β = -0.46, P < 0.001). Furthermore, results have shown depression as the only significant mental component summary predictor variable (β = -0.39, P < 0.001). This study has highlighted the role of anxiety, fatigue, and depression in physical dimensions and the role of depression in psychological dimensions of the lives of MS patients. In addition, the findings of this study indirectly suggest that psychological interventions for reducing fatigue, depression, and anxiety can lead to improved QOL of MS patients.
Poor sleep quality and nightmares are associated with non-suicidal self-injury in adolescents.

PubMed

Liu, Xianchen; Chen, Hua; Bo, Qi-Gui; Fan, Fang; Jia, Cun-Xian

2017-03-01

Non-suicidal self-injury (NSSI) is prevalent and is associated with increased risk of suicidal behavior in adolescents. This study examined which sleep variables are associated with NSSI, independently from demographics and mental health problems in Chinese adolescents. Participants consisted of 2090 students sampled from three high schools in Shandong, China and had a mean age of 15.49 years. Participants completed a sleep and health questionnaire to report their demographic and family information, sleep duration and sleep problems, impulsiveness, hopelessness, internalizing and externalizing problems, and NSSI. A series of regression analyses were conducted to examine the associations between sleep variables and NSSI. Of the sample, 12.6 % reported having ever engaged in NSSI and 8.8 % engaged during the last year. Univariate logistic analyses demonstrated that multiple sleep variables including short sleep duration, insomnia symptoms, poor sleep quality, sleep insufficiency, unrefreshed sleep, sleep dissatisfaction, daytime sleepiness, fatigue, snoring, and nightmares were associated with increased risk of NSSI. After adjusting for demographic and mental health variables, NSSI was significantly associated with sleeping <6 h per night, poor sleep quality, sleep dissatisfaction, daytime sleepiness, and frequent nightmares. Stepwise logistic regression model demonstrated that poor sleep quality (OR = 2.18, 95 % CI = 1.37-3.47) and frequent nightmares (OR = 2.88, 95 % CI = 1.45-5.70) were significantly independently associated with NSSI. In conclusion, while multiple sleep variables are associated with NSSI, poor sleep quality and frequent nightmares are independent risk factors of NSSI. These findings may have important implications for further research of sleep self-harm mechanisms and early detection and prevention of NSSI in adolescents.
Bankfull characteristics of Ohio streams and their relation to peak streamflows

USGS Publications Warehouse

Sherwood, James M.; Huitger, Carrie A.

2005-01-01

Regional curves, simple-regression equations, and multiple-regression equations were developed to estimate bankfull width, bankfull mean depth, bankfull cross-sectional area, and bankfull discharge of rural, unregulated streams in Ohio. The methods are based on geomorphic, basin, and flood-frequency data collected at 50 study sites on unregulated natural alluvial streams in Ohio, of which 40 sites are near streamflow-gaging stations. The regional curves and simple-regression equations relate the bankfull characteristics to drainage area. The multiple-regression equations relate the bankfull characteristics to drainage area, main-channel slope, main-channel elevation index, median bed-material particle size, bankfull cross-sectional area, and local-channel slope. Average standard errors of prediction for bankfull width equations range from 20.6 to 24.8 percent; for bankfull mean depth, 18.8 to 20.6 percent; for bankfull cross-sectional area, 25.4 to 30.6 percent; and for bankfull discharge, 27.0 to 78.7 percent. The simple-regression (drainage-area only) equations have the highest average standard errors of prediction. The multiple-regression equations in which the explanatory variables included drainage area, main-channel slope, main-channel elevation index, median bed-material particle size, bankfull cross-sectional area, and local-channel slope have the lowest average standard errors of prediction. Field surveys were done at each of the 50 study sites to collect the geomorphic data. Bankfull indicators were identified and evaluated, cross-section and longitudinal profiles were surveyed, and bed- and bank-material were sampled. Field data were analyzed to determine various geomorphic characteristics such as bankfull width, bankfull mean depth, bankfull cross-sectional area, bankfull discharge, streambed slope, and bed- and bank-material particle-size distribution. The various geomorphic characteristics were analyzed by means of a combination of graphical and statistical techniques. The logarithms of the annual peak discharges for the 40 gaged study sites were fit by a Pearson Type III frequency distribution to develop flood-peak discharges associated with recurrence intervals of 2, 5, 10, 25, 50, and 100 years. The peak-frequency data were related to geomorphic, basin, and climatic variables by multiple-regression analysis. Simple-regression equations were developed to estimate 2-, 5-, 10-, 25-, 50-, and 100-year flood-peak discharges of rural, unregulated streams in Ohio from bankfull channel cross-sectional area. The average standard errors of prediction are 31.6, 32.6, 35.9, 41.5, 46.2, and 51.2 percent, respectively. The study and methods developed are intended to improve understanding of the relations between geomorphic, basin, and flood characteristics of streams in Ohio and to aid in the design of hydraulic structures, such as culverts and bridges, where stability of the stream and structure is an important element of the design criteria. The study was done in cooperation with the Ohio Department of Transportation and the U.S. Department of Transportation, Federal Highway Administration.
Spatial regression analysis on 32 years of total column ozone data

NASA Astrophysics Data System (ADS)

Knibbe, J. S.; van der A, R. J.; de Laat, A. T. J.

2014-08-01

Multiple-regression analyses have been performed on 32 years of total ozone column data that was spatially gridded with a 1 × 1.5° resolution. The total ozone data consist of the MSR (Multi Sensor Reanalysis; 1979-2008) and 2 years of assimilated SCIAMACHY (SCanning Imaging Absorption spectroMeter for Atmospheric CHartographY) ozone data (2009-2010). The two-dimensionality in this data set allows us to perform the regressions locally and investigate spatial patterns of regression coefficients and their explanatory power. Seasonal dependencies of ozone on regressors are included in the analysis. A new physically oriented model is developed to parameterize stratospheric ozone. Ozone variations on nonseasonal timescales are parameterized by explanatory variables describing the solar cycle, stratospheric aerosols, the quasi-biennial oscillation (QBO), El Niño-Southern Oscillation (ENSO) and stratospheric alternative halogens which are parameterized by the effective equivalent stratospheric chlorine (EESC). For several explanatory variables, seasonally adjusted versions of these explanatory variables are constructed to account for the difference in their effect on ozone throughout the year. To account for seasonal variation in ozone, explanatory variables describing the polar vortex, geopotential height, potential vorticity and average day length are included. Results of this regression model are compared to that of a similar analysis based on a more commonly applied statistically oriented model. The physically oriented model provides spatial patterns in the regression results for each explanatory variable. The EESC has a significant depleting effect on ozone at mid- and high latitudes, the solar cycle affects ozone positively mostly in the Southern Hemisphere, stratospheric aerosols affect ozone negatively at high northern latitudes, the effect of QBO is positive and negative in the tropics and mid- to high latitudes, respectively, and ENSO affects ozone negatively between 30° N and 30° S, particularly over the Pacific. The contribution of explanatory variables describing seasonal ozone variation is generally large at mid- to high latitudes. We observe ozone increases with potential vorticity and day length and ozone decreases with geopotential height and variable ozone effects due to the polar vortex in regions to the north and south of the polar vortices. Recovery of ozone is identified globally. However, recovery rates and uncertainties strongly depend on choices that can be made in defining the explanatory variables. The application of several trend models, each with their own pros and cons, yields a large range of recovery rate estimates. Overall these results suggest that care has to be taken in determining ozone recovery rates, in particular for the Antarctic ozone hole.
Prediction of rectal temperature using non-invasive physiologic variable measurements in hair pregnant ewes subjected to natural conditions of heat stress.

PubMed

Vicente-Pérez, Ricardo; Avendaño-Reyes, Leonel; Mejía-Vázquez, Ángel; Álvarez-Valenzuela, F Daniel; Correa-Calderón, Abelardo; Mellado, Miguel; Meza-Herrera, Cesar A; Guerra-Liera, Juan E; Robinson, P H; Macías-Cruz, Ulises

2016-01-01

Rectal temperature (RT) is the foremost physiological variable indicating if an animal is suffering hyperthermia. However, this variable is traditionally measured by invasive methods, which may compromise animal welfare. Models to predict RT have been developed for growing pigs and lactating dairy cows, but not for pregnant heat-stressed ewes. Our aim was to develop a prediction equation for RT using non-invasive physiological variables in pregnant ewes under heat stress. A total of 192 records of respiratory frequency (RF) and hair coat temperature in various body regions (i.e., head, rump, flank, shoulder, and belly) obtained from 24 Katahdin × Pelibuey pregnant multiparous ewes were collected during the last third of gestation (i.e., d 100 to lambing) with a 15 d sampling interval. Hair coat temperatures were taken using infrared thermal imaging technology. Initially, a Pearson correlation analysis examined the relationship among variables, and then multiple linear regression analysis was used to develop the prediction equations. All predictor variables were positively correlated (P<0.01; r=0.59-0.67) with RT. The adjusted equation which best predicted RT (P<0.01; Radj(2)=56.15%; CV=0.65%) included as predictors RF and head and belly temperatures. Comparison of predicted and observed values for RT indicates a suitable agreement (P<0.01) between them with moderate accuracy (Radj(2)=56.15%) when RT was calculated with the adjusted equation. In general, the final equation does not violate any assumption of multiple regression analysis. The RT in heat-stressed pregnant ewes can be predicted with an adequate accuracy using non-invasive physiologic variables, and the final equation was: RT=35.57+0.004 (RF)+0.067 (heat temperature)+0.028 (belly temperature). Copyright © 2015 Elsevier Ltd. All rights reserved.
Proposing a Tentative Cut Point for the Compulsive Sexual Behavior Inventory

PubMed Central

Storholm, Erik David; Fisher, Dennis G.; Napper, Lucy E.; Reynolds, Grace L.

2015-01-01

Bivariate analyses were utilized in order to identify the relations between scores on the Compulsive Sexual Behavior Inventory (CSBI) and self-report of risky sexual behavior and drug abuse among 482 racially and ethnically diverse men and women. CSBI scores were associated with both risky sexual behavior and drug abuse among a diverse non-clinical sample, thereby providing evidence of criterion-related validity. The variables that demonstrated a high association with the CSBI were subsequently entered into a multiple regression model. Four variables (number of sexual partners in the last 30 days, self-report of trading drugs for sex, having paid for sex, and perceived chance of acquiring HIV) were retained as variables with good model fit. Receiver operating characteristic (ROC) curve analyses were conducted in order to determine the optimal tentative cut point for the CSBI. The four variables retained in the multiple regression model were utilized as exploratory gold standards in order to construct ROC curves. The ROC curves were then compared to one another in order to determine the point that maximized both sensitivity and specificity in the identification of compulsive sexual behavior with the CSBI scale. The current findings suggest that a tentative cut point of 40 may prove clinically useful in discriminating between persons who exhibit compulsive sexual behavior and those who do not. Because of the association between compulsive sexual behavior and HIV, STIs, and drug abuse, it is paramount that a psychometrically sound measure of compulsive sexual behavior is made available to all healthcare professionals working in disease prevention and other areas. PMID:21203814
Third and Fourth Degree Perineal Injury After Vaginal Delivery: Does Race Make a Difference?

PubMed Central

de Silva, Kanoe-Lehua; Tsai, Pai-Jong Stacy; Kon, Leanne M; Kessel, Bruce; Seto, Todd; Kaneshiro, Bliss

2014-01-01

Severe perineal injury (third and fourth degree laceration) at the time of vaginal delivery increases the risk of fecal incontinence, chronic perineal pain, and dyspareunia.1–5 Studies suggest the prevalence of severe perineal injury may vary by racial group.6 The purpose of the current study was to examine rates of severe perineal injury in different Asian and Pacific Islander subgroups. A retrospective cohort study was performed among all patients who had a vaginal delivery at Queens Medical Center in Honolulu, Hawai‘i between January 1, 2002 and December 31, 2003. Demographic and health related variables were obtained for each participant. Maternal race/ethnicity (Japanese, Filipino, Chinese, other Asian, Part-Hawaiian/Hawaiian, Micronesian, other Pacific Islander, Caucasian, multiracial [non-Hawaiian], and other) was self-reported by the patient at the time admission. The significance of associations between racial/ethnic groups and demographic and health related variables was determined using chi-square tests for categorical variables and analysis of variance for continuous factors. Multiple logistic regression was performed to adjust for potential confounders when examining severe laceration rates. A total of 1842 subjects met inclusion criteria. The proportion of severe perineal lacerations did not differ significantly between racial groups. In the multiple logistic regression analysis, operative vaginal delivery was related to both race and severe perineal laceration. However, despite adjusting for this variable, race was not associated with an increased risk of having a severe laceration (P = .70). The results of this study indicate the risk of severe perineal laceration does not differ based on maternal race/ethnicity. PMID:24660124
Proposing a tentative cut point for the Compulsive Sexual Behavior Inventory.

PubMed

Storholm, Erik David; Fisher, Dennis G; Napper, Lucy E; Reynolds, Grace L; Halkitis, Perry N

2011-12-01

Bivariate analyses were utilized in order to identify the relations between scores on the Compulsive Sexual Behavior Inventory (CSBI) and self-report of risky sexual behavior and drug abuse among 482 racially and ethnically diverse men and women. CSBI scores were associated with both risky sexual behavior and drug abuse among a diverse non-clinical sample, thereby providing evidence of criterion-related validity. The variables that demonstrated a high association with the CSBI were subsequently entered into a multiple regression model. Four variables (number of sexual partners in the last 30 days, self-report of trading drugs for sex, having paid for sex, and perceived chance of acquiring HIV) were retained as variables with good model fit. Receiver operating characteristic (ROC) curve analyses were conducted in order to determine the optimal tentative cut point for the CSBI. The four variables retained in the multiple regression model were utilized as exploratory gold standards in order to construct ROC curves. The ROC curves were then compared to one another in order to determine the point that maximized both sensitivity and specificity in the identification of compulsive sexual behavior with the CSBI scale. The current findings suggest that a tentative cut point of 40 may prove clinically useful in discriminating between persons who exhibit compulsive sexual behavior and those who do not. Because of the association between compulsive sexual behavior and HIV, STIs, and drug abuse, it is paramount that a psychometrically sound measure of compulsive sexual behavior is made available to all healthcare professionals working in disease prevention and other areas.
Prediction of Biological Motion Perception Performance from Intrinsic Brain Network Regional Efficiency

PubMed Central

Wang, Zengjian; Zhang, Delong; Liang, Bishan; Chang, Song; Pan, Jinghua; Huang, Ruiwang; Liu, Ming

2016-01-01

Biological motion perception (BMP) refers to the ability to perceive the moving form of a human figure from a limited amount of stimuli, such as from a few point lights located on the joints of a moving body. BMP is commonplace and important, but there is great inter-individual variability in this ability. This study used multiple regression model analysis to explore the association between BMP performance and intrinsic brain activity, in order to investigate the neural substrates underlying inter-individual variability of BMP performance. The resting-state functional magnetic resonance imaging (rs-fMRI) and BMP performance data were collected from 24 healthy participants, for whom intrinsic brain networks were constructed, and a graph-based network efficiency metric was measured. Then, a multiple linear regression model was used to explore the association between network regional efficiency and BMP performance. We found that the local and global network efficiency of many regions was significantly correlated with BMP performance. Further analysis showed that the local efficiency rather than global efficiency could be used to explain most of the BMP inter-individual variability, and the regions involved were predominately located in the Default Mode Network (DMN). Additionally, discrimination analysis showed that the local efficiency of certain regions such as the thalamus could be used to classify BMP performance across participants. Notably, the association pattern between network nodal efficiency and BMP was different from the association pattern of static directional/gender information perception. Overall, these findings show that intrinsic brain network efficiency may be considered a neural factor that explains BMP inter-individual variability. PMID:27853427
Alterations of papilla dimensions after orthodontic closure of the maxillary midline diastema: a retrospective longitudinal study

PubMed Central

2016-01-01

Purpose The aim of this study was to evaluate alterations of papilla dimensions after orthodontic closure of the diastema between maxillary central incisors. Methods Sixty patients who had a visible diastema between maxillary central incisors that had been closed by orthodontic approximation were selected for this study. Various papilla dimensions were assessed on clinical photographs and study models before the orthodontic treatment and at the follow-up examination after closure of the diastema. Influences of the variables assessed before orthodontic treatment on the alterations of papilla height (PH) and papilla base thickness (PBT) were evaluated by univariate regression analysis. To analyze potential influences of the 3-dimensional papilla dimensions before orthodontic treatment on the alterations of PH and PBT, a multiple regression model was formulated including the 3-dimensional papilla dimensions as predictor variables. Results On average, PH decreased by 0.80 mm and PBT increased after orthodontic closure of the diastema (P<0.01). Univariate regression analysis revealed that the PH (P=0.002) and PBT (P=0.047) before orthodontic treatment influenced the alteration of PH. With respect to the alteration of PBT, the diastema width (P=0.045) and PBT (P=0.000) were found to be influential factors. PBT before the orthodontic treatment significantly influenced the alteration of PBT in the multiple regression model. Conclusions PH decreased but PBT increased after orthodontic closure of the diastema. The papilla dimensions before orthodontic treatment influenced the alterations of PH and PBT after closure of the diastema. The PBT increased more when the diastema width before the orthodontic treatment was larger. PMID:27382507
Self-reported work ability and work performance in workers with chronic nonspecific musculoskeletal pain.

PubMed

de Vries, Haitze J; Reneman, Michiel F; Groothoff, Johan W; Geertzen, Jan H B; Brouwer, Sandra

2013-03-01

To assess self-reported work ability and work performance of workers who stay at work despite chronic nonspecific musculoskeletal pain (CMP), and to explore which variables were associated with these outcomes. In a cross-sectional study we assessed work ability (Work Ability Index, single item scale 0-10) and work performance (Health and Work Performance Questionnaire, scale 0-10) among 119 workers who continued work while having CMP. Scores of work ability and work performance were categorized into excellent (10), good (9), moderate (8) and poor (0-7). Hierarchical multiple regression and logistic regression analysis was used to analyze the relation of socio-demographic, pain-related, personal- and work-related variables with work ability and work performance. Mean work ability and work performance were 7.1 and 7.7 (poor to moderate). Hierarchical multiple regression analysis revealed that higher work ability scores were associated with lower age, better general health perception, and higher pain self-efficacy beliefs (R(2) = 42 %). Higher work performance was associated with lower age, higher pain self-efficacy beliefs, lower physical work demand category and part-time work (R(2) = 37 %). Logistic regression analysis revealed that work ability ≥8 was significantly explained by age (OR = 0.90), general health perception (OR = 1.04) and pain self-efficacy (OR = 1.15). Work performance ≥8 was explained by pain self-efficacy (OR = 1.11). Many workers with CMP who stay at work report poor to moderate work ability and work performance. Our findings suggest that a subgroup of workers with CMP can stay at work with high work ability and performance, especially when they have high beliefs of pain self-efficacy. Our results further show that not the pain itself, but personal and work-related factors relate to work ability and work performance.
The isoform A of reticulon-4 (Nogo-A) in cerebrospinal fluid of primary brain tumor patients: influencing factors.

PubMed

Koper, Olga Martyna; Kamińska, Joanna; Milewska, Anna; Sawicki, Karol; Mariak, Zenon; Kemona, Halina; Matowicka-Karna, Joanna

2018-05-18

The influence of isoform A of reticulon-4 (Nogo-A), also known as neurite outgrowth inhibitor, on primary brain tumor development was reported. Therefore the aim was the evaluation of Nogo-A concentrations in cerebrospinal fluid (CSF) and serum of brain tumor patients compared with non-tumoral individuals. All serum results, except for two cases, obtained both in brain tumors and non-tumoral individuals, were below the lower limit of ELISA detection. Cerebrospinal fluid Nogo-A concentrations were significantly lower in primary brain tumor patients compared to non-tumoral individuals. The univariate linear regression analysis found that if white blood cell count increases by 1 × 10 3 /μL, the mean cerebrospinal fluid Nogo-A concentration value decreases 1.12 times. In the model of multiple linear regression analysis predictor variables influencing cerebrospinal fluid Nogo-A concentrations included: diagnosis, sex, and sodium level. The mean cerebrospinal fluid Nogo-A concentration value was 1.9 times higher for women in comparison to men. In the astrocytic brain tumor group higher sodium level occurs with lower cerebrospinal fluid Nogo-A concentrations. We found the opposite situation in non-tumoral individuals. Univariate linear regression analysis revealed, that cerebrospinal fluid Nogo-A concentrations change in relation to white blood cell count. In the created model of multiple linear regression analysis we found, that within predictor variables influencing CSF Nogo-A concentrations were diagnosis, sex, and sodium level. Results may be relevant to the search for cerebrospinal fluid biomarkers and potential therapeutic targets in primary brain tumor patients. Nogo-A concentrations were tested by means of enzyme-linked immunosorbent assay (ELISA).
Periodontal disease in Chinese patients with systemic lupus erythematosus.

PubMed

Zhang, Qiuxiang; Zhang, Xiaoli; Feng, Guijaun; Fu, Ting; Yin, Rulan; Zhang, Lijuan; Feng, Xingmei; Li, Liren; Gu, Zhifeng

2017-08-01

Disease of systemic lupus erythematosus (SLE) and periodontal disease (PD) shares the common multiple characteristics. The aims of the present study were to evaluate the prevalence and severity of periodontal disease in Chinese SLE patients and to determine the association between SLE features and periodontal parameters. A cross-sectional study of 108 SLE patients together with 108 age- and sex-matched healthy controls was made. Periodontal status was conducted by two dentists independently. Sociodemographic characteristics, lifestyle factors, medication use, and clinical parameters were also assessed. The periodontal status was significantly worse in SLE patients compared to controls. In univariate logistic regression, SLE had a significant 2.78-fold [95% confidence interval (CI) 1.60-4.82] increase in odds of periodontitis compared to healthy controls. Adjusted for potential risk factors, patients with SLE had 13.98-fold (95% CI 5.10-38.33) increased odds against controls. In multiple linear regression model, the independent variable negatively and significantly associated with gingival index was education (P = 0.005); conversely, disease activity (P < 0.001) and plaque index (P = 0.002) were positively associated; Age was the only variable independently associated with periodontitis of SLE in multivariate logistic regression (OR 1.348; 95% CI: 1.183-1.536, P < 0.001). Chinese SLE patients were likely to suffer from higher odds of PD. These findings confirmed the importance of early interventions in combination with medical therapy. It is necessary for a close collaboration between dentists and clinicians when treating those patients.
Multiple Ordinal Regression by Maximizing the Sum of Margins

PubMed Central

Hamsici, Onur C.; Martinez, Aleix M.

2016-01-01

Human preferences are usually measured using ordinal variables. A system whose goal is to estimate the preferences of humans and their underlying decision mechanisms requires to learn the ordering of any given sample set. We consider the solution of this ordinal regression problem using a Support Vector Machine algorithm. Specifically, the goal is to learn a set of classifiers with common direction vectors and different biases correctly separating the ordered classes. Current algorithms are either required to solve a quadratic optimization problem, which is computationally expensive, or are based on maximizing the minimum margin (i.e., a fixed margin strategy) between a set of hyperplanes, which biases the solution to the closest margin. Another drawback of these strategies is that they are limited to order the classes using a single ranking variable (e.g., perceived length). In this paper, we define a multiple ordinal regression algorithm based on maximizing the sum of the margins between every consecutive class with respect to one or more rankings (e.g., perceived length and weight). We provide derivations of an efficient, easy-to-implement iterative solution using a Sequential Minimal Optimization procedure. We demonstrate the accuracy of our solutions in several datasets. In addition, we provide a key application of our algorithms in estimating human subjects’ ordinal classification of attribute associations to object categories. We show that these ordinal associations perform better than the binary one typically employed in the literature. PMID:26529784

Maternal overprotection score of the Parental Bonding Instrument predicts the outcome of cognitive behavior therapy by trainees for depression.

PubMed

Asano, Motoshi; Esaki, Kosei; Wakamatsu, Aya; Kitajima, Tomoko; Narita, Tomohiro; Naitoh, Hiroshi; Ozaki, Norio; Iwata, Nakao

2013-07-01

The purpose of this study was to predict the outcome of cognitive behavior therapy (CBT) by trainees for major depressive disorder (MDD) based on the Parental Bonding Instrument (PBI). The hypothesis was that the higher level of care and/or lower level of overprotection score would predict a favorable outcome of CBT by trainees. The subjects were all outpatients with MDD treated with CBT as a training case. All the subjects were asked to fill out the Japanese version of the PBI before commencing the course of psychotherapy. The difference between the first and the last Beck Depression Inventory (BDI) score was used to represent the improvement of the intensity of depression by CBT. In order to predict improvement (the difference of the BDI scores) as the objective variable, multiple regression analysis was performed using maternal overprotection score and baseline BDI score as the explanatory variables. The multiple regression model was significant (P = 0.0026) and partial regression coefficient for the maternal overprotection score and the baseline BDI was -0.73 (P = 0.0046) and 0.88 (P = 0.0092), respectively. Therefore, when a patient's maternal overprotection score of the PBI was lower, a better outcome of CBT was expected. The hypothesis was partially supported. This result would be useful in determining indications for CBT by trainees for patients with MDD. © 2013 The Authors. Psychiatry and Clinical Neurosciences © 2013 Japanese Society of Psychiatry and Neurology.
Inflammation, homocysteine and carotid intima-media thickness.

PubMed

Baptista, Alexandre P; Cacdocar, Sanjiva; Palmeiro, Hugo; Faísca, Marília; Carrasqueira, Herménio; Morgado, Elsa; Sampaio, Sandra; Cabrita, Ana; Silva, Ana Paula; Bernardo, Idalécio; Gome, Veloso; Neves, Pedro L

2008-01-01

Cardiovascular disease is the main cause of morbidity and mortality in chronic renal patients. Carotid intima-media thickness (CIMT) is one of the most accurate markers of atherosclerosis risk. In this study, the authors set out to evaluate a population of chronic renal patients to determine which factors are associated with an increase in intima-media thickness. We included 56 patients (F=22, M=34), with a mean age of 68.6 years, and an estimated glomerular filtration rate of 15.8 ml/min (calculated by the MDRD equation). Various laboratory and inflammatory parameters (hsCRP, IL-6 and TNF-alpha) were evaluated. All subjects underwent measurement of internal carotid artery intima-media thickness by high-resolution real-time B-mode ultrasonography using a 10 MHz linear transducer. Intima-media thickness was used as a dependent variable in a simple linear regression model, with the various laboratory parameters as independent variables. Only parameters showing a significant correlation with CIMT were evaluated in a multiple regression model: age (p=0.001), hemoglobin (p=00.3), logCRP (p=0.042), logIL-6 (p=0.004) and homocysteine (p=0.002). In the multiple regression model we found that age (p=0.001) and homocysteine (p=0.027) were independently correlated with CIMT. LogIL-6 did not reach statistical significance (p=0.057), probably due to the small population size. The authors conclude that age and homocysteine correlate with carotid intima-media thickness, and thus can be considered as markers/risk factors in chronic renal patients.
Anodic microbial community diversity as a predictor of the power output of microbial fuel cells.

PubMed

Stratford, James P; Beecroft, Nelli J; Slade, Robert C T; Grüning, André; Avignone-Rossa, Claudio

2014-03-01

The relationship between the diversity of mixed-species microbial consortia and their electrogenic potential in the anodes of microbial fuel cells was examined using different diversity measures as predictors. Identical microbial fuel cells were sampled at multiple time-points. Biofilm and suspension communities were analysed by denaturing gradient gel electrophoresis to calculate the number and relative abundance of species. Shannon and Simpson indices and richness were examined for association with power using bivariate and multiple linear regression, with biofilm DNA as an additional variable. In simple bivariate regressions, the correlation of Shannon diversity of the biofilm and power is stronger (r=0.65, p=0.001) than between power and richness (r=0.39, p=0.076), or between power and the Simpson index (r=0.5, p=0.018). Using Shannon diversity and biofilm DNA as predictors of power, a regression model can be constructed (r=0.73, p<0.001). Ecological parameters such as the Shannon index are predictive of the electrogenic potential of microbial communities. Copyright © 2014 Elsevier Ltd. All rights reserved.
Serum alpha-fetoprotein in the three trimesters of pregnancy: effects of maternal characteristics and medical history.

PubMed

Bredaki, F E; Sciorio, C; Wright, A; Wright, D; Nicolaides, K H

2015-07-01

To define the contribution of maternal variables which influence the measured level of maternal serum alpha-fetoprotein (AFP) in screening for pregnancy complications. Maternal characteristics and medical history were recorded and serum AFP was measured in women with a singleton pregnancy attending for three routine hospital visits at 11 + 0 to 13 + 6, 19 + 0 to 24 + 6 and 30 + 0 to 34 + 6 weeks' gestation. For pregnancies delivering phenotypically normal live births or stillbirths ≥ 24 weeks' gestation, variables from maternal demographic characteristics and medical history that are important in the prediction of AFP were determined from a linear mixed-effects multiple regression. Serum AFP was measured in 17 071 cases in the first trimester, 8583 in the second trimester and 8607 in the third trimester. Significant independent contributions to serum AFP were provided by gestational age, maternal weight, racial origin, gestational age at delivery and birth-weight Z-score of the neonate of the previous pregnancy and interpregnancy interval. Cigarette smoking was found to significantly affect serum AFP in the first trimester only. The machine used to measure serum AFP was also found to have a significant effect. Random-effects multiple regression analysis was used to define the contribution of maternal variables that influence the measured level of serum AFP and express the values as multiples of the median (MoMs). The model was shown to provide an adequate fit of MoM values for all covariates, both in pregnancies that developed pre-eclampsia and in those without this pregnancy complication. A model was fitted to express measured serum AFP across the three trimesters of pregnancy as MoMs, after adjusting for variables from maternal characteristics and medical history that affect this measurement. Copyright © 2015 ISUOG. Published by John Wiley & Sons Ltd.
Analysis of threats to research validity introduced by audio recording clinic visits: Selection bias, Hawthorne effect, both, or neither?

PubMed Central

Henry, Stephen G.; Jerant, Anthony; Iosif, Ana-Maria; Feldman, Mitchell D.; Cipri, Camille; Kravitz, Richard L.

2015-01-01

Objective To identify factors associated with participant consent to record visits; to estimate effects of recording on patient-clinician interactions Methods Secondary analysis of data from a randomized trial studying communication about depression; participants were asked for optional consent to audio record study visits. Multiple logistic regression was used to model likelihood of patient and clinician consent. Multivariable regression and propensity score analyses were used to estimate effects of audio recording on 6 dependent variables: discussion of depressive symptoms, preventive health, and depression diagnosis; depression treatment recommendations; visit length; visit difficulty. Results Of 867 visits involving 135 primary care clinicians, 39% were recorded. For clinicians, only working in academic settings (P=0.003) and having worked longer at their current practice (P=0.02) were associated with increased likelihood of consent. For patients, white race (P=0.002) and diabetes (P=0.03) were associated with increased likelihood of consent. Neither multivariable regression nor propensity score analyses revealed any significant effects of recording on the variables examined. Conclusion Few clinician or patient characteristics were significantly associated with consent. Audio recording had no significant effect on any dependent variables. Practice Implications Benefits of recording clinic visits likely outweigh the risks of bias in this setting. PMID:25837372
Post-processing method for wind speed ensemble forecast using wind speed and direction

NASA Astrophysics Data System (ADS)

Sofie Eide, Siri; Bjørnar Bremnes, John; Steinsland, Ingelin

2017-04-01

Statistical methods are widely applied to enhance the quality of both deterministic and ensemble NWP forecasts. In many situations, like wind speed forecasting, most of the predictive information is contained in one variable in the NWP models. However, in statistical calibration of deterministic forecasts it is often seen that including more variables can further improve forecast skill. For ensembles this is rarely taken advantage of, mainly due to that it is generally not straightforward how to include multiple variables. In this study, it is demonstrated how multiple variables can be included in Bayesian model averaging (BMA) by using a flexible regression method for estimating the conditional means. The method is applied to wind speed forecasting at 204 Norwegian stations based on wind speed and direction forecasts from the ECMWF ensemble system. At about 85 % of the sites the ensemble forecasts were improved in terms of CRPS by adding wind direction as predictor compared to only using wind speed. On average the improvements were about 5 %, but mainly for moderate to strong wind situations. For weak wind speeds adding wind direction had more or less neutral impact.
Estimates of self, parental, and partner multiple intelligence and their relationship with personality, values, and demographic variables: a study in Britain and France.

PubMed

Swami, Viren; Furnham, Adrian; Zilkha, Susan

2009-11-01

In the present study, 151 British and 151 French participants estimated their own, their parents' and their partner's overall intelligence and 13 'multiple intelligences.' In accordance with previous studies, men rated themselves as higher on almost all measures of intelligence, but there were few cross-national differences. There were also important sex differences in ratings of parental and partner intelligence. Participants generally believed they were more intelligent than their parents but not their partners. Regressions indicated that participants believed verbal, logical-mathematical, and spatial intelligence to be the main predictors of intelligence. Regressions also showed that participants' Big Five personality scores (in particular, Extraversion and Openness), but not values or beliefs about intelligence and intelligences tests, were good predictors of intelligence. Results were discussed in terms of the influence of gender-role stereotypes.
Bayesian function-on-function regression for multilevel functional data.

PubMed

Meyer, Mark J; Coull, Brent A; Versace, Francesco; Cinciripini, Paul; Morris, Jeffrey S

2015-09-01

Medical and public health research increasingly involves the collection of complex and high dimensional data. In particular, functional data-where the unit of observation is a curve or set of curves that are finely sampled over a grid-is frequently obtained. Moreover, researchers often sample multiple curves per person resulting in repeated functional measures. A common question is how to analyze the relationship between two functional variables. We propose a general function-on-function regression model for repeatedly sampled functional data on a fine grid, presenting a simple model as well as a more extensive mixed model framework, and introducing various functional Bayesian inferential procedures that account for multiple testing. We examine these models via simulation and a data analysis with data from a study that used event-related potentials to examine how the brain processes various types of images. © 2015, The International Biometric Society.
Anxiety, affect, self-esteem, and stress: mediation and moderation effects on depression.

PubMed

Nima, Ali Al; Rosenberg, Patricia; Archer, Trevor; Garcia, Danilo

2013-01-01

Mediation analysis investigates whether a variable (i.e., mediator) changes in regard to an independent variable, in turn, affecting a dependent variable. Moderation analysis, on the other hand, investigates whether the statistical interaction between independent variables predict a dependent variable. Although this difference between these two types of analysis is explicit in current literature, there is still confusion with regard to the mediating and moderating effects of different variables on depression. The purpose of this study was to assess the mediating and moderating effects of anxiety, stress, positive affect, and negative affect on depression. Two hundred and two university students (males = 93, females = 113) completed questionnaires assessing anxiety, stress, self-esteem, positive and negative affect, and depression. Mediation and moderation analyses were conducted using techniques based on standard multiple regression and hierarchical regression analyses. The results indicated that (i) anxiety partially mediated the effects of both stress and self-esteem upon depression, (ii) that stress partially mediated the effects of anxiety and positive affect upon depression, (iii) that stress completely mediated the effects of self-esteem on depression, and (iv) that there was a significant interaction between stress and negative affect, and between positive affect and negative affect upon depression. The study highlights different research questions that can be investigated depending on whether researchers decide to use the same variables as mediators and/or moderators.
Using Data Mining for Wine Quality Assessment

NASA Astrophysics Data System (ADS)

Cortez, Paulo; Teixeira, Juliana; Cerdeira, António; Almeida, Fernando; Matos, Telmo; Reis, José

Certification and quality assessment are crucial issues within the wine industry. Currently, wine quality is mostly assessed by physicochemical (e.g alcohol levels) and sensory (e.g. human expert evaluation) tests. In this paper, we propose a data mining approach to predict wine preferences that is based on easily available analytical tests at the certification step. A large dataset is considered with white vinho verde samples from the Minho region of Portugal. Wine quality is modeled under a regression approach, which preserves the order of the grades. Explanatory knowledge is given in terms of a sensitivity analysis, which measures the response changes when a given input variable is varied through its domain. Three regression techniques were applied, under a computationally efficient procedure that performs simultaneous variable and model selection and that is guided by the sensitivity analysis. The support vector machine achieved promising results, outperforming the multiple regression and neural network methods. Such model is useful for understanding how physicochemical tests affect the sensory preferences. Moreover, it can support the wine expert evaluations and ultimately improve the production.
Estimating the magnitude of annual peak discharges with recurrence intervals between 1.1 and 3.0 years for rural, unregulated streams in West Virginia

USGS Publications Warehouse

Wiley, Jeffrey B.; Atkins, John T.; Newell, Dawn A.

2002-01-01

Multiple and simple least-squares regression models for the log10-transformed 1.5- and 2-year recurrence intervals of peak discharges with independent variables describing the basin characteristics (log10-transformed and untransformed) for 236 streamflow-gaging stations were evaluated, and the regression residuals were plotted as areal distributions that defined three regions in West Virginia designated as East, North, and South. Regional equations for the 1.1-, 1.2-, 1.3-, 1.4-, 1.5-, 1.6-, 1.7-, 1.8-, 1.9-, 2.0-, 2.5-, and 3-year recurrence intervals of peak discharges were determined by generalized least-squares regression. Log10-transformed drainage area was the most significant independent variable for all regions. Equations developed in this study are applicable only to rural, unregulated streams within the boundaries of West Virginia. The accuracies of estimating equations are quantified by measuring the average prediction error (from 27.4 to 52.4 percent) and equivalent years of record (from 1.1 to 3.4 years).
Regression equations for disinfection by-products for the Mississippi, Ohio and Missouri rivers

USGS Publications Warehouse

Rathbun, R.E.

1996-01-01

Trihalomethane and nonpurgeable total organic-halide formation potentials were determined for the chlorination of water samples from the Mississippi, Ohio and Missouri Rivers. Samples were collected during the summer and fall of 1991 and the spring of 1992 at twelve locations on the Mississippi from New Orleans to Minneapolis, and on the Ohio and Missouri 1.6 km upstream from their confluences with the Mississippi. Formation potentials were determined as a function of pH, initial free-chlorine concentration, and reaction time. Multiple linear regression analysis of the data indicated that pH, reaction time, and the dissolved organic carbon concentration and/or the ultraviolet absorbance of the water were the most significant variables. The initial free-chlorine concentration had less significance and bromide concentration had little or no significance. Analysis of combinations of the dissolved organic carbon concentration and the ultraviolet absorbance indicated that use of the ultraviolet absorbance alone provided the best prediction of the experimental data. Regression coefficients for the variables were generally comparable to coefficients previously presented in the literature for waters from other parts of the United States.
Multiple correlates of cigarette use among high school students.

PubMed

McDermott, R J; Sarvela, P D; Hoalt, P N; Bajracharya, S M; Marty, P J; Emery, E M

1992-04-01

A cross-sectional survey research design measured factors related to cigarette use among 2,212 senior high school students. Results showed 14.3% of the sample smoked cigarettes at least occasionally, with 5.3% reporting they were daily smokers. About 12.8% indicated they were ex-smokers. Males and females smoked at almost equal rates, and the percentage of 10th grade student smokers was slightly higher (16.4%) than the percentage of juniors and seniors who smoked. Approximately 22% of Hispanic students, 15% of Caucasian students, and 4.5% of African-American students reported smoking cigarettes at least occasionally. An initial regression analysis used 21 variables to predict cigarette smoking. A more parsimonious regression model (R2 = .28), using variables from the initial regression analysis with significance levels of .01 or less, indicated the most important predictors of cigarette use were ethnic group, attitude toward females who smoke, close friends' use of cigarettes, personal use of marijuana, best friend's use of cigarettes, personal use of alcohol, and school self-esteem. Implications for school health programs are addressed.
Cardiovascular Disease Death Before Age 65 in 168 Countries Correlated Statistically with Biometrics, Socioeconomic Status, Tobacco, Gender, Exercise, Macronutrients, and Vitamin K

PubMed Central

Agutter, Paul S

2016-01-01

Background Nutrition researchers recently recognized that deficiency of vitamin K2 (menaquinone: MK-4–MK-13) is widespread and contributes to cardiovascular disease (CVD). The deficiency of vitamin K2 or vitamin K inhibition with warfarin leads to calcium deposition in the arterial blood vessels. Methods Using publicly available sources, we collected food commodity availability data and derived nutrient profiles including vitamin K2 for people from 168 countries. We also collected female and male cohort data on early death from CVD (ages 15–64 years), insufficient physical activity, tobacco, biometric CVD risk markers, socioeconomic risk factors for CVD, and gender. The outcome measures included (1) univariate correlations of early death from CVD with each risk factor, (2) a multiple regression-derived formula relating early death from CVD (dependent variable) to macronutrient profile, vitamin K1 and K2 and other risk factors (independent variables), (3) for each risk factor appearing in the multiple regression formula, the portion of CVD risk attributable to that factor, and (4) similar univariate and multivariate analyses of body mass index (BMI), fasting blood sugar (FBS) (simulated from diabetes prevalence), systolic blood pressure (SBP), and cholesterol/ HDL-C ratio (simulated from serum cholesterol) (dependent variables) and dietary and other risk factors (independent variables). Results Female and male cohorts in countries that have vitamin K2 < 5µg per 2000 kcal/day per capita (n = 70) had about 2.2 times the rate of early CVD deaths as people in countries with > 24 µg/day of vitamin K2 per 2000 kcal/day (n = 72). A multiple regression-derived formula relating early death from CVD to dietary nutrients and other risk factors accounted for about 50% of the variance between cohorts in early CVD death. The attributable risks of the variables in the CVD early death formula were: too much alcohol (0.38%), too little vitamin K2 (6.95%), tobacco (6.87%), high blood pressure (9.01%), air pollution (9.15%), early childhood death (3.64%), poverty (7.66%), and male gender (6.13%). Conclusions Worldwide dietary vitamin K2 data derived from food commodities add much understanding to the analysis of CVD risk factors and the etiology of CVD. Vitamin K2 in food products should be systematically quantified. Public health programs should be considered to increase the intake of vitamin K2-containing fermented plant foods such as sauerkraut, miso, and natto. PMID:27688985
Cardiovascular Disease Death Before Age 65 in 168 Countries Correlated Statistically with Biometrics, Socioeconomic Status, Tobacco, Gender, Exercise, Macronutrients, and Vitamin K.

PubMed

Cundiff, David K; Agutter, Paul S

2016-08-24

Nutrition researchers recently recognized that deficiency of vitamin K2 (menaquinone: MK-4-MK-13) is widespread and contributes to cardiovascular disease (CVD). The deficiency of vitamin K2 or vitamin K inhibition with warfarin leads to calcium deposition in the arterial blood vessels. Using publicly available sources, we collected food commodity availability data and derived nutrient profiles including vitamin K2 for people from 168 countries. We also collected female and male cohort data on early death from CVD (ages 15-64 years), insufficient physical activity, tobacco, biometric CVD risk markers, socioeconomic risk factors for CVD, and gender. The outcome measures included (1) univariate correlations of early death from CVD with each risk factor, (2) a multiple regression-derived formula relating early death from CVD (dependent variable) to macronutrient profile, vitamin K1 and K2 and other risk factors (independent variables), (3) for each risk factor appearing in the multiple regression formula, the portion of CVD risk attributable to that factor, and (4) similar univariate and multivariate analyses of body mass index (BMI), fasting blood sugar (FBS) (simulated from diabetes prevalence), systolic blood pressure (SBP), and cholesterol/ HDL-C ratio (simulated from serum cholesterol) (dependent variables) and dietary and other risk factors (independent variables). Female and male cohorts in countries that have vitamin K2 < 5µg per 2000 kcal/day per capita (n = 70) had about 2.2 times the rate of early CVD deaths as people in countries with > 24 µg/day of vitamin K2 per 2000 kcal/day (n = 72). A multiple regression-derived formula relating early death from CVD to dietary nutrients and other risk factors accounted for about 50% of the variance between cohorts in early CVD death. The attributable risks of the variables in the CVD early death formula were: too much alcohol (0.38%), too little vitamin K2 (6.95%), tobacco (6.87%), high blood pressure (9.01%), air pollution (9.15%), early childhood death (3.64%), poverty (7.66%), and male gender (6.13%). Worldwide dietary vitamin K2 data derived from food commodities add much understanding to the analysis of CVD risk factors and the etiology of CVD. Vitamin K2 in food products should be systematically quantified. Public health programs should be considered to increase the intake of vitamin K2-containing fermented plant foods such as sauerkraut, miso, and natto.
Estimating annual suspended-sediment loads in the northern and central Appalachian Coal region

USGS Publications Warehouse

Koltun, G.F.

1985-01-01

Multiple-regression equations were developed for estimating the annual suspended-sediment load, for a given year, from small to medium-sized basins in the northern and central parts of the Appalachian coal region. The regression analysis was performed with data for land use, basin characteristics, streamflow, rainfall, and suspended-sediment load for 15 sites in the region. Two variables, the maximum mean-daily discharge occurring within the year and the annual peak discharge, explained much of the variation in the annual suspended-sediment load. Separate equations were developed employing each of these discharge variables. Standard errors for both equations are relatively large, which suggests that future predictions will probably have a low level of precision. This level of precision, however, may be acceptable for certain purposes. It is therefore left to the user to asses whether the level of precision provided by these equations is acceptable for the intended application.
Linking family dynamics and the mental health of Colombian dementia caregivers.

PubMed

Sutter, Megan; Perrin, Paul B; Chang, Yu-Ping; Hoyos, Guillermo Ramirez; Buraye, Jaqueline Arabia; Arango-Lasprilla, Juan Carlos

2014-02-01

This cross-sectional, quantitative, self-report study examined the relationship between family dynamics (cohesion, flexibility, pathology/ functioning, communication, family satisfaction, and empathy) and mental health (depression, burden, stress, and satisfaction with life [SWL]) in 90 dementia caregivers from Colombia. Hierarchical multiple regressions controlling for caregiver demographics found that family dynamics were significantly associated with caregiver depression, stress, and SWL and marginally associated with burden. Within these regressions, empathy was uniquely associated with stress; flexibility with depression and marginally with SWL; and family communication marginally with burden and stress. Nearly all family dynamic variables were bivariately associated with caregiver mental health variables, such that caregivers had stronger mental health when their family dynamics were healthy. Family-systems interventions in global regions with high levels of familism like that in the current study may improve family empathy, flexibility, and communication, thereby producing better caregiver mental health and better informal care for people with dementia.
Results of the 2005 AORN salary survey--trends for perioperative nursing.

PubMed

Bacon, Donald

2005-12-01

AORN conducted its annual compensation survey for perioperative nurses in August 2005. A multiple regression model was used to examine how a variety of variables, including job title, education level, certification, experience, and geographic region, affect nursing compensation. This survey also examines the effect of other forms of compensation (eg, on-call compensation, overtime, bonuses, shift differential) on average base compensation rates.
Results of the 2006 AORN salary survey: trends for perioperative nursing.

PubMed

Bacon, Donald

2006-12-01

AORN CONDUCTED ITS ANNUAL compensation survey for perioperative nurses in August 2006. MULTIPLE REGRESSION MODEL was used to examine how a variety of variables, including job title, education level, certification, experience, and geographic region, affect nursing compensation. THIS SURVEY ALSO EXAMINES the effect of other forms of compensation (eg, on-call compensation, overtime, bonuses, shift differential) on average base compensation rates.
Modeling contemporary climate profiles of whitebark pine (Pinus albicaulis) and predicting responses to global warming

Treesearch

Marcus V. Warwell; Gerald E. Rehfeldt; Nicholas L. Crookston

2006-01-01

The Random Forests multiple regression tree was used to develop an empirically-based bioclimate model for the distribution of Pinus albicaulis (whitebark pine) in western North America, latitudes 31Â° to 51Â° N and longitudes 102Â° to 125Â° W. Independent variables included 35 simple expressions of temperature and precipitation and their interactions....

The Relationship of Item-Level Response Times with Test-Taker and Item Variables in an Operational CAT Environment. LSAC Research Report Series.

ERIC Educational Resources Information Center

Swygert, Kimberly A.

In this study, data from an operational computerized adaptive test (CAT) were examined in order to gather information concerning item response times in a CAT environment. The CAT under study included multiple-choice items measuring verbal, quantitative, and analytical reasoning. The analyses included the fitting of regression models describing the…
Exploring the facilitators and barriers to engagement in physical activity for people with multiple sclerosis.

PubMed

Kayes, Nicola M; McPherson, Kathryn M; Schluter, Philip; Taylor, Denise; Leete, Marta; Kolt, Gregory S

2011-01-01

To explore the relationship that cognitive behavioural and other previously identified variables have with physical activity engagement in people with multiple sclerosis (MS). This study adopted a cross-sectional questionnaire design. Participants were 282 individuals with MS. Outcome measures included the Physical Activity Disability Survey--Revised, Cognitive and Behavioural Responses to Symptoms Questionnaire, Barriers to Health Promoting Activities for Disabled Persons Scale, Multiple Sclerosis Self-efficacy Scale, Self-Efficacy for Chronic Diseases Scales and Chalder Fatigue Questionnaire. Multivariable stepwise regression analyses found that greater self-efficacy, greater reported mental fatigue and lower number of perceived barriers to physical activity accounted for a significant proportion of variance in physical activity behaviour, over that accounted for by illness-related variables. Although fear-avoidance beliefs accounted for a significant proportion of variance in the initial analyses, its effect was explained by other factors in the final multivariable analyses. Self-efficacy, mental fatigue and perceived barriers to physical activity are potentially modifiable variables which could be incorporated into interventions designed to improve physical activity engagement. Future research should explore whether a measurement tool tailored to capture beliefs about physical activity identified by people with MS would better predict participation in physical activity.
Does weather shape rodents? Climate related changes in morphology of two heteromyid species

NASA Astrophysics Data System (ADS)

Wolf, Mosheh; Friggens, Michael; Salazar-Bravo, Jorge

2009-01-01

Geographical variation in morphometric characters in heteromyid rodents has often correlated with climate gradients. Here, we used the long-term database of rodents trapped in the Sevilleta National Wildlife Refuge in New Mexico, USA to test whether significant annual changes in external morphometric characters are observed in a region with large variations in temperature and precipitation. We looked at the relationships between multiple temperature and precipitation variables and a number of morphological traits (body mass, body, tail, hind leg, and ear length) for two heteromyid rodents, Dipodomys merriami and Perognathus flavescens. Because these rodents can live multiple years in the wild, the climate variables for the year of the capture and the previous 2 years were included in the analyses. Using multiple linear regressions, we found that all of our morphometric traits, with the exception of tail length in D. merriami, had a significant relationship with one or more of the climate variables used. Our results demonstrate that effects of climate change on morphological traits occur over short periods, even in noninsular mammal populations. It is unclear, though, whether these changes are the result of morphological plasticity or natural selection.
Explanation of the variance in quality of life and activity capacity of patients with heart failure by laboratory data.

PubMed

Athanasopoulos, Leonidas V; Dritsas, Athanasios; Doll, Helen A; Cokkinos, Dennis V

2010-08-01

This study was conducted to explain the variance in quality of life (QoL) and activity capacity of patients with congestive heart failure from pathophysiological changes as estimated by laboratory data. Peak oxygen consumption (peak VO2) and ventilation (VE)/carbon dioxide output (VCO2) slope derived from cardiopulmonary exercise testing, plasma N-terminal prohormone of B-type natriuretic peptide (NT-proBNP), and echocardiographic markers [left atrium (LA), left ventricular ejection fraction (LVEF)] were measured in 62 patients with congestive heart failure, who also completed the Minnesota Living with Heart Failure Questionnaire and the Specific Activity Questionnaire. All regression models were adjusted for age and sex. On linear regression analysis, peak VO2 with P value less than 0.001, VE/VCO2 slope with P value less than 0.01, LVEF with P value less than 0.001, LA with P=0.001, and logNT-proBNP with P value less than 0.01 were found to be associated with QoL. On stepwise multiple linear regression, peak VO2 and LVEF continued to be predictive, accounting for 40% of the variability in Minnesota Living with Heart Failure Questionnaire score. On linear regression analysis, peak VO2 with P value less than 0.001, VE/VCO2 slope with P value less than 0.001, LVEF with P value less than 0.05, LA with P value less than 0.001, and logNT-proBNP with P value less than 0.001 were found to be associated with activity capacity. On stepwise multiple linear regression, peak VO2 and LA continued to be predictive, accounting for 53% of the variability in Specific Activity Questionnaire score. Peak VO2 is independently associated both with QoL and activity capacity. In addition to peak VO2, LVEF is independently associated with QoL, and LA with activity capacity.
Race-ethnicity is a strong correlate of circulating fat-soluble nutrient concentrations in a representative sample of the U.S. population.

PubMed

Schleicher, Rosemary L; Sternberg, Maya R; Pfeiffer, Christine M

2013-06-01

Sociodemographic and lifestyle factors exert important influences on nutritional status; however, information on their association with biomarkers of fat-soluble nutrients is limited, particularly in a representative sample of adults. Serum or plasma concentrations of vitamin A, vitamin E, carotenes, xanthophylls, 25-hydroxyvitamin D [25(OH)D], SFAs, MUFAs, PUFAs, and total fatty acids (tFAs) were measured in adults (aged ≥ 20 y) during all or part of NHANES 2003-2006. Simple and multiple linear regression models were used to assess 5 sociodemographic variables (age, sex, race-ethnicity, education, and income) and 5 lifestyle behaviors (smoking, alcohol consumption, BMI, physical activity, and supplement use) and their relation to biomarker concentrations. Adjustment for total serum cholesterol and lipid-altering drug use was added to the full regression model. Adjustment for latitude and season was added to the full model for 25(OH)D. Based on simple linear regression, race-ethnicity, BMI, and supplement use were significantly related to all fat-soluble biomarkers. Sociodemographic variables as a group explained 5-17% of biomarker variability, whereas together, sociodemographic and lifestyle variables explained 22-23% [25(OH)D, vitamin E, xanthophylls], 17% (vitamin A), 15% (MUFAs), 10-11% (SFAs, carotenes, tFAs), and 6% (PUFAs) of biomarker variability. Although lipid adjustment explained additional variability for all biomarkers except for 25(OH)D, it appeared to be largely independent of sociodemographic and lifestyle variables. After adjusting for sociodemographic, lifestyle, and lipid-related variables, major differences in biomarkers were associated with race-ethnicity (from -44 to 57%), smoking (up to -25%), supplement use (up to 21%), and BMI (up to -15%). Latitude and season attenuated some race-ethnicity differences. Of the sociodemographic and lifestyle variables examined, with or without lipid adjustment, most fat-soluble nutrient biomarkers were significantly associated with race-ethnicity.
On using summary statistics from an external calibration sample to correct for covariate measurement error.

PubMed

Guo, Ying; Little, Roderick J; McConnell, Daniel S

2012-01-01

Covariate measurement error is common in epidemiologic studies. Current methods for correcting measurement error with information from external calibration samples are insufficient to provide valid adjusted inferences. We consider the problem of estimating the regression of an outcome Y on covariates X and Z, where Y and Z are observed, X is unobserved, but a variable W that measures X with error is observed. Information about measurement error is provided in an external calibration sample where data on X and W (but not Y and Z) are recorded. We describe a method that uses summary statistics from the calibration sample to create multiple imputations of the missing values of X in the regression sample, so that the regression coefficients of Y on X and Z and associated standard errors can be estimated using simple multiple imputation combining rules, yielding valid statistical inferences under the assumption of a multivariate normal distribution. The proposed method is shown by simulation to provide better inferences than existing methods, namely the naive method, classical calibration, and regression calibration, particularly for correction for bias and achieving nominal confidence levels. We also illustrate our method with an example using linear regression to examine the relation between serum reproductive hormone concentrations and bone mineral density loss in midlife women in the Michigan Bone Health and Metabolism Study. Existing methods fail to adjust appropriately for bias due to measurement error in the regression setting, particularly when measurement error is substantial. The proposed method corrects this deficiency.
DOE Office of Scientific and Technical Information (OSTI.GOV)

Yust, B.L.

The relationship between fuels used by households in a rural region of Leyte Province, the Philippines, and the variables that can affect the type and amount of fuel used were examined. Data were drawn from interviews conducted in a previous study with 150 female heads of households from 10 villages near Baybay, Leyte. Within a family-ecosystem framework, a multiple regression model was developed to identify predictors of fuel use in the households. Inputs to the system included the following independent variables representing aspects of household environments; (1) natural--geographic location of the village, (2) technical--cook stove and equipment ownership, (3) economic--distancemore » to fuel sources and number of hectares of land owned, and (4) cultural-cooking fuel preference. Two regression equations were developed. The first used as the dependent variable the number of units of each of four specific fuels used in the household in one week: wood, coconut fronds, and coconut shells, and coconut husks with shells. The second used as the dependent variable an aggregate measure, barrel oil equivalent (boe), of the quantity of all fuels used in the household in one week. The households in this study were primarily dependent on biomass fuels gathered by family members; a limited quantity of commercial fuels was used.« less
Climate variations and salmonellosis transmission in Adelaide, South Australia: a comparison between regression models

NASA Astrophysics Data System (ADS)

Zhang, Ying; Bi, Peng; Hiller, Janet

2008-01-01

This is the first study to identify appropriate regression models for the association between climate variation and salmonellosis transmission. A comparison between different regression models was conducted using surveillance data in Adelaide, South Australia. By using notified salmonellosis cases and climatic variables from the Adelaide metropolitan area over the period 1990-2003, four regression methods were examined: standard Poisson regression, autoregressive adjusted Poisson regression, multiple linear regression, and a seasonal autoregressive integrated moving average (SARIMA) model. Notified salmonellosis cases in 2004 were used to test the forecasting ability of the four models. Parameter estimation, goodness-of-fit and forecasting ability of the four regression models were compared. Temperatures occurring 2 weeks prior to cases were positively associated with cases of salmonellosis. Rainfall was also inversely related to the number of cases. The comparison of the goodness-of-fit and forecasting ability suggest that the SARIMA model is better than the other three regression models. Temperature and rainfall may be used as climatic predictors of salmonellosis cases in regions with climatic characteristics similar to those of Adelaide. The SARIMA model could, thus, be adopted to quantify the relationship between climate variations and salmonellosis transmission.
Modified Regression Correlation Coefficient for Poisson Regression Model

NASA Astrophysics Data System (ADS)

Kaengthong, Nattacha; Domthong, Uthumporn

2017-09-01

This study gives attention to indicators in predictive power of the Generalized Linear Model (GLM) which are widely used; however, often having some restrictions. We are interested in regression correlation coefficient for a Poisson regression model. This is a measure of predictive power, and defined by the relationship between the dependent variable (Y) and the expected value of the dependent variable given the independent variables [E(Y|X)] for the Poisson regression model. The dependent variable is distributed as Poisson. The purpose of this research was modifying regression correlation coefficient for Poisson regression model. We also compare the proposed modified regression correlation coefficient with the traditional regression correlation coefficient in the case of two or more independent variables, and having multicollinearity in independent variables. The result shows that the proposed regression correlation coefficient is better than the traditional regression correlation coefficient based on Bias and the Root Mean Square Error (RMSE).
The multiple imputation method: a case study involving secondary data analysis.

PubMed

Walani, Salimah R; Cleland, Charles M

2015-05-01

To illustrate with the example of a secondary data analysis study the use of the multiple imputation method to replace missing data. Most large public datasets have missing data, which need to be handled by researchers conducting secondary data analysis studies. Multiple imputation is a technique widely used to replace missing values while preserving the sample size and sampling variability of the data. The 2004 National Sample Survey of Registered Nurses. The authors created a model to impute missing values using the chained equation method. They used imputation diagnostics procedures and conducted regression analysis of imputed data to determine the differences between the log hourly wages of internationally educated and US-educated registered nurses. The authors used multiple imputation procedures to replace missing values in a large dataset with 29,059 observations. Five multiple imputed datasets were created. Imputation diagnostics using time series and density plots showed that imputation was successful. The authors also present an example of the use of multiple imputed datasets to conduct regression analysis to answer a substantive research question. Multiple imputation is a powerful technique for imputing missing values in large datasets while preserving the sample size and variance of the data. Even though the chained equation method involves complex statistical computations, recent innovations in software and computation have made it possible for researchers to conduct this technique on large datasets. The authors recommend nurse researchers use multiple imputation methods for handling missing data to improve the statistical power and external validity of their studies.
Multiple causes of nonstationarity in the Weihe annual low-flow series

NASA Astrophysics Data System (ADS)

Xiong, Bin; Xiong, Lihua; Chen, Jie; Xu, Chong-Yu; Li, Lingqi

2018-02-01

Under the background of global climate change and local anthropogenic activities, multiple driving forces have introduced various nonstationary components into low-flow series. This has led to a high demand on low-flow frequency analysis that considers nonstationary conditions for modeling. In this study, through a nonstationary frequency analysis framework with the generalized linear model (GLM) to consider time-varying distribution parameters, the multiple explanatory variables were incorporated to explain the variation in low-flow distribution parameters. These variables are comprised of the three indices of human activities (HAs; i.e., population, POP; irrigation area, IAR; and gross domestic product, GDP) and the eight measuring indices of the climate and catchment conditions (i.e., total precipitation P, mean frequency of precipitation events λ, temperature T, potential evapotranspiration (EP), climate aridity index AIEP, base-flow index (BFI), recession constant K and the recession-related aridity index AIK). This framework was applied to model the annual minimum flow series of both Huaxian and Xianyang gauging stations in the Weihe River, China (also known as the Wei He River). The results from stepwise regression for the optimal explanatory variables show that the variables related to irrigation, recession, temperature and precipitation play an important role in modeling. Specifically, analysis of annual minimum 30-day flow in Huaxian shows that the nonstationary distribution model with any one of all explanatory variables is better than the one without explanatory variables, the nonstationary gamma distribution model with four optimal variables is the best model and AIK is of the highest relative importance among these four variables, followed by IAR, BFI and AIEP. We conclude that the incorporation of multiple indices related to low-flow generation permits tracing various driving forces. The established link in nonstationary analysis will be beneficial to analyze future occurrences of low-flow extremes in similar areas.
Multiple variables explain the variability in the decrement in VO2max during acute hypobaric hypoxia.

PubMed

Robergs, R A; Quintana, R; Parker, D L; Frankel, C C

1998-06-01

We used multiple regression analyses to determine the relationships between the decrement in sea level (SL, 760 Torr) VO2max during hypobaric hypoxia (HH) and variables that could alter or be related to the decrement in VO2max. HH conditions consisted of 682 Torr, 632 Torr, and 566 Torr, and the measured independent variables were SL-VO2max, SL lactate threshold (SL-LT), the change in hemoglobin saturation at VO2max between 760 and 566 Torr (delta SaO2max), lean body mass (LBM), and gender. Male (N = 14) and female (N = 14) subjects of varied fitness, training status, and residential altitude (1,640-2,460 m) completed cycle ergometry tests of VO2max at each HH condition under randomized and single-blinded conditions. VO2max decreased significantly from 760 Torr after 682 Torr (approximately 915 m) (3.5 +/- 0.9 to 3.4 +/- 0.8 L.min-1, P = 0.0003). Across all HH conditions, the slope of the relative decrement in VO2max (%VO2max) during HH was -9.2%/100 mm Hg (-8.1%/1000 m) with an initial decrease from 100% estimated to occur below 705 Torr (610 m). Step-wise multiple regression revealed that SL-VO2max, SL-LT, delta SaO2max, LBM, and gender each significantly combined to account for 89.03% of the variance in the decrement in VO2max (760-566 Torr) (P < 0.001). Individuals who have a combination of a large SL-VO2max, a small SL-LT (VO2, L.min-1), greater reductions in delta SaO2max, a large LBM, and are male have the greatest decrement in VO2max during HH. The unique variance explanation afforded by SL-LT, LBM, and gender suggests that issues pertaining to oxygen diffusion within skeletal muscle may add to the explanation of between subjects variability in the decrement in VO2max during HH.
Bayesian Group Bridge for Bi-level Variable Selection.

PubMed

Mallick, Himel; Yi, Nengjun

2017-06-01

A Bayesian bi-level variable selection method (BAGB: Bayesian Analysis of Group Bridge) is developed for regularized regression and classification. This new development is motivated by grouped data, where generic variables can be divided into multiple groups, with variables in the same group being mechanistically related or statistically correlated. As an alternative to frequentist group variable selection methods, BAGB incorporates structural information among predictors through a group-wise shrinkage prior. Posterior computation proceeds via an efficient MCMC algorithm. In addition to the usual ease-of-interpretation of hierarchical linear models, the Bayesian formulation produces valid standard errors, a feature that is notably absent in the frequentist framework. Empirical evidence of the attractiveness of the method is illustrated by extensive Monte Carlo simulations and real data analysis. Finally, several extensions of this new approach are presented, providing a unified framework for bi-level variable selection in general models with flexible penalties.
Community characteristics that attract physicians in Japan: a cross-sectional analysis of community demographic and economic factors.

PubMed

Matsumoto, Masatoshi; Inoue, Kazuo; Noguchi, Satomi; Toyokawa, Satoshi; Kajii, Eiji

2009-02-18

In many countries, there is a surplus of physicians in some communities and a shortage in others. Population size is known to be correlated with the number of physicians in a community, and is conventionally considered to represent the power of communities to attract physicians. However, associations between other demographic/economic variables and the number of physicians in a community have not been fully evaluated. This study seeks other parameters that correlate with the physician population and show which characteristics of a community determine its "attractiveness" to physicians. Associations between the number of physicians and selected demographic/economic/life-related variables of all of Japan's 3132 municipalities were examined. In order to exclude the confounding effect of community size, correlations between the physician-to-population ratio and other variable-to-population ratios or variable-to-area ratios were evaluated with simple correlation and multiple regression analyses. The equity of physician distribution against each variable was evaluated by the orenz curve and Gini index. Among the 21 variables selected, the service industry workers-to-population ratio (0.543), commercial land price (0.527), sales of goods per person (0.472), and daytime population density (0.451) were better correlated with the physician-to-population ratio than was population density (0.409). Multiple regression analysis showed that the service industry worker-to-population ratio, the daytime population density, and the elderly rate were each independently correlated with the physician-to-population ratio (standardized regression coefficient 0.393, 0.355, 0.089 respectively; each p<0.001). Equity of physician distribution was higher against service industry population (Gini index=0.26) and daytime population (0.28) than against population (0.33). Daytime population and service industry population in a municipality are better parameters of community attractiveness to physicians than population. Because attractiveness is supposed to consist of medical demand and the amenities of urban life, the two parameters may represent the amount of medical demand and/or the extent of urban amenities of the community more precisely than population does. The conventional demand-supply analysis based solely on population as the demand parameter may overestimate the inequity of the physician distribution among communities.
Model selection with multiple regression on distance matrices leads to incorrect inferences.

PubMed

Franckowiak, Ryan P; Panasci, Michael; Jarvis, Karl J; Acuña-Rodriguez, Ian S; Landguth, Erin L; Fortin, Marie-Josée; Wagner, Helene H

2017-01-01

In landscape genetics, model selection procedures based on Information Theoretic and Bayesian principles have been used with multiple regression on distance matrices (MRM) to test the relationship between multiple vectors of pairwise genetic, geographic, and environmental distance. Using Monte Carlo simulations, we examined the ability of model selection criteria based on Akaike's information criterion (AIC), its small-sample correction (AICc), and the Bayesian information criterion (BIC) to reliably rank candidate models when applied with MRM while varying the sample size. The results showed a serious problem: all three criteria exhibit a systematic bias toward selecting unnecessarily complex models containing spurious random variables and erroneously suggest a high level of support for the incorrectly ranked best model. These problems effectively increased with increasing sample size. The failure of AIC, AICc, and BIC was likely driven by the inflated sample size and different sum-of-squares partitioned by MRM, and the resulting effect on delta values. Based on these findings, we strongly discourage the continued application of AIC, AICc, and BIC for model selection with MRM.
Income, housing, and fire injuries: a census tract analysis.

PubMed

Shai, Donna

2006-01-01

This study investigates the social and demographic correlates of nonfatal structural fire injury rates for the civilian population for Philadelphia census tracts during 1993-2001. The author analyzed 1,563 fire injuries by census tract using the 1990 census (STF 3) and unpublished data from the Office of the Fire Marshal of the Philadelphia Fire Department. Injury rates were calculated per 1,000 residents of a given census tract. Multiple regression was used to determine significant variables in predicting fire injuries in a given census tract over a nine-year period and interaction effects between two of these variables-age of housing and income. Multiple regression analysis indicates that older housing (prior to 1940), low income, the prevalence of vacant houses, and the ability to speak English have significant independent effects on fire injury rates in Philadelphia. In addition, the results show a significant interaction between older housing and low income. Given the finding of very high rates of fire injuries in census tracts that are both low income and have older housing, fire prevention units can take preventative measures. Fire protection devices, especially smoke alarms, should be distributed in the neighborhoods most at risk. Multiple occupancy dwellings should have sprinkler systems and fire extinguishers. Laws concerning the maintenance of older rental housing need to be strictly enforced. Vacant houses should be effectively boarded up or renovated for residential use. Fire prevention material should be distributed in a number of languages to meet local needs.
Pursuit of STEM: Factors shaping degree completion for African American females in STEM

NASA Astrophysics Data System (ADS)

Wilkins, Ashlee N.

The primary purpose of the study was to examine secondary data from the Cooperative Institutional Research Program (CIRP) Freshman and College Senior Surveys to investigate factors shaping degree aspirations for African American female undergraduates partaking in science, technology, engineering, and mathematics (STEM) majors. Hierarchical multiple regression was used to analyze the data and identify relationships between independent variables in relation to the dependent variable. The findings of the study reveal four key variables that were predictive of degree completion for African American females in STEM. Father's education, SAT composite, highest degree planned, and self-perception were positive predictors for females; while independent variable overall sense of community among students remained a negative predictor. Lastly implications for education and recommendations for future research were discussed.
Influence of hydroxypropyl methylcellulose on drug release pattern of a gastroretentive floating drug delivery system using a 3(2) full factorial design.

PubMed

Swain, Kalpana; Pattnaik, Satyanarayan; Mallick, Subrata; Chowdary, Korla Appana

2009-01-01

In the present investigation, controlled release gastroretentive floating drug delivery system of theophylline was developed employing response surface methodology. A 3(2) randomized full factorial design was developed to study the effect of formulation variables like various viscosity grades and contents of hydroxypropyl methylcellulose (HPMC) and their interactions on response variables. The floating lag time for all nine experimental trial batches were less than 2 min and floatation time of more than 12 h. Theophylline release from the polymeric matrix system followed non-Fickian anomalous transport. Multiple regression analysis revealed that both viscosity and content of HPMC had statistically significant influence on all dependent variables but the effect of these variables found to be nonlinear above certain threshold values.
Comparison of stream invertebrate response models for bioassessment metric

USGS Publications Warehouse

Waite, Ian R.; Kennen, Jonathan G.; May, Jason T.; Brown, Larry R.; Cuffney, Thomas F.; Jones, Kimberly A.; Orlando, James L.

2012-01-01

We aggregated invertebrate data from various sources to assemble data for modeling in two ecoregions in Oregon and one in California. Our goal was to compare the performance of models developed using multiple linear regression (MLR) techniques with models developed using three relatively new techniques: classification and regression trees (CART), random forest (RF), and boosted regression trees (BRT). We used tolerance of taxa based on richness (RICHTOL) and ratio of observed to expected taxa (O/E) as response variables and land use/land cover as explanatory variables. Responses were generally linear; therefore, there was little improvement to the MLR models when compared to models using CART and RF. In general, the four modeling techniques (MLR, CART, RF, and BRT) consistently selected the same primary explanatory variables for each region. However, results from the BRT models showed significant improvement over the MLR models for each region; increases in R2 from 0.09 to 0.20. The O/E metric that was derived from models specifically calibrated for Oregon consistently had lower R2 values than RICHTOL for the two regions tested. Modeled O/E R2 values were between 0.06 and 0.10 lower for each of the four modeling methods applied in the Willamette Valley and were between 0.19 and 0.36 points lower for the Blue Mountains. As a result, BRT models may indeed represent a good alternative to MLR for modeling species distribution relative to environmental variables.
Evaluation of accuracy of linear regression models in predicting urban stormwater discharge characteristics.

PubMed

Madarang, Krish J; Kang, Joo-Hyon

2014-06-01

Stormwater runoff has been identified as a source of pollution for the environment, especially for receiving waters. In order to quantify and manage the impacts of stormwater runoff on the environment, predictive models and mathematical models have been developed. Predictive tools such as regression models have been widely used to predict stormwater discharge characteristics. Storm event characteristics, such as antecedent dry days (ADD), have been related to response variables, such as pollutant loads and concentrations. However it has been a controversial issue among many studies to consider ADD as an important variable in predicting stormwater discharge characteristics. In this study, we examined the accuracy of general linear regression models in predicting discharge characteristics of roadway runoff. A total of 17 storm events were monitored in two highway segments, located in Gwangju, Korea. Data from the monitoring were used to calibrate United States Environmental Protection Agency's Storm Water Management Model (SWMM). The calibrated SWMM was simulated for 55 storm events, and the results of total suspended solid (TSS) discharge loads and event mean concentrations (EMC) were extracted. From these data, linear regression models were developed. R(2) and p-values of the regression of ADD for both TSS loads and EMCs were investigated. Results showed that pollutant loads were better predicted than pollutant EMC in the multiple regression models. Regression may not provide the true effect of site-specific characteristics, due to uncertainty in the data. Copyright © 2014 The Research Centre for Eco-Environmental Sciences, Chinese Academy of Sciences. Published by Elsevier B.V. All rights reserved.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.