Sample records for calculated multiple regression

  1. A Simple and Convenient Method of Multiple Linear Regression to Calculate Iodine Molecular Constants

    ERIC Educational Resources Information Center

    Cooper, Paul D.

    2010-01-01

    A new procedure using a student-friendly least-squares multiple linear-regression technique utilizing a function within Microsoft Excel is described that enables students to calculate molecular constants from the vibronic spectrum of iodine. This method is advantageous pedagogically as it calculates molecular constants for ground and excited…

  2. Ridge: a computer program for calculating ridge regression estimates

    Treesearch

    Donald E. Hilt; Donald W. Seegrist

    1977-01-01

    Least-squares coefficients for multiple-regression models may be unstable when the independent variables are highly correlated. Ridge regression is a biased estimation procedure that produces stable estimates of the coefficients. Ridge regression is discussed, and a computer program for calculating the ridge coefficients is presented.

  3. A Method of Calculating Functional Independence Measure at Discharge from Functional Independence Measure Effectiveness Predicted by Multiple Regression Analysis Has a High Degree of Predictive Accuracy.

    PubMed

    Tokunaga, Makoto; Watanabe, Susumu; Sonoda, Shigeru

    2017-09-01

    Multiple linear regression analysis is often used to predict the outcome of stroke rehabilitation. However, the predictive accuracy may not be satisfactory. The objective of this study was to elucidate the predictive accuracy of a method of calculating motor Functional Independence Measure (mFIM) at discharge from mFIM effectiveness predicted by multiple regression analysis. The subjects were 505 patients with stroke who were hospitalized in a convalescent rehabilitation hospital. The formula "mFIM at discharge = mFIM effectiveness × (91 points - mFIM at admission) + mFIM at admission" was used. By including the predicted mFIM effectiveness obtained through multiple regression analysis in this formula, we obtained the predicted mFIM at discharge (A). We also used multiple regression analysis to directly predict mFIM at discharge (B). The correlation between the predicted and the measured values of mFIM at discharge was compared between A and B. The correlation coefficients were .916 for A and .878 for B. Calculating mFIM at discharge from mFIM effectiveness predicted by multiple regression analysis had a higher degree of predictive accuracy of mFIM at discharge than that directly predicted. Copyright © 2017 National Stroke Association. Published by Elsevier Inc. All rights reserved.

  4. [Application of SAS macro to evaluated multiplicative and additive interaction in logistic and Cox regression in clinical practices].

    PubMed

    Nie, Z Q; Ou, Y Q; Zhuang, J; Qu, Y J; Mai, J Z; Chen, J M; Liu, X Q

    2016-05-01

    Conditional logistic regression analysis and unconditional logistic regression analysis are commonly used in case control study, but Cox proportional hazard model is often used in survival data analysis. Most literature only refer to main effect model, however, generalized linear model differs from general linear model, and the interaction was composed of multiplicative interaction and additive interaction. The former is only statistical significant, but the latter has biological significance. In this paper, macros was written by using SAS 9.4 and the contrast ratio, attributable proportion due to interaction and synergy index were calculated while calculating the items of logistic and Cox regression interactions, and the confidence intervals of Wald, delta and profile likelihood were used to evaluate additive interaction for the reference in big data analysis in clinical epidemiology and in analysis of genetic multiplicative and additive interactions.

  5. An improved multiple linear regression and data analysis computer program package

    NASA Technical Reports Server (NTRS)

    Sidik, S. M.

    1972-01-01

    NEWRAP, an improved version of a previous multiple linear regression program called RAPIER, CREDUC, and CRSPLT, allows for a complete regression analysis including cross plots of the independent and dependent variables, correlation coefficients, regression coefficients, analysis of variance tables, t-statistics and their probability levels, rejection of independent variables, plots of residuals against the independent and dependent variables, and a canonical reduction of quadratic response functions useful in optimum seeking experimentation. A major improvement over RAPIER is that all regression calculations are done in double precision arithmetic.

  6. The Use of Multiple Regression Models to Determine if Conjoint Analysis Should Be Conducted on Aggregate Data.

    ERIC Educational Resources Information Center

    Fraas, John W.; Newman, Isadore

    1996-01-01

    In a conjoint-analysis consumer-preference study, researchers must determine whether the product factor estimates, which measure consumer preferences, should be calculated and interpreted for each respondent or collectively. Multiple regression models can determine whether to aggregate data by examining factor-respondent interaction effects. This…

  7. Sample size determination for logistic regression on a logit-normal distribution.

    PubMed

    Kim, Seongho; Heath, Elisabeth; Heilbrun, Lance

    2017-06-01

    Although the sample size for simple logistic regression can be readily determined using currently available methods, the sample size calculation for multiple logistic regression requires some additional information, such as the coefficient of determination ([Formula: see text]) of a covariate of interest with other covariates, which is often unavailable in practice. The response variable of logistic regression follows a logit-normal distribution which can be generated from a logistic transformation of a normal distribution. Using this property of logistic regression, we propose new methods of determining the sample size for simple and multiple logistic regressions using a normal transformation of outcome measures. Simulation studies and a motivating example show several advantages of the proposed methods over the existing methods: (i) no need for [Formula: see text] for multiple logistic regression, (ii) available interim or group-sequential designs, and (iii) much smaller required sample size.

  8. Multiplication factor versus regression analysis in stature estimation from hand and foot dimensions.

    PubMed

    Krishan, Kewal; Kanchan, Tanuj; Sharma, Abhilasha

    2012-05-01

    Estimation of stature is an important parameter in identification of human remains in forensic examinations. The present study is aimed to compare the reliability and accuracy of stature estimation and to demonstrate the variability in estimated stature and actual stature using multiplication factor and regression analysis methods. The study is based on a sample of 246 subjects (123 males and 123 females) from North India aged between 17 and 20 years. Four anthropometric measurements; hand length, hand breadth, foot length and foot breadth taken on the left side in each subject were included in the study. Stature was measured using standard anthropometric techniques. Multiplication factors were calculated and linear regression models were derived for estimation of stature from hand and foot dimensions. Derived multiplication factors and regression formula were applied to the hand and foot measurements in the study sample. The estimated stature from the multiplication factors and regression analysis was compared with the actual stature to find the error in estimated stature. The results indicate that the range of error in estimation of stature from regression analysis method is less than that of multiplication factor method thus, confirming that the regression analysis method is better than multiplication factor analysis in stature estimation. Copyright © 2012 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.

  9. Prediction of hearing outcomes by multiple regression analysis in patients with idiopathic sudden sensorineural hearing loss.

    PubMed

    Suzuki, Hideaki; Tabata, Takahisa; Koizumi, Hiroki; Hohchi, Nobusuke; Takeuchi, Shoko; Kitamura, Takuro; Fujino, Yoshihisa; Ohbuchi, Toyoaki

    2014-12-01

    This study aimed to create a multiple regression model for predicting hearing outcomes of idiopathic sudden sensorineural hearing loss (ISSNHL). The participants were 205 consecutive patients (205 ears) with ISSNHL (hearing level ≥ 40 dB, interval between onset and treatment ≤ 30 days). They received systemic steroid administration combined with intratympanic steroid injection. Data were examined by simple and multiple regression analyses. Three hearing indices (percentage hearing improvement, hearing gain, and posttreatment hearing level [HLpost]) and 7 prognostic factors (age, days from onset to treatment, initial hearing level, initial hearing level at low frequencies, initial hearing level at high frequencies, presence of vertigo, and contralateral hearing level) were included in the multiple regression analysis as dependent and explanatory variables, respectively. In the simple regression analysis, the percentage hearing improvement, hearing gain, and HLpost showed significant correlation with 2, 5, and 6 of the 7 prognostic factors, respectively. The multiple correlation coefficients were 0.396, 0.503, and 0.714 for the percentage hearing improvement, hearing gain, and HLpost, respectively. Predicted values of HLpost calculated by the multiple regression equation were reliable with 70% probability with a 40-dB-width prediction interval. Prediction of HLpost by the multiple regression model may be useful to estimate the hearing prognosis of ISSNHL. © The Author(s) 2014.

  10. Epidemiologic programs for computers and calculators. A microcomputer program for multiple logistic regression by unconditional and conditional maximum likelihood methods.

    PubMed

    Campos-Filho, N; Franco, E L

    1989-02-01

    A frequent procedure in matched case-control studies is to report results from the multivariate unmatched analyses if they do not differ substantially from the ones obtained after conditioning on the matching variables. Although conceptually simple, this rule requires that an extensive series of logistic regression models be evaluated by both the conditional and unconditional maximum likelihood methods. Most computer programs for logistic regression employ only one maximum likelihood method, which requires that the analyses be performed in separate steps. This paper describes a Pascal microcomputer (IBM PC) program that performs multiple logistic regression by both maximum likelihood estimation methods, which obviates the need for switching between programs to obtain relative risk estimates from both matched and unmatched analyses. The program calculates most standard statistics and allows factoring of categorical or continuous variables by two distinct methods of contrast. A built-in, descriptive statistics option allows the user to inspect the distribution of cases and controls across categories of any given variable.

  11. Use of Thematic Mapper for water quality assessment

    NASA Technical Reports Server (NTRS)

    Horn, E. M.; Morrissey, L. A.

    1984-01-01

    The evaluation of simulated TM data obtained on an ER-2 aircraft at twenty-five predesignated sample sites for mapping water quality factors such as conductivity, pH, suspended solids, turbidity, temperature, and depth, is discussed. Using a multiple regression for the seven TM bands, an equation is developed for the suspended solids. TM bands 1, 2, 3, 4, and 6 are used with logarithm conductivity in a multiple regression. The assessment of regression equations for a high coefficient of determination (R-squared) and statistical significance is considered. Confidence intervals about the mean regression point are calculated in order to assess the robustness of the regressions used for mapping conductivity, turbidity, and suspended solids, and by regressing random subsamples of sites and comparing the resultant range of R-squared, cross validation is conducted.

  12. Mapping diffuse photosynthetically active radiation from satellite data in Thailand

    NASA Astrophysics Data System (ADS)

    Choosri, P.; Janjai, S.; Nunez, M.; Buntoung, S.; Charuchittipan, D.

    2017-12-01

    In this paper, calculation of monthly average hourly diffuse photosynthetically active radiation (PAR) using satellite data is proposed. Diffuse PAR was analyzed at four stations in Thailand. A radiative transfer model was used for calculating the diffuse PAR for cloudless sky conditions. Differences between the diffuse PAR under all sky conditions obtained from the ground-based measurements and those from the model are representative of cloud effects. Two models are developed, one describing diffuse PAR only as a function of solar zenith angle, and the second one as a multiple linear regression with solar zenith angle and satellite reflectivity acting linearly and aerosol optical depth acting in logarithmic functions. When tested with an independent data set, the multiple regression model performed best with a higher coefficient of variance R2 (0.78 vs. 0.70), lower root mean square difference (RMSD) (12.92% vs. 13.05%) and the same mean bias difference (MBD) of -2.20%. Results from the multiple regression model are used to map diffuse PAR throughout the country as monthly averages of hourly data.

  13. Obtaining Predictions from Models Fit to Multiply Imputed Data

    ERIC Educational Resources Information Center

    Miles, Andrew

    2016-01-01

    Obtaining predictions from regression models fit to multiply imputed data can be challenging because treatments of multiple imputation seldom give clear guidance on how predictions can be calculated, and because available software often does not have built-in routines for performing the necessary calculations. This research note reviews how…

  14. BIODEGRADATION PROBABILITY PROGRAM (BIODEG)

    EPA Science Inventory

    The Biodegradation Probability Program (BIODEG) calculates the probability that a chemical under aerobic conditions with mixed cultures of microorganisms will biodegrade rapidly or slowly. It uses fragment constants developed using multiple linear and non-linear regressions and d...

  15. The use of regression analysis in determining reference intervals for low hematocrit and thrombocyte count in multiple electrode aggregometry and platelet function analyzer 100 testing of platelet function.

    PubMed

    Kuiper, Gerhardus J A J M; Houben, Rik; Wetzels, Rick J H; Verhezen, Paul W M; Oerle, Rene van; Ten Cate, Hugo; Henskens, Yvonne M C; Lancé, Marcus D

    2017-11-01

    Low platelet counts and hematocrit levels hinder whole blood point-of-care testing of platelet function. Thus far, no reference ranges for MEA (multiple electrode aggregometry) and PFA-100 (platelet function analyzer 100) devices exist for low ranges. Through dilution methods of volunteer whole blood, platelet function at low ranges of platelet count and hematocrit levels was assessed on MEA for four agonists and for PFA-100 in two cartridges. Using (multiple) regression analysis, 95% reference intervals were computed for these low ranges. Low platelet counts affected MEA in a positive correlation (all agonists showed r 2 ≥ 0.75) and PFA-100 in an inverse correlation (closure times were prolonged with lower platelet counts). Lowered hematocrit did not affect MEA testing, except for arachidonic acid activation (ASPI), which showed a weak positive correlation (r 2 = 0.14). Closure time on PFA-100 testing was inversely correlated with hematocrit for both cartridges. Regression analysis revealed different 95% reference intervals in comparison with originally established intervals for both MEA and PFA-100 in low platelet or hematocrit conditions. Multiple regression analysis of ASPI and both tests on the PFA-100 for combined low platelet and hematocrit conditions revealed that only PFA-100 testing should be adjusted for both thrombocytopenia and anemia. 95% reference intervals were calculated using multiple regression analysis. However, coefficients of determination of PFA-100 were poor, and some variance remained unexplained. Thus, in this pilot study using (multiple) regression analysis, we could establish reference intervals of platelet function in anemia and thrombocytopenia conditions on PFA-100 and in thrombocytopenia conditions on MEA.

  16. Deriving percentage study weights in multi-parameter meta-analysis models: with application to meta-regression, network meta-analysis and one-stage individual participant data models.

    PubMed

    Riley, Richard D; Ensor, Joie; Jackson, Dan; Burke, Danielle L

    2017-01-01

    Many meta-analysis models contain multiple parameters, for example due to multiple outcomes, multiple treatments or multiple regression coefficients. In particular, meta-regression models may contain multiple study-level covariates, and one-stage individual participant data meta-analysis models may contain multiple patient-level covariates and interactions. Here, we propose how to derive percentage study weights for such situations, in order to reveal the (otherwise hidden) contribution of each study toward the parameter estimates of interest. We assume that studies are independent, and utilise a decomposition of Fisher's information matrix to decompose the total variance matrix of parameter estimates into study-specific contributions, from which percentage weights are derived. This approach generalises how percentage weights are calculated in a traditional, single parameter meta-analysis model. Application is made to one- and two-stage individual participant data meta-analyses, meta-regression and network (multivariate) meta-analysis of multiple treatments. These reveal percentage study weights toward clinically important estimates, such as summary treatment effects and treatment-covariate interactions, and are especially useful when some studies are potential outliers or at high risk of bias. We also derive percentage study weights toward methodologically interesting measures, such as the magnitude of ecological bias (difference between within-study and across-study associations) and the amount of inconsistency (difference between direct and indirect evidence in a network meta-analysis).

  17. The mechanical properties of high speed GTAW weld and factors of nonlinear multiple regression model under external transverse magnetic field

    NASA Astrophysics Data System (ADS)

    Lu, Lin; Chang, Yunlong; Li, Yingmin; He, Youyou

    2013-05-01

    A transverse magnetic field was introduced to the arc plasma in the process of welding stainless steel tubes by high-speed Tungsten Inert Gas Arc Welding (TIG for short) without filler wire. The influence of external magnetic field on welding quality was investigated. 9 sets of parameters were designed by the means of orthogonal experiment. The welding joint tensile strength and form factor of weld were regarded as the main standards of welding quality. A binary quadratic nonlinear regression equation was established with the conditions of magnetic induction and flow rate of Ar gas. The residual standard deviation was calculated to adjust the accuracy of regression model. The results showed that, the regression model was correct and effective in calculating the tensile strength and aspect ratio of weld. Two 3D regression models were designed respectively, and then the impact law of magnetic induction on welding quality was researched.

  18. A Modified Double Multiple Nonlinear Regression Constitutive Equation for Modeling and Prediction of High Temperature Flow Behavior of BFe10-1-2 Alloy

    NASA Astrophysics Data System (ADS)

    Cai, Jun; Wang, Kuaishe; Shi, Jiamin; Wang, Wen; Liu, Yingying

    2018-01-01

    Constitutive analysis for hot working of BFe10-1-2 alloy was carried out by using experimental stress-strain data from isothermal hot compression tests, in a wide range of temperature of 1,023 1,273 K, and strain rate range of 0.001 10 s-1. A constitutive equation based on modified double multiple nonlinear regression was proposed considering the independent effects of strain, strain rate, temperature and their interrelation. The predicted flow stress data calculated from the developed equation was compared with the experimental data. Correlation coefficient (R), average absolute relative error (AARE) and relative errors were introduced to verify the validity of the developed constitutive equation. Subsequently, a comparative study was made on the capability of strain-compensated Arrhenius-type constitutive model. The results showed that the developed constitutive equation based on modified double multiple nonlinear regression could predict flow stress of BFe10-1-2 alloy with good correlation and generalization.

  19. Simultaneous estimation of transcutaneous bilirubin, hemoglobin, and melanin based on diffuse reflectance spectroscopy

    NASA Astrophysics Data System (ADS)

    Nishidate, Izumi; Abdul, Wares MD.; Ohtsu, Mizuki; Nakano, Kazuya; Haneishi, Hideaki

    2018-02-01

    We propose a method to estimate transcutaneous bilirubin, hemoglobin, and melanin based on the diffuse reflectance spectroscopy. In the proposed method, the Monte Carlo simulation-based multiple regression analysis for an absorbance spectrum in the visible wavelength region (460-590 nm) is used to specify the concentrations of bilirubin (Cbil), oxygenated hemoglobin (Coh), deoxygenated hemoglobin (Cdh), and melanin (Cm). Using the absorbance spectrum calculated from the measured diffuse reflectance spectrum as a response variable and the extinction coefficients of bilirubin, oxygenated hemoglobin, deoxygenated hemoglobin, and melanin, as predictor variables, multiple regression analysis provides regression coefficients. Concentrations of bilirubin, oxygenated hemoglobin, deoxygenated hemoglobin, and melanin, are then determined from the regression coefficients using conversion vectors that are numerically deduced in advance by the Monte Carlo simulations for light transport in skin. Total hemoglobin concentration (Cth) and tissue oxygen saturation (StO2) are simply calculated from the oxygenated hemoglobin and deoxygenated hemoglobin. In vivo animal experiments with bile duct ligation in rats demonstrated that the estimated Cbil is increased after ligation of bile duct and reaches to around 20 mg/dl at 72 h after the onset of the ligation, which corresponds to the reference value of Cbil measured by a commercially available transcutaneous bilirubin meter. We also performed in vivo experiments with rats while varying the fraction of inspired oxygen (FiO2). Coh and Cdh decreased and increased, respectively, as FiO2 decreased. Consequently, StO2 was dramatically decreased. The results in this study indicate potential of the method for simultaneous evaluation of multiple chromophores in skin tissue.

  20. A simplified calculation procedure for mass isotopomer distribution analysis (MIDA) based on multiple linear regression.

    PubMed

    Fernández-Fernández, Mario; Rodríguez-González, Pablo; García Alonso, J Ignacio

    2016-10-01

    We have developed a novel, rapid and easy calculation procedure for Mass Isotopomer Distribution Analysis based on multiple linear regression which allows the simultaneous calculation of the precursor pool enrichment and the fraction of newly synthesized labelled proteins (fractional synthesis) using linear algebra. To test this approach, we used the peptide RGGGLK as a model tryptic peptide containing three subunits of glycine. We selected glycine labelled in two 13 C atoms ( 13 C 2 -glycine) as labelled amino acid to demonstrate that spectral overlap is not a problem in the proposed methodology. The developed methodology was tested first in vitro by changing the precursor pool enrichment from 10 to 40% of 13 C 2 -glycine. Secondly, a simulated in vivo synthesis of proteins was designed by combining the natural abundance RGGGLK peptide and 10 or 20% 13 C 2 -glycine at 1 : 1, 1 : 3 and 3 : 1 ratios. Precursor pool enrichments and fractional synthesis values were calculated with satisfactory precision and accuracy using a simple spreadsheet. This novel approach can provide a relatively rapid and easy means to measure protein turnover based on stable isotope tracers. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  1. Linear viral load increase of a single HPV-type in women with multiple HPV infections predicts progression to cervical cancer.

    PubMed

    Depuydt, Christophe E; Thys, Sofie; Beert, Johan; Jonckheere, Jef; Salembier, Geert; Bogers, Johannes J

    2016-11-01

    Persistent high-risk human papillomavirus (HPV) infection is strongly associated with development of high-grade cervical intraepithelial neoplasia or cancer (CIN3+). In single type infections, serial type-specific viral-load measurements predict the natural history of the infection. In infections with multiple HPV-types, the individual type-specific viral-load profile could distinguish progressing HPV-infections from regressing infections. A case-cohort natural history study was established using samples from untreated women with multiple HPV-infections who developed CIN3+ (n = 57) or cleared infections (n = 88). Enriched cell pellet from liquid based cytology samples were subjected to a clinically validated real-time qPCR-assay (18 HPV-types). Using serial type-specific viral-load measurements (≥3) we calculated HPV-specific slopes and coefficient of determination (R(2) ) by linear regression. For each woman slopes and R(2) were used to calculate which HPV-induced processes were ongoing (progression, regression, serial transient, transient). In transient infections with multiple HPV-types, each single HPV-type generated similar increasing (0.27copies/cell/day) and decreasing (-0.27copies/cell/day) viral-load slopes. In CIN3+, at least one of the HPV-types had a clonal progressive course (R(2)  ≥ 0.85; 0.0025copies/cell/day). In selected CIN3+ cases (n = 6), immunostaining detecting type-specific HPV 16, 31, 33, 58 and 67 RNA showed an even staining in clonal populations (CIN3+), whereas in transient virion-producing infections the RNA-staining was less in the basal layer compared to the upper layer where cells were ready to desquamate and release newly-formed virions. RNA-hybridization patterns matched the calculated ongoing processes measured by R(2) and slope in serial type-specific viral-load measurements preceding the biopsy. In women with multiple HPV-types, serial type-specific viral-load measurements predict the natural history of the different HPV-types and elucidates HPV-genotype attribution. © 2016 UICC.

  2. Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits.

    PubMed

    Zhang, Futao; Xie, Dan; Liang, Meimei; Xiong, Momiao

    2016-04-01

    To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains fundamentally unexplored. To fill this gap, we formulate a test for interaction between two genes in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large-scale simulations to calculate Type I error rates for testing interaction between two genes with multiple phenotypes and to compare the power with multivariate pairwise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate performance, the MFRG for epistasis analysis is applied to five phenotypes of exome sequence data from the NHLBI's Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 267 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has a much higher power to detect interaction than the interaction analysis of a single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes.

  3. Evidencing the association between swimming capacities and performance indicators in water polo: a multiple regression study.

    PubMed

    Kontic, Dean; Zenic, Natasa; Uljevic, Ognjen; Sekulic, Damir; Lesnik, Blaz

    2017-06-01

    Swimming capacities are hypothesized to be important determinants of water polo performance but there is an evident lack of studies examining different swimming capacities in relation to specific offensive and defensive performance variables in this sport. The aim of this study was to determine the relationship between five swimming capacities and six performance determinants in water polo. The sample comprised 79 high-level youth water polo players (all males, 17-18 years of age). The variables included six performance-related variables (agility in offence and defense, efficacy in offence and defense, polyvalence in offence and defense), and five swimming-capacity tests (water polo sprint test [15 m], swimming sprint test [25 m], short-distance [100 m], aerobic endurance [400 m] and an anaerobic lactate endurance test [4× 50 m]). First, multiple regressions were calculated for one-half of the sample of subjects which were then validated with the remaining half of the sample. The 25-m swim was not included in the regression analyses due to the multicollinearity with other predictors. The originally calculated regression models were validated for defensive agility (R=0.67 and R=0.55 for the original regression calculation and validation subsample, respectively) offensive agility (R=0.59 and R=0.61), and offensive efficacy (R=0.64 and R=0.58). Anaerobic lactate endurance is a significant predictor of offensive and defensive agility, while 15 m sprint significantly contributes to offensive efficacy. Swimming capacities are not found to be related to the polyvalence of the players. The most superior offensive performance can be expected from those players with a high level of anaerobic lactate endurance and advanced sprinting capacity, while anaerobic lactate endurance is recognized as most important quality in defensive duties. Future studies should observe players' polyvalence in relation to (theoretical) knowledge of technical and tactical tasks. Results reinforce the need for the cross-validation of the prediction-models in sport and exercise sciences.

  4. Spectral regression and correlation coefficients of some benzaldimines and salicylaldimines in different solvents

    NASA Astrophysics Data System (ADS)

    Hammud, Hassan H.; Ghannoum, Amer; Masoud, Mamdouh S.

    2006-02-01

    Sixteen Schiff bases obtained from the condensation of benzaldehyde or salicylaldehyde with various amines (aniline, 4-carboxyaniline, phenylhydrazine, 2,4-dinitrophenylhydrazine, ethylenediamine, hydrazine, o-phenylenediamine and 2,6-pyridinediamine) are studied with UV-vis spectroscopy to observe the effect of solvents, substituents and other structural factors on the spectra. The bands involving different electronic transitions are interpreted. Computerized analysis and multiple regression techniques were applied to calculate the regression and correlation coefficients based on the equation that relates peak position λmax to the solvent parameters that depend on the H-bonding ability, refractive index and dielectric constant of solvents.

  5. Partial F-tests with multiply imputed data in the linear regression framework via coefficient of determination.

    PubMed

    Chaurasia, Ashok; Harel, Ofer

    2015-02-10

    Tests for regression coefficients such as global, local, and partial F-tests are common in applied research. In the framework of multiple imputation, there are several papers addressing tests for regression coefficients. However, for simultaneous hypothesis testing, the existing methods are computationally intensive because they involve calculation with vectors and (inversion of) matrices. In this paper, we propose a simple method based on the scalar entity, coefficient of determination, to perform (global, local, and partial) F-tests with multiply imputed data. The proposed method is evaluated using simulated data and applied to suicide prevention data. Copyright © 2014 John Wiley & Sons, Ltd.

  6. The Digital Shoreline Analysis System (DSAS) Version 4.0 - An ArcGIS extension for calculating shoreline change

    USGS Publications Warehouse

    Thieler, E. Robert; Himmelstoss, Emily A.; Zichichi, Jessica L.; Ergul, Ayhan

    2009-01-01

    The Digital Shoreline Analysis System (DSAS) version 4.0 is a software extension to ESRI ArcGIS v.9.2 and above that enables a user to calculate shoreline rate-of-change statistics from multiple historic shoreline positions. A user-friendly interface of simple buttons and menus guides the user through the major steps of shoreline change analysis. Components of the extension and user guide include (1) instruction on the proper way to define a reference baseline for measurements, (2) automated and manual generation of measurement transects and metadata based on user-specified parameters, and (3) output of calculated rates of shoreline change and other statistical information. DSAS computes shoreline rates of change using four different methods: (1) endpoint rate, (2) simple linear regression, (3) weighted linear regression, and (4) least median of squares. The standard error, correlation coefficient, and confidence interval are also computed for the simple and weighted linear-regression methods. The results of all rate calculations are output to a table that can be linked to the transect file by a common attribute field. DSAS is intended to facilitate the shoreline change-calculation process and to provide rate-of-change information and the statistical data necessary to establish the reliability of the calculated results. The software is also suitable for any generic application that calculates positional change over time, such as assessing rates of change of glacier limits in sequential aerial photos, river edge boundaries, land-cover changes, and so on.

  7. Comparing cluster-level dynamic treatment regimens using sequential, multiple assignment, randomized trials: Regression estimation and sample size considerations.

    PubMed

    NeCamp, Timothy; Kilbourne, Amy; Almirall, Daniel

    2017-08-01

    Cluster-level dynamic treatment regimens can be used to guide sequential treatment decision-making at the cluster level in order to improve outcomes at the individual or patient-level. In a cluster-level dynamic treatment regimen, the treatment is potentially adapted and re-adapted over time based on changes in the cluster that could be impacted by prior intervention, including aggregate measures of the individuals or patients that compose it. Cluster-randomized sequential multiple assignment randomized trials can be used to answer multiple open questions preventing scientists from developing high-quality cluster-level dynamic treatment regimens. In a cluster-randomized sequential multiple assignment randomized trial, sequential randomizations occur at the cluster level and outcomes are observed at the individual level. This manuscript makes two contributions to the design and analysis of cluster-randomized sequential multiple assignment randomized trials. First, a weighted least squares regression approach is proposed for comparing the mean of a patient-level outcome between the cluster-level dynamic treatment regimens embedded in a sequential multiple assignment randomized trial. The regression approach facilitates the use of baseline covariates which is often critical in the analysis of cluster-level trials. Second, sample size calculators are derived for two common cluster-randomized sequential multiple assignment randomized trial designs for use when the primary aim is a between-dynamic treatment regimen comparison of the mean of a continuous patient-level outcome. The methods are motivated by the Adaptive Implementation of Effective Programs Trial which is, to our knowledge, the first-ever cluster-randomized sequential multiple assignment randomized trial in psychiatry.

  8. Exact Interval Estimation, Power Calculation, and Sample Size Determination in Normal Correlation Analysis

    ERIC Educational Resources Information Center

    Shieh, Gwowen

    2006-01-01

    This paper considers the problem of analysis of correlation coefficients from a multivariate normal population. A unified theorem is derived for the regression model with normally distributed explanatory variables and the general results are employed to provide useful expressions for the distributions of simple, multiple, and partial-multiple…

  9. Identifying Pedophiles "Eligible" for Community Notification under Megan's Law: A Multivariate Model for Actuarially Anchored Decisions.

    ERIC Educational Resources Information Center

    Pallone, Nathaniel J.; Hennessy, James J.; Voelbel, Gerald T.

    1998-01-01

    A scientifically sound methodology for identifying offenders about whose presence the community should be notified is demonstrated. A stepwise multiple regression was calculated among incarcerated pedophiles (N=52) including both psychological and legal data; a precision-weighted equation produced 90.4% "true positives." This methodology can be…

  10. Simulation of land use change in the three gorges reservoir area based on CART-CA

    NASA Astrophysics Data System (ADS)

    Yuan, Min

    2018-05-01

    This study proposes a new method to simulate spatiotemporal complex multiple land uses by using classification and regression tree algorithm (CART) based CA model. In this model, we use classification and regression tree algorithm to calculate land class conversion probability, and combine neighborhood factor, random factor to extract cellular transformation rules. The overall Kappa coefficient is 0.8014 and the overall accuracy is 0.8821 in the land dynamic simulation results of the three gorges reservoir area from 2000 to 2010, and the simulation results are satisfactory.

  11. Predicting flight delay based on multiple linear regression

    NASA Astrophysics Data System (ADS)

    Ding, Yi

    2017-08-01

    Delay of flight has been regarded as one of the toughest difficulties in aviation control. How to establish an effective model to handle the delay prediction problem is a significant work. To solve the problem that the flight delay is difficult to predict, this study proposes a method to model the arriving flights and a multiple linear regression algorithm to predict delay, comparing with Naive-Bayes and C4.5 approach. Experiments based on a realistic dataset of domestic airports show that the accuracy of the proposed model approximates 80%, which is further improved than the Naive-Bayes and C4.5 approach approaches. The result testing shows that this method is convenient for calculation, and also can predict the flight delays effectively. It can provide decision basis for airport authorities.

  12. Estimating interaction on an additive scale between continuous determinants in a logistic regression model.

    PubMed

    Knol, Mirjam J; van der Tweel, Ingeborg; Grobbee, Diederick E; Numans, Mattijs E; Geerlings, Mirjam I

    2007-10-01

    To determine the presence of interaction in epidemiologic research, typically a product term is added to the regression model. In linear regression, the regression coefficient of the product term reflects interaction as departure from additivity. However, in logistic regression it refers to interaction as departure from multiplicativity. Rothman has argued that interaction estimated as departure from additivity better reflects biologic interaction. So far, literature on estimating interaction on an additive scale using logistic regression only focused on dichotomous determinants. The objective of the present study was to provide the methods to estimate interaction between continuous determinants and to illustrate these methods with a clinical example. and results From the existing literature we derived the formulas to quantify interaction as departure from additivity between one continuous and one dichotomous determinant and between two continuous determinants using logistic regression. Bootstrapping was used to calculate the corresponding confidence intervals. To illustrate the theory with an empirical example, data from the Utrecht Health Project were used, with age and body mass index as risk factors for elevated diastolic blood pressure. The methods and formulas presented in this article are intended to assist epidemiologists to calculate interaction on an additive scale between two variables on a certain outcome. The proposed methods are included in a spreadsheet which is freely available at: http://www.juliuscenter.nl/additive-interaction.xls.

  13. Models for predicting the mass of lime fruits by some engineering properties.

    PubMed

    Miraei Ashtiani, Seyed-Hassan; Baradaran Motie, Jalal; Emadi, Bagher; Aghkhani, Mohammad-Hosein

    2014-11-01

    Grading fruits based on mass is important in packaging and reduces the waste, also increases the marketing value of agricultural produce. The aim of this study was mass modeling of two major cultivars of Iranian limes based on engineering attributes. Models were classified into three: 1-Single and multiple variable regressions of lime mass and dimensional characteristics. 2-Single and multiple variable regressions of lime mass and projected areas. 3-Single regression of lime mass based on its actual volume and calculated volume assumed as ellipsoid and prolate spheroid shapes. All properties considered in the current study were found to be statistically significant (ρ < 0.01). The results indicated that mass modeling of lime based on minor diameter and first projected area are the most appropriate models in the first and the second classifications, respectively. In third classification, the best model was obtained on the basis of the prolate spheroid volume. It was finally concluded that the suitable grading system of lime mass is based on prolate spheroid volume.

  14. Simple linear and multivariate regression models.

    PubMed

    Rodríguez del Águila, M M; Benítez-Parejo, N

    2011-01-01

    In biomedical research it is common to find problems in which we wish to relate a response variable to one or more variables capable of describing the behaviour of the former variable by means of mathematical models. Regression techniques are used to this effect, in which an equation is determined relating the two variables. While such equations can have different forms, linear equations are the most widely used form and are easy to interpret. The present article describes simple and multiple linear regression models, how they are calculated, and how their applicability assumptions are checked. Illustrative examples are provided, based on the use of the freely accessible R program. Copyright © 2011 SEICAP. Published by Elsevier Espana. All rights reserved.

  15. SU-F-R-20: Image Texture Features Correlate with Time to Local Failure in Lung SBRT Patients

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Andrews, M; Abazeed, M; Woody, N

    Purpose: To explore possible correlation between CT image-based texture and histogram features and time-to-local-failure in early stage non-small cell lung cancer (NSCLC) patients treated with stereotactic body radiotherapy (SBRT).Methods and Materials: From an IRB-approved lung SBRT registry for patients treated between 2009–2013 we selected 48 (20 male, 28 female) patients with local failure. Median patient age was 72.3±10.3 years. Mean time to local failure was 15 ± 7.1 months. Physician-contoured gross tumor volumes (GTV) on the planning CT images were processed and 3D gray-level co-occurrence matrix (GLCM) based texture and histogram features were calculated in Matlab. Data were exported tomore » R and a multiple linear regression model was used to examine the relationship between texture features and time-to-local-failure. Results: Multiple linear regression revealed that entropy (p=0.0233, multiple R2=0.60) from GLCM-based texture analysis and the standard deviation (p=0.0194, multiple R2=0.60) from the histogram-based features were statistically significantly correlated with the time-to-local-failure. Conclusion: Image-based texture analysis can be used to predict certain aspects of treatment outcomes of NSCLC patients treated with SBRT. We found entropy and standard deviation calculated for the GTV on the CT images displayed a statistically significant correlation with and time-to-local-failure in lung SBRT patients.« less

  16. Estimating Time to Event From Longitudinal Categorical Data: An Analysis of Multiple Sclerosis Progression.

    PubMed

    Mandel, Micha; Gauthier, Susan A; Guttmann, Charles R G; Weiner, Howard L; Betensky, Rebecca A

    2007-12-01

    The expanded disability status scale (EDSS) is an ordinal score that measures progression in multiple sclerosis (MS). Progression is defined as reaching EDSS of a certain level (absolute progression) or increasing of one point of EDSS (relative progression). Survival methods for time to progression are not adequate for such data since they do not exploit the EDSS level at the end of follow-up. Instead, we suggest a Markov transitional model applicable for repeated categorical or ordinal data. This approach enables derivation of covariate-specific survival curves, obtained after estimation of the regression coefficients and manipulations of the resulting transition matrix. Large sample theory and resampling methods are employed to derive pointwise confidence intervals, which perform well in simulation. Methods for generating survival curves for time to EDSS of a certain level, time to increase of EDSS of at least one point, and time to two consecutive visits with EDSS greater than three are described explicitly. The regression models described are easily implemented using standard software packages. Survival curves are obtained from the regression results using packages that support simple matrix calculation. We present and demonstrate our method on data collected at the Partners MS center in Boston, MA. We apply our approach to progression defined by time to two consecutive visits with EDSS greater than three, and calculate crude (without covariates) and covariate-specific curves.

  17. Evaluating the relationship between wildfire extent and nitrogen dry deposition in a boreal forest in interior Alaska

    NASA Astrophysics Data System (ADS)

    Nagano, Hirohiko; Iwata, Hiroki

    2017-03-01

    Alaska wildfires may play an important role in nitrogen (N) dry deposition in Alaskan boreal forests. Here we used annual N dry deposition data measured by CASTNET at Denali National Park (DEN417) during 1999-2013, to evaluate the relationships between wildfire extent and N dry deposition in Alaska. We established six potential factors for multiple regression analysis, including burned area within 100 km of DEN417 (BA100km) and in other distant parts of Alaska (BAAK), the sum of indexes of North Atlantic Oscillation and Arctic Oscillation (OI), number of days with negative OI (OIday), precipitation (PRCP), and number of days with PRCP (PRCPday). Multiple regression analysis was conducted for both time scales, annual (using only annual values of factors) and six-month (using annual values of BAAK and BA100km, and fire and non-fire seasons' values of other four factors) time scales. Together, BAAK, BA100km, and OIday, along with PRCPday in the case of the six-month scale, explained more than 92% of the interannual variation in N dry deposition. The influence of BA100km on N dry deposition was ten-fold greater than from BAAK; the qualitative contribution was almost zero, however, due to the small BA100km. BAAK was the leading explanatory factor, with a 15 ± 14% contribution. We further calculated N dry deposition during 1950-2013 using the obtained regression equation and long-term records for the factors. The N dry deposition calculated for 1950-2013 revealed that an increased occurrence of wildfires during the 2000s led to the maximum N dry deposition exhibited during this decade. As a result, the effect of BAAK on N dry deposition remains sufficiently large, even when large possible uncertainties (>40%) in the measurement of N dry deposition are taken into account for the multiple regression analysis.

  18. Malignant testicular tumour incidence and mortality trends

    PubMed Central

    Wojtyła-Buciora, Paulina; Więckowska, Barbara; Krzywinska-Wiewiorowska, Małgorzata; Gromadecka-Sutkiewicz, Małgorzata

    2016-01-01

    Aim of the study In Poland testicular tumours are the most frequent cancer among men aged 20–44 years. Testicular tumour incidence since the 1980s and 1990s has been diversified geographically, with an increased risk of mortality in Wielkopolska Province, which was highlighted at the turn of the 1980s and 1990s. The aim of the study was the comparative analysis of the tendencies in incidence and death rates due to malignant testicular tumours observed among men in Poland and in Wielkopolska Province. Material and methods Data from the National Cancer Registry were used for calculations. The incidence/mortality rates among men due to malignant testicular cancer as well as the tendencies in incidence/death ratio observed in Poland and Wielkopolska were established based on regression equation. The analysis was deepened by adopting the multiple linear regression model. A p-value < 0.05 was arbitrarily adopted as the criterion of statistical significance, and for multiple comparisons it was modified according to the Bonferroni adjustment to a value of p < 0.0028. Calculations were performed with the use of PQStat v1.4.8 package. Results The incidence of malignant testicular neoplasms observed among men in Poland and in Wielkopolska Province indicated a significant rising tendency. The multiple linear regression model confirmed that the year variable is a strong incidence forecast factor only within the territory of Poland. A corresponding analysis of mortality rates among men in Poland and in Wielkopolska Province did not show any statistically significant correlations. Conclusions Late diagnosis of Polish patients calls for undertaking appropriate educational activities that would facilitate earlier reporting of the patients, thus increasing their chances for recovery. Introducing preventive examinations in the regions of increased risk of testicular tumour may allow earlier diagnosis. PMID:27095941

  19. Predictive ability of a comprehensive incremental test in mountain bike marathon.

    PubMed

    Ahrend, Marc-Daniel; Schneeweiss, Patrick; Martus, Peter; Niess, Andreas M; Krauss, Inga

    2018-01-01

    Traditional performance tests in mountain bike marathon (XCM) primarily quantify aerobic metabolism and may not describe the relevant capacities in XCM. We aimed to validate a comprehensive test protocol quantifying its intermittent demands. Forty-nine athletes (38.8±9.1 years; 38 male; 11 female) performed a laboratory performance test, including an incremental test, to determine individual anaerobic threshold (IAT), peak power output (PPO) and three maximal efforts (10 s all-out sprint, 1 min maximal effort and 5 min maximal effort). Within 2 weeks, the athletes participated in one of three XCM races (n=15, n=9 and n=25). Correlations between test variables and race times were calculated separately. In addition, multiple regression models of the predictive value of laboratory outcomes were calculated for race 3 and across all races (z-transformed data). All variables were correlated with race times 1, 2 and 3: 10 s all-out sprint (r=-0.72; r=-0.59; r=-0.61), 1 min maximal effort (r=-0.85; r=-0.84; r=-0.82), 5 min maximal effort (r=-0.57; r=-0.85; r=-0.76), PPO (r=-0.77; r=-0.73; r=-0.76) and IAT (r=-0.71; r=-0.67; r=-0.68). The best-fitting multiple regression models for race 3 (r 2 =0.868) and across all races (r 2 =0.757) comprised 1 min maximal effort, IAT and body weight. Aerobic and intermittent variables correlated least strongly with race times. Their use in a multiple regression model confirmed additional explanatory power to predict XCM performance. These findings underline the usefulness of the comprehensive incremental test to predict performance in that sport more precisely.

  20. Relationship between postoperative refractive outcomes and cataract density: multiple regression analysis.

    PubMed

    Ueda, Tetsuo; Ikeda, Hitoe; Ota, Takeo; Matsuura, Toyoaki; Hara, Yoshiaki

    2010-05-01

    To evaluate the relationship between cataract density and the deviation from the predicted refraction. Department of Ophthalmology, Nara Medical University, Kashihara, Japan. Axial length (AL) was measured in eyes with mainly nuclear cataract using partial coherence interferometry (IOLMaster). The postoperative AL was measured in pseudophakic mode. The AL difference was calculated by subtracting the postoperative AL from the preoperative AL. Cataract density was measured with the pupil dilated using anterior segment Scheimpflug imaging (EAS-1000). The predicted postoperative refraction was calculated using the SRK/T formula. The subjective refraction 3 months postoperatively was also measured. The mean absolute prediction error (MAE) (mean of absolute difference between predicted postoperative refraction and spherical equivalent of postoperative subjective refraction) was calculated. The relationship between the MAE and cataract density, age, preoperative visual acuity, anterior chamber depth, corneal radius of curvature, and AL difference was evaluated using multiple regression analysis. In the 96 eyes evaluated, the MAE was correlated with cataract density (r = 0.37, P = .001) and the AL difference (r = 0.34, P = .003) but not with the other parameters. The AL difference was correlated with cataract density (r = 0.53, P<.0001). The postoperative refractive outcome was affected by cataract density. This should be taken into consideration in eyes with a higher density cataract. (c) 2010 ASCRS and ESCRS. Published by Elsevier Inc. All rights reserved.

  1. Application of third molar development and eruption models in estimating dental age in Malay sub-adults.

    PubMed

    Mohd Yusof, Mohd Yusmiaidil Putera; Cauwels, Rita; Deschepper, Ellen; Martens, Luc

    2015-08-01

    The third molar development (TMD) has been widely utilized as one of the radiographic method for dental age estimation. By using the same radiograph of the same individual, third molar eruption (TME) information can be incorporated to the TMD regression model. This study aims to evaluate the performance of dental age estimation in individual method models and the combined model (TMD and TME) based on the classic regressions of multiple linear and principal component analysis. A sample of 705 digital panoramic radiographs of Malay sub-adults aged between 14.1 and 23.8 years was collected. The techniques described by Gleiser and Hunt (modified by Kohler) and Olze were employed to stage the TMD and TME, respectively. The data was divided to develop three respective models based on the two regressions of multiple linear and principal component analysis. The trained models were then validated on the test sample and the accuracy of age prediction was compared between each model. The coefficient of determination (R²) and root mean square error (RMSE) were calculated. In both genders, adjusted R² yielded an increment in the linear regressions of combined model as compared to the individual models. The overall decrease in RMSE was detected in combined model as compared to TMD (0.03-0.06) and TME (0.2-0.8). In principal component regression, low value of adjusted R(2) and high RMSE except in male were exhibited in combined model. Dental age estimation is better predicted using combined model in multiple linear regression models. Copyright © 2015 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.

  2. Revising the Rorschach Ego Impairment Index to Accommodate Recent Recommendations about Improving Rorschach Validity

    ERIC Educational Resources Information Center

    Viglione, Donald J.; Perry, William; Giromini, Luciano; Meyer, Gregory J.

    2011-01-01

    We used multiple regression to calculate a new Ego Impairment Index (EII-3). The aim was to incorporate changes in the component variables and distribution of the number of responses as found in the new Rorschach Performance Assessment System, while sustaining the validity and reliability of previous EIIs. The EII-3 formula was derived from a…

  3. Pumps and warmers during amnioinfusion: is either necessary?

    PubMed

    Glantz, J C; Letteney, D L

    1996-01-01

    To determine if there is evidence from published reports that the use of infusion pumps or solution warmers during amnioinfusion is beneficial. We identified all English-language amnioinfusion reports published since 1983 through Medline and references. Fourteen prospective papers with at least 40 subjects were identified. For the amnioinfusion and control groups in each study, odds ratios (OR) were calculated for cesarean delivery, fetal distress, meconium below the cords, low 5-minute Apgar score, and endometritis. Cumulative ORs were calculated using the Mantel-Haenszel inverse variance method. This process was repeated after separation into pump-gravity and warmed-unwarmed groups. Multiple regression analyses were performed. Amnioinfusion improved the ability of the fetus to tolerate labor (fetal distress OR 0.40), decreased the incidence of meconium below the cords (OR 0.16), and decreased the rate of cesarean delivery (OR 0.56). There were no demonstrable benefits associated with the use of warmers or pumps. In multiple regression analysis, infusion pumps were associated with a significantly increased risk of fetal distress (P = .01). The use of amnioinfusion is associated with a decreased risk of fetal distress, meconium below the cords, and cesarean delivery. To date, there is no demonstrable benefit using infusion pumps or solution warmers during amnioinfusion.

  4. Linear regression analysis: part 14 of a series on evaluation of scientific publications.

    PubMed

    Schneider, Astrid; Hommel, Gerhard; Blettner, Maria

    2010-11-01

    Regression analysis is an important statistical method for the analysis of medical data. It enables the identification and characterization of relationships among multiple factors. It also enables the identification of prognostically relevant risk factors and the calculation of risk scores for individual prognostication. This article is based on selected textbooks of statistics, a selective review of the literature, and our own experience. After a brief introduction of the uni- and multivariable regression models, illustrative examples are given to explain what the important considerations are before a regression analysis is performed, and how the results should be interpreted. The reader should then be able to judge whether the method has been used correctly and interpret the results appropriately. The performance and interpretation of linear regression analysis are subject to a variety of pitfalls, which are discussed here in detail. The reader is made aware of common errors of interpretation through practical examples. Both the opportunities for applying linear regression analysis and its limitations are presented.

  5. Regression modeling of ground-water flow

    USGS Publications Warehouse

    Cooley, R.L.; Naff, R.L.

    1985-01-01

    Nonlinear multiple regression methods are developed to model and analyze groundwater flow systems. Complete descriptions of regression methodology as applied to groundwater flow models allow scientists and engineers engaged in flow modeling to apply the methods to a wide range of problems. Organization of the text proceeds from an introduction that discusses the general topic of groundwater flow modeling, to a review of basic statistics necessary to properly apply regression techniques, and then to the main topic: exposition and use of linear and nonlinear regression to model groundwater flow. Statistical procedures are given to analyze and use the regression models. A number of exercises and answers are included to exercise the student on nearly all the methods that are presented for modeling and statistical analysis. Three computer programs implement the more complex methods. These three are a general two-dimensional, steady-state regression model for flow in an anisotropic, heterogeneous porous medium, a program to calculate a measure of model nonlinearity with respect to the regression parameters, and a program to analyze model errors in computed dependent variables such as hydraulic head. (USGS)

  6. Estimation of standard liver volume in Chinese adult living donors.

    PubMed

    Fu-Gui, L; Lu-Nan, Y; Bo, L; Yong, Z; Tian-Fu, W; Ming-Qing, X; Wen-Tao, W; Zhe-Yu, C

    2009-12-01

    To determine a formula predicting the standard liver volume based on body surface area (BSA) or body weight in Chinese adults. A total of 115 consecutive right-lobe living donors not including the middle hepatic vein underwent right hemi-hepatectomy. No organs were used from prisoners, and no subjects were prisoners. Donor anthropometric data including age, gender, body weight, and body height were recorded prospectively. The weights and volumes of the right lobe liver grafts were measured at the back table. Liver weights and volumes were calculated from the right lobe graft weight and volume obtained at the back table, divided by the proportion of the right lobe on computed tomography. By simple linear regression analysis and stepwise multiple linear regression analysis, we correlated calculated liver volume and body height, body weight, or body surface area. The subjects had a mean age of 35.97 +/- 9.6 years, and a female-to-male ratio of 60:55. The mean volume of the right lobe was 727.47 +/- 136.17 mL, occupying 55.59% +/- 6.70% of the whole liver by computed tomography. The volume of the right lobe was 581.73 +/- 96.137 mL, and the estimated liver volume was 1053.08 +/- 167.56 mL. Females of the same body weight showed a slightly lower liver weight. By simple linear regression analysis and stepwise multiple linear regression analysis, a formula was derived based on body weight. All formulae except the Hong Kong formula overestimated liver volume compared to this formula. The formula of standard liver volume, SLV (mL) = 11.508 x body weight (kg) + 334.024, may be applied to estimate liver volumes in Chinese adults.

  7. Anodic microbial community diversity as a predictor of the power output of microbial fuel cells.

    PubMed

    Stratford, James P; Beecroft, Nelli J; Slade, Robert C T; Grüning, André; Avignone-Rossa, Claudio

    2014-03-01

    The relationship between the diversity of mixed-species microbial consortia and their electrogenic potential in the anodes of microbial fuel cells was examined using different diversity measures as predictors. Identical microbial fuel cells were sampled at multiple time-points. Biofilm and suspension communities were analysed by denaturing gradient gel electrophoresis to calculate the number and relative abundance of species. Shannon and Simpson indices and richness were examined for association with power using bivariate and multiple linear regression, with biofilm DNA as an additional variable. In simple bivariate regressions, the correlation of Shannon diversity of the biofilm and power is stronger (r=0.65, p=0.001) than between power and richness (r=0.39, p=0.076), or between power and the Simpson index (r=0.5, p=0.018). Using Shannon diversity and biofilm DNA as predictors of power, a regression model can be constructed (r=0.73, p<0.001). Ecological parameters such as the Shannon index are predictive of the electrogenic potential of microbial communities. Copyright © 2014 Elsevier Ltd. All rights reserved.

  8. Independence of heritable influences on the food intake of free-living humans.

    PubMed

    de Castro, John M

    2002-01-01

    The time of day of meal ingestion, the number of people present at the meal, the subjective state of hunger, and the estimated before-meal contents in the stomach have been established as influences on the amount eaten in a meal and these influences have been shown to be heritable. Because these factors intercorrelate, the calculated heritabilities for some of these variables might result indirectly from their covariation with one of the other heritable variables. The independence of the heritability of the influence of these four factors was investigated with 110 identical and 102 fraternal same-sex and 53 fraternal mixed-sex adult twin pairs who were paid to maintain 7-d food-intake diaries. From the diary reports, the meal sizes were calculated and subjected to multiple regression analysis using the estimated before-meal stomach contents, the reported number of other people present, the subjective hunger ratings, and the time of day of the meal as predictors. Linear structural modeling was applied to the beta-coefficients from the multiple regression to investigate whether the heritability of the influences of these four variables was independent. Significant genetic effects were found for the beta-coefficients for all four variables, indicating that the heritability of their relationship with intake is to some extent independent and heritable. This suggests that influences of multiple factors on intake are influenced by the genes and become part of the total package of genetically determined physiologic, sociocultural, and psychological processes that regulate energy balance.

  9. Heritability of diurnal changes in food intake in free-living humans.

    PubMed

    de Castro, J M

    2001-09-01

    The time of day of meal ingestion, the number of people present at the meal, the subjective state of hunger, and the estimated before-meal contents in the stomach have been established as influences on the amount eaten in a meal, and this influence has been shown to be heritable. Because these factors intercorrelate, the possibility that the calculated heritabilities for some of these variables could result indirectly from their convariation with one of the other heritable variables was assessed. The independence of the heritability of the influence of these four factors was investigated with 110 identical and 102 fraternal same-sex and 53 fraternal mixed-sex adult twin pairs who were paid to maintain 7-d food intake diaries. From the diary reports, the meal sizes were calculated and subjected to multiple regression analysis using the estimated before-meal stomach contents, the reported number of other people present, the subjective hunger ratings, and the time of day of the meal as predictors. Linear structural modeling was applied to the beta coefficients from the multiple regression to investigate whether the heritability of the influences of these four variables was independent. Significant genetic effects were found for the beta coefficients for all four variables, indicating that the heritability of their relationship with intake is to some extent heritable. These results suggest that the influences of multiple factors on intake are influenced by the genes and become part of the total package of genetically determined physiologic, sociocultural, and psychological processes that regulate energy balance.

  10. NIMEFI: gene regulatory network inference using multiple ensemble feature importance algorithms.

    PubMed

    Ruyssinck, Joeri; Huynh-Thu, Vân Anh; Geurts, Pierre; Dhaene, Tom; Demeester, Piet; Saeys, Yvan

    2014-01-01

    One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms) and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made publicly available.

  11. Novel Index (Hepatic Receptor: IHR) to Evaluate Hepatic Functional Reserve Using (99m)Tc-GSA Scintigraphy.

    PubMed

    Hasegawa, Daisuke; Onishi, Hideo; Matsutomo, Norikazu

    2016-02-01

    This study aimed to evaluate the novel index of hepatic receptor (IHR) on the regression analysis derived from time activity curve of the liver for hepatic functional reserve. Sixty patients had undergone (99m)Tc-galactosyl serum albumin ((99m)Tc-GSA) scintigraphy in the retrospective clinical study. Time activity curves for liver were obtained by region of interest (ROI) on the whole liver. A novel hepatic functional predictor was calculated with multiple regression analysis of time activity curves. In the multiple regression function, the objective variables were the indocyanine green (ICG) retention rate at 15 min, and the explanatory variables were the liver counts in 3-min intervals until end from beginning. Then, this result was defined by IHR, and we analyzed the correlation between IHR and ICG, uptake ratio of the heart at 15 minutes to that at 3 minutes (HH15), uptake ratio of the liver to the liver plus heart at 15 minutes (LHL15), and index of convexity (IOC). Regression function of IHR was derived as follows: IHR=0.025×L(6)-0.052×L(12)+0.027×L(27). The multiple regression analysis indicated that liver counts at 6 min, 12 min, and 27 min were significantly related to objective variables. The correlation coefficient between IHR and ICG was 0.774, and the correlation coefficient between ICG and conventional indices (HH15, LHL15, and IOC) were 0.837, 0.773, and 0.793, respectively. IHR had good correlation with HH15, LHL15, and IOC. The finding results suggested that IHR would provide clinical benefit for hepatic functional assessment in the (99m)Tc-GSA scintigraphy.

  12. Calculating stage duration statistics in multistage diseases.

    PubMed

    Komarova, Natalia L; Thalhauser, Craig J

    2011-01-01

    Many human diseases are characterized by multiple stages of progression. While the typical sequence of disease progression can be identified, there may be large individual variations among patients. Identifying mean stage durations and their variations is critical for statistical hypothesis testing needed to determine if treatment is having a significant effect on the progression, or if a new therapy is showing a delay of progression through a multistage disease. In this paper we focus on two methods for extracting stage duration statistics from longitudinal datasets: an extension of the linear regression technique, and a counting algorithm. Both are non-iterative, non-parametric and computationally cheap methods, which makes them invaluable tools for studying the epidemiology of diseases, with a goal of identifying different patterns of progression by using bioinformatics methodologies. Here we show that the regression method performs well for calculating the mean stage durations under a wide variety of assumptions, however, its generalization to variance calculations fails under realistic assumptions about the data collection procedure. On the other hand, the counting method yields reliable estimations for both means and variances of stage durations. Applications to Alzheimer disease progression are discussed.

  13. Paleotemperature reconstruction from mammalian phosphate δ18O records - an alternative view on data processing

    NASA Astrophysics Data System (ADS)

    Skrzypek, Grzegorz; Sadler, Rohan; Wiśniewski, Andrzej

    2017-04-01

    The stable oxygen isotope composition of phosphates (δ18O) extracted from mammalian bone and teeth material is commonly used as a proxy for paleotemperature. Historically, several different analytical and statistical procedures for determining air paleotemperatures from the measured δ18O of phosphates have been applied. This inconsistency in both stable isotope data processing and the application of statistical procedures has led to large and unwanted differences between calculated results. This study presents the uncertainty associated with two of the most commonly used regression methods: least squares inverted fit and transposed fit. We assessed the performance of these methods by designing and applying calculation experiments to multiple real-life data sets, calculating in reverse temperatures, and comparing them with true recorded values. Our calculations clearly show that the mean absolute errors are always substantially higher for the inverted fit (a causal model), with the transposed fit (a predictive model) returning mean values closer to the measured values (Skrzypek et al. 2015). The predictive models always performed better than causal models, with 12-65% lower mean absolute errors. Moreover, the least-squares regression (LSM) model is more appropriate than Reduced Major Axis (RMA) regression for calculating the environmental water stable oxygen isotope composition from phosphate signatures, as well as for calculating air temperature from the δ18O value of environmental water. The transposed fit introduces a lower overall error than the inverted fit for both the δ18O of environmental water and Tair calculations; therefore, the predictive models are more statistically efficient than the causal models in this instance. The direct comparison of paleotemperature results from different laboratories and studies may only be achieved if a single method of calculation is applied. Reference Skrzypek G., Sadler R., Wiśniewski A., 2016. Reassessment of recommendations for processing mammal phosphate δ18O data for paleotemperature reconstruction. Palaeogeography, Palaeoclimatology, Palaeoecology 446, 162-167.

  14. Catalog of Air Force Weather Technical Documents, 1941-2006

    DTIC Science & Technology

    2006-05-19

    radiosondes in current use in USA. Elementary discussion of statistical terms and concepts used for expressing accuracy or error is discussed. AWS TR 105...Techniques, Appendix B: Vorticity—An Elementary Discussion of the Concept, August 1956, 27pp. Formerly AWSM 105– 50/1A. Provides the necessary back...steps involved in ordinary multiple linear regression. Conditional probability is calculated using transnormalized variables in the multivariate normal

  15. Triglyceride glucose index and common carotid wall shear stress.

    PubMed

    Tripolino, Cesare; Irace, Concetta; Scavelli, Faustina B; de Franceschi, Maria S; Esposito, Teresa; Carallo, Claudio; Gnasso, Agostino

    2014-02-01

    Alterations in wall shear stress contribute to both clinical and subclinical atherosclerosis. Several conditions such as hypertension, diabetes, and obesity can impair shear stress, but the role of insulin resistance has never been investigated. The present study was designed to investigate whether insulin resistance assessed by TyG Index associates with wall shear stress in the common carotid artery. One hundred six individuals were enrolled. Blood pressure, lipids, glucose, and cigarette smoking were evaluated. TyG Index was calculated as log[fasting triglycerides × fasting glucose / 2]. Subjects underwent blood viscosity measurement and echo-Doppler evaluation of carotid arteries to calculate wall shear stress. The association between TyG Index and carotid wall shear stress was assessed by simple and multiple regression analyses. TyG Index was significantly and inversely associated with carotid wall shear stress both in simple (r = -0.44, P < 0.001) and multiple regression analyses accounting for age, sex, and major cardiovascular risk factors. The association was further confirmed after exclusion of subjects with diabetes, dyslipidemia, fasting blood glucose greater than 100 mg/dL, and triglycerides greater than 150 mg/dL. The present findings suggest that increasing insulin resistance, as assessed by TyG Index, associates with atherosclerosis-prone shear stress reduction in the common carotid artery.

  16. Do climate variables and human density affect Achatina fulica (Bowditch) (Gastropoda: Pulmonata) shell length, total weight and condition factor?

    PubMed

    Albuquerque, F S; Peso-Aguiar, M C; Assunção-Albuquerque, M J T; Gálvez, L

    2009-08-01

    The length-weight relationship and condition factor have been broadly investigated in snails to obtain the index of physical condition of populations and evaluate habitat quality. Herein, our goal was to describe the best predictors that explain Achatina fulica biometrical parameters and well being in a recently introduced population. From November 2001 to November 2002, monthly snail samples were collected in Lauro de Freitas City, Bahia, Brazil. Shell length and total weight were measured in the laboratory and the potential curve and condition factor were calculated. Five environmental variables were considered: temperature range, mean temperature, humidity, precipitation and human density. Multiple regressions were used to generate models including multiple predictors, via model selection approach, and then ranked with AIC criteria. Partial regressions were used to obtain the separated coefficients of determination of climate and human density models. A total of 1.460 individuals were collected, presenting a shell length range between 4.8 to 102.5 mm (mean: 42.18 mm). The relationship between total length and total weight revealed that Achatina fulica presented a negative allometric growth. Simple regression indicated that humidity has a significant influence on A. fulica total length and weight. Temperature range was the main variable that influenced the condition factor. Multiple regressions showed that climatic and human variables explain a small proportion of the variance in shell length and total weight, but may explain up to 55.7% of the condition factor variance. Consequently, we believe that the well being and biometric parameters of A. fulica can be influenced by climatic and human density factors.

  17. Predictive ability of a comprehensive incremental test in mountain bike marathon

    PubMed Central

    Schneeweiss, Patrick; Martus, Peter; Niess, Andreas M; Krauss, Inga

    2018-01-01

    Objectives Traditional performance tests in mountain bike marathon (XCM) primarily quantify aerobic metabolism and may not describe the relevant capacities in XCM. We aimed to validate a comprehensive test protocol quantifying its intermittent demands. Methods Forty-nine athletes (38.8±9.1 years; 38 male; 11 female) performed a laboratory performance test, including an incremental test, to determine individual anaerobic threshold (IAT), peak power output (PPO) and three maximal efforts (10 s all-out sprint, 1 min maximal effort and 5 min maximal effort). Within 2 weeks, the athletes participated in one of three XCM races (n=15, n=9 and n=25). Correlations between test variables and race times were calculated separately. In addition, multiple regression models of the predictive value of laboratory outcomes were calculated for race 3 and across all races (z-transformed data). Results All variables were correlated with race times 1, 2 and 3: 10 s all-out sprint (r=−0.72; r=−0.59; r=−0.61), 1 min maximal effort (r=−0.85; r=−0.84; r=−0.82), 5 min maximal effort (r=−0.57; r=−0.85; r=−0.76), PPO (r=−0.77; r=−0.73; r=−0.76) and IAT (r=−0.71; r=−0.67; r=−0.68). The best-fitting multiple regression models for race 3 (r2=0.868) and across all races (r2=0.757) comprised 1 min maximal effort, IAT and body weight. Conclusion Aerobic and intermittent variables correlated least strongly with race times. Their use in a multiple regression model confirmed additional explanatory power to predict XCM performance. These findings underline the usefulness of the comprehensive incremental test to predict performance in that sport more precisely. PMID:29387445

  18. Parameter estimation of multivariate multiple regression model using bayesian with non-informative Jeffreys’ prior distribution

    NASA Astrophysics Data System (ADS)

    Saputro, D. R. S.; Amalia, F.; Widyaningsih, P.; Affan, R. C.

    2018-05-01

    Bayesian method is a method that can be used to estimate the parameters of multivariate multiple regression model. Bayesian method has two distributions, there are prior and posterior distributions. Posterior distribution is influenced by the selection of prior distribution. Jeffreys’ prior distribution is a kind of Non-informative prior distribution. This prior is used when the information about parameter not available. Non-informative Jeffreys’ prior distribution is combined with the sample information resulting the posterior distribution. Posterior distribution is used to estimate the parameter. The purposes of this research is to estimate the parameters of multivariate regression model using Bayesian method with Non-informative Jeffreys’ prior distribution. Based on the results and discussion, parameter estimation of β and Σ which were obtained from expected value of random variable of marginal posterior distribution function. The marginal posterior distributions for β and Σ are multivariate normal and inverse Wishart. However, in calculation of the expected value involving integral of a function which difficult to determine the value. Therefore, approach is needed by generating of random samples according to the posterior distribution characteristics of each parameter using Markov chain Monte Carlo (MCMC) Gibbs sampling algorithm.

  19. Use of a pretest strategy for physical therapist assistant programs to predict success rate on the national physical therapy exam.

    PubMed

    Sloas, Stacey B; Keith, Becky; Whitehead, Malcolm T

    2013-01-01

    This study investigated a pretest strategy that identified physical therapist assistant (PTA) students who were at risk of failure on the National Physical Therapy Examination (NPTE). Program assessment data from five cohorts of PTA students (2005-2009) were used to develop a stepwise multiple regression formula that predicted first-time NPTE licensure scores. Data used included the Nelson-Denny Reading Test, grades from eight core courses, grade point average upon admission to the program, and scores from three mock NPTE exams given during the program. Pearson correlation coefficients were calculated between each of the 15 variables and NPTE scores. Stepwise multiple regression analysis was performed using data collected at the ends of the first, second, and third (final) semesters of the program. Data from the class of 2010 were then used to validate the formula. The end-of-program formula accounted for the greatest variance (57%) in predicted scores. Those students scoring below a predicted scaled score of 620 were identified to be at risk of failure of the licensure exam. These students were counseled, and a remedial plan was developed based on regression predictions prior to them sitting for the licensure exam.

  20. Adherence to preferable behavior for lipid control by high-risk dyslipidemic Japanese patients under pravastatin treatment: the APPROACH-J study.

    PubMed

    Kitagawa, Yasuhisa; Teramoto, Tamio; Daida, Hiroyuki

    2012-01-01

    We evaluated the impact of adherence to preferable behavior on serum lipid control assessed by a self-reported questionnaire in high-risk patients taking pravastatin for primary prevention of coronary artery disease. High-risk patients taking pravastatin were followed for 2 years. Questionnaire surveys comprising 21 questions, including 18 questions concerning awareness of health, and current status of diet, exercise, and drug therapy, were conducted at baseline and after 1 year. Potential domains were established by factor analysis from the results of questionnaires, and adherence scores were calculated in each domain. The relationship between adherence scores and lipid values during the 1-year treatment period was analyzed by each domain using multiple regression analysis. A total of 5,792 patients taking pravastatin were included in the analysis. Multiple regression analysis showed a significant correlation in terms of "Intake of high fat/cholesterol/sugar foods" (regression coefficient -0.58, p=0.0105) and "Adherence to instructions for drug therapy" (regression coefficient -6.61, p<0.0001). Low-density lipoprotein cholesterol (LDL-C) values were significantly lower in patients who had an increase in the adherence score in the "Awareness of health" domain compared with those with a decreased score. There was a significant correlation between high-density lipoprotein (HDL-C) values and "Awareness of health" (regression coefficient 0.26; p= 0.0037), "Preferable dietary behaviors" (regression coefficient 0.75; p<0.0001), and "Exercise" (regression coefficient 0.73; p= 0.0002). Similar relations were seen with triglycerides. In patients who have a high awareness of their health, a positive attitude toward lipid-lowering treatment including diet, exercise, and high adherence to drug therapy, is related with favorable overall lipid control even in patients under treatment with pravastatin.

  1. Regression analysis of current-status data: an application to breast-feeding.

    PubMed

    Grummer-strawn, L M

    1993-09-01

    "Although techniques for calculating mean survival time from current-status data are well known, their use in multiple regression models is somewhat troublesome. Using data on current breast-feeding behavior, this article considers a number of techniques that have been suggested in the literature, including parametric, nonparametric, and semiparametric models as well as the application of standard schedules. Models are tested in both proportional-odds and proportional-hazards frameworks....I fit [the] models to current status data on breast-feeding from the Demographic and Health Survey (DHS) in six countries: two African (Mali and Ondo State, Nigeria), two Asian (Indonesia and Sri Lanka), and two Latin American (Colombia and Peru)." excerpt

  2. Upper Gastrointestinal Complications and Cardiovascular/Gastrointestinal Risk Calculator in Patients with Myocardial Infarction Treated with Aspirin.

    PubMed

    Wen, Lei

    2017-08-20

    Aspirin is widely used for the prevention of cardiovascular and cerebrovascular diseases for the past few years. However, much attention has been paid to the adverse effects associated with aspirin such as gastrointestinal bleeding. How to weigh the benefits and hazards? The current study aimed to assess the feasibility of a cardiovascular/gastrointestinal risk calculator, AsaRiskCalculator, in predicting gastrointestinal events in Chinese patients with myocardial infarction (MI), determining unique risk factor(s) for gastrointestinal events to be considered in the calculator. The MI patients who visited Shapingba District People's Hospital between January 2012 and January 2016 were retrospectively reviewed. Based on gastroscopic data, the patients were divided into two groups: gastrointestinal and nongastrointestinal groups. Demographic and clinical data of the patients were then retrieved for statistical analysis. Univariate and multiple logistic regression analyses were used to identify independent risk factors for gastrointestinal events. The receiver operating characteristic (ROC) curves were used to assess the predictive value of AsaRiskCalculator for gastrointestinal events. A total of 400 MI patients meeting the eligibility criteria were analyzed, including 94 and 306 in the gastrointestinal and nongastrointestinal groups, respectively. The data showed that age, male gender, predicted gastrointestinal events, and Helicobacter pylori (HP) infection were positively correlated with gastrointestinal events. In multiple logistic regression analysis, predicted gastrointestinal events and HP infection were identified as risk factors for actual gastrointestinal events. HP infection was highly predictive in Chinese patients; the ROC curve indicated an area under the curve of 0.822 (95% confidence interval: 0.774-0.870). The best diagnostic cutoff point of predicted gastrointestinal events was 68.0‰, yielding sensitivity and specificity of 60.6% and 93.1%, respectively, for predicting gastrointestinal events in Chinese patients with MI. AsaRiskCalculator had a predictive value for gastrointestinal events in Chinese patients with MI. HP infection seemed to be an independent risk factor for gastrointestinal events caused by long-term aspirin treatment in Chinese patients with MI, and it should be included in the risk calculator adapted for Chinese patients.

  3. Quantifying Parkinson's disease finger-tapping severity by extracting and synthesizing finger motion properties.

    PubMed

    Sano, Yuko; Kandori, Akihiko; Shima, Keisuke; Yamaguchi, Yuki; Tsuji, Toshio; Noda, Masafumi; Higashikawa, Fumiko; Yokoe, Masaru; Sakoda, Saburo

    2016-06-01

    We propose a novel index of Parkinson's disease (PD) finger-tapping severity, called "PDFTsi," for quantifying the severity of symptoms related to the finger tapping of PD patients with high accuracy. To validate the efficacy of PDFTsi, the finger-tapping movements of normal controls and PD patients were measured by using magnetic sensors, and 21 characteristics were extracted from the finger-tapping waveforms. To distinguish motor deterioration due to PD from that due to aging, the aging effect on finger tapping was removed from these characteristics. Principal component analysis (PCA) was applied to the age-normalized characteristics, and principal components that represented the motion properties of finger tapping were calculated. Multiple linear regression (MLR) with stepwise variable selection was applied to the principal components, and PDFTsi was calculated. The calculated PDFTsi indicates that PDFTsi has a high estimation ability, namely a mean square error of 0.45. The estimation ability of PDFTsi is higher than that of the alternative method, MLR with stepwise regression selection without PCA, namely a mean square error of 1.30. This result suggests that PDFTsi can quantify PD finger-tapping severity accurately. Furthermore, the result of interpreting a model for calculating PDFTsi indicated that motion wideness and rhythm disorder are important for estimating PD finger-tapping severity.

  4. A Technique of Fuzzy C-Mean in Multiple Linear Regression Model toward Paddy Yield

    NASA Astrophysics Data System (ADS)

    Syazwan Wahab, Nur; Saifullah Rusiman, Mohd; Mohamad, Mahathir; Amira Azmi, Nur; Che Him, Norziha; Ghazali Kamardan, M.; Ali, Maselan

    2018-04-01

    In this paper, we propose a hybrid model which is a combination of multiple linear regression model and fuzzy c-means method. This research involved a relationship between 20 variates of the top soil that are analyzed prior to planting of paddy yields at standard fertilizer rates. Data used were from the multi-location trials for rice carried out by MARDI at major paddy granary in Peninsular Malaysia during the period from 2009 to 2012. Missing observations were estimated using mean estimation techniques. The data were analyzed using multiple linear regression model and a combination of multiple linear regression model and fuzzy c-means method. Analysis of normality and multicollinearity indicate that the data is normally scattered without multicollinearity among independent variables. Analysis of fuzzy c-means cluster the yield of paddy into two clusters before the multiple linear regression model can be used. The comparison between two method indicate that the hybrid of multiple linear regression model and fuzzy c-means method outperform the multiple linear regression model with lower value of mean square error.

  5. Liquid electrolyte informatics using an exhaustive search with linear regression.

    PubMed

    Sodeyama, Keitaro; Igarashi, Yasuhiko; Nakayama, Tomofumi; Tateyama, Yoshitaka; Okada, Masato

    2018-06-14

    Exploring new liquid electrolyte materials is a fundamental target for developing new high-performance lithium-ion batteries. In contrast to solid materials, disordered liquid solution properties have been less studied by data-driven information techniques. Here, we examined the estimation accuracy and efficiency of three information techniques, multiple linear regression (MLR), least absolute shrinkage and selection operator (LASSO), and exhaustive search with linear regression (ES-LiR), by using coordination energy and melting point as test liquid properties. We then confirmed that ES-LiR gives the most accurate estimation among the techniques. We also found that ES-LiR can provide the relationship between the "prediction accuracy" and "calculation cost" of the properties via a weight diagram of descriptors. This technique makes it possible to choose the balance of the "accuracy" and "cost" when the search of a huge amount of new materials was carried out.

  6. Age, Body Mass Index, and Frequency of Sexual Activity are Independent Predictors of Testosterone Deficiency in Men With Erectile Dysfunction.

    PubMed

    Pagano, Matthew J; De Fazio, Adam; Levy, Alison; RoyChoudhury, Arindam; Stahl, Peter J

    2016-04-01

    To identify clinical predictors of testosterone deficiency (TD) in men with erectile dysfunction (ED), thereby identifying subgroups that are most likely to benefit from targeted testosterone screening. Retrospective review was conducted on 498 men evaluated for ED between January 2013 and July 2014. Testing for TD by early morning serum measurement was offered to all eligible men. Patients with history of prostate cancer or testosterone replacement were excluded. Univariable linear regression was conducted to analyze 19 clinical variables for associations with serum total testosterone (TT), calculated free testosterone (cFT), and TD (T <300 ng/dL or cFT <6.5 ng/dL). Variables significant on univariable analysis were included in multiple regression models. A total of 225 men met inclusion criteria. Lower TT levels were associated with greater body mass index (BMI), less frequent sexual activity, and absence of clinical depression on multiple regression analysis. TT decreased by 49.5 ng/dL for each 5-point increase in BMI. BMI and age were the only independent predictors of cFT levels on multivariable analysis. Overall, 62 subjects (27.6%) met criteria for TD. Older age, greater BMI, and less frequent sexual activity were the only independent predictors of TD on multiple regression. We observed a 2.2-fold increase in the odds of TD for every 5-point increase in BMI, and a 1.8-fold increase for every 10 year increase in age. Men with ED and elevated BMI, advanced age, or infrequent sexual activity appear to be at high risk of TD, and such patients represent excellent potential candidates for targeted testosterone screening. Copyright © 2016 Elsevier Inc. All rights reserved.

  7. A difference in systolic blood pressure between arms is a novel predictor of the development and progression of diabetic nephropathy in patients with type 2 diabetes.

    PubMed

    Okada, Hiroshi; Fukui, Michiaki; Tanaka, Muhei; Matsumoto, Shinobu; Iwase, Hiroya; Kobayashi, Kanae; Asano, Mai; Yamazaki, Masahiro; Hasegawa, Goji; Nakamura, Naoto

    2013-10-01

    Recent studies have suggested that a difference in systolic blood pressure (SBP) between arms is associated with both vascular disease and mortality. The aim of this study was to investigate the relationship between a difference in SBP between arms and change in urinary albumin excretion or development of albuminuria in patients with type 2 diabetes. We measured SBP in 408 consecutive patients with type 2 diabetes, and calculated a difference in SBP between arms. We performed follow-up study to assess change in urinary albumin excretion or development of albuminuria, mean interval of which was 4.6 ± 1.7 years. We then evaluated the relationship of a difference in SBP between arms to diabetic nephropathy using multiple regression analysis and multiple Cox regression model. Multiple regression analyses demonstrated that a difference in SBP between arms was independently associated with change in urinary albumin excretion (β = 0.1869, P = 0.0010). Adjusted Cox regression analyses demonstrated that a difference in SBP between arms was associated with an increased hazard of development of albuminuria; hazard ratio was 1.215 (95% confidence interval 1.077-1.376). Moreover, the risk of development of albuminuria was increased in patients with a difference in SBP of equal to or more than 10 mmHg between arms; hazard ratio was 4.168 (95% confidence interval 1.478-11.70). A difference in SBP between arms could be a novel predictor of the development and progression of diabetic nephropathy in patients with type 2 diabetes. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  8. Trends in Mortality After Primary Cytoreductive Surgery for Ovarian Cancer: A Systematic Review and Metaregression of Randomized Clinical Trials and Observational Studies.

    PubMed

    Di Donato, Violante; Kontopantelis, Evangelos; Aletti, Giovanni; Casorelli, Assunta; Piacenti, Ilaria; Bogani, Giorgio; Lecce, Francesca; Benedetti Panici, Pierluigi

    2017-06-01

    Primary cytoreductive surgery (PDS) followed by platinum-based chemotherapy is the cornerstone of treatment and the absence of residual tumor after PDS is universally considered the most important prognostic factor. The aim of the present analysis was to evaluate trend and predictors of 30-day mortality in patients undergoing primary cytoreduction for ovarian cancer. Literature was searched for records reporting 30-day mortality after PDS. All cohorts were rated for quality. Simple and multiple Poisson regression models were used to quantify the association between 30-day mortality and the following: overall or severe complications, proportion of patients with stage IV disease, median age, year of publication, and weighted surgical complexity index. Using the multiple regression model, we calculated the risk of perioperative mortality at different levels for statistically significant covariates of interest. Simple regression identified median age and proportion of patients with stage IV disease as statistically significant predictors of 30-day mortality. When included in the multiple Poisson regression model, both remained statistically significant, with an incidence rate ratio of 1.087 for median age and 1.017 for stage IV disease. Disease stage was a strong predictor, with the risk estimated to increase from 2.8% (95% confidence interval 2.02-3.66) for stage III to 16.1% (95% confidence interval 6.18-25.93) for stage IV, for a cohort with a median age of 65 years. Metaregression demonstrated that increased age and advanced clinical stage were independently associated with an increased risk of mortality, and the combined effects of both factors greatly increased the risk.

  9. [Co-author and keyword networks and their clustering appearance in preventive medicine fields in Korea: analysis of papers in the Journal of Preventive Medicine and Public Health, 1991~2006].

    PubMed

    Jung, Minsoo; Chung, Dongjun

    2008-01-01

    This study evaluated knowledge structure and its effect factor by analysis of co-author and keyword networks in Korea's preventive medicine sector. The data was extracted from 873 papers listed in the Journal of Preventive Medicine and Public Health, and was transformed into a co-author and keyword matrix where the existence of a 'link' was judged by impact factors calculated by the weight value of the role and rate of author participation. Research achievement was dependent upon the author's status and networking index, as analyzed by neighborhood degree, multidimensional scaling, correspondence analysis, and multiple regression. Co-author networks developed as randomness network in the center of a few high-productivity researchers. In particular, closeness centrality was more developed than degree centrality. Also, power law distribution was discovered in impact factor and research productivity by college affiliation. In multiple regression, the effect of the author's role was significant in both the impact factor calculated by the participatory rate and the number of listed articles. However, the number of listed articles varied by sex. This study shows that the small world phenomenon exists in co-author and keyword networks in a journal, as in citation networks. However, the differentiation of knowledge structure in the field of preventive medicine was relatively restricted by specialization.

  10. The role of chemometrics in single and sequential extraction assays: a review. Part II. Cluster analysis, multiple linear regression, mixture resolution, experimental design and other techniques.

    PubMed

    Giacomino, Agnese; Abollino, Ornella; Malandrino, Mery; Mentasti, Edoardo

    2011-03-04

    Single and sequential extraction procedures are used for studying element mobility and availability in solid matrices, like soils, sediments, sludge, and airborne particulate matter. In the first part of this review we reported an overview on these procedures and described the applications of chemometric uni- and bivariate techniques and of multivariate pattern recognition techniques based on variable reduction to the experimental results obtained. The second part of the review deals with the use of chemometrics not only for the visualization and interpretation of data, but also for the investigation of the effects of experimental conditions on the response, the optimization of their values and the calculation of element fractionation. We will describe the principles of the multivariate chemometric techniques considered, the aims for which they were applied and the key findings obtained. The following topics will be critically addressed: pattern recognition by cluster analysis (CA), linear discriminant analysis (LDA) and other less common techniques; modelling by multiple linear regression (MLR); investigation of spatial distribution of variables by geostatistics; calculation of fractionation patterns by a mixture resolution method (Chemometric Identification of Substrates and Element Distributions, CISED); optimization and characterization of extraction procedures by experimental design; other multivariate techniques less commonly applied. Copyright © 2010 Elsevier B.V. All rights reserved.

  11. Age-, sex-, and education-specific norms for an extended CERAD Neuropsychological Assessment Battery-Results from the population-based LIFE-Adult-Study.

    PubMed

    Luck, Tobias; Pabst, Alexander; Rodriguez, Francisca S; Schroeter, Matthias L; Witte, Veronica; Hinz, Andreas; Mehnert, Anja; Engel, Christoph; Loeffler, Markus; Thiery, Joachim; Villringer, Arno; Riedel-Heller, Steffi G

    2018-05-01

    To provide new age-, sex-, and education-specific reference values for an extended version of the well-established Consortium to Establish a Registry for Alzheimer's Disease Neuropsychological Assessment Battery (CERAD-NAB) that additionally includes the Trail Making Test and the Verbal Fluency Test-S-Words. Norms were calculated based on the cognitive performances of n = 1,888 dementia-free participants (60-79 years) from the population-based German LIFE-Adult-Study. Multiple regressions were used to examine the association of the CERAD-NAB scores with age, sex, and education. In order to calculate the norms, quantile and censored quantile regression analyses were performed estimating marginal means of the test scores at 2.28, 6.68, 10, 15.87, 25, 50, 75, and 90 percentiles for age-, sex-, and education-specific subgroups. Multiple regression analyses revealed that younger age was significantly associated with better cognitive performance in 15 CERAD-NAB measures and higher education with better cognitive performance in all 17 measures. Women performed significantly better than men in 12 measures and men than women in four measures. The determined norms indicate ceiling effects for the cognitive performances in the Boston Naming, Word List Recognition, Constructional Praxis Copying, and Constructional Praxis Recall tests. The new norms for the extended CERAD-NAB will be useful for evaluating dementia-free German-speaking adults in a broad variety of relevant cognitive domains. The extended CERAD-NAB follows more closely the criteria for the new DSM-5 Mild and Major Neurocognitive Disorder. Additionally, it could be further developed to include a test for social cognition. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  12. Deriving the Intrahepatic Arteriovenous Shunt Rate from CT Images and Biochemical Data Instead of from Arterial Perfusion Scintigraphy in Hepatic Arterial Infusion Chemotherapy

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Ozaki, Toshiro, E-mail: ganronbun@amail.plala.or.jp; Seki, Hiroshi; Shiina, Makoto

    2009-09-15

    The purpose of the present study was to elucidate a method for predicting the intrahepatic arteriovenous shunt rate from computed tomography (CT) images and biochemical data, instead of from arterial perfusion scintigraphy, because adverse exacerbated systemic effects may be induced in cases where a high shunt rate exists. CT and arterial perfusion scintigraphy were performed in patients with liver metastases from gastric or colorectal cancer. Biochemical data and tumor marker levels of 33 enrolled patients were measured. The results were statistically verified by multiple regression analysis. The total metastatic hepatic tumor volume (V{sub metastasized}), residual hepatic parenchyma volume (V{sub residual};more » calculated from CT images), and biochemical data were treated as independent variables; the intrahepatic arteriovenous (IHAV) shunt rate (calculated from scintigraphy) was treated as a dependent variable. The IHAV shunt rate was 15.1 {+-} 11.9%. Based on the correlation matrixes, the best correlation coefficient of 0.84 was established between the IHAV shunt rate and V{sub metastasized} (p < 0.01). In the multiple regression analysis with the IHAV shunt rate as the dependent variable, the coefficient of determination (R{sup 2}) was 0.75, which was significant at the 0.1% level with two significant independent variables (V{sub metastasized} and V{sub residual}). The standardized regression coefficients ({beta}) of V{sub metastasized} and V{sub residual} were significant at the 0.1 and 5% levels, respectively. Based on this result, we can obtain a predicted value of IHAV shunt rate (p < 0.001) using CT images. When a high shunt rate was predicted, beneficial and consistent clinical monitoring can be initiated in, for example, hepatic arterial infusion chemotherapy.« less

  13. Isovolumic relaxation period as an index of left ventricular relaxation under different afterload conditions--comparison with the time constant of left ventricular pressure decay in the dog.

    PubMed

    Ochi, H; Ikuma, I; Toda, H; Shimada, T; Morioka, S; Moriyama, K

    1989-12-01

    In order to determine whether isovolumic relaxation period (IRP) reflects left ventricular relaxation under different afterload conditions, 17 anesthetized, open chest dogs were studied, and the left ventricular pressure decay time constant (T) was calculated. In 12 dogs, angiotensin II and nitroprusside were administered, with the heart rate constant at 90 beats/min. Multiple linear regression analysis showed that the aortic dicrotic notch pressure (AoDNP) and T were major determinants of IRP, while left ventricular end-diastolic pressure was a minor determinant. Multiple linear regression analysis, correlating T with IRP and AoDNP, did not further improve the correlation coefficient compared with that between T and IRP. We concluded that correction of the IRP by AoDNP is not necessary to predict T from additional multiple linear regression. The effects of ascending aortic constriction or angiotensin II on IRP were examined in five dogs, after pretreatment with propranolol. Aortic constriction caused a significant decrease in IRP and T, while angiotensin II produced a significant increase in IRP and T. IRP was affected by the change of afterload. However, the IRP and T values were always altered in the same direction. These results demonstrate that IRP is substituted for T and it reflects left ventricular relaxation even in different afterload conditions. We conclude that IRP is a simple parameter easily used to evaluate left ventricular relaxation in clinical situations.

  14. UCODE, a computer code for universal inverse modeling

    USGS Publications Warehouse

    Poeter, E.P.; Hill, M.C.

    1999-01-01

    This article presents the US Geological Survey computer program UCODE, which was developed in collaboration with the US Army Corps of Engineers Waterways Experiment Station and the International Ground Water Modeling Center of the Colorado School of Mines. UCODE performs inverse modeling, posed as a parameter-estimation problem, using nonlinear regression. Any application model or set of models can be used; the only requirement is that they have numerical (ASCII or text only) input and output files and that the numbers in these files have sufficient significant digits. Application models can include preprocessors and postprocessors as well as models related to the processes of interest (physical, chemical and so on), making UCODE extremely powerful for model calibration. Estimated parameters can be defined flexibly with user-specified functions. Observations to be matched in the regression can be any quantity for which a simulated equivalent value can be produced, thus simulated equivalent values are calculated using values that appear in the application model output files and can be manipulated with additive and multiplicative functions, if necessary. Prior, or direct, information on estimated parameters also can be included in the regression. The nonlinear regression problem is solved by minimizing a weighted least-squares objective function with respect to the parameter values using a modified Gauss-Newton method. Sensitivities needed for the method are calculated approximately by forward or central differences and problems and solutions related to this approximation are discussed. Statistics are calculated and printed for use in (1) diagnosing inadequate data or identifying parameters that probably cannot be estimated with the available data, (2) evaluating estimated parameter values, (3) evaluating the model representation of the actual processes and (4) quantifying the uncertainty of model simulated values. UCODE is intended for use on any computer operating system: it consists of algorithms programmed in perl, a freeware language designed for text manipulation and Fortran90, which efficiently performs numerical calculations.

  15. Multivariate research in areas of phosphorus cast-iron brake shoes manufacturing using the statistical analysis and the multiple regression equations

    NASA Astrophysics Data System (ADS)

    Kiss, I.; Cioată, V. G.; Alexa, V.; Raţiu, S. A.

    2017-05-01

    The braking system is one of the most important and complex subsystems of railway vehicles, especially when it comes for safety. Therefore, installing efficient safe brakes on the modern railway vehicles is essential. Nowadays is devoted attention to solving problems connected with using high performance brake materials and its impact on thermal and mechanical loading of railway wheels. The main factor that influences the selection of a friction material for railway applications is the performance criterion, due to the interaction between the brake block and the wheel produce complex thermos-mechanical phenomena. In this work, the investigated subjects are the cast-iron brake shoes, which are still widely used on freight wagons. Therefore, the cast-iron brake shoes - with lamellar graphite and with a high content of phosphorus (0.8-1.1%) - need a special investigation. In order to establish the optimal condition for the cast-iron brake shoes we proposed a mathematical modelling study by using the statistical analysis and multiple regression equations. Multivariate research is important in areas of cast-iron brake shoes manufacturing, because many variables interact with each other simultaneously. Multivariate visualization comes to the fore when researchers have difficulties in comprehending many dimensions at one time. Technological data (hardness and chemical composition) obtained from cast-iron brake shoes were used for this purpose. In order to settle the multiple correlation between the hardness of the cast-iron brake shoes, and the chemical compositions elements several model of regression equation types has been proposed. Because a three-dimensional surface with variables on three axes is a common way to illustrate multivariate data, in which the maximum and minimum values are easily highlighted, we plotted graphical representation of the regression equations in order to explain interaction of the variables and locate the optimal level of each variable for maximal response. For the calculation of the regression coefficients, dispersion and correlation coefficients, the software Matlab was used.

  16. NIMEFI: Gene Regulatory Network Inference using Multiple Ensemble Feature Importance Algorithms

    PubMed Central

    Ruyssinck, Joeri; Huynh-Thu, Vân Anh; Geurts, Pierre; Dhaene, Tom; Demeester, Piet; Saeys, Yvan

    2014-01-01

    One of the long-standing open challenges in computational systems biology is the topology inference of gene regulatory networks from high-throughput omics data. Recently, two community-wide efforts, DREAM4 and DREAM5, have been established to benchmark network inference techniques using gene expression measurements. In these challenges the overall top performer was the GENIE3 algorithm. This method decomposes the network inference task into separate regression problems for each gene in the network in which the expression values of a particular target gene are predicted using all other genes as possible predictors. Next, using tree-based ensemble methods, an importance measure for each predictor gene is calculated with respect to the target gene and a high feature importance is considered as putative evidence of a regulatory link existing between both genes. The contribution of this work is twofold. First, we generalize the regression decomposition strategy of GENIE3 to other feature importance methods. We compare the performance of support vector regression, the elastic net, random forest regression, symbolic regression and their ensemble variants in this setting to the original GENIE3 algorithm. To create the ensemble variants, we propose a subsampling approach which allows us to cast any feature selection algorithm that produces a feature ranking into an ensemble feature importance algorithm. We demonstrate that the ensemble setting is key to the network inference task, as only ensemble variants achieve top performance. As second contribution, we explore the effect of using rankwise averaged predictions of multiple ensemble algorithms as opposed to only one. We name this approach NIMEFI (Network Inference using Multiple Ensemble Feature Importance algorithms) and show that this approach outperforms all individual methods in general, although on a specific network a single method can perform better. An implementation of NIMEFI has been made publicly available. PMID:24667482

  17. Quantifying components of the hydrologic cycle in Virginia using chemical hydrograph separation and multiple regression analysis

    USGS Publications Warehouse

    Sanford, Ward E.; Nelms, David L.; Pope, Jason P.; Selnick, David L.

    2012-01-01

    This study by the U.S. Geological Survey, prepared in cooperation with the Virginia Department of Environmental Quality, quantifies the components of the hydrologic cycle across the Commonwealth of Virginia. Long-term, mean fluxes were calculated for precipitation, surface runoff, infiltration, total evapotranspiration (ET), riparian ET, recharge, base flow (or groundwater discharge) and net total outflow. Fluxes of these components were first estimated on a number of real-time-gaged watersheds across Virginia. Specific conductance was used to distinguish and separate surface runoff from base flow. Specific-conductance data were collected every 15 minutes at 75 real-time gages for approximately 18 months between March 2007 and August 2008. Precipitation was estimated for 1971–2000 using PRISM climate data. Precipitation and temperature from the PRISM data were used to develop a regression-based relation to estimate total ET. The proportion of watershed precipitation that becomes surface runoff was related to physiographic province and rock type in a runoff regression equation. Component flux estimates from the watersheds were transferred to flux estimates for counties and independent cities using the ET and runoff regression equations. Only 48 of the 75 watersheds yielded sufficient data, and data from these 48 were used in the final runoff regression equation. The base-flow proportion for the 48 watersheds averaged 72 percent using specific conductance, a value that was substantially higher than the 61 percent average calculated using a graphical-separation technique (the USGS program PART). Final results for the study are presented as component flux estimates for all counties and independent cities in Virginia.

  18. Geochemistry of some rare earth elements in groundwater, Vierlingsbeek, The Netherlands.

    PubMed

    Janssen, René P T; Verweij, Wilko

    2003-03-01

    Groundwater samples were taken from seven bore holes at depths ranging from 2 to 41m nearby drinking water pumping station Vierlingsbeek, The Netherlands and analysed for Y, La, Ce, Pr, Nd, Sm and Eu. Shale-normalized patterns were generally flat and showed that the observed rare earth elements (REE) were probably of natural origin. In the shallow groundwaters the REEs were light REE (LREE) enriched, probably caused by binding of LREEs to colloids. To improve understanding of the behaviour of the REE, two approaches were used: calculations of the speciation and a statistical approach. For the speciation calculations, complexation and precipitation reactions including inorganic and dissolved organic carbon (DOC) compounds, were taken into account. The REE speciation showed REE(3+), REE(SO(4))(+), REE(CO(3))(+) and REE(DOC) being the major species. Dissolution of pure REE precipitates and REE-enriched solid phases did not account for the observed REEs in groundwater. Regulation of REE concentrations by adsorption-desorption processes to Fe(III)(OH)(3) and Al(OH)(3) minerals, which were calculated to be present in nearly all groundwaters, is a probable explanation. The statistical approach (multiple linear regression) showed that pH is by far the most significant groundwater characteristic which contributes to the variation in REE concentrations. Also DOC, SO(4), Fe and Al contributed significantly, although to a much lesser extent, to the variation in REE concentrations. This is in line with the calculated REE-species in solution and REE-adsorption to iron and aluminium (hydr)oxides. Regression equations including only pH, were derived to predict REE concentrations in groundwater. External validation showed that these regression equations were reasonably successful to predict REE concentrations of groundwater of another drinking water pumping station in quite different region of The Netherlands.

  19. Multiple regression equations modelling of groundwater of Ajmer-Pushkar railway line region, Rajasthan (India).

    PubMed

    Mathur, Praveen; Sharma, Sarita; Soni, Bhupendra

    2010-01-01

    In the present work, an attempt is made to formulate multiple regression equations using all possible regressions method for groundwater quality assessment of Ajmer-Pushkar railway line region in pre- and post-monsoon seasons. Correlation studies revealed the existence of linear relationships (r 0.7) for electrical conductivity (EC), total hardness (TH) and total dissolved solids (TDS) with other water quality parameters. The highest correlation was found between EC and TDS (r = 0.973). EC showed highly significant positive correlation with Na, K, Cl, TDS and total solids (TS). TH showed highest correlation with Ca and Mg. TDS showed significant correlation with Na, K, SO4, PO4 and Cl. The study indicated that most of the contamination present was water soluble or ionic in nature. Mg was present as MgCl2; K mainly as KCl and K2SO4, and Na was present as the salts of Cl, SO4 and PO4. On the other hand, F and NO3 showed no significant correlations. The r2 values and F values (at 95% confidence limit, alpha = 0.05) for the modelled equations indicated high degree of linearity among independent and dependent variables. Also the error % between calculated and experimental values was contained within +/- 15% limit.

  20. Logistic and Multiple Regression: A Two-Pronged Approach to Accurately Estimate Cost Growth in Major DoD Weapon Systems

    DTIC Science & Technology

    2004-03-01

    Breusch - Pagan test for constant variance of the residuals. Using Microsoft Excel® we calculate a p-value of 0.841237. This high p-value, which is above...our alpha of 0.05, indicates that our residuals indeed pass the Breusch - Pagan test for constant variance. In addition to the assumption tests , we...Wilk Test for Normality – Support (Reduced) Model (OLS) Finally, we perform a Breusch - Pagan test for constant variance of the residuals. Using

  1. Multiple Correlation versus Multiple Regression.

    ERIC Educational Resources Information Center

    Huberty, Carl J.

    2003-01-01

    Describes differences between multiple correlation analysis (MCA) and multiple regression analysis (MRA), showing how these approaches involve different research questions and study designs, different inferential approaches, different analysis strategies, and different reported information. (SLD)

  2. The Detection and Interpretation of Interaction Effects between Continuous Variables in Multiple Regression.

    ERIC Educational Resources Information Center

    Jaccard, James; And Others

    1990-01-01

    Issues in the detection and interpretation of interaction effects between quantitative variables in multiple regression analysis are discussed. Recent discussions associated with problems of multicollinearity are reviewed in the context of the conditional nature of multiple regression with product terms. (TJH)

  3. Beyond Multiple Regression: Using Commonality Analysis to Better Understand R[superscript 2] Results

    ERIC Educational Resources Information Center

    Warne, Russell T.

    2011-01-01

    Multiple regression is one of the most common statistical methods used in quantitative educational research. Despite the versatility and easy interpretability of multiple regression, it has some shortcomings in the detection of suppressor variables and for somewhat arbitrarily assigning values to the structure coefficients of correlated…

  4. A CNN Regression Approach for Real-Time 2D/3D Registration.

    PubMed

    Shun Miao; Wang, Z Jane; Rui Liao

    2016-05-01

    In this paper, we present a Convolutional Neural Network (CNN) regression approach to address the two major limitations of existing intensity-based 2-D/3-D registration technology: 1) slow computation and 2) small capture range. Different from optimization-based methods, which iteratively optimize the transformation parameters over a scalar-valued metric function representing the quality of the registration, the proposed method exploits the information embedded in the appearances of the digitally reconstructed radiograph and X-ray images, and employs CNN regressors to directly estimate the transformation parameters. An automatic feature extraction step is introduced to calculate 3-D pose-indexed features that are sensitive to the variables to be regressed while robust to other factors. The CNN regressors are then trained for local zones and applied in a hierarchical manner to break down the complex regression task into multiple simpler sub-tasks that can be learned separately. Weight sharing is furthermore employed in the CNN regression model to reduce the memory footprint. The proposed approach has been quantitatively evaluated on 3 potential clinical applications, demonstrating its significant advantage in providing highly accurate real-time 2-D/3-D registration with a significantly enlarged capture range when compared to intensity-based methods.

  5. Associations of blood lead, cadmium, and mercury with estimated glomerular filtration rate in the Korean general population: Analysis of 2008-2010 Korean National Health and Nutrition Examination Survey data

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Kim, Yangho; Lee, Byung-Kook, E-mail: bklee@sch.ac.kr

    Introduction: The objective of this study was to evaluate associations between blood lead, cadmium, and mercury levels with estimated glomerular filtration rate in a general population of South Korean adults. Methods: This was a cross-sectional study based on data obtained in the Korean National Health and Nutrition Examination Survey (KNHANES) (2008-2010). The final analytical sample consisted of 5924 participants. Estimated glomerular filtration rate (eGFR) was calculated using the MDRD Study equation as an indicator of glomerular function. Results: In multiple linear regression analysis of log2-transformed blood lead as a continuous variable on eGFR, after adjusting for covariates including cadmium andmore » mercury, the difference in eGFR levels associated with doubling of blood lead were -2.624 mL/min per 1.73 m Superscript-Two (95% CI: -3.803 to -1.445). In multiple linear regression analysis using quartiles of blood lead as the independent variable, the difference in eGFR levels comparing participants in the highest versus the lowest quartiles of blood lead was -3.835 mL/min per 1.73 m Superscript-Two (95% CI: -5.730 to -1.939). In a multiple linear regression analysis using blood cadmium and mercury, as continuous or categorical variables, as independent variables, neither metal was a significant predictor of eGFR. Odds ratios (ORs) and 95% CI values for reduced eGFR calculated for log2-transformed blood metals and quartiles of the three metals showed similar trends after adjustment for covariates. Discussion: In this large, representative sample of South Korean adults, elevated blood lead level was consistently associated with lower eGFR levels and with the prevalence of reduced eGFR even in blood lead levels below 10 {mu}g/dL. In conclusion, elevated blood lead level was associated with lower eGFR in a Korean general population, supporting the role of lead as a risk factor for chronic kidney disease.« less

  6. [Study of blending method for the extracts of herbal plants].

    PubMed

    Liu, Yongsuo; Cao, Min; Chen, Yuying; Hu, Yuzhu; Wang, Yiming; Luo, Guoan

    2006-03-01

    The irregularity in herbal plant composition is influenced by multiple factors. As for quality control of traditional Chinese medicine, the most critical challenge is to ensure the dosage content uniformity. This content uniformity can be improved by blending different batches of the extracts of herbal plants. Nonlinear least-squares regression was used to calculate the blending coefficient, which means no great absolute differences allowed for all ingredients. For traditional Chinese medicines, even relatively smaller differences could present to be very important for all the ingredients. The auto-scaling pretreatment was used prior to the calculation of the blending coefficients. The pretreatment buffered the characteristics of individual data for the ingredients in different batches, so an improved auto-scaling pretreatment method was proposed. With the improved auto-scaling pretreatment, the relative. differences decreased after blending different batches of extracts of herbal plants according to the reference samples. And the content uniformity control of the specific ingredients could be achieved by the error control coefficient. In the studies for the extracts of fructus gardeniae, the relative differences of all the ingredients is less than 3% after blending different batches of the extracts. The results showed that nonlinear least-squares regression can be used to calculate the blending coefficient of the herbal plant extracts.

  7. Pareto fronts for multiobjective optimization design on materials data

    NASA Astrophysics Data System (ADS)

    Gopakumar, Abhijith; Balachandran, Prasanna; Gubernatis, James E.; Lookman, Turab

    Optimizing multiple properties simultaneously is vital in materials design. Here we apply infor- mation driven, statistical optimization strategies blended with machine learning methods, to address multi-objective optimization tasks on materials data. These strategies aim to find the Pareto front consisting of non-dominated data points from a set of candidate compounds with known character- istics. The objective is to find the pareto front in as few additional measurements or calculations as possible. We show how exploration of the data space to find the front is achieved by using uncer- tainties in predictions from regression models. We test our proposed design strategies on multiple, independent data sets including those from computations as well as experiments. These include data sets for Max phases, piezoelectrics and multicomponent alloys.

  8. False Positives in Multiple Regression: Unanticipated Consequences of Measurement Error in the Predictor Variables

    ERIC Educational Resources Information Center

    Shear, Benjamin R.; Zumbo, Bruno D.

    2013-01-01

    Type I error rates in multiple regression, and hence the chance for false positive research findings, can be drastically inflated when multiple regression models are used to analyze data that contain random measurement error. This article shows the potential for inflated Type I error rates in commonly encountered scenarios and provides new…

  9. Using Robust Standard Errors to Combine Multiple Regression Estimates with Meta-Analysis

    ERIC Educational Resources Information Center

    Williams, Ryan T.

    2012-01-01

    Combining multiple regression estimates with meta-analysis has continued to be a difficult task. A variety of methods have been proposed and used to combine multiple regression slope estimates with meta-analysis, however, most of these methods have serious methodological and practical limitations. The purpose of this study was to explore the use…

  10. Use of Multiple Regression and Use-Availability Analyses in Determining Habitat Selection by Gray Squirrels (Sciurus Carolinensis)

    Treesearch

    John W. Edwards; Susan C. Loeb; David C. Guynn

    1994-01-01

    Multiple regression and use-availability analyses are two methods for examining habitat selection. Use-availability analysis is commonly used to evaluate macrohabitat selection whereas multiple regression analysis can be used to determine microhabitat selection. We compared these techniques using behavioral observations (n = 5534) and telemetry locations (n = 2089) of...

  11. Building Regression Models: The Importance of Graphics.

    ERIC Educational Resources Information Center

    Dunn, Richard

    1989-01-01

    Points out reasons for using graphical methods to teach simple and multiple regression analysis. Argues that a graphically oriented approach has considerable pedagogic advantages in the exposition of simple and multiple regression. Shows that graphical methods may play a central role in the process of building regression models. (Author/LS)

  12. Testing Different Model Building Procedures Using Multiple Regression.

    ERIC Educational Resources Information Center

    Thayer, Jerome D.

    The stepwise regression method of selecting predictors for computer assisted multiple regression analysis was compared with forward, backward, and best subsets regression, using 16 data sets. The results indicated the stepwise method was preferred because of its practical nature, when the models chosen by different selection methods were similar…

  13. Decreasing Multicollinearity: A Method for Models with Multiplicative Functions.

    ERIC Educational Resources Information Center

    Smith, Kent W.; Sasaki, M. S.

    1979-01-01

    A method is proposed for overcoming the problem of multicollinearity in multiple regression equations where multiplicative independent terms are entered. The method is not a ridge regression solution. (JKS)

  14. Multiple-Instance Regression with Structured Data

    NASA Technical Reports Server (NTRS)

    Wagstaff, Kiri L.; Lane, Terran; Roper, Alex

    2008-01-01

    We present a multiple-instance regression algorithm that models internal bag structure to identify the items most relevant to the bag labels. Multiple-instance regression (MIR) operates on a set of bags with real-valued labels, each containing a set of unlabeled items, in which the relevance of each item to its bag label is unknown. The goal is to predict the labels of new bags from their contents. Unlike previous MIR methods, MI-ClusterRegress can operate on bags that are structured in that they contain items drawn from a number of distinct (but unknown) distributions. MI-ClusterRegress simultaneously learns a model of the bag's internal structure, the relevance of each item, and a regression model that accurately predicts labels for new bags. We evaluated this approach on the challenging MIR problem of crop yield prediction from remote sensing data. MI-ClusterRegress provided predictions that were more accurate than those obtained with non-multiple-instance approaches or MIR methods that do not model the bag structure.

  15. Prediction of erodibility in Oxisols using iron oxides, soil color and diffuse reflectance spectroscopy

    NASA Astrophysics Data System (ADS)

    Arantes Camargo, Livia; Marques, José, Jr.

    2015-04-01

    The prediction of erodibility using indirect methods such as diffuse reflectance spectroscopy could facilitate the characterization of the spatial variability in large areas and optimize implementation of conservation practices. The aim of this study was to evaluate the prediction of interrill erodibility (Ki) and rill erodibility (Kr) by means of iron oxides content and soil color using multiple linear regression and diffuse reflectance spectroscopy (DRS) using regression analysis by least squares partial (PLSR). The soils were collected from three geomorphic surfaces and analyzed for chemical, physical and mineralogical properties, plus scanned in the spectral range from the visible and infrared. Maps of spatial distribution of Ki and Kr were built with the values calculated by the calibrated models that obtained the best accuracy using geostatistics. Interrill-rill erodibility presented negative correlation with iron extracted by dithionite-citrate-bicarbonate, hematite, and chroma, confirming the influence of iron oxides in soil structural stability. Hematite and hue were the attributes that most contributed in calibration models by multiple linear regression for the prediction of Ki (R2 = 0.55) and Kr (R2 = 0.53). The diffuse reflectance spectroscopy via PLSR allowed to predict Interrill-rill erodibility with high accuracy (R2adj = 0.76, 0.81 respectively and RPD> 2.0) in the range of the visible spectrum (380-800 nm) and the characterization of the spatial variability of these attributes by geostatistics.

  16. DFT study on oxidation of HS(CH2) m SH ( m = 1-8) in oxidative desulfurization

    NASA Astrophysics Data System (ADS)

    Song, Y. Z.; Song, J. J.; Zhao, T. T.; Chen, C. Y.; He, M.; Du, J.

    2016-06-01

    Density functional theory was employed for calculation of HS(CH2) m SH ( m = 1-8) and its derivatives at B3LYP method at 6-31++g ( d, p) level. Using eigenvalues of LUMO and HOMO for HS(CH2) m SH, the standard electrode potentials were estimated by a stepwise multiple regression techniques (MLR), and obtained as E° = 1.500 + 7.167 × 10-3 HOMO-0.229 LUMO with high correlation coefficients of 0.973 and F values of 43.973.

  17. A new method for inferring carbon monoxide concentrations from gas filter radiometer data

    NASA Technical Reports Server (NTRS)

    Wallio, H. A.; Reichle, H. G., Jr.; Casas, J. C.; Gormsen, B. B.

    1981-01-01

    A method for inferring carbon monoxide concentrations from gas filter radiometer data is presented. The technique can closely approximate the results of more costly line-by-line radiative transfer calculations over a wide range of altitudes, ground temperatures, and carbon monoxide concentrations. The technique can also be used over a larger range of conditions than those used for the regression analysis. Because the influence of the carbon monoxide mixing ratio requires only addition, multiplication and a minimum of logic, the method can be implemented on very small computers or microprocessors.

  18. TI-59 Programs for Multiple Regression.

    DTIC Science & Technology

    1980-05-01

    general linear hypothesis model of full rank [ Graybill , 19611 can be written as Y = x 8 + C , s-N(O,o 2I) nxl nxk kxl nxl where Y is the vector of n...a "reduced model " solution, and confidence intervals for linear functions of the coefficients can be obtained using (x’x) and a2, based on the t...O107)l UA.LLL. Library ModuIe NASTER -Puter 0NTINA Cards 1 PROGRAM DESCRIPTION (s s 2 ror the general linear hypothesis model Y - XO + C’ calculates

  19. Factors associated with developing a fear of falling in subjects with primary open-angle glaucoma.

    PubMed

    Adachi, Sayaka; Yuki, Kenya; Awano-Tanabe, Sachiko; Ono, Takeshi; Shiba, Daisuke; Murata, Hiroshi; Asaoka, Ryo; Tsubota, Kazuo

    2018-02-13

    To investigate the relationship between clinical risk factors, including visual field (VF) defects and visual acuity, and a fear of falling, among patients with primary open-angle glaucoma (POAG). All participants answered the following question at a baseline ophthalmic examination: Are you afraid of falling? The same question was then answered every 12 months for 3 years. A binocular integrated visual field was calculated by merging a patient's monocular Humphrey field analyzer VFs, using the 'best sensitivity' method. The means of total deviation values in the whole, superior peripheral, superior central, inferior central, and inferior peripheral VFs were calculated. The relationship between these mean VF measurements, and various clinical factors, against patients' baseline fear of falling and future fear of falling was analyzed using multiple logistic regression. Among 392 POAG subjects, 342 patients (87.2%) responded to the fear of falling question at least twice in the 3 years study period. The optimal regression model for patients' baseline fear of falling included age, gender, mean of total deviation values in the inferior peripheral VF and number of previous falls. The optimal regression equation for future fear of falling included age, gender, mean of total deviation values in the inferior peripheral VF and number of previous falls. Defects in the inferior peripheral VF area are significantly related to the development of a fear of falling.

  20. Which Measurement of Blood Pressure Is More Associated With Albuminuria in Patients With Type 2 Diabetes: Central Blood Pressure or Peripheral Blood Pressure?

    PubMed

    Kitagawa, Noriyuki; Okada, Hiroshi; Tanaka, Muhei; Hashimoto, Yoshitaka; Kimura, Toshihiro; Nakano, Koji; Yamazaki, Masahiro; Hasegawa, Goji; Nakamura, Naoto; Fukui, Michiaki

    2016-08-01

    The aim of this study was to investigate whether central systolic blood pressure (SBP) was associated with albuminuria, defined as urinary albumin excretion (UAE) ≥30 mg/g creatinine, and, if so, whether the relationship of central SBP with albuminuria was stronger than that of peripheral SBP in patients with type 2 diabetes. The authors performed a cross-sectional study in 294 outpatients with type 2 diabetes. The relationship between peripheral SBP or central SBP and UAE using regression analysis was evaluated, and the odds ratios of peripheral SBP or central SBP were calculated to identify albuminuria using logistic regression model. Moreover, the area under the receiver operating characteristic curve (AUC) of central SBP was compared with that of peripheral SBP to identify albuminuria. Multiple regression analysis demonstrated that peripheral SBP (β=0.255, P<.0001) or central SBP (r=0.227, P<.0001) was associated with UAE. Multiple logistic regression analysis demonstrated that peripheral SBP (odds ratio, 1.029; 95% confidence interval, 1.016-1.043) or central SBP (odds ratio, 1.022; 95% confidence interval, 1.011-1.034) was associated with an increased odds of albuminuria. In addition, AUC of peripheral SBP was significantly greater than that of central SBP to identify albuminuria (P=0.035). Peripheral SBP is superior to central SBP in identifying albuminuria, although both peripheral and central SBP are associated with UAE in patients with type 2 diabetes. © 2016 Wiley Periodicals, Inc.

  1. Testing in Microbiome-Profiling Studies with MiRKAT, the Microbiome Regression-Based Kernel Association Test

    PubMed Central

    Zhao, Ni; Chen, Jun; Carroll, Ian M.; Ringel-Kulka, Tamar; Epstein, Michael P.; Zhou, Hua; Zhou, Jin J.; Ringel, Yehuda; Li, Hongzhe; Wu, Michael C.

    2015-01-01

    High-throughput sequencing technology has enabled population-based studies of the role of the human microbiome in disease etiology and exposure response. Distance-based analysis is a popular strategy for evaluating the overall association between microbiome diversity and outcome, wherein the phylogenetic distance between individuals’ microbiome profiles is computed and tested for association via permutation. Despite their practical popularity, distance-based approaches suffer from important challenges, especially in selecting the best distance and extending the methods to alternative outcomes, such as survival outcomes. We propose the microbiome regression-based kernel association test (MiRKAT), which directly regresses the outcome on the microbiome profiles via the semi-parametric kernel machine regression framework. MiRKAT allows for easy covariate adjustment and extension to alternative outcomes while non-parametrically modeling the microbiome through a kernel that incorporates phylogenetic distance. It uses a variance-component score statistic to test for the association with analytical p value calculation. The model also allows simultaneous examination of multiple distances, alleviating the problem of choosing the best distance. Our simulations demonstrated that MiRKAT provides correctly controlled type I error and adequate power in detecting overall association. “Optimal” MiRKAT, which considers multiple candidate distances, is robust in that it suffers from little power loss in comparison to when the best distance is used and can achieve tremendous power gain in comparison to when a poor distance is chosen. Finally, we applied MiRKAT to real microbiome datasets to show that microbial communities are associated with smoking and with fecal protease levels after confounders are controlled for. PMID:25957468

  2. A nonparametric multiple imputation approach for missing categorical data.

    PubMed

    Zhou, Muhan; He, Yulei; Yu, Mandi; Hsu, Chiu-Hsieh

    2017-06-06

    Incomplete categorical variables with more than two categories are common in public health data. However, most of the existing missing-data methods do not use the information from nonresponse (missingness) probabilities. We propose a nearest-neighbour multiple imputation approach to impute a missing at random categorical outcome and to estimate the proportion of each category. The donor set for imputation is formed by measuring distances between each missing value with other non-missing values. The distance function is calculated based on a predictive score, which is derived from two working models: one fits a multinomial logistic regression for predicting the missing categorical outcome (the outcome model) and the other fits a logistic regression for predicting missingness probabilities (the missingness model). A weighting scheme is used to accommodate contributions from two working models when generating the predictive score. A missing value is imputed by randomly selecting one of the non-missing values with the smallest distances. We conduct a simulation to evaluate the performance of the proposed method and compare it with several alternative methods. A real-data application is also presented. The simulation study suggests that the proposed method performs well when missingness probabilities are not extreme under some misspecifications of the working models. However, the calibration estimator, which is also based on two working models, can be highly unstable when missingness probabilities for some observations are extremely high. In this scenario, the proposed method produces more stable and better estimates. In addition, proper weights need to be chosen to balance the contributions from the two working models and achieve optimal results for the proposed method. We conclude that the proposed multiple imputation method is a reasonable approach to dealing with missing categorical outcome data with more than two levels for assessing the distribution of the outcome. In terms of the choices for the working models, we suggest a multinomial logistic regression for predicting the missing outcome and a binary logistic regression for predicting the missingness probability.

  3. Can Functional Cardiac Age be Predicted from ECG in a Normal Healthy Population

    NASA Technical Reports Server (NTRS)

    Schlegel, Todd; Starc, Vito; Leban, Manja; Sinigoj, Petra; Vrhovec, Milos

    2011-01-01

    In a normal healthy population, we desired to determine the most age-dependent conventional and advanced ECG parameters. We hypothesized that changes in several ECG parameters might correlate with age and together reliably characterize the functional age of the heart. Methods: An initial study population of 313 apparently healthy subjects was ultimately reduced to 148 subjects (74 men, 84 women, in the range from 10 to 75 years of age) after exclusion criteria. In all subjects, ECG recordings (resting 5-minute 12-lead high frequency ECG) were evaluated via custom software programs to calculate up to 85 different conventional and advanced ECG parameters including beat-to-beat QT and RR variability, waveform complexity, and signal-averaged, high-frequency and spatial/spatiotemporal ECG parameters. The prediction of functional age was evaluated by multiple linear regression analysis using the best 5 univariate predictors. Results: Ignoring what were ultimately small differences between males and females, the functional age was found to be predicted (R2= 0.69, P < 0.001) from a linear combination of 5 independent variables: QRS elevation in the frontal plane (p<0.001), a new repolarization parameter QTcorr (p<0.001), mean high frequency QRS amplitude (p=0.009), the variability parameter % VLF of RRV (p=0.021) and the P-wave width (p=0.10). Here, QTcorr represents the correlation between the calculated QT and the measured QT signal. Conclusions: In apparently healthy subjects with normal conventional ECGs, functional cardiac age can be estimated by multiple linear regression analysis of mostly advanced ECG results. Because some parameters in the regression formula, such as QTcorr, high frequency QRS amplitude and P-wave width also change with disease in the same direction as with increased age, increased functional age of the heart may reflect subtle age-related pathologies in cardiac electrical function that are usually hidden on conventional ECG.

  4. A Quantile Regression Approach to Understanding the Relations Between Morphological Awareness, Vocabulary, and Reading Comprehension in Adult Basic Education Students

    PubMed Central

    Tighe, Elizabeth L.; Schatschneider, Christopher

    2015-01-01

    The purpose of this study was to investigate the joint and unique contributions of morphological awareness and vocabulary knowledge at five reading comprehension levels in Adult Basic Education (ABE) students. We introduce the statistical technique of multiple quantile regression, which enabled us to assess the predictive utility of morphological awareness and vocabulary knowledge at multiple points (quantiles) along the continuous distribution of reading comprehension. To demonstrate the efficacy of our multiple quantile regression analysis, we compared and contrasted our results with a traditional multiple regression analytic approach. Our results indicated that morphological awareness and vocabulary knowledge accounted for a large portion of the variance (82-95%) in reading comprehension skills across all quantiles. Morphological awareness exhibited the greatest unique predictive ability at lower levels of reading comprehension whereas vocabulary knowledge exhibited the greatest unique predictive ability at higher levels of reading comprehension. These results indicate the utility of using multiple quantile regression to assess trajectories of component skills across multiple levels of reading comprehension. The implications of our findings for ABE programs are discussed. PMID:25351773

  5. ℓ(p)-Norm multikernel learning approach for stock market price forecasting.

    PubMed

    Shao, Xigao; Wu, Kun; Liao, Bifeng

    2012-01-01

    Linear multiple kernel learning model has been used for predicting financial time series. However, ℓ(1)-norm multiple support vector regression is rarely observed to outperform trivial baselines in practical applications. To allow for robust kernel mixtures that generalize well, we adopt ℓ(p)-norm multiple kernel support vector regression (1 ≤ p < ∞) as a stock price prediction model. The optimization problem is decomposed into smaller subproblems, and the interleaved optimization strategy is employed to solve the regression model. The model is evaluated on forecasting the daily stock closing prices of Shanghai Stock Index in China. Experimental results show that our proposed model performs better than ℓ(1)-norm multiple support vector regression model.

  6. [Association between hours of television watched, physical activity, sleep and excess weight among young adults].

    PubMed

    Martínez-Moyá, María; Navarrete-Muñoz, Eva M; García de la Hera, Manuela; Giménez-Monzo, Daniel; González-Palacios, Sandra; Valera-Gran, Desirée; Sempere-Orts, María; Vioque, Jesús

    2014-01-01

    To explore the association between excess weight or body mass index (BMI) and the time spent watching television, self-reported physical activity and sleep duration in a young adult population. We analyzed cross-sectional baseline data of 1,135 participants (17-35 years old) from the project Dieta, salud y antropometría en población universitaria (Diet, Health and Anthrompmetric Variables in Univeristy Students). Information about time spent watching television, sleep duration, self-reported physical activity and self-reported height and weight was provided by a baseline questionnaire. BMI was calculated as kg/m(2) and excess of weight was defined as ≥25. We used multiple logistic regression to explore the association between excess weight (no/yes) and independent variables, and multiple linear regression for BMI. The prevalence of excess weight was 13.7% (11.2% were overweight and 2.5% were obese). A significant positive association was found between excess weight and a greater amount of time spent watching television. Participants who reported watching television >2h a day had a higher risk of excess weight than those who watched television ≤1h a day (OR=2.13; 95%CI: 1.37-3.36; p-trend: 0.002). A lower level of physical activity was associated with an increased risk of excess weight, although the association was statistically significant only in multiple linear regression (p=0.037). No association was observed with sleep duration. A greater number of hours spent watching television and lower physical activity were significantly associated with a higher BMI in young adults. Both factors are potentially modifiable with preventive strategies. Copyright © 2013 SESPAS. Published by Elsevier Espana. All rights reserved.

  7. Multiple calibrator measurements improve accuracy and stability estimates of automated assays.

    PubMed

    Akbas, Neval; Budd, Jeffrey R; Klee, George G

    2016-01-01

    The effects of combining multiple calibrations on assay accuracy (bias) and measurement of calibration stability were investigated for total triiodothyronine (TT3), vitamin B12 and luteinizing hormone (LH) using Beckman Coulter's Access 2 analyzer. Three calibration procedures (CC1, CC2 and CC3) combined 12, 34 and 56 calibrator measurements over 1, 2, and 3 days. Bias was calculated between target values and average measured value over 3 consecutive days after calibration. Using regression analysis of calibrator measurements versus measurement date, calibration stability was determined as the maximum number of days before a calibrator measurement exceeded 5% tolerance limits. Competitive assays (TT3, vitamin B12) had positive time regression slopes, while sandwich assay (LH) had a negative slope. Bias values for TT3 were -2.49%, 1.49%, and -0.50% using CC1, CC2 and CC3 respectively, with calibrator stability of 32, 20, and 30 days. Bias values for vitamin B12 were 2.44%, 0.91%, and -0.50%, with calibrator stability of 4, 9, and 12 days. Bias values for LH were 2.26%, 1.44% and -0.29% with calibrator stability of >43, 39 and 36 days. Measured stability was more consistent across calibration procedures using percent change rather than difference from target: 26 days for TT3, 12 days for B12 and 31 days for LH. Averaging over multiple calibrations produced smaller bias, consistent with improved accuracy. Time regression slopes in percent change were unaffected by number of calibration measurements but calibrator stability measured from the target value was highly affected by the calibrator value at time zero.

  8. Loss of Load Probability Calculation for West Java Power System with Nuclear Power Plant Scenario

    NASA Astrophysics Data System (ADS)

    Azizah, I. D.; Abdullah, A. G.; Purnama, W.; Nandiyanto, A. B. D.; Shafii, M. A.

    2017-03-01

    Loss of Load Probability (LOLP) index showing the quality and performance of an electrical system. LOLP value is affected by load growth, the load duration curve, forced outage rate of the plant, number and capacity of generating units. This reliability index calculation begins with load forecasting to 2018 using multiple regression method. Scenario 1 with compositions of conventional plants produce the largest LOLP in 2017 amounted to 71.609 days / year. While the best reliability index generated in scenario 2 with the NPP amounted to 6.941 days / year in 2015. Improved reliability of systems using nuclear power more efficiently when compared to conventional plants because it also has advantages such as emission-free, inexpensive fuel costs, as well as high level of plant availability.

  9. Non-contact measurement of pulse wave velocity using RGB cameras

    NASA Astrophysics Data System (ADS)

    Nakano, Kazuya; Aoki, Yuta; Satoh, Ryota; Hoshi, Akira; Suzuki, Hiroyuki; Nishidate, Izumi

    2016-03-01

    Non-contact measurement of pulse wave velocity (PWV) using red, green, and blue (RGB) digital color images is proposed. Generally, PWV is used as the index of arteriosclerosis. In our method, changes in blood volume are calculated based on changes in the color information, and is estimated by combining multiple regression analysis (MRA) with a Monte Carlo simulation (MCS) model of the transit of light in human skin. After two pulse waves of human skins were measured using RGB cameras, and the PWV was calculated from the difference of the pulse transit time and the distance between two measurement points. The measured forehead-finger PWV (ffPWV) was on the order of m/s and became faster as the values of vital signs raised. These results demonstrated the feasibility of this method.

  10. Accounting for Multiple Births in Neonatal and Perinatal Trials: Systematic Review and Case Study

    PubMed Central

    Hibbs, Anna Maria; Black, Dennis; Palermo, Lisa; Cnaan, Avital; Luan, Xianqun; Truog, William E; Walsh, Michele C; Ballard, Roberta A

    2010-01-01

    Objectives To determine the prevalence in the neonatal literature of statistical approaches accounting for the unique clustering patterns of multiple births. To explore the sensitivity of an actual trial to several analytic approaches to multiples. Methods A systematic review of recent perinatal trials assessed the prevalence of studies accounting for clustering of multiples. The NO CLD trial served as a case study of the sensitivity of the outcome to several statistical strategies. We calculated odds ratios using non-clustered (logistic regression) and clustered (generalized estimating equations, multiple outputation) analyses. Results In the systematic review, most studies did not describe the randomization of twins and did not account for clustering. Of those studies that did, exclusion of multiples and generalized estimating equations were the most common strategies. The NO CLD study included 84 infants with a sibling enrolled in the study. Multiples were more likely than singletons to be white and were born to older mothers (p<0.01). Analyses that accounted for clustering were statistically significant; analyses assuming independence were not. Conclusions The statistical approach to multiples can influence the odds ratio and width of confidence intervals, thereby affecting the interpretation of a study outcome. A minority of perinatal studies address this issue. PMID:19969305

  11. Accounting for multiple births in neonatal and perinatal trials: systematic review and case study.

    PubMed

    Hibbs, Anna Maria; Black, Dennis; Palermo, Lisa; Cnaan, Avital; Luan, Xianqun; Truog, William E; Walsh, Michele C; Ballard, Roberta A

    2010-02-01

    To determine the prevalence in the neonatal literature of statistical approaches accounting for the unique clustering patterns of multiple births and to explore the sensitivity of an actual trial to several analytic approaches to multiples. A systematic review of recent perinatal trials assessed the prevalence of studies accounting for clustering of multiples. The Nitric Oxide to Prevent Chronic Lung Disease (NO CLD) trial served as a case study of the sensitivity of the outcome to several statistical strategies. We calculated odds ratios using nonclustered (logistic regression) and clustered (generalized estimating equations, multiple outputation) analyses. In the systematic review, most studies did not describe the random assignment of twins and did not account for clustering. Of those studies that did, exclusion of multiples and generalized estimating equations were the most common strategies. The NO CLD study included 84 infants with a sibling enrolled in the study. Multiples were more likely than singletons to be white and were born to older mothers (P < .01). Analyses that accounted for clustering were statistically significant; analyses assuming independence were not. The statistical approach to multiples can influence the odds ratio and width of confidence intervals, thereby affecting the interpretation of a study outcome. A minority of perinatal studies address this issue. Copyright 2010 Mosby, Inc. All rights reserved.

  12. Detecting outliers when fitting data with nonlinear regression – a new method based on robust nonlinear regression and the false discovery rate

    PubMed Central

    Motulsky, Harvey J; Brown, Ronald E

    2006-01-01

    Background Nonlinear regression, like linear regression, assumes that the scatter of data around the ideal curve follows a Gaussian or normal distribution. This assumption leads to the familiar goal of regression: to minimize the sum of the squares of the vertical or Y-value distances between the points and the curve. Outliers can dominate the sum-of-the-squares calculation, and lead to misleading results. However, we know of no practical method for routinely identifying outliers when fitting curves with nonlinear regression. Results We describe a new method for identifying outliers when fitting data with nonlinear regression. We first fit the data using a robust form of nonlinear regression, based on the assumption that scatter follows a Lorentzian distribution. We devised a new adaptive method that gradually becomes more robust as the method proceeds. To define outliers, we adapted the false discovery rate approach to handling multiple comparisons. We then remove the outliers, and analyze the data using ordinary least-squares regression. Because the method combines robust regression and outlier removal, we call it the ROUT method. When analyzing simulated data, where all scatter is Gaussian, our method detects (falsely) one or more outlier in only about 1–3% of experiments. When analyzing data contaminated with one or several outliers, the ROUT method performs well at outlier identification, with an average False Discovery Rate less than 1%. Conclusion Our method, which combines a new method of robust nonlinear regression with a new method of outlier identification, identifies outliers from nonlinear curve fits with reasonable power and few false positives. PMID:16526949

  13. Advanced statistics: linear regression, part II: multiple linear regression.

    PubMed

    Marill, Keith A

    2004-01-01

    The applications of simple linear regression in medical research are limited, because in most situations, there are multiple relevant predictor variables. Univariate statistical techniques such as simple linear regression use a single predictor variable, and they often may be mathematically correct but clinically misleading. Multiple linear regression is a mathematical technique used to model the relationship between multiple independent predictor variables and a single dependent outcome variable. It is used in medical research to model observational data, as well as in diagnostic and therapeutic studies in which the outcome is dependent on more than one factor. Although the technique generally is limited to data that can be expressed with a linear function, it benefits from a well-developed mathematical framework that yields unique solutions and exact confidence intervals for regression coefficients. Building on Part I of this series, this article acquaints the reader with some of the important concepts in multiple regression analysis. These include multicollinearity, interaction effects, and an expansion of the discussion of inference testing, leverage, and variable transformations to multivariate models. Examples from the first article in this series are expanded on using a primarily graphic, rather than mathematical, approach. The importance of the relationships among the predictor variables and the dependence of the multivariate model coefficients on the choice of these variables are stressed. Finally, concepts in regression model building are discussed.

  14. Modeling Polytomous Item Responses Using Simultaneously Estimated Multinomial Logistic Regression Models

    ERIC Educational Resources Information Center

    Anderson, Carolyn J.; Verkuilen, Jay; Peyton, Buddy L.

    2010-01-01

    Survey items with multiple response categories and multiple-choice test questions are ubiquitous in psychological and educational research. We illustrate the use of log-multiplicative association (LMA) models that are extensions of the well-known multinomial logistic regression model for multiple dependent outcome variables to reanalyze a set of…

  15. A Method for Calculating the Probability of Successfully Completing a Rocket Propulsion Ground Test

    NASA Technical Reports Server (NTRS)

    Messer, Bradley

    2007-01-01

    Propulsion ground test facilities face the daily challenge of scheduling multiple customers into limited facility space and successfully completing their propulsion test projects. Over the last decade NASA s propulsion test facilities have performed hundreds of tests, collected thousands of seconds of test data, and exceeded the capabilities of numerous test facility and test article components. A logistic regression mathematical modeling technique has been developed to predict the probability of successfully completing a rocket propulsion test. A logistic regression model is a mathematical modeling approach that can be used to describe the relationship of several independent predictor variables X(sub 1), X(sub 2),.., X(sub k) to a binary or dichotomous dependent variable Y, where Y can only be one of two possible outcomes, in this case Success or Failure of accomplishing a full duration test. The use of logistic regression modeling is not new; however, modeling propulsion ground test facilities using logistic regression is both a new and unique application of the statistical technique. Results from this type of model provide project managers with insight and confidence into the effectiveness of rocket propulsion ground testing.

  16. Relationship between masticatory performance using a gummy jelly and masticatory movement.

    PubMed

    Uesugi, Hanako; Shiga, Hiroshi

    2017-10-01

    The purpose of this study was to clarify the relationship between masticatory performance using a gummy jelly and masticatory movement. Thirty healthy males were asked to chew a gummy jelly on their habitual chewing side for 20s, and the parameters of masticatory performance and masticatory movement were calculated as follows. For evaluating the masticatory performance, the amount of glucose extraction during chewing of a gummy jelly was measured. For evaluating the masticatory movement, the movement of the mandibular incisal point was recorded using the MKG K6-I, and ten parameters of the movement path (opening distance and masticatory width), movement rhythm (opening time, closing time, occluding time, and cycle time), stability of movement (stability of path and stability of rhythm), and movement velocity (opening maximum velocity and closing maximum velocity) were calculated from 10 cycles of chewing beginning with the fifth cycle. The relationship between the amount of glucose extraction and parameters representing masticatory movement was investigated and then stepwise multiple linear regression analysis was performed. The amount of glucose extraction was associated with 7 parameters representing the masticatory movement. Stepwise multiple linear regression analysis showed that the opening distance, closing time, stability of rhythm, and closing maximum velocity were the most important factors affecting the glucose extraction. From these results it was suggested that there was a close relation between masticatory performance and masticatory movement, and that the masticatory performance could be increased by rhythmic, rapid and stable mastication with a large opening distance. Copyright © 2017 Japan Prosthodontic Society. Published by Elsevier Ltd. All rights reserved.

  17. Prediction of human core body temperature using non-invasive measurement methods.

    PubMed

    Niedermann, Reto; Wyss, Eva; Annaheim, Simon; Psikuta, Agnes; Davey, Sarah; Rossi, René Michel

    2014-01-01

    The measurement of core body temperature is an efficient method for monitoring heat stress amongst workers in hot conditions. However, invasive measurement of core body temperature (e.g. rectal, intestinal, oesophageal temperature) is impractical for such applications. Therefore, the aim of this study was to define relevant non-invasive measures to predict core body temperature under various conditions. We conducted two human subject studies with different experimental protocols, different environmental temperatures (10 °C, 30 °C) and different subjects. In both studies the same non-invasive measurement methods (skin temperature, skin heat flux, heart rate) were applied. A principle component analysis was conducted to extract independent factors, which were then used in a linear regression model. We identified six parameters (three skin temperatures, two skin heat fluxes and heart rate), which were included for the calculation of two factors. The predictive value of these factors for core body temperature was evaluated by a multiple regression analysis. The calculated root mean square deviation (rmsd) was in the range from 0.28 °C to 0.34 °C for all environmental conditions. These errors are similar to previous models using non-invasive measures to predict core body temperature. The results from this study illustrate that multiple physiological parameters (e.g. skin temperature and skin heat fluxes) are needed to predict core body temperature. In addition, the physiological measurements chosen in this study and the algorithm defined in this work are potentially applicable as real-time core body temperature monitoring to assess health risk in broad range of working conditions.

  18. Sources of variability in satellite-derived estimates of Phytoplankton production in the eastern tropical Pacific

    NASA Astrophysics Data System (ADS)

    Banse, Karl; Yong, Marina

    1990-05-01

    As a proxy for satellite (coastal zone color scanner) observations and concurrent measurements of primary production rates, data from 138 stations occupied seasonally during 1967-1968 in the offshore, eastern tropical Pacific were analyzed in terms of six temporal groups and four current regimes. In multiple linear regressions on column production Pt, we found that simulated satellite pigment is generally weakly correlated, but sometimes not correlated with Pt, and that incident irradiance, sea surface temperature, nitrate, transparency, and depths of mixed layer or nitracline assume little or no importance. After a proxy for the light-saturated chlorophyll-specific photosynthetic rate pmax is added, the coefficient of determination (r2) ranges from 0.55 to 0.91 (median of 0.85) for the 10 cases. In stepwise multiple linear regressions the pmax proxy is the best predictor for Pt. Pt can be calculated fairly accurately (on the average, within 10-20%) from satellite pigment, the 10% light depth, and station values (but not from regional or seasonal means) of the pmax proxy; for individual stations the precision is 35-84% (median of 57% for the 10 groupings; p = 0.05) of the means of observed values. At present, pmax cannot be estimated from space; in the data set it is not even highly correlated with irradiance, temperature, and nitrate at depth of occurrence. Therefore extant models for calculating Pt in this tropical ocean have inherent limits of accuracy as well as of precision owing to ignorance about a physiological parameter.

  19. Poor anaerobic power/capability and static balance predicted prospective musculoskeletal injuries among Soldiers of the 101st Airborne (Air Assault) Division.

    PubMed

    Nagai, Takashi; Lovalekar, Mita; Wohleber, Meleesa F; Perlsweig, Katherine A; Wirt, Michael D; Beals, Kim

    2017-11-01

    Musculoskeletal injuries have negatively impacted tactical readiness. The identification of prospective and modifiable risk factors of preventable musculoskeletal injuries can guide specific injury prevention strategies for Soldiers and health care providers. To analyze physiological and neuromuscular characteristics as predictors of preventable musculoskeletal injuries. Prospective-cohort study. A total of 491 Soldiers were enrolled and participated in the baseline laboratory testing, including body composition, aerobic capacity, anaerobic power/capacity, muscular strength, flexibility, static balance, and landing biomechanics. After reviewing their medical charts, 275 male Soldiers who met the criteria were divided into two groups: with injuries (INJ) and no injuries (NOI). Simple and multiple logistic regression analyses were used to calculate the odds ratio (OR) and significant predictors of musculoskeletal injuries (p<0.05). The final multiple logistic regression model included the static balance with eyes-closed and peak anaerobic power as predictors of future injuries (p<0.001). The current results highlighted the importance of anaerobic power/capacity and static balance. High intensity training and balance exercise should be incorporated in their physical training as countermeasures. Copyright © 2017 Sports Medicine Australia. All rights reserved.

  20. Estimation of Magnitude and Frequency of Floods for Streams on the Island of Oahu, Hawaii

    USGS Publications Warehouse

    Wong, Michael F.

    1994-01-01

    This report describes techniques for estimating the magnitude and frequency of floods for the island of Oahu. The log-Pearson Type III distribution and methodology recommended by the Interagency Committee on Water Data was used to determine the magnitude and frequency of floods at 79 gaging stations that had 11 to 72 years of record. Multiple regression analysis was used to construct regression equations to transfer the magnitude and frequency information from gaged sites to ungaged sites. Oahu was divided into three hydrologic regions to define relations between peak discharge and drainage-basin and climatic characteristics. Regression equations are provided to estimate the 2-, 5-, 10-, 25-, 50-, and 100-year peak discharges at ungaged sites. Significant basin and climatic characteristics included in the regression equations are drainage area, median annual rainfall, and the 2-year, 24-hour rainfall intensity. Drainage areas for sites used in this study ranged from 0.03 to 45.7 square miles. Standard error of prediction for the regression equations ranged from 34 to 62 percent. Peak-discharge data collected through water year 1988, geographic information system (GIS) technology, and generalized least-squares regression were used in the analyses. The use of GIS seems to be a more flexible and consistent means of defining and calculating basin and climatic characteristics than using manual methods. Standard errors of estimate for the regression equations in this report are an average of 8 percent less than those published in previous studies.

  1. Isolating and Examining Sources of Suppression and Multicollinearity in Multiple Linear Regression

    ERIC Educational Resources Information Center

    Beckstead, Jason W.

    2012-01-01

    The presence of suppression (and multicollinearity) in multiple regression analysis complicates interpretation of predictor-criterion relationships. The mathematical conditions that produce suppression in regression analysis have received considerable attention in the methodological literature but until now nothing in the way of an analytic…

  2. General Nature of Multicollinearity in Multiple Regression Analysis.

    ERIC Educational Resources Information Center

    Liu, Richard

    1981-01-01

    Discusses multiple regression, a very popular statistical technique in the field of education. One of the basic assumptions in regression analysis requires that independent variables in the equation should not be highly correlated. The problem of multicollinearity and some of the solutions to it are discussed. (Author)

  3. ℓ p-Norm Multikernel Learning Approach for Stock Market Price Forecasting

    PubMed Central

    Shao, Xigao; Wu, Kun; Liao, Bifeng

    2012-01-01

    Linear multiple kernel learning model has been used for predicting financial time series. However, ℓ 1-norm multiple support vector regression is rarely observed to outperform trivial baselines in practical applications. To allow for robust kernel mixtures that generalize well, we adopt ℓ p-norm multiple kernel support vector regression (1 ≤ p < ∞) as a stock price prediction model. The optimization problem is decomposed into smaller subproblems, and the interleaved optimization strategy is employed to solve the regression model. The model is evaluated on forecasting the daily stock closing prices of Shanghai Stock Index in China. Experimental results show that our proposed model performs better than ℓ 1-norm multiple support vector regression model. PMID:23365561

  4. A Quantile Regression Approach to Understanding the Relations Among Morphological Awareness, Vocabulary, and Reading Comprehension in Adult Basic Education Students.

    PubMed

    Tighe, Elizabeth L; Schatschneider, Christopher

    2016-07-01

    The purpose of this study was to investigate the joint and unique contributions of morphological awareness and vocabulary knowledge at five reading comprehension levels in adult basic education (ABE) students. We introduce the statistical technique of multiple quantile regression, which enabled us to assess the predictive utility of morphological awareness and vocabulary knowledge at multiple points (quantiles) along the continuous distribution of reading comprehension. To demonstrate the efficacy of our multiple quantile regression analysis, we compared and contrasted our results with a traditional multiple regression analytic approach. Our results indicated that morphological awareness and vocabulary knowledge accounted for a large portion of the variance (82%-95%) in reading comprehension skills across all quantiles. Morphological awareness exhibited the greatest unique predictive ability at lower levels of reading comprehension whereas vocabulary knowledge exhibited the greatest unique predictive ability at higher levels of reading comprehension. These results indicate the utility of using multiple quantile regression to assess trajectories of component skills across multiple levels of reading comprehension. The implications of our findings for ABE programs are discussed. © Hammill Institute on Disabilities 2014.

  5. Stepwise versus Hierarchical Regression: Pros and Cons

    ERIC Educational Resources Information Center

    Lewis, Mitzi

    2007-01-01

    Multiple regression is commonly used in social and behavioral data analysis. In multiple regression contexts, researchers are very often interested in determining the "best" predictors in the analysis. This focus may stem from a need to identify those predictors that are supportive of theory. Alternatively, the researcher may simply be interested…

  6. Role of regression analysis and variation of rheological data in calculation of pressure drop for sludge pipelines.

    PubMed

    Farno, E; Coventry, K; Slatter, P; Eshtiaghi, N

    2018-06-15

    Sludge pumps in wastewater treatment plants are often oversized due to uncertainty in calculation of pressure drop. This issue costs millions of dollars for industry to purchase and operate the oversized pumps. Besides costs, higher electricity consumption is associated with extra CO 2 emission which creates huge environmental impacts. Calculation of pressure drop via current pipe flow theory requires model estimation of flow curve data which depends on regression analysis and also varies with natural variation of rheological data. This study investigates impact of variation of rheological data and regression analysis on variation of pressure drop calculated via current pipe flow theories. Results compare the variation of calculated pressure drop between different models and regression methods and suggest on the suitability of each method. Copyright © 2018 Elsevier Ltd. All rights reserved.

  7. Optimum pelvic incidence minus lumbar lordosis value can be determined by individual pelvic incidence.

    PubMed

    Inami, Satoshi; Moridaira, Hiroshi; Takeuchi, Daisaku; Shiba, Yo; Nohara, Yutaka; Taneichi, Hiroshi

    2016-11-01

    Adult spinal deformity (ASD) classification showing that ideal pelvic incidence minus lumbar lordosis (PI-LL) value is within 10° has been received widely. But no study has focused on the optimum level of PI-LL value that reflects wide variety in PI among patients. This study was conducted to determine the optimum PI-LL value specific to an individual's PI in postoperative ASD patients. 48 postoperative ASD patients were recruited. Spino-pelvic parameters and Oswestry Disability Index (ODI) were measured at the final follow-up. Factors associated with good clinical results were determined by stepwise multiple regression model using the ODI. The patients with ODI under the 75th percentile cutoff were designated into the "good" health related quality of life (HRQOL) group. In this group, the relationship between the PI-LL and PI was assessed by regression analysis. Multiple regression analysis revealed PI-LL as significant parameters associated with ODI. Thirty-six patients with an ODI <22 points (75th percentile cutoff) were categorized into a good HRQOL group, and linear regression models demonstrated the following equation: PI-LL = 0.41PI-11.12 (r = 0.45, P = 0.0059). On the basis of this equation, in the patients with a PI = 50°, the PI-LL is 9°. Whereas in those with a PI = 30°, the optimum PI-LL is calculated to be as low as 1°. In those with a PI = 80°, PI-LL is estimated at 22°. Consequently, an optimum PI-LL is inconsistent in that it depends on the individual PI.

  8. Bayesian Regression of Thermodynamic Models of Redox Active Materials

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Johnston, Katherine

    Finding a suitable functional redox material is a critical challenge to achieving scalable, economically viable technologies for storing concentrated solar energy in the form of a defected oxide. Demonstrating e ectiveness for thermal storage or solar fuel is largely accomplished by using a thermodynamic model derived from experimental data. The purpose of this project is to test the accuracy of our regression model on representative data sets. Determining the accuracy of the model includes parameter tting the model to the data, comparing the model using di erent numbers of param- eters, and analyzing the entropy and enthalpy calculated from themore » model. Three data sets were considered in this project: two demonstrating materials for solar fuels by wa- ter splitting and the other of a material for thermal storage. Using Bayesian Inference and Markov Chain Monte Carlo (MCMC), parameter estimation was preformed on the three data sets. Good results were achieved, except some there was some deviations on the edges of the data input ranges. The evidence values were then calculated in a variety of ways and used to compare models with di erent number of parameters. It was believed that at least one of the parameters was unnecessary and comparing evidence values demonstrated that the parameter was need on one data set and not signi cantly helpful on another. The entropy was calculated by taking the derivative in one variable and integrating over another. and its uncertainty was also calculated by evaluating the entropy over multiple MCMC samples. Afterwards, all the parts were written up as a tutorial for the Uncertainty Quanti cation Toolkit (UQTk).« less

  9. Use of Empirical Estimates of Shrinkage in Multiple Regression: A Caution.

    ERIC Educational Resources Information Center

    Kromrey, Jeffrey D.; Hines, Constance V.

    1995-01-01

    The accuracy of four empirical techniques to estimate shrinkage in multiple regression was studied through Monte Carlo simulation. None of the techniques provided unbiased estimates of the population squared multiple correlation coefficient, but the normalized jackknife and bootstrap techniques demonstrated marginally acceptable performance with…

  10. Enhance-Synergism and Suppression Effects in Multiple Regression

    ERIC Educational Resources Information Center

    Lipovetsky, Stan; Conklin, W. Michael

    2004-01-01

    Relations between pairwise correlations and the coefficient of multiple determination in regression analysis are considered. The conditions for the occurrence of enhance-synergism and suppression effects when multiple determination becomes bigger than the total of squared correlations of the dependent variable with the regressors are discussed. It…

  11. Validating the absolute reliability of a fat free mass estimate equation in hemodialysis patients using near-infrared spectroscopy.

    PubMed

    Kono, Kenichi; Nishida, Yusuke; Moriyama, Yoshihumi; Taoka, Masahiro; Sato, Takashi

    2015-06-01

    The assessment of nutritional states using fat free mass (FFM) measured with near-infrared spectroscopy (NIRS) is clinically useful. This measurement should incorporate the patient's post-dialysis weight ("dry weight"), in order to exclude the effects of any change in water mass. We therefore used NIRS to investigate the regression, independent variables, and absolute reliability of FFM in dry weight. The study included 47 outpatients from the hemodialysis unit. Body weight was measured before dialysis, and FFM was measured using NIRS before and after dialysis treatment. Multiple regression analysis was used to estimate the FFM in dry weight as the dependent variable. The measured FFM before dialysis treatment (Mw-FFM), and the difference between measured and dry weight (Mw-Dw) were independent variables. We performed Bland-Altman analysis to detect errors between the statistically estimated FFM and the measured FFM after dialysis treatment. The multiple regression equation to estimate the FFM in dry weight was: Dw-FFM = 0.038 + (0.984 × Mw-FFM) + (-0.571 × [Mw-Dw]); R(2)  = 0.99). There was no systematic bias between the estimated and the measured values of FFM in dry weight. Using NIRS, FFM in dry weight can be calculated by an equation including FFM in measured weight and the difference between the measured weight and the dry weight. © 2015 The Authors. Therapeutic Apheresis and Dialysis © 2015 International Society for Apheresis.

  12. Protective Effect of HLA-DQB1 Alleles Against Alloimmunization in Patients with Sickle Cell Disease

    PubMed Central

    Tatari-Calderone, Zohreh; Gordish-Dressman, Heather; Fasano, Ross; Riggs, Michael; Fortier, Catherine; Andrew; Campbell, D.; Charron, Dominique; Gordeuk, Victor R.; Luban, Naomi L.C.; Vukmanovic, Stanislav; Tamouza, Ryad

    2015-01-01

    Background Alloimmunization or the development of alloantibodies to Red Blood Cell (RBC) antigens is considered one of the major complications after RBC transfusions in patients with sickle cell disease (SCD) and can lead to both acute and delayed hemolytic reactions. It has been suggested that polymorphisms in HLA genes, may play a role in alloimmunization. We conducted a retrospective study analyzing the influence of HLA-DRB1 and DQB1 genetic diversity on RBC-alloimmunization. Study design Two-hundred four multi-transfused SCD patients with and without RBC-alloimmunization were typed at low/medium resolution by PCR-SSO, using IMGT-HLA Database. HLA-DRB1 and DQB1 allele frequencies were analyzed using logistic regression models, and global p-value was calculated using multiple logistic regression. Results While only trends towards associations between HLA-DR diversity and alloimmunization were observed, analysis of HLA-DQ showed that HLA-DQ2 (p=0.02), -DQ3 (p=0.02) and -DQ5 (p=0.01) alleles were significantly higher in non-alloimmunized patients, likely behaving as protective alleles. In addition, multiple logistic regression analysis showed both HLA-DQ2/6 (p=0.01) and HLA-DQ5/5 (p=0.03) combinations constitute additional predictor of protective status. Conclusion Our data suggest that particular HLA-DQ alleles influence the clinical course of RBC transfusion in patients with SCD, which could pave the way towards predictive strategies. PMID:26476208

  13. Inflammation, homocysteine and carotid intima-media thickness.

    PubMed

    Baptista, Alexandre P; Cacdocar, Sanjiva; Palmeiro, Hugo; Faísca, Marília; Carrasqueira, Herménio; Morgado, Elsa; Sampaio, Sandra; Cabrita, Ana; Silva, Ana Paula; Bernardo, Idalécio; Gome, Veloso; Neves, Pedro L

    2008-01-01

    Cardiovascular disease is the main cause of morbidity and mortality in chronic renal patients. Carotid intima-media thickness (CIMT) is one of the most accurate markers of atherosclerosis risk. In this study, the authors set out to evaluate a population of chronic renal patients to determine which factors are associated with an increase in intima-media thickness. We included 56 patients (F=22, M=34), with a mean age of 68.6 years, and an estimated glomerular filtration rate of 15.8 ml/min (calculated by the MDRD equation). Various laboratory and inflammatory parameters (hsCRP, IL-6 and TNF-alpha) were evaluated. All subjects underwent measurement of internal carotid artery intima-media thickness by high-resolution real-time B-mode ultrasonography using a 10 MHz linear transducer. Intima-media thickness was used as a dependent variable in a simple linear regression model, with the various laboratory parameters as independent variables. Only parameters showing a significant correlation with CIMT were evaluated in a multiple regression model: age (p=0.001), hemoglobin (p=00.3), logCRP (p=0.042), logIL-6 (p=0.004) and homocysteine (p=0.002). In the multiple regression model we found that age (p=0.001) and homocysteine (p=0.027) were independently correlated with CIMT. LogIL-6 did not reach statistical significance (p=0.057), probably due to the small population size. The authors conclude that age and homocysteine correlate with carotid intima-media thickness, and thus can be considered as markers/risk factors in chronic renal patients.

  14. Work-related injuries involving a hand or fingers among union carpenters in Washington State, 1989 to 2008.

    PubMed

    Lipscomb, Hester J; Schoenfisch, Ashley; Cameron, Wilfrid

    2013-07-01

    We evaluated work-related injuries involving a hand or fingers and associated costs among a cohort of 24,830 carpenters between 1989 and 2008. Injury rates and rate ratios were calculated by using Poisson regression to explore higher risk on the basis of age, sex, time in the union, predominant work, and calendar time. Negative binomial regression was used to model dollars paid per claim after adjustment for inflation and discounting. Hand injuries accounted for 21.1% of reported injuries and 9.5% of paid lost time injuries. Older carpenters had proportionately more amputations, fractures, and multiple injuries, but their rates of these more severe injuries were not higher. Costs exceeded $21 million, a cost burden of $0.11 per hour worked. Older carpenters' higher proportion of serious injuries in the absence of higher rates likely reflects age-related reporting differences.

  15. New 1,6-heptadienes with pyrimidine bases attached: Syntheses and spectroscopic analyses

    NASA Astrophysics Data System (ADS)

    Hammud, Hassan H.; Ghannoum, Amer M.; Fares, Fares A.; Abramian, Lara K.; Bouhadir, Kamal H.

    2008-06-01

    A simple, high yielding synthesis leading to the functionalization of some pyrimidine bases with a 1,6-heptadienyl moiety spaced from the N - 1 position by a methylene group is described. A key step in this synthesis involves a Mitsunobu reaction by coupling 3N-benzoyluracil and 3N-benzoylthymine to 2-allyl-pent-4-en-1-ol followed by alkaline hydrolysis of the 3N-benzoyl protecting groups. This protocol should eventually lend itself to the synthesis of a host of N-alkylated nucleoside analogs. The absorption and emission properties of these pyrimidine derivatives ( 3- 6) were studied in solvents of different physical properties. Computerized analysis and multiple regression techniques were applied to calculate the regression and correlation coefficients based on the equation that relates peak position λmax to the solvent parameters that depend on the H-bonding ability, refractive index, and dielectric constant of solvents.

  16. Odontological approach to sexual dimorphism in southeastern France.

    PubMed

    Lladeres, Emilie; Saliba-Serre, Bérengère; Sastre, Julien; Foti, Bruno; Tardivo, Delphine; Adalian, Pascal

    2013-01-01

    The aim of this study was to establish a prediction formula to allow for the determination of sex among the southeastern French population using dental measurements. The sample consisted of 105 individuals (57 males and 48 females, aged between 18 and 25 years). Dental measurements were calculated using Euclidean distances, in three-dimensional space, from point coordinates obtained by a Microscribe. A multiple logistic regression analysis was performed to establish the prediction formula. Among 12 selected dental distances, a stepwise logistic regression analysis highlighted the two most significant discriminate predictors of sex: one located at the mandible and the other at the maxilla. A cutpoint was proposed to prediction of true sex. The prediction formula was then tested on a validation sample (20 males and 34 females, aged between 18 and 62 years and with a history of orthodontics or restorative care) to evaluate the accuracy of the method. © 2012 American Academy of Forensic Sciences.

  17. Three-parameter modeling of the soil sorption of acetanilide and triazine herbicide derivatives.

    PubMed

    Freitas, Mirlaine R; Matias, Stella V B G; Macedo, Renato L G; Freitas, Matheus P; Venturin, Nelson

    2014-02-01

    Herbicides have widely variable toxicity and many of them are persistent soil contaminants. Acetanilide and triazine family of herbicides have widespread use, but increasing interest for the development of new herbicides has been rising to increase their effectiveness and to diminish environmental hazard. The environmental risk of new herbicides can be accessed by estimating their soil sorption (logKoc), which is usually correlated to the octanol/water partition coefficient (logKow). However, earlier findings have shown that this correlation is not valid for some acetanilide and triazine herbicides. Thus, easily accessible quantitative structure-property relationship models are required to predict logKoc of analogues of the these compounds. Octanol/water partition coefficient, molecular weight and volume were calculated and then regressed against logKoc for two series of acetanilide and triazine herbicides using multiple linear regression, resulting in predictive and validated models.

  18. Linear and nonlinear models for predicting fish bioconcentration factors for pesticides.

    PubMed

    Yuan, Jintao; Xie, Chun; Zhang, Ting; Sun, Jinfang; Yuan, Xuejie; Yu, Shuling; Zhang, Yingbiao; Cao, Yunyuan; Yu, Xingchen; Yang, Xuan; Yao, Wu

    2016-08-01

    This work is devoted to the applications of the multiple linear regression (MLR), multilayer perceptron neural network (MLP NN) and projection pursuit regression (PPR) to quantitative structure-property relationship analysis of bioconcentration factors (BCFs) of pesticides tested on Bluegill (Lepomis macrochirus). Molecular descriptors of a total of 107 pesticides were calculated with the DRAGON Software and selected by inverse enhanced replacement method. Based on the selected DRAGON descriptors, a linear model was built by MLR, nonlinear models were developed using MLP NN and PPR. The robustness of the obtained models was assessed by cross-validation and external validation using test set. Outliers were also examined and deleted to improve predictive power. Comparative results revealed that PPR achieved the most accurate predictions. This study offers useful models and information for BCF prediction, risk assessment, and pesticide formulation. Copyright © 2016 Elsevier Ltd. All rights reserved.

  19. Estimation of instantaneous heat transfer coefficients for a direct-injection stratified-charge rotary engine

    NASA Technical Reports Server (NTRS)

    Lee, C. M.; Addy, H. E.; Bond, T. H.; Chun, K. S.; Lu, C. Y.

    1987-01-01

    The main objective of this report was to derive equations to estimate heat transfer coefficients in both the combustion chamber and coolant pasage of a rotary engine. This was accomplished by making detailed temperature and pressure measurements in a direct-injection stratified-charge rotary engine under a range of conditions. For each sppecific measurement point, the local physical properties of the fluids were calculated. Then an empirical correlation of the coefficients was derived by using a multiple regression program. This correlation expresses the Nusselt number as a function of the Prandtl number and Reynolds number.

  20. Design and testing of digitally manufactured paraffin Acrylonitrile-butadiene-styrene hybrid rocket motors

    NASA Astrophysics Data System (ADS)

    McCulley, Jonathan M.

    This research investigates the application of additive manufacturing techniques for fabricating hybrid rocket fuel grains composed of porous Acrylonitrile-butadiene-styrene impregnated with paraffin wax. The digitally manufactured ABS substrate provides mechanical support for the paraffin fuel material and serves as an additional fuel component. The embedded paraffin provides an enhanced fuel regression rate while having no detrimental effect on the thermodynamic burn properties of the fuel grain. Multiple fuel grains with various ABS-to-Paraffin mass ratios were fabricated and burned with nitrous oxide. Analytical predictions for end-to-end motor performance and fuel regression are compared against static test results. Baseline fuel grain regression calculations use an enthalpy balance energy analysis with the material and thermodynamic properties based on the mean paraffin/ABS mass fractions within the fuel grain. In support of these analytical comparisons, a novel method for propagating the fuel port burn surface was developed. In this modeling approach the fuel cross section grid is modeled as an image with white pixels representing the fuel and black pixels representing empty or burned grid cells.

  1. Multivariate meta-analysis using individual participant data

    PubMed Central

    Riley, R. D.; Price, M. J.; Jackson, D.; Wardle, M.; Gueyffier, F.; Wang, J.; Staessen, J. A.; White, I. R.

    2016-01-01

    When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is that within-study correlations needed to fit the multivariate model are unknown from published reports. However, provision of individual participant data (IPD) allows them to be calculated directly. Here, we illustrate how to use IPD to estimate within-study correlations, using a joint linear regression for multiple continuous outcomes and bootstrapping methods for binary, survival and mixed outcomes. In a meta-analysis of 10 hypertension trials, we then show how these methods enable multivariate meta-analysis to address novel clinical questions about continuous, survival and binary outcomes; treatment–covariate interactions; adjusted risk/prognostic factor effects; longitudinal data; prognostic and multiparameter models; and multiple treatment comparisons. Both frequentist and Bayesian approaches are applied, with example software code provided to derive within-study correlations and to fit the models. PMID:26099484

  2. A biological quality index for volcanic Andisols and Aridisols (Canary Islands, Spain): variations related to the ecosystem degradation.

    PubMed

    Armas, Cecilia María; Santana, Bayanor; Mora, Juan Luis; Notario, Jesús Santiago; Arbelo, Carmen Dolores; Rodríguez-Rodríguez, Antonio

    2007-05-25

    The aim of this work is to identify indicators of biological activity in soils from the Canary Islands, by studying the variation of selected biological parameters related to the processes of deforestation and accelerated soil degradation affecting the Canarian natural ecosystems. Ten plots with different degrees of maturity/degradation have been selected in three typical habitats in the Canary Islands: laurel forest, pine forest and xerophytic scrub with Andisols and Aridisols as the most common soils. The studied characteristics in each case include total organic carbon, field soil respiration, mineralized carbon after laboratory incubation, microbial biomass carbon, hot water-extractable carbon and carboxymethylcellulase, beta-d-glucosidase and dehydrogenase activities. A Biological Quality Index (BQI) has been designed on the basis of a regression model using these variables, assuming that the total soil organic carbon content is quite stable in nearly mature ecosystems. Total carbon in mature ecosystems has been related to significant biological variables (hot water-extractable carbon, soil respiration and carboxymethylcellulase, beta-d-glucosidase and dehydrogenase activities), accounting for nearly 100% of the total variance by a multiple regression analysis. The index has been calculated as the ratio of the value calculated from the regression model and the actual measured value. The obtained results show that soils in nearly mature ecosystems have BQI values close to unit, whereas those in degraded ecosystems range between 0.24 and 0.97, depending on the degradation degree.

  3. Predicting the demand of physician workforce: an international model based on "crowd behaviors".

    PubMed

    Tsai, Tsuen-Chiuan; Eliasziw, Misha; Chen, Der-Fang

    2012-03-26

    Appropriateness of physician workforce greatly influences the quality of healthcare. When facing the crisis of physician shortages, the correction of manpower always takes an extended time period, and both the public and health personnel suffer. To calculate an appropriate number of Physician Density (PD) for a specific country, this study was designed to create a PD prediction model, based on health-related data from many countries. Twelve factors that could possibly impact physicians' demand were chosen, and data of these factors from 130 countries (by reviewing 195) were extracted. Multiple stepwise-linear regression was used to derive the PD prediction model, and a split-sample cross-validation procedure was performed to evaluate the generalizability of the results. Using data from 130 countries, with the consideration of the correlation between variables, and preventing multi-collinearity, seven out of the 12 predictor variables were selected for entry into the stepwise regression procedure. The final model was: PD = (5.014 - 0.128 × proportion under age 15 years + 0.034 × life expectancy)2, with R2 of 80.4%. Using the prediction equation, 70 countries had PDs with "negative discrepancy", while 58 had PDs with "positive discrepancy". This study provided a regression-based PD model to calculate a "norm" number of PD for a specific country. A large PD discrepancy in a country indicates the needs to examine physician's workloads and their well-being, the effectiveness/efficiency of medical care, the promotion of population health and the team resource management.

  4. An Effect Size for Regression Predictors in Meta-Analysis

    ERIC Educational Resources Information Center

    Aloe, Ariel M.; Becker, Betsy Jane

    2012-01-01

    A new effect size representing the predictive power of an independent variable from a multiple regression model is presented. The index, denoted as r[subscript sp], is the semipartial correlation of the predictor with the outcome of interest. This effect size can be computed when multiple predictor variables are included in the regression model…

  5. Regression Analysis: Legal Applications in Institutional Research

    ERIC Educational Resources Information Center

    Frizell, Julie A.; Shippen, Benjamin S., Jr.; Luna, Andrew L.

    2008-01-01

    This article reviews multiple regression analysis, describes how its results should be interpreted, and instructs institutional researchers on how to conduct such analyses using an example focused on faculty pay equity between men and women. The use of multiple regression analysis will be presented as a method with which to compare salaries of…

  6. RAWS II: A MULTIPLE REGRESSION ANALYSIS PROGRAM,

    DTIC Science & Technology

    This memorandum gives instructions for the use and operation of a revised version of RAWS, a multiple regression analysis program. The program...of preprocessed data, the directed retention of variable, listing of the matrix of the normal equations and its inverse, and the bypassing of the regression analysis to provide the input variable statistics only. (Author)

  7. Incremental Net Effects in Multiple Regression

    ERIC Educational Resources Information Center

    Lipovetsky, Stan; Conklin, Michael

    2005-01-01

    A regular problem in regression analysis is estimating the comparative importance of the predictors in the model. This work considers the 'net effects', or shares of the predictors in the coefficient of the multiple determination, which is a widely used characteristic of the quality of a regression model. Estimation of the net effects can be a…

  8. Floating Data and the Problem with Illustrating Multiple Regression.

    ERIC Educational Resources Information Center

    Sachau, Daniel A.

    2000-01-01

    Discusses how to introduce basic concepts of multiple regression by creating a large-scale, three-dimensional regression model using the classroom walls and floor. Addresses teaching points that should be covered and reveals student reaction to the model. Finds that the greatest benefit of the model is the low fear, walk-through, nonmathematical…

  9. Who Will Win?: Predicting the Presidential Election Using Linear Regression

    ERIC Educational Resources Information Center

    Lamb, John H.

    2007-01-01

    This article outlines a linear regression activity that engages learners, uses technology, and fosters cooperation. Students generated least-squares linear regression equations using TI-83 Plus[TM] graphing calculators, Microsoft[C] Excel, and paper-and-pencil calculations using derived normal equations to predict the 2004 presidential election.…

  10. Using Multiple and Logistic Regression to Estimate the Median WillCost and Probability of Cost and Schedule Overrun for Program Managers

    DTIC Science & Technology

    2017-03-23

    PUBLIC RELEASE; DISTRIBUTION UNLIMITED Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and... Cost and Probability of Cost and Schedule Overrun for Program Managers Ryan C. Trudelle Follow this and additional works at: https://scholar.afit.edu...afit.edu. Recommended Citation Trudelle, Ryan C., "Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and

  11. A prediction model for lift-fan simulator performance. M.S. Thesis - Cleveland State Univ.

    NASA Technical Reports Server (NTRS)

    Yuska, J. A.

    1972-01-01

    The performance characteristics of a model VTOL lift-fan simulator installed in a two-dimensional wing are presented. The lift-fan simulator consisted of a 15-inch diameter fan driven by a turbine contained in the fan hub. The performance of the lift-fan simulator was measured in two ways: (1) the calculated momentum thrust of the fan and turbine (total thrust loading), and (2) the axial-force measured on a load cell force balance (axial-force loading). Tests were conducted over a wide range of crossflow velocities, corrected tip speeds, and wing angle of attack. A prediction modeling technique was developed to help in analyzing the performance characteristics of lift-fan simulators. A multiple linear regression analysis technique is presented which calculates prediction model equations for the dependent variables.

  12. Mapping of the DLQI scores to EQ-5D utility values using ordinal logistic regression.

    PubMed

    Ali, Faraz Mahmood; Kay, Richard; Finlay, Andrew Y; Piguet, Vincent; Kupfer, Joerg; Dalgard, Florence; Salek, M Sam

    2017-11-01

    The Dermatology Life Quality Index (DLQI) and the European Quality of Life-5 Dimension (EQ-5D) are separate measures that may be used to gather health-related quality of life (HRQoL) information from patients. The EQ-5D is a generic measure from which health utility estimates can be derived, whereas the DLQI is a specialty-specific measure to assess HRQoL. To reduce the burden of multiple measures being administered and to enable a more disease-specific calculation of health utility estimates, we explored an established mathematical technique known as ordinal logistic regression (OLR) to develop an appropriate model to map DLQI data to EQ-5D-based health utility estimates. Retrospective data from 4010 patients were randomly divided five times into two groups for the derivation and testing of the mapping model. Split-half cross-validation was utilized resulting in a total of ten ordinal logistic regression models for each of the five EQ-5D dimensions against age, sex, and all ten items of the DLQI. Using Monte Carlo simulation, predicted health utility estimates were derived and compared against those observed. This method was repeated for both OLR and a previously tested mapping methodology based on linear regression. The model was shown to be highly predictive and its repeated fitting demonstrated a stable model using OLR as well as linear regression. The mean differences between OLR-predicted health utility estimates and observed health utility estimates ranged from 0.0024 to 0.0239 across the ten modeling exercises, with an average overall difference of 0.0120 (a 1.6% underestimate, not of clinical importance). This modeling framework developed in this study will enable researchers to calculate EQ-5D health utility estimates from a specialty-specific study population, reducing patient and economic burden.

  13. Tools to Support Interpreting Multiple Regression in the Face of Multicollinearity

    PubMed Central

    Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K.

    2012-01-01

    While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses. PMID:22457655

  14. Tools to support interpreting multiple regression in the face of multicollinearity.

    PubMed

    Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K

    2012-01-01

    While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses.

  15. The comparison between several robust ridge regression estimators in the presence of multicollinearity and multiple outliers

    NASA Astrophysics Data System (ADS)

    Zahari, Siti Meriam; Ramli, Norazan Mohamed; Moktar, Balkiah; Zainol, Mohammad Said

    2014-09-01

    In the presence of multicollinearity and multiple outliers, statistical inference of linear regression model using ordinary least squares (OLS) estimators would be severely affected and produces misleading results. To overcome this, many approaches have been investigated. These include robust methods which were reported to be less sensitive to the presence of outliers. In addition, ridge regression technique was employed to tackle multicollinearity problem. In order to mitigate both problems, a combination of ridge regression and robust methods was discussed in this study. The superiority of this approach was examined when simultaneous presence of multicollinearity and multiple outliers occurred in multiple linear regression. This study aimed to look at the performance of several well-known robust estimators; M, MM, RIDGE and robust ridge regression estimators, namely Weighted Ridge M-estimator (WRM), Weighted Ridge MM (WRMM), Ridge MM (RMM), in such a situation. Results of the study showed that in the presence of simultaneous multicollinearity and multiple outliers (in both x and y-direction), the RMM and RIDGE are more or less similar in terms of superiority over the other estimators, regardless of the number of observation, level of collinearity and percentage of outliers used. However, when outliers occurred in only single direction (y-direction), the WRMM estimator is the most superior among the robust ridge regression estimators, by producing the least variance. In conclusion, the robust ridge regression is the best alternative as compared to robust and conventional least squares estimators when dealing with simultaneous presence of multicollinearity and outliers.

  16. Multiple tobacco product use among adults in the United States: cigarettes, cigars, electronic cigarettes, hookah, smokeless tobacco, and snus.

    PubMed

    Lee, Youn O; Hebert, Christine J; Nonnemaker, James M; Kim, Annice E

    2014-05-01

    Noncigarette tobacco products are increasingly popular. Researchers need to understand multiple tobacco product use to assess the effects of these products on population health. We estimate national prevalence and examine risk factors for multiple product use. We calculated prevalence estimates of current use patterns involving cigarettes, cigars, electronic cigarettes, hookah, smokeless tobacco, and snus using data from the 2012 RTI National Adult Tobacco Survey (N=3627), a random-digit-dial telephone survey of adults aged 18 and over. Associations between use patterns (exclusive single product and multiple products) and demographic characteristics were examined using Pearson chi-square tests and logistic regression. 32.1% of adults currently use 1 or more tobacco products; 14.9% use cigarettes exclusively, and 6.6% use one noncigarette product exclusively, 6.9% use cigarettes with another product (dual use), 1.3% use two noncigarette products, and 2.4% use three or more products (polytobacco use). Smokers who are young adult, male, never married, reside in the West, and made prior quit attempts were at risk for multiple product use. Over 10% of U.S. adults use multiple tobacco products. A better understanding of multiple product use involving combustible products, like cigars and hookah, is needed. Multiple product use may be associated with past quit attempts. Copyright © 2014 Elsevier Inc. All rights reserved.

  17. Variables Associated with Communicative Participation in People with Multiple Sclerosis: A Regression Analysis

    ERIC Educational Resources Information Center

    Baylor, Carolyn; Yorkston, Kathryn; Bamer, Alyssa; Britton, Deanna; Amtmann, Dagmar

    2010-01-01

    Purpose: To explore variables associated with self-reported communicative participation in a sample (n = 498) of community-dwelling adults with multiple sclerosis (MS). Method: A battery of questionnaires was administered online or on paper per participant preference. Data were analyzed using multiple linear backward stepwise regression. The…

  18. Power and Sample Size Calculations for Logistic Regression Tests for Differential Item Functioning

    ERIC Educational Resources Information Center

    Li, Zhushan

    2014-01-01

    Logistic regression is a popular method for detecting uniform and nonuniform differential item functioning (DIF) effects. Theoretical formulas for the power and sample size calculations are derived for likelihood ratio tests and Wald tests based on the asymptotic distribution of the maximum likelihood estimators for the logistic regression model.…

  19. Combustion performance and scale effect from N2O/HTPB hybrid rocket motor simulations

    NASA Astrophysics Data System (ADS)

    Shan, Fanli; Hou, Lingyun; Piao, Ying

    2013-04-01

    HRM code for the simulation of N2O/HTPB hybrid rocket motor operation and scale effect analysis has been developed. This code can be used to calculate motor thrust and distributions of physical properties inside the combustion chamber and nozzle during the operational phase by solving the unsteady Navier-Stokes equations using a corrected compressible difference scheme and a two-step, five species combustion model. A dynamic fuel surface regression technique and a two-step calculation method together with the gas-solid coupling are applied in the calculation of fuel regression and the determination of combustion chamber wall profile as fuel regresses. Both the calculated motor thrust from start-up to shut-down mode and the combustion chamber wall profile after motor operation are in good agreements with experimental data. The fuel regression rate equation and the relation between fuel regression rate and axial distance have been derived. Analysis of results suggests improvements in combustion performance to the current hybrid rocket motor design and explains scale effects in the variation of fuel regression rate with combustion chamber diameter.

  20. Income, housing, and fire injuries: a census tract analysis.

    PubMed

    Shai, Donna

    2006-01-01

    This study investigates the social and demographic correlates of nonfatal structural fire injury rates for the civilian population for Philadelphia census tracts during 1993-2001. The author analyzed 1,563 fire injuries by census tract using the 1990 census (STF 3) and unpublished data from the Office of the Fire Marshal of the Philadelphia Fire Department. Injury rates were calculated per 1,000 residents of a given census tract. Multiple regression was used to determine significant variables in predicting fire injuries in a given census tract over a nine-year period and interaction effects between two of these variables-age of housing and income. Multiple regression analysis indicates that older housing (prior to 1940), low income, the prevalence of vacant houses, and the ability to speak English have significant independent effects on fire injury rates in Philadelphia. In addition, the results show a significant interaction between older housing and low income. Given the finding of very high rates of fire injuries in census tracts that are both low income and have older housing, fire prevention units can take preventative measures. Fire protection devices, especially smoke alarms, should be distributed in the neighborhoods most at risk. Multiple occupancy dwellings should have sprinkler systems and fire extinguishers. Laws concerning the maintenance of older rental housing need to be strictly enforced. Vacant houses should be effectively boarded up or renovated for residential use. Fire prevention material should be distributed in a number of languages to meet local needs.

  1. [Correlation between percentage of body fat and simple anthropometric parameters in children aged 6-9 years in Guangzhou].

    PubMed

    Yan, H C; Hao, Y T; Guo, Y F; Wei, Y H; Zhang, J H; Huang, G P; Mao, L M; Zhang, Z Q

    2017-11-10

    Objective: To evaluate the accuracy of simple anthropometric parameters in diagnosing obesity in children in Guangzhou. Methods: A cross-sectional study, including 465 children aged 6-9 years, was carried out in Guangzhou. Their body height and weight, waist circumference (WC) and hip circumference were measured according to standard procedure. Body mass index (BMI), waist to hip ratio (WHR) and waist-to-height ratio (WHtR) were calculated. Body fat percentage (BF%) was determined by dual-energy X-ray absorptiometry. Multiple regression analysis was applied to evaluate the correlations between age-adjusted physical indicators and BF%, after the adjustment for age. Obesity was defined by BF%. Receiver operating characteristic (ROC) curve analyses were performed to assess the diagnostic accuracy of the indicators for childhood obesity. Area under-ROC curves (AUCs) were calculated and the best cut-off point that maximizing 'sensitivity + specificity-1' was determined. Results: BMI showed the strongest association with BF% through multiple regression analysis. For 'per-standard deviation increase' of BMI, BF% increased by 5.3% ( t =23.1, P <0.01) in boys and 4.6% ( t =17.5, P <0.01) in girls, respectively. The ROC curve analysis indicated that BMI exhibited the largest AUC in both boys (AUC=0.908) and girls (AUC=0.895). The sensitivity was 80.8% in boys and 81.8% in girls, and the specificity was 88.2% in boys and 87.1% in girls. Both the AUCs for WHtR and WC were less than 0.8 in boys and girls. WHR had the smallest AUCs (<0.8) in both boys and girls. Conclusion: BMI appeared to be a good predicator for BF% in children aged 6-9 years in Guangzhou.

  2. Low dietary diversity and micronutrient adequacy among lactating women in a peri-urban area of Nepal.

    PubMed

    Henjum, Sigrun; Torheim, Liv Elin; Thorne-Lyman, Andrew L; Chandyo, Ram; Fawzi, Wafaie W; Shrestha, Prakash S; Strand, Tor A

    2015-12-01

    The main objectives were to assess the adequacy of the micronutrient intakes of lactating women in a peri-urban area in Nepal and to describe the relationships between micronutrient intake adequacy, dietary diversity and sociodemographic variables. A cross-sectional survey was performed during 2008-2009. We used 24 h dietary recall to assess dietary intake on three non-consecutive days and calculated the probability of adequacy (PA) of the usual intake of eleven micronutrients and the overall mean probability of adequacy (MPA). A mean dietary diversity score (MDDS) was calculated of eight food groups averaged over 3 d. Multiple linear regression was used to identify the determinants of the MPA. Bhaktapur municipality, Nepal. Lactating women (n 500), 17-44 years old, randomly selected. The mean usual energy intake was 8464 (sd 1305) kJ/d (2023 (sd 312) kcal/d), while the percentage of energy from protein, fat and carbohydrates was 11 %, 13 % and 76 %, respectively. The mean usual micronutrient intakes were below the estimated average requirements for all micronutrients, with the exception of vitamin C and Zn. The MPA across eleven micronutrients was 0·19 (sd 0·16). The diet was found to be monotonous (MDDS was 3·9 (sd 1·0)) and rice contributed to about 60 % of the energy intake. The multiple regression analyses showed that MPA was positively associated with energy intake, dietary diversity, women's educational level and socio-economic status, and was higher in the winter. The low micronutrient intakes are probably explained by low dietary diversity and a low intake of micronutrient-rich foods.

  3. Predictors of postoperative outcomes of cubital tunnel syndrome treatments using multiple logistic regression analysis.

    PubMed

    Suzuki, Taku; Iwamoto, Takuji; Shizu, Kanae; Suzuki, Katsuji; Yamada, Harumoto; Sato, Kazuki

    2017-05-01

    This retrospective study was designed to investigate prognostic factors for postoperative outcomes for cubital tunnel syndrome (CubTS) using multiple logistic regression analysis with a large number of patients. Eighty-three patients with CubTS who underwent surgeries were enrolled. The following potential prognostic factors for disease severity were selected according to previous reports: sex, age, type of surgery, disease duration, body mass index, cervical lesion, presence of diabetes mellitus, Workers' Compensation status, preoperative severity, and preoperative electrodiagnostic testing. Postoperative severity of disease was assessed 2 years after surgery by Messina's criteria which is an outcome measure specifically for CubTS. Bivariate analysis was performed to select candidate prognostic factors for multiple linear regression analyses. Multiple logistic regression analysis was conducted to identify the association between postoperative severity and selected prognostic factors. Both bivariate and multiple linear regression analysis revealed only preoperative severity as an independent risk factor for poor prognosis, while other factors did not show any significant association. Although conflicting results exist regarding prognosis of CubTS, this study supports evidence from previous studies and concludes early surgical intervention portends the most favorable prognosis. Copyright © 2017 The Japanese Orthopaedic Association. Published by Elsevier B.V. All rights reserved.

  4. Children's environmental chemical exposures in the USA, NHANES 2003-2012.

    PubMed

    Hendryx, Michael; Luo, Juhua

    2018-02-01

    Children are vulnerable to environmental chemical exposures, but little is known about the extent of multiple chemical exposures among children. We analyzed biomonitoring data from five cycles (2003-2012) of the National Health and Nutrition Examination Survey (NHANES) to describe multiple chemical exposures in US children, examine levels of chemical concentrations present over time, and examine differences in chemical exposures by selected demographic groups. We analyzed data for 36 chemical analytes across five chemical classes in a sample of 4299 children aged 6-18. Classes included metals, pesticides, phthalates, phenols, and polycyclic aromatic hydrocarbons. We calculated the number and percent of chemicals detected and tested for secular trends over time in chemical concentrations. We compared log concentrations among groups defined by age, sex, race/ethnicity, and poverty using multiple linear regression models and report adjusted geometric means. Among a smaller subgroup of 733 children with data across chemical classes, we calculated the linear correlations within and between classes and conducted a principal component analysis. The percentage of children with detectable concentrations of an individual chemical ranged from 26 to 100%; the average was 93%, and 29 of 36 were detected in more than 90% of children. Concentrations of most tested chemicals were either unchanged or declined from earlier to more recent years. Many differences in concentrations were present by age, sex, poverty, and race/ethnicity categories. Within and between class correlations were all significant and positive, and the principal component analysis suggested a one factor solution, indicating that children exposed to higher levels of one chemical were exposed to higher levels of other chemicals. In conclusion, children in the USA are exposed to multiple simultaneous chemicals at uneven risk across socioeconomic and demographic groups. Further efforts to understand the effects of multiple exposures on child health and development are warranted.

  5. Extended Hansen solubility approach: naphthalene in individual solvents.

    PubMed

    Martin, A; Wu, P L; Adjei, A; Beerbower, A; Prausnitz, J M

    1981-11-01

    A multiple regression method using Hansen partial solubility parameters, delta D, delta p, and delta H, was used to reproduce the solubilities of naphthalene in pure polar and nonpolar solvents and to predict its solubility in untested solvents. The method, called the extended Hansen approach, was compared with the extended Hildebrand solubility approach and the universal-functional-group-activity-coefficient (UNIFAC) method. The Hildebrand regular solution theory was also used to calculate naphthalene solubility. Naphthalene, an aromatic molecule having no side chains or functional groups, is "well-behaved', i.e., its solubility in active solvents known to interact with drug molecules is fairly regular. Because of its simplicity, naphthalene is a suitable solute with which to initiate the difficult study of solubility phenomena. The three methods tested (Hildebrand regular solution theory was introduced only for comparison of solubilities in regular solution) yielded similar results, reproducing naphthalene solubilities within approximately 30% of literature values. In some cases, however, the error was considerably greater. The UNIFAC calculation is superior in that it requires only the solute's heat of fusion, the melting point, and a knowledge of chemical structures of solute and solvent. The extended Hansen and extended Hildebrand methods need experimental solubility data on which to carry out regression analysis. The extended Hansen approach was the method of second choice because of its adaptability to solutes and solvents from various classes. Sample calculations are included to illustrate methods of predicting solubilities in untested solvents at various temperatures. The UNIFAC method was successful in this regard.

  6. Measurement of left ventricular mass in vivo using gated nuclear magnetic resonance imaging.

    PubMed

    Florentine, M S; Grosskreutz, C L; Chang, W; Hartnett, J A; Dunn, V D; Ehrhardt, J C; Fleagle, S R; Collins, S M; Marcus, M L; Skorton, D J

    1986-07-01

    Alterations of left ventricular mass occur in a variety of congenital and acquired heart diseases. In vivo determination of left ventricular mass, using several different techniques, has been previously reported. Problems inherent in some previous methods include the use of ionizing radiation, complicated geometric assumptions and invasive techniques. We tested the ability of gated nuclear magnetic resonance imaging to determine in vivo left ventricular mass in animals. By studying both dogs (n = 9) and cats (n = 2) of various sizes, a broad range of left ventricular mass (7 to 133 g) was examined. With a 0.5 tesla superconducting nuclear magnetic resonance imaging system the left ventricle was imaged in the transaxial plane and multiple adjacent 10 mm thick slices were obtained. Endocardial and epicardial edges were manually traced in each computer-displayed image. The wall area of each image was determined by computer and the areas were summed and multiplied by the slice thickness and the specific gravity of muscle, providing calculated left ventricular mass. Calculated left ventricular mass was compared with actual postmortem left ventricular mass using linear regression analysis. An excellent relation between calculated and actual mass was found (r = 0.95; SEE = 13.1 g; regression equation: magnetic resonance mass = 0.95 X actual mass + 14.8 g). Intraobserver and interobserver reproducibility were also excellent (r = 0.99). Thus, gated nuclear magnetic resonance imaging can accurately determine in vivo left ventricular mass in anesthetized animals.

  7. The Geometry of Enhancement in Multiple Regression

    ERIC Educational Resources Information Center

    Waller, Niels G.

    2011-01-01

    In linear multiple regression, "enhancement" is said to occur when R[superscript 2] = b[prime]r greater than r[prime]r, where b is a p x 1 vector of standardized regression coefficients and r is a p x 1 vector of correlations between a criterion y and a set of standardized regressors, x. When p = 1 then b [is congruent to] r and…

  8. A metabolomic study of low estimated GFR in non-proteinuric type 2 diabetes mellitus.

    PubMed

    Ng, D P K; Salim, A; Liu, Y; Zou, L; Xu, F G; Huang, S; Leong, H; Ong, C N

    2012-02-01

    We carried out a urinary metabolomic study to gain insight into low estimated GFR (eGFR) in patients with non-proteinuric type 2 diabetes. Patients were identified as being non-proteinuric using multiple urinalyses. Cases (n = 44) with low eGFR and controls (n = 46) had eGFR values <60 and ≥60 ml min(-1) 1.73 m(-2), respectively, as calculated using the Modification of Diet in Renal Disease formula. Urine samples were analysed by liquid chromatography/mass spectrometry (LC/MS) and GC/MS. False discovery rates were used to adjust for multiple hypotheses testing, and selection of metabolites that best predicted low eGFR status was achieved using least absolute shrinkage and selection operator logistic regression. Eleven GC/MS metabolites were strongly associated with low eGFR after correction for multiple hypotheses testing (smallest adjusted p value = 2.62 × 10(-14), largest adjusted p value = 3.84 × 10(-2)). In regression analysis, octanol, oxalic acid, phosphoric acid, benzamide, creatinine, 3,5-dimethoxymandelic amide and N-acetylglutamine were selected as the best subset for prediction and allowed excellent classification of low eGFR (AUC = 0.996). In LC/MS, 19 metabolites remained significant after multiple hypotheses testing had been taken into account (smallest adjusted p value = 2.04 × 10(-4), largest adjusted p value = 4.48 × 10(-2)), and several metabolites showed stronger evidence of association relative to the uraemic toxin, indoxyl sulphate (adjusted p value = 3.03 × 10(-2)). The potential effect of confounding on the association between metabolites was excluded. Our study has yielded substantial new insight into low eGFR and provided a collection of potential urinary biomarkers for its detection.

  9. Multiple factors explain injury risk in adolescent elite athletes: Applying a biopsychosocial perspective.

    PubMed

    von Rosen, P; Frohm, A; Kottorp, A; Fridén, C; Heijne, A

    2017-12-01

    Many risk factors for injury are presented in the literature, few of those are however consistent and the majority is associated with adult and not adolescent elite athletes. The aim was to identify risk factors for injury in adolescent elite athletes, by applying a biopsychosocial approach. A total of 496 adolescent elite athletes (age range 15-19), participating in 16 different sports, were monitored repeatedly over 52 weeks using a valid questionnaire about injuries, training exposure, sleep, stress, nutrition, and competence-based self-esteem. Univariate and multiple Cox regression analyses were used to calculate hazard ratios (HR) for risk factors for first reported injury. The main finding was that an increase in training load, training intensity, and at the same time decreasing the sleep volume resulted in a higher risk for injury compared to no change in these variables (HR 2.25, 95% CI, 1.46-3.45, P<.01), which was the strongest risk factor identified. In addition, an increase by one score of competence-based self-esteem increased the hazard for injury with 1.02 (HR 95% CI, 1.00-1.04, P=.01). Based on the multiple Cox regression analysis, an athlete having the identified risk factors (Risk Index, competence-based self-esteem), with an average competence-based self-esteem score, had more than a threefold increased risk for injury (HR 3.35), compared to an athlete with a low competence-based self-esteem and no change in sleep or training volume. Our findings confirm injury occurrence as a result of multiple risk factors interacting in complex ways. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  10. Prediction system of hydroponic plant growth and development using algorithm Fuzzy Mamdani method

    NASA Astrophysics Data System (ADS)

    Sudana, I. Made; Purnawirawan, Okta; Arief, Ulfa Mediaty

    2017-03-01

    Hydroponics is a method of farming without soil. One of the Hydroponic plants is Watercress (Nasturtium Officinale). The development and growth process of hydroponic Watercress was influenced by levels of nutrients, acidity and temperature. The independent variables can be used as input variable system to predict the value level of plants growth and development. The prediction system is using Fuzzy Algorithm Mamdani method. This system was built to implement the function of Fuzzy Inference System (Fuzzy Inference System/FIS) as a part of the Fuzzy Logic Toolbox (FLT) by using MATLAB R2007b. FIS is a computing system that works on the principle of fuzzy reasoning which is similar to humans' reasoning. Basically FIS consists of four units which are fuzzification unit, fuzzy logic reasoning unit, base knowledge unit and defuzzification unit. In addition to know the effect of independent variables on the plants growth and development that can be visualized with the function diagram of FIS output surface that is shaped three-dimensional, and statistical tests based on the data from the prediction system using multiple linear regression method, which includes multiple linear regression analysis, T test, F test, the coefficient of determination and donations predictor that are calculated using SPSS (Statistical Product and Service Solutions) software applications.

  11. [Burnout syndrome and suicide risk among primary care nurses].

    PubMed

    Tomás-Sábado, Joaquín; Maynegre-Santaulària, Montserrat; Pérez-Bartolomé, Meritxell; Alsina-Rodríguez, Marta; Quinta-Barbero, Roser; Granell-Navas, Sergi

    2010-01-01

    To observe the prevalence of the burnout syndrome and the relationship with suicide risk, self-esteem, anxiety and depression, in a sample of primary care nurses. Observational, cross-sectional and correlational study. The sample consisted of 146 nursing professionals, 131 women and 15 men, with an average age of 44.02 years (SD=10.89). Participants responded to a questionnaire which included the Spanish forms of the Maslach burnout inventory (MBI), the Plutchik Suicide Risk Scale (SR), the Kuwait University Anxiety Scale (KUAS), the Self-Rating Depression Scale (SDS) and the Rosenberg Self-esteem Scale (RSES). In the inferential statistical analysis, Pearson's r coefficients and multiple linear regression were calculated. Significant correlations between suicidal risk and anxiety, depression, self-esteem, emotional exhaustion and personal performance, were obtained. In the multiple regression analysis, depression was the main predictor of suicidal risk, followed by anxiety and emotional exhaustion. The scores obtained in burnout and suicidal risk were, in general, lower than those observed in other studies, emphasising the high level observed in personal performance, which reflects reasonable professional satisfaction. The results show the important role of working atmosphere and early recognition of mental disorders in burnout and suicidal risk prevention. Copyright (c) 2009 Elsevier España, S.L. All rights reserved.

  12. Association between the Type of Workplace and Lung Function in Copper Miners

    PubMed Central

    Gruszczyński, Leszek; Wojakowska, Anna; Ścieszka, Marek; Turczyn, Barbara; Schmidt, Edward

    2016-01-01

    The aim of the analysis was to retrospectively assess changes in lung function in copper miners depending on the type of workplace. In the groups of 225 operators, 188 welders, and 475 representatives of other jobs, spirometry was performed at the start of employment and subsequently after 10, 20, and 25 years of work. Spirometry Longitudinal Data Analysis software was used to estimate changes in group means for FEV1 and FVC. Multiple linear regression analysis was used to assess an association between workplace and lung function. Lung function assessed on the basis of calculation of longitudinal FEV1 (FVC) decline was similar in all studied groups. However, multiple linear regression model used in cross-sectional analysis revealed an association between workplace and lung function. In the group of welders, FEF75 was lower in comparison to operators and other miners as early as after 10 years of work. Simultaneously, in smoking welders, the FEV1/FVC ratio was lower than in nonsmokers (p < 0,05). The interactions between type of workplace and smoking (p < 0,05) in their effect on FVC, FEV1, PEF, and FEF50 were shown. Among underground working copper miners, the group of smoking welders is especially threatened by impairment of lung ventilatory function. PMID:27274987

  13. Autonomous motivation and quality of life as predictors of physical activity in patients with schizophrenia.

    PubMed

    Costa, Raquel; Bastos, Tânia; Probst, Michel; Seabra, André; Vilhena, Estela; Corredeira, Rui

    2018-02-08

    Being physically active is a complex behaviour in patients with schizophrenia. Several factors were identified as barriers to achieving active behaviours in this population. Therefore, the purpose of this study was to investigate among a number of barriers what predicts the most on physical activity (PA) in patients with schizophrenia. A total of 114 patients (28♀) with schizophrenia were included. Body mass index (BMI) was calculated. Autonomous and controlled motivation (Behavioural Regulation in Exercise Questionnaire - 3), self-esteem (Rosenberg Self-esteem scale), quality of life (World Health Organization Quality of Life Scale - Brief version) and functional exercise capacity (6-minute walk test - 6MWT) were evaluated. Multiple Regression Analysis was applied to assess the effect of these variables on Total PA per week (International Physical Activity Questionnaire - short version). Autonomous motivation and domains of quality of life were positively correlated with Total PA per week. Stepwise multiple regression analyses showed that of all the candidate factors to predict PA, autonomous motivation and global domain of quality of life were found as significant predictors. Our findings help to understand the importance of autonomous motivation and quality of life for PA in patients with schizophrenia. Knowledge about these predictors may provide guidance to improve PA behaviour in this population.

  14. A rotor optimization using regression analysis

    NASA Technical Reports Server (NTRS)

    Giansante, N.

    1984-01-01

    The design and development of helicopter rotors is subject to the many design variables and their interactions that effect rotor operation. Until recently, selection of rotor design variables to achieve specified rotor operational qualities has been a costly, time consuming, repetitive task. For the past several years, Kaman Aerospace Corporation has successfully applied multiple linear regression analysis, coupled with optimization and sensitivity procedures, in the analytical design of rotor systems. It is concluded that approximating equations can be developed rapidly for a multiplicity of objective and constraint functions and optimizations can be performed in a rapid and cost effective manner; the number and/or range of design variables can be increased by expanding the data base and developing approximating functions to reflect the expanded design space; the order of the approximating equations can be expanded easily to improve correlation between analyzer results and the approximating equations; gradients of the approximating equations can be calculated easily and these gradients are smooth functions reducing the risk of numerical problems in the optimization; the use of approximating functions allows the problem to be started easily and rapidly from various initial designs to enhance the probability of finding a global optimum; and the approximating equations are independent of the analysis or optimization codes used.

  15. Predicting Retention Times of Naturally Occurring Phenolic Compounds in Reversed-Phase Liquid Chromatography: A Quantitative Structure-Retention Relationship (QSRR) Approach

    PubMed Central

    Akbar, Jamshed; Iqbal, Shahid; Batool, Fozia; Karim, Abdul; Chan, Kim Wei

    2012-01-01

    Quantitative structure-retention relationships (QSRRs) have successfully been developed for naturally occurring phenolic compounds in a reversed-phase liquid chromatographic (RPLC) system. A total of 1519 descriptors were calculated from the optimized structures of the molecules using MOPAC2009 and DRAGON softwares. The data set of 39 molecules was divided into training and external validation sets. For feature selection and mapping we used step-wise multiple linear regression (SMLR), unsupervised forward selection followed by step-wise multiple linear regression (UFS-SMLR) and artificial neural networks (ANN). Stable and robust models with significant predictive abilities in terms of validation statistics were obtained with negation of any chance correlation. ANN models were found better than remaining two approaches. HNar, IDM, Mp, GATS2v, DISP and 3D-MoRSE (signals 22, 28 and 32) descriptors based on van der Waals volume, electronegativity, mass and polarizability, at atomic level, were found to have significant effects on the retention times. The possible implications of these descriptors in RPLC have been discussed. All the models are proven to be quite able to predict the retention times of phenolic compounds and have shown remarkable validation, robustness, stability and predictive performance. PMID:23203132

  16. Advanced statistics: linear regression, part I: simple linear regression.

    PubMed

    Marill, Keith A

    2004-01-01

    Simple linear regression is a mathematical technique used to model the relationship between a single independent predictor variable and a single dependent outcome variable. In this, the first of a two-part series exploring concepts in linear regression analysis, the four fundamental assumptions and the mechanics of simple linear regression are reviewed. The most common technique used to derive the regression line, the method of least squares, is described. The reader will be acquainted with other important concepts in simple linear regression, including: variable transformations, dummy variables, relationship to inference testing, and leverage. Simplified clinical examples with small datasets and graphic models are used to illustrate the points. This will provide a foundation for the second article in this series: a discussion of multiple linear regression, in which there are multiple predictor variables.

  17. Assessing the Transferability of Statistical Predictive Models for Leaf Area Index Between Two Airborne Discrete Return LiDAR Sensor Designs Within Multiple Intensely Managed Loblolly Pine Forest Locations in the South-Eastern USA

    NASA Technical Reports Server (NTRS)

    Sumnall, Matthew; Peduzzi, Alicia; Fox, Thomas R.; Wynne, Randolph H.; Thomas, Valerie A.; Cook, Bruce

    2016-01-01

    Leaf area is an important forest structural variable which serves as the primary means of mass and energy exchange within vegetated ecosystems. The objective of the current study was to determine if leaf area index (LAI) could be estimated accurately and consistently in five intensively managed pine plantation forests using two multiple-return airborne LiDAR datasets. Field measurements of LAI were made using the LiCOR LAI2000 and LAI2200 instruments within 116 plots were established of varying size and within a variety of stand conditions (i.e. stand age, nutrient regime and stem density) in North Carolina and Virginia in 2008 and 2013. A number of common LiDAR return height and intensity distribution metrics were calculated (e.g. average return height), in addition to ten indices, with two additional variants, utilized in the surrounding literature which have been used to estimate LAI and fractional cover, were calculated from return heights and intensity, for each plot extent. Each of the indices was assessed for correlation with each other, and was used as independent variables in linear regression analysis with field LAI as the dependent variable. All LiDAR derived metrics were also entered into a forward stepwise linear regression. The results from each of the indices varied from an R2 of 0.33 (S.E. 0.87) to 0.89 (S.E. 0.36). Those indices calculated using ratios of all returns produced the strongest correlations, such as the Above and Below Ratio Index (ABRI) and Laser Penetration Index 1 (LPI1). The regression model produced from a combination of three metrics did not improve correlations greatly (R2 0.90; S.E. 0.35). The results indicate that LAI can be predicted over a range of intensively managed pine plantation forest environments accurately when using different LiDAR sensor designs. Those indices which incorporated counts of specific return numbers (e.g. first returns) or return intensity correlated poorly with field measurements. There were disparities between the number of different types of returns and intensity values when comparing the results from two LiDAR sensors, indicating that predictive models developed using such metrics are not transferable between datasets with different acquisition parameters. Each of the indices were significantly correlated with one another, with one exception (LAI proxy), in particular those indices calculated from all returns, which indicates similarities in information content for those indices. It can then be argued that LiDAR indices have reached a similar stage in development to those calculated from optical-spectral sensors, but which offer a number of advantages, such as the reduction or removal of saturation issues in areas of high biomass.

  18. An appraisal of statistical procedures used in derivation of reference intervals.

    PubMed

    Ichihara, Kiyoshi; Boyd, James C

    2010-11-01

    When conducting studies to derive reference intervals (RIs), various statistical procedures are commonly applied at each step, from the planning stages to final computation of RIs. Determination of the necessary sample size is an important consideration, and evaluation of at least 400 individuals in each subgroup has been recommended to establish reliable common RIs in multicenter studies. Multiple regression analysis allows identification of the most important factors contributing to variation in test results, while accounting for possible confounding relationships among these factors. Of the various approaches proposed for judging the necessity of partitioning reference values, nested analysis of variance (ANOVA) is the likely method of choice owing to its ability to handle multiple groups and being able to adjust for multiple factors. Box-Cox power transformation often has been used to transform data to a Gaussian distribution for parametric computation of RIs. However, this transformation occasionally fails. Therefore, the non-parametric method based on determination of the 2.5 and 97.5 percentiles following sorting of the data, has been recommended for general use. The performance of the Box-Cox transformation can be improved by introducing an additional parameter representing the origin of transformation. In simulations, the confidence intervals (CIs) of reference limits (RLs) calculated by the parametric method were narrower than those calculated by the non-parametric approach. However, the margin of difference was rather small owing to additional variability in parametrically-determined RLs introduced by estimation of parameters for the Box-Cox transformation. The parametric calculation method may have an advantage over the non-parametric method in allowing identification and exclusion of extreme values during RI computation.

  19. Obesity Increases Prevalence of Colonic Adenomas at Screening Colonoscopy: A Canadian Community-Based Study

    PubMed Central

    Chen, Grant I.; Devlin, Tim; Gibbs, Alison; Murray, Iain C.; Tran, Stanley; Weigensberg, Corey

    2017-01-01

    Background and Aims Obesity is a risk factor for colorectal neoplasia. We examined the influence of obesity and metabolic syndrome (MetS) on prevalence of neoplasia at screening colonoscopy. Methods We evaluated 2020 subjects undergoing first screening colonoscopy. Body mass index (BMI) was calculated at enrolment. Hyperlipidemia (HL), hypertension (HT), and diabetes mellitus (DM) were identified. Details of colonoscopy, polypectomy, and histology were recorded. Odds for adenomas (A) and advanced adenomas (ADV) in overweight (BMI 25.1–30) and obese (BMI > 30) subjects were assessed by multinomial regression, adjusted for covariates. Analyses included relationships between HL, HT, DM, age, tobacco usage, and neoplasia. Discriminatory power of HT, HL, DM, and BMI for neoplasia was assessed by binary logistic regression. Odds were calculated for neoplasia in each colonic segment related to BMI. Results A and ADV were commoner in overweight and obese males, obese females, older subjects, and smokers. HL, HT, and DM were associated with increased odds for neoplasia, significantly for A with hypertension. BMI alone predicted neoplasia as well as HT, HL, DM, or combinations thereof. All segments of the colon were affected. Multiple polyps were particularly prevalent in the obese. Conclusions Obesity and MetS are risk factors for colonic neoplasia in a Canadian population. PMID:28781966

  20. Computer modeling of multiple-channel input signals and intermodulation losses caused by nonlinear traveling wave tube amplifiers

    NASA Technical Reports Server (NTRS)

    Stankiewicz, N.

    1982-01-01

    The multiple channel input signal to a soft limiter amplifier as a traveling wave tube is represented as a finite, linear sum of Gaussian functions in the frequency domain. Linear regression is used to fit the channel shapes to a least squares residual error. Distortions in output signal, namely intermodulation products, are produced by the nonlinear gain characteristic of the amplifier and constitute the principal noise analyzed in this study. The signal to noise ratios are calculated for various input powers from saturation to 10 dB below saturation for two specific distributions of channels. A criterion for the truncation of the series expansion of the nonlinear transfer characteristic is given. It is found that he signal to noise ratios are very sensitive to the coefficients used in this expansion. Improper or incorrect truncation of the series leads to ambiguous results in the signal to noise ratios.

  1. Noninvasive spectral imaging of skin chromophores based on multiple regression analysis aided by Monte Carlo simulation

    NASA Astrophysics Data System (ADS)

    Nishidate, Izumi; Wiswadarma, Aditya; Hase, Yota; Tanaka, Noriyuki; Maeda, Takaaki; Niizeki, Kyuichi; Aizu, Yoshihisa

    2011-08-01

    In order to visualize melanin and blood concentrations and oxygen saturation in human skin tissue, a simple imaging technique based on multispectral diffuse reflectance images acquired at six wavelengths (500, 520, 540, 560, 580 and 600nm) was developed. The technique utilizes multiple regression analysis aided by Monte Carlo simulation for diffuse reflectance spectra. Using the absorbance spectrum as a response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as predictor variables, multiple regression analysis provides regression coefficients. Concentrations of melanin and total blood are then determined from the regression coefficients using conversion vectors that are deduced numerically in advance, while oxygen saturation is obtained directly from the regression coefficients. Experiments with a tissue-like agar gel phantom validated the method. In vivo experiments with human skin of the human hand during upper limb occlusion and of the inner forearm exposed to UV irradiation demonstrated the ability of the method to evaluate physiological reactions of human skin tissue.

  2. Using the Coefficient of Determination "R"[superscript 2] to Test the Significance of Multiple Linear Regression

    ERIC Educational Resources Information Center

    Quinino, Roberto C.; Reis, Edna A.; Bessegato, Lupercio F.

    2013-01-01

    This article proposes the use of the coefficient of determination as a statistic for hypothesis testing in multiple linear regression based on distributions acquired by beta sampling. (Contains 3 figures.)

  3. Anger and depression: evidence of a possible mediating role for rumination.

    PubMed

    Balsamo, Michela

    2010-02-01

    Tendency to ruminate may mediate the relationship between anger and depression. In this preliminary study, 353 Italian community participants completed the State-Trait Anger Expression Inventory-2, the Padua Inventory's Tendency to Doubt and to Ruminate subscale, and the Beck Depression Inventory-II. Trait anger and depression were expected to have a positive relationship, and separate relationships with the tendency to ruminate. Theoretically, a new hypothesis was that the tendency to ruminate would mediate the relationship between depression and anger. Zero-order and partial correlations and a path analysis based on Baron and Kenny's method for calculating multiple regression analyses were calculated. Consistent with the hypotheses, anger and depression were strongly associated; the tendency to ruminate was significantly associated with both anger and depression; and the mediation model fit the data. Behaviors related to the tendency to ruminate could help to explain how depression is related to anger.

  4. Effect of substituents on prediction of TLC retention of tetra-dentate Schiff bases and their Copper(II) and Nickel(II) complexes.

    PubMed

    Stevanović, Nikola R; Perušković, Danica S; Gašić, Uroš M; Antunović, Vesna R; Lolić, Aleksandar Đ; Baošić, Rada M

    2017-03-01

    The objectives of this study were to gain insights into structure-retention relationships and to propose the model to estimating their retention. Chromatographic investigation of series of 36 Schiff bases and their copper(II) and nickel(II) complexes was performed under both normal- and reverse-phase conditions. Chemical structures of the compounds were characterized by molecular descriptors which are calculated from the structure and related to the chromatographic retention parameters by multiple linear regression analysis. Effects of chelation on retention parameters of investigated compounds, under normal- and reverse-phase chromatographic conditions, were analyzed by principal component analysis, quantitative structure-retention relationship and quantitative structure-activity relationship models were developed on the basis of theoretical molecular descriptors, calculated exclusively from molecular structure, and parameters of retention and lipophilicity. Copyright © 2016 John Wiley & Sons, Ltd.

  5. Air pollution forecasting in Ankara, Turkey using air pollution index and its relation to assimilative capacity of the atmosphere.

    PubMed

    Genc, D Deniz; Yesilyurt, Canan; Tuncel, Gurdal

    2010-07-01

    Spatial and temporal variations in concentrations of CO, NO, NO(2), SO(2), and PM(10), measured between 1999 and 2000, at traffic-impacted and residential stations in Ankara were investigated. Air quality in residential areas was found to be influenced by traffic activities in the city. Pollutant ratios were proven to be reliable tracers to differentiate between different sources. Air pollution index (API) of the whole city was calculated to evaluate the level of air quality in Ankara. Multiple linear regression model was developed for forecasting API in Ankara. The correlation coefficients were found to be 0.79 and 0.63 for different time periods. The assimilative capacity of Ankara atmosphere was calculated in terms of ventilation coefficient (VC). The relation between API and VC was investigated and found that the air quality in Ankara was determined by meteorology rather than emissions.

  6. Validity of Treadmill-Derived Critical Speed on Predicting 5000-Meter Track-Running Performance.

    PubMed

    Nimmerichter, Alfred; Novak, Nina; Triska, Christoph; Prinz, Bernhard; Breese, Brynmor C

    2017-03-01

    Nimmerichter, A, Novak, N, Triska, C, Prinz, B, and Breese, BC. Validity of treadmill-derived critical speed on predicting 5,000-meter track-running performance. J Strength Cond Res 31(3): 706-714, 2017-To evaluate 3 models of critical speed (CS) for the prediction of 5,000-m running performance, 16 trained athletes completed an incremental test on a treadmill to determine maximal aerobic speed (MAS) and 3 randomly ordered runs to exhaustion at the [INCREMENT]70% intensity, at 110% and 98% of MAS. Critical speed and the distance covered above CS (D') were calculated using the hyperbolic speed-time (HYP), the linear distance-time (LIN), and the linear speed inverse-time model (INV). Five thousand meter performance was determined on a 400-m running track. Individual predictions of 5,000-m running time (t = [5,000-D']/CS) and speed (s = D'/t + CS) were calculated across the 3 models in addition to multiple regression analyses. Prediction accuracy was assessed with the standard error of estimate (SEE) from linear regression analysis and the mean difference expressed in units of measurement and coefficient of variation (%). Five thousand meter running performance (speed: 4.29 ± 0.39 m·s; time: 1,176 ± 117 seconds) was significantly better than the predictions from all 3 models (p < 0.0001). The mean difference was 65-105 seconds (5.7-9.4%) for time and -0.22 to -0.34 m·s (-5.0 to -7.5%) for speed. Predictions from multiple regression analyses with CS and D' as predictor variables were not significantly different from actual running performance (-1.0 to 1.1%). The SEE across all models and predictions was approximately 65 seconds or 0.20 m·s and is therefore considered as moderate. The results of this study have shown the importance of aerobic and anaerobic energy system contribution to predict 5,000-m running performance. Using estimates of CS and D' is valuable for predicting performance over race distances of 5,000 m.

  7. 4D-LQTA-QSAR and docking study on potent Gram-negative specific LpxC inhibitors: a comparison to CoMFA modeling.

    PubMed

    Ghasemi, Jahan B; Safavi-Sohi, Reihaneh; Barbosa, Euzébio G

    2012-02-01

    A quasi 4D-QSAR has been carried out on a series of potent Gram-negative LpxC inhibitors. This approach makes use of the molecular dynamics (MD) trajectories and topology information retrieved from the GROMACS package. This new methodology is based on the generation of a conformational ensemble profile, CEP, for each compound instead of only one conformation, followed by the calculation intermolecular interaction energies at each grid point considering probes and all aligned conformations resulting from MD simulations. These interaction energies are independent variables employed in a QSAR analysis. The comparison of the proposed methodology to comparative molecular field analysis (CoMFA) formalism was performed. This methodology explores jointly the main features of CoMFA and 4D-QSAR models. Step-wise multiple linear regression was used for the selection of the most informative variables. After variable selection, multiple linear regression (MLR) and partial least squares (PLS) methods used for building the regression models. Leave-N-out cross-validation (LNO), and Y-randomization were performed in order to confirm the robustness of the model in addition to analysis of the independent test set. Best models provided the following statistics: [Formula in text] (PLS) and [Formula in text] (MLR). Docking study was applied to investigate the major interactions in protein-ligand complex with CDOCKER algorithm. Visualization of the descriptors of the best model helps us to interpret the model from the chemical point of view, supporting the applicability of this new approach in rational drug design.

  8. Age estimation standards for a Western Australian population using the coronal pulp cavity index.

    PubMed

    Karkhanis, Shalmira; Mack, Peter; Franklin, Daniel

    2013-09-10

    Age estimation is a vital aspect in creating a biological profile and aids investigators by narrowing down potentially matching identities from the available pool. In addition to routine casework, in the present global political scenario, age estimation in living individuals is required in cases of refugees, asylum seekers, human trafficking and to ascertain age of criminal responsibility. Thus robust methods that are simple, non-invasive and ethically viable are required. The aim of the present study is, therefore, to test the reliability and applicability of the coronal pulp cavity index method, for the purpose of developing age estimation standards for an adult Western Australian population. A total of 450 orthopantomograms (220 females and 230 males) of Australian individuals were analyzed. Crown and coronal pulp chamber heights were measured in the mandibular left and right premolars, and the first and second molars. These measurements were then used to calculate the tooth coronal index. Data was analyzed using paired sample t-tests to assess bilateral asymmetry followed by simple linear and multiple regressions to develop age estimation models. The most accurate age estimation based on simple linear regression model was with mandibular right first molar (SEE ±8.271 years). Multiple regression models improved age prediction accuracy considerably and the most accurate model was with bilateral first and second molars (SEE ±6.692 years). This study represents the first investigation of this method in a Western Australian population and our results indicate that the method is suitable for forensic application. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.

  9. C U L8ter: YouTube distracted driving PSAs use of behavior change theory.

    PubMed

    Steadman, Mindy; Chao, Melanie S; Strong, Jessica T; Maxwell, Martha; West, Joshua H

    2014-01-01

    To examine the inclusion of health behavior theory in distracted driving PSAs on YouTube.com. Two-hundred fifty PSAs were assessed using constructs from 4 prominent health behavior theories. A total theory score was calculated for each video. Multiple regression analysis was used to identify factors associated with higher theory scores. PSAs were generally lacking in theoretical content. Video length, use of rates/statistics, driving scenario depiction, and presence of a celebrity were positively associated with theory inclusion. Collaboration between health experts and PSA creators could be fostered to produce more theory-based distracted driving videos on YouTube.com.

  10. Determination of organic compounds in water using ultraviolet LED

    NASA Astrophysics Data System (ADS)

    Kim, Chihoon; Ji, Taeksoo; Eom, Joo Beom

    2018-04-01

    This paper describes a method of detecting organic compounds in water using an ultraviolet LED (280 nm) spectroscopy system and a photodetector. The LED spectroscopy system showed a high correlation between the concentration of the prepared potassium hydrogen phthalate and that calculated by multiple linear regression, indicating an adjusted coefficient of determination ranging from 0.953-0.993. In addition, a comparison between the performance of the spectroscopy system and the total organic carbon analyzer indicated that the difference in concentration was small. Based on the close correlation between the spectroscopy and photodetector absorbance values, organic measurement with a photodetector could be configured for monitoring.

  11. Identifying maternal and infant factors associated with newborn size in rural Bangladesh by partial least squares (PLS) regression analysis

    PubMed Central

    Rahman, Md. Jahanur; Shamim, Abu Ahmed; Klemm, Rolf D. W.; Labrique, Alain B.; Rashid, Mahbubur; Christian, Parul; West, Keith P.

    2017-01-01

    Birth weight, length and circumferences of the head, chest and arm are key measures of newborn size and health in developing countries. We assessed maternal socio-demographic factors associated with multiple measures of newborn size in a large rural population in Bangladesh using partial least squares (PLS) regression method. PLS regression, combining features from principal component analysis and multiple linear regression, is a multivariate technique with an ability to handle multicollinearity while simultaneously handling multiple dependent variables. We analyzed maternal and infant data from singletons (n = 14,506) born during a double-masked, cluster-randomized, placebo-controlled maternal vitamin A or β-carotene supplementation trial in rural northwest Bangladesh. PLS regression results identified numerous maternal factors (parity, age, early pregnancy MUAC, living standard index, years of education, number of antenatal care visits, preterm delivery and infant sex) significantly (p<0.001) associated with newborn size. Among them, preterm delivery had the largest negative influence on newborn size (Standardized β = -0.29 − -0.19; p<0.001). Scatter plots of the scores of first two PLS components also revealed an interaction between newborn sex and preterm delivery on birth size. PLS regression was found to be more parsimonious than both ordinary least squares regression and principal component regression. It also provided more stable estimates than the ordinary least squares regression and provided the effect measure of the covariates with greater accuracy as it accounts for the correlation among the covariates and outcomes. Therefore, PLS regression is recommended when either there are multiple outcome measurements in the same study, or the covariates are correlated, or both situations exist in a dataset. PMID:29261760

  12. Identifying maternal and infant factors associated with newborn size in rural Bangladesh by partial least squares (PLS) regression analysis.

    PubMed

    Kabir, Alamgir; Rahman, Md Jahanur; Shamim, Abu Ahmed; Klemm, Rolf D W; Labrique, Alain B; Rashid, Mahbubur; Christian, Parul; West, Keith P

    2017-01-01

    Birth weight, length and circumferences of the head, chest and arm are key measures of newborn size and health in developing countries. We assessed maternal socio-demographic factors associated with multiple measures of newborn size in a large rural population in Bangladesh using partial least squares (PLS) regression method. PLS regression, combining features from principal component analysis and multiple linear regression, is a multivariate technique with an ability to handle multicollinearity while simultaneously handling multiple dependent variables. We analyzed maternal and infant data from singletons (n = 14,506) born during a double-masked, cluster-randomized, placebo-controlled maternal vitamin A or β-carotene supplementation trial in rural northwest Bangladesh. PLS regression results identified numerous maternal factors (parity, age, early pregnancy MUAC, living standard index, years of education, number of antenatal care visits, preterm delivery and infant sex) significantly (p<0.001) associated with newborn size. Among them, preterm delivery had the largest negative influence on newborn size (Standardized β = -0.29 - -0.19; p<0.001). Scatter plots of the scores of first two PLS components also revealed an interaction between newborn sex and preterm delivery on birth size. PLS regression was found to be more parsimonious than both ordinary least squares regression and principal component regression. It also provided more stable estimates than the ordinary least squares regression and provided the effect measure of the covariates with greater accuracy as it accounts for the correlation among the covariates and outcomes. Therefore, PLS regression is recommended when either there are multiple outcome measurements in the same study, or the covariates are correlated, or both situations exist in a dataset.

  13. Space, race, and poverty: Spatial inequalities in walkable neighborhood amenities?

    PubMed Central

    Aldstadt, Jared; Whalen, John; White, Kellee; Castro, Marcia C.; Williams, David R.

    2017-01-01

    BACKGROUND Multiple and varied benefits have been suggested for increased neighborhood walkability. However, spatial inequalities in neighborhood walkability likely exist and may be attributable, in part, to residential segregation. OBJECTIVE Utilizing a spatial demographic perspective, we evaluated potential spatial inequalities in walkable neighborhood amenities across census tracts in Boston, MA (US). METHODS The independent variables included minority racial/ethnic population percentages and percent of families in poverty. Walkable neighborhood amenities were assessed with a composite measure. Spatial autocorrelation in key study variables were first calculated with the Global Moran’s I statistic. Then, Spearman correlations between neighborhood socio-demographic characteristics and walkable neighborhood amenities were calculated as well as Spearman correlations accounting for spatial autocorrelation. We fit ordinary least squares (OLS) regression and spatial autoregressive models, when appropriate, as a final step. RESULTS Significant positive spatial autocorrelation was found in neighborhood socio-demographic characteristics (e.g. census tract percent Black), but not walkable neighborhood amenities or in the OLS regression residuals. Spearman correlations between neighborhood socio-demographic characteristics and walkable neighborhood amenities were not statistically significant, nor were neighborhood socio-demographic characteristics significantly associated with walkable neighborhood amenities in OLS regression models. CONCLUSIONS Our results suggest that there is residential segregation in Boston and that spatial inequalities do not necessarily show up using a composite measure. COMMENTS Future research in other geographic areas (including international contexts) and using different definitions of neighborhoods (including small-area definitions) should evaluate if spatial inequalities are found using composite measures but also should use measures of specific neighborhood amenities. PMID:29046612

  14. The M Word: Multicollinearity in Multiple Regression.

    ERIC Educational Resources Information Center

    Morrow-Howell, Nancy

    1994-01-01

    Notes that existence of substantial correlation between two or more independent variables creates problems of multicollinearity in multiple regression. Discusses multicollinearity problem in social work research in which independent variables are usually intercorrelated. Clarifies problems created by multicollinearity, explains detection of…

  15. Psychological risk and protective factors for disability in chronic low back pain - a longitudinal analysis in primary care.

    PubMed

    Jegan, Nikita Roman A; Brugger, Markus; Viniol, Annika; Strauch, Konstantin; Barth, Jürgen; Baum, Erika; Leonhardt, Corinna; Becker, Annette

    2017-03-20

    Utilizing psychological resources when dealing with chronic low back pain might aid the prevention of disability. The observational study at hand examined the longitudinal impact of resilience and coping resources on disability in addition to established risk factors. Four hundred eighty four patients with chronic low back pain (>3 months) were recruited in primary care practices and followed up for one year. Resilience, coping, depression, somatization, pain and demographic variables were measured at baseline. At follow-up (participation rate 89%), data on disability was collected. We first calculated bivariate correlations of all the predictors with each other and with follow-up disability. We then used a multiple regression to evaluate the impact of all the predictors on disability together. More than half of the followed up sample showed a high degree of disability at baseline (53.7%) and had suffered for more than 10 years from pain (50.4%). Besides gender all of the predictors were bivariately associated with follow-up disability. However in the main analysis (multiple regression), disability at follow up was only predicted by baseline disability, age and somatization. There was no relationship between resilience and disability, nor between coping resources and disability. Although it is known that there are cross-sectional relationships between resilience/coping resources and disability we were not able to replicate it in the multiple regression. This can have several reasons: a) the majority of patients in our sample were much more disabled and suffered for a longer time than in other studies. Therefore our results might be limited to this specific population and resilience and coping resources might still have a protective influence in acute or subacute populations. b) We used a rather broad operationalization of resilience. There is emerging evidence that focusing on more concrete sub facets like (pain) self-efficacy and acceptance might be more beneficial. German Clinical Trial Register, DRKS00003123 (June 28th 2011).

  16. [Prediction model of health workforce and beds in county hospitals of Hunan by multiple linear regression].

    PubMed

    Ling, Ru; Liu, Jiawang

    2011-12-01

    To construct prediction model for health workforce and hospital beds in county hospitals of Hunan by multiple linear regression. We surveyed 16 counties in Hunan with stratified random sampling according to uniform questionnaires,and multiple linear regression analysis with 20 quotas selected by literature view was done. Independent variables in the multiple linear regression model on medical personnels in county hospitals included the counties' urban residents' income, crude death rate, medical beds, business occupancy, professional equipment value, the number of devices valued above 10 000 yuan, fixed assets, long-term debt, medical income, medical expenses, outpatient and emergency visits, hospital visits, actual available bed days, and utilization rate of hospital beds. Independent variables in the multiple linear regression model on county hospital beds included the the population of aged 65 and above in the counties, disposable income of urban residents, medical personnel of medical institutions in county area, business occupancy, the total value of professional equipment, fixed assets, long-term debt, medical income, medical expenses, outpatient and emergency visits, hospital visits, actual available bed days, utilization rate of hospital beds, and length of hospitalization. The prediction model shows good explanatory and fitting, and may be used for short- and mid-term forecasting.

  17. [Psychoactive substances use and health-related quality of life among school age adolescents].

    PubMed

    Vilugrón Aravena, Fabiola; Hidalgo-Rasmussen, Carlos Alejandro; Molina G, Temístocles; Gras Pérez, María Eugenia; Font-Mayolas, Silvia

    2017-12-01

    Background The use of psychoactive substances among adolescents is a major social and public health concern. Aim To analyze association of substance abuse and multiple drug use with health-related quality of life (HRQOL) in adolescents attending a high school in Valparaiso, Chile. Material and Methods Analytical cross-sectional study conducted in a sample of adolescents attending high school. HRQOL was assessed using KIDSCREEN-52 questionnaire and substance use was measured using the Global school-based student health survey. Participants had to complete online, self-administered, anonymous questionnaires. Multiple logistic regression analyses were conducted to calculate Odd ratios. Results A total of 550 adolescents aged 16 ± 1 years old completed the questionnaires. Thirty nine percent consumed alcohol during the last month, 31% smoked, 33% used marijuana and 33% admitted the use of multiple drugs. High-risk alcohol consumption was associated with a lower perception of psychological well-being, self-perception and school environment. This last dimension was affected in those who admitted marijuana use during the last month. Multiple drug use (three substances) was associated with a lower perception of physical and psychological well-being, self-perception, relationship with parents, family life and school environment. Conclusions High-risk alcohol consumption and multiple drug use (three substances) have a negative impact on the HRQOL of school age adolescents.

  18. Price Analysis of Railway Freight Transport under Marketing Mechanism

    NASA Astrophysics Data System (ADS)

    Shi, Ying; Fang, Xiaoping; Chen, Zhiya

    Regarding the problems in the reform of the railway tariff system and the pricing of the transport, by means of assaying the influence of the price elasticity on the artifice used for price, this article proposed multiple regressive model which analyzed price elasticity quantitatively. This model conclude multi-factors which influences on the price elasticity, such as the averagely railway freight charge, the averagely freight haulage of proximate supersede transportation mode, the GDP per capita in the point of origin, and a series of dummy variable which can reflect the features of some productive and consume demesne. It can calculate the price elasticity of different classes in different domains, and predict the freight traffic volume on different rate levels. It can calculate confidence-level, and evaluate the relevance of each parameter to get rid of irrelevant or little relevant variables. It supplied a good theoretical basis for directing the pricing of transport enterprises in market economic conditions, which is suitable for railway freight, passenger traffic and other transportation manner as well. SPSS (Statistical Package for the Social Science) software was used to calculate and analysis the example. This article realized the calculation by HYFX system(Ministry of Railways fund).

  19. Measuring sperm whales from their clicks: Stability of interpulse intervals and validation that they indicate whale length

    NASA Astrophysics Data System (ADS)

    Rhinelander, Marcus Q.; Dawson, Stephen M.

    2004-04-01

    Multiple pulses can often be distinguished in the clicks of sperm whales (Physeter macrocephalus). Norris and Harvey [in Animal Orientation and Navigation, NASA SP-262 (1972), pp. 397-417] proposed that this results from reflections within the head, and thus that interpulse interval (IPI) is an indicator of head length, and by extrapolation, total length. For this idea to hold, IPIs must be stable within individuals, but differ systematically among individuals of different size. IPI stability was examined in photographically identified individuals recorded repeatedly over different dives, days, and years. IPI variation among dives in a single day and days in a single year was statistically significant, although small in magnitude (it would change total length estimates by <3%). As expected, IPIs varied significantly among individuals. Most individuals showed significant increases in IPIs over several years, suggesting growth. Mean total lengths calculated from published IPI regressions were 13.1 to 16.1 m, longer than photogrammetric estimates of the same whales (12.3 to 15.3 m). These discrepancies probably arise from the paucity of large (12-16 m) whales in data used in published regressions. A new regression is offered for this size range.

  20. Factors associated with parasite dominance in fishes from Brazil.

    PubMed

    Amarante, Cristina Fernandes do; Tassinari, Wagner de Souza; Luque, Jose Luis; Pereira, Maria Julia Salim

    2016-06-14

    The present study used regression models to evaluate the existence of factors that may influence the numerical parasite dominance with an epidemiological approximation. A database including 3,746 fish specimens and their respective parasites were used to evaluate the relationship between parasite dominance and biotic characteristics inherent to the studied hosts and the parasite taxa. Multivariate, classical, and mixed effects linear regression models were fitted. The calculations were performed using R software (95% CI). In the fitting of the classical multiple linear regression model, freshwater and planktivorous fish species and body length, as well as the species of the taxa Trematoda, Monogenea, and Hirudinea, were associated with parasite dominance. However, the fitting of the mixed effects model showed that the body length of the host and the species of the taxa Nematoda, Trematoda, Monogenea, Hirudinea, and Crustacea were significantly associated with parasite dominance. Studies that consider specific biological aspects of the hosts and parasites should expand the knowledge regarding factors that influence the numerical dominance of fish in Brazil. The use of a mixed model shows, once again, the importance of the appropriate use of a model correlated with the characteristics of the data to obtain consistent results.

  1. Nonlinear-regression flow model of the Gulf Coast aquifer systems in the south-central United States

    USGS Publications Warehouse

    Kuiper, L.K.

    1994-01-01

    A multiple-regression methodology was used to help answer questions concerning model reliability, and to calibrate a time-dependent variable-density ground-water flow model of the gulf coast aquifer systems in the south-central United States. More than 40 regression models with 2 to 31 regressions parameters are used and detailed results are presented for 12 of the models. More than 3,000 values for grid-element volume-averaged head and hydraulic conductivity are used for the regression model observations. Calculated prediction interval half widths, though perhaps inaccurate due to a lack of normality of the residuals, are the smallest for models with only four regression parameters. In addition, the root-mean weighted residual decreases very little with an increase in the number of regression parameters. The various models showed considerable overlap between the prediction inter- vals for shallow head and hydraulic conductivity. Approximate 95-percent prediction interval half widths for volume-averaged freshwater head exceed 108 feet; for volume-averaged base 10 logarithm hydraulic conductivity, they exceed 0.89. All of the models are unreliable for the prediction of head and ground-water flow in the deeper parts of the aquifer systems, including the amount of flow coming from the underlying geopressured zone. Truncating the domain of solution of one model to exclude that part of the system having a ground-water density greater than 1.005 grams per cubic centimeter or to exclude that part of the systems below a depth of 3,000 feet, and setting the density to that of freshwater does not appreciably change the results for head and ground-water flow, except for locations close to the truncation surface.

  2. Multivariate meta-analysis using individual participant data.

    PubMed

    Riley, R D; Price, M J; Jackson, D; Wardle, M; Gueyffier, F; Wang, J; Staessen, J A; White, I R

    2015-06-01

    When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is that within-study correlations needed to fit the multivariate model are unknown from published reports. However, provision of individual participant data (IPD) allows them to be calculated directly. Here, we illustrate how to use IPD to estimate within-study correlations, using a joint linear regression for multiple continuous outcomes and bootstrapping methods for binary, survival and mixed outcomes. In a meta-analysis of 10 hypertension trials, we then show how these methods enable multivariate meta-analysis to address novel clinical questions about continuous, survival and binary outcomes; treatment-covariate interactions; adjusted risk/prognostic factor effects; longitudinal data; prognostic and multiparameter models; and multiple treatment comparisons. Both frequentist and Bayesian approaches are applied, with example software code provided to derive within-study correlations and to fit the models. © 2014 The Authors. Research Synthesis Methods published by John Wiley & Sons, Ltd.

  3. Multi -risk assessment at a national level in Georgia

    NASA Astrophysics Data System (ADS)

    Tsereteli, Nino; Varazanashvili, Otar; Amiranashvili, Avtandil; Tsereteli, Emili; Elizbarashvili, Elizbar; Saluqvadze, Manana; Dolodze, Jemal

    2013-04-01

    Work presented here was initiated by national GNSF project " Reducing natural disasters multiple risk: a positive factor for Georgia development " and two international projects: NATO SFP 983038 "Seismic hazard and Rusk assessment for Southern Caucasus-eastern Turkey Energy Corridors" and EMME " Earthquake Model for Middle east Region". Methodology for estimation of "general" vulnerability, hazards and multiple risk to natural hazards (namely, earthquakes, landslides, snow avalanches, flash floods, mudflows, drought, hurricanes, frost, hail) where developed for Georgia. The electronic detailed databases of natural disasters were created. These databases contain the parameters of hazardous phenomena that caused natural disasters. The magnitude and intensity scale of the mentioned disasters are reviewed and the new magnitude and intensity scales are suggested for disasters for which the corresponding formalization is not yet performed. The associated economic losses were evaluated and presented in monetary terms for these hazards. Based on the hazard inventory, an approach was developed that allowed for the calculation of an overall vulnerability value for each individual hazard type, using the Gross Domestic Product per unit area (applied to population) as the indicator for elements at risk exposed. The correlation between estimated economic losses, physical exposure and the magnitude for each of the six types of hazards has been investigated in detail by using multiple linear regression analysis. Economic losses for all past events and historical vulnerability were estimated. Finally, the spatial distribution of general vulnerability was assessed, and the expected maximum economic loss was calculated as well as a multi-risk map was set-up.

  4. SOME STATISTICAL ISSUES RELATED TO MULTIPLE LINEAR REGRESSION MODELING OF BEACH BACTERIA CONCENTRATIONS

    EPA Science Inventory

    As a fast and effective technique, the multiple linear regression (MLR) method has been widely used in modeling and prediction of beach bacteria concentrations. Among previous works on this subject, however, several issues were insufficiently or inconsistently addressed. Those is...

  5. MULTIPLE REGRESSION MODELS FOR HINDCASTING AND FORECASTING MIDSUMMER HYPOXIA IN THE GULF OF MEXICO

    EPA Science Inventory

    A new suite of multiple regression models were developed that describe the relationship between the area of bottom water hypoxia along the northern Gulf of Mexico and Mississippi-Atchafalaya River nitrate concentration, total phosphorus (TP) concentration, and discharge. Variabil...

  6. Estimate the contribution of incubation parameters influence egg hatchability using multiple linear regression analysis

    PubMed Central

    Khalil, Mohamed H.; Shebl, Mostafa K.; Kosba, Mohamed A.; El-Sabrout, Karim; Zaki, Nesma

    2016-01-01

    Aim: This research was conducted to determine the most affecting parameters on hatchability of indigenous and improved local chickens’ eggs. Materials and Methods: Five parameters were studied (fertility, early and late embryonic mortalities, shape index, egg weight, and egg weight loss) on four strains, namely Fayoumi, Alexandria, Matrouh, and Montazah. Multiple linear regression was performed on the studied parameters to determine the most influencing one on hatchability. Results: The results showed significant differences in commercial and scientific hatchability among strains. Alexandria strain has the highest significant commercial hatchability (80.70%). Regarding the studied strains, highly significant differences in hatching chick weight among strains were observed. Using multiple linear regression analysis, fertility made the greatest percent contribution (71.31%) to hatchability, and the lowest percent contributions were made by shape index and egg weight loss. Conclusion: A prediction of hatchability using multiple regression analysis could be a good tool to improve hatchability percentage in chickens. PMID:27651666

  7. Nutrient intake and use of dietary supplements among US adults with disabilities.

    PubMed

    An, Ruopeng; Chiu, Chung-Yi; Andrade, Flavia

    2015-04-01

    Physical, mental, social, and financial hurdles in adults with disabilities may limit their access to adequate nutrition. To examine the impact of dietary supplement use on daily total nutrient intake levels among US adults 20 years and older with disabilities. Study sample came from 2007-2008 and 2009-2010 waves of the National Health and Nutrition Examination Survey, a nationally representative repeated cross-sectional survey. Disability was classified into 5 categories using standardized indices. Nutrient intakes from foods and dietary supplements were calculated from 2 nonconsecutive 24-hour dietary recalls. Two-sample proportion tests and multiple logistic regressions were used to examine the adherence rates to the recommended daily nutrient intake levels between dietary supplement users and nonusers in each disability category. The association between sociodemographic characteristics and dietary supplement use was assessed using multiple logistic regressions, accounting for complex survey design. A substantial proportion of the US adult population with disabilities failed to meet dietary guidelines, with insufficient intakes of multiple nutrients. Over half of the US adults with disabilities used dietary supplements. Dietary supplement use was associated with higher adherence rates for vitamin A, vitamin B1, vitamin B2, vitamin B6, vitamin B12, vitamin C, vitamin D, vitamin E, calcium, copper, iron, magnesium, and zinc intake among adults with disabilities. Women, non-Hispanic Whites, older age, higher education, and higher household income were found to predict dietary supplement use. Proper use of dietary supplements under the guidance of health care providers may improve the nutritional status among adults with disabilities. Copyright © 2015 Elsevier Inc. All rights reserved.

  8. QSAR Analysis of 2-Amino or 2-Methyl-1-Substituted Benzimidazoles Against Pseudomonas aeruginosa

    PubMed Central

    Podunavac-Kuzmanović, Sanja O.; Cvetković, Dragoljub D.; Barna, Dijana J.

    2009-01-01

    A set of benzimidazole derivatives were tested for their inhibitory activities against the Gram-negative bacterium Pseudomonas aeruginosa and minimum inhibitory concentrations were determined for all the compounds. Quantitative structure activity relationship (QSAR) analysis was applied to fourteen of the abovementioned derivatives using a combination of various physicochemical, steric, electronic, and structural molecular descriptors. A multiple linear regression (MLR) procedure was used to model the relationships between molecular descriptors and the antibacterial activity of the benzimidazole derivatives. The stepwise regression method was used to derive the most significant models as a calibration model for predicting the inhibitory activity of this class of molecules. The best QSAR models were further validated by a leave one out technique as well as by the calculation of statistical parameters for the established theoretical models. To confirm the predictive power of the models, an external set of molecules was used. High agreement between experimental and predicted inhibitory values, obtained in the validation procedure, indicated the good quality of the derived QSAR models. PMID:19468332

  9. Quantile regression and clustering analysis of standardized precipitation index in the Tarim River Basin, Xinjiang, China

    NASA Astrophysics Data System (ADS)

    Yang, Peng; Xia, Jun; Zhang, Yongyong; Han, Jian; Wu, Xia

    2017-11-01

    Because drought is a very common and widespread natural disaster, it has attracted a great deal of academic interest. Based on 12-month time scale standardized precipitation indices (SPI12) calculated from precipitation data recorded between 1960 and 2015 at 22 weather stations in the Tarim River Basin (TRB), this study aims to identify the trends of SPI and drought duration, severity, and frequency at various quantiles and to perform cluster analysis of drought events in the TRB. The results indicated that (1) both precipitation and temperature at most stations in the TRB exhibited significant positive trends during 1960-2015; (2) multiple scales of SPIs changed significantly around 1986; (3) based on quantile regression analysis of temporal drought changes, the positive SPI slopes indicated less severe and less frequent droughts at lower quantiles, but clear variation was detected in the drought frequency; and (4) significantly different trends were found in drought frequency probably between severe droughts and drought frequency.

  10. [Association of mineral and bone disorder with increasing PWV in CKD 1-5 patients].

    PubMed

    Shiota, Jun; Watanabe, Mitsuhiro

    2007-01-01

    The association between pulse wave velocity(PWV) and chronic kidney disease mineral and bone disorder(CKD-MBD) was investigated in CKD 1-5 patients without dialysis. Pulse pressure(PP), PWV, serum Cr, non-HDL-cholesterol, Alb, Ca, Pi, calcitriol, intact-PTH and BAP were measured in sixty patients not receiving a phosphate binder or vitamin D. Using the relationship between age and baPWV in healthy subjects, we determined delta baPWV(measured baPWV-calculated baPWV) as an index for the effect of CKD-related factors. delta baPWV was significantly higher in diabetic patients (p < 0.00001). Simple regression analysis revealed that delta baPWV was positively correlated with PP (p < 0.05) and Log(intact-PTH) (p < 0.01), but negatively correlated with Log(estimated GFR) and Log(calcitriol) (p < 0.01). Multiple regression analysis revealed that delta baPWV was significantly associated with PP and calcitriol, or PP and intact-PTH. These results suggest a relationship between PWV and CKD-MBD.

  11. Mean centering, multicollinearity, and moderators in multiple regression: The reconciliation redux.

    PubMed

    Iacobucci, Dawn; Schneider, Matthew J; Popovich, Deidre L; Bakamitsos, Georgios A

    2017-02-01

    In this article, we attempt to clarify our statements regarding the effects of mean centering. In a multiple regression with predictors A, B, and A × B (where A × B serves as an interaction term), mean centering A and B prior to computing the product term can clarify the regression coefficients (which is good) and the overall model fit R 2 will remain undisturbed (which is also good).

  12. A Common Mechanism for Resistance to Oxime Reactivation of Acetylcholinesterase Inhibited by Organophosphorus Compounds

    DTIC Science & Technology

    2013-01-01

    application of the Hammett equation with the constants rph in the chemistry of organophosphorus compounds, Russ. Chem. Rev. 38 (1969) 795–811. [13...of oximes and OP compounds and the ability of oximes to reactivate OP- inhibited AChE. Multiple linear regression equations were analyzed using...phosphonate pairs, 21 oxime/ phosphoramidate pairs and 12 oxime/phosphate pairs. The best linear regression equation resulting from multiple regression anal

  13. Novel applications of multitask learning and multiple output regression to multiple genetic trait prediction.

    PubMed

    He, Dan; Kuhn, David; Parida, Laxmi

    2016-06-15

    Given a set of biallelic molecular markers, such as SNPs, with genotype values encoded numerically on a collection of plant, animal or human samples, the goal of genetic trait prediction is to predict the quantitative trait values by simultaneously modeling all marker effects. Genetic trait prediction is usually represented as linear regression models. In many cases, for the same set of samples and markers, multiple traits are observed. Some of these traits might be correlated with each other. Therefore, modeling all the multiple traits together may improve the prediction accuracy. In this work, we view the multitrait prediction problem from a machine learning angle: as either a multitask learning problem or a multiple output regression problem, depending on whether different traits share the same genotype matrix or not. We then adapted multitask learning algorithms and multiple output regression algorithms to solve the multitrait prediction problem. We proposed a few strategies to improve the least square error of the prediction from these algorithms. Our experiments show that modeling multiple traits together could improve the prediction accuracy for correlated traits. The programs we used are either public or directly from the referred authors, such as MALSAR (http://www.public.asu.edu/~jye02/Software/MALSAR/) package. The Avocado data set has not been published yet and is available upon request. dhe@us.ibm.com. © The Author 2016. Published by Oxford University Press.

  14. Simple and multiple linear regression: sample size considerations.

    PubMed

    Hanley, James A

    2016-11-01

    The suggested "two subjects per variable" (2SPV) rule of thumb in the Austin and Steyerberg article is a chance to bring out some long-established and quite intuitive sample size considerations for both simple and multiple linear regression. This article distinguishes two of the major uses of regression models that imply very different sample size considerations, neither served well by the 2SPV rule. The first is etiological research, which contrasts mean Y levels at differing "exposure" (X) values and thus tends to focus on a single regression coefficient, possibly adjusted for confounders. The second research genre guides clinical practice. It addresses Y levels for individuals with different covariate patterns or "profiles." It focuses on the profile-specific (mean) Y levels themselves, estimating them via linear compounds of regression coefficients and covariates. By drawing on long-established closed-form variance formulae that lie beneath the standard errors in multiple regression, and by rearranging them for heuristic purposes, one arrives at quite intuitive sample size considerations for both research genres. Copyright © 2016 Elsevier Inc. All rights reserved.

  15. Multiple imputation for cure rate quantile regression with censored data.

    PubMed

    Wu, Yuanshan; Yin, Guosheng

    2017-03-01

    The main challenge in the context of cure rate analysis is that one never knows whether censored subjects are cured or uncured, or whether they are susceptible or insusceptible to the event of interest. Considering the susceptible indicator as missing data, we propose a multiple imputation approach to cure rate quantile regression for censored data with a survival fraction. We develop an iterative algorithm to estimate the conditionally uncured probability for each subject. By utilizing this estimated probability and Bernoulli sample imputation, we can classify each subject as cured or uncured, and then employ the locally weighted method to estimate the quantile regression coefficients with only the uncured subjects. Repeating the imputation procedure multiple times and taking an average over the resultant estimators, we obtain consistent estimators for the quantile regression coefficients. Our approach relaxes the usual global linearity assumption, so that we can apply quantile regression to any particular quantile of interest. We establish asymptotic properties for the proposed estimators, including both consistency and asymptotic normality. We conduct simulation studies to assess the finite-sample performance of the proposed multiple imputation method and apply it to a lung cancer study as an illustration. © 2016, The International Biometric Society.

  16. Estimating basin lagtime and hydrograph-timing indexes used to characterize stormflows for runoff-quality analysis

    USGS Publications Warehouse

    Granato, Gregory E.

    2012-01-01

    A nationwide study to better define triangular-hydrograph statistics for use with runoff-quality and flood-flow studies was done by the U.S. Geological Survey (USGS) in cooperation with the Federal Highway Administration. Although the triangular hydrograph is a simple linear approximation, the cumulative distribution of stormflow with a triangular hydrograph is a curvilinear S-curve that closely approximates the cumulative distribution of stormflows from measured data. The temporal distribution of flow within a runoff event can be estimated using the basin lagtime, (which is the time from the centroid of rainfall excess to the centroid of the corresponding runoff hydrograph) and the hydrograph recession ratio (which is the ratio of the duration of the falling limb to the rising limb of the hydrograph). This report documents results of the study, methods used to estimate the variables, and electronic files that facilitate calculation of variables. Ten viable multiple-linear regression equations were developed to estimate basin lagtimes from readily determined drainage basin properties using data published in 37 stormflow studies. Regression equations using the basin lag factor (BLF, which is a variable calculated as the main-channel length, in miles, divided by the square root of the main-channel slope in feet per mile) and two variables describing development in the drainage basin were selected as the best candidates, because each equation explains about 70 percent of the variability in the data. The variables describing development are the USGS basin development factor (BDF, which is a function of the amount of channel modifications, storm sewers, and curb-and-gutter streets in a basin) and the total impervious area variable (IMPERV) in the basin. Two datasets were used to develop regression equations. The primary dataset included data from 493 sites that have values for the BLF, BDF, and IMPERV variables. This dataset was used to develop the best-fit regression equation using the BLF and BDF variables. The secondary dataset included data from 896 sites that have values for the BLF and IMPERV variables. This dataset was used to develop the best-fit regression equation using the BLF and IMPERV variables. Analysis of hydrograph recession ratios and basin characteristics for 41 sites indicated that recession ratios are random variables. Thus, recession ratios cannot be estimated quantitatively using multiple linear regression equations developed using the data available for these sites. The minimums of recession ratios for different streamgages are well characterized by a value of one. The most probable values and maximum values of recession ratios for different streamgages are, however, more variable than the minimums. The most probable values of recession ratios for the 41 streamgages analyzed ranged from 1.0 to 3.52 and had a median of 1.85. The maximum values ranged from 2.66 to 11.3 and had a median of 4.36.

  17. Undergraduate Student Motivation in Modularized Developmental Mathematics Courses

    ERIC Educational Resources Information Center

    Pachlhofer, Keith A.

    2017-01-01

    This study used the Motivated Strategies for Learning Questionnaire in modularized courses at three institutions across the nation (N = 189), and multiple regression was completed to investigate five categories of student motivation that predicted academic success and course completion. The overall multiple regression analysis was significant and…

  18. MULGRES: a computer program for stepwise multiple regression analysis

    Treesearch

    A. Jeff Martin

    1971-01-01

    MULGRES is a computer program source deck that is designed for multiple regression analysis employing the technique of stepwise deletion in the search for most significant variables. The features of the program, along with inputs and outputs, are briefly described, with a note on machine compatibility.

  19. Categorical Variables in Multiple Regression: Some Cautions.

    ERIC Educational Resources Information Center

    O'Grady, Kevin E.; Medoff, Deborah R.

    1988-01-01

    Limitations of dummy coding and nonsense coding as methods of coding categorical variables for use as predictors in multiple regression analysis are discussed. The combination of these approaches often yields estimates and tests of significance that are not intended by researchers for inclusion in their models. (SLD)

  20. Multiple Imputation of a Randomly Censored Covariate Improves Logistic Regression Analysis.

    PubMed

    Atem, Folefac D; Qian, Jing; Maye, Jacqueline E; Johnson, Keith A; Betensky, Rebecca A

    2016-01-01

    Randomly censored covariates arise frequently in epidemiologic studies. The most commonly used methods, including complete case and single imputation or substitution, suffer from inefficiency and bias. They make strong parametric assumptions or they consider limit of detection censoring only. We employ multiple imputation, in conjunction with semi-parametric modeling of the censored covariate, to overcome these shortcomings and to facilitate robust estimation. We develop a multiple imputation approach for randomly censored covariates within the framework of a logistic regression model. We use the non-parametric estimate of the covariate distribution or the semiparametric Cox model estimate in the presence of additional covariates in the model. We evaluate this procedure in simulations, and compare its operating characteristics to those from the complete case analysis and a survival regression approach. We apply the procedures to an Alzheimer's study of the association between amyloid positivity and maternal age of onset of dementia. Multiple imputation achieves lower standard errors and higher power than the complete case approach under heavy and moderate censoring and is comparable under light censoring. The survival regression approach achieves the highest power among all procedures, but does not produce interpretable estimates of association. Multiple imputation offers a favorable alternative to complete case analysis and ad hoc substitution methods in the presence of randomly censored covariates within the framework of logistic regression.

  1. The allometry of coarse root biomass: log-transformed linear regression or nonlinear regression?

    PubMed

    Lai, Jiangshan; Yang, Bo; Lin, Dunmei; Kerkhoff, Andrew J; Ma, Keping

    2013-01-01

    Precise estimation of root biomass is important for understanding carbon stocks and dynamics in forests. Traditionally, biomass estimates are based on allometric scaling relationships between stem diameter and coarse root biomass calculated using linear regression (LR) on log-transformed data. Recently, it has been suggested that nonlinear regression (NLR) is a preferable fitting method for scaling relationships. But while this claim has been contested on both theoretical and empirical grounds, and statistical methods have been developed to aid in choosing between the two methods in particular cases, few studies have examined the ramifications of erroneously applying NLR. Here, we use direct measurements of 159 trees belonging to three locally dominant species in east China to compare the LR and NLR models of diameter-root biomass allometry. We then contrast model predictions by estimating stand coarse root biomass based on census data from the nearby 24-ha Gutianshan forest plot and by testing the ability of the models to predict known root biomass values measured on multiple tropical species at the Pasoh Forest Reserve in Malaysia. Based on likelihood estimates for model error distributions, as well as the accuracy of extrapolative predictions, we find that LR on log-transformed data is superior to NLR for fitting diameter-root biomass scaling models. More importantly, inappropriately using NLR leads to grossly inaccurate stand biomass estimates, especially for stands dominated by smaller trees.

  2. Development and validation of an environmental fragility index (EFI) for the neotropical savannah biome.

    PubMed

    Macedo, Diego R; Hughes, Robert M; Kaufmann, Philip R; Callisto, Marcos

    2018-04-23

    Augmented production and transport of fine sediments resulting from increased human activities are major threats to freshwater ecosystems, including reservoirs and their ecosystem services. To support large scale assessment of the likelihood of soil erosion and reservoir sedimentation, we developed and validated an environmental fragility index (EFI) for the Brazilian neotropical savannah. The EFI was derived from measured geoclimatic controls on sediment production (rainfall, variation of elevation and slope, geology) and anthropogenic pressures (natural cover, road density, distance from roads and urban centers) in 111 catchments upstream of four large hydroelectric reservoirs. We evaluated the effectiveness of the EFI by regressing it against a relative bed stability index (LRBS) that assesses the degree to which stream sites draining into the reservoirs are affected by excess fine sediments. We developed the EFI on 111 of these sites and validated our model on the remaining 37 independent sites. We also compared the effectiveness of the EFI in predicting LRBS with that of a multiple linear regression model (via best-subset procedure) using 7 independent variables. The EFI was significantly correlated with the LRBS, with regression R 2 values of 0.32 and 0.40, respectively, in development and validation sites. Although the EFI and multiple regression explained similar amounts of variability (R 2  = 0.32 vs 0.36), the EFI had a higher F-ratio (51.6 vs 8.5) and better AICc value (333 vs 338). Because the sites were randomly selected and well-distributed across geoclimatic controlling factors, we were able to calculate spatially-explicit EFI values for all hydrologic units within the study area (~38,500 km 2 ). This model-based inference showed that over 65% of those units had high or extreme fragility. This methodology has great potential for application in the management, recovery, and preservation of hydroelectric reservoirs and streams in tropical river basins. Copyright © 2018 Elsevier B.V. All rights reserved.

  3. Modeling Laterality of the Globus Pallidus Internus in Patients With Parkinson's Disease.

    PubMed

    Sharim, Justin; Yazdi, Daniel; Baohan, Amy; Behnke, Eric; Pouratian, Nader

    2017-04-01

    Neurosurgical interventions such as deep brain stimulation surgery of the globus pallidus internus (GPi) play an important role in the treatment of medically refractory Parkinson's disease (PD), and require high targeting accuracy. Variability in the laterality of the GPi across patients with PD has not been well characterized. The aim of this report is to identify factors that may contribute to differences in position of the motor region of GPi. The charts and operative reports of 101 PD patients following deep brain stimulation surgery (70 males, aged 11-78 years) representing 201 GPi were retrospectively reviewed. Data extracted for each subject include age, gender, anterior and posterior commissures (AC-PC) distance, and third ventricular width. Multiple linear regression, stepwise regression, and relative importance of regressors analysis were performed to assess the predictive ability of these variables on GPi laterality. Multiple linear regression for target vs. third ventricular width, gender, AC-PC distance, and age were significant for normalized linear regression coefficients of 0.333 (p < 0.0001), 0.206 (p = 0.00219), 0.168 (p = 0.0119), and 0.159 (p = 0.0136), respectively. Third ventricular width, gender, AC-PC distance, and age each account for 44.06% (21.38-65.69%, 95% CI), 20.82% (10.51-35.88%), 21.46% (8.28-37.05%), and 13.66% (2.62-28.64%) of the R 2 value, respectively. Effect size calculation was significant for a change in the GPi laterality of 0.19 mm per mm of ventricular width, 0.11 mm per mm of AC-PC distance, 0.017 mm per year in age, and 0.54 mm increase for male gender. This variability highlights the limitations of indirect targeting alone, and argues for the continued use of MRI as well as intraoperative physiological testing to account for such factors that contribute to patient-specific variability in GPi localization. © 2016 International Neuromodulation Society.

  4. The relationship between hemoglobin level and the type 1 diabetic nephropathy in Anhui Han's patients.

    PubMed

    Jiang, Jun; Lei, Lan; Zhou, Xiaowan; Li, Peng; Wei, Ren

    2018-02-20

    Recent studies have shown that low hemoglobin (Hb) level promote the progression of chronic kidney disease. This study assessed the relationship between Hb level and type 1 diabetic nephropathy (DN) in Anhui Han's patients. There were a total of 236 patients diagnosed with type 1 diabetes mellitus and (T1DM) seen between January 2014 and December 2016 in our centre. Hemoglobin levels in patients with DN were compared with those without DN. The relationship between Hb level and the urinary albumin-creatinine ratio (ACR) was examined by Spearman's correlational analysis and multiple stepwise regression analysis. The binary logistic multivariate regression analysis was performed to analyze the correlated factors for type 1 DN, calculate the Odds Ratio (OR) and 95%confidence interval (CI). The predicting value of Hb level for DN was evaluated by area under receiver operation characteristic curve (AUROC) for discrimination and Hosmer-Lemeshow goodness-of-fit test for calibration. The average Hb levels in the DN group (116.1 ± 20.8 g/L) were significantly lower than the non-DN group (131.9 ± 14.4 g/L) , P < 0.001. Hb levels were independently correlated with the urinary ACR in multiple stepwise regression analysis. The logistic multivariate regression analysis showed that the Hb level (OR: 0.936, 95% CI: 0.910 to 0.963, P < 0.001) was inversely correlated with DN in patients with T1DM. In sub-analysis, low Hb level (Hb < 120g/L in female, Hb < 130g/L in male) was still negatively associated with DN in patients with T1DM. The AUROC was 0.721 (95% CI: 0.655 to 0.787) in assessing the discrimination of the Hb level for DN. The value of P was 0.593 in Hosmer-Lemeshow goodness-of-fit test. In Anhui Han's patients with T1DM, the Hb level is inversely correlated with urinary ACR and DN. This article is protected by copyright. All rights reserved.

  5. An empirical study using permutation-based resampling in meta-regression

    PubMed Central

    2012-01-01

    Background In meta-regression, as the number of trials in the analyses decreases, the risk of false positives or false negatives increases. This is partly due to the assumption of normality that may not hold in small samples. Creation of a distribution from the observed trials using permutation methods to calculate P values may allow for less spurious findings. Permutation has not been empirically tested in meta-regression. The objective of this study was to perform an empirical investigation to explore the differences in results for meta-analyses on a small number of trials using standard large sample approaches verses permutation-based methods for meta-regression. Methods We isolated a sample of randomized controlled clinical trials (RCTs) for interventions that have a small number of trials (herbal medicine trials). Trials were then grouped by herbal species and condition and assessed for methodological quality using the Jadad scale, and data were extracted for each outcome. Finally, we performed meta-analyses on the primary outcome of each group of trials and meta-regression for methodological quality subgroups within each meta-analysis. We used large sample methods and permutation methods in our meta-regression modeling. We then compared final models and final P values between methods. Results We collected 110 trials across 5 intervention/outcome pairings and 5 to 10 trials per covariate. When applying large sample methods and permutation-based methods in our backwards stepwise regression the covariates in the final models were identical in all cases. The P values for the covariates in the final model were larger in 78% (7/9) of the cases for permutation and identical for 22% (2/9) of the cases. Conclusions We present empirical evidence that permutation-based resampling may not change final models when using backwards stepwise regression, but may increase P values in meta-regression of multiple covariates for relatively small amount of trials. PMID:22587815

  6. Advanced Statistics for Exotic Animal Practitioners.

    PubMed

    Hodsoll, John; Hellier, Jennifer M; Ryan, Elizabeth G

    2017-09-01

    Correlation and regression assess the association between 2 or more variables. This article reviews the core knowledge needed to understand these analyses, moving from visual analysis in scatter plots through correlation, simple and multiple linear regression, and logistic regression. Correlation estimates the strength and direction of a relationship between 2 variables. Regression can be considered more general and quantifies the numerical relationships between an outcome and 1 or multiple variables in terms of a best-fit line, allowing predictions to be made. Each technique is discussed with examples and the statistical assumptions underlying their correct application. Copyright © 2017 Elsevier Inc. All rights reserved.

  7. The Clustering of Lifestyle Behaviours in New Zealand and their Relationship with Optimal Wellbeing.

    PubMed

    Prendergast, Kate B; Mackay, Lisa M; Schofield, Grant M

    2016-10-01

    The purpose of this research was to determine (1) associations between multiple lifestyle behaviours and optimal wellbeing and (2) the extent to which five lifestyle behaviours-sleep, physical activity, sedentary behaviour, sugary drink consumption, and fruit and vegetable intake-cluster in a national sample. A national sample of New Zealand adults participated in a web-based wellbeing survey. Five lifestyle behaviours-sleep, physical activity, sedentary behaviour, sugary drink consumption, and fruit and vegetable intake-were dichotomised into healthy (meets recommendations) and unhealthy (does not meet recommendations) categories. Optimal wellbeing was calculated using a multi-dimensional flourishing scale, and binary logistic regression analysis was used to calculate the relationship between multiple healthy behaviours and optimal wellbeing. Clustering was examined by comparing the observed and expected prevalence rates (O/E) of healthy and unhealthy two-, three-, four-, and five-behaviour combinations. Data from 9425 participants show those engaging in four to five healthy behaviours (23 %) were 4.7 (95 % confidence interval (CI) 3.8-5.7) times more likely to achieve optimal wellbeing compared to those engaging in zero to one healthy behaviour (21 %). Clustering was observed for healthy (5 %, O/E 2.0, 95 % CI 1.8-2.2) and unhealthy (5 %, O/E 2.1, 95 % CI 1.9-2.3) five-behaviour combinations and for four- and three-behaviour combinations. At the two-behaviour level, healthy fruit and vegetable intake clustered with all behaviours, except sleep which did not cluster with any behaviour. Multiple lifestyle behaviours were positively associated with optimal wellbeing. The results show lifestyle behaviours cluster, providing support for multiple behaviour lifestyle-based interventions for optimising wellbeing.

  8. The Economic Costs of Progressive Supranuclear Palsy and Multiple System Atrophy in France, Germany and the United Kingdom

    PubMed Central

    McCrone, Paul; Payan, Christine Anne Mary; Knapp, Martin; Ludolph, Albert; Agid, Yves; Leigh, P. Nigel; Bensimon, Gilbert

    2011-01-01

    Progressive supranuclear palsy (PSP) and multiple system atrophy (MSA) are progressive disabling neurological conditions usually fatal within 10 years of onset. Little is known about the economic costs of these conditions. This paper reports service use and costs from France, Germany and the UK and identifies patient characteristics that are associated with cost. 767 patients were recruited, and 760 included in the study, from 44 centres as part of the NNIPPS trial. Service use during the previous six months was measured at entry to the study and costs calculated. Mean six-month costs were calculated for 742 patients. Data on patient sociodemographic and clinical characteristics were recorded and used in regression models to identify predictors of service costs and unpaid care costs (i.e., care from family and friends). The mean six-month service costs of PSP were €24,491 in France, €30,643 in Germany and €25,655 in the UK. The costs for MSA were €28,924, €25,645 and €19,103 respectively. Unpaid care accounted for 68–76%. Formal and unpaid costs were significantly higher the more severe the illness, as indicated by the Parkinson's Plus Symptom scale. There was a significant inverse relationship between service and unpaid care costs. PMID:21931694

  9. Automated brain volumetrics in multiple sclerosis: a step closer to clinical application

    PubMed Central

    Beadnall, H N; Hatton, S N; Bader, G; Tomic, D; Silva, D G

    2016-01-01

    Background Whole brain volume (WBV) estimates in patients with multiple sclerosis (MS) correlate more robustly with clinical disability than traditional, lesion-based metrics. Numerous algorithms to measure WBV have been developed over the past two decades. We compare Structural Image Evaluation using Normalisation of Atrophy-Cross-sectional (SIENAX) to NeuroQuant and MSmetrix, for assessment of cross-sectional WBV in patients with MS. Methods MRIs from 61 patients with relapsing-remitting MS and 2 patients with clinically isolated syndrome were analysed. WBV measurements were calculated using SIENAX, NeuroQuant and MSmetrix. Statistical agreement between the methods was evaluated using linear regression and Bland-Altman plots. Precision and accuracy of WBV measurement was calculated for (1) NeuroQuant versus SIENAX and (2) MSmetrix versus SIENAX. Results Precision (Pearson's r) of WBV estimation for NeuroQuant and MSmetrix versus SIENAX was 0.983 and 0.992, respectively. Accuracy (Cb) was 0.871 and 0.994, respectively. NeuroQuant and MSmetrix showed a 5.5% and 1.0% volume difference compared with SIENAX, respectively, that was consistent across low and high values. Conclusions In the analysed population, NeuroQuant and MSmetrix both quantified cross-sectional WBV with comparable statistical agreement to SIENAX, a well-validated cross-sectional tool that has been used extensively in MS clinical studies. PMID:27071647

  10. INTRODUCTION TO A COMBINED MULTIPLE LINEAR REGRESSION AND ARMA MODELING APPROACH FOR BEACH BACTERIA PREDICTION

    EPA Science Inventory

    Due to the complexity of the processes contributing to beach bacteria concentrations, many researchers rely on statistical modeling, among which multiple linear regression (MLR) modeling is most widely used. Despite its ease of use and interpretation, there may be time dependence...

  11. Determining the Spatial and Seasonal Variability in OM/OC Ratios across the U.S. Using Multiple Regression

    EPA Science Inventory

    Data from the Interagency Monitoring of Protected Visual Environments (IMPROVE) network are used to estimate organic mass to organic carbon (OM/OC) ratios across the United States by extending previously published multiple regression techniques. Our new methodology addresses com...

  12. Analysis and Interpretation of Findings Using Multiple Regression Techniques

    ERIC Educational Resources Information Center

    Hoyt, William T.; Leierer, Stephen; Millington, Michael J.

    2006-01-01

    Multiple regression and correlation (MRC) methods form a flexible family of statistical techniques that can address a wide variety of different types of research questions of interest to rehabilitation professionals. In this article, we review basic concepts and terms, with an emphasis on interpretation of findings relevant to research questions…

  13. Tracking the Gender Pay Gap: A Case Study

    ERIC Educational Resources Information Center

    Travis, Cheryl B.; Gross, Louis J.; Johnson, Bruce A.

    2009-01-01

    This article provides a short introduction to standard considerations in the formal study of wages and illustrates the use of multiple regression and resampling simulation approaches in a case study of faculty salaries at one university. Multiple regression is especially beneficial where it provides information on strength of association, specific…

  14. Estimating air drying times of lumber with multiple regression

    Treesearch

    William T. Simpson

    2004-01-01

    In this study, the applicability of a multiple regression equation for estimating air drying times of red oak, sugar maple, and ponderosa pine lumber was evaluated. The equation allows prediction of estimated air drying times from historic weather records of temperature and relative humidity at any desired location.

  15. Using Robust Variance Estimation to Combine Multiple Regression Estimates with Meta-Analysis

    ERIC Educational Resources Information Center

    Williams, Ryan

    2013-01-01

    The purpose of this study was to explore the use of robust variance estimation for combining commonly specified multiple regression models and for combining sample-dependent focal slope estimates from diversely specified models. The proposed estimator obviates traditionally required information about the covariance structure of the dependent…

  16. Multiple Regression: A Leisurely Primer.

    ERIC Educational Resources Information Center

    Daniel, Larry G.; Onwuegbuzie, Anthony J.

    Multiple regression is a useful statistical technique when the researcher is considering situations in which variables of interest are theorized to be multiply caused. It may also be useful in those situations in which the researchers is interested in studies of predictability of phenomena of interest. This paper provides an introduction to…

  17. Using Monte Carlo Techniques to Demonstrate the Meaning and Implications of Multicollinearity

    ERIC Educational Resources Information Center

    Vaughan, Timothy S.; Berry, Kelly E.

    2005-01-01

    This article presents an in-class Monte Carlo demonstration, designed to demonstrate to students the implications of multicollinearity in a multiple regression study. In the demonstration, students already familiar with multiple regression concepts are presented with a scenario in which the "true" relationship between the response and…

  18. Assessing the Impact of Influential Observations on Multiple Regression Analysis on Human Resource Research.

    ERIC Educational Resources Information Center

    Bates, Reid A.; Holton, Elwood F., III; Burnett, Michael F.

    1999-01-01

    A case study of learning transfer demonstrates the possible effect of influential observation on linear regression analysis. A diagnostic method that tests for violation of assumptions, multicollinearity, and individual and multiple influential observations helps determine which observation to delete to eliminate bias. (SK)

  19. Use of magnetic resonance imaging to predict the body composition of pigs in vivo.

    PubMed

    Kremer, P V; Förster, M; Scholz, A M

    2013-06-01

    The objective of the study was to evaluate whether magnetic resonance imaging (MRI) offers the opportunity to reliably analyze body composition of pigs in vivo. Therefore, the relation between areas of loin eye muscle and its back fat based on MRI images were used to predict body composition values measured by dual energy X-ray absorptiometry (DXA). During the study, a total of 77 pigs were studied by MRI and DXA, with a BW ranging between 42 and 102 kg. The pigs originated from different extensive or conventional breeds or crossbreds such as Cerdo Iberico, Duroc, German Landrace, German Large White, Hampshire and Pietrain. A Siemens Magnetom Open was used for MRI in the thorax region between 13th and 14th vertebrae in order to measure the loin eye area (MRI-LA) and the above back fat area (MRI-FA) of both body sides, whereas a whole body scan was performed by DXA with a GE Lunar DPX-IQ in order to measure the amount and percentage of fat tissue (DXA-FM; DXA-%FM) and lean tissue mass (DXA-LM; DXA-%LM). A linear single regression analysis was performed to quantify the linear relationships between MRI- and DXA-derived traits. In addition, a stepwise regression procedure was carried out to calculate (multiple) regression equations between MRI and DXA variables (including BW). Single regression analyses showed high relationships between DXA-%FM and MRI-FA (R 2 = 0.89, √MSE = 2.39%), DXA-FM and MRI-FA (R 2 = 0.82, √MSE = 2757 g) and DXA-LM and MRI-LA (R 2 = 0.82, √MSE = 4018 g). Only DXA-%LM and MRI-LA did not show any relationship (R 2 = 0). As a result of the multiple regression analysis, DXA-LM and DXA-FM were both highly related to MRI-LA, MRI-FA and BW (R 2 = 0.96; √MSE = 1784 g, and R 2 = 0.95, √MSE = 1496 g). Therefore, it can be concluded that the use of MRI-derived images provides exact information about important 'carcass-traits' in pigs and may be used to reliably predict the body composition in vivo.

  20. The impact of a standardized program on short and long-term outcomes in bariatric surgery.

    PubMed

    Aird, Lisa N F; Hong, Dennis; Gmora, Scott; Breau, Ruth; Anvari, Mehran

    2017-02-01

    The purpose of this study was to determine whether there has been an improvement in short- and long-term clinical outcomes since 2010, when the Ontario Bariatric Network led a province-wide initiative to establish a standardized system of care for bariatric patients. The system includes nine bariatric centers, a centralized referral system, and a research registry. Standardization of procedures has progressed yearly, including guidelines for preoperative assessment and perioperative care. Analysis of the OBN registry data was performed by fiscal year between April 2010 and March 2015. Three-month overall postoperative complication rates and 30 day postoperative mortality were calculated. The mean percentage of weight loss at 1, 2, and 3 years postoperative, and regression of obesity-related diseases were calculated. The analysis of continuous and nominal data was performed using ANOVA, Chi-square, and McNemar's testing. A multiple logistic regression analysis was performed for factors affecting postoperative complication rate. Eight thousand and forty-three patients were included in the bariatric registry between April 2010 and March 2015. Thirty-day mortality was rare (<0.075 %) and showed no significant difference between years. Three-month overall postoperative complication rates significantly decreased with standardization (p < 0.001), as did intra-operative complication rates (p < -0.001). Regression analysis demonstrated increasing standardization to be a predictor of 3 month complication rate OR of 0.59 (95 %CI 0.41-0.85, p = 0.00385). The mean percentage of weight loss at 1, 2, and 3 years postoperative showed stability at 33.2 % (9.0 SD), 34.1 % (10.1 SD), and 32.7 % (10.1 SD), respectively. Sustained regression in obesity-related comorbidities was demonstrated at 1, 2, and 3 years postoperative. Evidence indicates the implementation of a standardized system of bariatric care has contributed to improvements in complication rates and supported prolonged weight loss and regression of obesity-related diseases in patients undergoing bariatric surgery in Ontario.

  1. [Life satisfaction and related socio-demographic factors during female midlife].

    PubMed

    Cuadros, José Luis; Pérez-Roncero, Gonzalo R; López-Baena, María Teresa; Cuadros-Celorrio, Angela M; Fernández-Alonso, Ana María

    2014-01-01

    To assess life satisfaction and related factors in middle-aged Spanish women. This was a cross-sectional study including 235 women aged 40 to 65, living in Granada (Spain), healthy companions of patients visiting the obstetrics and gynecology clinics. They completed the Diener Satisfaction with Life Scale, the Menopause Rating Scale, the Perceived Stress Scale, the Insomnia Severity Index and a sociodemographic questionnaire containing personal and partner data. Internal consistency of each tool was also calculated. Almost two-thirds (61.3%) of the women were postmenopausal, and 43.8% had abdominal obesity, 36.6% had insomnia, 18.7% had poor menopause-related quality of life, 31.9% performed regular exercise, and 5.1% had severe financial problems. Life satisfaction showed significant positive correlations (Spearman's test) with female and male age, and inverse correlations with menopause-related quality of life, perceived stress and insomnia. In the multiple linear regression analysis, high life satisfaction is positively correlated with having a partner who performed exercise, and inversely with having work problems, perceived stress and the suspicion of partner infidelity. These factors explained 40% of the variance of the multiple regression analysis for life satisfaction in middle-aged women. Life satisfaction is a construct related to perceived stress, work problems, and having a partner, while aspects of menopause and general health had no significant influence. Copyright © 2014 Elsevier España, S.L.U. All rights reserved.

  2. CALIBRATING NON-CONVEX PENALIZED REGRESSION IN ULTRA-HIGH DIMENSION.

    PubMed

    Wang, Lan; Kim, Yongdai; Li, Runze

    2013-10-01

    We investigate high-dimensional non-convex penalized regression, where the number of covariates may grow at an exponential rate. Although recent asymptotic theory established that there exists a local minimum possessing the oracle property under general conditions, it is still largely an open problem how to identify the oracle estimator among potentially multiple local minima. There are two main obstacles: (1) due to the presence of multiple minima, the solution path is nonunique and is not guaranteed to contain the oracle estimator; (2) even if a solution path is known to contain the oracle estimator, the optimal tuning parameter depends on many unknown factors and is hard to estimate. To address these two challenging issues, we first prove that an easy-to-calculate calibrated CCCP algorithm produces a consistent solution path which contains the oracle estimator with probability approaching one. Furthermore, we propose a high-dimensional BIC criterion and show that it can be applied to the solution path to select the optimal tuning parameter which asymptotically identifies the oracle estimator. The theory for a general class of non-convex penalties in the ultra-high dimensional setup is established when the random errors follow the sub-Gaussian distribution. Monte Carlo studies confirm that the calibrated CCCP algorithm combined with the proposed high-dimensional BIC has desirable performance in identifying the underlying sparsity pattern for high-dimensional data analysis.

  3. Association of dentine hypersensitivity with different risk factors - a cross sectional study.

    PubMed

    Vijaya, V; Sanjay, Venkataraam; Varghese, Rana K; Ravuri, Rajyalakshmi; Agarwal, Anil

    2013-12-01

    This study was done to assess the prevalence of Dentine hypersensitivity (DH) and its associated risk factors. This epidemiological study was done among patients coming to dental college regarding prevalence of DH. A self structured questionnaire along with clinical examination was done for assessment. Descriptive statistics were obtained and frequency distribution was calculated using Chi square test at p value <0.05. Stepwise multiple linear regression was also done to access frequency of DH with different factors. The study population was comprised of 655 participants with different age groups. Our study showed prevalence as 55% and it was more common among males. Similarly smokers and those who use hard tooth brush had more cases of DH. Step wise multiple linear regression showed that best predictor for DH was age followed by habit of smoking and type of tooth brush. Most aggravating factors were cold water (15.4%) and sweet foods (14.7%), whereas only 5% of the patients had it while brushing. A high level of dental hypersensitivity has been in this study and more common among males. A linear finding was shown with age, smoking and type of tooth brush. How to cite this article: Vijaya V, Sanjay V, Varghese RK, Ravuri R, Agarwal A. Association of Dentine Hypersensitivity with Different Risk Factors - A Cross Sectional Study. J Int Oral Health 2013;5(6):88-92 .

  4. Continuous optical measurement system of hemolysis during a photosensitization reaction using absorption spectrum

    NASA Astrophysics Data System (ADS)

    Hamada, R.; Ogawa, E.; Arai, T.

    2018-02-01

    To investigate hemolysis phenomena during a photosensitization reaction with the reaction condition continuously and simultaneously for a safety assessment of hemolysis side effect, we constructed an optical system to measure blood sample absorption spectrum during the reaction. Hemolysis degree might be under estimated in general evaluation methods because there is a constant oxygen pressure assumption in spite of oxygen depression take place. By investigating hemoglobin oxidation and oxygen desorption dynamics obtained from the contribution of the visible absorption spectrum and multiple regression analysis, both the hemolysis phenomena and its oxygen environment might be obtained with time. A 664 nm wavelength laser beam for the reaction excitation and 475-650 nm light beam for measuring the absorbance spectrum were arranged perpendicularly crossing. A quartz glass cuvette with 1×10 mm in dimensions for the spectrum measurement was located at this crossing point. A red blood cells suspension medium was arranged with low hematocrit containing 30 μg/ml talaporfin sodium. This medium was irradiated up to 40 J/cm2 . The met-hemoglobin, oxygenatedhemoglobin, and deoxygenated-hemoglobin concentrations were calculated by a multiple regression analysis from the measured spectra. We confirmed the met-hemoglobin concentration increased and oxygen saturation decreased with the irradiation time, which seems to indicate the hemolysis progression and oxygen consumption, respectively. By using our measuring system, the hemolysis progression seems to be obtained with oxygen environment information.

  5. CALIBRATING NON-CONVEX PENALIZED REGRESSION IN ULTRA-HIGH DIMENSION

    PubMed Central

    Wang, Lan; Kim, Yongdai; Li, Runze

    2014-01-01

    We investigate high-dimensional non-convex penalized regression, where the number of covariates may grow at an exponential rate. Although recent asymptotic theory established that there exists a local minimum possessing the oracle property under general conditions, it is still largely an open problem how to identify the oracle estimator among potentially multiple local minima. There are two main obstacles: (1) due to the presence of multiple minima, the solution path is nonunique and is not guaranteed to contain the oracle estimator; (2) even if a solution path is known to contain the oracle estimator, the optimal tuning parameter depends on many unknown factors and is hard to estimate. To address these two challenging issues, we first prove that an easy-to-calculate calibrated CCCP algorithm produces a consistent solution path which contains the oracle estimator with probability approaching one. Furthermore, we propose a high-dimensional BIC criterion and show that it can be applied to the solution path to select the optimal tuning parameter which asymptotically identifies the oracle estimator. The theory for a general class of non-convex penalties in the ultra-high dimensional setup is established when the random errors follow the sub-Gaussian distribution. Monte Carlo studies confirm that the calibrated CCCP algorithm combined with the proposed high-dimensional BIC has desirable performance in identifying the underlying sparsity pattern for high-dimensional data analysis. PMID:24948843

  6. Cognitive and physical functions related to the level of supervision and dependence in the toileting of stroke patients.

    PubMed

    Sato, Atsushi; Okuda, Yutaka; Fujita, Takaaki; Kimura, Norihiko; Hoshina, Noriyuki; Kato, Sayaka; Tanaka, Shigenari

    2016-01-01

    This study aimed to clarify which cognitive and physical factors are associated with the need for toileting assistance in stroke patients and to calculate cut-off values for discriminating between independent supervision and dependent toileting ability. This cross-sectional study included 163 first-stroke patients in nine convalescent rehabilitation wards. Based on their FIM Ⓡ instrument score for toileting, the patients were divided into an independent-supervision group and a dependent group. Multiple logistic regression analysis and receiver operating characteristic analysis were performed to identify factors related to toileting performance. The Minimental State Examination (MMSE); the Stroke Impairment Assessment Set (SIAS) score for the affected lower limb, speech, and visuospatial functions; and the Functional Assessment for Control of Trunk (FACT) were analyzed as independent variables. The multiple logistic regression analysis showed that the FIM Ⓡ instrument score for toileting was associated with the SIAS score for the affected lower limb function, MMSE, and FACT. On receiver operating characteristic analysis, the SIAS score for the affected lower limb function cut-off value was 8/7 points, the MMSE cut-off value was 25/24 points, and the FACT cut-off value was 14/13 points. Affected lower limb function, cognitive function, and trunk function were related with the need for toileting assistance. These cut-off values may be useful for judging whether toileting assistance is needed in stroke patients.

  7. Reference values of brachial-ankle pulse wave velocity according to age and blood pressure in a central Asia population.

    PubMed

    Yiming, Gulinuer; Zhou, Xianhui; Lv, Wenkui; Peng, Yi; Zhang, Wenhui; Cheng, Xinchun; Li, Yaodong; Xing, Qiang; Zhang, Jianghua; Zhou, Qina; Zhang, Ling; Lu, Yanmei; Wang, Hongli; Tang, Baopeng

    2017-01-01

    Brachial-ankle pulse wave velocity (baPWV), a direct measure of aortic stiffness, has increasingly become an important assessment for cardiovascular risk. The present study established the reference and normal values of baPWV in a Central Asia population in Xinjiang, China. We recruited participants from a central Asia population in Xinjiang, China. We performed multiple regression analysis to investigate the determinants of baPWV. The median and 10th-90th percentiles were calculated to establish the reference and normal values based on these categories. In total, 5,757 Han participants aged 15-88 years were included in the present study. Spearman correlation analysis showed that age (r = 0.587, p < 0.001) and mean blood pressure (MBP, r = 0.599, p <0.001) were the major factors influencing the values of baPWV in the reference population. Furthermore, in the multiple linear regression analysis, the standardized regression coefficients of age (0.445) and MBP (0.460) were much higher than those of body mass index, triglyceride, and glycemia (-0.054, 0.035, and 0.033, respectively). In the covariance analysis, after adjustment for age and MBP, only diabetes was the significant independent determinant of baPWV (p = 0.009). Thus, participants with diabetes were excluded from the reference value population. The reference values ranged from 14.3 to 25.2 m/s, and the normal values ranged from 13.9 to 21.2 m/s. This is the first study that has established the reference and normal values for baPWV according to age and blood pressure in a Central Asia population.

  8. Quality of life of patients who undergo breast reconstruction after mastectomy: effects of personality characteristics.

    PubMed

    Bellino, Silvio; Fenocchio, Marina; Zizza, Monica; Rocca, Giuseppe; Bogetti, Paolo; Bogetto, Filippo

    2011-01-01

    Reconstruction after mastectomy has become an integral part of breast cancer treatment. The effects of psychological factors on quality of life after reconstruction have been poorly investigated. The authors examined clinical and personality characteristics related to quality of life in patients receiving reconstructive surgery. All patients received immediate reconstruction and were evaluated in the week before tissue expander implantation (T0) with a semistructured interview for demographic and clinical characteristics, the Temperament and Character Inventory, the Inventory of Interpersonal Problems, the Short Form Health Survey, the Severity Item of the Clinical Global Impression, the Hamilton Depression Rating Scale, and the Hamilton Anxiety Rating Scale. Assessment with the Short Form was repeated 3 months after expander placement (T1). Statistics were calculated with univariate regression and analysis of variance. Significant variables were included in a multiple regression analysis to identify factors related to the change T1-T0 of the mean of the Short Form-transformed scores. Results were significant when p was less than or equal to 0.05. Fifty-seven women were enrolled. Results of multiple regression analysis showed that the Temperament and Character Inventory personality dimension harm avoidance and the Inventory of Interpersonal Problems domain vindictive/self-centered were significantly and independently related to the change in Short Form mean score. Personality dimensions and patterns of interpersonal functioning produce significant effects on patients' quality of life during breast reconstruction. Patients with high harm avoidance are apprehensive and doubtful. Restoration of body image could help them to reduce social anxiety and insecurity. Vindictive/self-centered patients are resentful and aggressive. Breast reconstruction could symbolize the conclusion of a reparative process and fulfill the desire of revenge on cancer.

  9. Spatial analysis and land use regression of VOCs and NO(2) from school-based urban air monitoring in Detroit/Dearborn, USA.

    PubMed

    Mukerjee, Shaibal; Smith, Luther A; Johnson, Mary M; Neas, Lucas M; Stallings, Casson A

    2009-08-01

    Passive ambient air sampling for nitrogen dioxide (NO(2)) and volatile organic compounds (VOCs) was conducted at 25 school and two compliance sites in Detroit and Dearborn, Michigan, USA during the summer of 2005. Geographic Information System (GIS) data were calculated at each of 116 schools. The 25 selected schools were monitored to assess and model intra-urban gradients of air pollutants to evaluate impact of traffic and urban emissions on pollutant levels. Schools were chosen to be statistically representative of urban land use variables such as distance to major roadways, traffic intensity around the schools, distance to nearest point sources, population density, and distance to nearest border crossing. Two approaches were used to investigate spatial variability. First, Kruskal-Wallis analyses and pairwise comparisons on data from the schools examined coarse spatial differences based on city section and distance from heavily trafficked roads. Secondly, spatial variation on a finer scale and as a response to multiple factors was evaluated through land use regression (LUR) models via multiple linear regression. For weeklong exposures, VOCs did not exhibit spatial variability by city section or distance from major roads; NO(2) was significantly elevated in a section dominated by traffic and industrial influence versus a residential section. Somewhat in contrast to coarse spatial analyses, LUR results revealed spatial gradients in NO(2) and selected VOCs across the area. The process used to select spatially representative sites for air sampling and the results of coarse and fine spatial variability of air pollutants provide insights that may guide future air quality studies in assessing intra-urban gradients.

  10. Normative data for a battery of free recall, cued recall and recognition tests in the elderly Italian population.

    PubMed

    Coluccia, Emanuele; Gamboz, Nadia; Brandimonte, Maria A

    2011-12-01

    The present study aimed to provide normative data on a large sample of the elderly Italian population (N = 464; range of age = 49-94; range of education = 3-25) on both the word and the picture versions of a battery of free recall, cued recall, and recognition tests of memory. Results from multiple regression analyses showed that both age and education were significant predictors of performance. Therefore, norms were calculated taking into account these demographic variables. The availability of normative data based on a large sample will allow a more reliable use of the battery for clinical assessment in Italian-speaking dementia population.

  11. Evaluation of the laboratory mouse model for screening topical mosquito repellents.

    PubMed

    Rutledge, L C; Gupta, R K; Wirtz, R A; Buescher, M D

    1994-12-01

    Eight commercial repellents were tested against Aedes aegypti 0 and 4 h after application in serial dilution to volunteers and laboratory mice. Results were analyzed by multiple regression of percentage of biting (probit scale) on dose (logarithmic scale) and time. Empirical correction terms for conversion of values obtained in tests on mice to values expected in tests on human volunteers were calculated from data obtained on 4 repellents and evaluated with data obtained on 4 others. Corrected values from tests on mice did not differ significantly from values obtained in tests on volunteers. Test materials used in the study were dimethyl phthalate, butopyronoxyl, butoxy polypropylene glycol, MGK Repellent 11, deet, ethyl hexanediol, Citronyl, and dibutyl phthalate.

  12. Imidazole derivatives as angiotensin II AT1 receptor blockers: Benchmarks, drug-like calculations and quantitative structure-activity relationships modeling

    NASA Astrophysics Data System (ADS)

    Alloui, Mebarka; Belaidi, Salah; Othmani, Hasna; Jaidane, Nejm-Eddine; Hochlaf, Majdi

    2018-03-01

    We performed benchmark studies on the molecular geometry, electron properties and vibrational analysis of imidazole using semi-empirical, density functional theory and post Hartree-Fock methods. These studies validated the use of AM1 for the treatment of larger systems. Then, we treated the structural, physical and chemical relationships for a series of imidazole derivatives acting as angiotensin II AT1 receptor blockers using AM1. QSAR studies were done for these imidazole derivatives using a combination of various physicochemical descriptors. A multiple linear regression procedure was used to design the relationships between molecular descriptor and the activity of imidazole derivatives. Results validate the derived QSAR model.

  13. Suicidal ideation and Attempts in North American School-Based Surveys

    PubMed Central

    Saewyc, Elizabeth M.; Skay, Carol L.; Hynds, Patricia; Pettingell, Sandra; Bearinger, Linda H.; Resnick, Michael D.; Reis, Elizabeth

    2008-01-01

    This study explored the prevalence, disparity, and cohort trends in suicidality among bisexual teens vs. heterosexual and gay/lesbian peers in 9 population-based high school surveys in Canada and the U.S. Multivariate logistic regressions were used to calculate age-adjusted odds ratios separately by gender; 95% confidence intervals tested cohort trends where surveys were repeated over multiple years. Results showed remarkable consistency: bisexual youth reported higher odds of recent suicidal ideation and attempts vs. heterosexual peers, with increasing odds in most surveys over the past decade. Results compared to gay and lesbian peers were mixed, with varying gender differences in prevalence and disparity trends in the different regions. PMID:19835039

  14. Temporal and spatial trends in nutrient and sediment loading to Lake Tahoe, California-Nevada, USA

    USGS Publications Warehouse

    Coats, Robert; Lewis, Jack; Alvarez, Nancy L.; Arneson, Patricia

    2016-01-01

    Since 1980, the Lake Tahoe Interagency Monitoring Program (LTIMP) has provided stream-discharge and water quality data—nitrogen (N), phosphorus (P), and suspended sediment—at more than 20 stations in Lake Tahoe Basin streams. To characterize the temporal and spatial patterns in nutrient and sediment loading to the lake, and improve the usefulness of the program and the existing database, we have (1) identified and corrected for sources of bias in the water quality database; (2) generated synthetic datasets for sediments and nutrients, and resampled to compare the accuracy and precision of different load calculation models; (3) using the best models, recalculated total annual loads over the period of record; (4) regressed total loads against total annual and annual maximum daily discharge, and tested for time trends in the residuals; (5) compared loads for different forms of N and P; and (6) tested constituent loads against land use-land cover (LULC) variables using multiple regression. The results show (1) N and P loads are dominated by organic N and particulate P; (2) there are significant long-term downward trends in some constituent loads of some streams; and (3) anthropogenic impervious surface is the most important LULC variable influencing water quality in basin streams. Many of our recommendations for changes in water quality monitoring and load calculation methods have been adopted by the LTIMP.

  15. Validation of a food frequency questionnaire measurement of dietary acrylamide intake using hemoglobin adducts of acrylamide and glycidamide

    PubMed Central

    Wilson, Kathryn M.; Vesper, Hubert W.; Tocco, Paula; Sampson, Laura; Rosén, Johan; Hellenäs, Karl-Erik; Törnqvist, Margareta; Willett, Walter C.

    2011-01-01

    Objective Acrylamide, a probable human carcinogen, is formed during high-heat cooking of many common foods. The validity of food frequency questionnaire (FFQ) measures of acrylamide intake has not been established. We assessed the validity of acrylamide intake calculated from an FFQ using a biomarker of acrylamide exposure. Methods We calculated acrylamide intake from an FFQ in the Nurses' Health Study II. We measured hemoglobin adducts of acrylamide and its metabolite, glycidamide, in a random sample of 296 women. Correlation and regression analyses were used to assess the relationship between acrylamide intake and adducts. Results The correlation between acrylamide intake and the sum of acrylamide and glycidamide adducts was 0.31 (95% CI: 0.20 – 0.41), adjusted for laboratory batch, energy intake, and age. Further adjustment for BMI, alcohol intake, and correction for random within-person measurement error in adducts gave a correlation of 0.34 (CI: 0.23 – 0.45). The intraclass correlation coefficient for the sum of adducts was 0.77 in blood samples collected 1 to 3 years apart in a subset of 45 women. Intake of several foods significantly predicted adducts in multiple regression. Conclusions Acrylamide intake and hemoglobin adducts of acrylamide and glycidamide were moderately correlated. Within-person consistency in adducts was high over time. PMID:18855107

  16. The influence of changes in land use and landscape patterns on soil erosion in a watershed.

    PubMed

    Zhang, Shanghong; Fan, Weiwei; Li, Yueqiang; Yi, Yujun

    2017-01-01

    It is very important to have a good understanding of the relation between soil erosion and landscape patterns so that soil and water conservation in river basins can be optimized. In this study, this relationship was explored, using the Liusha River Watershed, China, as a case study. A distributed water and sediment model based on the Soil and Water Assessment Tool (SWAT) was developed to simulate soil erosion from different land use types in each sub-basin of the Liusha River Watershed. Observed runoff and sediment data from 1985 to 2005 and land use maps from 1986, 1995, and 2000 were used to calibrate and validate the model. The erosion modulus for each sub-basin was calculated from SWAT model results using the different land use maps and 12 landscape indices were chosen and calculated to describe the land use in each sub-basin for the different years. The variations in instead of the absolute amounts of the erosion modulus and the landscape indices for each sub-basin were used as the dependent and independent variables, respectively, for the regression equations derived from multiple linear regression. The results indicated that the variations in the erosion modulus were closely related to changes in the large patch index, patch cohesion index, modified Simpson's evenness index, and the aggregation index. From the regression equation and the corresponding landscape indices, it was found that watershed erosion can be reduced by decreasing the physical connectivity between patches, improving the evenness of the landscape patch types, enriching landscape types, and enhancing the degree of aggregation between the landscape patches. These findings will be useful for water and soil conservation and for optimizing the management of watershed landscapes. Copyright © 2016 Elsevier B.V. All rights reserved.

  17. A novel simple QSAR model for the prediction of anti-HIV activity using multiple linear regression analysis.

    PubMed

    Afantitis, Antreas; Melagraki, Georgia; Sarimveis, Haralambos; Koutentis, Panayiotis A; Markopoulos, John; Igglessi-Markopoulou, Olga

    2006-08-01

    A quantitative-structure activity relationship was obtained by applying Multiple Linear Regression Analysis to a series of 80 1-[2-hydroxyethoxy-methyl]-6-(phenylthio) thymine (HEPT) derivatives with significant anti-HIV activity. For the selection of the best among 37 different descriptors, the Elimination Selection Stepwise Regression Method (ES-SWR) was utilized. The resulting QSAR model (R (2) (CV) = 0.8160; S (PRESS) = 0.5680) proved to be very accurate both in training and predictive stages.

  18. Wavelet regression model in forecasting crude oil price

    NASA Astrophysics Data System (ADS)

    Hamid, Mohd Helmie; Shabri, Ani

    2017-05-01

    This study presents the performance of wavelet multiple linear regression (WMLR) technique in daily crude oil forecasting. WMLR model was developed by integrating the discrete wavelet transform (DWT) and multiple linear regression (MLR) model. The original time series was decomposed to sub-time series with different scales by wavelet theory. Correlation analysis was conducted to assist in the selection of optimal decomposed components as inputs for the WMLR model. The daily WTI crude oil price series has been used in this study to test the prediction capability of the proposed model. The forecasting performance of WMLR model were also compared with regular multiple linear regression (MLR), Autoregressive Moving Average (ARIMA) and Generalized Autoregressive Conditional Heteroscedasticity (GARCH) using root mean square errors (RMSE) and mean absolute errors (MAE). Based on the experimental results, it appears that the WMLR model performs better than the other forecasting technique tested in this study.

  19. Multiple regression for physiological data analysis: the problem of multicollinearity.

    PubMed

    Slinker, B K; Glantz, S A

    1985-07-01

    Multiple linear regression, in which several predictor variables are related to a response variable, is a powerful statistical tool for gaining quantitative insight into complex in vivo physiological systems. For these insights to be correct, all predictor variables must be uncorrelated. However, in many physiological experiments the predictor variables cannot be precisely controlled and thus change in parallel (i.e., they are highly correlated). There is a redundancy of information about the response, a situation called multicollinearity, that leads to numerical problems in estimating the parameters in regression equations; the parameters are often of incorrect magnitude or sign or have large standard errors. Although multicollinearity can be avoided with good experimental design, not all interesting physiological questions can be studied without encountering multicollinearity. In these cases various ad hoc procedures have been proposed to mitigate multicollinearity. Although many of these procedures are controversial, they can be helpful in applying multiple linear regression to some physiological problems.

  20. Testing Mediation Using Multiple Regression and Structural Equation Modeling Analyses in Secondary Data

    ERIC Educational Resources Information Center

    Li, Spencer D.

    2011-01-01

    Mediation analysis in child and adolescent development research is possible using large secondary data sets. This article provides an overview of two statistical methods commonly used to test mediated effects in secondary analysis: multiple regression and structural equation modeling (SEM). Two empirical studies are presented to illustrate the…

  1. Conjoint Analysis: A Study of the Effects of Using Person Variables.

    ERIC Educational Resources Information Center

    Fraas, John W.; Newman, Isadore

    Three statistical techniques--conjoint analysis, a multiple linear regression model, and a multiple linear regression model with a surrogate person variable--were used to estimate the relative importance of five university attributes for students in the process of selecting a college. The five attributes include: availability and variety of…

  2. An Exploratory Study of Face-to-Face and Cyberbullying in Sixth Grade Students

    ERIC Educational Resources Information Center

    Accordino, Denise B.; Accordino, Michael P.

    2011-01-01

    In a pilot study, sixth grade students (N = 124) completed a questionnaire assessing students' experience with bullying and cyberbullying, demographic information, quality of parent-child relationship, and ways they have dealt with bullying/cyberbullying in the past. Two multiple regression analyses were conducted. The multiple regression analysis…

  3. The Use of Multiple Regression and Trend Analysis to Understand Enrollment Fluctuations. AIR Forum 1979 Paper.

    ERIC Educational Resources Information Center

    Campbell, S. Duke; Greenberg, Barry

    The development of a predictive equation capable of explaining a significant percentage of enrollment variability at Florida International University is described. A model utilizing trend analysis and a multiple regression approach to enrollment forecasting was adapted to investigate enrollment dynamics at the university. Four independent…

  4. Double Cross-Validation in Multiple Regression: A Method of Estimating the Stability of Results.

    ERIC Educational Resources Information Center

    Rowell, R. Kevin

    In multiple regression analysis, where resulting predictive equation effectiveness is subject to shrinkage, it is especially important to evaluate result replicability. Double cross-validation is an empirical method by which an estimate of invariance or stability can be obtained from research data. A procedure for double cross-validation is…

  5. Determination of water pH using absorption-based optical sensors: evaluation of different calculation methods

    NASA Astrophysics Data System (ADS)

    Wang, Hongliang; Liu, Baohua; Ding, Zhongjun; Wang, Xiangxin

    2017-02-01

    Absorption-based optical sensors have been developed for the determination of water pH. In this paper, based on the preparation of a transparent sol-gel thin film with a phenol red (PR) indicator, several calculation methods, including simple linear regression analysis, quadratic regression analysis and dual-wavelength absorbance ratio analysis, were used to calculate water pH. Results of MSSRR show that dual-wavelength absorbance ratio analysis can improve the calculation accuracy of water pH in long-term measurement.

  6. Bias due to two-stage residual-outcome regression analysis in genetic association studies.

    PubMed

    Demissie, Serkalem; Cupples, L Adrienne

    2011-11-01

    Association studies of risk factors and complex diseases require careful assessment of potential confounding factors. Two-stage regression analysis, sometimes referred to as residual- or adjusted-outcome analysis, has been increasingly used in association studies of single nucleotide polymorphisms (SNPs) and quantitative traits. In this analysis, first, a residual-outcome is calculated from a regression of the outcome variable on covariates and then the relationship between the adjusted-outcome and the SNP is evaluated by a simple linear regression of the adjusted-outcome on the SNP. In this article, we examine the performance of this two-stage analysis as compared with multiple linear regression (MLR) analysis. Our findings show that when a SNP and a covariate are correlated, the two-stage approach results in biased genotypic effect and loss of power. Bias is always toward the null and increases with the squared-correlation between the SNP and the covariate (). For example, for , 0.1, and 0.5, two-stage analysis results in, respectively, 0, 10, and 50% attenuation in the SNP effect. As expected, MLR was always unbiased. Since individual SNPs often show little or no correlation with covariates, a two-stage analysis is expected to perform as well as MLR in many genetic studies; however, it produces considerably different results from MLR and may lead to incorrect conclusions when independent variables are highly correlated. While a useful alternative to MLR under , the two -stage approach has serious limitations. Its use as a simple substitute for MLR should be avoided. © 2011 Wiley Periodicals, Inc.

  7. BAYESIAN LARGE-SCALE MULTIPLE REGRESSION WITH SUMMARY STATISTICS FROM GENOME-WIDE ASSOCIATION STUDIES1

    PubMed Central

    Zhu, Xiang; Stephens, Matthew

    2017-01-01

    Bayesian methods for large-scale multiple regression provide attractive approaches to the analysis of genome-wide association studies (GWAS). For example, they can estimate heritability of complex traits, allowing for both polygenic and sparse models; and by incorporating external genomic data into the priors, they can increase power and yield new biological insights. However, these methods require access to individual genotypes and phenotypes, which are often not easily available. Here we provide a framework for performing these analyses without individual-level data. Specifically, we introduce a “Regression with Summary Statistics” (RSS) likelihood, which relates the multiple regression coefficients to univariate regression results that are often easily available. The RSS likelihood requires estimates of correlations among covariates (SNPs), which also can be obtained from public databases. We perform Bayesian multiple regression analysis by combining the RSS likelihood with previously proposed prior distributions, sampling posteriors by Markov chain Monte Carlo. In a wide range of simulations RSS performs similarly to analyses using the individual data, both for estimating heritability and detecting associations. We apply RSS to a GWAS of human height that contains 253,288 individuals typed at 1.06 million SNPs, for which analyses of individual-level data are practically impossible. Estimates of heritability (52%) are consistent with, but more precise, than previous results using subsets of these data. We also identify many previously unreported loci that show evidence for association with height in our analyses. Software is available at https://github.com/stephenslab/rss. PMID:29399241

  8. An efficient approach to ARMA modeling of biological systems with multiple inputs and delays

    NASA Technical Reports Server (NTRS)

    Perrott, M. H.; Cohen, R. J.

    1996-01-01

    This paper presents a new approach to AutoRegressive Moving Average (ARMA or ARX) modeling which automatically seeks the best model order to represent investigated linear, time invariant systems using their input/output data. The algorithm seeks the ARMA parameterization which accounts for variability in the output of the system due to input activity and contains the fewest number of parameters required to do so. The unique characteristics of the proposed system identification algorithm are its simplicity and efficiency in handling systems with delays and multiple inputs. We present results of applying the algorithm to simulated data and experimental biological data In addition, a technique for assessing the error associated with the impulse responses calculated from estimated ARMA parameterizations is presented. The mapping from ARMA coefficients to impulse response estimates is nonlinear, which complicates any effort to construct confidence bounds for the obtained impulse responses. Here a method for obtaining a linearization of this mapping is derived, which leads to a simple procedure to approximate the confidence bounds.

  9. Do different types of school mathematics development depend on different constellations of numerical versus general cognitive abilities?

    PubMed

    Fuchs, Lynn S; Geary, David C; Compton, Donald L; Fuchs, Douglas; Hamlett, Carol L; Seethaler, Pamela M; Bryant, Joan D; Schatschneider, Christopher

    2010-11-01

    The purpose of this study was to examine the interplay between basic numerical cognition and domain-general abilities (such as working memory) in explaining school mathematics learning. First graders (N = 280; mean age = 5.77 years) were assessed on 2 types of basic numerical cognition, 8 domain-general abilities, procedural calculations, and word problems in fall and then reassessed on procedural calculations and word problems in spring. Development was indexed by latent change scores, and the interplay between numerical and domain-general abilities was analyzed by multiple regression. Results suggest that the development of different types of formal school mathematics depends on different constellations of numerical versus general cognitive abilities. When controlling for 8 domain-general abilities, both aspects of basic numerical cognition were uniquely predictive of procedural calculations and word problems development. Yet, for procedural calculations development, the additional amount of variance explained by the set of domain-general abilities was not significant, and only counting span was uniquely predictive. By contrast, for word problems development, the set of domain-general abilities did provide additional explanatory value, accounting for about the same amount of variance as the basic numerical cognition variables. Language, attentive behavior, nonverbal problem solving, and listening span were uniquely predictive.

  10. Statistical experiments using the multiple regression research for prediction of proper hardness in areas of phosphorus cast-iron brake shoes manufacturing

    NASA Astrophysics Data System (ADS)

    Kiss, I.; Cioată, V. G.; Ratiu, S. A.; Rackov, M.; Penčić, M.

    2018-01-01

    Multivariate research is important in areas of cast-iron brake shoes manufacturing, because many variables interact with each other simultaneously. This article focuses on expressing the multiple linear regression model related to the hardness assurance by the chemical composition of the phosphorous cast irons destined to the brake shoes, having in view that the regression coefficients will illustrate the unrelated contributions of each independent variable towards predicting the dependent variable. In order to settle the multiple correlations between the hardness of the cast-iron brake shoes, and their chemical compositions several regression equations has been proposed. Is searched a mathematical solution which can determine the optimum chemical composition for the hardness desirable values. Starting from the above-mentioned affirmations two new statistical experiments are effectuated related to the values of Phosphorus [P], Manganese [Mn] and Silicon [Si]. Therefore, the regression equations, which describe the mathematical dependency between the above-mentioned elements and the hardness, are determined. As result, several correlation charts will be revealed.

  11. Agility performance in high-level junior basketball players: the predictive value of anthropometrics and power qualities.

    PubMed

    Sisic, Nedim; Jelicic, Mario; Pehar, Miran; Spasic, Miodrag; Sekulic, Damir

    2016-01-01

    In basketball, anthropometric status is an important factor when identifying and selecting talents, while agility is one of the most vital motor performances. The aim of this investigation was to evaluate the influence of anthropometric variables and power capacities on different preplanned agility performances. The participants were 92 high-level, junior-age basketball players (16-17 years of age; 187.6±8.72 cm in body height, 78.40±12.26 kg in body mass), randomly divided into a validation and cross-validation subsample. The predictors set consisted of 16 anthropometric variables, three tests of power-capacities (Sargent-jump, broad-jump and medicine-ball-throw) as predictors. The criteria were three tests of agility: a T-Shape-Test; a Zig-Zag-Test, and a test of running with a 180-degree turn (T180). Forward stepwise multiple regressions were calculated for validation subsamples and then cross-validated. Cross validation included correlations between observed and predicted scores, dependent samples t-test between predicted and observed scores; and Bland Altman graphics. Analysis of the variance identified centres being advanced in most of the anthropometric indices, and medicine-ball-throw (all at P<0.05); with no significant between-position-differences for other studied motor performances. Multiple regression models originally calculated for the validation subsample were then cross-validated, and confirmed for Zig-zag-Test (R of 0.71 and 0.72 for the validation and cross-validation subsample, respectively). Anthropometrics were not strongly related to agility performance, but leg length is found to be negatively associated with performance in basketball-specific agility. Power capacities are confirmed to be an important factor in agility. The results highlighted the importance of sport-specific tests when studying pre-planned agility performance in basketball. The improvement in power capacities will probably result in an improvement in agility in basketball athletes, while anthropometric indices should be used in order to identify those athletes who can achieve superior agility performance.

  12. Are Predictive Equations for Estimating Resting Energy Expenditure Accurate in Asian Indian Male Weightlifters?

    PubMed

    Joseph, Mini; Gupta, Riddhi Das; Prema, L; Inbakumari, Mercy; Thomas, Nihal

    2017-01-01

    The accuracy of existing predictive equations to determine the resting energy expenditure (REE) of professional weightlifters remains scarcely studied. Our study aimed at assessing the REE of male Asian Indian weightlifters with indirect calorimetry and to compare the measured REE (mREE) with published equations. A new equation using potential anthropometric variables to predict REE was also evaluated. REE was measured on 30 male professional weightlifters aged between 17 and 28 years using indirect calorimetry and compared with the eight formulas predicted by Harris-Benedicts, Mifflin-St. Jeor, FAO/WHO/UNU, ICMR, Cunninghams, Owen, Katch-McArdle, and Nelson. Pearson correlation coefficient, intraclass correlation coefficient, and multiple linear regression analysis were carried out to study the agreement between the different methods, association with anthropometric variables, and to formulate a new prediction equation for this population. Pearson correlation coefficients between mREE and the anthropometric variables showed positive significance with suprailiac skinfold thickness, lean body mass (LBM), waist circumference, hip circumference, bone mineral mass, and body mass. All eight predictive equations underestimated the REE of the weightlifters when compared with the mREE. The highest mean difference was 636 kcal/day (Owen, 1986) and the lowest difference was 375 kcal/day (Cunninghams, 1980). Multiple linear regression done stepwise showed that LBM was the only significant determinant of REE in this group of sportspersons. A new equation using LBM as the independent variable for calculating REE was computed. REE for weightlifters = -164.065 + 0.039 (LBM) (confidence interval -1122.984, 794.854]. This new equation reduced the mean difference with mREE by 2.36 + 369.15 kcal/day (standard error = 67.40). The significant finding of this study was that all the prediction equations underestimated the REE. The LBM was the sole determinant of REE in this population. In the absence of indirect calorimetry, the REE equation developed by us using LBM is a better predictor for calculating REE of professional male weightlifters of this region.

  13. Comparison of the color of natural teeth measured by a colorimeter and Shade Vision System.

    PubMed

    Cho, Byeong-Hoon; Lim, Yong-Kyu; Lee, Yong-Keun

    2007-10-01

    The objectives were to measure the difference in the color and color parameters of natural teeth measured by a tristimulus colorimeter (CM, used as a reference) and Shade Vision System (SV), and to determine the influence of color parameters on the color difference between the values measured by two instruments. Color of 12 maxillary and mandibular anterior teeth was measured by CM and SV for 47 volunteers (number of teeth=564). Color parameters such as CIE L*, a* and b* values, chroma and hue angle measured by two instruments were compared. Chroma was calculated as C*ab=(a*2 = b*2)1/2, and hue angle was calculated as h degrees =arctan(b*/a*). The influence of color parameters measured by CM on the color difference (DeltaE*(ab)) between the values measured by two instruments was analyzed with multiple regression analysis (alpha=0.01). Mean DeltaE*(ab) value between the values measured by two instruments was 21.7 (+/-3.7), and the mean difference in lightness (CIE L*) and chroma was 16.2 (+/-3.9) and 13.2 (+/-3.0), respectively. Difference in hue angle was high as 132.7 (+/-53.3) degrees . Except for the hue angle, all the color parameters showed significant correlations and the coefficient of determination (r(2)) was in the range of 0.089-0.478. Based on multiple regression analysis, the standardized partial correlation coefficient (beta) of the included predictors for the color difference was -0.710 for CIE L* and -0.300 for C*(ab) (p<0.01). All the color parameters showed significant but weak correlations except for hue angle. When lightness and chroma of teeth were high, color difference between the values measured by two instruments was small. Clinical accuracy of two instruments should be investigated further.

  14. An analysis of collegiate band directors' exposure to sound pressure levels

    NASA Astrophysics Data System (ADS)

    Roebuck, Nikole Moore

    Noise-induced hearing loss (NIHL) is a significant but unfortunate common occupational hazard. The purpose of the current study was to measure the magnitude of sound pressure levels generated within a collegiate band room and determine if those sound pressure levels are of a magnitude that exceeds the policy standards and recommendations of the Occupational Safety and Health Administration (OSHA), and the National Institute of Occupational Safety and Health (NIOSH). In addition, reverberation times were measured and analyzed in order to determine the appropriateness of acoustical conditions for the band rehearsal environment. Sound pressure measurements were taken from the rehearsal of seven collegiate marching bands. Single sample t test were conducted to compare the sound pressure levels of all bands to the noise exposure standards of OSHA and NIOSH. Multiple regression analysis were conducted and analyzed in order to determine the effect of the band room's conditions on the sound pressure levels and reverberation times. Time weighted averages (TWA), noise percentage doses, and peak levels were also collected. The mean Leq for all band directors was 90.5 dBA. The total accumulated noise percentage dose for all band directors was 77.6% of the maximum allowable daily noise dose under the OSHA standard. The total calculated TWA for all band directors was 88.2% of the maximum allowable daily noise dose under the OSHA standard. The total accumulated noise percentage dose for all band directors was 152.1% of the maximum allowable daily noise dose under the NIOSH standards, and the total calculated TWA for all band directors was 93dBA of the maximum allowable daily noise dose under the NIOSH standard. Multiple regression analysis revealed that the room volume, the level of acoustical treatment and the mean room reverberation time predicted 80% of the variance in sound pressure levels in this study.

  15. Amount of balance necessary for the independence of transfer and stair-climbing in stroke inpatients.

    PubMed

    Fujita, Takaaki; Sato, Atsushi; Ohashi, Yuji; Nishiyama, Kazutaka; Ohashi, Takuro; Yamane, Kazuhiro; Yamamoto, Yuichi; Tsuchiya, Kenji; Otsuki, Koji; Tozato, Fusae

    2018-05-01

    The purpose of this study was to clarify the amount of balance necessary for the independence of transfer and stair-climbing in stroke patients. This study included 111 stroke inpatients. Simple and multiple regression analyses were conducted to establish the association between the FIM ® instrument scores for transfer or stair-climbing and Berg Balance Scale. Furthermore, receiver operating characteristic curves were used to elucidate the amount of balance necessary for the independence of transfer and stair-climbing. Simple and multiple regression analyses showed that the FIM ® instrument scores for transfer and stair-climbing were strongly associated with Berg Balance Scale. On comparison of the independent and supervision-dependent groups, Berg Balance Scale cut-off values for transfer and stair-climbing were 41/40 and 54/53 points, respectively. On comparison of the independent-supervision and dependent groups, the cut-off values for transfer and stair-climbing were 30/29 and 41/40 points, respectively. The calculated cut-off values indicated the amount of balance necessary for the independence of transfer and stair-climbing, with and without supervision, in stroke patients. Berg Balance Scale has a good discriminatory ability and cut-off values are clinically useful to determine the appropriate independence levels of transfer and stair-climbing in hospital wards. Implications for rehabilitation The Berg Balance Scale's (BBS) strong association with transfer and stair-climbing independence and performance indicates that establishing cut-off values is vitally important for the established use of the BBS clinically. The cut-off values calculated herein accurately demonstrate the level of balance necessary for transfer and stair-climbing independence, with and without supervision, in stroke patients. These criteria should be employed clinically for determining the level of independence for transfer and stair-climbing as well as for setting balance training goals aimed at improving transfer and stair-climbing.

  16. [Multiple roles and health among Korean women].

    PubMed

    Cho, Su-Jin; Jang, Soong-Nang; Cho, Sung-Il

    2008-09-01

    Most studies about multiple roles and women's health suggested that combining with paid job, being married and having children was more likely to improve health status than in case of single or traditional roles. We investigated whether there was better health outcome in multiple roles among Korean women coinciding with previous studies of other nations. Data were from the 2005 Korea National Health & Nutritional Examination Survey, a subsample of women aged 25-59 years (N=2,943). Health status was assessed for self-rated poor health, perceived stress and depression, respectively based on one questionnaire item. The age-standardized prevalence of all health outcomes were calculated by role categories and socioeconomic status. Multiple logistic regression was used to assess the association of self rated health, perceived stress, and depression with multiple roles adjusted for age, education, household income, number of children and age of children. Having multiple roles with working role was not associated with better health and psychological wellbeing. Compared to those with traditional roles, employed women more frequently experienced perceived stress, with marital and/or parental roles. Non-working single mothers suffered depression more often than women with traditional roles or other role occupancy. Socioeconomic status indicators were potent independent correlates of self-rated health and perceived stress. Employment of women with other roles did not confer additional health benefit to traditional family responsibility. Juggling of work and family responsibility appeared more stressful than traditional unemployed parental and marital role in Korean women.

  17. Multiple Myeloma and Glyphosate Use: A Re-Analysis of US Agricultural Health Study (AHS) Data

    PubMed Central

    Sorahan, Tom

    2015-01-01

    A previous publication of 57,311 pesticide applicators enrolled in the US Agricultural Health Study (AHS) produced disparate findings in relation to multiple myeloma risks in the period 1993–2001 and ever-use of glyphosate (32 cases of multiple myeloma in the full dataset of 54,315 applicators without adjustment for other variables: rate ratio (RR) 1.1, 95% confidence interval (CI) 0.5 to 2.4; 22 cases of multiple myeloma in restricted dataset of 40,719 applicators with adjustment for other variables: RR 2.6, 95% CI 0.7 to 9.4). It seemed important to determine which result should be preferred. RRs for exposed and non-exposed subjects were calculated using Poisson regression; subjects with missing data were not excluded from the main analyses. Using the full dataset adjusted for age and gender the analysis produced a RR of 1.12 (95% CI 0.50 to 2.49) for ever-use of glyphosate. Additional adjustment for lifestyle factors and use of ten other pesticides had little effect (RR 1.24, 95% CI 0.52 to 2.94). There were no statistically significant trends for multiple myeloma risks in relation to reported cumulative days (or intensity weighted days) of glyphosate use. The doubling of risk reported previously arose from the use of an unrepresentative restricted dataset and analyses of the full dataset provides no convincing evidence in the AHS for a link between multiple myeloma risk and glyphosate use. PMID:25635915

  18. Multiple myeloma and glyphosate use: a re-analysis of US Agricultural Health Study (AHS) data.

    PubMed

    Sorahan, Tom

    2015-01-28

    A previous publication of 57,311 pesticide applicators enrolled in the US Agricultural Health Study (AHS) produced disparate findings in relation to multiple myeloma risks in the period 1993-2001 and ever-use of glyphosate (32 cases of multiple myeloma in the full dataset of 54,315 applicators without adjustment for other variables: rate ratio (RR) 1.1, 95% confidence interval (CI) 0.5 to 2.4; 22 cases of multiple myeloma in restricted dataset of 40,719 applicators with adjustment for other variables: RR 2.6, 95% CI 0.7 to 9.4). It seemed important to determine which result should be preferred. RRs for exposed and non-exposed subjects were calculated using Poisson regression; subjects with missing data were not excluded from the main analyses. Using the full dataset adjusted for age and gender the analysis produced a RR of 1.12 (95% CI 0.50 to 2.49) for ever-use of glyphosate. Additional adjustment for lifestyle factors and use of ten other pesticides had little effect (RR 1.24, 95% CI 0.52 to 2.94). There were no statistically significant trends for multiple myeloma risks in relation to reported cumulative days (or intensity weighted days) of glyphosate use. The doubling of risk reported previously arose from the use of an unrepresentative restricted dataset and analyses of the full dataset provides no convincing evidence in the AHS for a link between multiple myeloma risk and glyphosate use.

  19. Assessing the use of treated waste water for irrigation agricultural lands by using soil quality indices

    NASA Astrophysics Data System (ADS)

    Arcenegui, V.; Morugán, A.; García-Orenes, F.; Zornoza, R.; Mataix-Solera, J.; Navarro, M. A.; Guerrero, C.; Mataix-Beneyto, J.

    2009-04-01

    The use of treated wastewater for the irrigation of agricultural soils is an alternative to utilizing better-quality water, especially in semiarid regions where water shortage is a very serious problem. However, this practise can modify the soil equilibrium and affect its quality. In this work two soil quality indices (models) are used to evaluate the effects of long-term irrigation with treated wastewater in soil. The models were developed studying different soil properties in undisturbed forest soils in SE Spain, and the relationships between soil parameters were established using multiple linear regressions. Model 1, that explained 92% of the variance in soil organic carbon (SOC) showed that the SOC can be calculated by the linear combination of 6 physical, chemical and biochemical properties (acid phosphatase, water holding capacity (WHC), electrical conductivity (EC), available phosphorus (P), cation exchange capacity (CEC) and aggregate stability (AS)). Model 2 explains 89% of the SOC variance, which can be calculated by means of 7 chemical and biochemical properties (urease, phosphatase, and

  20. Factors associated with mouth breathing in children with -developmental -disabilities.

    PubMed

    de Castilho, Lia Silva; Abreu, Mauro Henrique Nogueira Guimarães; de Oliveira, Renata Batista; Souza E Silva, Maria Elisa; Resende, Vera Lúcia Silva

    2016-01-01

    To investigate the prevalence and factors associated with mouth breathing among patients with developmental disabilities of a dental service. We analyzed 408 dental records. Mouth breathing was reported by the patients' parents and from direct observation. Other variables were as -follows: history of asthma, bronchitis, palate shape, pacifier use, thumb -sucking, nail biting, use of medications, gastroesophageal reflux, bruxism, gender, age, and diagnosis of the patient. Statistical analysis included descriptive analysis with ratio calculation and multiple logistic regression. Variables with p < 0.25 were included in the model to estimate the adjusted OR (95% CI), calculated by the forward stepwise method. Variables with p ​​< 0.05 were kept in the model. Being male (p = 0.016) and use of centrally acting drugs (p = 0.001) were the variables that remained in the model. Among patients with -developmental disabilities, boys and psychotropic drug users had a greater chance of being mouth breathers. © 2016 Special Care Dentistry Association and Wiley Periodicals, Inc.

  1. [Correlation of retinol binding protein 4 with 
metabolic indexes of glucose and 
lipid, bile cholesterol saturation index].

    PubMed

    Wang, Wen; Li, Nianfeng

    2015-06-01

    To measure retinol binding protein 4 (RBP4) levels in serum and bile and to analyze their relationship with insulin resistance, dyslipidemia or cholesterol saturation index (CSI).
 A total of 60 patients with gallstone were divided into a diabetes group (n=30) and a control group (n=30). The concentrations of RBP4 in serum and bile were detected by enzyme-linked immunosorbent assay (ELISA). Enzyme colorimetric method was used to measure the concentration of biliary cholesterol, bile acid and phospholipid. Biliary CSI was calculated by Carey table. Partial correlation and multiple linear regression analysis were used to evaluate the correlation between the RBP4 levels in serum or bile and the above indexes.
 The RBP4 concentrations in serum and bile in the diabetes group were significantly elevated compared with those in the control group (both P<0.01). There was no significant difference in the serum total bile acid (TBA), serum triglyceride (TG), serum high-density lipoprotein (HDL), bile TBA, bile total cholesterol (TC) , bile phospholipids and bile CSI between the 2 groups (all P>0.05); but the serum TC, low density lipoprotein (LDL), fasting blood glucose (FBG), fasting insulin (FINS), and homeostasis model assessment for insulin resistance (HOMA-IR) in the diabetes group were significantly increased compared to those in the control group (all P<0.05). The partial correlation analysis, which was adjusted by age, showed that the bile RBP4 was positively correlated with body mass index (BMI), waist circumference (WC), FINS, FBG, TC, LDL and HOMA-IR (r=0.283, 0.405, 0.685, 0.667, 0.553, 0.424 and 0.735, respectively), and the serum RBP4 was also positively correlated with the WC, FINS, FBG, TC, LDL and HOMA-IR (r=0.317, 0.734, 0.609, 0.528, 0.386 and 0.751, respectively). Stepwise multivariate linear regression analysis suggested that the HOMA-IR, BMI and WC were independently correlated with the level of bile RBP4 (multiple regression equation: Ybile RBP4=2.372XHOMA-IR+0.420XBMI+0.178XWC-26.813), and the serum RBP4 level was correlated with the HOMA-IR and WC independently (multiple regression equation: Yserum RBP4=2.832XHOMA-IR +0.235XWC-20.128). Multiple regression equations showed that HOMA-IR was the strongest correlation factor with RBP4.
 RBP4 concentrations in serum and bile in the diabetes group are significantly higher than those in the control group. HOMA-IR, BMI and WC are independently correlated with the level of bile RBP4. HOMA-IR and WC are independently correlated with the serum RBP4 level. HOMA-IR is the strongest correlation factor with RBP4. RBP4 might play an important role in the course of gallstone formation in Type 2 diabetes mellitus.

  2. Additive effects prevail: The response of biota to multiple stressors in an intensively monitored watershed.

    PubMed

    Gieswein, Alexander; Hering, Daniel; Feld, Christian K

    2017-09-01

    Freshwater ecosystems are impacted by a range of stressors arising from diverse human-caused land and water uses. Identifying the relative importance of single stressors and understanding how multiple stressors interact and jointly affect biology is crucial for River Basin Management. This study addressed multiple human-induced stressors and their effects on the aquatic flora and fauna based on data from standard WFD monitoring schemes. For altogether 1095 sites within a mountainous catchment, we used 12 stressor variables covering three different stressor groups: riparian land use, physical habitat quality and nutrient enrichment. Twenty-one biological metrics calculated from taxa lists of three organism groups (fish, benthic invertebrates and aquatic macrophytes) served as response variables. Stressor and response variables were subjected to Boosted Regression Tree (BRT) analysis to identify stressor hierarchy and stressor interactions and subsequently to Generalised Linear Regression Modelling (GLM) to quantify the stressors standardised effect size. Our results show that riverine habitat degradation was the dominant stressor group for the river fauna, notably the bed physical habitat structure. Overall, the explained variation in benthic invertebrate metrics was higher than it was in fish and macrophyte metrics. In particular, general integrative (aggregate) metrics such as % Ephemeroptera, Plecoptera and Trichoptera (EPT) taxa performed better than ecological traits (e.g. % feeding types). Overall, additive stressor effects dominated, while significant and meaningful stressor interactions were generally rare and weak. We concluded that given the type of stressor and ecological response variables addressed in this study, river basin managers do not need to bother much about complex stressor interactions, but can focus on the prevailing stressors according to the hierarchy identified. Copyright © 2017 Elsevier B.V. All rights reserved.

  3. Estimation of PM2.5 and PM10 using ground-based AOD measurements during KORUS-AQ campaign

    NASA Astrophysics Data System (ADS)

    Koo, J. H.; Kim, J.; Kim, S.; Go, S.; Lee, S.; Lee, H.; Mok, J.; Hong, J.; Lee, J.; Eck, T. F.; Holben, B. N.

    2017-12-01

    During the KORUS-AQ campaign (2 May - 12 June, 2016), aerosol optical depth (AOD) was obtained at multiple channels using various ground-based instruments at Yonsei University, Seoul: AERONET sunphotometer, SKYNET skyradiometer, Brewer spectrophotometer, and multi-filter rotating shadowband radiometer (MFRSR). At the same location, planetary boundary layer (PBL) height and vertical profile of backscattering coefficients also can be obtained based on the celiometer measurements. Using celiometer products and various AODs, we try to estimate the amount of particular matter (PM2.5 and PM10) and validate with in-situ surface PM2.5 and PM10 measurements from AIRKOREA network. Direct comparison between PM2.5 and AOD reveals that the ultraviolet(UV) channel AOD has better correlations, due to the higher sensitivity of short wavelength to the fine-mode particle. In contrast, PM10 shows the highest correlation with the near-infrared(NIR) AOD. Next, we extract the boundary-layer portion of AOD using either PBL height or vertical profile of backscattering coefficients to compare with PM2.5 and PM10. Both results enhance the correlation, but consideration of weighting factor calculated from backscattering coefficients shows larger contribution to the correlation increase. Finally, we performed the multiple linear regression to estimate PM2.5 and PM10 using AODs. Consideration of meteorology (temperature, wind speed, and relative humidity) can enhance the correlation and also O3 and NO2 consideration highly contributes to the high correlation. This finding implies the importance to consider the ambient condition of secondary aerosol formation related to the PM2.5 variation. Multiple regression model finally finds the correlation 0.7-0.8, and diminishes the wavelength-dependent correlation patterns.

  4. Estimating Causal Effects of Education Interventions Using a Two-Rating Regression Discontinuity Design: Lessons from a Simulation Study

    ERIC Educational Resources Information Center

    Porter, Kristin E.; Reardon, Sean F.; Unlu, Fatih; Bloom, Howard S.; Robinson-Cimpian, Joseph P.

    2014-01-01

    A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the "surface" method, the "frontier" method, the "binding-score" method, and…

  5. Development of Multiple Regression Equations To Predict Fourth Graders' Achievement in Reading and Selected Content Areas.

    ERIC Educational Resources Information Center

    Hafner, Lawrence E.

    A study developed a multiple regression prediction equation for each of six selected achievement variables in a popular standardized test of achievement. Subjects, 42 fourth-grade pupils randomly selected across several classes in a large elementary school in a north Florida city, were administered several standardized tests to determine predictor…

  6. Physical and Cognitive-Affective Factors Associated with Fatigue in Individuals with Fibromyalgia: A Multiple Regression Analysis

    ERIC Educational Resources Information Center

    Muller, Veronica; Brooks, Jessica; Tu, Wei-Mo; Moser, Erin; Lo, Chu-Ling; Chan, Fong

    2015-01-01

    Purpose: The main objective of this study was to determine the extent to which physical and cognitive-affective factors are associated with fibromyalgia (FM) fatigue. Method: A quantitative descriptive design using correlation techniques and multiple regression analysis. The participants consisted of 302 members of the National Fibromyalgia &…

  7. Latent Variable Regression 4-Level Hierarchical Model Using Multisite Multiple-Cohorts Longitudinal Data. CRESST Report 801

    ERIC Educational Resources Information Center

    Choi, Kilchan

    2011-01-01

    This report explores a new latent variable regression 4-level hierarchical model for monitoring school performance over time using multisite multiple-cohorts longitudinal data. This kind of data set has a 4-level hierarchical structure: time-series observation nested within students who are nested within different cohorts of students. These…

  8. What Is Wrong with ANOVA and Multiple Regression? Analyzing Sentence Reading Times with Hierarchical Linear Models

    ERIC Educational Resources Information Center

    Richter, Tobias

    2006-01-01

    Most reading time studies using naturalistic texts yield data sets characterized by a multilevel structure: Sentences (sentence level) are nested within persons (person level). In contrast to analysis of variance and multiple regression techniques, hierarchical linear models take the multilevel structure of reading time data into account. They…

  9. Some Applied Research Concerns Using Multiple Linear Regression Analysis.

    ERIC Educational Resources Information Center

    Newman, Isadore; Fraas, John W.

    The intention of this paper is to provide an overall reference on how a researcher can apply multiple linear regression in order to utilize the advantages that it has to offer. The advantages and some concerns expressed about the technique are examined. A number of practical ways by which researchers can deal with such concerns as…

  10. A Spreadsheet Tool for Learning the Multiple Regression F-Test, T-Tests, and Multicollinearity

    ERIC Educational Resources Information Center

    Martin, David

    2008-01-01

    This note presents a spreadsheet tool that allows teachers the opportunity to guide students towards answering on their own questions related to the multiple regression F-test, the t-tests, and multicollinearity. The note demonstrates approaches for using the spreadsheet that might be appropriate for three different levels of statistics classes,…

  11. Predicting Final GPA of Graduate School Students: Comparing Artificial Neural Networking and Simultaneous Multiple Regression

    ERIC Educational Resources Information Center

    Anderson, Joan L.

    2006-01-01

    Data from graduate student applications at a large Western university were used to determine which factors were the best predictors of success in graduate school, as defined by cumulative graduate grade point average. Two statistical models were employed and compared: artificial neural networking and simultaneous multiple regression. Both models…

  12. Computational Tools for Probing Interactions in Multiple Linear Regression, Multilevel Modeling, and Latent Curve Analysis

    ERIC Educational Resources Information Center

    Preacher, Kristopher J.; Curran, Patrick J.; Bauer, Daniel J.

    2006-01-01

    Simple slopes, regions of significance, and confidence bands are commonly used to evaluate interactions in multiple linear regression (MLR) models, and the use of these techniques has recently been extended to multilevel or hierarchical linear modeling (HLM) and latent curve analysis (LCA). However, conducting these tests and plotting the…

  13. Regression Models for the Analysis of Longitudinal Gaussian Data from Multiple Sources

    PubMed Central

    O’Brien, Liam M.; Fitzmaurice, Garrett M.

    2006-01-01

    We present a regression model for the joint analysis of longitudinal multiple source Gaussian data. Longitudinal multiple source data arise when repeated measurements are taken from two or more sources, and each source provides a measure of the same underlying variable and on the same scale. This type of data generally produces a relatively large number of observations per subject; thus estimation of an unstructured covariance matrix often may not be possible. We consider two methods by which parsimonious models for the covariance can be obtained for longitudinal multiple source data. The methods are illustrated with an example of multiple informant data arising from a longitudinal interventional trial in psychiatry. PMID:15726666

  14. Differences Between Gait on Stairs and Flat Surfaces in Relation to Fall Risk and Future Falls.

    PubMed

    Wang, Kejia; Delbaere, Kim; Brodie, Matthew A D; Lovell, Nigel H; Kark, Lauren; Lord, Stephen R; Redmond, Stephen J

    2017-11-01

    We used body-worn inertial sensors to quantify differences in semi-free-living gait between stairs and on normal flat ground in older adults, and investigated the utility of assessing gait on these terrains for predicting the occurrence of multiple falls. Eighty-two community-dwelling older adults wore two inertial sensors, on the lower back and the right ankle, during several bouts of walking on flat surfaces and up and down stairs, in between rests and activities of daily living. Derived from the vertical acceleration at the lower back, step rate was calculated from the signal's fundamental frequency. Step rate variability was the width of this fundamental frequency peak from the signal's power spectral density. Movement vigor was calculated at both body locations from the signal variance. Partial Spearman correlations between gait parameters and physiological fall risk factors (components from the Physiological Profile Assessment) were calculated while controlling for age and gender. Overall, anteroposterior vigor at the lower back in stair descent was lower in subjects with longer reaction times. Older adults walked more slowly on stairs, but they were not significantly slower on flat surfaces. Using logistic regression, faster step rate in stair descent was associated with multiple prospective falls over 12 months. No significant associations were shown from gait parameters derived during walking upstairs or on flat surfaces. These results suggest that stair descent gait may provide more insight into fall risk than regular walking and stair ascent, and that further sensor-based investigation into unsupervised gait on different terrains would be valuable.

  15. Interpretation of commonly used statistical regression models.

    PubMed

    Kasza, Jessica; Wolfe, Rory

    2014-01-01

    A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.

  16. Applied Multiple Linear Regression: A General Research Strategy

    ERIC Educational Resources Information Center

    Smith, Brandon B.

    1969-01-01

    Illustrates some of the basic concepts and procedures for using regression analysis in experimental design, analysis of variance, analysis of covariance, and curvilinear regression. Applications to evaluation of instruction and vocational education programs are illustrated. (GR)

  17. Total Ambient Dose Equivalent Buildup Factor Determination for Nbs04 Concrete.

    PubMed

    Duckic, Paulina; Hayes, Robert B

    2018-06-01

    Buildup factors are dimensionless multiplicative factors required by the point kernel method to account for scattered radiation through a shielding material. The accuracy of the point kernel method is strongly affected by the correspondence of analyzed parameters to experimental configurations, which is attempted to be simplified here. The point kernel method has not been found to have widespread practical use for neutron shielding calculations due to the complex neutron transport behavior through shielding materials (i.e. the variety of interaction mechanisms that neutrons may undergo while traversing the shield) as well as non-linear neutron total cross section energy dependence. In this work, total ambient dose buildup factors for NBS04 concrete are calculated in terms of neutron and secondary gamma ray transmission factors. The neutron and secondary gamma ray transmission factors are calculated using MCNP6™ code with updated cross sections. Both transmission factors and buildup factors are given in a tabulated form. Practical use of neutron transmission and buildup factors warrants rigorously calculated results with all associated uncertainties. In this work, sensitivity analysis of neutron transmission factors and total buildup factors with varying water content has been conducted. The analysis showed significant impact of varying water content in concrete on both neutron transmission factors and total buildup factors. Finally, support vector regression, a machine learning technique, has been engaged to make a model based on the calculated data for calculation of the buildup factors. The developed model can predict most of the data with 20% relative error.

  18. Risk of multiple myeloma following medication use and medical conditions: a case-control study in Connecticut women.

    PubMed

    Landgren, Ola; Zhang, Yawei; Zahm, Sheila Hoar; Inskip, Peter; Zheng, Tongzhang; Baris, Dalsu

    2006-12-01

    Certain commonly used drugs and medical conditions characterized by chronic immune dysfunction and/or antigen stimulation have been suggested to affect important pathways in multiple myeloma tumor cell growth and survival. We conducted a population-based case-control study to investigate the role of medical history in the etiology of multiple myeloma among Connecticut women. A total of 179 incident multiple myeloma cases (21-84 years, diagnosed 1996-2002) and 691 population-based controls was included in this study. Information on medical conditions, medications, and medical radiation was obtained by in-person interviews. We calculated odds ratios (OR) as measures of relative risks using logistic regression models. A reduced multiple myeloma risk was found among women who had used antilipid statin therapy [OR, 0.4; 95% confidence interval (95% CI), 0.2-0.8] or estrogen replacement therapy (OR, 0.6; 95% CI, 0.4-0.99) or who had a medical history of allergy (OR, 0.4; 95% CI, 0.3-0.7), scarlet fever (OR, 0.5; 95% CI, 0.2-0.9), or bursitis (OR, 0.4; 95% CI, 0.2-0.7). An increased risk of multiple myeloma was found among women who used prednisone (OR, 5.1; 95% CI, 1.8-14.4), insulin (OR, 3.1; 95% CI, 1.1-9.0), or gout medication (OR, 6.7; 95% CI, 1.2-38.0). If our results are confirmed, mechanistic studies examining how prior use of insulin, prednisone, and, perhaps, gout medication might promote increased occurrence of multiple myeloma and how antilipid statins, estrogen replacement therapy, and certain medical conditions might protect against multiple myeloma may provide insights to the as yet unknown etiology of multiple myeloma.

  19. Population heterogeneity in the salience of multiple risk factors for adolescent delinquency.

    PubMed

    Lanza, Stephanie T; Cooper, Brittany R; Bray, Bethany C

    2014-03-01

    To present mixture regression analysis as an alternative to more standard regression analysis for predicting adolescent delinquency. We demonstrate how mixture regression analysis allows for the identification of population subgroups defined by the salience of multiple risk factors. We identified population subgroups (i.e., latent classes) of individuals based on their coefficients in a regression model predicting adolescent delinquency from eight previously established risk indices drawn from the community, school, family, peer, and individual levels. The study included N = 37,763 10th-grade adolescents who participated in the Communities That Care Youth Survey. Standard, zero-inflated, and mixture Poisson and negative binomial regression models were considered. Standard and mixture negative binomial regression models were selected as optimal. The five-class regression model was interpreted based on the class-specific regression coefficients, indicating that risk factors had varying salience across classes of adolescents. Standard regression showed that all risk factors were significantly associated with delinquency. Mixture regression provided more nuanced information, suggesting a unique set of risk factors that were salient for different subgroups of adolescents. Implications for the design of subgroup-specific interventions are discussed. Copyright © 2014 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.

  20. Costs of Chronic Diseases at the State Level: The Chronic Disease Cost Calculator

    PubMed Central

    Murphy, Louise B.; Khavjou, Olga A.; Li, Rui; Maylahn, Christopher M.; Tangka, Florence K.; Nurmagambetov, Tursynbek A.; Ekwueme, Donatus U.; Nwaise, Isaac; Chapman, Daniel P.; Orenstein, Diane

    2015-01-01

    Introduction Many studies have estimated national chronic disease costs, but state-level estimates are limited. The Centers for Disease Control and Prevention developed the Chronic Disease Cost Calculator (CDCC), which estimates state-level costs for arthritis, asthma, cancer, congestive heart failure, coronary heart disease, hypertension, stroke, other heart diseases, depression, and diabetes. Methods Using publicly available and restricted secondary data from multiple national data sets from 2004 through 2008, disease-attributable annual per-person medical and absenteeism costs were estimated. Total state medical and absenteeism costs were derived by multiplying per person costs from regressions by the number of people in the state treated for each disease. Medical costs were estimated for all payers and separately for Medicaid, Medicare, and private insurers. Projected medical costs for all payers (2010 through 2020) were calculated using medical costs and projected state population counts. Results Median state-specific medical costs ranged from $410 million (asthma) to $1.8 billion (diabetes); median absenteeism costs ranged from $5 million (congestive heart failure) to $217 million (arthritis). Conclusion CDCC provides methodologically rigorous chronic disease cost estimates. These estimates highlight possible areas of cost savings achievable through targeted prevention efforts or research into new interventions and treatments. PMID:26334712

  1. HIV-related ocular microangiopathic syndrome and color contrast sensitivity.

    PubMed

    Geier, S A; Hammel, G; Bogner, J R; Kronawitter, U; Berninger, T; Goebel, F D

    1994-06-01

    Color vision deficits in patients with acquired immunodeficiency syndrome (AIDS) or human immunodeficiency virus (HIV) disease were reported, and a retinal pathogenic mechanism was proposed. The purpose of this study was to evaluate the association of color vision deficits with HIV-related retinal microangiopathy. A computer graphics system was used to measure protan, deutan, and tritan color contrast sensitivity (CCS) thresholds in 60 HIV-infected patients. Retinal microangiopathy was measured by counting the number of cotton-wool spots, and conjunctival blood-flow sludging was determined. Additional predictors were CD4+ count, age, time on aerosolized pentamidine, time on zidovudine, and Walter Reed staging. The relative influence of each predictor was calculated by stepwise multiple regression analysis (inclusion criterion; incremental P value = < 0.05) using data for the right eyes (RE). The results were validated by using data for the left eyes (LE) and both eyes (BE). The only included predictors in multiple regression analyses for the RE were number of cotton-wool spots (tritan: R = .70; deutan: R = .46; and protan: R = .58; P < .0001 for all axes) and age (tritan: increment of R [Ri] = .05, P = .002; deutan: Ri = .10, P = .004; and protan: Ri = .05, P = .002). The predictors time on zidovudine (Ri = .05, P = .002) and Walter Reed staging (Ri = .03, P = .01) were additionally included in multiple regression analysis for tritan LE. The results for deutan LE were comparable to those for the RE. In the analysis for protan LE, the only included predictor was number of cotton-wool spots. In the analyses for BE, no further predictors were included. The predictors Walter Reed staging and CD4+ count showed a significant association with all three criteria in univariate analysis. Additionally, tritan CCS was significantly associated with conjunctival blood-flow sludging. CCS deficits in patients with HIV disease are primarily associated with the number of cotton-wool spots. Results of this study are in accordance with the hypothesis that CCS deficits are in a relevant part caused by neuroretinal damage secondary to HIV-related microangiopathy.

  2. Estimating Causal Effects of Education Interventions Using a Two-Rating Regression Discontinuity Design: Lessons from a Simulation Study and an Application

    ERIC Educational Resources Information Center

    Porter, Kristin E.; Reardon, Sean F.; Unlu, Fatih; Bloom, Howard S.; Cimpian, Joseph R.

    2017-01-01

    A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the "surface" method, the "frontier" method, the "binding-score" method, and…

  3. How Variables Uncorrelated with the Dependent Variable Can Actually Make Excellent Predictors: The Important Suppressor Variable Case.

    ERIC Educational Resources Information Center

    Woolley, Kristin K.

    Many researchers are unfamiliar with suppressor variables and how they operate in multiple regression analyses. This paper describes the role suppressor variables play in a multiple regression model and provides practical examples that explain how they can change research results. A variable that when added as another predictor increases the total…

  4. Death Anxiety as a Predictor of Posttraumatic Stress Levels among Individuals with Spinal Cord Injuries

    ERIC Educational Resources Information Center

    Martz, Erin

    2004-01-01

    Because the onset of a spinal cord injury may involve a brush with death and because serious injury and disability can act as a reminder of death, death anxiety was examined as a predictor of posttraumatic stress levels among individuals with disabilities. This cross-sectional study used multiple regression and multivariate multiple regression to…

  5. Multicollinearity is a red herring in the search for moderator variables: A guide to interpreting moderated multiple regression models and a critique of Iacobucci, Schneider, Popovich, and Bakamitsos (2016).

    PubMed

    McClelland, Gary H; Irwin, Julie R; Disatnik, David; Sivan, Liron

    2017-02-01

    Multicollinearity is irrelevant to the search for moderator variables, contrary to the implications of Iacobucci, Schneider, Popovich, and Bakamitsos (Behavior Research Methods, 2016, this issue). Multicollinearity is like the red herring in a mystery novel that distracts the statistical detective from the pursuit of a true moderator relationship. We show multicollinearity is completely irrelevant for tests of moderator variables. Furthermore, readers of Iacobucci et al. might be confused by a number of their errors. We note those errors, but more positively, we describe a variety of methods researchers might use to test and interpret their moderated multiple regression models, including two-stage testing, mean-centering, spotlighting, orthogonalizing, and floodlighting without regard to putative issues of multicollinearity. We cite a number of recent studies in the psychological literature in which the researchers used these methods appropriately to test, to interpret, and to report their moderated multiple regression models. We conclude with a set of recommendations for the analysis and reporting of moderated multiple regression that should help researchers better understand their models and facilitate generalizations across studies.

  6. Risk factors for lesions of the knee menisci among workers in South Korea's national parks.

    PubMed

    Shin, Donghee; Youn, Kanwoo; Lee, Eunja; Lee, Myeongjun; Chung, Hweemin; Kim, Deokweon

    2016-01-01

    This study was designed to investigate the prevalence of the menisci lesions in national park workers and work factors affecting this prevalence. The study subjects were 698 workers who worked in 20 Korean national parks in 2014. An orthopedist visited each national park and performed physical examinations. Knee MRI was performed if the McMurray test or Apley test was positive and there was a complaint of pain in knee area. An orthopedist and a radiologist respectively read these images of the menisci using a grading system based on the MRI signals. To calculate the cumulative intensity of trekking of the workers, the mean trail distance, the difficulty of the trail, the tenure at each national parks, and the number of treks per month for each worker from the start of work until the present were investigated. Chi-square tests was performed to see if there were differences in the menisci lesions grade according to the variables. The variables used in the Chi-square test were evaluated using simple logistic regression analysis to get crude odds ratios, and adjusted odds ratios and 95 % confidence intervals were calculated using multivariate logistic regression analysis after establishing three different models according to the adjusted variables. According to the MRI signal grades of menisci, 29 % were grade 0, 11.3 % were grade 1, 46.0 % were grade 2, and 13.7 % were grade 3. The differences in the MRI signal grades of menisci according to age and the intensity of trekking as calculated by the three different methods were statistically significant. Multiple logistic regression analysis was performed for three models. In model 1, there was no statistically significant factor affecting the menisci lesions. In model 2, among the factors affecting the menisci lesions, the OR of a high cumulative intensity of trekking was 4.08 (95 % CI 1.00-16.61), and in model 3, the OR of a high cumulative intensity of trekking was 5.84 (95 % CI 1.09-31.26). The factor that most affected the menisci lesions among the workers in Korean national park was a high cumulative intensity of trekking.

  7. Computer-aided molecular modeling techniques for predicting the stability of drug cyclodextrin inclusion complexes in aqueous solutions

    NASA Astrophysics Data System (ADS)

    Faucci, Maria Teresa; Melani, Fabrizio; Mura, Paola

    2002-06-01

    Molecular modeling was used to investigate factors influencing complex formation between cyclodextrins and guest molecules and predict their stability through a theoretical model based on the search for a correlation between experimental stability constants ( Ks) and some theoretical parameters describing complexation (docking energy, host-guest contact surfaces, intermolecular interaction fields) calculated from complex structures at a minimum conformational energy, obtained through stochastic methods based on molecular dynamic simulations. Naproxen, ibuprofen, ketoprofen and ibuproxam were used as model drug molecules. Multiple Regression Analysis allowed identification of the significant factors for the complex stability. A mathematical model ( r=0.897) related log Ks with complex docking energy and lipophilic molecular fields of cyclodextrin and drug.

  8. Use of principal-component, correlation, and stepwise multiple-regression analyses to investigate selected physical and hydraulic properties of carbonate-rock aquifers

    USGS Publications Warehouse

    Brown, C. Erwin

    1993-01-01

    Correlation analysis in conjunction with principal-component and multiple-regression analyses were applied to laboratory chemical and petrographic data to assess the usefulness of these techniques in evaluating selected physical and hydraulic properties of carbonate-rock aquifers in central Pennsylvania. Correlation and principal-component analyses were used to establish relations and associations among variables, to determine dimensions of property variation of samples, and to filter the variables containing similar information. Principal-component and correlation analyses showed that porosity is related to other measured variables and that permeability is most related to porosity and grain size. Four principal components are found to be significant in explaining the variance of data. Stepwise multiple-regression analysis was used to see how well the measured variables could predict porosity and (or) permeability for this suite of rocks. The variation in permeability and porosity is not totally predicted by the other variables, but the regression is significant at the 5% significance level. ?? 1993.

  9. [Establishment of multiple regression model for virulence factors of Saccharomyces albicans by random amplified polymorphic DNA bands].

    PubMed

    Liu, Qi; Wu, Youcong; Yuan, Youhua; Bai, Li; Niu, Kun

    2011-12-01

    To research the relationship between the virulence factors of Saccharomyces albicans (S. albicans) and the random amplified polymorphic DNA (RAPD) bands of them, and establish the regression model by multiple regression analysis. Extracellular phospholipase, secreted proteinase, ability to generate germ tubes and adhere to oral mucosal cells of 92 strains of S. albicans were measured in vitro; RAPD-polymerase chain reaction (RAPD-PCR) was used to get their bands. Multiple regression for virulence factors of S. albicans and RAPD-PCR bands was established. The extracellular phospholipase activity was associated with 4 RAPD bands: 350, 450, 650 and 1 300 bp (P < 0.05); secreted proteinase activity of S. albicans was associated with 2 bands: 350 and 1 200 bp (P < 0.05); the ability of germ tube produce was associated with 2 bands: 400 and 550 bp (P < 0.05). Some RAPD bands will reflect the virulence factors of S. albicans indirectly. These bands would contain some important messages for regulation of S. albicans virulence factors.

  10. Impact of night-shift work on the prevalence of erosive esophagitis in shipyard male workers.

    PubMed

    Chung, Tae Heum; Lee, Jiho; Kim, Moon Chan

    2016-08-01

    Whether night-shift work is a risk factor for gastroesophageal reflux disease is controversial. The aim of this study was to investigate the association between night-shift work and other factors, and erosive esophagitis. A cross-sectional study with 6040 male shipyard workers was performed. Esophagogastroduodenoscopic examination and a survey about night-shift work status, lifestyle, medical history, educational status, and marital status were conducted in all workers. The odds ratios of erosive esophagitis according to night-shift work status were calculated by using the logistic regression model. The prevalence of erosive esophagitis increased in the night-shift workers [odds ratio, 95 % confidence interval: 1.41 (1.03-1.94)]. According to multiple logistic regression models, night-shift work, obesity, smoking, and alcohol consumption of ≥140 g/week were significant risk factors for erosive esophagitis. By contrast, Helicobacter pylori infection was negatively associated with erosive esophagitis. Night-shift work is suggested to be a risk factor for erosive esophagitis. Avoidance of night-shift work and lifestyle modification should be considered for prevention and management of gastroesophageal reflux disease.

  11. Actual and estimated costs of disposable materials used during surgical procedures.

    PubMed

    Toyabe, Shin-Ichi; Cao, Pengyu; Kurashima, Sachiko; Nakayama, Yukiko; Ishii, Yuko; Hosoyama, Noriko; Akazawa, Kouhei

    2005-07-01

    It is difficult to estimate precisely the costs of disposable materials used during surgical operations. To evaluate the actual costs of disposable materials, we calculated the actual costs of disposable materials used in 59 operations by taking account of costs of all disposable materials used for each operation. The costs of the disposable materials varied significantly from operation to operation (US$ 38-4230 per operation), and the median [25-percentile and 75-percentile] of the sum total of disposable material costs of a single operation was found to be US$ 686 [205 and 993]. Multiple regression analysis with a stepwise regression method showed that costs of disposable materials significantly correlated only with operation time (p<0.001). Based on the results, we propose a simple method for estimating costs of disposable materials by measuring operation time, and we found that the method gives reliable results. Since costs of disposable materials used during surgical operations are considerable, precise estimation of the costs is essential for hospital cost accounting. Our method should be useful for planning hospital administration strategies.

  12. Trace analysis of acids and bases by conductometric titration with multiparametric non-linear regression.

    PubMed

    Coelho, Lúcia H G; Gutz, Ivano G R

    2006-03-15

    A chemometric method for analysis of conductometric titration data was introduced to extend its applicability to lower concentrations and more complex acid-base systems. Auxiliary pH measurements were made during the titration to assist the calculation of the distribution of protonable species on base of known or guessed equilibrium constants. Conductivity values of each ionized or ionizable species possibly present in the sample were introduced in a general equation where the only unknown parameters were the total concentrations of (conjugated) bases and of strong electrolytes not involved in acid-base equilibria. All these concentrations were adjusted by a multiparametric nonlinear regression (NLR) method, based on the Levenberg-Marquardt algorithm. This first conductometric titration method with NLR analysis (CT-NLR) was successfully applied to simulated conductometric titration data and to synthetic samples with multiple components at concentrations as low as those found in rainwater (approximately 10 micromol L(-1)). It was possible to resolve and quantify mixtures containing a strong acid, formic acid, acetic acid, ammonium ion, bicarbonate and inert electrolyte with accuracy of 5% or better.

  13. Use of chemical indicators of beer aging for ex-post checking of storage conditions and prediction of the sensory stability of beer.

    PubMed

    Cejka, Pavel; Culík, Jiří; Horák, Tomáš; Jurková, Marie; Olšovská, Jana

    2013-12-26

    The rate of beer aging is affected by storage conditions including largely time and temperature. Although bottled beer is commonly stored for up to 1 year, sensorial damage of it is quite frequent. Therefore, a method for retrospective determination of temperature of stored beer was developed. The method is based on the determination of selected carbonyl compounds called as "aging indicators", which are formed during beer aging. The aging indicators were determined using GC-MS after precolumn derivatization with O-(2,3,4,5,6-pentaflourobenzyl)hydroxylamine hydrochloride, and their profile was correlated with the development of old flavor evolving under defined conditions (temperature, time) using both a mathematical and statistical apparatus. Three approaches, including calculation from regression graph, multiple linear regression, and neural networks, were employed. The ultimate uncertainty of the method ranged from 3.0 to 11.0 °C depending on the approach used. Furthermore, the assay was extended to include prediction of beer tendency to sensory aging from freshly bottled beer.

  14. Simultaneous multiple non-crossing quantile regression estimation using kernel constraints

    PubMed Central

    Liu, Yufeng; Wu, Yichao

    2011-01-01

    Quantile regression (QR) is a very useful statistical tool for learning the relationship between the response variable and covariates. For many applications, one often needs to estimate multiple conditional quantile functions of the response variable given covariates. Although one can estimate multiple quantiles separately, it is of great interest to estimate them simultaneously. One advantage of simultaneous estimation is that multiple quantiles can share strength among them to gain better estimation accuracy than individually estimated quantile functions. Another important advantage of joint estimation is the feasibility of incorporating simultaneous non-crossing constraints of QR functions. In this paper, we propose a new kernel-based multiple QR estimation technique, namely simultaneous non-crossing quantile regression (SNQR). We use kernel representations for QR functions and apply constraints on the kernel coefficients to avoid crossing. Both unregularised and regularised SNQR techniques are considered. Asymptotic properties such as asymptotic normality of linear SNQR and oracle properties of the sparse linear SNQR are developed. Our numerical results demonstrate the competitive performance of our SNQR over the original individual QR estimation. PMID:22190842

  15. Automated grading system for evaluation of ocular redness associated with dry eye.

    PubMed

    Rodriguez, John D; Johnston, Patrick R; Ousler, George W; Smith, Lisa M; Abelson, Mark B

    2013-01-01

    We have observed that dry eye redness is characterized by a prominence of fine horizontal conjunctival vessels in the exposed ocular surface of the interpalpebral fissure, and have incorporated this feature into the grading of redness in clinical studies of dry eye. To develop an automated method of grading dry eye-associated ocular redness in order to expand on the clinical grading system currently used. Ninety nine images from 26 dry eye subjects were evaluated by five graders using a 0-4 (in 0.5 increments) dry eye redness (Ora Calibra™ Dry Eye Redness Scale [OCDER]) scale. For the automated method, the Opencv computer vision library was used to develop software for calculating redness and horizontal conjunctival vessels (noted as "horizontality"). From original photograph, the region of interest (ROI) was selected manually using the open source ImageJ software. Total average redness intensity (Com-Red) was calculated as a single channel 8-bit image as R - 0.83G - 0.17B, where R, G and B were the respective intensities of the red, green and blue channels. The location of vessels was detected by normalizing the blue channel and selecting pixels with an intensity of less than 97% of the mean. The horizontal component (Com-Hor) was calculated by the first order Sobel derivative in the vertical direction and the score was calculated as the average blue channel image intensity of this vertical derivative. Pearson correlation coefficients, accuracy and concordance correlation coefficients (CCC) were calculated after regression and standardized regression of the dataset. The agreement (both Pearson's and CCC) among investigators using the OCDER scale was 0.67, while the agreement of investigator to computer was 0.76. A multiple regression using both redness and horizontality improved the agreement CCC from 0.66 and 0.69 to 0.76, demonstrating the contribution of vessel geometry to the overall grade. Computer analysis of a given image has 100% repeatability and zero variability from session to session. This objective means of grading ocular redness in a unified fashion has potential significance as a new clinical endpoint. In comparisons between computer and investigator, computer grading proved to be more reliable than another investigator using the OCDER scale. The best fitting model based on the present sample, and usable for future studies, was [Formula: see text] is the predicted investigator grade, and [Formula: see text] and [Formula: see text] are logarithmic transformations of the computer calculated parameters COM-Hor and COM-Red. Considering the superior repeatability, computer automated grading might be preferable to investigator grading in multicentered dry eye studies in which the subtle differences in redness incurred by treatment have been historically difficult to define.

  16. Dry season mean monthly flow and harmonic mean flow regression equations for selected ungaged basins in Arkansas

    USGS Publications Warehouse

    Breaker, Brian K.

    2015-01-01

    Equations for two regions were found to be statistically significant for developing regression equations for estimating harmonic mean flows at ungaged basins; thus, equations are applicable only to streams in those respective regions in Arkansas. Regression equations for dry season mean monthly flows are applicable only to streams located throughout Arkansas. All regression equations are applicable only to unaltered streams where flows were not significantly affected by regulation, diversion, or urbanization. The median number of years used for dry season mean monthly flow calculation was 43, and the median number of years used for harmonic mean flow calculations was 34 for region 1 and 43 for region 2.

  17. Methods for calculating confidence and credible intervals for the residual between-study variance in random effects meta-regression models

    PubMed Central

    2014-01-01

    Background Meta-regression is becoming increasingly used to model study level covariate effects. However this type of statistical analysis presents many difficulties and challenges. Here two methods for calculating confidence intervals for the magnitude of the residual between-study variance in random effects meta-regression models are developed. A further suggestion for calculating credible intervals using informative prior distributions for the residual between-study variance is presented. Methods Two recently proposed and, under the assumptions of the random effects model, exact methods for constructing confidence intervals for the between-study variance in random effects meta-analyses are extended to the meta-regression setting. The use of Generalised Cochran heterogeneity statistics is extended to the meta-regression setting and a Newton-Raphson procedure is developed to implement the Q profile method for meta-analysis and meta-regression. WinBUGS is used to implement informative priors for the residual between-study variance in the context of Bayesian meta-regressions. Results Results are obtained for two contrasting examples, where the first example involves a binary covariate and the second involves a continuous covariate. Intervals for the residual between-study variance are wide for both examples. Conclusions Statistical methods, and R computer software, are available to compute exact confidence intervals for the residual between-study variance under the random effects model for meta-regression. These frequentist methods are almost as easily implemented as their established counterparts for meta-analysis. Bayesian meta-regressions are also easily performed by analysts who are comfortable using WinBUGS. Estimates of the residual between-study variance in random effects meta-regressions should be routinely reported and accompanied by some measure of their uncertainty. Confidence and/or credible intervals are well-suited to this purpose. PMID:25196829

  18. Efficacy prediction of cevimeline in patients with Sjögren's syndrome.

    PubMed

    Yamada, Hiroyuki; Nakagawa, Yoichi; Wakamatsu, Ei; Sumida, Takayuki; Yamachika, Shigeo; Nomura, Yoshiaki; Mishima, Kenji; Saito, Ichiro

    2007-08-01

    The objective of this study was to examine the clinical and immunological factors influencing the efficacy of cevimeline hydrochloride hydrate (cevimeline) for the treatment of xerostomia in patients with Sjögren's syndrome (SS). Thirty primary SS patients who were medicated with cevimeline were enrolled in this study. Whole stimulated sialometry (WSS) was compared between pre- and posttreatment points (4 weeks after oral cevimeline administration) and the increment rate of WSS was calculated. Multiple regression was employed to examine the relative contributions of the clinical and immunological factors, including age, pretreatment WSS, duration of disease, sialography, minor salivary gland biopsy, anti-Ro/SS-A antibodies, anti-La/SS-B antibodies, and antibodies to muscarinic type 3 receptors to the posttreatment WSS. Patients with normal sialography findings, negative minor salivary gland biopsy, and absence of anti-La/SS-B antibodies had significantly higher increment rates of WSS compared with those with positive findings (p=0.042, 0.002, and 0.018, respectively). Results of the multiple regression analysis showed that sialography (coefficient=-0.867, p=0.004) and minor salivary gland biopsy (coefficient=-0.869, p=0.003) had significant associations with the posttreatment WSS. Our preliminary results demonstrated the relationship between the effect of cevimeline on saliva secretion and the degree of salivary gland destruction evaluated by sialography and histopathological findings in the labial minor salivary glands. These diagnostic approaches could provide useful prognostic information on the efficacy of cevimeline in SS patients.

  19. Chair-side detection of Prevotella Intermedia in mature dental plaque by its fluorescence.

    PubMed

    Nomura, Yoshiaki; Takeuchi, Hiroaki; Okamoto, Masaaki; Sogabe, Kaoru; Okada, Ayako; Hanada, Nobuhiro

    2017-06-01

    Prevotella intermedia/nigrescens is one of the well-known pathogens causing periodontal diseases, and the red florescence excited by the visible blue light caused by the protoporphyrin IX in the bacterial cells could be useful for the chair-side detection. The aim of this study was to evaluated levels of periodontal pathogen, especially P. intermedia in clinical samples of red fluorescent dental plaque. Thirty two supra gingival plaque samples from six individuals were measured its fluorescence at 640nm wavelength excited by 409nm. Periodontopathic bacteria were counted by the Invader PLUS PCR assay. Co-relations the fluorescence intensity and bacterial counts were analyzed by Person's correlation coefficient and simple and multiple regression analysis. Positive and negative predictive values of the fluorescence intensities for with or without P. intermedia in supragingival plaque was calculated. When relative fluorescence unit (RFU) were logarithmic transformed, statistically significant linear relations between RFU and bacterial counts were obtained for P. intermedia, Porphyromonas gingivalis and Tannerella forsythia. By the multiple regression analysis, only P. intermedia had statistically significant co-relation with fluorescence intensities. All of the fluorescent dental plaque contained P. intermedia m. In contrast, 28% of non-fluorescent plaques contained P. intermedia. To check the fluorescence dental plaque in the oral cavity could be the simple chair-side screening of the mature dental plaque before examining the periodontal pathogens especially P. intermedia by the PCR method. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Association of Dentine Hypersensitivity with Different Risk Factors – A Cross Sectional Study

    PubMed Central

    Vijaya, V; Sanjay, Venkataraam; Varghese, Rana K; Ravuri, Rajyalakshmi; Agarwal, Anil

    2013-01-01

    Background: This study was done to assess the prevalence of Dentine hypersensitivity (DH) and its associated risk factors. Materials & Methods: This epidemiological study was done among patients coming to dental college regarding prevalence of DH. A self structured questionnaire along with clinical examination was done for assessment. Descriptive statistics were obtained and frequency distribution was calculated using Chi square test at p value <0.05. Stepwise multiple linear regression was also done to access frequency of DH with different factors. Results: The study population was comprised of 655 participants with different age groups. Our study showed prevalence as 55% and it was more common among males. Similarly smokers and those who use hard tooth brush had more cases of DH. Step wise multiple linear regression showed that best predictor for DH was age followed by habit of smoking and type of tooth brush. Most aggravating factors were cold water (15.4%) and sweet foods (14.7%), whereas only 5% of the patients had it while brushing. Conclusion: A high level of dental hypersensitivity has been in this study and more common among males. A linear finding was shown with age, smoking and type of tooth brush. How to cite this article: Vijaya V, Sanjay V, Varghese RK, Ravuri R, Agarwal A. Association of Dentine Hypersensitivity with Different Risk Factors – A Cross Sectional Study. J Int Oral Health 2013;5(6):88-92 . PMID:24453451

  1. Association between red blood cell distribution width (RDW) and carotid artery atherosclerosis (CAS) in patients with primary ischemic stroke.

    PubMed

    Jia, He; Li, Huimian; Zhang, Yan; Li, Che; Hu, Yingyun; Xia, Chunfang

    2015-01-01

    The present study aimed to explore the association between RDW and CAS in patients with ischemic stroke, expecting to find a new and significant diagnosis index for clinical practice. This cross-sectional study involves 432 consecutive patients with primary ischemic stroke (within 72 h). All subjects were confirmed by magnetic resonance imaging, and underwent physical examination, laboratory tests and carotid ultrasonography check. Finally, 392 patients were included according to the exclusion criteria. The odds ratios of independent variables were calculated using stepwise multiple logistic regression. Carotid intimal-medial thickness (IMT) and RDW are both significantly different between CAS group and control group. Univariate analyses show that high-sensitive C-reactive protein (Hs-CRP) and RDW (r=0.436) are both in significantly positive association with IMT. Stepwise multiple logistic regression shows that RDW is an independent protective factor of CAS in patients with ischemic stroke. Compared with the lowest quartile, the second to fourth quartiles are 1.13 (95% CI: 1.13-3.05), 2.02 (95% CI: 1.66-4.67), and 3.10 (95% CI: 2.46-7.65), respectively. The present study suggested that RDW level were higher than non-CAS in patients with primary ischemic stroke. Our results facilitated a bridge to connect RDW with ischemic stroke and further confirmed the role of RDW in the progression of the ischemic stroke. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  2. QSAR studies of the bioactivity of hepatitis C virus (HCV) NS3/4A protease inhibitors by multiple linear regression (MLR) and support vector machine (SVM).

    PubMed

    Qin, Zijian; Wang, Maolin; Yan, Aixia

    2017-07-01

    In this study, quantitative structure-activity relationship (QSAR) models using various descriptor sets and training/test set selection methods were explored to predict the bioactivity of hepatitis C virus (HCV) NS3/4A protease inhibitors by using a multiple linear regression (MLR) and a support vector machine (SVM) method. 512 HCV NS3/4A protease inhibitors and their IC 50 values which were determined by the same FRET assay were collected from the reported literature to build a dataset. All the inhibitors were represented with selected nine global and 12 2D property-weighted autocorrelation descriptors calculated from the program CORINA Symphony. The dataset was divided into a training set and a test set by a random and a Kohonen's self-organizing map (SOM) method. The correlation coefficients (r 2 ) of training sets and test sets were 0.75 and 0.72 for the best MLR model, 0.87 and 0.85 for the best SVM model, respectively. In addition, a series of sub-dataset models were also developed. The performances of all the best sub-dataset models were better than those of the whole dataset models. We believe that the combination of the best sub- and whole dataset SVM models can be used as reliable lead designing tools for new NS3/4A protease inhibitors scaffolds in a drug discovery pipeline. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. Sarcopenia Impairs Prognosis of Patients with Hepatocellular Carcinoma: The Role of Liver Functional Reserve and Tumor-Related Factors in Loss of Skeletal Muscle Volume.

    PubMed

    Imai, Kenji; Takai, Koji; Watanabe, Satoshi; Hanai, Tatsunori; Suetsugu, Atsushi; Shiraki, Makoto; Shimizu, Masahito

    2017-09-22

    Sarcopenia impairs survival in patients with hepatocellular carcinoma (HCC). This study aimed to clarify the factors that contribute to decreased skeletal muscle volume in patients with HCC. The third lumbar vertebra skeletal muscle index (L3 SMI) in 351 consecutive patients with HCC was calculated to identify sarcopenia. Sarcopenia was defined as an L3 SMI value ≤ 29.0 cm²/m² for women and ≤ 36.0 cm²/m² for men. The factors affecting L3 SMI were analyzed by multiple linear regression analysis and tree-based models. Of the 351 HCC patients, 33 were diagnosed as having sarcopenia and showed poor prognosis compared with non-sarcopenia patients ( p = 0.007). However, this significant difference disappeared after the adjustments for age, sex, Child-Pugh score, maximum tumor size, tumor number, and the degree of portal vein invasion by propensity score matching analysis. Multiple linear regression analysis showed that age ( p = 0.015) and sex ( p < 0.0001) were significantly correlated with a decrease in L3 SMI. Tree-based models revealed that sex (female) is the most significant factor that affects L3 SMI. In male patients, L3 SMI was decreased by aging, increased Child-Pugh score (≥56 years), and enlarged tumor size (<56 years). Maintaining liver functional reserve and early diagnosis and therapy for HCC are vital to prevent skeletal muscle depletion and improve the prognosis of patients with HCC.

  4. Monitoring heavy metal Cr in soil based on hyperspectral data using regression analysis

    NASA Astrophysics Data System (ADS)

    Zhang, Ningyu; Xu, Fuyun; Zhuang, Shidong; He, Changwei

    2016-10-01

    Heavy metal pollution in soils is one of the most critical problems in the global ecology and environment safety nowadays. Hyperspectral remote sensing and its application is capable of high speed, low cost, less risk and less damage, and provides a good method for detecting heavy metals in soil. This paper proposed a new idea of applying regression analysis of stepwise multiple regression between the spectral data and monitoring the amount of heavy metal Cr by sample points in soil for environmental protection. In the measurement, a FieldSpec HandHeld spectroradiometer is used to collect reflectance spectra of sample points over the wavelength range of 325-1075 nm. Then the spectral data measured by the spectroradiometer is preprocessed to reduced the influence of the external factors, and the preprocessed methods include first-order differential equation, second-order differential equation and continuum removal method. The algorithms of stepwise multiple regression are established accordingly, and the accuracy of each equation is tested. The results showed that the accuracy of first-order differential equation works best, which makes it feasible to predict the content of heavy metal Cr by using stepwise multiple regression.

  5. Nonalcoholic Fatty Liver Disease: Diagnostic and Fat-Grading Accuracy of Low-Flip-Angle Multiecho Gradient-Recalled-Echo MR Imaging at 1.5 T

    PubMed Central

    Yokoo, Takeshi; Bydder, Mark; Hamilton, Gavin; Middleton, Michael S.; Gamst, Anthony C.; Wolfson, Tanya; Hassanein, Tarek; Patton, Heather M.; Lavine, Joel E.; Schwimmer, Jeffrey B.; Sirlin, Claude B.

    2009-01-01

    Purpose: To assess the accuracy of four fat quantification methods at low-flip-angle multiecho gradient-recalled-echo (GRE) magnetic resonance (MR) imaging in nonalcoholic fatty liver disease (NAFLD) by using MR spectroscopy as the reference standard. Materials and Methods: In this institutional review board–approved, HIPAA-compliant prospective study, 110 subjects (29 with biopsy-confirmed NAFLD, 50 overweight and at risk for NAFLD, and 31 healthy volunteers) (mean age, 32.6 years ± 15.6 [standard deviation]; range, 8–66 years) gave informed consent and underwent MR spectroscopy and GRE MR imaging of the liver. Spectroscopy involved a long repetition time (to suppress T1 effects) and multiple echo times (to estimate T2 effects); the reference fat fraction (FF) was calculated from T2-corrected fat and water spectral peak areas. Imaging involved a low flip angle (to suppress T1 effects) and multiple echo times (to estimate T2* effects); imaging FF was calculated by using four analysis methods of progressive complexity: dual echo, triple echo, multiecho, and multiinterference. All methods except dual echo corrected for T2* effects. The multiinterference method corrected for multiple spectral interference effects of fat. For each method, the accuracy for diagnosis of fatty liver, as defined with a spectroscopic threshold, was assessed by estimating sensitivity and specificity; fat-grading accuracy was assessed by comparing imaging and spectroscopic FF values by using linear regression. Results: Dual-echo, triple-echo, multiecho, and multiinterference methods had a sensitivity of 0.817, 0.967, 0.950, and 0.983 and a specificity of 1.000, 0.880, 1.000, and 0.880, respectively. On the basis of regression slope and intercept, the multiinterference (slope, 0.98; intercept, 0.91%) method had high fat-grading accuracy without statistically significant error (P > .05). Dual-echo (slope, 0.98; intercept, −2.90%), triple-echo (slope, 0.94; intercept, 1.42%), and multiecho (slope, 0.85; intercept, −0.15%) methods had statistically significant error (P < .05). Conclusion: Relaxation- and interference-corrected fat quantification at low-flip-angle multiecho GRE MR imaging provides high diagnostic and fat-grading accuracy in NAFLD. © RSNA, 2009 PMID:19221054

  6. Element enrichment factor calculation using grain-size distribution and functional data regression.

    PubMed

    Sierra, C; Ordóñez, C; Saavedra, A; Gallego, J R

    2015-01-01

    In environmental geochemistry studies it is common practice to normalize element concentrations in order to remove the effect of grain size. Linear regression with respect to a particular grain size or conservative element is a widely used method of normalization. In this paper, the utility of functional linear regression, in which the grain-size curve is the independent variable and the concentration of pollutant the dependent variable, is analyzed and applied to detrital sediment. After implementing functional linear regression and classical linear regression models to normalize and calculate enrichment factors, we concluded that the former regression technique has some advantages over the latter. First, functional linear regression directly considers the grain-size distribution of the samples as the explanatory variable. Second, as the regression coefficients are not constant values but functions depending on the grain size, it is easier to comprehend the relationship between grain size and pollutant concentration. Third, regularization can be introduced into the model in order to establish equilibrium between reliability of the data and smoothness of the solutions. Copyright © 2014 Elsevier Ltd. All rights reserved.

  7. Methodology for the development of normative data for Spanish-speaking pediatric populations.

    PubMed

    Rivera, D; Arango-Lasprilla, J C

    2017-01-01

    To describe the methodology utilized to calculate reliability and the generation of norms for 10 neuropsychological tests for children in Spanish-speaking countries. The study sample consisted of over 4,373 healthy children from nine countries in Latin America (Chile, Cuba, Ecuador, Guatemala, Honduras, Mexico, Paraguay, Peru, and Puerto Rico) and Spain. Inclusion criteria for all countries were to have between 6 to 17 years of age, an Intelligence Quotient of≥80 on the Test of Non-Verbal Intelligence (TONI-2), and score of <19 on the Children's Depression Inventory. Participants completed 10 neuropsychological tests. Reliability and norms were calculated for all tests. Test-retest analysis showed excellent or good- reliability on all tests (r's>0.55; p's<0.001) except M-WCST perseverative errors whose coefficient magnitude was fair. All scores were normed using multiple linear regressions and standard deviations of residual values. Age, age2, sex, and mean level of parental education (MLPE) were included as predictors in the models by country. The non-significant variables (p > 0.05) were removed and the analysis were run again. This is the largest Spanish-speaking children and adolescents normative study in the world. For the generation of normative data, the method based on linear regression models and the standard deviation of residual values was used. This method allows determination of the specific variables that predict test scores, helps identify and control for collinearity of predictive variables, and generates continuous and more reliable norms than those of traditional methods.

  8. Climate patterns as predictors of amphibians species richness and indicators of potential stress

    USGS Publications Warehouse

    Battaglin, W.; Hay, L.; McCabe, G.; Nanjappa, P.; Gallant, Alisa L.

    2005-01-01

    Amphibians occupy a range of habitats throughout the world, but species richness is greatest in regions with moist, warm climates. We modeled the statistical relations of anuran and urodele species richness with mean annual climate for the conterminous United States, and compared the strength of these relations at national and regional levels. Model variables were calculated for county and subcounty mapping units, and included 40-year (1960-1999) annual mean and mean annual climate statistics, mapping unit average elevation, mapping unit land area, and estimates of anuran and urodele species richness. Climate data were derived from more than 7,500 first-order and cooperative meteorological stations and were interpolated to the mapping units using multiple linear regression models. Anuran and urodele species richness were calculated from the United States Geological Survey's Amphibian Research and Monitoring Initiative (ARMI) National Atlas for Amphibian Distributions. The national multivariate linear regression (MLR) model of anuran species richness had an adjusted coefficient of determination (R2) value of 0.64 and the national MLR model for urodele species richness had an R2 value of 0.45. Stratifying the United States by coarse-resolution ecological regions provided models for anUrans that ranged in R2 values from 0.15 to 0.78. Regional models for urodeles had R2 values. ranging from 0.27 to 0.74. In general, regional models for anurans were more strongly influenced by temperature variables, whereas precipitation variables had a larger influence on urodele models.

  9. Calculation of Haem Iron Intake and Its Role in the Development of Iron Deficiency in Young Women from the Australian Longitudinal Study on Women's Health.

    PubMed

    Reeves, Angela J; McEvoy, Mark A; MacDonald-Wicks, Lesley K; Barker, Daniel; Attia, John; Hodge, Allison M; Patterson, Amanda J

    2017-05-19

    Total iron intake is not strongly associated with iron stores, but haem iron intake may be more predictive. Haem iron is not available in most nutrient databases, so experimentally determined haem contents were applied to an Australian Food Frequency Questionnaire (FFQ) to estimate haem iron intake in a representative sample of young women (25-30 years). The association between dietary haem iron intakes and incident self-reported diagnosed iron deficiency over six years of follow-up was examined. Haem iron contents for Australian red meats, fish, and poultry were applied to haem-containing foods in the Dietary Questionnaire for Epidemiological Studies V2 (DQESv2) FFQ. Haem iron intakes were calculated for 9076 women from the Australian Longitudinal Study on Women's Health (ALSWH) using the DQESv2 dietary data from 2003. Logistic regression was used to examine the association between haem iron intake (2003) and the incidence of iron deficiency in 2006 and 2009. Multiple logistic regression showed baseline haem iron intake was a statistically significant predictor of iron deficiency in 2006 (Odds Ratio (OR): 0.91; 95% Confidence Interval (CI): 0.84-0.99; p -value: 0.020) and 2009 (OR: 0.89; 95% CI: 0.82-0.99; p -value: 0.007). Using the energy-adjusted haem intake made little difference to the associations. Higher haem iron intake is associated with reduced odds of iron deficiency developing in young adult Australian women.

  10. Forecasting USAF JP-8 Fuel Needs

    DTIC Science & Technology

    2009-03-01

    versus complex ones. When we consider long -term forecasts, 5-years in this case, multiple regression outperforms ANN modeling within the specified...with more simple and easy-to-implement methods, versus complex ones. When we consider long -term 5-year forecasts, our multiple regression model...effort. The insight and experience was certainly appreciated. Special thanks to my Turkish peers for their continuous support and help during this long

  11. The Overall Odds Ratio as an Intuitive Effect Size Index for Multiple Logistic Regression: Examination of Further Refinements

    ERIC Educational Resources Information Center

    Le, Huy; Marcus, Justin

    2012-01-01

    This study used Monte Carlo simulation to examine the properties of the overall odds ratio (OOR), which was recently introduced as an index for overall effect size in multiple logistic regression. It was found that the OOR was relatively independent of study base rate and performed better than most commonly used R-square analogs in indexing model…

  12. Future Performance Trend Indicators: A Current Value Approach to Human Resources Accounting. Report III. Multivariate Predictions of Organizational Performance Across Time.

    ERIC Educational Resources Information Center

    Pecorella, Patricia A.; Bowers, David G.

    Multiple regression in a double cross-validated design was used to predict two performance measures (total variable expense and absence rate) by multi-month period in five industrial firms. The regressions do cross-validate, and produce multiple coefficients which display both concurrent and predictive effects, peaking 18 months to two years…

  13. Using multiple calibration sets to improve the quantitative accuracy of partial least squares (PLS) regression on open-path fourier transform infrared (OP/FT-IR) spectra of ammonia over wide concentration ranges

    USDA-ARS?s Scientific Manuscript database

    A technique of using multiple calibration sets in partial least squares regression (PLS) was proposed to improve the quantitative determination of ammonia from open-path Fourier transform infrared spectra. The spectra were measured near animal farms, and the path-integrated concentration of ammonia...

  14. FIRE: an SPSS program for variable selection in multiple linear regression analysis via the relative importance of predictors.

    PubMed

    Lorenzo-Seva, Urbano; Ferrando, Pere J

    2011-03-01

    We provide an SPSS program that implements currently recommended techniques and recent developments for selecting variables in multiple linear regression analysis via the relative importance of predictors. The approach consists of: (1) optimally splitting the data for cross-validation, (2) selecting the final set of predictors to be retained in the equation regression, and (3) assessing the behavior of the chosen model using standard indices and procedures. The SPSS syntax, a short manual, and data files related to this article are available as supplemental materials from brm.psychonomic-journals.org/content/supplemental.

  15. Standardized Regression Coefficients as Indices of Effect Sizes in Meta-Analysis

    ERIC Educational Resources Information Center

    Kim, Rae Seon

    2011-01-01

    When conducting a meta-analysis, it is common to find many collected studies that report regression analyses, because multiple regression analysis is widely used in many fields. Meta-analysis uses effect sizes drawn from individual studies as a means of synthesizing a collection of results. However, indices of effect size from regression analyses…

  16. Evaluation of the comprehensive palatability of Japanese sake paired with dishes by multiple regression analysis based on subdomains.

    PubMed

    Nakamura, Ryo; Nakano, Kumiko; Tamura, Hiroyasu; Mizunuma, Masaki; Fushiki, Tohru; Hirata, Dai

    2017-08-01

    Many factors contribute to palatability. In order to evaluate the palatability of Japanese alcohol sake paired with certain dishes by integrating multiple factors, here we applied an evaluation method previously reported for palatability of cheese by multiple regression analysis based on 3 subdomain factors (rewarding, cultural, and informational). We asked 94 Japanese participants/subjects to evaluate the palatability of sake (1st evaluation/E1 for the first cup, 2nd/E2 and 3rd/E3 for the palatability with aftertaste/afterglow of certain dishes) and to respond to a questionnaire related to 3 subdomains. In E1, 3 factors were extracted by a factor analysis, and the subsequent multiple regression analyses indicated that the palatability of sake was interpreted by mainly the rewarding. Further, the results of attribution-dissections in E1 indicated that 2 factors (rewarding and informational) contributed to the palatability. Finally, our results indicated that the palatability of sake was influenced by the dish eaten just before drinking.

  17. Species Composition at the Sub-Meter Level in Discontinuous Permafrost in Subarctic Sweden

    NASA Astrophysics Data System (ADS)

    Anderson, S. M.; Palace, M. W.; Layne, M.; Varner, R. K.; Crill, P. M.

    2013-12-01

    Northern latitudes are experiencing rapid warming. Wetlands underlain by permafrost are particularly vulnerable to warming which results in changes in vegetative cover. Specific species have been associated with greenhouse gas emissions therefore knowledge of species compositional shift allows for the systematic change and quantification of emissions and changes in such emissions. Species composition varies on the sub-meter scale based on topography and other microsite environmental parameters. This complexity and the need to scale vegetation to the landscape level proves vital in our estimation of carbon dioxide (CO2) and methane (CH4) emissions and dynamics. Stordalen Mire (68°21'N, 18°49'E) in Abisko and is located at the edge of discontinuous permafrost zone. This provides a unique opportunity to analyze multiple vegetation communities in a close proximity. To do this, we randomly selected 25 1x1 meter plots that were representative of five major cover types: Semi-wet, wet, hummock, tall graminoid, and tall shrub. We used a quadrat with 64 sub plots and measured areal percent cover for 24 species. We collected ground based remote sensing (RS) at each plot to determine species composition using an ADC-lite (near infrared, red, green) and GoPro (red, blue, green). We normalized each image based on a Teflon white chip placed in each image. Textural analysis was conducted on each image for entropy, angular second momentum, and lacunarity. A logistic regression was developed to examine vegetation cover types and remote sensing parameters. We used a multiple linear regression using forwards stepwise variable selection. We found statistical difference in species composition and diversity indices between vegetation cover types. In addition, we were able to build regression model to significantly estimate vegetation cover type as well as percent cover for specific key vegetative species. This ground-based remote sensing allows for quick quantification of vegetation cover and species and also provides the framework for scaling to satellite image data to estimate species composition and shift on the landscape level. To determine diversity within our plots we calculated species richness and Shannon Index. We found that there were statistically different species composition within each vegetation cover type and also determined which species were indicative for cover type. Our logistical regression was able to significantly classify vegetation cover types based on RS parameters. Our multiple regression analysis indicated Betunla nana (Dwarf Birch) (r2= .48, p=<0.0001) and Sphagnum (r2=0.59, p=<0.0001) were statistically significant with respect to RS parameters. We suggest that ground based remote sensing methods may provide a unique and efficient method to quantify vegetation across the landscape in northern latitude wetlands.

  18. Determination of streamflow of the Arkansas River near Bentley in south-central Kansas

    USGS Publications Warehouse

    Perry, Charles A.

    2012-01-01

    The Kansas Department of Agriculture, Division of Water Resources, requires that the streamflow of the Arkansas River just upstream from Bentley in south-central Kansas be measured or calculated before groundwater can be pumped from the well field. When the daily streamflow of the Arkansas River near Bentley is less than 165 cubic feet per second (ft3/s), pumping must be curtailed. Daily streamflow near Bentley was calculated by determining the relations between streamflow data from two reference streamgages with a concurrent record of 24 years, one located 17.2 miles (mi) upstream and one located 10.9 mi downstream, and streamflow at a temporary gage located just upstream from Bentley (Arkansas River near Bentley, Kansas). Flow-duration curves for the two reference streamgages indicate that during 1988?2011, the mean daily streamflow was less than 165 ft3/s 30 to 35 percent of the time. During extreme low-flow (drought) conditions, the reach of the Arkansas River between Hutchinson and Maize can lose flow to the adjacent alluvial aquifer, with streamflow losses as much as 1.6 cubic feet per second per mile. Three models were developed to calculate the streamflow of the Arkansas River near Bentley, Kansas. The model chosen depends on the data available and on whether the reach of the Arkansas River between Hutchinson and Maize is gaining or losing groundwater from or to the adjacent alluvial aquifer. The first model was a pair of equations developed from linear regressions of the relation between daily streamflow data from the Bentley streamgage and daily streamflow data from either the Arkansas River near Hutchinson, Kansas, station (station number 07143330) or the Arkansas River near Maize, Kansas, station (station number 07143375). The standard error of the Hutchinson-only equation was 22.8 ft3/s, and the standard error of the Maize-only equation was 22.3 ft3/s. The single-station model would be used if only one streamgage was available. In the second model, the flow gradient between the streamflow near Hutchinson and the streamflow near Maize was used to calculate the streamflow at the Bentley streamgage. This equation resulted in a standard error of 26.7 ft3/s. In the third model, a multiple regression analysis between both the daily streamflow of the Arkansas River near Hutchinson, Kansas, and the daily streamflow of the Arkansas River near Maize, Kansas, was used to calculate the streamflow at the Bentley streamgage. The multiple regression equation had a standard error of 21.2 ft3/s, which was the smallest of the standard errors for all the models. An analysis of the number of low-flow days and the number of days when the reach between Hutchinson and Maize loses flow to the adjacent alluvial aquifer indicates that the long-term trend is toward fewer days of losing conditions. This trend may indicate a long-term increase in water levels in the alluvial aquifer, which could be caused by one or more of several conditions, including an increase in rainfall, a decrease in pumping, a decrease in temperature, and an increase in streamflow upstream from the Hutchinson-to-Maize reach of the Arkansas River.

  19. REGRES: A FORTRAN-77 program to calculate nonparametric and ``structural'' parametric solutions to bivariate regression equations

    NASA Astrophysics Data System (ADS)

    Rock, N. M. S.; Duffy, T. R.

    REGRES allows a range of regression equations to be calculated for paired sets of data values in which both variables are subject to error (i.e. neither is the "independent" variable). Nonparametric regressions, based on medians of all possible pairwise slopes and intercepts, are treated in detail. Estimated slopes and intercepts are output, along with confidence limits, Spearman and Kendall rank correlation coefficients. Outliers can be rejected with user-determined stringency. Parametric regressions can be calculated for any value of λ (the ratio of the variances of the random errors for y and x)—including: (1) major axis ( λ = 1); (2) reduced major axis ( λ = variance of y/variance of x); (3) Y on Xλ = infinity; or (4) X on Y ( λ = 0) solutions. Pearson linear correlation coefficients also are output. REGRES provides an alternative to conventional isochron assessment techniques where bivariate normal errors cannot be assumed, or weighting methods are inappropriate.

  20. Correlation and simple linear regression.

    PubMed

    Eberly, Lynn E

    2007-01-01

    This chapter highlights important steps in using correlation and simple linear regression to address scientific questions about the association of two continuous variables with each other. These steps include estimation and inference, assessing model fit, the connection between regression and ANOVA, and study design. Examples in microbiology are used throughout. This chapter provides a framework that is helpful in understanding more complex statistical techniques, such as multiple linear regression, linear mixed effects models, logistic regression, and proportional hazards regression.

  1. Evaluation of the effect of alternative measurements of body weight gain and dry matter intake for the calculation of residual feed intake in growing purebred Charolais and Red Angus cattle.

    PubMed

    Kayser, W; Glaze, J B; Welch, C M; Kerley, M; Hill, R A

    2015-07-01

    The objective of this study was to determine the effects of alternative-measurements of body weight and DMI used to evaluate residual feed intake (RFI). Weaning weight (WW), ADG, and DMI were recorded on 970 growing purebred Charolais bulls (n = 519) and heifers (n = 451) and 153 Red Angus growing steers (n = 69) and heifers (n = 84) using a GrowSafe (GrowSafe, Airdrie, Alberta, Canada) system. Averages of individual DMI were calculated in 10-d increments and compared to the overall DMI to identify the magnitude of the errors associated with measuring DMI. These incremental measurements were also used in calculation of RFI, computed from the linear regression of DMI on ADG and midtest body weight0.75 (MMWT). RFI_Regress was calculated using ADG_Regress (ADG calculated as the response of BW gain and DOF) and MMWT_PWG (metabolic midweight calculated throughout the postweaning gain test), considered the control in Red Angus. A similar calculation served as control for Charolais; RFI was calculated using 2-d consecutive start and finish weights (RFI_Calc). The RFI weaning weight (RFI_WW) was calculated using ADG_WW (ADG from weaning till the final out weight of the postweaning gain test) and MMWT_WW, calculated similarly. Overall average estimated DMI was highly correlated to the measurements derived over shorter periods, with 10 d being the least correlated and 60 d being the most correlated. The ADG_Calc (calculated using 2-d consecutive start and finish weight/DOF) and ADG_WW were highly correlated in Charolais. The ADG_Regress and ADG_Calc were highly correlated, and ADG_Regress and ADG_WW were moderately correlated in Red Angus. The control measures of RFI were highly correlated with the RFI_WW in Charolais and Red Angus. The outcomes of including abbreviated period DMI in the model with the weaning weight gain measurements showed that the model using 10 d of intake (RFI WW_10) was the least correlated with the control measures. The model with 60 d of intake had the largest correlation with the control measures. The fewest measured intake days coupled with the weaning weight values providing acceptable predictive value was RFI_WW_40, being highly correlated with the control measures. As established in the literature, at least 70 d is required to accurately measure ADG. However, we conclude that a shorter period, possibly as few as 40 d is needed to accurately estimate DMI for a reliable calculation of RFI.

  2. A systematic review and meta-analysis of the effects of antibiotic consumption on antibiotic resistance

    PubMed Central

    2014-01-01

    Background Greater use of antibiotics during the past 50 years has exerted selective pressure on susceptible bacteria and may have favoured the survival of resistant strains. Existing information on antibiotic resistance patterns from pathogens circulating among community-based patients is substantially less than from hospitalized patients on whom guidelines are often based. We therefore chose to assess the relationship between the antibiotic resistance pattern of bacteria circulating in the community and the consumption of antibiotics in the community. Methods Both gray literature and published scientific literature in English and other European languages was examined. Multiple regression analysis was used to analyse whether studies found a positive relationship between antibiotic consumption and resistance. A subsequent meta-analysis and meta-regression was conducted for studies for which a common effect size measure (odds ratio) could be calculated. Results Electronic searches identified 974 studies but only 243 studies were considered eligible for inclusion by the two independent reviewers who extracted the data. A binomial test revealed a positive relationship between antibiotic consumption and resistance (p < .001) but multiple regression modelling did not produce any significant predictors of study outcome. The meta-analysis generated a significant pooled odds ratio of 2.3 (95% confidence interval 2.2 to 2.5) with a meta-regression producing several significant predictors (F(10,77) = 5.82, p < .01). Countries in southern Europe produced a stronger link between consumption and resistance than other regions. Conclusions Using a large set of studies we found that antibiotic consumption is associated with the development of antibiotic resistance. A subsequent meta-analysis, with a subsample of the studies, generated several significant predictors. Countries in southern Europe produced a stronger link between consumption and resistance than other regions so efforts at reducing antibiotic consumption may need to be strengthened in this area. Increased consumption of antibiotics may not only produce greater resistance at the individual patient level but may also produce greater resistance at the community, country, and regional levels, which can harm individual patients. PMID:24405683

  3. Factor analysis and multiple regression between topography and precipitation on Jeju Island, Korea

    NASA Astrophysics Data System (ADS)

    Um, Myoung-Jin; Yun, Hyeseon; Jeong, Chang-Sam; Heo, Jun-Haeng

    2011-11-01

    SummaryIn this study, new factors that influence precipitation were extracted from geographic variables using factor analysis, which allow for an accurate estimation of orographic precipitation. Correlation analysis was also used to examine the relationship between nine topographic variables from digital elevation models (DEMs) and the precipitation in Jeju Island. In addition, a spatial analysis was performed in order to verify the validity of the regression model. From the results of the correlation analysis, it was found that all of the topographic variables had a positive correlation with the precipitation. The relations between the variables also changed in accordance with a change in the precipitation duration. However, upon examining the correlation matrix, no significant relationship between the latitude and the aspect was found. According to the factor analysis, eight topographic variables (latitude being the exception) were found to have a direct influence on the precipitation. Three factors were then extracted from the eight topographic variables. By directly comparing the multiple regression model with the factors (model 1) to the multiple regression model with the topographic variables (model 3), it was found that model 1 did not violate the limits of statistical significance and multicollinearity. As such, model 1 was considered to be appropriate for estimating the precipitation when taking into account the topography. In the study of model 1, the multiple regression model using factor analysis was found to be the best method for estimating the orographic precipitation on Jeju Island.

  4. Use of Multiple Imputation to Estimate the Proportion of Respiratory Virus Detections Among Patients Hospitalized With Community-Acquired Pneumonia.

    PubMed

    Bozio, Catherine H; Flanders, W Dana; Finelli, Lyn; Bramley, Anna M; Reed, Carrie; Gandhi, Neel R; Vidal, Jorge E; Erdman, Dean; Levine, Min Z; Lindstrom, Stephen; Ampofo, Krow; Arnold, Sandra R; Self, Wesley H; Williams, Derek J; Grijalva, Carlos G; Anderson, Evan J; McCullers, Jonathan A; Edwards, Kathryn M; Pavia, Andrew T; Wunderink, Richard G; Jain, Seema

    2018-04-01

    Real-time polymerase chain reaction (PCR) on respiratory specimens and serology on paired blood specimens are used to determine the etiology of respiratory illnesses for research studies. However, convalescent serology is often not collected. We used multiple imputation to assign values for missing serology results to estimate virus-specific prevalence among pediatric and adult community-acquired pneumonia hospitalizations using data from an active population-based surveillance study. Presence of adenoviruses, human metapneumovirus, influenza viruses, parainfluenza virus types 1-3, and respiratory syncytial virus was defined by positive PCR on nasopharyngeal/oropharyngeal specimens or a 4-fold rise in paired serology. We performed multiple imputation by developing a multivariable regression model for each virus using data from patients with available serology results. We calculated absolute and relative differences in the proportion of each virus detected comparing the imputed to observed (nonimputed) results. Among 2222 children and 2259 adults, 98.8% and 99.5% had nasopharyngeal/oropharyngeal specimens and 43.2% and 37.5% had paired serum specimens, respectively. Imputed results increased viral etiology assignments by an absolute difference of 1.6%-4.4% and 0.8%-2.8% in children and adults, respectively; relative differences were 1.1-3.0 times higher. Multiple imputation can be used when serology results are missing, to refine virus-specific prevalence estimates, and these will likely increase estimates.

  5. Assessing Risk Prediction Models Using Individual Participant Data From Multiple Studies

    PubMed Central

    Pennells, Lisa; Kaptoge, Stephen; White, Ian R.; Thompson, Simon G.; Wood, Angela M.; Tipping, Robert W.; Folsom, Aaron R.; Couper, David J.; Ballantyne, Christie M.; Coresh, Josef; Goya Wannamethee, S.; Morris, Richard W.; Kiechl, Stefan; Willeit, Johann; Willeit, Peter; Schett, Georg; Ebrahim, Shah; Lawlor, Debbie A.; Yarnell, John W.; Gallacher, John; Cushman, Mary; Psaty, Bruce M.; Tracy, Russ; Tybjærg-Hansen, Anne; Price, Jackie F.; Lee, Amanda J.; McLachlan, Stela; Khaw, Kay-Tee; Wareham, Nicholas J.; Brenner, Hermann; Schöttker, Ben; Müller, Heiko; Jansson, Jan-Håkan; Wennberg, Patrik; Salomaa, Veikko; Harald, Kennet; Jousilahti, Pekka; Vartiainen, Erkki; Woodward, Mark; D'Agostino, Ralph B.; Bladbjerg, Else-Marie; Jørgensen, Torben; Kiyohara, Yutaka; Arima, Hisatomi; Doi, Yasufumi; Ninomiya, Toshiharu; Dekker, Jacqueline M.; Nijpels, Giel; Stehouwer, Coen D. A.; Kauhanen, Jussi; Salonen, Jukka T.; Meade, Tom W.; Cooper, Jackie A.; Cushman, Mary; Folsom, Aaron R.; Psaty, Bruce M.; Shea, Steven; Döring, Angela; Kuller, Lewis H.; Grandits, Greg; Gillum, Richard F.; Mussolino, Michael; Rimm, Eric B.; Hankinson, Sue E.; Manson, JoAnn E.; Pai, Jennifer K.; Kirkland, Susan; Shaffer, Jonathan A.; Shimbo, Daichi; Bakker, Stephan J. L.; Gansevoort, Ron T.; Hillege, Hans L.; Amouyel, Philippe; Arveiler, Dominique; Evans, Alun; Ferrières, Jean; Sattar, Naveed; Westendorp, Rudi G.; Buckley, Brendan M.; Cantin, Bernard; Lamarche, Benoît; Barrett-Connor, Elizabeth; Wingard, Deborah L.; Bettencourt, Richele; Gudnason, Vilmundur; Aspelund, Thor; Sigurdsson, Gunnar; Thorsson, Bolli; Kavousi, Maryam; Witteman, Jacqueline C.; Hofman, Albert; Franco, Oscar H.; Howard, Barbara V.; Zhang, Ying; Best, Lyle; Umans, Jason G.; Onat, Altan; Sundström, Johan; Michael Gaziano, J.; Stampfer, Meir; Ridker, Paul M.; Michael Gaziano, J.; Ridker, Paul M.; Marmot, Michael; Clarke, Robert; Collins, Rory; Fletcher, Astrid; Brunner, Eric; Shipley, Martin; Kivimäki, Mika; Ridker, Paul M.; Buring, Julie; Cook, Nancy; Ford, Ian; Shepherd, James; Cobbe, Stuart M.; Robertson, Michele; Walker, Matthew; Watson, Sarah; Alexander, Myriam; Butterworth, Adam S.; Angelantonio, Emanuele Di; Gao, Pei; Haycock, Philip; Kaptoge, Stephen; Pennells, Lisa; Thompson, Simon G.; Walker, Matthew; Watson, Sarah; White, Ian R.; Wood, Angela M.; Wormser, David; Danesh, John

    2014-01-01

    Individual participant time-to-event data from multiple prospective epidemiologic studies enable detailed investigation into the predictive ability of risk models. Here we address the challenges in appropriately combining such information across studies. Methods are exemplified by analyses of log C-reactive protein and conventional risk factors for coronary heart disease in the Emerging Risk Factors Collaboration, a collation of individual data from multiple prospective studies with an average follow-up duration of 9.8 years (dates varied). We derive risk prediction models using Cox proportional hazards regression analysis stratified by study and obtain estimates of risk discrimination, Harrell's concordance index, and Royston's discrimination measure within each study; we then combine the estimates across studies using a weighted meta-analysis. Various weighting approaches are compared and lead us to recommend using the number of events in each study. We also discuss the calculation of measures of reclassification for multiple studies. We further show that comparison of differences in predictive ability across subgroups should be based only on within-study information and that combining measures of risk discrimination from case-control studies and prospective studies is problematic. The concordance index and discrimination measure gave qualitatively similar results throughout. While the concordance index was very heterogeneous between studies, principally because of differing age ranges, the increments in the concordance index from adding log C-reactive protein to conventional risk factors were more homogeneous. PMID:24366051

  6. Assessing risk prediction models using individual participant data from multiple studies.

    PubMed

    Pennells, Lisa; Kaptoge, Stephen; White, Ian R; Thompson, Simon G; Wood, Angela M

    2014-03-01

    Individual participant time-to-event data from multiple prospective epidemiologic studies enable detailed investigation into the predictive ability of risk models. Here we address the challenges in appropriately combining such information across studies. Methods are exemplified by analyses of log C-reactive protein and conventional risk factors for coronary heart disease in the Emerging Risk Factors Collaboration, a collation of individual data from multiple prospective studies with an average follow-up duration of 9.8 years (dates varied). We derive risk prediction models using Cox proportional hazards regression analysis stratified by study and obtain estimates of risk discrimination, Harrell's concordance index, and Royston's discrimination measure within each study; we then combine the estimates across studies using a weighted meta-analysis. Various weighting approaches are compared and lead us to recommend using the number of events in each study. We also discuss the calculation of measures of reclassification for multiple studies. We further show that comparison of differences in predictive ability across subgroups should be based only on within-study information and that combining measures of risk discrimination from case-control studies and prospective studies is problematic. The concordance index and discrimination measure gave qualitatively similar results throughout. While the concordance index was very heterogeneous between studies, principally because of differing age ranges, the increments in the concordance index from adding log C-reactive protein to conventional risk factors were more homogeneous.

  7. Weather Impact on Airport Arrival Meter Fix Throughput

    NASA Technical Reports Server (NTRS)

    Wang, Yao

    2017-01-01

    Time-based flow management provides arrival aircraft schedules based on arrival airport conditions, airport capacity, required spacing, and weather conditions. In order to meet a scheduled time at which arrival aircraft can cross an airport arrival meter fix prior to entering the airport terminal airspace, air traffic controllers make regulations on air traffic. Severe weather may create an airport arrival bottleneck if one or more of airport arrival meter fixes are partially or completely blocked by the weather and the arrival demand has not been reduced accordingly. Under these conditions, aircraft are frequently being put in holding patterns until they can be rerouted. A model that predicts the weather impacted meter fix throughput may help air traffic controllers direct arrival flows into the airport more efficiently, minimizing arrival meter fix congestion. This paper presents an analysis of air traffic flows across arrival meter fixes at the Newark Liberty International Airport (EWR). Several scenarios of weather impacted EWR arrival fix flows are described. Furthermore, multiple linear regression and regression tree ensemble learning approaches for translating multiple sector Weather Impacted Traffic Indexes (WITI) to EWR arrival meter fix throughputs are examined. These weather translation models are developed and validated using the EWR arrival flight and weather data for the period of April-September in 2014. This study also compares the performance of the regression tree ensemble with traditional multiple linear regression models for estimating the weather impacted throughputs at each of the EWR arrival meter fixes. For all meter fixes investigated, the results from the regression tree ensemble weather translation models show a stronger correlation between model outputs and observed meter fix throughputs than that produced from multiple linear regression method.

  8. Determination of osteoporosis risk factors using a multiple logistic regression model in postmenopausal Turkish women.

    PubMed

    Akkus, Zeki; Camdeviren, Handan; Celik, Fatma; Gur, Ali; Nas, Kemal

    2005-09-01

    To determine the risk factors of osteoporosis using a multiple binary logistic regression method and to assess the risk variables for osteoporosis, which is a major and growing health problem in many countries. We presented a case-control study, consisting of 126 postmenopausal healthy women as control group and 225 postmenopausal osteoporotic women as the case group. The study was carried out in the Department of Physical Medicine and Rehabilitation, Dicle University, Diyarbakir, Turkey between 1999-2002. The data from the 351 participants were collected using a standard questionnaire that contains 43 variables. A multiple logistic regression model was then used to evaluate the data and to find the best regression model. We classified 80.1% (281/351) of the participants using the regression model. Furthermore, the specificity value of the model was 67% (84/126) of the control group while the sensitivity value was 88% (197/225) of the case group. We found the distribution of residual values standardized for final model to be exponential using the Kolmogorow-Smirnow test (p=0.193). The receiver operating characteristic curve was found successful to predict patients with risk for osteoporosis. This study suggests that low levels of dietary calcium intake, physical activity, education, and longer duration of menopause are independent predictors of the risk of low bone density in our population. Adequate dietary calcium intake in combination with maintaining a daily physical activity, increasing educational level, decreasing birth rate, and duration of breast-feeding may contribute to healthy bones and play a role in practical prevention of osteoporosis in Southeast Anatolia. In addition, the findings of the present study indicate that the use of multivariate statistical method as a multiple logistic regression in osteoporosis, which maybe influenced by many variables, is better than univariate statistical evaluation.

  9. Practical Guidance for Conducting Mediation Analysis With Multiple Mediators Using Inverse Odds Ratio Weighting

    PubMed Central

    Nguyen, Quynh C.; Osypuk, Theresa L.; Schmidt, Nicole M.; Glymour, M. Maria; Tchetgen Tchetgen, Eric J.

    2015-01-01

    Despite the recent flourishing of mediation analysis techniques, many modern approaches are difficult to implement or applicable to only a restricted range of regression models. This report provides practical guidance for implementing a new technique utilizing inverse odds ratio weighting (IORW) to estimate natural direct and indirect effects for mediation analyses. IORW takes advantage of the odds ratio's invariance property and condenses information on the odds ratio for the relationship between the exposure (treatment) and multiple mediators, conditional on covariates, by regressing exposure on mediators and covariates. The inverse of the covariate-adjusted exposure-mediator odds ratio association is used to weight the primary analytical regression of the outcome on treatment. The treatment coefficient in such a weighted regression estimates the natural direct effect of treatment on the outcome, and indirect effects are identified by subtracting direct effects from total effects. Weighting renders treatment and mediators independent, thereby deactivating indirect pathways of the mediators. This new mediation technique accommodates multiple discrete or continuous mediators. IORW is easily implemented and is appropriate for any standard regression model, including quantile regression and survival analysis. An empirical example is given using data from the Moving to Opportunity (1994–2002) experiment, testing whether neighborhood context mediated the effects of a housing voucher program on obesity. Relevant Stata code (StataCorp LP, College Station, Texas) is provided. PMID:25693776

  10. Right and left ventricular volumes in vitro by a new nongeometric method

    NASA Technical Reports Server (NTRS)

    Buckey, J. C.; Beattie, J. M.; Nixon, J. V.; Gaffney, F. A.; Blomqvist, C. G.

    1987-01-01

    We present an evaluation of a new nongeometric technique for calculating right and left ventricular volumes. This method calculates ventricular chamber volumes from multiple cross-sectional echocardiographic views taken from a single point as the echo beam is tilted progressively through the ventricle. Right and left ventricular volumes are calculated from both the approximate short axis and approximate apical position on 20 in vitro human hearts and compared with the actual chamber volumes. The results for both ventricles from both positions are excellent. Correlation coefficients are > 0.95 for all positions; the standard errors are in the range of 5 to 7 mL and the slopes and intercepts for the regression lines are not significantly different from 1 and 0, respectively (except for the left ventricular short-axis intercept). For all positions, approximately 6 to 8 views are needed for peak accuracy (7.5 degrees to 10 degrees separation). This approach offers several advantages. No geometric assumptions about ventricular shape are made. All images are acquired from a single point (or window), and the digitized points can be used to make a three-dimensional reconstruction of the ventricle. Also, during the calculations a volume distribution curve for the ventricle is produced. The shape of this curve can be characteristic for certain situations (ie, right ventricle, short axis) and can be used to make new simple equations for calculating volume. We conclude that this is an accurate nongeometric method for determining both right and left ventricular volumes in vitro.

  11. A Statistical Multimodel Ensemble Approach to Improving Long-Range Forecasting in Pakistan

    DTIC Science & Technology

    2012-03-01

    Impact of global warming on monsoon variability in Pakistan. J. Anim. Pl. Sci., 21, no. 1, 107–110. Gillies, S., T. Murphree, and D. Meyer, 2012...are generated by multiple regression models that relate globally distributed oceanic and atmospheric predictors to local predictands. The...generated by multiple regression models that relate globally distributed oceanic and atmospheric predictors to local predictands. The predictands are

  12. Suppression Situations in Multiple Linear Regression

    ERIC Educational Resources Information Center

    Shieh, Gwowen

    2006-01-01

    This article proposes alternative expressions for the two most prevailing definitions of suppression without resorting to the standardized regression modeling. The formulation provides a simple basis for the examination of their relationship. For the two-predictor regression, the author demonstrates that the previous results in the literature are…

  13. Reduction of shading-derived artifacts in skin chromophore imaging without measurements or assumptions about the shape of the subject

    NASA Astrophysics Data System (ADS)

    Yoshida, Kenichiro; Nishidate, Izumi; Ojima, Nobutoshi; Iwata, Kayoko

    2014-01-01

    To quantitatively evaluate skin chromophores over a wide region of curved skin surface, we propose an approach that suppresses the effect of the shading-derived error in the reflectance on the estimation of chromophore concentrations, without sacrificing the accuracy of that estimation. In our method, we use multiple regression analysis, assuming the absorbance spectrum as the response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as the predictor variables. The concentrations of melanin and total hemoglobin are determined from the multiple regression coefficients using compensation formulae (CF) based on the diffuse reflectance spectra derived from a Monte Carlo simulation. To suppress the shading-derived error, we investigated three different combinations of multiple regression coefficients for the CF. In vivo measurements with the forearm skin demonstrated that the proposed approach can reduce the estimation errors that are due to shading-derived errors in the reflectance. With the best combination of multiple regression coefficients, we estimated that the ratio of the error to the chromophore concentrations is about 10%. The proposed method does not require any measurements or assumptions about the shape of the subjects; this is an advantage over other studies related to the reduction of shading-derived errors.

  14. Quantitative assessment of cervical vertebral maturation using cone beam computed tomography in Korean girls.

    PubMed

    Byun, Bo-Ram; Kim, Yong-Il; Yamaguchi, Tetsutaro; Maki, Koutaro; Son, Woo-Sung

    2015-01-01

    This study was aimed to examine the correlation between skeletal maturation status and parameters from the odontoid process/body of the second vertebra and the bodies of third and fourth cervical vertebrae and simultaneously build multiple regression models to be able to estimate skeletal maturation status in Korean girls. Hand-wrist radiographs and cone beam computed tomography (CBCT) images were obtained from 74 Korean girls (6-18 years of age). CBCT-generated cervical vertebral maturation (CVM) was used to demarcate the odontoid process and the body of the second cervical vertebra, based on the dentocentral synchondrosis. Correlation coefficient analysis and multiple linear regression analysis were used for each parameter of the cervical vertebrae (P < 0.05). Forty-seven of 64 parameters from CBCT-generated CVM (independent variables) exhibited statistically significant correlations (P < 0.05). The multiple regression model with the greatest R (2) had six parameters (PH2/W2, UW2/W2, (OH+AH2)/LW2, UW3/LW3, D3, and H4/W4) as independent variables with a variance inflation factor (VIF) of <2. CBCT-generated CVM was able to include parameters from the second cervical vertebral body and odontoid process, respectively, for the multiple regression models. This suggests that quantitative analysis might be used to estimate skeletal maturation status.

  15. Using Regression Equations Built from Summary Data in the Psychological Assessment of the Individual Case: Extension to Multiple Regression

    ERIC Educational Resources Information Center

    Crawford, John R.; Garthwaite, Paul H.; Denham, Annie K.; Chelune, Gordon J.

    2012-01-01

    Regression equations have many useful roles in psychological assessment. Moreover, there is a large reservoir of published data that could be used to build regression equations; these equations could then be employed to test a wide variety of hypotheses concerning the functioning of individual cases. This resource is currently underused because…

  16. Expression of anaesthetic and analgesic drug target genes in excised breast tumour tissue: Association with clinical disease recurrence or metastasis.

    PubMed

    Connolly, C; Madden, S F; Buggy, D J; Gallagher, H C

    2017-01-01

    Retrospective analyses suggest anaesthetic-analgesics technique during cancer surgery may affect recurrence/metastasis. This could involve direct effects of anaesthetic-analgesic drugs on cancer cells. While μ-opioid receptor over-expression in lung tumours is associated with greater metastasis, other anaesthetic-analgesic receptor targets in cancer recurrence/metastasis remain unexplored. Therefore, we evaluated the association between genetic expression of anaesthetic-analgesic receptor targets and recurrence/metastasis, using a repository of breast cancer gene expression and matching clinical data. A list of 23 genes encoding for the most prominent anaesthetic-analgesic receptor targets was compiled. This was processed through BreastMark- an algorithm integrating gene expression data from ~17,000 samples and clinical data from >4,500 breast cancer samples. Gene expression data was dichotomized using disease-free survival (survival without recurrence) and distant disease-free survival (survival without metastasis) as end points. Hazard ratios were calculated by Cox-regression analysis. Enrichment for prognostic markers was determined by randomly choosing 23-member gene lists from all available genes, calculating how often >5 significant markers were observed and adjusting p-values for multiple testing. This was repeated 10,000 times and an empirical p-value calculated. Of 23 selected genes, 9 were significantly associated with altered rates of metastasis and 4 with recurrence on univariate analysis. Adjusting for multiple testing, 5 of these 9 genes remained significantly associated with metastasis, non with recurrence. This ratio of genes (5/23) was not significantly enriched for markers of metastasis (p = 0.07). Several anaesthetic-analgesic receptor genes were associated with metastatic spread in breast cancer. Overall there was no significant enrichment in prognostic markers of metastasis, although a trend was observed.

  17. Elbow joint angle and elbow movement velocity estimation using NARX-multiple layer perceptron neural network model with surface EMG time domain parameters.

    PubMed

    Raj, Retheep; Sivanandan, K S

    2017-01-01

    Estimation of elbow dynamics has been the object of numerous investigations. In this work a solution is proposed for estimating elbow movement velocity and elbow joint angle from Surface Electromyography (SEMG) signals. Here the Surface Electromyography signals are acquired from the biceps brachii muscle of human hand. Two time-domain parameters, Integrated EMG (IEMG) and Zero Crossing (ZC), are extracted from the Surface Electromyography signal. The relationship between the time domain parameters, IEMG and ZC with elbow angular displacement and elbow angular velocity during extension and flexion of the elbow are studied. A multiple input-multiple output model is derived for identifying the kinematics of elbow. A Nonlinear Auto Regressive with eXogenous inputs (NARX) structure based multiple layer perceptron neural network (MLPNN) model is proposed for the estimation of elbow joint angle and elbow angular velocity. The proposed NARX MLPNN model is trained using Levenberg-marquardt based algorithm. The proposed model is estimating the elbow joint angle and elbow movement angular velocity with appreciable accuracy. The model is validated using regression coefficient value (R). The average regression coefficient value (R) obtained for elbow angular displacement prediction is 0.9641 and for the elbow anglular velocity prediction is 0.9347. The Nonlinear Auto Regressive with eXogenous inputs (NARX) structure based multiple layer perceptron neural networks (MLPNN) model can be used for the estimation of angular displacement and movement angular velocity of the elbow with good accuracy.

  18. Predicting in-hospital mortality of traffic victims: A comparison between AIS-and ICD-9-CM-related injury severity scales when only ICD-9-CM is reported.

    PubMed

    Van Belleghem, Griet; Devos, Stefanie; De Wit, Liesbet; Hubloue, Ives; Lauwaert, Door; Pien, Karen; Putman, Koen

    2016-01-01

    Injury severity scores are important in the context of developing European and national goals on traffic safety, health-care benchmarking and improving patient communication. Various severity scores are available and are mostly based on Abbreviated Injury Scale (AIS) or International Classification of Diseases (ICD). The aim of this paper is to compare the predictive value for in-hospital mortality between the various severity scores if only International Classification of Diseases, 9th revision, Clinical Modification ICD-9-CM is reported. To estimate severity scores based on the AIS lexicon, ICD-9-CM codes were converted with ICD Programmes for Injury Categorization (ICDPIC) and four AIS-based severity scores were derived: Maximum AIS (MaxAIS), Injury Severity Score (ISS), New Injury Severity Score (NISS) and Exponential Injury Severity Score (EISS). Based on ICD-9-CM, six severity scores were calculated. Determined by the number of injuries taken into account and the means by which survival risk ratios (SRRs) were calculated, four different approaches were used to calculate the ICD-9-based Injury Severity Scores (ICISS). The Trauma Mortality Prediction Model (TMPM) was calculated with the ICD-9-CM-based model averaged regression coefficients (MARC) for both the single worst injury and multiple injuries. Severity scores were compared via model discrimination and calibration. Model comparisons were performed separately for the severity scores based on the single worst injury and multiple injuries. For ICD-9-based scales, estimation of area under the receiver operating characteristic curve (AUROC) ranges between 0.94 and 0.96, while AIS-based scales range between 0.72 and 0.76, respectively. The intercept in the calibration plots is not significantly different from 0 for MaxAIS, ICISS and TMPM. When only ICD-9-CM codes are reported, ICD-9-CM-based severity scores perform better than severity scores based on the conversion to AIS. Copyright © 2015 Elsevier Ltd. All rights reserved.

  19. Tumour regression of uveal melanoma after ruthenium-106 brachytherapy or stereotactic radiotherapy with gamma knife or linear accelerator.

    PubMed

    Georgopoulos, Michael; Zehetmayer, Martin; Ruhswurm, Irene; Toma-Bstaendig, Sabine; Ségur-Eltz, Nikolaus; Sacu, Stefan; Menapace, Rupert

    2003-01-01

    This study assesses differences in relative tumour regression and internal acoustic reflectivity after 3 methods of radiotherapy for uveal melanoma: (1) brachytherapy with ruthenium-106 radioactive plaques (RU), (2) fractionated high-dose gamma knife stereotactic irradiation in 2-3 fractions (GK) or (3) fractionated linear-accelerator-based stereotactic teletherapy in 5 fractions (Linac). Ultrasound measurements of tumour thickness and internal reflectivity were performed with standardised A scan pre-operatively and 3, 6, 9, 12, 18, 24 and 36 months postoperatively. Of 211 patients included in the study, 111 had a complete 3-year follow-up (RU: 41, GK: 37, Linac: 33). Differences in tumour thickness and internal reflectivity were assessed with analysis of variance, and post hoc multiple comparisons were calculated with Tukey's honestly significant difference test. Local tumour control was excellent with all 3 methods (>93%). At 36 months, relative tumour height reduction was 69, 50 and 30% after RU, GK and Linac, respectively. In all 3 treatment groups, internal reflectivity increased from about 30% initially to 60-70% 3 years after treatment. Brachytherapy with ruthenium-106 plaques results in a faster tumour regression as compared to teletherapy with gamma knife or Linac. Internal reflectivity increases comparably in all 3 groups. Besides tumour growth arrest, increasing internal reflectivity is considered as an important factor indicating successful treatment. Copyright 2003 S. Karger AG, Basel

  20. Sample size estimation for alternating logistic regressions analysis of multilevel randomized community trials of under-age drinking.

    PubMed

    Reboussin, Beth A; Preisser, John S; Song, Eun-Young; Wolfson, Mark

    2012-07-01

    Under-age drinking is an enormous public health issue in the USA. Evidence that community level structures may impact on under-age drinking has led to a proliferation of efforts to change the environment surrounding the use of alcohol. Although the focus of these efforts is to reduce drinking by individual youths, environmental interventions are typically implemented at the community level with entire communities randomized to the same intervention condition. A distinct feature of these trials is the tendency of the behaviours of individuals residing in the same community to be more alike than that of others residing in different communities, which is herein called 'clustering'. Statistical analyses and sample size calculations must account for this clustering to avoid type I errors and to ensure an appropriately powered trial. Clustering itself may also be of scientific interest. We consider the alternating logistic regressions procedure within the population-averaged modelling framework to estimate the effect of a law enforcement intervention on the prevalence of under-age drinking behaviours while modelling the clustering at multiple levels, e.g. within communities and within neighbourhoods nested within communities, by using pairwise odds ratios. We then derive sample size formulae for estimating intervention effects when planning a post-test-only or repeated cross-sectional community-randomized trial using the alternating logistic regressions procedure.

  1. Regression Commonality Analysis: A Technique for Quantitative Theory Building

    ERIC Educational Resources Information Center

    Nimon, Kim; Reio, Thomas G., Jr.

    2011-01-01

    When it comes to multiple linear regression analysis (MLR), it is common for social and behavioral science researchers to rely predominately on beta weights when evaluating how predictors contribute to a regression model. Presenting an underutilized statistical technique, this article describes how organizational researchers can use commonality…

  2. Precision Efficacy Analysis for Regression.

    ERIC Educational Resources Information Center

    Brooks, Gordon P.

    When multiple linear regression is used to develop a prediction model, sample size must be large enough to ensure stable coefficients. If the derivation sample size is inadequate, the model may not predict well for future subjects. The precision efficacy analysis for regression (PEAR) method uses a cross- validity approach to select sample sizes…

  3. Modulation of the relationship between external knee adduction moments and medial joint contact forces across subjects and activities.

    PubMed

    Trepczynski, Adam; Kutzner, Ines; Bergmann, Georg; Taylor, William R; Heller, Markus O

    2014-05-01

    The external knee adduction moment (EAM) is often considered a surrogate measure of the distribution of loads across the tibiofemoral joint during walking. This study was undertaken to quantify the relationship between the EAM and directly measured medial tibiofemoral contact forces (Fmed ) in a sample of subjects across a spectrum of activities. The EAM for 9 patients who underwent total knee replacement was calculated using inverse dynamics analysis, while telemetric implants provided Fmed for multiple repetitions of 10 activities, including walking, stair negotiation, sit-to-stand activities, and squatting. The effects of the factors "subject" and "activity" on the relationships between Fmed and EAM were quantified using mixed-effects regression analyses in terms of the root mean square error (RMSE) and the slope of the regression. Across subjects and activities a good correlation between peak EAM and Fmed values was observed, with an overall R(2) value of 0.88. However, the slope of the linear regressions varied between subjects by up to a factor of 2. At peak EAM and Fmed , the RMSE of the regression across all subjects was 35% body weight (%BW), while the maximum error was 127 %BW. The relationship between EAM and Fmed is generally good but varies considerably across subjects and activities. These findings emphasize the limitation of relying solely on the EAM to infer medial joint loading when excessive directed cocontraction of muscles exists and call for further investigations into the soft tissue-related mechanisms that modulate the internal forces at the knee. Copyright © 2014 by the American College of Rheumatology.

  4. Guidelines and Procedures for Computing Time-Series Suspended-Sediment Concentrations and Loads from In-Stream Turbidity-Sensor and Streamflow Data

    USGS Publications Warehouse

    Rasmussen, Patrick P.; Gray, John R.; Glysson, G. Douglas; Ziegler, Andrew C.

    2009-01-01

    In-stream continuous turbidity and streamflow data, calibrated with measured suspended-sediment concentration data, can be used to compute a time series of suspended-sediment concentration and load at a stream site. Development of a simple linear (ordinary least squares) regression model for computing suspended-sediment concentrations from instantaneous turbidity data is the first step in the computation process. If the model standard percentage error (MSPE) of the simple linear regression model meets a minimum criterion, this model should be used to compute a time series of suspended-sediment concentrations. Otherwise, a multiple linear regression model using paired instantaneous turbidity and streamflow data is developed and compared to the simple regression model. If the inclusion of the streamflow variable proves to be statistically significant and the uncertainty associated with the multiple regression model results in an improvement over that for the simple linear model, the turbidity-streamflow multiple linear regression model should be used to compute a suspended-sediment concentration time series. The computed concentration time series is subsequently used with its paired streamflow time series to compute suspended-sediment loads by standard U.S. Geological Survey techniques. Once an acceptable regression model is developed, it can be used to compute suspended-sediment concentration beyond the period of record used in model development with proper ongoing collection and analysis of calibration samples. Regression models to compute suspended-sediment concentrations are generally site specific and should never be considered static, but they represent a set period in a continually dynamic system in which additional data will help verify any change in sediment load, type, and source.

  5. CATEGORICAL REGRESSION ANALYSIS OF ACUTE INHALATION TOXICITY DATA FOR HYDROGEN SULFIDE

    EPA Science Inventory

    Categorical regression is one of the tools offered by the U.S. EPA for derivation of acute reference exposures (AREs), which are dose-response assessments for acute exposures to inhaled chemicals. Categorical regression is used as a meta-analytical technique to calculate probabi...

  6. USE OF LETHALITY DATA DURING CATEGORICAL REGRESSION MODELING OF ACUTE REFERENCE EXPOSURES

    EPA Science Inventory

    Categorical regression is being considered by the U.S. EPA as an additional tool for derivation of acute reference exposures (AREs) to be used for human health risk assessment for exposure to inhaled chemicals. Categorical regression is used to calculate probability-response fun...

  7. The prediction of intelligence in preschool children using alternative models to regression.

    PubMed

    Finch, W Holmes; Chang, Mei; Davis, Andrew S; Holden, Jocelyn E; Rothlisberg, Barbara A; McIntosh, David E

    2011-12-01

    Statistical prediction of an outcome variable using multiple independent variables is a common practice in the social and behavioral sciences. For example, neuropsychologists are sometimes called upon to provide predictions of preinjury cognitive functioning for individuals who have suffered a traumatic brain injury. Typically, these predictions are made using standard multiple linear regression models with several demographic variables (e.g., gender, ethnicity, education level) as predictors. Prior research has shown conflicting evidence regarding the ability of such models to provide accurate predictions of outcome variables such as full-scale intelligence (FSIQ) test scores. The present study had two goals: (1) to demonstrate the utility of a set of alternative prediction methods that have been applied extensively in the natural sciences and business but have not been frequently explored in the social sciences and (2) to develop models that can be used to predict premorbid cognitive functioning in preschool children. Predictions of Stanford-Binet 5 FSIQ scores for preschool-aged children is used to compare the performance of a multiple regression model with several of these alternative methods. Results demonstrate that classification and regression trees provided more accurate predictions of FSIQ scores than does the more traditional regression approach. Implications of these results are discussed.

  8. Can aircraft noise less than or equal 115 to dBA adversely affect reproductive outcome in USAF women?

    NASA Astrophysics Data System (ADS)

    Brubaker, P. A.

    1985-06-01

    It has been suggested, mainly through animal studies, that exposure to high noise levels may be associated with lower birth weight, reduced gestational length and other adverse reproductive outcomes. Few studies have been done on humans to show this association. The Air Force employs pregnant women in areas where there is a high potential for exposure to high noise levels. This study proposes a method to determine if there is an association between high frequency noise levels or = 115 dBA and adverse reproductive outcomes through a review of records and self-administered questionnaires in a case-comparison design. Prevelance rates will be calculated and a multiple logistic regression analysis computed for the independent variables that can affect reproduction.

  9. Chronic exposure to water pollutant trichloroethylene increased epigenetic drift in CD4(+) T cells.

    PubMed

    Gilbert, Kathleen M; Blossom, Sarah J; Erickson, Stephen W; Reisfeld, Brad; Zurlinden, Todd J; Broadfoot, Brannon; West, Kirk; Bai, Shasha; Cooney, Craig A

    2016-05-01

    Autoimmune disease and CD4(+) T-cell alterations are induced in mice exposed to the water pollutant trichloroethylene (TCE). We examined here whether TCE altered gene-specific DNA methylation in CD4(+) T cells as a possible mechanism of immunotoxicity. Naive and effector/memory CD4(+) T cells from mice exposed to TCE (0.5 mg/ml in drinking water) for 40 weeks were examined by bisulfite next-generation DNA sequencing. A probabilistic model calculated from multiple genes showed that TCE decreased methylation control in CD4(+) T cells. Data from individual genes fitted to a quadratic regression model showed that TCE increased gene-specific methylation variance in both CD4 subsets. TCE increased epigenetic drift of specific CpG sites in CD4(+) T cells.

  10. Optimization of fixture layouts of glass laser optics using multiple kernel regression.

    PubMed

    Su, Jianhua; Cao, Enhua; Qiao, Hong

    2014-05-10

    We aim to build an integrated fixturing model to describe the structural properties and thermal properties of the support frame of glass laser optics. Therefore, (a) a near global optimal set of clamps can be computed to minimize the surface shape error of the glass laser optic based on the proposed model, and (b) a desired surface shape error can be obtained by adjusting the clamping forces under various environmental temperatures based on the model. To construct the model, we develop a new multiple kernel learning method and call it multiple kernel support vector functional regression. The proposed method uses two layer regressions to group and order the data sources by the weights of the kernels and the factors of the layers. Because of that, the influences of the clamps and the temperature can be evaluated by grouping them into different layers.

  11. Prediction of anthropometric foot characteristics in children.

    PubMed

    Morrison, Stewart C; Durward, Brian R; Watt, Gordon F; Donaldson, Malcolm D C

    2009-01-01

    The establishment of growth reference values is needed in pediatric practice where pathologic conditions can have a detrimental effect on the growth and development of the pediatric foot. This study aims to use multiple regression to evaluate the effects of multiple predictor variables (height, age, body mass, and gender) on anthropometric characteristics of the peripubescent foot. Two hundred children aged 9 to 12 years were recruited, and three anthropometric measurements of the pediatric foot were recorded (foot length, forefoot width, and navicular height). Multiple regression analysis was conducted, and coefficients for gender, height, and body mass all had significant relationships for the prediction of forefoot width and foot length (P < or = .05, r > or = 0.7). The coefficients for gender and body mass were not significant for the prediction of navicular height (P > or = .05), whereas height was (P < or = .05). Normative growth reference values and prognostic regression equations are presented for the peripubescent foot.

  12. Birthweight Related Factors in Northwestern Iran: Using Quantile Regression Method.

    PubMed

    Fallah, Ramazan; Kazemnejad, Anoshirvan; Zayeri, Farid; Shoghli, Alireza

    2015-11-18

    Birthweight is one of the most important predicting indicators of the health status in adulthood. Having a balanced birthweight is one of the priorities of the health system in most of the industrial and developed countries. This indicator is used to assess the growth and health status of the infants. The aim of this study was to assess the birthweight of the neonates by using quantile regression in Zanjan province. This analytical descriptive study was carried out using pre-registered (March 2010 - March 2012) data of neonates in urban/rural health centers of Zanjan province using multiple-stage cluster sampling. Data were analyzed using multiple linear regressions andquantile regression method and SAS 9.2 statistical software. From 8456 newborn baby, 4146 (49%) were female. The mean age of the mothers was 27.1±5.4 years. The mean birthweight of the neonates was 3104 ± 431 grams. Five hundred and seventy-three patients (6.8%) of the neonates were less than 2500 grams. In all quantiles, gestational age of neonates (p<0.05), weight and educational level of the mothers (p<0.05) showed a linear significant relationship with the i of the neonates. However, sex and birth rank of the neonates, mothers age, place of residence (urban/rural) and career were not significant in all quantiles (p>0.05). This study revealed the results of multiple linear regression and quantile regression were not identical. We strictly recommend the use of quantile regression when an asymmetric response variable or data with outliers is available.

  13. Birthweight Related Factors in Northwestern Iran: Using Quantile Regression Method

    PubMed Central

    Fallah, Ramazan; Kazemnejad, Anoshirvan; Zayeri, Farid; Shoghli, Alireza

    2016-01-01

    Introduction: Birthweight is one of the most important predicting indicators of the health status in adulthood. Having a balanced birthweight is one of the priorities of the health system in most of the industrial and developed countries. This indicator is used to assess the growth and health status of the infants. The aim of this study was to assess the birthweight of the neonates by using quantile regression in Zanjan province. Methods: This analytical descriptive study was carried out using pre-registered (March 2010 - March 2012) data of neonates in urban/rural health centers of Zanjan province using multiple-stage cluster sampling. Data were analyzed using multiple linear regressions andquantile regression method and SAS 9.2 statistical software. Results: From 8456 newborn baby, 4146 (49%) were female. The mean age of the mothers was 27.1±5.4 years. The mean birthweight of the neonates was 3104 ± 431 grams. Five hundred and seventy-three patients (6.8%) of the neonates were less than 2500 grams. In all quantiles, gestational age of neonates (p<0.05), weight and educational level of the mothers (p<0.05) showed a linear significant relationship with the i of the neonates. However, sex and birth rank of the neonates, mothers age, place of residence (urban/rural) and career were not significant in all quantiles (p>0.05). Conclusion: This study revealed the results of multiple linear regression and quantile regression were not identical. We strictly recommend the use of quantile regression when an asymmetric response variable or data with outliers is available. PMID:26925889

  14. Mercury and neuromotor function among children in a rural town in Chile

    PubMed Central

    Huber, Stella Maria; Schomaker, Michael; Heumann, Christian; Schierl, Rudolf; Michalke, Bernhard; Jenni, Oskar G.; Caflisch, Jon; Muñoz, Daniel Moraga; von Ehrenstein, Ondine S.; Radon, Katja

    2016-01-01

    Background Mercury (Hg) exposure from artisanal gold mining has adverse effects on the neuromotor function in adults. However, few studies have examined this relationship in children. Objectives To investigate the impact of Hg exposure on children’s neuromotor function. Methods Cross-sectional data on Hg risk factors and demographics were collected from n = 288 children (response = 68.9%). Based on complete cases (CCs) (n = 130) and multiple imputations (n = 288), associations between fingernail Hg and four different neuromotor function components were calculated using multiple logistic regression adjusted for confounders. Results Of the children, 11.1, 14.9, 63.9, and 10.4% had pathologic pure motor skills, adaptive fine motor skills, adaptive gross motor skills, and static balance, respectively. No significant association between fingernail Hg and any neuromotor component was found. However, Hg burning in the household was significantly associated with children’s pathologic pure motor skills (OR 3.07 95% CI 1.03–9.18). Conclusion Elemental Hg exposure in the household might have adverse long-term effects on children’s pure motor skills. PMID:27078174

  15. Viscoelastic Parameters for Quantifying Liver Fibrosis: Three-Dimensional Multifrequency MR Elastography Study on Thin Liver Rat Slices

    PubMed Central

    Ronot, Maxime; Lambert, Simon A.; Wagner, Mathilde; Garteiser, Philippe; Doblas, Sabrina; Albuquerque, Miguel; Paradis, Valérie; Vilgrain, Valérie; Sinkus, Ralph; Van Beers, Bernard E.

    2014-01-01

    Objective To assess in a high-resolution model of thin liver rat slices which viscoelastic parameter at three-dimensional multifrequency MR elastography has the best diagnostic performance for quantifying liver fibrosis. Materials and Methods The study was approved by the ethics committee for animal care of our institution. Eight normal rats and 42 rats with carbon tetrachloride induced liver fibrosis were used in the study. The rats were sacrificed, their livers were resected and three-dimensional MR elastography of 5±2 mm liver slices was performed at 7T with mechanical frequencies of 500, 600 and 700 Hz. The complex shear, storage and loss moduli, and the coefficient of the frequency power law were calculated. At histopathology, fibrosis and inflammation were assessed with METAVIR score, fibrosis was further quantified with morphometry. The diagnostic value of the viscoelastic parameters for assessing fibrosis severity was evaluated with simple and multiple linear regressions, receiver operating characteristic analysis and Obuchowski measures. Results At simple regression, the shear, storage and loss moduli were associated with the severity of fibrosis. At multiple regression, the storage modulus at 600 Hz was the only parameter associated with fibrosis severity (r = 0.86, p<0.0001). This parameter had an Obuchowski measure of 0.89+/−0.03. This measure was significantly larger than that of the loss modulus (0.78+/−0.04, p = 0.028), but not than that of the complex shear modulus (0.88+/−0.03, p = 0.84). Conclusion Our high resolution, three-dimensional multifrequency MR elastography study of thin liver slices shows that the storage modulus is the viscoelastic parameter that has the best association with the severity of liver fibrosis. However, its diagnostic performance does not differ significantly from that of the complex shear modulus. PMID:24722733

  16. [Culture and quality of life assessment in Chinese populations].

    PubMed

    Xia, Ping; Li, Ning-Xiu; Liu, Chao-Jie; Lü, Yu-Bo; Zhang, Qiang; Ou, Ai-Hua

    2010-07-01

    To investigate the impact of cultural factors on quality of life (QOL) and to identify appropriate ways of dividing sub-populations for population norm-based quality of life assessment. The WHOQOL-BREF was used as a QOL instrument. Another questionnaire was developed to assess cultural values. A cross-sectional survey was undertaken in 1090 Guangzhou residents, which included 635 respondents from communities and 455 patients who visited outpatient departments of hospitals. Cronbach's a coefficients and item-domain correlation coefficients were calculated to test the reliability and validity of the WHOQOL-BREF, respectively. Student t test, ANOVA and stepwise multiple linear regression analysis were performed to identify the variables that might have an impact on the QOL. Two regression models with and without including cultural variables were constructed, and the extent of impact exerted by the cultural factors was assessed through a comparison of the change of adjusted R square values. A total of 1052 (96%) valid questionnaire were returned. The Cronbach's alpha coefficients of the WHOQOL-BREF ranged from 0.67 to 0.78. Age, education, occupation and family income were correlated with all of the domains of the WHOQOL-BREF. Chronic condition was correlated with physical, psychological, and social relationship domains of the WHOQOL-BREF. Gender was correlated with physical and psychological domains of the WHOQOL-BREF. The multiple regression analysis showed that social and demographic factors contributed to 6.3%, 13.6%, 10.4% and 8.7% of the predicted variances for the physical, psychological, social relationship, and environment domains, respectively. Social support, horizontal collectivism, vertical individualism, escape acceptance, fear of death, health value, supernatural belief had a significant impact on QOL. However, social support was the only one factor that had an impact on all of the four QOL domains. It is necessary to divide sub-cultural populations for population norm-based QOL assessment. Further research is needed to develop a practical approach to the sub-cultural population division.

  17. Variation of Annual ET Determined from Water Budgets Across Rural Southeastern Basins Differing in Forest Types

    NASA Astrophysics Data System (ADS)

    Younger, S. E.; Jackson, C. R.

    2017-12-01

    In the Southeastern United States, evapotranspiration (ET) typically accounts for 60-70% of precipitation. Watershed and plot scale experiments show that evergreen forests have higher ET rates than hardwood forests and pastures. However, some plot experiments indicate that certain hardwood species have higher ET than paired evergreens. The complexity of factors influencing ET in mixed land cover watersheds makes identifying the relative influences difficult. Previous watershed scale studies have relied on regression to understand the influences or low flow analysis to indicate growing season differences among watersheds. Existing studies in the southeast investigating ET rates for watersheds with multiple forest cover types have failed to identify a significant forest type effect, but these studies acknowledge small sample sizes. Trends of decreasing streamflow have been recognized in the region and are generally attributed to five key factors, 1.) influences from multiple droughts, 2.) changes in distribution of precipitation, 3.) reforestation of agricultural land, 4.) increasing consumptive uses, or 5.) a combination of these and other factors. This study attempts to address the influence of forest type on long term average annual streamflow and on stream low flows. Long term annual ET rates were calculated as ET = P-Q for 46 USGS gaged basins with daily data for the 1982 - 2014 water years, >40% forest cover, and no large reservoirs. Land cover data was regressed against ET to describe the relationship between each of the forest types in the National Land Cover Database. Regression analysis indicates evergreen land cover has a positive relationship with ET while deciduous and total forest have a negative relationship with ET. Low flow analysis indicates low flows tend to be lower in watersheds with more evergreen cover, and that low flows increase with increasing deciduous cover, although these relationships are noisy. This work suggests considering forest cover type improves understanding of watershed scale ET at annual and seasonal levels which is consistent with historic paired watershed experiments and some plot scale data.

  18. Suicidal ideation, risk factors, and communication with parents. An HBSC study on school children in Estonia, Lithuania, and Luxembourg.

    PubMed

    Mark, Lauraliisa; Samm, Algi; Tooding, Liina-Mai; Sisask, Merike; Aasvee, Katrin; Zaborskis, Apolinaras; Zemaitiene, Nida; Värnik, Airi

    2013-01-01

    Suicide is a leading cause of death among youth. In the year 2002, Lithuania had the 2nd, Luxembourg the 5th, and Estonia the 9th highest suicide rates among 15- to 19-year-olds across 90 countries worldwide. Suicidal ideation is a significant precursor to suicide. To report on the prevalence of and associations between suicidal ideation, smoking, alcohol consumption, physical fighting, bullying, and communication with parents among 15-year-old schoolchildren. The survey analyzes data from the 2005/2006 HBSC study from Estonia, Lithuania, and Luxembourg (N = 4,954). The risk factors were calculated through multinomial logistic regression analyses. The overall prevalence of suicidal ideation in the preceding year was 17%. Suicidal thoughts were associated with communication difficulties with parents (OR from 2.0 to 4.6) and other risk factors, especially multiple risks (OR for 4-5 concurrent risk factors from 4.5 to 13.6). Parent-child communication had a significant mediating effect by decreasing the odds for suicidality and multiple risks. The prevalence estimates were obtained by self-reports. The causal relationships need further investigation. The risk factors studied, particularly multiple risks, were associated with higher odds for suicidal ideation. Good parent-child communication is a significant resource for decreasing suicidal ideation among adolescents.

  19. A Quantum Chemical and Statistical Study of Phenolic Schiff Bases with Antioxidant Activity against DPPH Free Radical

    PubMed Central

    Anouar, El Hassane

    2014-01-01

    Phenolic Schiff bases are known as powerful antioxidants. To select the electronic, 2D and 3D descriptors responsible for the free radical scavenging ability of a series of 30 phenolic Schiff bases, a set of molecular descriptors were calculated by using B3P86 (Becke’s three parameter hybrid functional with Perdew 86 correlation functional) combined with 6-31 + G(d,p) basis set (i.e., at the B3P86/6-31 + G(d,p) level of theory). The chemometric methods, simple and multiple linear regressions (SLR and MLR), principal component analysis (PCA) and hierarchical cluster analysis (HCA) were employed to reduce the dimensionality and to investigate the relationship between the calculated descriptors and the antioxidant activity. The results showed that the antioxidant activity mainly depends on the first and second bond dissociation enthalpies of phenolic hydroxyl groups, the dipole moment and the hydrophobicity descriptors. The antioxidant activity is inversely proportional to the main descriptors. The selected descriptors discriminate the Schiff bases into active and inactive antioxidants. PMID:26784873

  20. Approach of automatic 3D geological mapping: the case of the Kovdor phoscorite-carbonatite complex, NW Russia.

    PubMed

    Kalashnikov, A O; Ivanyuk, G Yu; Mikhailova, J A; Sokharev, V A

    2017-07-31

    We have developed an approach for automatic 3D geological mapping based on conversion of chemical composition of rocks to mineral composition by logical computation. It allows to calculate mineral composition based on bulk rock chemistry, interpolate the mineral composition in the same way as chemical composition, and, finally, build a 3D geological model. The approach was developed for the Kovdor phoscorite-carbonatite complex containing the Kovdor baddeleyite-apatite-magnetite deposit. We used 4 bulk rock chemistry analyses - Fe magn , P 2 O 5 , CO 2 and SiO 2 . We used four techniques for prediction of rock types - calculation of normative mineral compositions (norms), multiple regression, artificial neural network and developed by logical evaluation. The two latter became the best. As a result, we distinguished 14 types of phoscorites (forsterite-apatite-magnetite-carbonate rock), carbonatite and host rocks. The results show good convergence with our petrographical studies of the deposit, and recent manually built maps. The proposed approach can be used as a tool of a deposit genesis reconstruction and preliminary geometallurgical modelling.

  1. Interventions to Improve Medication Adherence in Hypertensive Patients: Systematic Review and Meta-analysis.

    PubMed

    Conn, Vicki S; Ruppar, Todd M; Chase, Jo-Ana D; Enriquez, Maithe; Cooper, Pamela S

    2015-12-01

    This systematic review applied meta-analytic procedures to synthesize medication adherence interventions that focus on adults with hypertension. Comprehensive searching located trials with medication adherence behavior outcomes. Study sample, design, intervention characteristics, and outcomes were coded. Random-effects models were used in calculating standardized mean difference effect sizes. Moderator analyses were conducted using meta-analytic analogues of ANOVA and regression to explore associations between effect sizes and sample, design, and intervention characteristics. Effect sizes were calculated for 112 eligible treatment-vs.-control group outcome comparisons of 34,272 subjects. The overall standardized mean difference effect size between treatment and control subjects was 0.300. Exploratory moderator analyses revealed interventions were most effective among female, older, and moderate- or high-income participants. The most promising intervention components were those linking adherence behavior with habits, giving adherence feedback to patients, self-monitoring of blood pressure, using pill boxes and other special packaging, and motivational interviewing. The most effective interventions employed multiple components and were delivered over many days. Future research should strive for minimizing risks of bias common in this literature, especially avoiding self-report adherence measures.

  2. Evaluation of AUC(0-4) predictive methods for cyclosporine in kidney transplant patients.

    PubMed

    Aoyama, Takahiko; Matsumoto, Yoshiaki; Shimizu, Makiko; Fukuoka, Masamichi; Kimura, Toshimi; Kokubun, Hideya; Yoshida, Kazunari; Yago, Kazuo

    2005-05-01

    Cyclosporine (CyA) is the most commonly used immunosuppressive agent in patients who undergo kidney transplantation. Dosage adjustment of CyA is usually based on trough levels. Recently, trough levels have been replacing the area under the concentration-time curve during the first 4 h after CyA administration (AUC(0-4)). The aim of this study was to compare the predictive values obtained using three different methods of AUC(0-4) monitoring. AUC(0-4) was calculated from 0 to 4 h in early and stable renal transplant patients using the trapezoidal rule. The predicted AUC(0-4) was calculated using three different methods: the multiple regression equation reported by Uchida et al.; Bayesian estimation for modified population pharmacokinetic parameters reported by Yoshida et al.; and modified population pharmacokinetic parameters reported by Cremers et al. The predicted AUC(0-4) was assessed on the basis of predictive bias, precision, and correlation coefficient. The predicted AUC(0-4) values obtained using three methods through measurement of three blood samples showed small differences in predictive bias, precision, and correlation coefficient. In the prediction of AUC(0-4) measurement of one blood sample from stable renal transplant patients, the performance of the regression equation reported by Uchida depended on sampling time. On the other hand, the performance of Bayesian estimation with modified pharmacokinetic parameters reported by Yoshida through measurement of one blood sample, which is not dependent on sampling time, showed a small difference in the correlation coefficient. The prediction of AUC(0-4) using a regression equation required accurate sampling time. In this study, the prediction of AUC(0-4) using Bayesian estimation did not require accurate sampling time in the AUC(0-4) monitoring of CyA. Thus Bayesian estimation is assumed to be clinically useful in the dosage adjustment of CyA.

  3. Weighted regression analysis and interval estimators

    Treesearch

    Donald W. Seegrist

    1974-01-01

    A method for deriving the weighted least squares estimators for the parameters of a multiple regression model. Confidence intervals for expected values, and prediction intervals for the means of future samples are given.

  4. Interpreting Bivariate Regression Coefficients: Going beyond the Average

    ERIC Educational Resources Information Center

    Halcoussis, Dennis; Phillips, G. Michael

    2010-01-01

    Statistics, econometrics, investment analysis, and data analysis classes often review the calculation of several types of averages, including the arithmetic mean, geometric mean, harmonic mean, and various weighted averages. This note shows how each of these can be computed using a basic regression framework. By recognizing when a regression model…

  5. Estimation of premorbid general fluid intelligence using traditional Chinese reading performance in Taiwanese samples.

    PubMed

    Chen, Ying-Jen; Ho, Meng-Yang; Chen, Kwan-Ju; Hsu, Chia-Fen; Ryu, Shan-Jin

    2009-08-01

    The aims of the present study were to (i) investigate if traditional Chinese word reading ability can be used for estimating premorbid general intelligence; and (ii) to provide multiple regression equations for estimating premorbid performance on Raven's Standard Progressive Matrices (RSPM), using age, years of education and Chinese Graded Word Reading Test (CGWRT) scores as predictor variables. Four hundred and twenty-six healthy volunteers (201 male, 225 female), aged 16-93 years (mean +/- SD, 41.92 +/- 18.19 years) undertook the tests individually under supervised conditions. Seventy percent of subjects were randomly allocated to the derivation group (n = 296), and the rest to the validation group (n = 130). RSPM score was positively correlated with CGWRT score and years of education. RSPM and CGWRT scores and years of education were also inversely correlated with age, but the declining trend for RSPM performance against age was steeper than that for CGWRT performance. Separate multiple regression equations were derived for estimating RSPM scores using different combinations of age, years of education, and CGWRT score for both groups. The multiple regression coefficient of each equation ranged from 0.71 to 0.80 with the standard error of estimate between 7 and 8 RSPM points. When fitting the data of one group to the equations derived from its counterpart group, the cross-validation multiple regression coefficients ranged from 0.71 to 0.79. There were no significant differences in the 'predicted-obtained' RSPM discrepancies between any equations. The regression equations derived in the present study may provide a basis for estimating premorbid RSPM performance.

  6. Overall Preference of Running Shoes Can Be Predicted by Suitable Perception Factors Using a Multiple Regression Model.

    PubMed

    Tay, Cheryl Sihui; Sterzing, Thorsten; Lim, Chen Yen; Ding, Rui; Kong, Pui Wah

    2017-05-01

    This study examined (a) the strength of four individual footwear perception factors to influence the overall preference of running shoes and (b) whether these perception factors satisfied the nonmulticollinear assumption in a regression model. Running footwear must fulfill multiple functional criteria to satisfy its potential users. Footwear perception factors, such as fit and cushioning, are commonly used to guide shoe design and development, but it is unclear whether running-footwear users are able to differentiate one factor from another. One hundred casual runners assessed four running shoes on a 15-cm visual analogue scale for four footwear perception factors (fit, cushioning, arch support, and stability) as well as for overall preference during a treadmill running protocol. Diagnostic tests showed an absence of multicollinearity between factors, where values for tolerance ranged from .36 to .72, corresponding to variance inflation factors of 2.8 to 1.4. The multiple regression model of these four footwear perception variables accounted for 77.7% to 81.6% of variance in overall preference, with each factor explaining a unique part of the total variance. Casual runners were able to rate each footwear perception factor separately, thus assigning each factor a true potential to improve overall preference for the users. The results also support the use of a multiple regression model of footwear perception factors to predict overall running shoe preference. Regression modeling is a useful tool for running-shoe manufacturers to more precisely evaluate how individual factors contribute to the subjective assessment of running footwear.

  7. A population-based study on the association between rheumatoid arthritis and voice problems.

    PubMed

    Hah, J Hun; An, Soo-Youn; Sim, Songyong; Kim, So Young; Oh, Dong Jun; Park, Bumjung; Kim, Sung-Gyun; Choi, Hyo Geun

    2016-07-01

    The objective of this study was to investigate whether rheumatoid arthritis increases the frequency of organic laryngeal lesions and the subjective voice complaint rate in those with no organic laryngeal lesion. We performed a cross-sectional study using the data from 19,368 participants (418 rheumatoid arthritis patients and 18,950 controls) of the 2008-2011 Korea National Health and Nutrition Examination Survey. The associations between rheumatoid arthritis and organic laryngeal lesions/subjective voice complaints were analyzed using simple/multiple logistic regression analysis with complex sample adjusting for confounding factors, including age, sex, smoking status, stress level, and body mass index, which could provoke voice problems. Vocal nodules, vocal polyp, and vocal palsy were not associated with rheumatoid arthritis in a multiple regression analysis, and only laryngitis showed a positive association (adjusted odds ratio, 1.59; 95 % confidence interval, 1.01-2.52; P = 0.047). Rheumatoid arthritis was associated with subjective voice discomfort in a simple regression analysis, but not in a multiple regression analysis. Participants with rheumatoid arthritis were older, more often female, and had higher stress levels than those without rheumatoid arthritis. These factors were associated with subjective voice complaints in both simple and multiple regression analyses. Rheumatoid arthritis was not associated with organic laryngeal diseases except laryngitis. Rheumatoid arthritis did not increase the odds ratio for subjective voice complaints. Voice problems in participants with rheumatoid arthritis originated from the characteristics of the rheumatoid arthritis group (higher mean age, female sex, and stress level) rather than rheumatoid arthritis itself.

  8. Predicting MHC-II binding affinity using multiple instance regression

    PubMed Central

    EL-Manzalawy, Yasser; Dobbs, Drena; Honavar, Vasant

    2011-01-01

    Reliably predicting the ability of antigen peptides to bind to major histocompatibility complex class II (MHC-II) molecules is an essential step in developing new vaccines. Uncovering the amino acid sequence correlates of the binding affinity of MHC-II binding peptides is important for understanding pathogenesis and immune response. The task of predicting MHC-II binding peptides is complicated by the significant variability in their length. Most existing computational methods for predicting MHC-II binding peptides focus on identifying a nine amino acids core region in each binding peptide. We formulate the problems of qualitatively and quantitatively predicting flexible length MHC-II peptides as multiple instance learning and multiple instance regression problems, respectively. Based on this formulation, we introduce MHCMIR, a novel method for predicting MHC-II binding affinity using multiple instance regression. We present results of experiments using several benchmark datasets that show that MHCMIR is competitive with the state-of-the-art methods for predicting MHC-II binding peptides. An online web server that implements the MHCMIR method for MHC-II binding affinity prediction is freely accessible at http://ailab.cs.iastate.edu/mhcmir. PMID:20855923

  9. Multiple-Shrinkage Multinomial Probit Models with Applications to Simulating Geographies in Public Use Data.

    PubMed

    Burgette, Lane F; Reiter, Jerome P

    2013-06-01

    Multinomial outcomes with many levels can be challenging to model. Information typically accrues slowly with increasing sample size, yet the parameter space expands rapidly with additional covariates. Shrinking all regression parameters towards zero, as often done in models of continuous or binary response variables, is unsatisfactory, since setting parameters equal to zero in multinomial models does not necessarily imply "no effect." We propose an approach to modeling multinomial outcomes with many levels based on a Bayesian multinomial probit (MNP) model and a multiple shrinkage prior distribution for the regression parameters. The prior distribution encourages the MNP regression parameters to shrink toward a number of learned locations, thereby substantially reducing the dimension of the parameter space. Using simulated data, we compare the predictive performance of this model against two other recently-proposed methods for big multinomial models. The results suggest that the fully Bayesian, multiple shrinkage approach can outperform these other methods. We apply the multiple shrinkage MNP to simulating replacement values for areal identifiers, e.g., census tract indicators, in order to protect data confidentiality in public use datasets.

  10. Iodine intake by adult residents of a farming area in Iwate Prefecture, Japan, and the accuracy of estimated iodine intake calculated using the Standard Tables of Food Composition in Japan.

    PubMed

    Nakatsuka, Haruo; Chiba, Keiko; Watanabe, Takao; Sawatari, Hideyuki; Seki, Takako

    2016-11-01

    Iodine intake by adults in farming districts in Northeastern Japan was evaluated by two methods: (1) government-approved food composition tables based calculation and (2) instrumental measurement. The correlation between these two values and a regression model for the calibration of calculated values was presented. Iodine intake was calculated, using the values in the Japan Standard Tables of Food Composition (FCT), through the analysis of duplicate samples of complete 24-h food consumption for 90 adult subjects. In cases where the value for iodine content was not available in the FCT, it was assumed to be zero for that food item (calculated values). Iodine content was also measured by ICP-MS (measured values). Calculated and measured values rendered geometric means (GM) of 336 and 279 μg/day, respectively. There was no statistically significant (p > 0.05) difference between calculated and measured values. The correlation coefficient was 0.646 (p < 0.05). With this high correlation coefficient, a simple regression line can be applied to estimate measured value from calculated value. A survey of the literature suggests that the values in this study were similar to values that have been reported to date for Japan, and higher than those for other countries in Asia. Iodine intake of Japanese adults was 336 μg/day (GM, calculated) and 279 μg/day (GM, measured). Both values correlated so well, with a correlation coefficient of 0.646, that a regression model (Y = 130.8 + 1.9479X, where X and Y are measured and calculated values, respectively) could be used to calibrate calculated values.

  11. Specific interactions of alcohols and non-alcohols with a biologically active boronic acid derivative: a spectroscopic study.

    PubMed

    Geethanjali, H S; Melavanki, R M; Nagaraja, D; Patil, N R; Thipperudrappa, J; Kusanur, R A

    2016-08-01

    The photophysical properties of 4-fluoro-2-methoxyphenyl boronic acid (4FMPBA) are characterized using absorption and fluorescence techniques in series of non-alcohols and alcohols. The results are analyzed using different solvent polarity functions and Kamlet and Catalan's multiple regression approaches. The excited state dipole moment and change in dipole moment are calculated using both the solvatochromic shift method and Reichardt's microscopic solvent polarity parameter ETN. The ground state dipole moment is evaluated using quantum chemical calculations. It is found that general solute-solvent and hydrogen bond interactions are operative in this system. A red shift of ~ 9 nm in the emission spectra is observed with an increase in the solvent polarity, which depicts π→π(*) transitions, as well as the possibility of an intramolecular charge transfer (ICT) character in the emitting singlet state of 4FMPBA. The relative quantum yield, radiative and non-radiative decay constants are calculated in alkanes and alcohols using the single point method. It is found that the quantum yield of the molecule varies from 16.81% to 50.79% with the change in solvent polarity, indicating the dependence of fluorescence on the solvent environment. Copyright © 2015 John Wiley & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.

  12. Kendall-Theil Robust Line (KTRLine--version 1.0)-A Visual Basic Program for Calculating and Graphing Robust Nonparametric Estimates of Linear-Regression Coefficients Between Two Continuous Variables

    USGS Publications Warehouse

    Granato, Gregory E.

    2006-01-01

    The Kendall-Theil Robust Line software (KTRLine-version 1.0) is a Visual Basic program that may be used with the Microsoft Windows operating system to calculate parameters for robust, nonparametric estimates of linear-regression coefficients between two continuous variables. The KTRLine software was developed by the U.S. Geological Survey, in cooperation with the Federal Highway Administration, for use in stochastic data modeling with local, regional, and national hydrologic data sets to develop planning-level estimates of potential effects of highway runoff on the quality of receiving waters. The Kendall-Theil robust line was selected because this robust nonparametric method is resistant to the effects of outliers and nonnormality in residuals that commonly characterize hydrologic data sets. The slope of the line is calculated as the median of all possible pairwise slopes between points. The intercept is calculated so that the line will run through the median of input data. A single-line model or a multisegment model may be specified. The program was developed to provide regression equations with an error component for stochastic data generation because nonparametric multisegment regression tools are not available with the software that is commonly used to develop regression models. The Kendall-Theil robust line is a median line and, therefore, may underestimate total mass, volume, or loads unless the error component or a bias correction factor is incorporated into the estimate. Regression statistics such as the median error, the median absolute deviation, the prediction error sum of squares, the root mean square error, the confidence interval for the slope, and the bias correction factor for median estimates are calculated by use of nonparametric methods. These statistics, however, may be used to formulate estimates of mass, volume, or total loads. The program is used to read a two- or three-column tab-delimited input file with variable names in the first row and data in subsequent rows. The user may choose the columns that contain the independent (X) and dependent (Y) variable. A third column, if present, may contain metadata such as the sample-collection location and date. The program screens the input files and plots the data. The KTRLine software is a graphical tool that facilitates development of regression models by use of graphs of the regression line with data, the regression residuals (with X or Y), and percentile plots of the cumulative frequency of the X variable, Y variable, and the regression residuals. The user may individually transform the independent and dependent variables to reduce heteroscedasticity and to linearize data. The program plots the data and the regression line. The program also prints model specifications and regression statistics to the screen. The user may save and print the regression results. The program can accept data sets that contain up to about 15,000 XY data points, but because the program must sort the array of all pairwise slopes, the program may be perceptibly slow with data sets that contain more than about 1,000 points.

  13. Quantile Regression in the Study of Developmental Sciences

    ERIC Educational Resources Information Center

    Petscher, Yaacov; Logan, Jessica A. R.

    2014-01-01

    Linear regression analysis is one of the most common techniques applied in developmental research, but only allows for an estimate of the average relations between the predictor(s) and the outcome. This study describes quantile regression, which provides estimates of the relations between the predictor(s) and outcome, but across multiple points of…

  14. Maintenance Operations in Mission Oriented Protective Posture Level IV (MOPPIV)

    DTIC Science & Technology

    1987-10-01

    Repair FADAC Printed Circuit Board ............. 6 3. Data Analysis Techniques ............................. 6 a. Multiple Linear Regression... ANALYSIS /DISCUSSION ............................... 12 1. Exa-ple of Regression Analysis ..................... 12 S2. Regression results for all tasks...6 * TABLE 9. Task Grouping for Analysis ........................ 7 "TABXLE 10. Remove/Replace H60A3 Power Pack................. 8 TABLE

  15. Interactions between cadmium and decabrominated diphenyl ether on blood cells count in rats-Multiple factorial regression analysis.

    PubMed

    Curcic, Marijana; Buha, Aleksandra; Stankovic, Sanja; Milovanovic, Vesna; Bulat, Zorica; Đukić-Ćosić, Danijela; Antonijević, Evica; Vučinić, Slavica; Matović, Vesna; Antonijevic, Biljana

    2017-02-01

    The objective of this study was to assess toxicity of Cd and BDE-209 mixture on haematological parameters in subacutely exposed rats and to determine the presence and type of interactions between these two chemicals using multiple factorial regression analysis. Furthermore, for the assessment of interaction type, an isobologram based methodology was applied and compared with multiple factorial regression analysis. Chemicals were given by oral gavage to the male Wistar rats weighing 200-240g for 28days. Animals were divided in 16 groups (8/group): control vehiculum group, three groups of rats were treated with 2.5, 7.5 or 15mg Cd/kg/day. These doses were chosen on the bases of literature data and reflect relatively high Cd environmental exposure, three groups of rats were treated with 1000, 2000 or 4000mg BDE-209/kg/bw/day, doses proved to induce toxic effects in rats. Furthermore, nine groups of animals were treated with different mixtures of Cd and BDE-209 containing doses of Cd and BDE-209 stated above. Blood samples were taken at the end of experiment and red blood cells, white blood cells and platelets counts were determined. For interaction assessment multiple factorial regression analysis and fitted isobologram approach were used. In this study, we focused on multiple factorial regression analysis as a method for interaction assessment. We also investigated the interactions between Cd and BDE-209 by the derived model for the description of the obtained fitted isobologram curves. Current study indicated that co-exposure to Cd and BDE-209 can result in significant decrease in RBC count, increase in WBC count and decrease in PLT count, when compared with controls. Multiple factorial regression analysis used for the assessment of interactions type between Cd and BDE-209 indicated synergism for the effect on RBC count and no interactions i.e. additivity for the effects on WBC and PLT counts. On the other hand, isobologram based approach showed slight antagonism for the effects on RBC and WBC while no interactions were proved for the joint effect on PLT count. These results confirm that the assessment of interactions between chemicals in the mixture greatly depends on the concept or method used for this evaluation. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.

  16. Estimation of 1RM for knee extension based on the maximal isometric muscle strength and body composition.

    PubMed

    Kanada, Yoshikiyo; Sakurai, Hiroaki; Sugiura, Yoshito; Arai, Tomoaki; Koyama, Soichiro; Tanabe, Shigeo

    2017-11-01

    [Purpose] To create a regression formula in order to estimate 1RM for knee extensors, based on the maximal isometric muscle strength measured using a hand-held dynamometer and data regarding the body composition. [Subjects and Methods] Measurement was performed in 21 healthy males in their twenties to thirties. Single regression analysis was performed, with measurement values representing 1RM and the maximal isometric muscle strength as dependent and independent variables, respectively. Furthermore, multiple regression analysis was performed, with data regarding the body composition incorporated as another independent variable, in addition to the maximal isometric muscle strength. [Results] Through single regression analysis with the maximal isometric muscle strength as an independent variable, the following regression formula was created: 1RM (kg)=0.714 + 0.783 × maximal isometric muscle strength (kgf). On multiple regression analysis, only the total muscle mass was extracted. [Conclusion] A highly accurate regression formula to estimate 1RM was created based on both the maximal isometric muscle strength and body composition. Using a hand-held dynamometer and body composition analyzer, it was possible to measure these items in a short time, and obtain clinically useful results.

  17. Data Mining Methods Applied to Flight Operations Quality Assurance Data: A Comparison to Standard Statistical Methods

    NASA Technical Reports Server (NTRS)

    Stolzer, Alan J.; Halford, Carl

    2007-01-01

    In a previous study, multiple regression techniques were applied to Flight Operations Quality Assurance-derived data to develop parsimonious model(s) for fuel consumption on the Boeing 757 airplane. The present study examined several data mining algorithms, including neural networks, on the fuel consumption problem and compared them to the multiple regression results obtained earlier. Using regression methods, parsimonious models were obtained that explained approximately 85% of the variation in fuel flow. In general data mining methods were more effective in predicting fuel consumption. Classification and Regression Tree methods reported correlation coefficients of .91 to .92, and General Linear Models and Multilayer Perceptron neural networks reported correlation coefficients of about .99. These data mining models show great promise for use in further examining large FOQA databases for operational and safety improvements.

  18. Quantum Chemically Estimated Abraham Solute Parameters Using Multiple Solvent-Water Partition Coefficients and Molecular Polarizability.

    PubMed

    Liang, Yuzhen; Xiong, Ruichang; Sandler, Stanley I; Di Toro, Dominic M

    2017-09-05

    Polyparameter Linear Free Energy Relationships (pp-LFERs), also called Linear Solvation Energy Relationships (LSERs), are used to predict many environmentally significant properties of chemicals. A method is presented for computing the necessary chemical parameters, the Abraham parameters (AP), used by many pp-LFERs. It employs quantum chemical calculations and uses only the chemical's molecular structure. The method computes the Abraham E parameter using density functional theory computed molecular polarizability and the Clausius-Mossotti equation relating the index refraction to the molecular polarizability, estimates the Abraham V as the COSMO calculated molecular volume, and computes the remaining AP S, A, and B jointly with a multiple linear regression using sixty-five solvent-water partition coefficients computed using the quantum mechanical COSMO-SAC solvation model. These solute parameters, referred to as Quantum Chemically estimated Abraham Parameters (QCAP), are further adjusted by fitting to experimentally based APs using QCAP parameters as the independent variables so that they are compatible with existing Abraham pp-LFERs. QCAP and adjusted QCAP for 1827 neutral chemicals are included. For 24 solvent-water systems including octanol-water, predicted log solvent-water partition coefficients using adjusted QCAP have the smallest root-mean-square errors (RMSEs, 0.314-0.602) compared to predictions made using APs estimated using the molecular fragment based method ABSOLV (0.45-0.716). For munition and munition-like compounds, adjusted QCAP has much lower RMSE (0.860) than does ABSOLV (4.45) which essentially fails for these compounds.

  19. Automated brain volumetrics in multiple sclerosis: a step closer to clinical application.

    PubMed

    Wang, C; Beadnall, H N; Hatton, S N; Bader, G; Tomic, D; Silva, D G; Barnett, M H

    2016-07-01

    Whole brain volume (WBV) estimates in patients with multiple sclerosis (MS) correlate more robustly with clinical disability than traditional, lesion-based metrics. Numerous algorithms to measure WBV have been developed over the past two decades. We compare Structural Image Evaluation using Normalisation of Atrophy-Cross-sectional (SIENAX) to NeuroQuant and MSmetrix, for assessment of cross-sectional WBV in patients with MS. MRIs from 61 patients with relapsing-remitting MS and 2 patients with clinically isolated syndrome were analysed. WBV measurements were calculated using SIENAX, NeuroQuant and MSmetrix. Statistical agreement between the methods was evaluated using linear regression and Bland-Altman plots. Precision and accuracy of WBV measurement was calculated for (1) NeuroQuant versus SIENAX and (2) MSmetrix versus SIENAX. Precision (Pearson's r) of WBV estimation for NeuroQuant and MSmetrix versus SIENAX was 0.983 and 0.992, respectively. Accuracy (Cb) was 0.871 and 0.994, respectively. NeuroQuant and MSmetrix showed a 5.5% and 1.0% volume difference compared with SIENAX, respectively, that was consistent across low and high values. In the analysed population, NeuroQuant and MSmetrix both quantified cross-sectional WBV with comparable statistical agreement to SIENAX, a well-validated cross-sectional tool that has been used extensively in MS clinical studies. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://www.bmj.com/company/products-services/rights-and-licensing/

  20. Cross Validation of Selection of Variables in Multiple Regression.

    DTIC Science & Technology

    1979-12-01

    55 vii CROSS VALIDATION OF SELECTION OF VARIABLES IN MULTIPLE REGRESSION I Introduction Background Long term DoD planning gcals...028545024 .31109000 BF * SS - .008700618 .0471961 Constant - .70977903 85.146786 55 had adequate predictive capabilities; the other two models (the...71ZCO F111D Control 54 73EGO FlIID Computer, General Purpose 55 73EPO FII1D Converter-Multiplexer 56 73HAO flllD Stabilizer Platform 57 73HCO F1ID

  1. Quantitative Assessment of Cervical Vertebral Maturation Using Cone Beam Computed Tomography in Korean Girls

    PubMed Central

    Byun, Bo-Ram; Kim, Yong-Il; Maki, Koutaro; Son, Woo-Sung

    2015-01-01

    This study was aimed to examine the correlation between skeletal maturation status and parameters from the odontoid process/body of the second vertebra and the bodies of third and fourth cervical vertebrae and simultaneously build multiple regression models to be able to estimate skeletal maturation status in Korean girls. Hand-wrist radiographs and cone beam computed tomography (CBCT) images were obtained from 74 Korean girls (6–18 years of age). CBCT-generated cervical vertebral maturation (CVM) was used to demarcate the odontoid process and the body of the second cervical vertebra, based on the dentocentral synchondrosis. Correlation coefficient analysis and multiple linear regression analysis were used for each parameter of the cervical vertebrae (P < 0.05). Forty-seven of 64 parameters from CBCT-generated CVM (independent variables) exhibited statistically significant correlations (P < 0.05). The multiple regression model with the greatest R 2 had six parameters (PH2/W2, UW2/W2, (OH+AH2)/LW2, UW3/LW3, D3, and H4/W4) as independent variables with a variance inflation factor (VIF) of <2. CBCT-generated CVM was able to include parameters from the second cervical vertebral body and odontoid process, respectively, for the multiple regression models. This suggests that quantitative analysis might be used to estimate skeletal maturation status. PMID:25878721

  2. Adjusted variable plots for Cox's proportional hazards regression model.

    PubMed

    Hall, C B; Zeger, S L; Bandeen-Roche, K J

    1996-01-01

    Adjusted variable plots are useful in linear regression for outlier detection and for qualitative evaluation of the fit of a model. In this paper, we extend adjusted variable plots to Cox's proportional hazards model for possibly censored survival data. We propose three different plots: a risk level adjusted variable (RLAV) plot in which each observation in each risk set appears, a subject level adjusted variable (SLAV) plot in which each subject is represented by one point, and an event level adjusted variable (ELAV) plot in which the entire risk set at each failure event is represented by a single point. The latter two plots are derived from the RLAV by combining multiple points. In each point, the regression coefficient and standard error from a Cox proportional hazards regression is obtained by a simple linear regression through the origin fit to the coordinates of the pictured points. The plots are illustrated with a reanalysis of a dataset of 65 patients with multiple myeloma.

  3. Analysis of Binary Adherence Data in the Setting of Polypharmacy: A Comparison of Different Approaches

    PubMed Central

    Esserman, Denise A.; Moore, Charity G.; Roth, Mary T.

    2009-01-01

    Older community dwelling adults often take multiple medications for numerous chronic diseases. Non-adherence to these medications can have a large public health impact. Therefore, the measurement and modeling of medication adherence in the setting of polypharmacy is an important area of research. We apply a variety of different modeling techniques (standard linear regression; weighted linear regression; adjusted linear regression; naïve logistic regression; beta-binomial (BB) regression; generalized estimating equations (GEE)) to binary medication adherence data from a study in a North Carolina based population of older adults, where each medication an individual was taking was classified as adherent or non-adherent. In addition, through simulation we compare these different methods based on Type I error rates, bias, power, empirical 95% coverage, and goodness of fit. We find that estimation and inference using GEE is robust to a wide variety of scenarios and we recommend using this in the setting of polypharmacy when adherence is dichotomously measured for multiple medications per person. PMID:20414358

  4. Selection of a Geostatistical Method to Interpolate Soil Properties of the State Crop Testing Fields using Attributes of a Digital Terrain Model

    NASA Astrophysics Data System (ADS)

    Sahabiev, I. A.; Ryazanov, S. S.; Kolcova, T. G.; Grigoryan, B. R.

    2018-03-01

    The three most common techniques to interpolate soil properties at a field scale—ordinary kriging (OK), regression kriging with multiple linear regression drift model (RK + MLR), and regression kriging with principal component regression drift model (RK + PCR)—were examined. The results of the performed study were compiled into an algorithm of choosing the most appropriate soil mapping technique. Relief attributes were used as the auxiliary variables. When spatial dependence of a target variable was strong, the OK method showed more accurate interpolation results, and the inclusion of the auxiliary data resulted in an insignificant improvement in prediction accuracy. According to the algorithm, the RK + PCR method effectively eliminates multicollinearity of explanatory variables. However, if the number of predictors is less than ten, the probability of multicollinearity is reduced, and application of the PCR becomes irrational. In that case, the multiple linear regression should be used instead.

  5. Genetic Programming Transforms in Linear Regression Situations

    NASA Astrophysics Data System (ADS)

    Castillo, Flor; Kordon, Arthur; Villa, Carlos

    The chapter summarizes the use of Genetic Programming (GP) inMultiple Linear Regression (MLR) to address multicollinearity and Lack of Fit (LOF). The basis of the proposed method is applying appropriate input transforms (model respecification) that deal with these issues while preserving the information content of the original variables. The transforms are selected from symbolic regression models with optimal trade-off between accuracy of prediction and expressional complexity, generated by multiobjective Pareto-front GP. The chapter includes a comparative study of the GP-generated transforms with Ridge Regression, a variant of ordinary Multiple Linear Regression, which has been a useful and commonly employed approach for reducing multicollinearity. The advantages of GP-generated model respecification are clearly defined and demonstrated. Some recommendations for transforms selection are given as well. The application benefits of the proposed approach are illustrated with a real industrial application in one of the broadest empirical modeling areas in manufacturing - robust inferential sensors. The chapter contributes to increasing the awareness of the potential of GP in statistical model building by MLR.

  6. A Solution to Separation and Multicollinearity in Multiple Logistic Regression

    PubMed Central

    Shen, Jianzhao; Gao, Sujuan

    2010-01-01

    In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27–38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth’s penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study. PMID:20376286

  7. A Solution to Separation and Multicollinearity in Multiple Logistic Regression.

    PubMed

    Shen, Jianzhao; Gao, Sujuan

    2008-10-01

    In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27-38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth's penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study.

  8. The role of social support in anxiety for persons with COPD.

    PubMed

    Dinicola, Gia; Julian, Laura; Gregorich, Steven E; Blanc, Paul D; Katz, Patricia P

    2013-02-01

    This study examined the contribution of perceived social support to the presence of anxiety in persons with chronic obstructive pulmonary disease (COPD). A cross-sectional survey sample of 452 persons with COPD (61.3% female; 53.5% older than 65; 70.8% without a college degree or higher educational achievement, and 54.8% with household income of $40,000 or less) completed a telephone survey. Measures included the anxiety subscale of the Hospital Anxiety and Depression Scale (HADS-A), 5 social support subscales from the Positive and Negative Social Exchanges (PANSE) Scale, a COPD Severity Score (CSS; a weighted algorithmic combination of symptoms and the need for various COPD medical interventions), and the Geriatric Depression Scale, Short Form (GDS-SF). Zero order correlations and a series of multiple regression analyses were calculated. Multiple regression analysis revealed that the receipt of instrumental support, feeling let down by the failure of others to provide needed help, and unsympathetic or insensitive behavior from others each positively predicted a higher level of patient anxiety in COPD patients, after controlling for demographic variables, smoking status, comorbid depression (GDS) and severity of illness (CSS). Additionally, the control variable of depression was the strongest predictor of anxiety, suggesting a high degree of co-morbidity in this sample. Anxiety and depression are serious co-morbid mental health concerns for persons with COPD. It is important to examine both positive and negative aspects of perceived social support for COPD patients and how they may impact or interact with these mental health concerns. Copyright © 2012 Elsevier Inc. All rights reserved.

  9. Risk Assessment and Prediction of Flyrock Distance by Combined Multiple Regression Analysis and Monte Carlo Simulation of Quarry Blasting

    NASA Astrophysics Data System (ADS)

    Armaghani, Danial Jahed; Mahdiyar, Amir; Hasanipanah, Mahdi; Faradonbeh, Roohollah Shirani; Khandelwal, Manoj; Amnieh, Hassan Bakhshandeh

    2016-09-01

    Flyrock is considered as one of the main causes of human injury, fatalities, and structural damage among all undesirable environmental impacts of blasting. Therefore, it seems that the proper prediction/simulation of flyrock is essential, especially in order to determine blast safety area. If proper control measures are taken, then the flyrock distance can be controlled, and, in return, the risk of damage can be reduced or eliminated. The first objective of this study was to develop a predictive model for flyrock estimation based on multiple regression (MR) analyses, and after that, using the developed MR model, flyrock phenomenon was simulated by the Monte Carlo (MC) approach. In order to achieve objectives of this study, 62 blasting operations were investigated in Ulu Tiram quarry, Malaysia, and some controllable and uncontrollable factors were carefully recorded/calculated. The obtained results of MC modeling indicated that this approach is capable of simulating flyrock ranges with a good level of accuracy. The mean of simulated flyrock by MC was obtained as 236.3 m, while this value was achieved as 238.6 m for the measured one. Furthermore, a sensitivity analysis was also conducted to investigate the effects of model inputs on the output of the system. The analysis demonstrated that powder factor is the most influential parameter on fly rock among all model inputs. It is noticeable that the proposed MR and MC models should be utilized only in the studied area and the direct use of them in the other conditions is not recommended.

  10. Diabetes and tooth loss: an analysis of data from the National Health and Nutrition Examination Survey, 2003-2004.

    PubMed

    Patel, Manthan H; Kumar, Jayanth V; Moss, Mark E

    2013-05-01

    The authors conducted an analysis of data from the National Health and Nutrition Examination Survey (NHANES) to understand the association between diabetes and tooth loss in the United States. The authors analyzed the oral examination and self-reported diabetes data obtained from the NHANES 2003-2004 cycle and included 2,508 participants representing a civilian, noninstitutionalized U.S. population 50 years and older. The authors calculated the prevalence of edentulism and the number of missing teeth among dentate people, and they used multiple regression analyses to assess the association between diabetes and tooth loss. The prevalence of edentulism was 28 percent and 14 percent among people with and without diabetes, respectively. The multiple logistic regression analysis revealed that people with diabetes were more likely to be edentulous than were those without diabetes (adjusted odds ratio = 2.25; 95 percent confidence interval, 1.19-4.21). Among dentate adults, those with diabetes had a higher number of missing teeth than did adults without diabetes (mean [standard error {SE}] = 9.8 [0.67]), mean [SE] = 6.7 [0.29]); P < .01). These study results revealed that adults with diabetes are at higher risk of experiencing tooth loss and edentulism than are adults without diabetes. One of every five cases of edentulism in the United States is linked to diabetes. Practical Implications. Although the association between diabetes and periodontal disease is well established, health care professionals also need to recognize the risk of tooth loss and its effect on quality of life among people with diabetes.

  11. Sasang Constitution as a Risk Factor for Diabetes Mellitus: A Cross-Sectional Study

    PubMed Central

    Lee, Tae-Gyu; Koh, Byunghee

    2009-01-01

    Sasang Constitutional Medicine, which is a branch of traditional Korean medicine, states that medications for diabetes should be individualized according to the patient's individual constitution. However, the effect of constitution on diabetes has not been evaluated to date. Therefore, this study was conducted to determine if constitution is an independent risk factor for diabetes by comparing the prevalence and odds ratios (ORs) of the disease according to constitution. The medical records of 1443 adults who had been examined and classified based on their constitution at Kyung Hee University Hospital in Seoul, Korea were reviewed. A chi-squared test and Fisher's exact test were used to compare the prevalence of diabetes according to constitution, and multiple logistic regression was used to calculate the ORs for diabetes. The prevalence of diabetes differed significantly according to constitution (χ2 = 36.20, df = 2, P < 0.001). Specifically, the prevalence of the disease was higher in Tae-eumin (11.4%) individuals than in Soyangin (5.0%) or Soeumin (1.7%) individuals. In addition, multiple logistic regression revealed that Tae-eumin individuals had a greater risk for diabetes than Soeumin individuals. When compared to Soeumin individuals, the adjusted ORs were 2.01 (95% CI 0.77–5.26) for Soyangin individuals and 3.96 (95% CI 1.48–10.60) for Tae-eumin individuals. These results show that constitution has a significant and independent association with diabetes, which suggests that constitution is an independent risk factor for diabetes that should be considered when attempting to detect and prevent the disease. PMID:19745018

  12. Early exposure to the combined measles-mumps-rubella vaccine and thimerosal-containing vaccines and risk of autism spectrum disorder.

    PubMed

    Uno, Yota; Uchiyama, Tokio; Kurosawa, Michiko; Aleksic, Branko; Ozaki, Norio

    2015-05-15

    This case-control study investigated the relationship between the risk of Autism Spectrum Disorder (ASD) onset, and early exposure to the combined Measles-Mumps-Rubella (MMR) vaccine and thimerosal consumption measured from vaccinations in the highly genetically homogenous Japanese population. Vaccination histories at 1, 3, 6, 12, 18, 24, and 36 months from birth were investigated in ASD cases (189 samples), and controls (224 samples) matching age and sex in each case. Crude odds ratios (ORs) and 95% confidence intervals (95% CIs) were calculated to determine relationship between MMR vaccination and ASD. The differences in mean values of the thimerosal dosage between cases and controls were analyzed using an unpaired t-test. MMR vaccination and thimerosal dosage were also investigated using a conditional multiple-regression model. There were no significant differences in MMR vaccination and thimerosal dosage between cases and controls at any age. Furthermore, the ORs (95% CIs) of MMR vaccination and thimerosal dosage associated with ASD in the conditional multiple regression model were, respectively, 0.875 (0.345-2.222) and 1.205 (0.862-1.683) at age 18 months, 0.724 (0.421-1.243) and 1.343 (0.997-1.808) at 24 months, and 1.040 (0.648-1.668) and 0.844 (0.632-1.128) at 36 months. Thus, there were no significant differences. No convincing evidence was found in this study that MMR vaccination and increasing thimerosal dose were associated with an increased risk of ASD onset. Copyright © 2014 Elsevier Ltd. All rights reserved.

  13. Does the Aristotle Score predict outcome in congenital heart surgery?

    PubMed

    Kang, Nicholas; Tsang, Victor T; Elliott, Martin J; de Leval, Marc R; Cole, Timothy J

    2006-06-01

    The Aristotle Score has been proposed as a measure of 'complexity' in congenital heart surgery, and a tool for comparing performance amongst different centres. To date, however, it remains unvalidated. We examined whether the Basic Aristotle Score was a useful predictor of mortality following open-heart surgery, and compared it to the Risk Adjustment in Congenital Heart Surgery (RACHS-1) system. We also examined the ability of the Aristotle Score to measure performance. The Basic Aristotle Score and RACHS-1 risk categories were assigned retrospectively to 1085 operations involving cardiopulmonary bypass in children less than 18 years of age. Multiple logistic regression analysis was used to determine the significance of the Aristotle Score and RACHS-1 category as independent predictors of in-hospital mortality. Operative performance was calculated using the Aristotle equation: performance = complexity x survival. Multiple logistic regression identified RACHS-1 category to be a powerful predictor of mortality (Wald 17.7, p < 0.0001), whereas Aristotle Score was only weakly associated with mortality (Wald 4.8, p = 0.03). Age at operation and bypass time were also highly significant predictors of postoperative death (Wald 13.7 and 33.8, respectively, p < 0.0001 for both). Operative performance was measured at 7.52 units. The Basic Aristotle Score was only weakly associated with postoperative mortality in this series. Operative performance appeared to be inflated by the fact that the overall complexity of cases was relatively high in this series. An alternative equation (performance = complexity/mortality) is proposed as a fairer and more logical method of risk-adjustment.

  14. The relationship between vitronectin and hepatic insulin resistance in type 2 diabetes mellitus.

    PubMed

    Cao, Yan; Li, Xinyu; Lu, Chong; Zhan, Xiaorong

    2018-05-18

    The World Health Organization (WHO) estimates that approximately 300 million people will suffer from diabetes mellitus by 2025. Type 2 diabetes mellitus (T2DM) is much more prevalent. T2DM comprises approximately 90% of diabetes mellitus cases, and it is caused by a combination of insulin resistance and inadequate compensatory insulin secretory response. In this study, we aimed to compare the plasma vitronectin (VN) levels between patients with T2DM and insulin resistance (IR) and healthy controls. Seventy patients with IR and 70 age- and body mass index (BMI)-matched healthy controls were included in the study. The insulin, Waist-to-Hip Ratio (WHR), C-peptide (CP) and VN levels of all participants were examined. The homeostasis model of assessment for insulin resistence index (HOMA-IR (CP)) formula was used to calculate insulin resistance. The levels of BMI, fasting plasma gluose (FPG), 2-hour postprandial glucose (2hPG), glycated hemoglobins (HbA1c), and HOMA-IR (CP) were significantly elevated in case group compared with controls. VN was found to be significantly decreased in case group. (VN Mean (Std): 8.55 (2.92) versus 12.88 (1.26) ng/mL p < 0.001). Multiple linear regression analysis was performed. This model explained 43.42% of the total variability of VN. Multiple linear regression analysis showed that HOMA-IR (CP) and age independently predicted VN levels. The VN may be a candidate target for the appraisal of hepatic insulin resistance in patients with T2DM.

  15. Normal reference values for bladder wall thickness on CT in a healthy population.

    PubMed

    Fananapazir, Ghaneh; Kitich, Aleksandar; Lamba, Ramit; Stewart, Susan L; Corwin, Michael T

    2018-02-01

    To determine normal bladder wall thickness on CT in patients without bladder disease. Four hundred and nineteen patients presenting for trauma with normal CTs of the abdomen and pelvis were included in our retrospective study. Bladder wall thickness was assessed, and bladder volume was measured using both the ellipsoid formula and an automated technique. Patient age, gender, and body mass index were recorded. Linear regression models were created to account for bladder volume, age, gender, and body mass index, and the multiple correlation coefficient with bladder wall thickness was computed. Bladder volume and bladder wall thickness were log-transformed to achieve approximate normality and homogeneity of variance. Variables that did not contribute substantively to the model were excluded, and a parsimonious model was created and the multiple correlation coefficient was calculated. Expected bladder wall thickness was estimated for different bladder volumes, and 1.96 standard deviation above expected provided the upper limit of normal on the log scale. Age, gender, and bladder volume were associated with bladder wall thickness (p = 0.049, 0.024, and < 0.001, respectively). The linear regression model had an R 2 of 0.52. Age and gender were negligible in contribution to the model, and a parsimonious model using only volume was created for both the ellipsoid and automated volumes (R 2  = 0.52 and 0.51, respectively). Bladder wall thickness correlates with bladder wall volume. The study provides reference bladder wall thicknesses on CT utilizing both the ellipsoid formula and automated bladder volumes.

  16. Pulmonary hypertension and computed tomography measurement of small pulmonary vessels in severe emphysema.

    PubMed

    Matsuoka, Shin; Washko, George R; Yamashiro, Tsuneo; Estepar, Raul San Jose; Diaz, Alejandro; Silverman, Edwin K; Hoffman, Eric; Fessler, Henry E; Criner, Gerard J; Marchetti, Nathaniel; Scharf, Steven M; Martinez, Fernando J; Reilly, John J; Hatabu, Hiroto

    2010-02-01

    Vascular alteration of small pulmonary vessels is one of the characteristic features of pulmonary hypertension in chronic obstructive pulmonary disease. The in vivo relationship between pulmonary hypertension and morphological alteration of the small pulmonary vessels has not been assessed in patients with severe emphysema. We evaluated the correlation of total cross-sectional area of small pulmonary vessels (CSA) assessed on computed tomography (CT) scans with the degree of pulmonary hypertension estimated by right heart catheterization. In 79 patients with severe emphysema enrolled in the National Emphysema Treatment Trial (NETT), we measured CSA less than 5 mm(2) (CSA(<5)) and 5 to 10 mm(2) (CSA(5-10)), and calculated the percentage of total CSA for the lung area (%CSA(<5) and %CSA(5-10), respectively). The correlations of %CSA(<5) and %CSA(5-10) with pulmonary arterial mean pressure (Ppa) obtained by right heart catheterization were evaluated. Multiple linear regression analysis using Ppa as the dependent outcome was also performed. The %CSA(<5) had a significant negative correlation with Ppa (r = -0.512, P < 0.0001), whereas the correlation between %CSA(5-10) and Ppa did not reach statistical significance (r = -0.196, P = 0.083). Multiple linear regression analysis showed that %CSA(<5) and diffusing capacity of carbon monoxide (DL(CO)) % predicted were independent predictors of Ppa (r(2) = 0.541): %CSA (<5) (P < 0.0001), and DL(CO) % predicted (P = 0.022). The %CSA(<5) measured on CT images is significantly correlated to Ppa in severe emphysema and can estimate the degree of pulmonary hypertension.

  17. Usefulness of the Trabecular Bone Score for assessing the risk of osteoporotic fracture.

    PubMed

    Redondo, L; Puigoriol, E; Rodríguez, J R; Peris, P; Kanterewicz, E

    2018-04-01

    The trabecular bone score (TBS) is an imaging technique that assesses the condition of the trabecular microarchitecture. Preliminary results suggest that TBS, along with the bone mineral density assessment, could improve the calculation of the osteoporotic fracture risk. The aim of this study was to analyse TBS values and their relationship with the clinical characteristics, bone mineral density and history of fractures of a cohort of posmenopausal women. We analysed 2,257 posmenopausal women from the FRODOS cohort, which was created to determine the risk factors for osteoporotic fracture through a clinical survey and bone densitometry with vertebral morphometry. TBS was applied to the densitometry images. TBS values ≤1230 were considered indicative of degraded microarchitecture. We performed a simple and multiple linear regression to determine the factors associated with this index. The mean TBS value in L1-L4 was 1.203±0.121. Some 55.3% of the women showed values indicating degraded microarchitecture. In the multiple linear regression analysis, the factors associated with low TBS values were age, weight, height, spinal T-score, glucocorticoid treatment, presence of type 2 diabetes and a history of fractures due to frailty. TBS showed microarchitecture degradation values in the participants of the FRODOS cohort and was associated with anthropometric factors, low bone mineral density values, the presence of fractures, a history of type 2 diabetes mellitus and the use of glucocorticoids. Copyright © 2017 Elsevier España, S.L.U. and Sociedad Española de Medicina Interna (SEMI). All rights reserved.

  18. Attitude and practice of physical activity and social problem-solving ability among university students.

    PubMed

    Sone, Toshimasa; Kawachi, Yousuke; Abe, Chihiro; Otomo, Yuki; Sung, Yul-Wan; Ogawa, Seiji

    2017-04-04

    Effective social problem-solving abilities can contribute to decreased risk of poor mental health. In addition, physical activity has a favorable effect on mental health. These previous studies suggest that physical activity and social problem-solving ability can interact by helping to sustain mental health. The present study aimed to determine the association between attitude and practice of physical activity and social problem-solving ability among university students. Information on physical activity and social problem-solving was collected using a self-administered questionnaire. We analyzed data from 185 students who participated in the questionnaire surveys and psychological tests. Social problem-solving as measured by the Social Problem-Solving Inventory-Revised (SPSI-R) (median score 10.85) was the dependent variable. Multiple logistic regression analysis was employed to calculate the odds ratios (ORs) and 95% confidence intervals (CIs) for higher SPSI-R according to physical activity categories. The multiple logistic regression analysis indicated that the ORs (95% CI) in reference to participants who said they never considered exercising were 2.08 (0.69-6.93), 1.62 (0.55-5.26), 2.78 (0.86-9.77), and 6.23 (1.81-23.97) for participants who did not exercise but intended to start, tried to exercise but did not, exercised but not regularly, and exercised regularly, respectively. This finding suggested that positive linear association between physical activity and social problem-solving ability (p value for linear trend < 0.01). The present findings suggest that regular physical activity or intention to start physical activity may be an effective strategy to improve social problem-solving ability.

  19. Statin Intake Is Associated With Decreased Insulin Sensitivity During Cardiac Surgery

    PubMed Central

    Sato, Hiroaki; Carvalho, George; Sato, Tamaki; Hatzakorzian, Roupen; Lattermann, Ralph; Codere-Maruyama, Takumi; Matsukawa, Takashi; Schricker, Thomas

    2012-01-01

    OBJECTIVE Surgical trauma impairs intraoperative insulin sensitivity and is associated with postoperative adverse events. Recently, preprocedural statin therapy is recommended for patients with coronary artery disease. However, statin therapy is reported to increase insulin resistance and the risk of new-onset diabetes. Thus, we investigated the association between preoperative statin therapy and intraoperative insulin sensitivity in nondiabetic, dyslipidemic patients undergoing coronary artery bypass grafting. RESEARCH DESIGN AND METHODS In this prospective, nonrandomized trial, patients taking lipophilic statins were assigned to the statin group and hypercholesterolemic patients not receiving any statins were allocated to the control group. Insulin sensitivity was assessed by the hyperinsulinemic-normoglycemic clamp technique during surgery. The mean, SD of blood glucose, and the coefficient of variation (CV) after surgery were calculated for each patient. The association between statin use and intraoperative insulin sensitivity was tested by multiple regression analysis. RESULTS We studied 120 patients. In both groups, insulin sensitivity gradually decreased during surgery with values being on average ∼20% lower in the statin than in the control group. In the statin group, the mean blood glucose in the intensive care unit was higher than in the control group (153 ± 20 vs. 140 ± 20 mg/dL; P < 0.001). The oscillation of blood glucose was larger in the statin group (SD, P < 0.001; CV, P = 0.001). Multiple regression analysis showed that statin use was independently associated with intraoperative insulin sensitivity (β = −0.16; P = 0.03). CONCLUSIONS Preoperative use of lipophilic statins is associated with increased insulin resistance during cardiac surgery in nondiabetic, dyslipidemic patients. PMID:22829524

  20. Basal-bolus insulin therapy reduces maternal triglycerides in gestational diabetes without modifying cholesteryl ester transfer protein activity.

    PubMed

    Olmos, Pablo R; Borzone, Gisella R

    2017-09-01

    Macrosomia in the offspring of overweight/obese mothers with glucose-controlled gestational diabetes mellitus (GDM) is due to excessive rise of maternal triglycerides (TG). We aimed to ascertain whether basal-bolus insulin therapy (BBIT), or other components of the treatment, could reduce TG in GDM. We studied the records of 131 singleton pregnancies with GDM, using stepwise multiple linear regression, Mann-Whitney, χ 2 , and Jonckheere-Terpstra tests. As maternal TG increased steadily during normal pregnancy, these were transformed as z-scores. The atherogenic index of plasma (AIP) was calculated as a measure of cholesteryl ester transfer protein activity. Multiple regression showed that only BBIT (but neither limitation of weight gain nor metformin) reduced maternal TG z-scores (P = 0.011). When the 131 pregnancies were split into two groups - without BBIT (n = 58; HbA1c = 5.3 ± 0.3%) and with BBIT (n = 73; HbA1c = 5.4 ± 0.6; P = 0.2005) - we observed that BBIT (n = 73) reduced maternal TG z-scores in a dose-related fashion (Jonckheere-Terpstra P = 0.03817). The atherogenic index of plasma remained within normal range in both groups. BBIT (but not weight gain control nor metformin) reduced maternal TG in mothers with glucose-controlled GDM. This beneficial effect of BBIT was not related to changes in the cholesteryl ester transfer protein activity. © 2017 Japan Society of Obstetrics and Gynecology.

  1. Climate change but not unemployment explains the changing suicidality in Thessaloniki Greece (2000-2012).

    PubMed

    Fountoulakis, Konstantinos N; Savopoulos, Christos; Zannis, Prodromos; Apostolopoulou, Martha; Fountoukidis, Ilias; Kakaletsis, Nikolaos; Kanellos, Ilias; Dimellis, Dimos; Hyphantis, Thomas; Tsikerdekis, Athanasios; Pompili, Maurizio; Hatzitolios, Apostolos I

    2016-03-15

    Recently there was a debate concerning the etiology behind attempts and completed suicides. The aim of the current study was to search for possible correlations between the rates of attempted and completed suicide and climate variables and regional unemployment per year in the county of Thessaloniki, Macedonia, northern Greece, for the years 2000-12. The regional rates of suicide and attempted suicide as well as regional unemployment were available from previous publications of the authors. The climate variables were calculated from the daily E-OBS gridded dataset which is based on observational data Only the male suicide rates correlate significantly with high mean annual temperature but not with unemployment. The multiple linear regression analysis results suggest that temperature is the only variable that determines male suicides and explains 51% of their variance. Unemployment fails to contribute significantly to the model. There seems to be a seasonal distribution for attempts with mean rates being higher for the period from May to October and the rates clearly correlate with temperature. The highest mean rates were observed during May and August and the lowest during December and February. Multiple linear regression analysis suggests that temperature also determines the female attempts rate although the explained variable is significant but very low (3-5%) Climate variables and specifically high temperature correlate both with suicide and attempted suicide rates but with a different way between males and females. The climate effect was stronger than the effect of unemployment. Copyright © 2016 Elsevier B.V. All rights reserved.

  2. The bitterness intensity of clarithromycin evaluated by a taste sensor.

    PubMed

    Tanigake, Atsu; Miyanaga, Yohko; Nakamura, Tomoko; Tsuji, Eriko; Matsuyama, Kenji; Kunitomo, Masaru; Uchida, Takahiro

    2003-11-01

    The purpose of this study was to evaluate the ability of a quantitative prediction method using a taste sensor to determine the bitterness of clarithromycin powder suspensions of various concentrations and of a commercial clarithromycin dry syrup product (Clarith dry syrup, Taisho Pharmaceutical Co., Ltd., Tokyo) containing aminoalkyl methacrylate polymer as a taste-masker. The bitterness of the clarithromycin dry syrup product dissolved in various beverages was also evaluated in gustatory sensation tests and using the taste sensor. In the sensor measurements, three variables were used to predict bitterness in single and multiple regression analysis: relative sensor output (R), the change of membrane potential caused by adsorption (CPA), and CPA/R ratio. The CPA values for channel 3 of the sensor predicted well the bitterness of clarithromycin powder suspensions and their filtered solutions. For Clarith dry syrup, the sensor output was small, suggesting that aminoalkyl methacrylate polymer was successful in almost complete masking of the bitter taste of the dry syrup product. When the bitterness intensities of mixtures of 1 g of Clarith dry syrup with 25 ml of water, coffee, tea, green tea, cocoa, milk, and a sports drink were examined, a good correlation was obtained between the results from human taste tests and the predicted values calculated on the basis of multiple regression analysis using CPA data from channel 4, and the CPA/R ratio from channel 3 of the taste sensor (r(2)=0.963, p<0.005). Co-administration of 1 g of Clarith dry syrup with an acidic sports drink was found to be the most bitter using either method.

  3. Automated grading system for evaluation of ocular redness associated with dry eye

    PubMed Central

    Rodriguez, John D; Johnston, Patrick R; Ousler, George W; Smith, Lisa M; Abelson, Mark B

    2013-01-01

    Background We have observed that dry eye redness is characterized by a prominence of fine horizontal conjunctival vessels in the exposed ocular surface of the interpalpebral fissure, and have incorporated this feature into the grading of redness in clinical studies of dry eye. Aim To develop an automated method of grading dry eye-associated ocular redness in order to expand on the clinical grading system currently used. Methods Ninety nine images from 26 dry eye subjects were evaluated by five graders using a 0–4 (in 0.5 increments) dry eye redness (Ora Calibra™ Dry Eye Redness Scale [OCDER]) scale. For the automated method, the Opencv computer vision library was used to develop software for calculating redness and horizontal conjunctival vessels (noted as “horizontality”). From original photograph, the region of interest (ROI) was selected manually using the open source ImageJ software. Total average redness intensity (Com-Red) was calculated as a single channel 8-bit image as R – 0.83G – 0.17B, where R, G and B were the respective intensities of the red, green and blue channels. The location of vessels was detected by normalizing the blue channel and selecting pixels with an intensity of less than 97% of the mean. The horizontal component (Com-Hor) was calculated by the first order Sobel derivative in the vertical direction and the score was calculated as the average blue channel image intensity of this vertical derivative. Pearson correlation coefficients, accuracy and concordance correlation coefficients (CCC) were calculated after regression and standardized regression of the dataset. Results The agreement (both Pearson’s and CCC) among investigators using the OCDER scale was 0.67, while the agreement of investigator to computer was 0.76. A multiple regression using both redness and horizontality improved the agreement CCC from 0.66 and 0.69 to 0.76, demonstrating the contribution of vessel geometry to the overall grade. Computer analysis of a given image has 100% repeatability and zero variability from session to session. Conclusion This objective means of grading ocular redness in a unified fashion has potential significance as a new clinical endpoint. In comparisons between computer and investigator, computer grading proved to be more reliable than another investigator using the OCDER scale. The best fitting model based on the present sample, and usable for future studies, was C4=−12.24+2.12C2HOR+0.88C2RED:C4 is the predicted investigator grade, and C2HOR and C2RED are logarithmic transformations of the computer calculated parameters COM-Hor and COM-Red. Considering the superior repeatability, computer automated grading might be preferable to investigator grading in multicentered dry eye studies in which the subtle differences in redness incurred by treatment have been historically difficult to define. PMID:23814457

  4. A regression technique for evaluation and quantification for water quality parameters from remote sensing data

    NASA Technical Reports Server (NTRS)

    Whitlock, C. H.; Kuo, C. Y.

    1979-01-01

    The objective of this paper is to define optical physics and/or environmental conditions under which the linear multiple-regression should be applicable. An investigation of the signal-response equations is conducted and the concept is tested by application to actual remote sensing data from a laboratory experiment performed under controlled conditions. Investigation of the signal-response equations shows that the exact solution for a number of optical physics conditions is of the same form as a linearized multiple-regression equation, even if nonlinear contributions from surface reflections, atmospheric constituents, or other water pollutants are included. Limitations on achieving this type of solution are defined.

  5. Calculation of Haem Iron Intake and Its Role in the Development of Iron Deficiency in Young Women from the Australian Longitudinal Study on Women’s Health

    PubMed Central

    Reeves, Angela J.; McEvoy, Mark A.; MacDonald-Wicks, Lesley K.; Barker, Daniel; Attia, John; Hodge, Allison M.; Patterson, Amanda J.

    2017-01-01

    Total iron intake is not strongly associated with iron stores, but haem iron intake may be more predictive. Haem iron is not available in most nutrient databases, so experimentally determined haem contents were applied to an Australian Food Frequency Questionnaire (FFQ) to estimate haem iron intake in a representative sample of young women (25–30 years). The association between dietary haem iron intakes and incident self-reported diagnosed iron deficiency over six years of follow-up was examined. Haem iron contents for Australian red meats, fish, and poultry were applied to haem-containing foods in the Dietary Questionnaire for Epidemiological Studies V2 (DQESv2) FFQ. Haem iron intakes were calculated for 9076 women from the Australian Longitudinal Study on Women’s Health (ALSWH) using the DQESv2 dietary data from 2003. Logistic regression was used to examine the association between haem iron intake (2003) and the incidence of iron deficiency in 2006 and 2009. Multiple logistic regression showed baseline haem iron intake was a statistically significant predictor of iron deficiency in 2006 (Odds Ratio (OR): 0.91; 95% Confidence Interval (CI): 0.84–0.99; p-value: 0.020) and 2009 (OR: 0.89; 95% CI: 0.82–0.99; p-value: 0.007). Using the energy-adjusted haem intake made little difference to the associations. Higher haem iron intake is associated with reduced odds of iron deficiency developing in young adult Australian women. PMID:28534830

  6. Burnout, stress and satisfaction among Australian and New Zealand radiation oncology trainees.

    PubMed

    Leung, John; Rioseco, Pilar

    2017-02-01

    To evaluate the incidence of burnout among radiation oncology trainees in Australia and New Zealand and the stress and satisfaction factors related to burnout. A survey of trainees was conducted in mid-2015. There were 42 Likert scale questions on stress, 14 Likert scale questions on satisfaction and the Maslach Burnout Inventory-Human Services Survey assessed burnout. A principal component analysis identified specific stress and satisfaction areas. Categorical variables for the stress and satisfaction factors were computed. Associations between respondent's characteristics and stress and satisfaction subscales were examined by independent sample t-tests and analysis of variance. Effect sizes were calculated using Cohens's d when significant mean differences were observed. This was also done for respondent characteristics and the three burnout subscales. Multiple regression analyses were performed. The response rate was 81.5%. The principal component analysis for stress identified five areas: demands on time, professional development/training, delivery demands, interpersonal demands and administration/organizational issues. There were no significant differences by demographic group or area of interest after P-values were adjusted for the multiple tests conducted. The principal component analysis revealed two satisfaction areas: resources/professional activities and value/delivery of services. There were no significant differences by demographic characteristics or area of interest in the level of satisfaction after P-values were adjusted for the multiple tests conducted. The burnout results revealed 49.5% of respondents scored highly in emotional exhaustion and/or depersonalization and 13.1% had burnout in all three measures. Multiple regression analysis revealed the stress subscales 'demands on time' and 'interpersonal demands' were associated with emotional exhaustion. 'Interpersonal demands' was also associated with depersonalization and correlated negatively with personal accomplishment. The satisfaction of value/delivery of services subscale was associated with higher levels of personal accomplishment. There is a significant level of burnout among radiation oncology trainees in Australia and New Zealand. Further work addressing intervention would be appropriate to reduce levels of burnout. © 2016 The Authors. Journal of Medical Imaging and Radiation Oncology published by John Wiley & Sons Australia, Ltd on behalf of The Royal Australian and New Zealand College of Radiologists.

  7. Application of back propagation artificial neural network on genetic variants in adiponectin ADIPOQ, peroxisome proliferator-activated receptor-γ, and retinoid X receptor-α genes and type 2 diabetes risk in a Chinese Han population.

    PubMed

    Shi, Hui; Lu, Ying; Du, Juan; Du, Wencong; Ye, Xinhua; Yu, Xiaofang; Ma, Jianhua; Cheng, Jinluo; Gao, Yanqin; Cao, Yuanyuan; Zhou, Ling; Li, Qian

    2012-03-01

    Our study was designed to explore the applied characteristics of the back propagation artificial neural network (BPANN) on studying the genetic variants in adipnectin ADIPOQ, peroxisome proliferator-activated receptor (PPAR)-γ, and retinoid X receptor-α (RXR-α) genes and type 2 diabetes mellitus (T2DM) risks in a Chinese Han population. We used BPANN as the fitting model based on data gathered from T2DM patients (n=913) and normal controls (n=1,001). The mean impact value (MIV) for each input variables were calculated, and the sequence of the factors according to their absolute MIVs was sorted. The results from BPANN were compared with multiple logistic regression analysis, and the generalized multifactor dimensionality reduction (GMDR) method was used to calculate the joint effects of ADIPOQ, PPAR-γ, and RXR-α genes. By BPANN analysis, the sequence according to the importance of the T2DM risk factors was in the order of serum adiponectin level, rs3856806, rs7649121, hypertension, rs3821799, rs17827276, rs12495941, rs4240711, age, rs16861194, waist circumference, rs2241767, rs2920502, rs1063539, alcohol drinking, smoking, hyperlipoproteinemia, gender, rs3132291, T2DM family history, rs4842194, rs822394, rs1801282, rs1045570, rs16861205, rs6537944, body mass index, rs266729, and rs1801282. However, compared with multiple logistic regression analysis, only 11 factors were statistically significant. After overweight and obesity were taken as environment adjustment factors into the analysis, model A2 B4 C5 C6 C8 (rs3856806, rs4240711, rs7649121, rs3821799, rs12495941) was the best model (coefficient of variation consistency=10/10, P=0.0107) in the GMDR method. These results suggested the interactions of ADIPOQ, PPAR-γ, and RXR-α genes might play a role in susceptibility to T2DM. BPANN could be used to analyze the risk factors of diseases and provide more complicated relationships between inputs and outputs.

  8. The Association between Charlson Comorbidity Index and the Medical Care Cost of Cancer: A Retrospective Study.

    PubMed

    Yoon, Seok-Jun; Kim, Eun-Jung; Seo, Hyun-Ju; Oh, In-Hwan

    2015-01-01

    This study compared comorbidity-related medical care cost associated with different types of cancer, by examining breast (N = 287), colon (N = 272), stomach (N = 614), and lung (N = 391) cancer patients undergoing surgery. Using medical benefits claims data, we calculated Charlson Comorbidity Index (CCI) and total medical cost. The effect of comorbidity on the medical care cost was investigated using multiple regression and logistic regression models and controlling for demographic characteristics and cancer stage. The treatment costs incurred by stomach and colon cancer patients were 1.05- and 1.01-fold higher, respectively, in patients with higher CCI determined. For breast cancer, the highest costs were seen in those with chronic obstructive pulmonary disease (COPD), but the increase in cost reduced as CCI increased. Colon cancer patients with diabetes mellitus and a CCI = 1 score had the highest medical costs. The lowest medical costs were incurred by lung cancer patients with COPD and a CCI = 2 score. The comorbidities had a major impact on the use of medical resources, with chronic comorbidities incurring the highest medical costs. The results indicate that comorbidities affect cancer outcomes and that they must be considered strategies mitigating cancer's economic and social impact.

  9. Internal exposure levels of typical POPs and their associations with childhood asthma in Shanghai, China.

    PubMed

    Meng, Ge; Feng, Yan; Nie, Zhiqing; Wu, Xiaomeng; Wei, Hongying; Wu, Shaowei; Yin, Yong; Wang, Yan

    2016-04-01

    Polybrominated diphenyl ethers (PBDEs), polychlorinated biphenyls (PCBs) and organochlorine pesticides (OCPs) are common persistent organic pollutants (POPs) that may be associated with childhood asthma. The concentrations of PBDEs, PCBs and OCPs were analyzed in pooled serum samples from both asthmatic and non-asthmatic children. The differences in the internal exposure levels between the case and control groups were tested (p value <0.0012). The associations between the internal exposure concentrations of the POPs and childhood asthma were estimated based on the odds ratios (ORs) calculated using logistic regression models. There were significant differences in three PBDEs, 26 PCBs and seven OCPs between the two groups, with significantly higher levels in the cases. The multiple logistic regression models demonstrated that the internal exposure concentrations of a number of the POPs (23 PCBs, p,p'-DDE and α-HCH) were positively associated with childhood asthma. Some synergistic effects were observed when the children were co-exposed to the chemicals. BDE-209 was positively associated with asthma aggravation. This study indicates the potential relationships between the internal exposure concentrations of particular POPs and the development of childhood asthma. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Combined Prediction Model of Death Toll for Road Traffic Accidents Based on Independent and Dependent Variables

    PubMed Central

    Zhong-xiang, Feng; Shi-sheng, Lu; Wei-hua, Zhang; Nan-nan, Zhang

    2014-01-01

    In order to build a combined model which can meet the variation rule of death toll data for road traffic accidents and can reflect the influence of multiple factors on traffic accidents and improve prediction accuracy for accidents, the Verhulst model was built based on the number of death tolls for road traffic accidents in China from 2002 to 2011; and car ownership, population, GDP, highway freight volume, highway passenger transportation volume, and highway mileage were chosen as the factors to build the death toll multivariate linear regression model. Then the two models were combined to be a combined prediction model which has weight coefficient. Shapley value method was applied to calculate the weight coefficient by assessing contributions. Finally, the combined model was used to recalculate the number of death tolls from 2002 to 2011, and the combined model was compared with the Verhulst and multivariate linear regression models. The results showed that the new model could not only characterize the death toll data characteristics but also quantify the degree of influence to the death toll by each influencing factor and had high accuracy as well as strong practicability. PMID:25610454

  11. Combined prediction model of death toll for road traffic accidents based on independent and dependent variables.

    PubMed

    Feng, Zhong-xiang; Lu, Shi-sheng; Zhang, Wei-hua; Zhang, Nan-nan

    2014-01-01

    In order to build a combined model which can meet the variation rule of death toll data for road traffic accidents and can reflect the influence of multiple factors on traffic accidents and improve prediction accuracy for accidents, the Verhulst model was built based on the number of death tolls for road traffic accidents in China from 2002 to 2011; and car ownership, population, GDP, highway freight volume, highway passenger transportation volume, and highway mileage were chosen as the factors to build the death toll multivariate linear regression model. Then the two models were combined to be a combined prediction model which has weight coefficient. Shapley value method was applied to calculate the weight coefficient by assessing contributions. Finally, the combined model was used to recalculate the number of death tolls from 2002 to 2011, and the combined model was compared with the Verhulst and multivariate linear regression models. The results showed that the new model could not only characterize the death toll data characteristics but also quantify the degree of influence to the death toll by each influencing factor and had high accuracy as well as strong practicability.

  12. Laboratory Headphone Studies of Human Response to Low-Amplitude Sonic Booms and Rattle Heard Indoors

    NASA Technical Reports Server (NTRS)

    Loubeau, Alexandra; Sullivan, Brenda M.; Klos, Jacob; Rathsam, Jonathan; Gavin, Joseph R.

    2013-01-01

    Human response to sonic booms heard indoors is affected by the generation of contact-induced rattle noise. The annoyance caused by sonic boom-induced rattle noise was studied in a series of psychoacoustics tests. Stimuli were divided into three categories and presented in three different studies: isolated rattles at the same calculated Perceived Level (PL), sonic booms combined with rattles with the mixed sound at a single PL, and sonic booms combined with rattles with the mixed sound at three different PL. Subjects listened to sounds over headphones and were asked to report their annoyance. Annoyance to different rattles was shown to vary significantly according to rattle object size. In addition, the combination of low-amplitude sonic booms and rattles can be more annoying than the sonic boom alone. Correlations and regression analyses for the combined sonic boom and rattle sounds identified the Moore and Glasberg Stationary Loudness (MGSL) metric as a primary predictor of annoyance for the tested sounds. Multiple linear regression models were developed to describe annoyance to the tested sounds, and simplifications for applicability to a wider range of sounds are presented.

  13. Physical Comorbidities in Depression Co-Occurring with Anxiety: A Cross Sectional Study in the Czech Primary Care System

    PubMed Central

    Winkler, Petr; Horáček, Jiří; Weissová, Aneta; Šustr, Martin; Brunovský, Martin

    2015-01-01

    Comorbidities associated with depression have been researched in a number of contexts. However, the epidemiological situation in clinical practice is understudied, especially in the post-Communist Central and Eastern Europe region. The aim of this study was to assess physical comorbidities in depression, and to identify whether there are increased odds of physical comorbidities associated with co-occurring depressive and anxiety disorders. Data on 4264 patients aged 18–98 were collected among medical doctors in the Czech Republic between 2010 and 2011. Descriptive statistics were calculated and multiple logistic regressions were performed to assess comorbidities among patients with depressive disorder. There were 51.29% of those who have a physical comorbidity, and 45.5% of those who have a comorbid anxiety disorders among patients treated with depression in Czech primary care. Results of logistic regressions show that odds of having pain, hypertension or diabetes mellitus are particularly elevated at those who have co-occurring depressive and anxiety disorder. Our findings demonstrate that comorbidities associated with depressive disorders are highly prevalent in primary health care practice, and that physical comorbidities are particularly frequent among those with co-occurring depressive and anxiety disorders. PMID:26690458

  14. Estimation of magnitude and frequency of floods for streams in Puerto Rico : new empirical models

    USGS Publications Warehouse

    Ramos-Gines, Orlando

    1999-01-01

    Flood-peak discharges and frequencies are presented for 57 gaged sites in Puerto Rico for recurrence intervals ranging from 2 to 500 years. The log-Pearson Type III distribution, the methodology recommended by the United States Interagency Committee on Water Data, was used to determine the magnitude and frequency of floods at the gaged sites having 10 to 43 years of record. A technique is presented for estimating flood-peak discharges at recurrence intervals ranging from 2 to 500 years for unregulated streams in Puerto Rico with contributing drainage areas ranging from 0.83 to 208 square miles. Loglinear multiple regression analyses, using climatic and basin characteristics and peak-discharge data from the 57 gaged sites, were used to construct regression equations to transfer the magnitude and frequency information from gaged to ungaged sites. The equations have contributing drainage area, depth-to-rock, and mean annual rainfall as the basin and climatic characteristics in estimating flood peak discharges. Examples are given to show a step-by-step procedure in calculating a 100-year flood at a gaged site, an ungaged site, a site near a gaged location, and a site between two gaged sites.

  15. Principal component regression analysis with SPSS.

    PubMed

    Liu, R X; Kuang, J; Gong, Q; Hou, X L

    2003-06-01

    The paper introduces all indices of multicollinearity diagnoses, the basic principle of principal component regression and determination of 'best' equation method. The paper uses an example to describe how to do principal component regression analysis with SPSS 10.0: including all calculating processes of the principal component regression and all operations of linear regression, factor analysis, descriptives, compute variable and bivariate correlations procedures in SPSS 10.0. The principal component regression analysis can be used to overcome disturbance of the multicollinearity. The simplified, speeded up and accurate statistical effect is reached through the principal component regression analysis with SPSS.

  16. Experimental and computational prediction of glass transition temperature of drugs.

    PubMed

    Alzghoul, Ahmad; Alhalaweh, Amjad; Mahlin, Denny; Bergström, Christel A S

    2014-12-22

    Glass transition temperature (Tg) is an important inherent property of an amorphous solid material which is usually determined experimentally. In this study, the relation between Tg and melting temperature (Tm) was evaluated using a data set of 71 structurally diverse druglike compounds. Further, in silico models for prediction of Tg were developed based on calculated molecular descriptors and linear (multilinear regression, partial least-squares, principal component regression) and nonlinear (neural network, support vector regression) modeling techniques. The models based on Tm predicted Tg with an RMSE of 19.5 K for the test set. Among the five computational models developed herein the support vector regression gave the best result with RMSE of 18.7 K for the test set using only four chemical descriptors. Hence, two different models that predict Tg of drug-like molecules with high accuracy were developed. If Tm is available, a simple linear regression can be used to predict Tg. However, the results also suggest that support vector regression and calculated molecular descriptors can predict Tg with equal accuracy, already before compound synthesis.

  17. Regression analysis for solving diagnosis problem of children's health

    NASA Astrophysics Data System (ADS)

    Cherkashina, Yu A.; Gerget, O. M.

    2016-04-01

    The paper includes results of scientific researches. These researches are devoted to the application of statistical techniques, namely, regression analysis, to assess the health status of children in the neonatal period based on medical data (hemostatic parameters, parameters of blood tests, the gestational age, vascular-endothelial growth factor) measured at 3-5 days of children's life. In this paper a detailed description of the studied medical data is given. A binary logistic regression procedure is discussed in the paper. Basic results of the research are presented. A classification table of predicted values and factual observed values is shown, the overall percentage of correct recognition is determined. Regression equation coefficients are calculated, the general regression equation is written based on them. Based on the results of logistic regression, ROC analysis was performed, sensitivity and specificity of the model are calculated and ROC curves are constructed. These mathematical techniques allow carrying out diagnostics of health of children providing a high quality of recognition. The results make a significant contribution to the development of evidence-based medicine and have a high practical importance in the professional activity of the author.

  18. RRegrs: an R package for computer-aided model selection with multiple regression models.

    PubMed

    Tsiliki, Georgia; Munteanu, Cristian R; Seoane, Jose A; Fernandez-Lozano, Carlos; Sarimveis, Haralambos; Willighagen, Egon L

    2015-01-01

    Predictive regression models can be created with many different modelling approaches. Choices need to be made for data set splitting, cross-validation methods, specific regression parameters and best model criteria, as they all affect the accuracy and efficiency of the produced predictive models, and therefore, raising model reproducibility and comparison issues. Cheminformatics and bioinformatics are extensively using predictive modelling and exhibit a need for standardization of these methodologies in order to assist model selection and speed up the process of predictive model development. A tool accessible to all users, irrespectively of their statistical knowledge, would be valuable if it tests several simple and complex regression models and validation schemes, produce unified reports, and offer the option to be integrated into more extensive studies. Additionally, such methodology should be implemented as a free programming package, in order to be continuously adapted and redistributed by others. We propose an integrated framework for creating multiple regression models, called RRegrs. The tool offers the option of ten simple and complex regression methods combined with repeated 10-fold and leave-one-out cross-validation. Methods include Multiple Linear regression, Generalized Linear Model with Stepwise Feature Selection, Partial Least Squares regression, Lasso regression, and Support Vector Machines Recursive Feature Elimination. The new framework is an automated fully validated procedure which produces standardized reports to quickly oversee the impact of choices in modelling algorithms and assess the model and cross-validation results. The methodology was implemented as an open source R package, available at https://www.github.com/enanomapper/RRegrs, by reusing and extending on the caret package. The universality of the new methodology is demonstrated using five standard data sets from different scientific fields. Its efficiency in cheminformatics and QSAR modelling is shown with three use cases: proteomics data for surface-modified gold nanoparticles, nano-metal oxides descriptor data, and molecular descriptors for acute aquatic toxicity data. The results show that for all data sets RRegrs reports models with equal or better performance for both training and test sets than those reported in the original publications. Its good performance as well as its adaptability in terms of parameter optimization could make RRegrs a popular framework to assist the initial exploration of predictive models, and with that, the design of more comprehensive in silico screening applications.Graphical abstractRRegrs is a computer-aided model selection framework for R multiple regression models; this is a fully validated procedure with application to QSAR modelling.

  19. Multiple Regression Redshift Calibration for Clusters of Galaxies

    NASA Astrophysics Data System (ADS)

    Kalinkov, M.; Kuneva, I.; Valtchanov, I.

    A new procedure for calibration of distances to ACO (Abell et al.1989) clusters of galaxies has been developed. In the previous version of the Reference Catalog of ACO Clusters of Galaxies (Kalinkov & Kuneva 1992) an attempt has been made to compare various calibration schemes. For the Version 93 we have made some refinements. Many improvements from the early days of the photometric calibration have been made --- from Rowan-Robinson (1972), Corwin (1974), Kalinkov & Kuneva (1975), Mills Hoskins (1977) to more complicated --- Leir & van den Bergh (1977), Postman et al.(1985), Kalinkov Kuneva (1985, 1986, 1990), Scaramella et al.(1991), Zucca et al. (1993). It was shown that it is impossible to use the same calibration relation for northern (A) and southern (ACO) clusters of galaxies. Therefore the calibration have to be made separately for both catalogs. Moreover it is better if one could find relations for the 274 A-clusters, studied by the authors of ACO. We use the luminosity distance for H0=100km/s/Mpc and q0 = 0.5 and we have 1200 clusters with measured redshifts. The first step is to fit log(z) on m10 (magnitude of the tenth rank galaxy) for A-clusters and on m1, m3 and m10 for ACO clusters. The second step is to take into account the K-correction and the Scott effect (Postman et al.1985) with iterative process. To avoid the initial errors of the redshift estimates in A- and ACO catalogs we adopt Hubble's law for the apparent radial distribution of galaxies in clusters. This enable us to calculate a new cluster richness from preliminary redshift estimate. This is the third step. Further continues the study of the correlation matrix between log(z) and prospective predictors --- new richness groups, BM, RS and A types, radio and X-ray fluxes, apparent separations between the first three brightest galaxies, mean population (gal/sq.deg), Multiple linear as well as nonlinear regression estimators are found. Many clusters that deviate by more than 2.5 sigmas are rejected. Each case is examined for observational errors, substructuring, foreground and background. Some of the clusters are doubtful --- most probably they have to be excluded from the catalogs. The multiple regressions allow us to estimate redshift in the range 0.02 to 0.2 with an error of 7 percent.

  20. Biostatistics Series Module 10: Brief Overview of Multivariate Methods.

    PubMed

    Hazra, Avijit; Gogtay, Nithya

    2017-01-01

    Multivariate analysis refers to statistical techniques that simultaneously look at three or more variables in relation to the subjects under investigation with the aim of identifying or clarifying the relationships between them. These techniques have been broadly classified as dependence techniques, which explore the relationship between one or more dependent variables and their independent predictors, and interdependence techniques, that make no such distinction but treat all variables equally in a search for underlying relationships. Multiple linear regression models a situation where a single numerical dependent variable is to be predicted from multiple numerical independent variables. Logistic regression is used when the outcome variable is dichotomous in nature. The log-linear technique models count type of data and can be used to analyze cross-tabulations where more than two variables are included. Analysis of covariance is an extension of analysis of variance (ANOVA), in which an additional independent variable of interest, the covariate, is brought into the analysis. It tries to examine whether a difference persists after "controlling" for the effect of the covariate that can impact the numerical dependent variable of interest. Multivariate analysis of variance (MANOVA) is a multivariate extension of ANOVA used when multiple numerical dependent variables have to be incorporated in the analysis. Interdependence techniques are more commonly applied to psychometrics, social sciences and market research. Exploratory factor analysis and principal component analysis are related techniques that seek to extract from a larger number of metric variables, a smaller number of composite factors or components, which are linearly related to the original variables. Cluster analysis aims to identify, in a large number of cases, relatively homogeneous groups called clusters, without prior information about the groups. The calculation intensive nature of multivariate analysis has so far precluded most researchers from using these techniques routinely. The situation is now changing with wider availability, and increasing sophistication of statistical software and researchers should no longer shy away from exploring the applications of multivariate methods to real-life data sets.

  1. The relationship between health-related fitness and quality of life in postmenopausal women from Southern Taiwan

    PubMed Central

    Hsu, Wei-Hsiu; Chen, Chi-lung; Kuo, Liang Tseng; Fan, Chun-Hao; Lee, Mel S; Hsu, Robert Wen-Wei

    2014-01-01

    Background Health-related fitness has been reported to be associated with improved quality of life (QoL) in the elderly. Health-related fitness is comprised of several dimensions that could be enhanced by specific training regimens. It has remained unclear how various dimensions of health-related fitness interact with QoL in postmenopausal women. Objective The purpose of the current study was to investigate the relationship between the dimensions of health-related fitness and QoL in elderly women. Methods A cohort of 408 postmenopausal women in a rural area of Taiwan was prospectively collected. Dimensions of health-related fitness, consisting of muscular strength, balance, cardiorespiratory endurance, flexibility, muscle endurance, and agility, were assessed. QoL was determined using the Short Form Health Survey (SF-36). Differences between age groups (stratified by decades) were calculated using a one-way analysis of variance (ANOVA) and multiple comparisons using a Scheffé test. A Spearman’s correlation analysis was performed to examine differences between QoL and each dimension of fitness. Multiple linear regression with forced-entry procedure was performed to evaluate the effects of health-related fitness. A P-value of <0.05 was considered statistically significant. Results Age-related decreases in health-related fitness were shown for sit-ups, back strength, grip strength, side steps, trunk extension, and agility (P<0.05). An age-related decrease in QoL, specifically in physical functioning, role limitation due to physical problems, and physical component score, was also demonstrated (P<0.05). Multiple linear regression analyses demonstrated that back strength significantly contributed to the physical component of QoL (adjusted beta of 0.268 [P<0.05]). Conclusion Back strength was positively correlated with the physical component of QoL among the examined dimensions of health-related fitness. Health-related fitness, as well as the physical component of QoL, declined with increasing age. PMID:25258526

  2. DOE Office of Scientific and Technical Information (OSTI.GOV)

    Boutilier, Justin J., E-mail: j.boutilier@mail.utoronto.ca; Lee, Taewoo; Craig, Tim

    Purpose: To develop and evaluate the clinical applicability of advanced machine learning models that simultaneously predict multiple optimization objective function weights from patient geometry for intensity-modulated radiation therapy of prostate cancer. Methods: A previously developed inverse optimization method was applied retrospectively to determine optimal objective function weights for 315 treated patients. The authors used an overlap volume ratio (OV) of bladder and rectum for different PTV expansions and overlap volume histogram slopes (OVSR and OVSB for the rectum and bladder, respectively) as explanatory variables that quantify patient geometry. Using the optimal weights as ground truth, the authors trained and appliedmore » three prediction models: logistic regression (LR), multinomial logistic regression (MLR), and weighted K-nearest neighbor (KNN). The population average of the optimal objective function weights was also calculated. Results: The OV at 0.4 cm and OVSR at 0.1 cm features were found to be the most predictive of the weights. The authors observed comparable performance (i.e., no statistically significant difference) between LR, MLR, and KNN methodologies, with LR appearing to perform the best. All three machine learning models outperformed the population average by a statistically significant amount over a range of clinical metrics including bladder/rectum V53Gy, bladder/rectum V70Gy, and dose to the bladder, rectum, CTV, and PTV. When comparing the weights directly, the LR model predicted bladder and rectum weights that had, on average, a 73% and 74% relative improvement over the population average weights, respectively. The treatment plans resulting from the LR weights had, on average, a rectum V70Gy that was 35% closer to the clinical plan and a bladder V70Gy that was 29% closer, compared to the population average weights. Similar results were observed for all other clinical metrics. Conclusions: The authors demonstrated that the KNN and MLR weight prediction methodologies perform comparably to the LR model and can produce clinical quality treatment plans by simultaneously predicting multiple weights that capture trade-offs associated with sparing multiple OARs.« less

  3. Analytical framework for reconstructing heterogeneous environmental variables from mammal community structure.

    PubMed

    Louys, Julien; Meloro, Carlo; Elton, Sarah; Ditchfield, Peter; Bishop, Laura C

    2015-01-01

    We test the performance of two models that use mammalian communities to reconstruct multivariate palaeoenvironments. While both models exploit the correlation between mammal communities (defined in terms of functional groups) and arboreal heterogeneity, the first uses a multiple multivariate regression of community structure and arboreal heterogeneity, while the second uses a linear regression of the principal components of each ecospace. The success of these methods means the palaeoenvironment of a particular locality can be reconstructed in terms of the proportions of heavy, moderate, light, and absent tree canopy cover. The linear regression is less biased, and more precisely and accurately reconstructs heavy tree canopy cover than the multiple multivariate model. However, the multiple multivariate model performs better than the linear regression for all other canopy cover categories. Both models consistently perform better than randomly generated reconstructions. We apply both models to the palaeocommunity of the Upper Laetolil Beds, Tanzania. Our reconstructions indicate that there was very little heavy tree cover at this site (likely less than 10%), with the palaeo-landscape instead comprising a mixture of light and absent tree cover. These reconstructions help resolve the previous conflicting palaeoecological reconstructions made for this site. Copyright © 2014 Elsevier Ltd. All rights reserved.

  4. Building a new predictor for multiple linear regression technique-based corrective maintenance turnaround time.

    PubMed

    Cruz, Antonio M; Barr, Cameron; Puñales-Pozo, Elsa

    2008-01-01

    This research's main goals were to build a predictor for a turnaround time (TAT) indicator for estimating its values and use a numerical clustering technique for finding possible causes of undesirable TAT values. The following stages were used: domain understanding, data characterisation and sample reduction and insight characterisation. Building the TAT indicator multiple linear regression predictor and clustering techniques were used for improving corrective maintenance task efficiency in a clinical engineering department (CED). The indicator being studied was turnaround time (TAT). Multiple linear regression was used for building a predictive TAT value model. The variables contributing to such model were clinical engineering department response time (CE(rt), 0.415 positive coefficient), stock service response time (Stock(rt), 0.734 positive coefficient), priority level (0.21 positive coefficient) and service time (0.06 positive coefficient). The regression process showed heavy reliance on Stock(rt), CE(rt) and priority, in that order. Clustering techniques revealed the main causes of high TAT values. This examination has provided a means for analysing current technical service quality and effectiveness. In doing so, it has demonstrated a process for identifying areas and methods of improvement and a model against which to analyse these methods' effectiveness.

  5. Effects of land cover, topography, and built structure on seasonal water quality at multiple spatial scales.

    PubMed

    Pratt, Bethany; Chang, Heejun

    2012-03-30

    The relationship among land cover, topography, built structure and stream water quality in the Portland Metro region of Oregon and Clark County, Washington areas, USA, is analyzed using ordinary least squares (OLS) and geographically weighted (GWR) multiple regression models. Two scales of analysis, a sectional watershed and a buffer, offered a local and a global investigation of the sources of stream pollutants. Model accuracy, measured by R(2) values, fluctuated according to the scale, season, and regression method used. While most wet season water quality parameters are associated with urban land covers, most dry season water quality parameters are related topographic features such as elevation and slope. GWR models, which take into consideration local relations of spatial autocorrelation, had stronger results than OLS regression models. In the multiple regression models, sectioned watershed results were consistently better than the sectioned buffer results, except for dry season pH and stream temperature parameters. This suggests that while riparian land cover does have an effect on water quality, a wider contributing area needs to be included in order to account for distant sources of pollutants. Copyright © 2012 Elsevier B.V. All rights reserved.

  6. Comparison of Nine Statistical Model Based Warfarin Pharmacogenetic Dosing Algorithms Using the Racially Diverse International Warfarin Pharmacogenetic Consortium Cohort Database

    PubMed Central

    Liu, Rong; Li, Xi; Zhang, Wei; Zhou, Hong-Hao

    2015-01-01

    Objective Multiple linear regression (MLR) and machine learning techniques in pharmacogenetic algorithm-based warfarin dosing have been reported. However, performances of these algorithms in racially diverse group have never been objectively evaluated and compared. In this literature-based study, we compared the performances of eight machine learning techniques with those of MLR in a large, racially-diverse cohort. Methods MLR, artificial neural network (ANN), regression tree (RT), multivariate adaptive regression splines (MARS), boosted regression tree (BRT), support vector regression (SVR), random forest regression (RFR), lasso regression (LAR) and Bayesian additive regression trees (BART) were applied in warfarin dose algorithms in a cohort from the International Warfarin Pharmacogenetics Consortium database. Covariates obtained by stepwise regression from 80% of randomly selected patients were used to develop algorithms. To compare the performances of these algorithms, the mean percentage of patients whose predicted dose fell within 20% of the actual dose (mean percentage within 20%) and the mean absolute error (MAE) were calculated in the remaining 20% of patients. The performances of these techniques in different races, as well as the dose ranges of therapeutic warfarin were compared. Robust results were obtained after 100 rounds of resampling. Results BART, MARS and SVR were statistically indistinguishable and significantly out performed all the other approaches in the whole cohort (MAE: 8.84–8.96 mg/week, mean percentage within 20%: 45.88%–46.35%). In the White population, MARS and BART showed higher mean percentage within 20% and lower mean MAE than those of MLR (all p values < 0.05). In the Asian population, SVR, BART, MARS and LAR performed the same as MLR. MLR and LAR optimally performed among the Black population. When patients were grouped in terms of warfarin dose range, all machine learning techniques except ANN and LAR showed significantly higher mean percentage within 20%, and lower MAE (all p values < 0.05) than MLR in the low- and high- dose ranges. Conclusion Overall, machine learning-based techniques, BART, MARS and SVR performed superior than MLR in warfarin pharmacogenetic dosing. Differences of algorithms’ performances exist among the races. Moreover, machine learning-based algorithms tended to perform better in the low- and high- dose ranges than MLR. PMID:26305568

  7. [Studies of marker screening efficiency and corresponding influencing factors in QTL composite interval mapping].

    PubMed

    Gao, Yong-Ming; Wan, Ping

    2002-06-01

    Screening markers efficiently is the foundation of mapping QTLs by composite interval mapping. Main and interaction markers distinguished, besides using background control for genetic variation, could also be used to construct intervals of two-way searching for mapping QTLs with epistasis, which can save a lot of calculation time. Therefore, the efficiency of marker screening would affect power and precision of QTL mapping. A doubled haploid population with 200 individuals and 5 chromosomes was constructed, with 50 markers evenly distributed at 10 cM space. Among a total of 6 QTLs, one was placed on chromosome I, two linked on chromosome II, and the other three linked on chromosome IV. QTL setting included additive effects and epistatic effects of additive x additive, the corresponding QTL interaction effects were set if data were collected under multiple environments. The heritability was assumed to be 0.5 if no special declaration. The power of marker screening by stepwise regression, forward regression, and three methods for random effect prediction, e.g. best linear unbiased prediction (BLUP), linear unbiased prediction (LUP) and adjusted unbiased prediction (AUP), was studied and compared through 100 Monte Carlo simulations. The results indicated that the marker screening power by stepwise regression at 0.1, 0.05 and 0.01 significant level changed from 2% to 68%, the power changed from 2% to 72% by forward regression. The larger the QTL effects, the higher the marker screening power. While the power of marker screening by three random effect prediction was very low, the maximum was only 13%. That suggested that regression methods were much better than those by using the approaches of random effect prediction to identify efficient markers flanking QTLs, and forward selection method was more simple and efficient. The results of simulation study on heritability showed that heightening of both general heritability and interaction heritability of genotype x environments could enhance marker screening power, the former had a greater influence on QTLs with larger main and/or epistatic effects, while the later on QTLs with small main and/or epistatic effects. The simulation of 100 times was also conducted to study the influence of different marker number and density on marker screening power. It is indicated that the marker screening power would decrease if there were too many markers, especially with high density in a mapping population, which suggested that a mapping population with definite individuals could only hold limited markers. According to the simulation study, the reasonable number of markers should not be more than individuals. The simulation study of marker screening under multiple environments showed high total power of marker screening. In order to relieve the problem that marker screening power restricted the efficiency of QTL mapping, markers identified in multiple environments could be used to construct two search intervals.

  8. Development of a Multiple Linear Regression Model to Forecast Facility Electrical Consumption at an Air Force Base.

    DTIC Science & Technology

    1981-09-01

    corresponds to the same square footage that consumed the electrical energy. 3. The basic assumptions of multiple linear regres- sion, as enumerated in...7. Data related to the sample of bases is assumed to be representative of bases in the population. Limitations Basic limitations on this research were... Ratemaking --Overview. Rand Report R-5894, Santa Monica CA, May 1977. Chatterjee, Samprit, and Bertram Price. Regression Analysis by Example. New York: John

  9. Multiple-trait structured antedependence model to study the relationship between litter size and birth weight in pigs and rabbits.

    PubMed

    David, Ingrid; Garreau, Hervé; Balmisse, Elodie; Billon, Yvon; Canario, Laurianne

    2017-01-20

    Some genetic studies need to take into account correlations between traits that are repeatedly measured over time. Multiple-trait random regression models are commonly used to analyze repeated traits but suffer from several major drawbacks. In the present study, we developed a multiple-trait extension of the structured antedependence model (SAD) to overcome this issue and validated its usefulness by modeling the association between litter size (LS) and average birth weight (ABW) over parities in pigs and rabbits. The single-trait SAD model assumes that a random effect at time [Formula: see text] can be explained by the previous values of the random effect (i.e. at previous times). The proposed multiple-trait extension of the SAD model consists in adding a cross-antedependence parameter to the single-trait SAD model. This model can be easily fitted using ASReml and the OWN Fortran program that we have developed. In comparison with the random regression model, we used our multiple-trait SAD model to analyze the LS and ABW of 4345 litters from 1817 Large White sows and 8706 litters from 2286 L-1777 does over a maximum of five successive parities. For both species, the multiple-trait SAD fitted the data better than the random regression model. The difference between AIC of the two models (AIC_random regression-AIC_SAD) were equal to 7 and 227 for pigs and rabbits, respectively. A similar pattern of heritability and correlation estimates was obtained for both species. Heritabilities were lower for LS (ranging from 0.09 to 0.29) than for ABW (ranging from 0.23 to 0.39). The general trend was a decrease of the genetic correlation for a given trait between more distant parities. Estimates of genetic correlations between LS and ABW were negative and ranged from -0.03 to -0.52 across parities. No correlation was observed between the permanent environmental effects, except between the permanent environmental effects of LS and ABW of the same parity, for which the estimate of the correlation was strongly negative (ranging from -0.57 to -0.67). We demonstrated that application of our multiple-trait SAD model is feasible for studying several traits with repeated measurements and showed that it provided a better fit to the data than the random regression model.

  10. Quantitative Structure Retention Relationships of Polychlorinated Dibenzodioxins and Dibenzofurans

    DTIC Science & Technology

    1991-08-01

    be a projection onto the X-Y plane. The algorithm for this calculation can be found in Stouch and Jurs (22), but was further refined by Rohrbaugh and...throughspace distances. WPSA2 (c) Weighted positive charged surface area. MOMH2 (c) Second major moment of inertia with hydrogens attached. CSTR 3 (d) Sum...of the models. The robust regression analysis method calculates a regression model using a least median squares algorithm which is not as susceptible

  11. Use of Multiple Imputation to Estimate the Proportion of Respiratory Virus Detections Among Patients Hospitalized With Community-Acquired Pneumonia

    PubMed Central

    Bozio, Catherine H; Flanders, W Dana; Finelli, Lyn; Bramley, Anna M; Reed, Carrie; Gandhi, Neel R; Vidal, Jorge E; Erdman, Dean; Levine, Min Z; Lindstrom, Stephen; Ampofo, Krow; Arnold, Sandra R; Self, Wesley H; Williams, Derek J; Grijalva, Carlos G; Anderson, Evan J; McCullers, Jonathan A; Edwards, Kathryn M; Pavia, Andrew T; Wunderink, Richard G; Jain, Seema

    2018-01-01

    Abstract Background Real-time polymerase chain reaction (PCR) on respiratory specimens and serology on paired blood specimens are used to determine the etiology of respiratory illnesses for research studies. However, convalescent serology is often not collected. We used multiple imputation to assign values for missing serology results to estimate virus-specific prevalence among pediatric and adult community-acquired pneumonia hospitalizations using data from an active population-based surveillance study. Methods Presence of adenoviruses, human metapneumovirus, influenza viruses, parainfluenza virus types 1–3, and respiratory syncytial virus was defined by positive PCR on nasopharyngeal/oropharyngeal specimens or a 4-fold rise in paired serology. We performed multiple imputation by developing a multivariable regression model for each virus using data from patients with available serology results. We calculated absolute and relative differences in the proportion of each virus detected comparing the imputed to observed (nonimputed) results. Results Among 2222 children and 2259 adults, 98.8% and 99.5% had nasopharyngeal/oropharyngeal specimens and 43.2% and 37.5% had paired serum specimens, respectively. Imputed results increased viral etiology assignments by an absolute difference of 1.6%–4.4% and 0.8%–2.8% in children and adults, respectively; relative differences were 1.1–3.0 times higher. Conclusions Multiple imputation can be used when serology results are missing, to refine virus-specific prevalence estimates, and these will likely increase estimates.

  12. Morphological characteristics associated with rupture risk of multiple intracranial aneurysms.

    PubMed

    Wang, Guang-Xian; Liu, Lan-Lan; Wen, Li; Cao, Yun-Xing; Pei, Yu-Chun; Zhang, Dong

    2017-10-01

    To identify the morphological parameters that are related to intracranial aneurysms (IAs) rupture using a case-control model. A total of 107 patients with multiple IAs and aneurysmal subarachnoid hemorrhage between August 2011 and February 2017 were enrolled in this study. Characteristics of IAs location, shape, neck width, perpendicular height, depth, maximum size, flow angle, parent vessel diameter (PVD), aspect ratio (AR) and size ratio (SR) were evaluated using CT angiography. Multiple logistic regression analysis was used to identify the independent risk factors associated with IAs rupture. Receiver operating characteristic curve analysis was performed on the final model, and the optimal thresholds were obtained. IAs located in the internal carotid artery (ICA) was associated with a negative risk of rupture, whereas AR, SR1 (height/PVD) and SR2 (depth/PVD) were associated with increased risk of rupture. When SR was calculated differently, the odds ratio values of these factors were also different. The receiver operating characteristic curve showed that AR, SR1 and SR2 had cut-off values of 1.01, 1.48 and 1.40, respectively. SR3 (maximum size/PVD) was not associated with IAs rupture. IAs located in the ICA are associated with a negative risk of rupture, while high AR (>1.01), SR1 (>1.48) or SR2 (>1.40) are risk factors for multiple IAs rupture. Copyright © 2017 Hainan Medical University. Production and hosting by Elsevier B.V. All rights reserved.

  13. 5 CFR 591.219 - How does OPM compute shelter price indexes?

    Code of Federal Regulations, 2014 CFR

    2014-01-01

    ... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...

  14. 5 CFR 591.219 - How does OPM compute shelter price indexes?

    Code of Federal Regulations, 2011 CFR

    2011-01-01

    ... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...

  15. 5 CFR 591.219 - How does OPM compute shelter price indexes?

    Code of Federal Regulations, 2013 CFR

    2013-01-01

    ... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...

  16. 5 CFR 591.219 - How does OPM compute shelter price indexes?

    Code of Federal Regulations, 2012 CFR

    2012-01-01

    ... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...

  17. Toward customer-centric organizational science: A common language effect size indicator for multiple linear regressions and regressions with higher-order terms.

    PubMed

    Krasikova, Dina V; Le, Huy; Bachura, Eric

    2018-06-01

    To address a long-standing concern regarding a gap between organizational science and practice, scholars called for more intuitive and meaningful ways of communicating research results to users of academic research. In this article, we develop a common language effect size index (CLβ) that can help translate research results to practice. We demonstrate how CLβ can be computed and used to interpret the effects of continuous and categorical predictors in multiple linear regression models. We also elaborate on how the proposed CLβ index is computed and used to interpret interactions and nonlinear effects in regression models. In addition, we test the robustness of the proposed index to violations of normality and provide means for computing standard errors and constructing confidence intervals around its estimates. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  18. Modeling brook trout presence and absence from landscape variables using four different analytical methods

    USGS Publications Warehouse

    Steen, Paul J.; Passino-Reader, Dora R.; Wiley, Michael J.

    2006-01-01

    As a part of the Great Lakes Regional Aquatic Gap Analysis Project, we evaluated methodologies for modeling associations between fish species and habitat characteristics at a landscape scale. To do this, we created brook trout Salvelinus fontinalis presence and absence models based on four different techniques: multiple linear regression, logistic regression, neural networks, and classification trees. The models were tested in two ways: by application to an independent validation database and cross-validation using the training data, and by visual comparison of statewide distribution maps with historically recorded occurrences from the Michigan Fish Atlas. Although differences in the accuracy of our models were slight, the logistic regression model predicted with the least error, followed by multiple regression, then classification trees, then the neural networks. These models will provide natural resource managers a way to identify habitats requiring protection for the conservation of fish species.

  19. Schistosomiasis Breeding Environment Situation Analysis in Dongting Lake Area

    NASA Astrophysics Data System (ADS)

    Li, Chuanrong; Jia, Yuanyuan; Ma, Lingling; Liu, Zhaoyan; Qian, Yonggang

    2013-01-01

    Monitoring environmental characteristics, such as vegetation, soil moisture et al., of Oncomelania hupensis (O. hupensis)’ spatial/temporal distribution is of vital importance to the schistosomiasis prevention and control. In this study, the relationship between environmental factors derived from remotely sensed data and the density of O. hupensis was analyzed by a multiple linear regression model. Secondly, spatial analysis of the regression residual was investigated by the semi-variogram method. Thirdly, spatial analysis of the regression residual and the multiple linear regression model were both employed to estimate the spatial variation of O. hupensis density. Finally, the approach was used to monitor and predict the spatial and temporal variations of oncomelania of Dongting Lake region, China. And the areas of potential O. hupensis habitats were predicted and the influence of Three Gorges Dam (TGB)project on the density of O. hupensis was analyzed.

  20. Toward a new payment system for inpatient rehabilitation. Part II: Reimbursing providers.

    PubMed

    Saitto, Carlo; Marino, Claudia; Fusco, Danilo; Arcà, Massimo; Perucci, Carlo A

    2005-09-01

    The major fault with existing reimbursement systems lies in their failure to discriminate for the effectiveness of stay, both when paying per day and when paying per episode of treatment. We sought to define an average length of effective stay and recovery trends by impairment category, to design a prospective payment system that takes into account costs and expected recovery trends, and to compare the calculated reimbursement with the predicted costs estimated in a previous study (Saitto C, Marino C, Fusco D, et al. A new prospective payment system for inpatient rehabilitation. Part I: predicting resource consumption. Med Care. 2005;43:844-855). We considered all rehabilitation admissions from 5 Italian inpatient facilities during a 12-month period for which total cost of care had already been estimated and daily cost predicted through regression model. We ascertained recovery trends by impairment category through repeated MDS-PAC schedules and factorial analysis of functional status. We defined effective stay and daily resource consumption by impairment category and used these parameters to calculate reimbursement for the admission. We compared our reimbursement with predicted cost through regression analysis and evaluated the goodness of fit through residual analysis. We calculated reimbursement for 2079 admissions. The r(2) values for the reimbursement to cost correlation ranged from 0.54 in the whole population to 0.56 for "multiple trauma" to 0.85 for "other medical disorders." The best fit was found in the central quintiles of the cost and severity distributions. For each impairment category, we determined the number of days of effective hospital stay and the trends of functional gain. We demonstrated, at least within the Italian health care system, the feasibility of a reimbursement system that matches costs with functional recovery. By linking reimbursement to effective stay adjusted for trends of functional gain, we suggest it is possible to avoid both needless cuts and extensions of hospital admissions.

  1. Marsh collapse thresholds for coastal Louisiana estimated using elevation and vegetation index data

    USGS Publications Warehouse

    Couvillion, Brady R.; Beck, Holly

    2013-01-01

    Forecasting marsh collapse in coastal Louisiana as a result of changes in sea-level rise, subsidence, and accretion deficits necessitates an understanding of thresholds beyond which inundation stress impedes marsh survival. The variability in thresholds at which different marsh types cease to occur (i.e., marsh collapse) is not well understood. We utilized remotely sensed imagery, field data, and elevation data to help gain insight into the relationships between vegetation health and inundation. A Normalized Difference Vegetation Index (NDVI) dataset was calculated using remotely sensed data at peak biomass (August) and used as a proxy for vegetation health and productivity. Statistics were calculated for NDVI values by marsh type for intermediate, brackish, and saline marsh in coastal Louisiana. Marsh-type specific NDVI values of 1.5 and 2 standard deviations below the mean were used as upper and lower limits to identify conditions indicative of collapse. As marshes seldom occur beyond these values, they are believed to represent a range within which marsh collapse is likely to occur. Inundation depth was selected as the primary candidate for evaluation of marsh collapse thresholds. Elevation relative to mean water level (MWL) was calculated by subtracting MWL from an elevation dataset compiled from multiple data types including light detection and ranging (lidar) and bathymetry. A polynomial cubic regression was used to examine a random subset of pixels to determine the relationship between elevation (relative to MWL) and NDVI. The marsh collapse uncertainty range values were found by locating the intercept of the regression line with the 1.5 and 2 standard deviations below the mean NDVI value for each marsh type. Results indicate marsh collapse uncertainty ranges of 30.7–35.8 cm below MWL for intermediate marsh, 20–25.6 cm below MWL for brackish marsh, and 16.9–23.5 cm below MWL for saline marsh. These values are thought to represent the ranges of inundation depths within which marsh collapse is probable.

  2. Modification of the USLE K factor for soil erodibility assessment on calcareous soils in Iran

    NASA Astrophysics Data System (ADS)

    Ostovari, Yaser; Ghorbani-Dashtaki, Shoja; Bahrami, Hossein-Ali; Naderi, Mehdi; Dematte, Jose Alexandre M.; Kerry, Ruth

    2016-11-01

    The measurement of soil erodibility (K) in the field is tedious, time-consuming and expensive; therefore, its prediction through pedotransfer functions (PTFs) could be far less costly and time-consuming. The aim of this study was to develop new PTFs to estimate the K factor using multiple linear regression, Mamdani fuzzy inference systems, and artificial neural networks. For this purpose, K was measured in 40 erosion plots with natural rainfall. Various soil properties including the soil particle size distribution, calcium carbonate equivalent, organic matter, permeability, and wet-aggregate stability were measured. The results showed that the mean measured K was 0.014 t h MJ- 1 mm- 1 and 2.08 times less than the estimated mean K (0.030 t h MJ- 1 mm- 1) using the USLE model. Permeability, wet-aggregate stability, very fine sand, and calcium carbonate were selected as independent variables by forward stepwise regression in order to assess the ability of multiple linear regression, Mamdani fuzzy inference systems and artificial neural networks to predict K. The calcium carbonate equivalent, which is not accounted for in the USLE model, had a significant impact on K in multiple linear regression due to its strong influence on the stability of aggregates and soil permeability. Statistical indices in validation and calibration datasets determined that the artificial neural networks method with the highest R2, lowest RMSE, and lowest ME was the best model for estimating the K factor. A strong correlation (R2 = 0.81, n = 40, p < 0.05) between the estimated K from multiple linear regression and measured K indicates that the use of calcium carbonate equivalent as a predictor variable gives a better estimation of K in areas with calcareous soils.

  3. Analysis and prediction of flow from local source in a river basin using a Neuro-fuzzy modeling tool.

    PubMed

    Aqil, Muhammad; Kita, Ichiro; Yano, Akira; Nishiyama, Soichi

    2007-10-01

    Traditionally, the multiple linear regression technique has been one of the most widely used models in simulating hydrological time series. However, when the nonlinear phenomenon is significant, the multiple linear will fail to develop an appropriate predictive model. Recently, neuro-fuzzy systems have gained much popularity for calibrating the nonlinear relationships. This study evaluated the potential of a neuro-fuzzy system as an alternative to the traditional statistical regression technique for the purpose of predicting flow from a local source in a river basin. The effectiveness of the proposed identification technique was demonstrated through a simulation study of the river flow time series of the Citarum River in Indonesia. Furthermore, in order to provide the uncertainty associated with the estimation of river flow, a Monte Carlo simulation was performed. As a comparison, a multiple linear regression analysis that was being used by the Citarum River Authority was also examined using various statistical indices. The simulation results using 95% confidence intervals indicated that the neuro-fuzzy model consistently underestimated the magnitude of high flow while the low and medium flow magnitudes were estimated closer to the observed data. The comparison of the prediction accuracy of the neuro-fuzzy and linear regression methods indicated that the neuro-fuzzy approach was more accurate in predicting river flow dynamics. The neuro-fuzzy model was able to improve the root mean square error (RMSE) and mean absolute percentage error (MAPE) values of the multiple linear regression forecasts by about 13.52% and 10.73%, respectively. Considering its simplicity and efficiency, the neuro-fuzzy model is recommended as an alternative tool for modeling of flow dynamics in the study area.

  4. Soil Cd, Cr, Cu, Ni, Pb and Zn sorption and retention models using SVM: Variable selection and competitive model.

    PubMed

    González Costa, J J; Reigosa, M J; Matías, J M; Covelo, E F

    2017-09-01

    The aim of this study was to model the sorption and retention of Cd, Cu, Ni, Pb and Zn in soils. To that extent, the sorption and retention of these metals were studied and the soil characterization was performed separately. Multiple stepwise regression was used to produce multivariate models with linear techniques and with support vector machines, all of which included 15 explanatory variables characterizing soils. When the R-squared values are represented, two different groups are noticed. Cr, Cu and Pb sorption and retention show a higher R-squared; the most explanatory variables being humified organic matter, Al oxides and, in some cases, cation-exchange capacity (CEC). The other group of metals (Cd, Ni and Zn) shows a lower R-squared, and clays are the most explanatory variables, including a percentage of vermiculite and slime. In some cases, quartz, plagioclase or hematite percentages also show some explanatory capacity. Support Vector Machine (SVM) regression shows that the different models are not as regular as in multiple regression in terms of number of variables, the regression for nickel adsorption being the one with the highest number of variables in its optimal model. On the other hand, there are cases where the most explanatory variables are the same for two metals, as it happens with Cd and Cr adsorption. A similar adsorption mechanism is thus postulated. These patterns of the introduction of variables in the model allow us to create explainability sequences. Those which are the most similar to the selectivity sequences obtained by Covelo (2005) are Mn oxides in multiple regression and change capacity in SVM. Among all the variables, the only one that is explanatory for all the metals after applying the maximum parsimony principle is the percentage of sand in the retention process. In the competitive model arising from the aforementioned sequences, the most intense competitiveness for the adsorption and retention of different metals appears between Cr and Cd, Cu and Zn in multiple regression; and between Cr and Cd in SVM regression. Copyright © 2017 Elsevier B.V. All rights reserved.

  5. Deconstructing Calculation Methods, Part 3: Multiplication

    ERIC Educational Resources Information Center

    Thompson, Ian

    2008-01-01

    In this third of a series of four articles, the author deconstructs the primary national strategy's approach to written multiplication. The approach to multiplication, as set out on pages 12 to 15 of the primary national strategy's "Guidance paper" "Calculation" (DfES, 2007), is divided into six stages: (1) mental…

  6. Practical guidance for conducting mediation analysis with multiple mediators using inverse odds ratio weighting.

    PubMed

    Nguyen, Quynh C; Osypuk, Theresa L; Schmidt, Nicole M; Glymour, M Maria; Tchetgen Tchetgen, Eric J

    2015-03-01

    Despite the recent flourishing of mediation analysis techniques, many modern approaches are difficult to implement or applicable to only a restricted range of regression models. This report provides practical guidance for implementing a new technique utilizing inverse odds ratio weighting (IORW) to estimate natural direct and indirect effects for mediation analyses. IORW takes advantage of the odds ratio's invariance property and condenses information on the odds ratio for the relationship between the exposure (treatment) and multiple mediators, conditional on covariates, by regressing exposure on mediators and covariates. The inverse of the covariate-adjusted exposure-mediator odds ratio association is used to weight the primary analytical regression of the outcome on treatment. The treatment coefficient in such a weighted regression estimates the natural direct effect of treatment on the outcome, and indirect effects are identified by subtracting direct effects from total effects. Weighting renders treatment and mediators independent, thereby deactivating indirect pathways of the mediators. This new mediation technique accommodates multiple discrete or continuous mediators. IORW is easily implemented and is appropriate for any standard regression model, including quantile regression and survival analysis. An empirical example is given using data from the Moving to Opportunity (1994-2002) experiment, testing whether neighborhood context mediated the effects of a housing voucher program on obesity. Relevant Stata code (StataCorp LP, College Station, Texas) is provided. © The Author 2015. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  7. Neural network and multiple linear regression to predict school children dimensions for ergonomic school furniture design.

    PubMed

    Agha, Salah R; Alnahhal, Mohammed J

    2012-11-01

    The current study investigates the possibility of obtaining the anthropometric dimensions, critical to school furniture design, without measuring all of them. The study first selects some anthropometric dimensions that are easy to measure. Two methods are then used to check if these easy-to-measure dimensions can predict the dimensions critical to the furniture design. These methods are multiple linear regression and neural networks. Each dimension that is deemed necessary to ergonomically design school furniture is expressed as a function of some other measured anthropometric dimensions. Results show that out of the five dimensions needed for chair design, four can be related to other dimensions that can be measured while children are standing. Therefore, the method suggested here would definitely save time and effort and avoid the difficulty of dealing with students while measuring these dimensions. In general, it was found that neural networks perform better than multiple linear regression in the current study. Copyright © 2012 Elsevier Ltd and The Ergonomics Society. All rights reserved.

  8. Relationship between body composition and vertical ground reaction forces in obese children when walking.

    PubMed

    Villarrasa-Sapiña, Israel; Serra-Añó, Pilar; Pardo-Ibáñez, Alberto; Gonzalez, Luis-Millán; García-Massó, Xavier

    2017-01-01

    Obesity is now a serious worldwide challenge, especially in children. This condition can cause a number of different health problems, including musculoskeletal disorders, some of which are due to mechanical stress caused by excess body weight. The aim of this study was to determine the association between body composition and the vertical ground reaction force produced during walking in obese children. Sixteen children participated in the study, six females and ten males [11.5 (1.2) years old, 69.8 (15.5) kg, 1.56 (0.09) m, and 28.36 (3.74) kg/m 2 of body mass index (BMI)]. Total weight, lean mass and fat mass were measured by dual-energy X-ray absorptiometry and vertical forces while walking were obtained by a force platform. The vertical force variables analysed were impact and propulsive forces, and the rate of development of both. Multiple regression models for each vertical force parameter were calculated using the body composition variables as input. The impact force regression model was found to be positively related to the weight of obese children and negatively related to lean mass. The regression model showed lean mass was positively related to the propulsive rate. Finally, regression models for impact and propulsive force showed a direct relationship with body weight. Impact force is positively related to the weight of obese children, but lean mass helps to reduce the impact force in this population. Exercise could help obese persons to reduce their total body weight and increase their lean mass, thus reducing impact forces during sports and other activities. Copyright © 2016 Elsevier Ltd. All rights reserved.

  9. Serial measurement of type-specific human papillomavirus load enables classification of cervical intraepithelial neoplasia lesions according to occurring human papillomavirus-induced pathway.

    PubMed

    Verhelst, Stefanie; Poppe, Willy A J; Bogers, Johannes J; Depuydt, Christophe E

    2017-03-01

    This retrospective study examined whether human papillomavirus (HPV) type-specific viral load changes measured in two or three serial cervical smears are predictive for the natural evolution of HPV infections and correlate with histological grades of cervical intraepithelial neoplasia (CIN), allowing triage of HPV-positive women. A cervical histology database was used to select consecutive women with biopsy-proven CIN in 2012 who had at least two liquid-based cytology samples before the diagnosis of CIN. Before performing cytology, 18 different quantitative PCRs allowed HPV type-specific viral load measurement. Changes in HPV-specific load between measurements were assessed by linear regression, with calculation of coefficient of determination (R) and slope. All infections could be classified into one of five categories: (i) clonal progressing process (R≥0.85; positive slope), (ii) simultaneously occurring clonal progressive and transient infection, (iii) clonal regressing process (R≥0.85; negative slope), (iv) serial transient infection with latency [R<0.85; slopes (two points) between 0.0010 and -0.0010 HPV copies/cell/day], and (v) transient productive infection (R<0.85; slope: ±0.0099 HPV copies/cell/day). Three hundred and seven women with CIN were included; 124 had single-type infections and 183 had multiple HPV types. Only with three consecutive measurements could a clonal process be identified in all CIN3 cases. We could clearly demonstrate clonal regressing lesions with a persistent linear decrease in viral load (R≥0.85; -0.003 HPV copies/cell/day) in all CIN categories. Type-specific viral load increase/decrease in three consecutive measurements enabled classification of CIN lesions in clonal HPV-driven transformation (progression/regression) and nonclonal virion-productive (serial transient/transient) processes.

  10. Gene set analysis using variance component tests.

    PubMed

    Huang, Yen-Tsung; Lin, Xihong

    2013-06-28

    Gene set analyses have become increasingly important in genomic research, as many complex diseases are contributed jointly by alterations of numerous genes. Genes often coordinate together as a functional repertoire, e.g., a biological pathway/network and are highly correlated. However, most of the existing gene set analysis methods do not fully account for the correlation among the genes. Here we propose to tackle this important feature of a gene set to improve statistical power in gene set analyses. We propose to model the effects of an independent variable, e.g., exposure/biological status (yes/no), on multiple gene expression values in a gene set using a multivariate linear regression model, where the correlation among the genes is explicitly modeled using a working covariance matrix. We develop TEGS (Test for the Effect of a Gene Set), a variance component test for the gene set effects by assuming a common distribution for regression coefficients in multivariate linear regression models, and calculate the p-values using permutation and a scaled chi-square approximation. We show using simulations that type I error is protected under different choices of working covariance matrices and power is improved as the working covariance approaches the true covariance. The global test is a special case of TEGS when correlation among genes in a gene set is ignored. Using both simulation data and a published diabetes dataset, we show that our test outperforms the commonly used approaches, the global test and gene set enrichment analysis (GSEA). We develop a gene set analyses method (TEGS) under the multivariate regression framework, which directly models the interdependence of the expression values in a gene set using a working covariance. TEGS outperforms two widely used methods, GSEA and global test in both simulation and a diabetes microarray data.

  11. OS087. Maternal characteristics, mean arterial pressure and PLGF in early prediction of preeclampsia.

    PubMed

    Kuc, S; Koster, M P; Franx, A; Schielen, P C; Visser, G H

    2012-07-01

    In a previous study, we described the predictive value of first-trimester pregnancy-associated plasma protein-A (PAPP-A), free beta-subunit of human chorionic gonadotrophin (fb-hCG), Placental Growth Factor (PlGF) and A Desintegrin And Metalloproteinase 12 (ADAM12) for early onset preeclampsia (delivery <34 weeks) [1]. The objective of the current study was to obtain the predictive value of these serum makers, for both early onset PE (EOPE) and late onset PE (LOPE), combined with maternal characteristics and first-trimester maternal mean arterial blood pressure (MAP). This was a nested case-control study, using stored first-trimester maternal serum from 167 women who subsequently developed PE, and 500 uncomplicated singleton pregnancies which resulted in a live birth =>37 weeks. Maternal characteristics (i.e. medical records, parity, weight, length) MAP and pregnancy outcome (i.e. gestational age at delivery, birthweight, fetal sex) were collected for each individual and used to calculate prior risks for PE in a multiple logistic regression model. MAP values and marker levels of PAPP-A, fb-hCG, PlGF and ADAM12 were expressed as multiples of the gestation-specific normal median (MoMs). Subsequently, MoMs were log-transformed and compared between PE and controls using Student's t-tests. Posterior risks were calculated using different combinations of variables;(1) maternal characteristics, serum markers, and MAP separately (2) maternal characteristics combined with serum markers or MAP (3) maternal characteristics combined with serum markers and MAP. The model-predicted detection rates (DR) for fixed 10% false-positive rates were obtained for EOPE and LOPE with or without intra-uterine growth restriction (IUGR,birth weight <10th centile). The maternal characteristics: maternal age, weight, length, smoking status and nulliparity were discriminative between PE and control groups and therefore incorporated in the multiple logistic regression model. MoM MAP was significantly elevated (1.10 p<0.001; 1.07 p<0.001) and MoM PlGF was significantly reduced (0.95 p=0.016; 0.90 p=0.029) in the EOPE and LOPE group, respectively. The differences in markers for IUGR groups were larger. The estimated DRs of the three different models are presented in the table. This study demonstrates that first-trimester MAP and PlGF combined with maternal characteristics are promising markers in risk assessment for PE. Combination of markers proved especially useful for risk assessment for term PE. Detection rates were higher in the presence of IUGR. Copyright © 2012. Published by Elsevier B.V.

  12. Cesium and strontium loads into a combined sewer system from rainwater runoff.

    PubMed

    Kamei-Ishikawa, Nao; Yoshida, Daiki; Ito, Ayumi; Umita, Teruyuki

    2016-12-01

    In this study, combined sewage samples were taken with time in several rain events and sanitary sewage samples were taken with time in dry weather to calculate Cs and Sr loads to sewers from rainwater runoff. Cs and Sr in rainwater were present as particulate forms at first flush and the particulate Cs and Sr were mainly bound with inorganic suspended solids such as clay minerals in combined sewage samples. In addition, multiple linear regression analysis showed Cs and Sr loads from rainwater runoff could be estimated by the total amount of rainfall and antecedent dry weather days. The variation of the Sr load from rainwater to sewers was more sensitive to total amount of rainfall and antecedent dry weather days than that of the Cs load. Copyright © 2016 Elsevier Ltd. All rights reserved.

  13. Domain General Mediators of the Relation between Kindergarten Number Sense and First-Grade Mathematics Achievement

    PubMed Central

    Hassinger-Das, Brenna; Jordan, Nancy C.; Glutting, Joseph; Irwin, Casey; Dyson, Nancy

    2013-01-01

    Domain general skills that mediate the relation between kindergarten number sense and first-grade mathematics skills were investigated. Participants were 107 children who displayed low number sense in the fall of kindergarten. Controlling for background variables, multiple regression analyses showed that attention problems and executive functioning both were unique predictors of mathematics outcomes. Attention problems were more important for predicting first-grade calculation performance while executive functioning was more important for predicting first-grade performance on applied problems. Moreover, both executive functioning and attention problems were unique partial mediators of the relationship between kindergarten and first-grade mathematics skills. The results provide empirical support for developing interventions that target executive functioning and attention problems in addition to instruction in number skills for kindergartners with initial low number sense. PMID:24237789

  14. The Effect of QBO on the Total Mass Density

    NASA Astrophysics Data System (ADS)

    Saǧır, Selçuk; Atıcı, Ramazan

    2016-07-01

    The relationship between Quasi-Biennial Oscillation (QBO) measured at 10 hPa altitude and total mass density (TMD) values obtained from NRLMSIS-00 model for 90 km altitude of ionosphere known as Mesosphere-Lower Thermosphere (MLT) region is statistically investigated. For this study, multiple-regression model is used. To see the effect on TMD of QBO directions, Dummy variables are also added to model. In the result of calculations, it is observed that QBO is effected on TMD. It is determined that 69% of variations at TMD can be explainable by QBO. It is determined that the explainable ratio is at the rate of 5%. Also, it is seen that an increase/a decrease of 1 meter per second at QBO give rise to an increase/a decrease of 7,2x10-4 g/cm3 at TMD.

  15. Chronic exposure to water pollutant trichloroethylene increased epigenetic drift in CD4+ T cells

    PubMed Central

    Gilbert, Kathleen M; Blossom, Sarah J; Erickson, Stephen W; Reisfeld, Brad; Zurlinden, Todd J; Broadfoot, Brannon; West, Kirk; Bai, Shasha; Cooney, Craig A

    2016-01-01

    Aim: Autoimmune disease and CD4+ T-cell alterations are induced in mice exposed to the water pollutant trichloroethylene (TCE). We examined here whether TCE altered gene-specific DNA methylation in CD4+ T cells as a possible mechanism of immunotoxicity. Materials & methods: Naive and effector/memory CD4+ T cells from mice exposed to TCE (0.5 mg/ml in drinking water) for 40 weeks were examined by bisulfite next-generation DNA sequencing. Results: A probabilistic model calculated from multiple genes showed that TCE decreased methylation control in CD4+ T cells. Data from individual genes fitted to a quadratic regression model showed that TCE increased gene-specific methylation variance in both CD4 subsets. Conclusion: TCE increased epigenetic drift of specific CpG sites in CD4+ T cells. PMID:27092578

  16. Domain-general mediators of the relation between kindergarten number sense and first-grade mathematics achievement.

    PubMed

    Hassinger-Das, Brenna; Jordan, Nancy C; Glutting, Joseph; Irwin, Casey; Dyson, Nancy

    2014-02-01

    Domain-general skills that mediate the relation between kindergarten number sense and first-grade mathematics skills were investigated. Participants were 107 children who displayed low number sense in the fall of kindergarten. Controlling for background variables, multiple regression analyses showed that both attention problems and executive functioning were unique predictors of mathematics outcomes. Attention problems were more important for predicting first-grade calculation performance, whereas executive functioning was more important for predicting first-grade performance on applied problems. Moreover, both executive functioning and attention problems were unique partial mediators of the relationship between kindergarten and first-grade mathematics skills. The results provide empirical support for developing interventions that target executive functioning and attention problems in addition to instruction in number skills for kindergartners with initial low number sense. Copyright © 2013 Elsevier Inc. All rights reserved.

  17. Comparing methods of analysing datasets with small clusters: case studies using four paediatric datasets.

    PubMed

    Marston, Louise; Peacock, Janet L; Yu, Keming; Brocklehurst, Peter; Calvert, Sandra A; Greenough, Anne; Marlow, Neil

    2009-07-01

    Studies of prematurely born infants contain a relatively large percentage of multiple births, so the resulting data have a hierarchical structure with small clusters of size 1, 2 or 3. Ignoring the clustering may lead to incorrect inferences. The aim of this study was to compare statistical methods which can be used to analyse such data: generalised estimating equations, multilevel models, multiple linear regression and logistic regression. Four datasets which differed in total size and in percentage of multiple births (n = 254, multiple 18%; n = 176, multiple 9%; n = 10 098, multiple 3%; n = 1585, multiple 8%) were analysed. With the continuous outcome, two-level models produced similar results in the larger dataset, while generalised least squares multilevel modelling (ML GLS 'xtreg' in Stata) and maximum likelihood multilevel modelling (ML MLE 'xtmixed' in Stata) produced divergent estimates using the smaller dataset. For the dichotomous outcome, most methods, except generalised least squares multilevel modelling (ML GH 'xtlogit' in Stata) gave similar odds ratios and 95% confidence intervals within datasets. For the continuous outcome, our results suggest using multilevel modelling. We conclude that generalised least squares multilevel modelling (ML GLS 'xtreg' in Stata) and maximum likelihood multilevel modelling (ML MLE 'xtmixed' in Stata) should be used with caution when the dataset is small. Where the outcome is dichotomous and there is a relatively large percentage of non-independent data, it is recommended that these are accounted for in analyses using logistic regression with adjusted standard errors or multilevel modelling. If, however, the dataset has a small percentage of clusters greater than size 1 (e.g. a population dataset of children where there are few multiples) there appears to be less need to adjust for clustering.

  18. Validity of VO(2 max) in predicting blood volume: implications for the effect of fitness on aging

    NASA Technical Reports Server (NTRS)

    Convertino, V. A.; Ludwig, D. A.

    2000-01-01

    A multiple regression model was constructed to investigate the premise that blood volume (BV) could be predicted using several anthropometric variables, age, and maximal oxygen uptake (VO(2 max)). To test this hypothesis, age, calculated body surface area (height/weight composite), percent body fat (hydrostatic weight), and VO(2 max) were regressed on to BV using data obtained from 66 normal healthy men. Results from the evaluation of the full model indicated that the most parsimonious result was obtained when age and VO(2 max) were regressed on BV expressed per kilogram body weight. The full model accounted for 52% of the total variance in BV per kilogram body weight. Both age and VO(2 max) were related to BV in the positive direction. Percent body fat contributed <1% to the explained variance in BV when expressed in absolute BV (ml) or as BV per kilogram body weight. When the model was cross validated on 41 new subjects and BV per kilogram body weight was reexpressed as raw BV, the results indicated that the statistical model would be stable under cross validation (e.g., predictive applications) with an accuracy of +/- 1,200 ml at 95% confidence. Our results support the hypothesis that BV is an increasing function of aerobic fitness and to a lesser extent the age of the subject. The results may have implication as to a mechanism by which aerobic fitness and activity may be protective against reduced BV associated with aging.

  19. Prediction of sickness absence: development of a screening instrument

    PubMed Central

    Duijts, S F A; Kant, IJ; Landeweerd, J A; Swaen, G M H

    2006-01-01

    Objectives To develop a concise screening instrument for early identification of employees at risk for sickness absence due to psychosocial health complaints. Methods Data from the Maastricht Cohort Study on “Fatigue at Work” were used to identify items to be associated with an increased risk of sickness absence. The analytical procedures univariate logistic regression, backward stepwise linear regression, and multiple logistic regression were successively applied. For both men and women, sum scores were calculated, and sensitivity and specificity rates of different cut‐off points on the screening instrument were defined. Results In women, results suggested that feeling depressed, having a burnout, being tired, being less interested in work, experiencing obligatory change in working days, and living alone, were strong predictors of sickness absence due to psychosocial health complaints. In men, statistically significant predictors were having a history of sickness absence, compulsive thinking, being mentally fatigued, finding it hard to relax, lack of supervisor support, and having no hobbies. A potential cut‐off point of 10 on the screening instrument resulted in a sensitivity score of 41.7% for women and 38.9% for men, and a specificity score of 91.3% for women and 90.6% for men. Conclusions This study shows that it is possible to identify predictive factors for sickness absence and to develop an instrument for early identification of employees at risk for sickness absence. The results of this study increase the possibility for both employers and policymakers to implement interventions directed at the prevention of sickness absence. PMID:16698807

  20. Groundwater depth prediction in a shallow aquifer in north China by a quantile regression model

    NASA Astrophysics Data System (ADS)

    Li, Fawen; Wei, Wan; Zhao, Yong; Qiao, Jiale

    2017-01-01

    There is a close relationship between groundwater level in a shallow aquifer and the surface ecological environment; hence, it is important to accurately simulate and predict the groundwater level in eco-environmental construction projects. The multiple linear regression (MLR) model is one of the most useful methods to predict groundwater level (depth); however, the predicted values by this model only reflect the mean distribution of the observations and cannot effectively fit the extreme distribution data (outliers). The study reported here builds a prediction model of groundwater-depth dynamics in a shallow aquifer using the quantile regression (QR) method on the basis of the observed data of groundwater depth and related factors. The proposed approach was applied to five sites in Tianjin city, north China, and the groundwater depth was calculated in different quantiles, from which the optimal quantile was screened out according to the box plot method and compared to the values predicted by the MLR model. The results showed that the related factors in the five sites did not follow the standard normal distribution and that there were outliers in the precipitation and last-month (initial state) groundwater-depth factors because the basic assumptions of the MLR model could not be achieved, thereby causing errors. Nevertheless, these conditions had no effect on the QR model, as it could more effectively describe the distribution of original data and had a higher precision in fitting the outliers.

  1. Combined computational-experimental approach to predict blood-brain barrier (BBB) permeation based on "green" salting-out thin layer chromatography supported by simple molecular descriptors.

    PubMed

    Ciura, Krzesimir; Belka, Mariusz; Kawczak, Piotr; Bączek, Tomasz; Markuszewski, Michał J; Nowakowska, Joanna

    2017-09-05

    The objective of this paper is to build QSRR/QSAR model for predicting the blood-brain barrier (BBB) permeability. The obtained models are based on salting-out thin layer chromatography (SOTLC) constants and calculated molecular descriptors. Among chromatographic methods SOTLC was chosen, since the mobile phases are free of organic solvent. As consequences, there are less toxic, and have lower environmental impact compared to classical reserved phases liquid chromatography (RPLC). During the study three stationary phase silica gel, cellulose plates and neutral aluminum oxide were examined. The model set of solutes presents a wide range of log BB values, containing compounds which cross the BBB readily and molecules poorly distributed to the brain including drugs acting on the nervous system as well as peripheral acting drugs. Additionally, the comparison of three regression models: multiple linear regression (MLR), partial least-squares (PLS) and orthogonal partial least squares (OPLS) were performed. The designed QSRR/QSAR models could be useful to predict BBB of systematically synthesized newly compounds in the drug development pipeline and are attractive alternatives of time-consuming and demanding directed methods for log BB measurement. The study also shown that among several regression techniques, significant differences can be obtained in models performance, measured by R 2 and Q 2 , hence it is strongly suggested to evaluate all available options as MLR, PLS and OPLS. Copyright © 2017 Elsevier B.V. All rights reserved.

  2. Relative Motion of the WDS 05110+3203 STF 648 System, With a Protocol for Calculating Relative Motion

    NASA Astrophysics Data System (ADS)

    Wiley, E. O.

    2010-07-01

    Relative motion studies of visual double stars can be investigated using least squares regression techniques and readily accessible programs such as Microsoft Excel and a calculator. Optical pairs differ from physical pairs under most geometries in both their simple scatter plots and their regression models. A step-by-step protocol for estimating the rectilinear elements of an optical pair is presented. The characteristics of physical pairs using these techniques are discussed.

  3. Regression Techniques for Determining the Effective Impervious Area in Southern California Watersheds

    NASA Astrophysics Data System (ADS)

    Sultana, R.; Mroczek, M.; Dallman, S.; Sengupta, A.; Stein, E. D.

    2016-12-01

    The portion of the Total Impervious Area (TIA) that is hydraulically connected to the storm drainage network is called the Effective Impervious Area (EIA). The remaining fraction of impervious area, called the non-effective impervious area, drains onto pervious surfaces which do not contribute to runoff for smaller events. Using the TIA instead of EIA in models and calculations can lead to overestimates of runoff volumes peak discharges and oversizing of drainage system since it is assumed all impervious areas produce urban runoff that is directly connected to storm drains. This makes EIA a better predictor of actual runoff from urban catchments for hydraulic design of storm drain systems and modeling non-point source pollution. Compared to TIA, determining the EIA is considerably more difficult to calculate since it cannot be found by using remote sensing techniques, readily available EIA datasets, or aerial imagery interpretation alone. For this study, EIA percentages were calculated by two successive regression methods for five watersheds (with areas of 8.38 - 158mi2) located in Southern California using rainfall-runoff event data for the years 2004 - 2007. Runoff generated from the smaller storm events are considered to be emanating only from the effective impervious areas. Therefore, larger events that were considered to have runoff from both impervious and pervious surfaces were successively removed in the regression methods using a criterion of (1) 1mm and (2) a max (2 , 1mm) above the regression line. MSE is calculated from actual runoff and runoff predicted by the regression. Analysis of standard deviations showed that criterion of max (2 , 1mm) better fit the regression line and is the preferred method in predicting the EIA percentage. The estimated EIAs have shown to be approximately 78% to 43% of the TIA which shows use of EIA instead of TIA can have significant impact on the cost building urban hydraulic systems and stormwater capture devices.

  4. Isolating the cow-specific part of residual energy intake in lactating dairy cows using random regressions.

    PubMed

    Fischer, A; Friggens, N C; Berry, D P; Faverdin, P

    2018-07-01

    The ability to properly assess and accurately phenotype true differences in feed efficiency among dairy cows is key to the development of breeding programs for improving feed efficiency. The variability among individuals in feed efficiency is commonly characterised by the residual intake approach. Residual feed intake is represented by the residuals of a linear regression of intake on the corresponding quantities of the biological functions that consume (or release) energy. However, the residuals include both, model fitting and measurement errors as well as any variability in cow efficiency. The objective of this study was to isolate the individual animal variability in feed efficiency from the residual component. Two separate models were fitted, in one the standard residual energy intake (REI) was calculated as the residual of a multiple linear regression of lactation average net energy intake (NEI) on lactation average milk energy output, average metabolic BW, as well as lactation loss and gain of body condition score. In the other, a linear mixed model was used to simultaneously fit fixed linear regressions and random cow levels on the biological traits and intercept using fortnight repeated measures for the variables. This method split the predicted NEI in two parts: one quantifying the population mean intercept and coefficients, and one quantifying cow-specific deviations in the intercept and coefficients. The cow-specific part of predicted NEI was assumed to isolate true differences in feed efficiency among cows. NEI and associated energy expenditure phenotypes were available for the first 17 fortnights of lactation from 119 Holstein cows; all fed a constant energy-rich diet. Mixed models fitting cow-specific intercept and coefficients to different combinations of the aforementioned energy expenditure traits, calculated on a fortnightly basis, were compared. The variance of REI estimated with the lactation average model represented only 8% of the variance of measured NEI. Among all compared mixed models, the variance of the cow-specific part of predicted NEI represented between 53% and 59% of the variance of REI estimated from the lactation average model or between 4% and 5% of the variance of measured NEI. The remaining 41% to 47% of the variance of REI estimated with the lactation average model may therefore reflect model fitting errors or measurement errors. In conclusion, the use of a mixed model framework with cow-specific random regressions seems to be a promising method to isolate the cow-specific component of REI in dairy cows.

  5. A Method for Calculating the Probability of Successfully Completing a Rocket Propulsion Ground Test

    NASA Technical Reports Server (NTRS)

    Messer, Bradley P.

    2004-01-01

    Propulsion ground test facilities face the daily challenges of scheduling multiple customers into limited facility space and successfully completing their propulsion test projects. Due to budgetary and schedule constraints, NASA and industry customers are pushing to test more components, for less money, in a shorter period of time. As these new rocket engine component test programs are undertaken, the lack of technology maturity in the test articles, combined with pushing the test facilities capabilities to their limits, tends to lead to an increase in facility breakdowns and unsuccessful tests. Over the last five years Stennis Space Center's propulsion test facilities have performed hundreds of tests, collected thousands of seconds of test data, and broken numerous test facility and test article parts. While various initiatives have been implemented to provide better propulsion test techniques and improve the quality, reliability, and maintainability of goods and parts used in the propulsion test facilities, unexpected failures during testing still occur quite regularly due to the harsh environment in which the propulsion test facilities operate. Previous attempts at modeling the lifecycle of a propulsion component test project have met with little success. Each of the attempts suffered form incomplete or inconsistent data on which to base the models. By focusing on the actual test phase of the tests project rather than the formulation, design or construction phases of the test project, the quality and quantity of available data increases dramatically. A logistic regression model has been developed form the data collected over the last five years, allowing the probability of successfully completing a rocket propulsion component test to be calculated. A logistic regression model is a mathematical modeling approach that can be used to describe the relationship of several independent predictor variables X(sub 1), X(sub 2),..,X(sub k) to a binary or dichotomous dependent variable Y, where Y can only be one of two possible outcomes, in this case Success or Failure. Logistic regression has primarily been used in the fields of epidemiology and biomedical research, but lends itself to many other applications. As indicated the use of logistic regression is not new, however, modeling propulsion ground test facilities using logistic regression is both a new and unique application of the statistical technique. Results from the models provide project managers with insight and confidence into the affectivity of rocket engine component ground test projects. The initial success in modeling rocket propulsion ground test projects clears the way for more complex models to be developed in this area.

  6. A Method for Assessing the Quality of Model-Based Estimates of Ground Temperature and Atmospheric Moisture Using Satellite Data

    NASA Technical Reports Server (NTRS)

    Wu, Man Li C.; Schubert, Siegfried; Lin, Ching I.; Stajner, Ivanka; Einaudi, Franco (Technical Monitor)

    2000-01-01

    A method is developed for validating model-based estimates of atmospheric moisture and ground temperature using satellite data. The approach relates errors in estimates of clear-sky longwave fluxes at the top of the Earth-atmosphere system to errors in geophysical parameters. The fluxes include clear-sky outgoing longwave radiation (CLR) and radiative flux in the window region between 8 and 12 microns (RadWn). The approach capitalizes on the availability of satellite estimates of CLR and RadWn and other auxiliary satellite data, and multiple global four-dimensional data assimilation (4-DDA) products. The basic methodology employs off-line forward radiative transfer calculations to generate synthetic clear-sky longwave fluxes from two different 4-DDA data sets. Simple linear regression is used to relate the clear-sky longwave flux discrepancies to discrepancies in ground temperature ((delta)T(sub g)) and broad-layer integrated atmospheric precipitable water ((delta)pw). The slopes of the regression lines define sensitivity parameters which can be exploited to help interpret mismatches between satellite observations and model-based estimates of clear-sky longwave fluxes. For illustration we analyze the discrepancies in the clear-sky longwave fluxes between an early implementation of the Goddard Earth Observing System Data Assimilation System (GEOS2) and a recent operational version of the European Centre for Medium-Range Weather Forecasts data assimilation system. The analysis of the synthetic clear-sky flux data shows that simple linear regression employing (delta)T(sub g)) and broad layer (delta)pw provides a good approximation to the full radiative transfer calculations, typically explaining more thin 90% of the 6 hourly variance in the flux differences. These simple regression relations can be inverted to "retrieve" the errors in the geophysical parameters, Uncertainties (normalized by standard deviation) in the monthly mean retrieved parameters range from 7% for (delta)T(sub g) to approx. 20% for the lower tropospheric moisture between 500 hPa and surface. The regression relationships developed from the synthetic flux data, together with CLR and RadWn observed with the Clouds and Earth Radiant Energy System instrument, ire used to assess the quality of the GEOS2 T(sub g) and pw. Results showed that the GEOS2 T(sub g) is too cold over land, and pw in upper layers is too high over the tropical oceans and too low in the lower atmosphere.

  7. Selection of the NIR region for a regression model of the ethanol concentration in fermentation process by an online NIR and mid-IR dual-region spectrometer and 2D heterospectral correlation spectroscopy.

    PubMed

    Nishii, Takashi; Genkawa, Takuma; Watari, Masahiro; Ozaki, Yukihiro

    2012-01-01

    A new selection procedure of an informative near-infrared (NIR) region for regression model building is proposed that uses an online NIR/mid-infrared (mid-IR) dual-region spectrometer in conjunction with two-dimensional (2D) NIR/mid-IR heterospectral correlation spectroscopy. In this procedure, both NIR and mid-IR spectra of a liquid sample are acquired sequentially during a reaction process using the NIR/mid-IR dual-region spectrometer; the 2D NIR/mid-IR heterospectral correlation spectrum is subsequently calculated from the obtained spectral data set. From the calculated 2D spectrum, a NIR region is selected that includes bands of high positive correlation intensity with mid-IR bands assigned to the analyte, and used for the construction of a regression model. To evaluate the performance of this procedure, a partial least-squares (PLS) regression model of the ethanol concentration in a fermentation process was constructed. During fermentation, NIR/mid-IR spectra in the 10000 - 1200 cm(-1) region were acquired every 3 min, and a 2D NIR/mid-IR heterospectral correlation spectrum was calculated to investigate the correlation intensity between the NIR and mid-IR bands. NIR regions that include bands at 4343, 4416, 5778, 5904, and 5955 cm(-1), which result from the combinations and overtones of the C-H group of ethanol, were selected for use in the PLS regression models, by taking the correlation intensity of a mid-IR band at 2985 cm(-1) arising from the CH(3) asymmetric stretching vibration mode of ethanol as a reference. The predicted results indicate that the ethanol concentrations calculated from the PLS regression models fit well to those obtained by high-performance liquid chromatography. Thus, it can be concluded that the selection procedure using the NIR/mid-IR dual-region spectrometer combined with 2D NIR/mid-IR heterospectral correlation spectroscopy is a powerful method for the construction of a reliable regression model.

  8. Temporal trend and climate factors of hemorrhagic fever with renal syndrome epidemic in Shenyang City, China

    PubMed Central

    2011-01-01

    Background Hemorrhagic fever with renal syndrome (HFRS) is an important infectious disease caused by different species of hantaviruses. As a rodent-borne disease with a seasonal distribution, external environmental factors including climate factors may play a significant role in its transmission. The city of Shenyang is one of the most seriously endemic areas for HFRS. Here, we characterized the dynamic temporal trend of HFRS, and identified climate-related risk factors and their roles in HFRS transmission in Shenyang, China. Methods The annual and monthly cumulative numbers of HFRS cases from 2004 to 2009 were calculated and plotted to show the annual and seasonal fluctuation in Shenyang. Cross-correlation and autocorrelation analyses were performed to detect the lagged effect of climate factors on HFRS transmission and the autocorrelation of monthly HFRS cases. Principal component analysis was constructed by using climate data from 2004 to 2009 to extract principal components of climate factors to reduce co-linearity. The extracted principal components and autocorrelation terms of monthly HFRS cases were added into a multiple regression model called principal components regression model (PCR) to quantify the relationship between climate factors, autocorrelation terms and transmission of HFRS. The PCR model was compared to a general multiple regression model conducted only with climate factors as independent variables. Results A distinctly declining temporal trend of annual HFRS incidence was identified. HFRS cases were reported every month, and the two peak periods occurred in spring (March to May) and winter (November to January), during which, nearly 75% of the HFRS cases were reported. Three principal components were extracted with a cumulative contribution rate of 86.06%. Component 1 represented MinRH0, MT1, RH1, and MWV1; component 2 represented RH2, MaxT3, and MAP3; and component 3 represented MaxT2, MAP2, and MWV2. The PCR model was composed of three principal components and two autocorrelation terms. The association between HFRS epidemics and climate factors was better explained in the PCR model (F = 446.452, P < 0.001, adjusted R2 = 0.75) than in the general multiple regression model (F = 223.670, P < 0.000, adjusted R2 = 0.51). Conclusion The temporal distribution of HFRS in Shenyang varied in different years with a distinctly declining trend. The monthly trends of HFRS were significantly associated with local temperature, relative humidity, precipitation, air pressure, and wind velocity of the different previous months. The model conducted in this study will make HFRS surveillance simpler and the control of HFRS more targeted in Shenyang. PMID:22133347

  9. The extraction of simple relationships in growth factor-specific multiple-input and multiple-output systems in cell-fate decisions by backward elimination PLS regression.

    PubMed

    Akimoto, Yuki; Yugi, Katsuyuki; Uda, Shinsuke; Kudo, Takamasa; Komori, Yasunori; Kubota, Hiroyuki; Kuroda, Shinya

    2013-01-01

    Cells use common signaling molecules for the selective control of downstream gene expression and cell-fate decisions. The relationship between signaling molecules and downstream gene expression and cellular phenotypes is a multiple-input and multiple-output (MIMO) system and is difficult to understand due to its complexity. For example, it has been reported that, in PC12 cells, different types of growth factors activate MAP kinases (MAPKs) including ERK, JNK, and p38, and CREB, for selective protein expression of immediate early genes (IEGs) such as c-FOS, c-JUN, EGR1, JUNB, and FOSB, leading to cell differentiation, proliferation and cell death; however, how multiple-inputs such as MAPKs and CREB regulate multiple-outputs such as expression of the IEGs and cellular phenotypes remains unclear. To address this issue, we employed a statistical method called partial least squares (PLS) regression, which involves a reduction of the dimensionality of the inputs and outputs into latent variables and a linear regression between these latent variables. We measured 1,200 data points for MAPKs and CREB as the inputs and 1,900 data points for IEGs and cellular phenotypes as the outputs, and we constructed the PLS model from these data. The PLS model highlighted the complexity of the MIMO system and growth factor-specific input-output relationships of cell-fate decisions in PC12 cells. Furthermore, to reduce the complexity, we applied a backward elimination method to the PLS regression, in which 60 input variables were reduced to 5 variables, including the phosphorylation of ERK at 10 min, CREB at 5 min and 60 min, AKT at 5 min and JNK at 30 min. The simple PLS model with only 5 input variables demonstrated a predictive ability comparable to that of the full PLS model. The 5 input variables effectively extracted the growth factor-specific simple relationships within the MIMO system in cell-fate decisions in PC12 cells.

  10. Detection of epistatic effects with logic regression and a classical linear regression model.

    PubMed

    Malina, Magdalena; Ickstadt, Katja; Schwender, Holger; Posch, Martin; Bogdan, Małgorzata

    2014-02-01

    To locate multiple interacting quantitative trait loci (QTL) influencing a trait of interest within experimental populations, usually methods as the Cockerham's model are applied. Within this framework, interactions are understood as the part of the joined effect of several genes which cannot be explained as the sum of their additive effects. However, if a change in the phenotype (as disease) is caused by Boolean combinations of genotypes of several QTLs, this Cockerham's approach is often not capable to identify them properly. To detect such interactions more efficiently, we propose a logic regression framework. Even though with the logic regression approach a larger number of models has to be considered (requiring more stringent multiple testing correction) the efficient representation of higher order logic interactions in logic regression models leads to a significant increase of power to detect such interactions as compared to a Cockerham's approach. The increase in power is demonstrated analytically for a simple two-way interaction model and illustrated in more complex settings with simulation study and real data analysis.

  11. Calculating the habitable zones of multiple star systems with a new interactive Web site

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Müller, Tobias W. A.; Haghighipour, Nader

    We have developed a comprehensive methodology and an interactive Web site for calculating the habitable zone (HZ) of multiple star systems. Using the concept of spectral weight factor, as introduced in our previous studies of the calculations of HZ in and around binary star systems, we calculate the contribution of each star (based on its spectral energy distribution) to the total flux received at the top of the atmosphere of an Earth-like planet, and use the models of the HZ of the Sun to determine the boundaries of the HZ in multiple star systems. Our interactive Web site for carryingmore » out these calculations is publicly available at http://astro.twam.info/hz. We discuss the details of our methodology and present its application to some of the multiple star systems detected by the Kepler space telescope. We also present the instructions for using our interactive Web site, and demonstrate its capabilities by calculating the HZ for two interesting analytical solutions of the three-body problem.« less

  12. Influence of landscape-scale factors in limiting brook trout populations in Pennsylvania streams

    USGS Publications Warehouse

    Kocovsky, P.M.; Carline, R.F.

    2006-01-01

    Landscapes influence the capacity of streams to produce trout through their effect on water chemistry and other factors at the reach scale. Trout abundance also fluctuates over time; thus, to thoroughly understand how spatial factors at landscape scales affect trout populations, one must assess the changes in populations over time to provide a context for interpreting the importance of spatial factors. We used data from the Pennsylvania Fish and Boat Commission's fisheries management database to investigate spatial factors that affect the capacity of streams to support brook trout Salvelinus fontinalis and to provide models useful for their management. We assessed the relative importance of spatial and temporal variation by calculating variance components and comparing relative standard errors for spatial and temporal variation. We used binary logistic regression to predict the presence of harvestable-length brook trout and multiple linear regression to assess the mechanistic links between landscapes and trout populations and to predict population density. The variance in trout density among streams was equal to or greater than the temporal variation for several streams, indicating that differences among sites affect population density. Logistic regression models correctly predicted the absence of harvestable-length brook trout in 60% of validation samples. The r 2-value for the linear regression model predicting density was 0.3, indicating low predictive ability. Both logistic and linear regression models supported buffering capacity against acid episodes as an important mechanistic link between landscapes and trout populations. Although our models fail to predict trout densities precisely, their success at elucidating the mechanistic links between landscapes and trout populations, in concert with the importance of spatial variation, increases our understanding of factors affecting brook trout abundance and will help managers and private groups to protect and enhance populations of wild brook trout. ?? Copyright by the American Fisheries Society 2006.

  13. Regression in autistic spectrum disorders.

    PubMed

    Stefanatos, Gerry A

    2008-12-01

    A significant proportion of children diagnosed with Autistic Spectrum Disorder experience a developmental regression characterized by a loss of previously-acquired skills. This may involve a loss of speech or social responsitivity, but often entails both. This paper critically reviews the phenomena of regression in autistic spectrum disorders, highlighting the characteristics of regression, age of onset, temporal course, and long-term outcome. Important considerations for diagnosis are discussed and multiple etiological factors currently hypothesized to underlie the phenomenon are reviewed. It is argued that regressive autistic spectrum disorders can be conceptualized on a spectrum with other regressive disorders that may share common pathophysiological features. The implications of this viewpoint are discussed.

  14. Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses.

    PubMed

    Faul, Franz; Erdfelder, Edgar; Buchner, Axel; Lang, Albert-Georg

    2009-11-01

    G*Power is a free power analysis program for a variety of statistical tests. We present extensions and improvements of the version introduced by Faul, Erdfelder, Lang, and Buchner (2007) in the domain of correlation and regression analyses. In the new version, we have added procedures to analyze the power of tests based on (1) single-sample tetrachoric correlations, (2) comparisons of dependent correlations, (3) bivariate linear regression, (4) multiple linear regression based on the random predictor model, (5) logistic regression, and (6) Poisson regression. We describe these new features and provide a brief introduction to their scope and handling.

  15. An empirical study of statistical properties of variance partition coefficients for multi-level logistic regression models

    USGS Publications Warehouse

    Li, Ji; Gray, B.R.; Bates, D.M.

    2008-01-01

    Partitioning the variance of a response by design levels is challenging for binomial and other discrete outcomes. Goldstein (2003) proposed four definitions for variance partitioning coefficients (VPC) under a two-level logistic regression model. In this study, we explicitly derived formulae for multi-level logistic regression model and subsequently studied the distributional properties of the calculated VPCs. Using simulations and a vegetation dataset, we demonstrated associations between different VPC definitions, the importance of methods for estimating VPCs (by comparing VPC obtained using Laplace and penalized quasilikehood methods), and bivariate dependence between VPCs calculated at different levels. Such an empirical study lends an immediate support to wider applications of VPC in scientific data analysis.

  16. A simple approach to power and sample size calculations in logistic regression and Cox regression models.

    PubMed

    Vaeth, Michael; Skovlund, Eva

    2004-06-15

    For a given regression problem it is possible to identify a suitably defined equivalent two-sample problem such that the power or sample size obtained for the two-sample problem also applies to the regression problem. For a standard linear regression model the equivalent two-sample problem is easily identified, but for generalized linear models and for Cox regression models the situation is more complicated. An approximately equivalent two-sample problem may, however, also be identified here. In particular, we show that for logistic regression and Cox regression models the equivalent two-sample problem is obtained by selecting two equally sized samples for which the parameters differ by a value equal to the slope times twice the standard deviation of the independent variable and further requiring that the overall expected number of events is unchanged. In a simulation study we examine the validity of this approach to power calculations in logistic regression and Cox regression models. Several different covariate distributions are considered for selected values of the overall response probability and a range of alternatives. For the Cox regression model we consider both constant and non-constant hazard rates. The results show that in general the approach is remarkably accurate even in relatively small samples. Some discrepancies are, however, found in small samples with few events and a highly skewed covariate distribution. Comparison with results based on alternative methods for logistic regression models with a single continuous covariate indicates that the proposed method is at least as good as its competitors. The method is easy to implement and therefore provides a simple way to extend the range of problems that can be covered by the usual formulas for power and sample size determination. Copyright 2004 John Wiley & Sons, Ltd.

  17. Obtaining appropriate interval estimates for age when multiple indicators are used: evaluation of an ad-hoc procedure.

    PubMed

    Fieuws, Steffen; Willems, Guy; Larsen-Tangmose, Sara; Lynnerup, Niels; Boldsen, Jesper; Thevissen, Patrick

    2016-03-01

    When an estimate of age is needed, typically multiple indicators are present as found in skeletal or dental information. There exists a vast literature on approaches to estimate age from such multivariate data. Application of Bayes' rule has been proposed to overcome drawbacks of classical regression models but becomes less trivial as soon as the number of indicators increases. Each of the age indicators can lead to a different point estimate ("the most plausible value for age") and a prediction interval ("the range of possible values"). The major challenge in the combination of multiple indicators is not the calculation of a combined point estimate for age but the construction of an appropriate prediction interval. Ignoring the correlation between the age indicators results in intervals being too small. Boldsen et al. (2002) presented an ad-hoc procedure to construct an approximate confidence interval without the need to model the multivariate correlation structure between the indicators. The aim of the present paper is to bring under attention this pragmatic approach and to evaluate its performance in a practical setting. This is all the more needed since recent publications ignore the need for interval estimation. To illustrate and evaluate the method, Köhler et al. (1995) third molar scores are used to estimate the age in a dataset of 3200 male subjects in the juvenile age range.

  18. Dependence of the Peak Fluxes of Solar Energetic Particles on CME 3D Parameters from STEREO and SOHO

    DOE Office of Scientific and Technical Information (OSTI.GOV)

    Park, Jinhye; Moon, Y.-J.; Lee, Harim, E-mail: jinhye@khu.ac.kr

    We investigate the relationships between the peak fluxes of 18 solar energetic particle (SEP) events and associated coronal mass ejection (CME) 3D parameters (speed, angular width, and separation angle) obtained from SOHO , and STEREO-A / B for the period from 2010 August to 2013 June. We apply the STEREO CME Analysis Tool (StereoCAT) to the SEP-associated CMEs to obtain 3D speeds and 3D angular widths. The separation angles are determined as the longitudinal angles between flaring regions and magnetic footpoints of the spacecraft, which are calculated by the assumption of a Parker spiral field. The main results are asmore » follows. (1) We find that the dependence of the SEP peak fluxes on CME 3D speed from multiple spacecraft is similar to that on CME 2D speed. (2) There is a positive correlation between SEP peak flux and 3D angular width from multiple spacecraft, which is much more evident than the relationship between SEP peak flux and 2D angular width. (3) There is a noticeable anti-correlation ( r = −0.62) between SEP peak flux and separation angle. (4) The multiple-regression method between SEP peak fluxes and CME 3D parameters shows that the longitudinal separation angle is the most important parameter, and the CME 3D speed is secondary on SEP peak flux.« less

  19. Interlaboratory comparability, bias, and precision for four laboratories measuring constituents in precipitation, November 1982-August 1983

    USGS Publications Warehouse

    Brooks, M.H.; Schroder, L.J.; Malo, B.A.

    1985-01-01

    Four laboratories were evaluated in their analysis of identical natural and simulated precipitation water samples. Interlaboratory comparability was evaluated using analysis of variance coupled with Duncan 's multiple range test, and linear-regression models describing the relations between individual laboratory analytical results for natural precipitation samples. Results of the statistical analyses indicate that certain pairs of laboratories produce different results when analyzing identical samples. Analyte bias for each laboratory was examined using analysis of variance coupled with Duncan 's multiple range test on data produced by the laboratories from the analysis of identical simulated precipitation samples. Bias for a given analyte produced by a single laboratory has been indicated when the laboratory mean for that analyte is shown to be significantly different from the mean for the most-probable analyte concentrations in the simulated precipitation samples. Ion-chromatographic methods for the determination of chloride, nitrate, and sulfate have been compared with the colorimetric methods that were also in use during the study period. Comparisons were made using analysis of variance coupled with Duncan 's multiple range test for means produced by the two methods. Analyte precision for each laboratory has been estimated by calculating a pooled variance for each analyte. Analyte estimated precisions have been compared using F-tests and differences in analyte precisions for laboratory pairs have been reported. (USGS)

  20. Relationship of pressure to be thin with gains in body weight and fat mass in adolescents.

    PubMed

    Suelter, C S; Schvey, N; Kelly, N R; Shanks, M; Thompson, K A; Mehari, R; Brady, S; Yanovski, S Z; Melby, C L; Tanofsky-Kraff, M; Yanovski, J A; Shomaker, L B

    2018-01-01

    Sociocultural pressure to be thin is commonly reported by adolescents; yet, to what extent such pressure is associated with weight gain has not been evaluated longitudinally. Examine whether pressure to be thin was positively associated with weight and fat gain in adolescents. Participants were 196 healthy adolescent (age 15 ± 1 years old) girls (65%) and boys of varying weights (BMI 25 ± 7 kg/m 2 ) studied at baseline and 1-year follow-up. At baseline, adolescents and their mothers reported pressure to be thin by questionnaire. At baseline and follow-up, BMI was calculated, and fat mass was assessed with air displacement plethysmography. Multiple regression was used to examine associations between baseline pressure to be thin and 1-year changes in BMI and fat mass. Accounting for multiple covariates, including baseline BMI or fat, adolescent-reported pressure from parents and peers and mother-reported pressure toward their teen were associated with greater gains in either adolescent BMI or fat (ps < .05). Adolescent weight status was a moderator of multiple effects (ps < .05). Parental and peer pressure to be thin were associated with increases in BMI and fat mass during adolescence, particularly in heavier adolescents. Further research is necessary to clarify how this association operates reciprocally and to identify underlying explanatory mechanisms. © 2016 World Obesity Federation.

Top