ERIC Educational Resources Information Center
Anderson, Joan L.
2006-01-01
Data from graduate student applications at a large Western university were used to determine which factors were the best predictors of success in graduate school, as defined by cumulative graduate grade point average. Two statistical models were employed and compared: artificial neural networking and simultaneous multiple regression. Both models…
Lorenzo-Seva, Urbano; Ferrando, Pere J
2011-03-01
We provide an SPSS program that implements currently recommended techniques and recent developments for selecting variables in multiple linear regression analysis via the relative importance of predictors. The approach consists of: (1) optimally splitting the data for cross-validation, (2) selecting the final set of predictors to be retained in the equation regression, and (3) assessing the behavior of the chosen model using standard indices and procedures. The SPSS syntax, a short manual, and data files related to this article are available as supplemental materials from brm.psychonomic-journals.org/content/supplemental.
Advanced statistics: linear regression, part II: multiple linear regression.
Marill, Keith A
2004-01-01
The applications of simple linear regression in medical research are limited, because in most situations, there are multiple relevant predictor variables. Univariate statistical techniques such as simple linear regression use a single predictor variable, and they often may be mathematically correct but clinically misleading. Multiple linear regression is a mathematical technique used to model the relationship between multiple independent predictor variables and a single dependent outcome variable. It is used in medical research to model observational data, as well as in diagnostic and therapeutic studies in which the outcome is dependent on more than one factor. Although the technique generally is limited to data that can be expressed with a linear function, it benefits from a well-developed mathematical framework that yields unique solutions and exact confidence intervals for regression coefficients. Building on Part I of this series, this article acquaints the reader with some of the important concepts in multiple regression analysis. These include multicollinearity, interaction effects, and an expansion of the discussion of inference testing, leverage, and variable transformations to multivariate models. Examples from the first article in this series are expanded on using a primarily graphic, rather than mathematical, approach. The importance of the relationships among the predictor variables and the dependence of the multivariate model coefficients on the choice of these variables are stressed. Finally, concepts in regression model building are discussed.
The Impact of Prior Programming Knowledge on Lecture Attendance and Final Exam
ERIC Educational Resources Information Center
Veerasamy, Ashok Kumar; D'Souza, Daryl; Lindén, Rolf; Laakso, Mikko-Jussi
2018-01-01
In this article, we report the results of the impact of prior programming knowledge (PPK) on lecture attendance (LA) and on subsequent final programming exam performance in a university level introductory programming course. This study used Spearman's rank correlation coefficient, multiple regression, Kruskal-Wallis, and Bonferroni correction…
Dipnall, Joanna F.
2016-01-01
Background Atheoretical large-scale data mining techniques using machine learning algorithms have promise in the analysis of large epidemiological datasets. This study illustrates the use of a hybrid methodology for variable selection that took account of missing data and complex survey design to identify key biomarkers associated with depression from a large epidemiological study. Methods The study used a three-step methodology amalgamating multiple imputation, a machine learning boosted regression algorithm and logistic regression, to identify key biomarkers associated with depression in the National Health and Nutrition Examination Study (2009–2010). Depression was measured using the Patient Health Questionnaire-9 and 67 biomarkers were analysed. Covariates in this study included gender, age, race, smoking, food security, Poverty Income Ratio, Body Mass Index, physical activity, alcohol use, medical conditions and medications. The final imputed weighted multiple logistic regression model included possible confounders and moderators. Results After the creation of 20 imputation data sets from multiple chained regression sequences, machine learning boosted regression initially identified 21 biomarkers associated with depression. Using traditional logistic regression methods, including controlling for possible confounders and moderators, a final set of three biomarkers were selected. The final three biomarkers from the novel hybrid variable selection methodology were red cell distribution width (OR 1.15; 95% CI 1.01, 1.30), serum glucose (OR 1.01; 95% CI 1.00, 1.01) and total bilirubin (OR 0.12; 95% CI 0.05, 0.28). Significant interactions were found between total bilirubin with Mexican American/Hispanic group (p = 0.016), and current smokers (p<0.001). Conclusion The systematic use of a hybrid methodology for variable selection, fusing data mining techniques using a machine learning algorithm with traditional statistical modelling, accounted for missing data and complex survey sampling methodology and was demonstrated to be a useful tool for detecting three biomarkers associated with depression for future hypothesis generation: red cell distribution width, serum glucose and total bilirubin. PMID:26848571
Dipnall, Joanna F; Pasco, Julie A; Berk, Michael; Williams, Lana J; Dodd, Seetal; Jacka, Felice N; Meyer, Denny
2016-01-01
Atheoretical large-scale data mining techniques using machine learning algorithms have promise in the analysis of large epidemiological datasets. This study illustrates the use of a hybrid methodology for variable selection that took account of missing data and complex survey design to identify key biomarkers associated with depression from a large epidemiological study. The study used a three-step methodology amalgamating multiple imputation, a machine learning boosted regression algorithm and logistic regression, to identify key biomarkers associated with depression in the National Health and Nutrition Examination Study (2009-2010). Depression was measured using the Patient Health Questionnaire-9 and 67 biomarkers were analysed. Covariates in this study included gender, age, race, smoking, food security, Poverty Income Ratio, Body Mass Index, physical activity, alcohol use, medical conditions and medications. The final imputed weighted multiple logistic regression model included possible confounders and moderators. After the creation of 20 imputation data sets from multiple chained regression sequences, machine learning boosted regression initially identified 21 biomarkers associated with depression. Using traditional logistic regression methods, including controlling for possible confounders and moderators, a final set of three biomarkers were selected. The final three biomarkers from the novel hybrid variable selection methodology were red cell distribution width (OR 1.15; 95% CI 1.01, 1.30), serum glucose (OR 1.01; 95% CI 1.00, 1.01) and total bilirubin (OR 0.12; 95% CI 0.05, 0.28). Significant interactions were found between total bilirubin with Mexican American/Hispanic group (p = 0.016), and current smokers (p<0.001). The systematic use of a hybrid methodology for variable selection, fusing data mining techniques using a machine learning algorithm with traditional statistical modelling, accounted for missing data and complex survey sampling methodology and was demonstrated to be a useful tool for detecting three biomarkers associated with depression for future hypothesis generation: red cell distribution width, serum glucose and total bilirubin.
Multiple linear regression analysis
NASA Technical Reports Server (NTRS)
Edwards, T. R.
1980-01-01
Program rapidly selects best-suited set of coefficients. User supplies only vectors of independent and dependent data and specifies confidence level required. Program uses stepwise statistical procedure for relating minimal set of variables to set of observations; final regression contains only most statistically significant coefficients. Program is written in FORTRAN IV for batch execution and has been implemented on NOVA 1200.
Primary Factors Related to Multiple Placements for Children in Out-of-Home Care
ERIC Educational Resources Information Center
Eggertsen, Lars
2008-01-01
Using an ecological framework, this study identified which factors related to out-of-home placements significantly influenced multiple placements for children in Utah during 2000, 2001, and 2002. Multinomial logistic regression statistical procedures and a geographical information system (GIS) were used to analyze the data. The final model…
Nakamura, Ryo; Nakano, Kumiko; Tamura, Hiroyasu; Mizunuma, Masaki; Fushiki, Tohru; Hirata, Dai
2017-08-01
Many factors contribute to palatability. In order to evaluate the palatability of Japanese alcohol sake paired with certain dishes by integrating multiple factors, here we applied an evaluation method previously reported for palatability of cheese by multiple regression analysis based on 3 subdomain factors (rewarding, cultural, and informational). We asked 94 Japanese participants/subjects to evaluate the palatability of sake (1st evaluation/E1 for the first cup, 2nd/E2 and 3rd/E3 for the palatability with aftertaste/afterglow of certain dishes) and to respond to a questionnaire related to 3 subdomains. In E1, 3 factors were extracted by a factor analysis, and the subsequent multiple regression analyses indicated that the palatability of sake was interpreted by mainly the rewarding. Further, the results of attribution-dissections in E1 indicated that 2 factors (rewarding and informational) contributed to the palatability. Finally, our results indicated that the palatability of sake was influenced by the dish eaten just before drinking.
An empirical study using permutation-based resampling in meta-regression
2012-01-01
Background In meta-regression, as the number of trials in the analyses decreases, the risk of false positives or false negatives increases. This is partly due to the assumption of normality that may not hold in small samples. Creation of a distribution from the observed trials using permutation methods to calculate P values may allow for less spurious findings. Permutation has not been empirically tested in meta-regression. The objective of this study was to perform an empirical investigation to explore the differences in results for meta-analyses on a small number of trials using standard large sample approaches verses permutation-based methods for meta-regression. Methods We isolated a sample of randomized controlled clinical trials (RCTs) for interventions that have a small number of trials (herbal medicine trials). Trials were then grouped by herbal species and condition and assessed for methodological quality using the Jadad scale, and data were extracted for each outcome. Finally, we performed meta-analyses on the primary outcome of each group of trials and meta-regression for methodological quality subgroups within each meta-analysis. We used large sample methods and permutation methods in our meta-regression modeling. We then compared final models and final P values between methods. Results We collected 110 trials across 5 intervention/outcome pairings and 5 to 10 trials per covariate. When applying large sample methods and permutation-based methods in our backwards stepwise regression the covariates in the final models were identical in all cases. The P values for the covariates in the final model were larger in 78% (7/9) of the cases for permutation and identical for 22% (2/9) of the cases. Conclusions We present empirical evidence that permutation-based resampling may not change final models when using backwards stepwise regression, but may increase P values in meta-regression of multiple covariates for relatively small amount of trials. PMID:22587815
Schistosomiasis Breeding Environment Situation Analysis in Dongting Lake Area
NASA Astrophysics Data System (ADS)
Li, Chuanrong; Jia, Yuanyuan; Ma, Lingling; Liu, Zhaoyan; Qian, Yonggang
2013-01-01
Monitoring environmental characteristics, such as vegetation, soil moisture et al., of Oncomelania hupensis (O. hupensis)’ spatial/temporal distribution is of vital importance to the schistosomiasis prevention and control. In this study, the relationship between environmental factors derived from remotely sensed data and the density of O. hupensis was analyzed by a multiple linear regression model. Secondly, spatial analysis of the regression residual was investigated by the semi-variogram method. Thirdly, spatial analysis of the regression residual and the multiple linear regression model were both employed to estimate the spatial variation of O. hupensis density. Finally, the approach was used to monitor and predict the spatial and temporal variations of oncomelania of Dongting Lake region, China. And the areas of potential O. hupensis habitats were predicted and the influence of Three Gorges Dam (TGB)project on the density of O. hupensis was analyzed.
MULTIPLE LINEAR REGRESSION FOR LAKE ICE AND LAKE TEMPERATURE CHARACTERISTICS. (R824801)
The perspectives, information and conclusions conveyed in research project abstracts, progress reports, final reports, journal abstracts and journal publications convey the viewpoints of the principal investigator and may not represent the views and policies of ORD and EPA. Concl...
Peng, Ying; Li, Su-Ning; Pei, Xuexue; Hao, Kun
2018-03-01
Amultivariate regression statisticstrategy was developed to clarify multi-components content-effect correlation ofpanaxginseng saponins extract and predict the pharmacological effect by components content. In example 1, firstly, we compared pharmacological effects between panax ginseng saponins extract and individual saponin combinations. Secondly, we examined the anti-platelet aggregation effect in seven different saponin combinations of ginsenoside Rb1, Rg1, Rh, Rd, Ra3 and notoginsenoside R1. Finally, the correlation between anti-platelet aggregation and the content of multiple components was analyzed by a partial least squares algorithm. In example 2, firstly, 18 common peaks were identified in ten different batches of panax ginseng saponins extracts from different origins. Then, we investigated the anti-myocardial ischemia reperfusion injury effects of the ten different panax ginseng saponins extracts. Finally, the correlation between the fingerprints and the cardioprotective effects was analyzed by a partial least squares algorithm. Both in example 1 and 2, the relationship between the components content and pharmacological effect was modeled well by the partial least squares regression equations. Importantly, the predicted effect curve was close to the observed data of dot marked on the partial least squares regression model. This study has given evidences that themulti-component content is a promising information for predicting the pharmacological effects of traditional Chinese medicine.
Commitment Predictors: Long-Distance versus Geographically Close Relationships
ERIC Educational Resources Information Center
Pistole, M. Carole; Roberts, Amber; Mosko, Jonathan E.
2010-01-01
In this web-based study, the authors examined long-distance relationships (LDRs) and geographically close relationships (GCRs). Two hierarchical multiple regressions (N = 138) indicated that attachment predicted LDR and GCR commitment in Step 1. Final equations indicated that high satisfaction and investments predicted LDR commitment, whereas low…
Family and school environmental predictors of sleep bruxism in children.
Rossi, Debora; Manfredini, Daniele
2013-01-01
To identify potential predictors of self-reported sleep bruxism (SB) within children's family and school environments. A total of 65 primary school children (55.4% males, mean age 9.3 ± 1.9 years) were administered a 10-item questionnaire investigating the prevalence of self-reported SB as well as nine family and school-related potential bruxism predictors. Regression analyses were performed to assess the correlation between the potential predictors and SB. A positive answer to the self-reported SB item was endorsed by 18.8% of subjects, with no sex differences. Multiple variable regression analysis identified a final model showing that having divorced parents and not falling asleep easily were the only two weak predictors of self-reported SB. The percentage of explained variance for SB by the final multiple regression model was 13.3% (Nagelkerke's R² = 0.133). While having a high specificity and a good negative predictive value, the model showed unacceptable sensitivity and positive predictive values. The resulting accuracy to predict the presence of self-reported SB was 73.8%. The present investigation suggested that, among family and school-related matters, having divorced parents and not falling asleep easily were two predictors, even if weak, of a child's self-report of SB.
Akkus, Zeki; Camdeviren, Handan; Celik, Fatma; Gur, Ali; Nas, Kemal
2005-09-01
To determine the risk factors of osteoporosis using a multiple binary logistic regression method and to assess the risk variables for osteoporosis, which is a major and growing health problem in many countries. We presented a case-control study, consisting of 126 postmenopausal healthy women as control group and 225 postmenopausal osteoporotic women as the case group. The study was carried out in the Department of Physical Medicine and Rehabilitation, Dicle University, Diyarbakir, Turkey between 1999-2002. The data from the 351 participants were collected using a standard questionnaire that contains 43 variables. A multiple logistic regression model was then used to evaluate the data and to find the best regression model. We classified 80.1% (281/351) of the participants using the regression model. Furthermore, the specificity value of the model was 67% (84/126) of the control group while the sensitivity value was 88% (197/225) of the case group. We found the distribution of residual values standardized for final model to be exponential using the Kolmogorow-Smirnow test (p=0.193). The receiver operating characteristic curve was found successful to predict patients with risk for osteoporosis. This study suggests that low levels of dietary calcium intake, physical activity, education, and longer duration of menopause are independent predictors of the risk of low bone density in our population. Adequate dietary calcium intake in combination with maintaining a daily physical activity, increasing educational level, decreasing birth rate, and duration of breast-feeding may contribute to healthy bones and play a role in practical prevention of osteoporosis in Southeast Anatolia. In addition, the findings of the present study indicate that the use of multivariate statistical method as a multiple logistic regression in osteoporosis, which maybe influenced by many variables, is better than univariate statistical evaluation.
NASA Astrophysics Data System (ADS)
Shrivastava, Prashant Kumar; Pandey, Arun Kumar
2018-06-01
Inconel-718 has found high demand in different industries due to their superior mechanical properties. The traditional cutting methods are facing difficulties for cutting these alloys due to their low thermal potential, lower elasticity and high chemical compatibility at inflated temperature. The challenges of machining and/or finishing of unusual shapes and/or sizes in these materials have also faced by traditional machining. Laser beam cutting may be applied for the miniaturization and ultra-precision cutting and/or finishing by appropriate control of different process parameter. This paper present multi-objective optimization the kerf deviation, kerf width and kerf taper in the laser cutting of Incone-718 sheet. The second order regression models have been developed for different quality characteristics by using the experimental data obtained through experimentation. The regression models have been used as objective function for multi-objective optimization based on the hybrid approach of multiple regression analysis and genetic algorithm. The comparison of optimization results to experimental results shows an improvement of 88%, 10.63% and 42.15% in kerf deviation, kerf width and kerf taper, respectively. Finally, the effects of different process parameters on quality characteristics have also been discussed.
ERIC Educational Resources Information Center
Deignan, Gerard M.; And Others
This report contains a comparative analysis of the differential effectiveness of computer-assisted instruction (CAI), programmed instructional text (PIT), and lecture methods of instruction in three medical courses--Medical Laboratory, Radiology, and Dental. The summative evaluation includes (1) multiple regression analyses conducted to predict…
Models for predicting the mass of lime fruits by some engineering properties.
Miraei Ashtiani, Seyed-Hassan; Baradaran Motie, Jalal; Emadi, Bagher; Aghkhani, Mohammad-Hosein
2014-11-01
Grading fruits based on mass is important in packaging and reduces the waste, also increases the marketing value of agricultural produce. The aim of this study was mass modeling of two major cultivars of Iranian limes based on engineering attributes. Models were classified into three: 1-Single and multiple variable regressions of lime mass and dimensional characteristics. 2-Single and multiple variable regressions of lime mass and projected areas. 3-Single regression of lime mass based on its actual volume and calculated volume assumed as ellipsoid and prolate spheroid shapes. All properties considered in the current study were found to be statistically significant (ρ < 0.01). The results indicated that mass modeling of lime based on minor diameter and first projected area are the most appropriate models in the first and the second classifications, respectively. In third classification, the best model was obtained on the basis of the prolate spheroid volume. It was finally concluded that the suitable grading system of lime mass is based on prolate spheroid volume.
The Effect of Attending Tutoring on Course Grades in Calculus I
ERIC Educational Resources Information Center
Rickard, Brian; Mills, Melissa
2018-01-01
Tutoring centres are common in universities in the United States, but there are few published studies that statistically examine the effects of tutoring on student success. This study utilizes multiple regression analysis to model the effect of tutoring attendance on final course grades in Calculus I. Our model predicted that every three visits to…
Ohlmacher, G.C.; Davis, J.C.
2003-01-01
Landslides in the hilly terrain along the Kansas and Missouri rivers in northeastern Kansas have caused millions of dollars in property damage during the last decade. To address this problem, a statistical method called multiple logistic regression has been used to create a landslide-hazard map for Atchison, Kansas, and surrounding areas. Data included digitized geology, slopes, and landslides, manipulated using ArcView GIS. Logistic regression relates predictor variables to the occurrence or nonoccurrence of landslides within geographic cells and uses the relationship to produce a map showing the probability of future landslides, given local slopes and geologic units. Results indicated that slope is the most important variable for estimating landslide hazard in the study area. Geologic units consisting mostly of shale, siltstone, and sandstone were most susceptible to landslides. Soil type and aspect ratio were considered but excluded from the final analysis because these variables did not significantly add to the predictive power of the logistic regression. Soil types were highly correlated with the geologic units, and no significant relationships existed between landslides and slope aspect. ?? 2003 Elsevier Science B.V. All rights reserved.
Regression analysis for LED color detection of visual-MIMO system
NASA Astrophysics Data System (ADS)
Banik, Partha Pratim; Saha, Rappy; Kim, Ki-Doo
2018-04-01
Color detection from a light emitting diode (LED) array using a smartphone camera is very difficult in a visual multiple-input multiple-output (visual-MIMO) system. In this paper, we propose a method to determine the LED color using a smartphone camera by applying regression analysis. We employ a multivariate regression model to identify the LED color. After taking a picture of an LED array, we select the LED array region, and detect the LED using an image processing algorithm. We then apply the k-means clustering algorithm to determine the number of potential colors for feature extraction of each LED. Finally, we apply the multivariate regression model to predict the color of the transmitted LEDs. In this paper, we show our results for three types of environmental light condition: room environmental light, low environmental light (560 lux), and strong environmental light (2450 lux). We compare the results of our proposed algorithm from the analysis of training and test R-Square (%) values, percentage of closeness of transmitted and predicted colors, and we also mention about the number of distorted test data points from the analysis of distortion bar graph in CIE1931 color space.
The effect of attending tutoring on course grades in Calculus I
NASA Astrophysics Data System (ADS)
Rickard, Brian; Mills, Melissa
2018-04-01
Tutoring centres are common in universities in the United States, but there are few published studies that statistically examine the effects of tutoring on student success. This study utilizes multiple regression analysis to model the effect of tutoring attendance on final course grades in Calculus I. Our model predicted that every three visits to the tutoring centre is correlated with an increase of a students' course grade by one per cent, after controlling for prior academic ability. We also found that for lower-achieving students, attending tutoring had a greater impact on final grades.
NASA Astrophysics Data System (ADS)
Tang, Jie; Liu, Rong; Zhang, Yue-Li; Liu, Mou-Ze; Hu, Yong-Fang; Shao, Ming-Jie; Zhu, Li-Jun; Xin, Hua-Wen; Feng, Gui-Wen; Shang, Wen-Jun; Meng, Xiang-Guang; Zhang, Li-Rong; Ming, Ying-Zi; Zhang, Wei
2017-02-01
Tacrolimus has a narrow therapeutic window and considerable variability in clinical use. Our goal was to compare the performance of multiple linear regression (MLR) and eight machine learning techniques in pharmacogenetic algorithm-based prediction of tacrolimus stable dose (TSD) in a large Chinese cohort. A total of 1,045 renal transplant patients were recruited, 80% of which were randomly selected as the “derivation cohort” to develop dose-prediction algorithm, while the remaining 20% constituted the “validation cohort” to test the final selected algorithm. MLR, artificial neural network (ANN), regression tree (RT), multivariate adaptive regression splines (MARS), boosted regression tree (BRT), support vector regression (SVR), random forest regression (RFR), lasso regression (LAR) and Bayesian additive regression trees (BART) were applied and their performances were compared in this work. Among all the machine learning models, RT performed best in both derivation [0.71 (0.67-0.76)] and validation cohorts [0.73 (0.63-0.82)]. In addition, the ideal rate of RT was 4% higher than that of MLR. To our knowledge, this is the first study to use machine learning models to predict TSD, which will further facilitate personalized medicine in tacrolimus administration in the future.
Occlusal factors are not related to self-reported bruxism.
Manfredini, Daniele; Visscher, Corine M; Guarda-Nardini, Luca; Lobbezoo, Frank
2012-01-01
To estimate the contribution of various occlusal features of the natural dentition that may identify self-reported bruxers compared to nonbruxers. Two age- and sex-matched groups of self-reported bruxers (n = 67) and self-reported nonbruxers (n = 75) took part in the study. For each patient, the following occlusal features were clinically assessed: retruded contact position (RCP) to intercuspal contact position (ICP) slide length (< 2 mm was considered normal), vertical overlap (< 0 mm was considered an anterior open bite; > 4 mm, a deep bite), horizontal overlap (> 4 mm was considered a large horizontal overlap), incisor dental midline discrepancy (< 2 mm was considered normal), and the presence of a unilateral posterior crossbite, mediotrusive interferences, and laterotrusive interferences. A multiple logistic regression model was used to identify the significant associations between the assessed occlusal features (independent variables) and self-reported bruxism (dependent variable). Accuracy values to predict self-reported bruxism were unacceptable for all occlusal variables. The only variable remaining in the final regression model was laterotrusive interferences (P = .030). The percentage of explained variance for bruxism by the final multiple regression model was 4.6%. This model including only one occlusal factor showed low positive (58.1%) and negative predictive values (59.7%), thus showing a poor accuracy to predict the presence of self-reported bruxism (59.2%). This investigation suggested that the contribution of occlusion to the differentiation between bruxers and nonbruxers is negligible. This finding supports theories that advocate a much diminished role for peripheral anatomical-structural factors in the pathogenesis of bruxism.
Gordon, Evan M.; Stollstorff, Melanie; Vaidya, Chandan J.
2012-01-01
Many researchers have noted that the functional architecture of the human brain is relatively invariant during task performance and the resting state. Indeed, intrinsic connectivity networks (ICNs) revealed by resting-state functional connectivity analyses are spatially similar to regions activated during cognitive tasks. This suggests that patterns of task-related activation in individual subjects may result from the engagement of one or more of these ICNs; however, this has not been tested. We used a novel analysis, spatial multiple regression, to test whether the patterns of activation during an N-back working memory task could be well described by a linear combination of ICNs delineated using Independent Components Analysis at rest. We found that across subjects, the cingulo-opercular Set Maintenance ICN, as well as right and left Frontoparietal Control ICNs, were reliably activated during working memory, while Default Mode and Visual ICNs were reliably deactivated. Further, involvement of Set Maintenance, Frontoparietal Control, and Dorsal Attention ICNs was sensitive to varying working memory load. Finally, the degree of left Frontoparietal Control network activation predicted response speed, while activation in both left Frontoparietal Control and Dorsal Attention networks predicted task accuracy. These results suggest that a close relationship between resting-state networks and task-evoked activation is functionally relevant for behavior, and that spatial multiple regression analysis is a suitable method for revealing that relationship. PMID:21761505
Huang, Guangzao; Yuan, Mingshun; Chen, Moliang; Li, Lei; You, Wenjie; Li, Hanjie; Cai, James J; Ji, Guoli
2017-10-07
The application of machine learning in cancer diagnostics has shown great promise and is of importance in clinic settings. Here we consider applying machine learning methods to transcriptomic data derived from tumor-educated platelets (TEPs) from individuals with different types of cancer. We aim to define a reliability measure for diagnostic purposes to increase the potential for facilitating personalized treatments. To this end, we present a novel classification method called MFRB (for Multiple Fitting Regression and Bayes decision), which integrates the process of multiple fitting regression (MFR) with Bayes decision theory. MFR is first used to map multidimensional features of the transcriptomic data into a one-dimensional feature. The probability density function of each class in the mapped space is then adjusted using the Gaussian probability density function. Finally, the Bayes decision theory is used to build a probabilistic classifier with the estimated probability density functions. The output of MFRB can be used to determine which class a sample belongs to, as well as to assign a reliability measure for a given class. The classical support vector machine (SVM) and probabilistic SVM (PSVM) are used to evaluate the performance of the proposed method with simulated and real TEP datasets. Our results indicate that the proposed MFRB method achieves the best performance compared to SVM and PSVM, mainly due to its strong generalization ability for limited, imbalanced, and noisy data.
NASA Astrophysics Data System (ADS)
Zhu, Ting-Lei; Zhao, Chang-Yin; Zhang, Ming-Jiang
2017-04-01
This paper aims to obtain an analytic approximation to the evolution of circular orbits governed by the Earth's J2 and the luni-solar gravitational perturbations. Assuming that the lunar orbital plane coincides with the ecliptic plane, Allan and Cook (Proc. R. Soc. A, Math. Phys. Eng. Sci. 280(1380):97, 1964) derived an analytic solution to the orbital plane evolution of circular orbits. Using their result as an intermediate solution, we establish an approximate analytic model with lunar orbital inclination and its node regression be taken into account. Finally, an approximate analytic expression is derived, which is accurate compared to the numerical results except for the resonant cases when the period of the reference orbit approximately equals the integer multiples (especially 1 or 2 times) of lunar node regression period.
Sloas, Stacey B; Keith, Becky; Whitehead, Malcolm T
2013-01-01
This study investigated a pretest strategy that identified physical therapist assistant (PTA) students who were at risk of failure on the National Physical Therapy Examination (NPTE). Program assessment data from five cohorts of PTA students (2005-2009) were used to develop a stepwise multiple regression formula that predicted first-time NPTE licensure scores. Data used included the Nelson-Denny Reading Test, grades from eight core courses, grade point average upon admission to the program, and scores from three mock NPTE exams given during the program. Pearson correlation coefficients were calculated between each of the 15 variables and NPTE scores. Stepwise multiple regression analysis was performed using data collected at the ends of the first, second, and third (final) semesters of the program. Data from the class of 2010 were then used to validate the formula. The end-of-program formula accounted for the greatest variance (57%) in predicted scores. Those students scoring below a predicted scaled score of 620 were identified to be at risk of failure of the licensure exam. These students were counseled, and a remedial plan was developed based on regression predictions prior to them sitting for the licensure exam.
A Hot-Deck Multiple Imputation Procedure for Gaps in Longitudinal Recurrent Event Histories
Wang, Chia-Ning; Little, Roderick; Nan, Bin; Harlow, Siobán D.
2012-01-01
Summary We propose a regression-based hot deck multiple imputation method for gaps of missing data in longitudinal studies, where subjects experience a recurrent event process and a terminal event. Examples are repeated asthma episodes and death, or menstrual periods and the menopause, as in our motivating application. Research interest concerns the onset time of a marker event, defined by the recurrent-event process, or the duration from this marker event to the final event. Gaps in the recorded event history make it difficult to determine the onset time of the marker event, and hence, the duration from onset to the final event. Simple approaches such as jumping gap times or dropping cases with gaps have obvious limitations. We propose a procedure for imputing information in the gaps by substituting information in the gap from a matched individual with a completely recorded history in the corresponding interval. Predictive Mean Matching is used to incorporate information on longitudinal characteristics of the repeated process and the final event time. Multiple imputation is used to propagate imputation uncertainty. The procedure is applied to an important data set for assessing the timing and duration of the menopausal transition. The performance of the proposed method is assessed by a simulation study. PMID:21361886
Shen, Minxue; Tan, Hongzhuan; Zhou, Shujin; Retnakaran, Ravi; Smith, Graeme N.; Davidge, Sandra T.; Trasler, Jacquetta; Walker, Mark C.; Wen, Shi Wu
2016-01-01
Background It has been reported that higher folate intake from food and supplementation is associated with decreased blood pressure (BP). The association between serum folate concentration and BP has been examined in few studies. We aim to examine the association between serum folate and BP levels in a cohort of young Chinese women. Methods We used the baseline data from a pre-conception cohort of women of childbearing age in Liuyang, China, for this study. Demographic data were collected by structured interview. Serum folate concentration was measured by immunoassay, and homocysteine, blood glucose, triglyceride and total cholesterol were measured through standardized clinical procedures. Multiple linear regression and principal component regression model were applied in the analysis. Results A total of 1,532 healthy normotensive non-pregnant women were included in the final analysis. The mean concentration of serum folate was 7.5 ± 5.4 nmol/L and 55% of the women presented with folate deficiency (< 6.8 nmol/L). Multiple linear regression and principal component regression showed that serum folate levels were inversely associated with systolic and diastolic BP, after adjusting for demographic, anthropometric, and biochemical factors. Conclusions Serum folate is inversely associated with BP in non-pregnant women of childbearing age with high prevalence of folate deficiency. PMID:27182603
A Technique of Fuzzy C-Mean in Multiple Linear Regression Model toward Paddy Yield
NASA Astrophysics Data System (ADS)
Syazwan Wahab, Nur; Saifullah Rusiman, Mohd; Mohamad, Mahathir; Amira Azmi, Nur; Che Him, Norziha; Ghazali Kamardan, M.; Ali, Maselan
2018-04-01
In this paper, we propose a hybrid model which is a combination of multiple linear regression model and fuzzy c-means method. This research involved a relationship between 20 variates of the top soil that are analyzed prior to planting of paddy yields at standard fertilizer rates. Data used were from the multi-location trials for rice carried out by MARDI at major paddy granary in Peninsular Malaysia during the period from 2009 to 2012. Missing observations were estimated using mean estimation techniques. The data were analyzed using multiple linear regression model and a combination of multiple linear regression model and fuzzy c-means method. Analysis of normality and multicollinearity indicate that the data is normally scattered without multicollinearity among independent variables. Analysis of fuzzy c-means cluster the yield of paddy into two clusters before the multiple linear regression model can be used. The comparison between two method indicate that the hybrid of multiple linear regression model and fuzzy c-means method outperform the multiple linear regression model with lower value of mean square error.
Short-term electric power demand forecasting based on economic-electricity transmission model
NASA Astrophysics Data System (ADS)
Li, Wenfeng; Bai, Hongkun; Liu, Wei; Liu, Yongmin; Wang, Yubin Mao; Wang, Jiangbo; He, Dandan
2018-04-01
Short-term electricity demand forecasting is the basic work to ensure safe operation of the power system. In this paper, a practical economic electricity transmission model (EETM) is built. With the intelligent adaptive modeling capabilities of Prognoz Platform 7.2, the econometric model consists of three industrial added value and income levels is firstly built, the electricity demand transmission model is also built. By multiple regression, moving averages and seasonal decomposition, the problem of multiple correlations between variables is effectively overcome in EETM. The validity of EETM is proved by comparison with the actual value of Henan Province. Finally, EETM model is used to forecast the electricity consumption of the 1-4 quarter of 2018.
Sanford, Ward E.; Nelms, David L.; Pope, Jason P.; Selnick, David L.
2012-01-01
This study by the U.S. Geological Survey, prepared in cooperation with the Virginia Department of Environmental Quality, quantifies the components of the hydrologic cycle across the Commonwealth of Virginia. Long-term, mean fluxes were calculated for precipitation, surface runoff, infiltration, total evapotranspiration (ET), riparian ET, recharge, base flow (or groundwater discharge) and net total outflow. Fluxes of these components were first estimated on a number of real-time-gaged watersheds across Virginia. Specific conductance was used to distinguish and separate surface runoff from base flow. Specific-conductance data were collected every 15 minutes at 75 real-time gages for approximately 18 months between March 2007 and August 2008. Precipitation was estimated for 1971–2000 using PRISM climate data. Precipitation and temperature from the PRISM data were used to develop a regression-based relation to estimate total ET. The proportion of watershed precipitation that becomes surface runoff was related to physiographic province and rock type in a runoff regression equation. Component flux estimates from the watersheds were transferred to flux estimates for counties and independent cities using the ET and runoff regression equations. Only 48 of the 75 watersheds yielded sufficient data, and data from these 48 were used in the final runoff regression equation. The base-flow proportion for the 48 watersheds averaged 72 percent using specific conductance, a value that was substantially higher than the 61 percent average calculated using a graphical-separation technique (the USGS program PART). Final results for the study are presented as component flux estimates for all counties and independent cities in Virginia.
Relationship among several measurements of slipperiness obtained in a laboratory environment.
Chang, Wen-Ruey; Chang, Chien-Chi
2018-04-01
Multiple sensing mechanisms could be used in forming responses to avoid slips, but previous studies, correlating only two parameters, revealed a limited picture of this complex system. In this study, the participants walked as fast as possible without a slip under 15 conditions of different degrees of slipperiness. The relationships among various response parameters, including perceived slipperiness rating, utilized coefficient of friction (UCOF), slipmeter measurement and kinematic parameters, were evaluated. The results showed that the UCOF, perceived rating and heel angle had higher adjusted R 2 values as dependent variables in the multiple linear regressions with the remaining variables in the final pool as independent variables. Although each variable in the final data pool could reflect some measurement of slipperiness, these three variables are more inclusive than others in representing the other variables and were bigger predictors of other variables, so they could be better candidates for measurements of slipperiness. Copyright © 2017 Elsevier Ltd. All rights reserved.
2004-03-01
Breusch - Pagan test for constant variance of the residuals. Using Microsoft Excel® we calculate a p-value of 0.841237. This high p-value, which is above...our alpha of 0.05, indicates that our residuals indeed pass the Breusch - Pagan test for constant variance. In addition to the assumption tests , we...Wilk Test for Normality – Support (Reduced) Model (OLS) Finally, we perform a Breusch - Pagan test for constant variance of the residuals. Using
Estimation of PM2.5 and PM10 using ground-based AOD measurements during KORUS-AQ campaign
NASA Astrophysics Data System (ADS)
Koo, J. H.; Kim, J.; Kim, S.; Go, S.; Lee, S.; Lee, H.; Mok, J.; Hong, J.; Lee, J.; Eck, T. F.; Holben, B. N.
2017-12-01
During the KORUS-AQ campaign (2 May - 12 June, 2016), aerosol optical depth (AOD) was obtained at multiple channels using various ground-based instruments at Yonsei University, Seoul: AERONET sunphotometer, SKYNET skyradiometer, Brewer spectrophotometer, and multi-filter rotating shadowband radiometer (MFRSR). At the same location, planetary boundary layer (PBL) height and vertical profile of backscattering coefficients also can be obtained based on the celiometer measurements. Using celiometer products and various AODs, we try to estimate the amount of particular matter (PM2.5 and PM10) and validate with in-situ surface PM2.5 and PM10 measurements from AIRKOREA network. Direct comparison between PM2.5 and AOD reveals that the ultraviolet(UV) channel AOD has better correlations, due to the higher sensitivity of short wavelength to the fine-mode particle. In contrast, PM10 shows the highest correlation with the near-infrared(NIR) AOD. Next, we extract the boundary-layer portion of AOD using either PBL height or vertical profile of backscattering coefficients to compare with PM2.5 and PM10. Both results enhance the correlation, but consideration of weighting factor calculated from backscattering coefficients shows larger contribution to the correlation increase. Finally, we performed the multiple linear regression to estimate PM2.5 and PM10 using AODs. Consideration of meteorology (temperature, wind speed, and relative humidity) can enhance the correlation and also O3 and NO2 consideration highly contributes to the high correlation. This finding implies the importance to consider the ambient condition of secondary aerosol formation related to the PM2.5 variation. Multiple regression model finally finds the correlation 0.7-0.8, and diminishes the wavelength-dependent correlation patterns.
Parameter estimation in Cox models with missing failure indicators and the OPPERA study.
Brownstein, Naomi C; Cai, Jianwen; Slade, Gary D; Bair, Eric
2015-12-30
In a prospective cohort study, examining all participants for incidence of the condition of interest may be prohibitively expensive. For example, the "gold standard" for diagnosing temporomandibular disorder (TMD) is a physical examination by a trained clinician. In large studies, examining all participants in this manner is infeasible. Instead, it is common to use questionnaires to screen for incidence of TMD and perform the "gold standard" examination only on participants who screen positively. Unfortunately, some participants may leave the study before receiving the "gold standard" examination. Within the framework of survival analysis, this results in missing failure indicators. Motivated by the Orofacial Pain: Prospective Evaluation and Risk Assessment (OPPERA) study, a large cohort study of TMD, we propose a method for parameter estimation in survival models with missing failure indicators. We estimate the probability of being an incident case for those lacking a "gold standard" examination using logistic regression. These estimated probabilities are used to generate multiple imputations of case status for each missing examination that are combined with observed data in appropriate regression models. The variance introduced by the procedure is estimated using multiple imputation. The method can be used to estimate both regression coefficients in Cox proportional hazard models as well as incidence rates using Poisson regression. We simulate data with missing failure indicators and show that our method performs as well as or better than competing methods. Finally, we apply the proposed method to data from the OPPERA study. Copyright © 2015 John Wiley & Sons, Ltd.
Multiple Correlation versus Multiple Regression.
ERIC Educational Resources Information Center
Huberty, Carl J.
2003-01-01
Describes differences between multiple correlation analysis (MCA) and multiple regression analysis (MRA), showing how these approaches involve different research questions and study designs, different inferential approaches, different analysis strategies, and different reported information. (SLD)
ERIC Educational Resources Information Center
Jaccard, James; And Others
1990-01-01
Issues in the detection and interpretation of interaction effects between quantitative variables in multiple regression analysis are discussed. Recent discussions associated with problems of multicollinearity are reviewed in the context of the conditional nature of multiple regression with product terms. (TJH)
Frndak, Seth E; Smerbeck, Audrey M; Irwin, Lauren N; Drake, Allison S; Kordovski, Victoria M; Kunker, Katrina A; Khan, Anjum L; Benedict, Ralph H B
2016-10-01
We endeavored to clarify how distinct co-occurring symptoms relate to the presence of negative work events in employed multiple sclerosis (MS) patients. Latent profile analysis (LPA) was utilized to elucidate common disability patterns by isolating patient subpopulations. Samples of 272 employed MS patients and 209 healthy controls (HC) were administered neuroperformance tests of ambulation, hand dexterity, processing speed, and memory. Regression-based norms were created from the HC sample. LPA identified latent profiles using the regression-based z-scores. Finally, multinomial logistic regression tested for negative work event differences among the latent profiles. Four profiles were identified via LPA: a common profile (55%) characterized by slightly below average performance in all domains, a broadly low-performing profile (18%), a poor motor abilities profile with average cognition (17%), and a generally high-functioning profile (9%). Multinomial regression analysis revealed that the uniformly low-performing profile demonstrated a higher likelihood of reported negative work events. Employed MS patients with co-occurring motor, memory and processing speed impairments were most likely to report a negative work event, classifying them as uniquely at risk for job loss.
The ties that bind what is known to the recall of what is new.
Nelson, D L; Zhang, N
2000-12-01
Cued recall success varies with what people know and with what they do during an episode. This paper focuses on prior knowledge and disentangles the relative effects of 10 features of words and their relationships on cued recall. Results are reported for correlational and multiple regression analyses of data obtained from free association norms and from 29 experiments. The 10 features were only weakly correlated with each other in the norms and, with notable exceptions, in the experiments. The regression analysis indicated that forward cue-to-target strength explained the most variance, followed by backward target-to-cue strength. Target connectivity and set size explained the next most variance, along with mediated cue-to-target strength. Finally, frequency, concreteness, shared associate strength, and cue set size also contributed significantly to recall. Taken together, indices of prior word knowledge explain 49% of the recall variance. Theoretically driven equations that use free association to predict cued recall were also evaluated. Each equation was designed to condense multiple indices of word interconnectivity into a single predictor.
The weighted priors approach for combining expert opinions in logistic regression experiments
Quinlan, Kevin R.; Anderson-Cook, Christine M.; Myers, Kary L.
2017-04-24
When modeling the reliability of a system or component, it is not uncommon for more than one expert to provide very different prior estimates of the expected reliability as a function of an explanatory variable such as age or temperature. Our goal in this paper is to incorporate all information from the experts when choosing a design about which units to test. Bayesian design of experiments has been shown to be very successful for generalized linear models, including logistic regression models. We use this approach to develop methodology for the case where there are several potentially non-overlapping priors under consideration.more » While multiple priors have been used for analysis in the past, they have never been used in a design context. The Weighted Priors method performs well for a broad range of true underlying model parameter choices and is more robust when compared to other reasonable design choices. Finally, we illustrate the method through multiple scenarios and a motivating example. Additional figures for this article are available in the online supplementary information.« less
The weighted priors approach for combining expert opinions in logistic regression experiments
DOE Office of Scientific and Technical Information (OSTI.GOV)
Quinlan, Kevin R.; Anderson-Cook, Christine M.; Myers, Kary L.
When modeling the reliability of a system or component, it is not uncommon for more than one expert to provide very different prior estimates of the expected reliability as a function of an explanatory variable such as age or temperature. Our goal in this paper is to incorporate all information from the experts when choosing a design about which units to test. Bayesian design of experiments has been shown to be very successful for generalized linear models, including logistic regression models. We use this approach to develop methodology for the case where there are several potentially non-overlapping priors under consideration.more » While multiple priors have been used for analysis in the past, they have never been used in a design context. The Weighted Priors method performs well for a broad range of true underlying model parameter choices and is more robust when compared to other reasonable design choices. Finally, we illustrate the method through multiple scenarios and a motivating example. Additional figures for this article are available in the online supplementary information.« less
Regression Models and Fuzzy Logic Prediction of TBM Penetration Rate
NASA Astrophysics Data System (ADS)
Minh, Vu Trieu; Katushin, Dmitri; Antonov, Maksim; Veinthal, Renno
2017-03-01
This paper presents statistical analyses of rock engineering properties and the measured penetration rate of tunnel boring machine (TBM) based on the data of an actual project. The aim of this study is to analyze the influence of rock engineering properties including uniaxial compressive strength (UCS), Brazilian tensile strength (BTS), rock brittleness index (BI), the distance between planes of weakness (DPW), and the alpha angle (Alpha) between the tunnel axis and the planes of weakness on the TBM rate of penetration (ROP). Four
De Cola, Maria Cristina; D'Aleo, Giangaetano; Sessa, Edoardo; Marino, Silvia
2015-01-01
Objective. To investigate the influence of demographic and clinical variables, such as depression, fatigue, and quantitative MRI marker on cognitive performances in a sample of patients affected by multiple sclerosis (MS). Methods. 60 MS patients (52 relapsing remitting and 8 primary progressive) underwent neuropsychological assessments using Rao's Brief Repeatable Battery of Neuropsychological Tests (BRB-N), the Beck Depression Inventory-second edition (BDI-II), and the Fatigue Severity Scale (FSS). We performed magnetic resonance imaging to all subjects using a 3 T scanner and obtained tissue-specific volumes (normalized brain volume and cortical brain volume). We used Student's t-test to compare depressed and nondepressed MS patients. Finally, we performed a multivariate regression analysis in order to assess possible predictors of patients' cognitive outcome among demographic and clinical variables. Results. 27.12% of the sample (16/59) was cognitively impaired, especially in tasks requiring attention and information processing speed. From between group comparison, we find that depressed patients had worse performances on BRB-N score, greater disability and disease duration, and brain volume decrease. According to multiple regression analysis, the BDI-II score was a significant predictor for most of the neuropsychological tests. Conclusions. Our findings suggest that the presence of depressive symptoms is an important determinant of cognitive performance in MS patients. PMID:25861633
Meijer, Kim A; Muhlert, Nils; Cercignani, Mara; Sethi, Varun; Ron, Maria A; Thompson, Alan J; Miller, David H; Chard, Declan; Geurts, Jeroen Jg; Ciccarelli, Olga
2016-10-01
While our knowledge of white matter (WM) pathology underlying cognitive impairment in relapsing remitting multiple sclerosis (MS) is increasing, equivalent understanding in those with secondary progressive (SP) MS lags behind. The aim of this study is to examine whether the extent and severity of WM tract damage differ between cognitively impaired (CI) and cognitively preserved (CP) secondary progressive multiple sclerosis (SPMS) patients. Conventional magnetic resonance imaging (MRI) and diffusion MRI were acquired from 30 SPMS patients and 32 healthy controls (HC). Cognitive domains commonly affected in MS patients were assessed. Linear regression was used to predict cognition. Diffusion measures were compared between groups using tract-based spatial statistics (TBSS). A total of 12 patients were classified as CI, and processing speed was the most commonly affected domain. The final regression model including demographic variables and radial diffusivity explained the greatest variance of cognitive performance (R 2 = 0.48, p = 0.002). SPMS patients showed widespread loss of WM integrity throughout the WM skeleton when compared with HC. When compared with CP patients, CI patients showed more extensive and severe damage of several WM tracts, including the fornix, superior longitudinal fasciculus and forceps major. Loss of WM integrity assessed using TBSS helps to explain cognitive decline in SPMS patients. © The Author(s), 2016.
Beyond Multiple Regression: Using Commonality Analysis to Better Understand R[superscript 2] Results
ERIC Educational Resources Information Center
Warne, Russell T.
2011-01-01
Multiple regression is one of the most common statistical methods used in quantitative educational research. Despite the versatility and easy interpretability of multiple regression, it has some shortcomings in the detection of suppressor variables and for somewhat arbitrarily assigning values to the structure coefficients of correlated…
Krishan, Kewal; Kanchan, Tanuj; Sharma, Abhilasha
2012-05-01
Estimation of stature is an important parameter in identification of human remains in forensic examinations. The present study is aimed to compare the reliability and accuracy of stature estimation and to demonstrate the variability in estimated stature and actual stature using multiplication factor and regression analysis methods. The study is based on a sample of 246 subjects (123 males and 123 females) from North India aged between 17 and 20 years. Four anthropometric measurements; hand length, hand breadth, foot length and foot breadth taken on the left side in each subject were included in the study. Stature was measured using standard anthropometric techniques. Multiplication factors were calculated and linear regression models were derived for estimation of stature from hand and foot dimensions. Derived multiplication factors and regression formula were applied to the hand and foot measurements in the study sample. The estimated stature from the multiplication factors and regression analysis was compared with the actual stature to find the error in estimated stature. The results indicate that the range of error in estimation of stature from regression analysis method is less than that of multiplication factor method thus, confirming that the regression analysis method is better than multiplication factor analysis in stature estimation. Copyright © 2012 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.
Suzuki, Hideaki; Tabata, Takahisa; Koizumi, Hiroki; Hohchi, Nobusuke; Takeuchi, Shoko; Kitamura, Takuro; Fujino, Yoshihisa; Ohbuchi, Toyoaki
2014-12-01
This study aimed to create a multiple regression model for predicting hearing outcomes of idiopathic sudden sensorineural hearing loss (ISSNHL). The participants were 205 consecutive patients (205 ears) with ISSNHL (hearing level ≥ 40 dB, interval between onset and treatment ≤ 30 days). They received systemic steroid administration combined with intratympanic steroid injection. Data were examined by simple and multiple regression analyses. Three hearing indices (percentage hearing improvement, hearing gain, and posttreatment hearing level [HLpost]) and 7 prognostic factors (age, days from onset to treatment, initial hearing level, initial hearing level at low frequencies, initial hearing level at high frequencies, presence of vertigo, and contralateral hearing level) were included in the multiple regression analysis as dependent and explanatory variables, respectively. In the simple regression analysis, the percentage hearing improvement, hearing gain, and HLpost showed significant correlation with 2, 5, and 6 of the 7 prognostic factors, respectively. The multiple correlation coefficients were 0.396, 0.503, and 0.714 for the percentage hearing improvement, hearing gain, and HLpost, respectively. Predicted values of HLpost calculated by the multiple regression equation were reliable with 70% probability with a 40-dB-width prediction interval. Prediction of HLpost by the multiple regression model may be useful to estimate the hearing prognosis of ISSNHL. © The Author(s) 2014.
ERIC Educational Resources Information Center
Shear, Benjamin R.; Zumbo, Bruno D.
2013-01-01
Type I error rates in multiple regression, and hence the chance for false positive research findings, can be drastically inflated when multiple regression models are used to analyze data that contain random measurement error. This article shows the potential for inflated Type I error rates in commonly encountered scenarios and provides new…
Using Robust Standard Errors to Combine Multiple Regression Estimates with Meta-Analysis
ERIC Educational Resources Information Center
Williams, Ryan T.
2012-01-01
Combining multiple regression estimates with meta-analysis has continued to be a difficult task. A variety of methods have been proposed and used to combine multiple regression slope estimates with meta-analysis, however, most of these methods have serious methodological and practical limitations. The purpose of this study was to explore the use…
John W. Edwards; Susan C. Loeb; David C. Guynn
1994-01-01
Multiple regression and use-availability analyses are two methods for examining habitat selection. Use-availability analysis is commonly used to evaluate macrohabitat selection whereas multiple regression analysis can be used to determine microhabitat selection. We compared these techniques using behavioral observations (n = 5534) and telemetry locations (n = 2089) of...
Effects of integration time on in-water radiometric profiles.
D'Alimonte, Davide; Zibordi, Giuseppe; Kajiyama, Tamito
2018-03-05
This work investigates the effects of integration time on in-water downward irradiance E d , upward irradiance E u and upwelling radiance L u profile data acquired with free-fall hyperspectral systems. Analyzed quantities are the subsurface value and the diffuse attenuation coefficient derived by applying linear and non-linear regression schemes. Case studies include oligotrophic waters (Case-1), as well as waters dominated by Colored Dissolved Organic Matter (CDOM) and Non-Algal Particles (NAP). Assuming a 24-bit digitization, measurements resulting from the accumulation of photons over integration times varying between 8 and 2048ms are evaluated at depths corresponding to: 1) the beginning of each integration interval (Fst); 2) the end of each integration interval (Lst); 3) the averages of Fst and Lst values (Avg); and finally 4) the values weighted accounting for the diffuse attenuation coefficient of water (Wgt). Statistical figures show that the effects of integration time can bias results well above 5% as a function of the depth definition. Results indicate the validity of the Wgt depth definition and the fair applicability of the Avg one. Instead, both the Fst and Lst depths should not be adopted since they may introduce pronounced biases in E u and L u regression products for highly absorbing waters. Finally, the study reconfirms the relevance of combining multiple radiometric casts into a single profile to increase precision of regression products.
Building Regression Models: The Importance of Graphics.
ERIC Educational Resources Information Center
Dunn, Richard
1989-01-01
Points out reasons for using graphical methods to teach simple and multiple regression analysis. Argues that a graphically oriented approach has considerable pedagogic advantages in the exposition of simple and multiple regression. Shows that graphical methods may play a central role in the process of building regression models. (Author/LS)
Testing Different Model Building Procedures Using Multiple Regression.
ERIC Educational Resources Information Center
Thayer, Jerome D.
The stepwise regression method of selecting predictors for computer assisted multiple regression analysis was compared with forward, backward, and best subsets regression, using 16 data sets. The results indicated the stepwise method was preferred because of its practical nature, when the models chosen by different selection methods were similar…
Machado-Carvalhais, Helenaura P; Ramos-Jorge, Maria L; Auad, Sheyla M; Martins, Laura H P M; Paiva, Saul M; Pordeus, Isabela A
2008-10-01
The aims of this cross-sectional study were to determine the prevalence of occupational accidents with exposure to biological material among undergraduate students of dentistry and to estimate potential risk factors associated with exposure to blood. Data were collected through a self-administered questionnaire (86.4 percent return rate), which was completed by a sample of 286 undergraduate dental students (mean age 22.4 +/-2.4 years). The students were enrolled in the clinical component of the curriculum, which corresponds to the final six semesters of study. Descriptive, bivariate, simple logistic regression and multiple logistic regression (Forward Stepwise Procedure) analyses were performed. The level of statistical significance was set at 5 percent. Percutaneous and mucous exposures to potentially infectious biological material were reported by 102 individuals (35.6 percent); 26.8 percent reported the occurrence of multiple episodes of exposure. The logistic regression analyses revealed that the incomplete use of individual protection equipment (OR=3.7; 95 percent CI 1.5-9.3), disciplines where surgical procedures are carried out (OR=16.3; 95 percent CI 7.1-37.2), and handling sharp instruments (OR=4.4; 95 percent CI 2.1-9.1), more specifically, hollow-bore needles (OR=6.8; 95 percent CI 2.1-19.0), were independently associated with exposure to blood. Policies of reviewing the procedures during clinical practice are recommended in order to reduce occupational exposure.
Decreasing Multicollinearity: A Method for Models with Multiplicative Functions.
ERIC Educational Resources Information Center
Smith, Kent W.; Sasaki, M. S.
1979-01-01
A method is proposed for overcoming the problem of multicollinearity in multiple regression equations where multiplicative independent terms are entered. The method is not a ridge regression solution. (JKS)
Villarrasa-Sapiña, Israel; Álvarez-Pitti, Julio; Cabeza-Ruiz, Ruth; Redón, Pau; Lurbe, Empar; García-Massó, Xavier
2018-02-01
Excess body weight during childhood causes reduced motor functionality and problems in postural control, a negative influence which has been reported in the literature. Nevertheless, no information regarding the effect of body composition on the postural control of overweight and obese children is available. The objective of this study was therefore to establish these relationships. A cross-sectional design was used to establish relationships between body composition and postural control variables obtained in bipedal eyes-open and eyes-closed conditions in twenty-two children. Centre of pressure signals were analysed in the temporal and frequency domains. Pearson correlations were applied to establish relationships between variables. Principal component analysis was applied to the body composition variables to avoid potential multicollinearity in the regression models. These principal components were used to perform a multiple linear regression analysis, from which regression models were obtained to predict postural control. Height and leg mass were the body composition variables that showed the highest correlation with postural control. Multiple regression models were also obtained and several of these models showed a higher correlation coefficient in predicting postural control than simple correlations. These models revealed that leg and trunk mass were good predictors of postural control. More equations were found in the eyes-open than eyes-closed condition. Body weight and height are negatively correlated with postural control. However, leg and trunk mass are better postural control predictors than arm or body mass. Finally, body composition variables are more useful in predicting postural control when the eyes are open. Copyright © 2017 Elsevier Ltd. All rights reserved.
Zhao, Ni; Chen, Jun; Carroll, Ian M.; Ringel-Kulka, Tamar; Epstein, Michael P.; Zhou, Hua; Zhou, Jin J.; Ringel, Yehuda; Li, Hongzhe; Wu, Michael C.
2015-01-01
High-throughput sequencing technology has enabled population-based studies of the role of the human microbiome in disease etiology and exposure response. Distance-based analysis is a popular strategy for evaluating the overall association between microbiome diversity and outcome, wherein the phylogenetic distance between individuals’ microbiome profiles is computed and tested for association via permutation. Despite their practical popularity, distance-based approaches suffer from important challenges, especially in selecting the best distance and extending the methods to alternative outcomes, such as survival outcomes. We propose the microbiome regression-based kernel association test (MiRKAT), which directly regresses the outcome on the microbiome profiles via the semi-parametric kernel machine regression framework. MiRKAT allows for easy covariate adjustment and extension to alternative outcomes while non-parametrically modeling the microbiome through a kernel that incorporates phylogenetic distance. It uses a variance-component score statistic to test for the association with analytical p value calculation. The model also allows simultaneous examination of multiple distances, alleviating the problem of choosing the best distance. Our simulations demonstrated that MiRKAT provides correctly controlled type I error and adequate power in detecting overall association. “Optimal” MiRKAT, which considers multiple candidate distances, is robust in that it suffers from little power loss in comparison to when the best distance is used and can achieve tremendous power gain in comparison to when a poor distance is chosen. Finally, we applied MiRKAT to real microbiome datasets to show that microbial communities are associated with smoking and with fecal protease levels after confounders are controlled for. PMID:25957468
Multiple-Instance Regression with Structured Data
NASA Technical Reports Server (NTRS)
Wagstaff, Kiri L.; Lane, Terran; Roper, Alex
2008-01-01
We present a multiple-instance regression algorithm that models internal bag structure to identify the items most relevant to the bag labels. Multiple-instance regression (MIR) operates on a set of bags with real-valued labels, each containing a set of unlabeled items, in which the relevance of each item to its bag label is unknown. The goal is to predict the labels of new bags from their contents. Unlike previous MIR methods, MI-ClusterRegress can operate on bags that are structured in that they contain items drawn from a number of distinct (but unknown) distributions. MI-ClusterRegress simultaneously learns a model of the bag's internal structure, the relevance of each item, and a regression model that accurately predicts labels for new bags. We evaluated this approach on the challenging MIR problem of crop yield prediction from remote sensing data. MI-ClusterRegress provided predictions that were more accurate than those obtained with non-multiple-instance approaches or MIR methods that do not model the bag structure.
Aspects of porosity prediction using multivariate linear regression
DOE Office of Scientific and Technical Information (OSTI.GOV)
Byrnes, A.P.; Wilson, M.D.
1991-03-01
Highly accurate multiple linear regression models have been developed for sandstones of diverse compositions. Porosity reduction or enhancement processes are controlled by the fundamental variables, Pressure (P), Temperature (T), Time (t), and Composition (X), where composition includes mineralogy, size, sorting, fluid composition, etc. The multiple linear regression equation, of which all linear porosity prediction models are subsets, takes the generalized form: Porosity = C{sub 0} + C{sub 1}(P) + C{sub 2}(T) + C{sub 3}(X) + C{sub 4}(t) + C{sub 5}(PT) + C{sub 6}(PX) + C{sub 7}(Pt) + C{sub 8}(TX) + C{sub 9}(Tt) + C{sub 10}(Xt) + C{sub 11}(PTX) + C{submore » 12}(PXt) + C{sub 13}(PTt) + C{sub 14}(TXt) + C{sub 15}(PTXt). The first four primary variables are often interactive, thus requiring terms involving two or more primary variables (the form shown implies interaction and not necessarily multiplication). The final terms used may also involve simple mathematic transforms such as log X, e{sup T}, X{sup 2}, or more complex transformations such as the Time-Temperature Index (TTI). The X term in the equation above represents a suite of compositional variable and, therefore, a fully expanded equation may include a series of terms incorporating these variables. Numerous published bivariate porosity prediction models involving P (or depth) or Tt (TTI) are effective to a degree, largely because of the high degree of colinearity between p and TTI. However, all such bivariate models ignore the unique contributions of P and Tt, as well as various X terms. These simpler models become poor predictors in regions where colinear relations change, were important variables have been ignored, or where the database does not include a sufficient range or weight distribution for the critical variables.« less
Cephalometric landmark detection in dental x-ray images using convolutional neural networks
NASA Astrophysics Data System (ADS)
Lee, Hansang; Park, Minseok; Kim, Junmo
2017-03-01
In dental X-ray images, an accurate detection of cephalometric landmarks plays an important role in clinical diagnosis, treatment and surgical decisions for dental problems. In this work, we propose an end-to-end deep learning system for cephalometric landmark detection in dental X-ray images, using convolutional neural networks (CNN). For detecting 19 cephalometric landmarks in dental X-ray images, we develop a detection system using CNN-based coordinate-wise regression systems. By viewing x- and y-coordinates of all landmarks as 38 independent variables, multiple CNN-based regression systems are constructed to predict the coordinate variables from input X-ray images. First, each coordinate variable is normalized by the length of either height or width of an image. For each normalized coordinate variable, a CNN-based regression system is trained on training images and corresponding coordinate variable, which is a variable to be regressed. We train 38 regression systems with the same CNN structure on coordinate variables, respectively. Finally, we compute 38 coordinate variables with these trained systems from unseen images and extract 19 landmarks by pairing the regressed coordinates. In experiments, the public database from the Grand Challenges in Dental X-ray Image Analysis in ISBI 2015 was used and the proposed system showed promising performance by successfully locating the cephalometric landmarks within considerable margins from the ground truths.
Health related quality of life in parents of six to eight year old children with Down syndrome.
Marchal, Jan Pieter; Maurice-Stam, Heleen; Hatzmann, Janneke; van Trotsenburg, A S Paul; Grootenhuis, Martha A
2013-11-01
Raising a child with Down syndrome (DS) has been found to be associated with lowered health related quality of life (HRQoL) in the domains cognitive functioning, social functioning, daily activities and vitality. We aimed to explore which socio-demographics, child functioning and psychosocial variables were related to these HRQoL domains in parents of children with DS. Parents of 98 children with DS completed the TNO-AZL adult quality of life questionnaire (TAAQOL) and a questionnaire assessing socio-demographic, child functioning and psychosocial predictors. Using multiple linear regression analyses for each category of predictors, we selected relevant predictors for the final models. The final multiple linear regression models revealed that cognitive functioning was best predicted by the sleep of the child (β=.29, p<.01) and by the parent having given up a hobby (β=-.29, p<.01), social functioning by the quality of the partner relation (β=.34, p<.001), daily activities by the parent having to care for an ill friend or family member (β=-.31, p<.01), and vitality by the parent having enough personal time (β=.32, p<.01). Overall, psychosocial variables rather than socio-demographics or child functioning showed most consistent and powerful relations to the HRQoL domains of cognitive functioning, social functioning, daily activities and vitality. These psychosocial variables mainly related to social support and time pressure. Systematic screening of parents to detect problems timely, and interventions targeting the supportive network and the demands in time are recommended. Copyright © 2013 Elsevier Ltd. All rights reserved.
LeBourgeois, Monique K.; Giannotti, Flavia; Cortesi, Flavia; Wolfson, Amy R.; Harsh, John
2014-01-01
Objective The purpose of the study was to examine the relationship between self-reported sleep quality and sleep hygiene in Italian and American adolescents and to assess whether sleep-hygiene practices mediate the relationship between culture and sleep quality. Methods Two nonprobability samples were collected from public schools in Rome, Italy, and Hattiesburg, Mississippi. Students completed the following self-report measures: Adolescent Sleep-Wake Scale, Adolescent Sleep Hygiene Scale, Pubertal Developmental Scale, and Morningness/Eveningness Scale. Results The final sample included 776 Italian and 572 American adolescents 12 to 17 years old. Italian adolescents reported much better sleep hygiene and substantially better sleep quality than American adolescents. A moderate-to-strong linear relationship was found between sleep hygiene and sleep quality in both samples. Separate hierarchical multiple regression analyses were performed on both samples. Demographic and individual characteristics explained a significant proportion of the variance in sleep quality (Italians: 18%; Americans: 25%), and the addition of sleep-hygiene domains explained significantly more variance in sleep quality (Italians: 17%; Americans: 16%). A final hierarchical multiple regression analysis with both samples combined showed that culture (Italy versus United States) only explained 0.8% of the variance in sleep quality after controlling for sleep hygiene and all other variables. Conclusions Cross-cultural differences in sleep quality, for the most part, were due to differences in sleep-hygiene practices. Sleep hygiene is an important predictor of sleep quality in Italian and American adolescents, thus supporting the implementation and evaluation of educational programs on good sleep-hygiene practices. PMID:15866860
Tighe, Elizabeth L.; Schatschneider, Christopher
2015-01-01
The purpose of this study was to investigate the joint and unique contributions of morphological awareness and vocabulary knowledge at five reading comprehension levels in Adult Basic Education (ABE) students. We introduce the statistical technique of multiple quantile regression, which enabled us to assess the predictive utility of morphological awareness and vocabulary knowledge at multiple points (quantiles) along the continuous distribution of reading comprehension. To demonstrate the efficacy of our multiple quantile regression analysis, we compared and contrasted our results with a traditional multiple regression analytic approach. Our results indicated that morphological awareness and vocabulary knowledge accounted for a large portion of the variance (82-95%) in reading comprehension skills across all quantiles. Morphological awareness exhibited the greatest unique predictive ability at lower levels of reading comprehension whereas vocabulary knowledge exhibited the greatest unique predictive ability at higher levels of reading comprehension. These results indicate the utility of using multiple quantile regression to assess trajectories of component skills across multiple levels of reading comprehension. The implications of our findings for ABE programs are discussed. PMID:25351773
ℓ(p)-Norm multikernel learning approach for stock market price forecasting.
Shao, Xigao; Wu, Kun; Liao, Bifeng
2012-01-01
Linear multiple kernel learning model has been used for predicting financial time series. However, ℓ(1)-norm multiple support vector regression is rarely observed to outperform trivial baselines in practical applications. To allow for robust kernel mixtures that generalize well, we adopt ℓ(p)-norm multiple kernel support vector regression (1 ≤ p < ∞) as a stock price prediction model. The optimization problem is decomposed into smaller subproblems, and the interleaved optimization strategy is employed to solve the regression model. The model is evaluated on forecasting the daily stock closing prices of Shanghai Stock Index in China. Experimental results show that our proposed model performs better than ℓ(1)-norm multiple support vector regression model.
Kang, Seung-Gul; Lee, Yu Jin; Kim, Seog Ju; Lim, Weonjeong; Lee, Heon-Jeong; Park, Young-Min; Cho, In Hee; Cho, Seong-Jin; Hong, Jin Pyo
2014-02-01
The current study aims to determine the associations of insufficient sleep with suicide attempts and self-injury in a large, school-based Korean adolescent sample. A sample of 4553 middle- and high-school students (grades 7-10) was recruited in this study. Finally, 4145 students completed self-report questionnaires including items on sleep duration (weekday/weekend), self-injury, suicide attempts during the past year, the Suicidal Ideation Questionnaire (SIQ), and the Beck Depression Inventory (BDI). A multiple linear regression model showed that higher SIQ scores were associated with longer weekend catch-up sleep duration (p=0.009), higher BDI score (p<0.001), and longer time spent in a private educational institute (p=0.025). The multiple logistic regression analysis revealed that longer weekend catch-up sleep duration (p=0.011), higher BDI score (p<0.001), longer time spent in a private educational institute (p=0.046), and poorer academic record (p=0.029) were associated with suicide attempt and self-injury during the past year. The present results suggest that weekend catch-up sleep duration--which is an indicator of insufficient weekday sleep--might be associated with suicide attempts and self-injury in Korean adolescents. © 2014.
Female homicide in Rio Grande do Sul, Brazil.
Leites, Gabriela Tomedi; Meneghel, Stela Nazareth; Hirakata, Vania Noemi
2014-01-01
This study aimed to assess the female homicide rate due to aggression in Rio Grande do Sul, Brazil, using this as a "proxy" of femicide. This was an ecological study which correlated the female homicide rate due to aggression in Rio Grande do Sul, according to the 35 microregions defined by the Brazilian Institute of Geography and Statistics (IBGE), with socioeconomic and demographic variables access and health indicators. Pearson's correlation test was performed with the selected variables. After this, multiple linear regressions were performed with variables with p < 0.20. The standardized average of female homicide rate due to aggression in the period from 2003 to 2007 was 3.1 obits per 100 thousand. After multiple regression analysis, the final model included male mortality due to aggression (p = 0.016), the percentage of hospital admissions for alcohol (p = 0.005) and the proportion of ill-defined deaths (p = 0.015). The model have an explanatory power of 39% (adjusted r2 = 0.391). The results are consistent with other studies and indicate a strong relationship between structural violence in society and violence against women, in addition to a higher incidence of female deaths in places with high alcohol hospitalization.
Nagai, Takashi; Lovalekar, Mita; Wohleber, Meleesa F; Perlsweig, Katherine A; Wirt, Michael D; Beals, Kim
2017-11-01
Musculoskeletal injuries have negatively impacted tactical readiness. The identification of prospective and modifiable risk factors of preventable musculoskeletal injuries can guide specific injury prevention strategies for Soldiers and health care providers. To analyze physiological and neuromuscular characteristics as predictors of preventable musculoskeletal injuries. Prospective-cohort study. A total of 491 Soldiers were enrolled and participated in the baseline laboratory testing, including body composition, aerobic capacity, anaerobic power/capacity, muscular strength, flexibility, static balance, and landing biomechanics. After reviewing their medical charts, 275 male Soldiers who met the criteria were divided into two groups: with injuries (INJ) and no injuries (NOI). Simple and multiple logistic regression analyses were used to calculate the odds ratio (OR) and significant predictors of musculoskeletal injuries (p<0.05). The final multiple logistic regression model included the static balance with eyes-closed and peak anaerobic power as predictors of future injuries (p<0.001). The current results highlighted the importance of anaerobic power/capacity and static balance. High intensity training and balance exercise should be incorporated in their physical training as countermeasures. Copyright © 2017 Sports Medicine Australia. All rights reserved.
ERIC Educational Resources Information Center
Anderson, Carolyn J.; Verkuilen, Jay; Peyton, Buddy L.
2010-01-01
Survey items with multiple response categories and multiple-choice test questions are ubiquitous in psychological and educational research. We illustrate the use of log-multiplicative association (LMA) models that are extensions of the well-known multinomial logistic regression model for multiple dependent outcome variables to reanalyze a set of…
Lin, Chao-Cheng; Bai, Ya-Mei; Chen, Jen-Yeu; Hwang, Tzung-Jeng; Chen, Tzu-Ting; Chiu, Hung-Wen; Li, Yu-Chuan
2010-03-01
Metabolic syndrome (MetS) is an important side effect of second-generation antipsychotics (SGAs). However, many SGA-treated patients with MetS remain undetected. In this study, we trained and validated artificial neural network (ANN) and multiple logistic regression models without biochemical parameters to rapidly identify MetS in patients with SGA treatment. A total of 383 patients with a diagnosis of schizophrenia or schizoaffective disorder (DSM-IV criteria) with SGA treatment for more than 6 months were investigated to determine whether they met the MetS criteria according to the International Diabetes Federation. The data for these patients were collected between March 2005 and September 2005. The input variables of ANN and logistic regression were limited to demographic and anthropometric data only. All models were trained by randomly selecting two-thirds of the patient data and were internally validated with the remaining one-third of the data. The models were then externally validated with data from 69 patients from another hospital, collected between March 2008 and June 2008. The area under the receiver operating characteristic curve (AUC) was used to measure the performance of all models. Both the final ANN and logistic regression models had high accuracy (88.3% vs 83.6%), sensitivity (93.1% vs 86.2%), and specificity (86.9% vs 83.8%) to identify MetS in the internal validation set. The mean +/- SD AUC was high for both the ANN and logistic regression models (0.934 +/- 0.033 vs 0.922 +/- 0.035, P = .63). During external validation, high AUC was still obtained for both models. Waist circumference and diastolic blood pressure were the common variables that were left in the final ANN and logistic regression models. Our study developed accurate ANN and logistic regression models to detect MetS in patients with SGA treatment. The models are likely to provide a noninvasive tool for large-scale screening of MetS in this group of patients. (c) 2010 Physicians Postgraduate Press, Inc.
Isolating and Examining Sources of Suppression and Multicollinearity in Multiple Linear Regression
ERIC Educational Resources Information Center
Beckstead, Jason W.
2012-01-01
The presence of suppression (and multicollinearity) in multiple regression analysis complicates interpretation of predictor-criterion relationships. The mathematical conditions that produce suppression in regression analysis have received considerable attention in the methodological literature but until now nothing in the way of an analytic…
General Nature of Multicollinearity in Multiple Regression Analysis.
ERIC Educational Resources Information Center
Liu, Richard
1981-01-01
Discusses multiple regression, a very popular statistical technique in the field of education. One of the basic assumptions in regression analysis requires that independent variables in the equation should not be highly correlated. The problem of multicollinearity and some of the solutions to it are discussed. (Author)
ℓ p-Norm Multikernel Learning Approach for Stock Market Price Forecasting
Shao, Xigao; Wu, Kun; Liao, Bifeng
2012-01-01
Linear multiple kernel learning model has been used for predicting financial time series. However, ℓ 1-norm multiple support vector regression is rarely observed to outperform trivial baselines in practical applications. To allow for robust kernel mixtures that generalize well, we adopt ℓ p-norm multiple kernel support vector regression (1 ≤ p < ∞) as a stock price prediction model. The optimization problem is decomposed into smaller subproblems, and the interleaved optimization strategy is employed to solve the regression model. The model is evaluated on forecasting the daily stock closing prices of Shanghai Stock Index in China. Experimental results show that our proposed model performs better than ℓ 1-norm multiple support vector regression model. PMID:23365561
Sample size determination for logistic regression on a logit-normal distribution.
Kim, Seongho; Heath, Elisabeth; Heilbrun, Lance
2017-06-01
Although the sample size for simple logistic regression can be readily determined using currently available methods, the sample size calculation for multiple logistic regression requires some additional information, such as the coefficient of determination ([Formula: see text]) of a covariate of interest with other covariates, which is often unavailable in practice. The response variable of logistic regression follows a logit-normal distribution which can be generated from a logistic transformation of a normal distribution. Using this property of logistic regression, we propose new methods of determining the sample size for simple and multiple logistic regressions using a normal transformation of outcome measures. Simulation studies and a motivating example show several advantages of the proposed methods over the existing methods: (i) no need for [Formula: see text] for multiple logistic regression, (ii) available interim or group-sequential designs, and (iii) much smaller required sample size.
Thermal conductance measurements of bolted copper joints for SuperCDMS
Schmitt, R. L.; Tatkowski, G.; Ruschman, M.; ...
2015-04-28
Joint thermal conductance testing has been undertaken for bolted copper to copper connections from 60 mK to 26 K. This testing was performed to validate an initial design basis for the SuperCDMS experiment, where a dilution refrigerator will be coupled to a cryostat via multiple bolted connections. Copper used during testing was either gold plated or passivated with citric acid to prevent surface oxidation. Finally, the results we obtained are well fit by a power law regression of joint thermal conductance to temperature and match well with data collected during a literature review.
NASA Astrophysics Data System (ADS)
Zhao, Wei; Fan, Shaojia; Guo, Hai; Gao, Bo; Sun, Jiaren; Chen, Laiguo
2016-11-01
The quantile regression (QR) method has been increasingly introduced to atmospheric environmental studies to explore the non-linear relationship between local meteorological conditions and ozone mixing ratios. In this study, we applied QR for the first time, together with multiple linear regression (MLR), to analyze the dominant meteorological parameters influencing the mean, 10th percentile, 90th percentile and 99th percentile of maximum daily 8-h average (MDA8) ozone concentrations in 2000-2015 in Hong Kong. The dominance analysis (DA) was used to assess the relative importance of meteorological variables in the regression models. Results showed that the MLR models worked better at suburban and rural sites than at urban sites, and worked better in winter than in summer. QR models performed better in summer for 99th and 90th percentiles and performed better in autumn and winter for 10th percentile. And QR models also performed better in suburban and rural areas for 10th percentile. The top 3 dominant variables associated with MDA8 ozone concentrations, changing with seasons and regions, were frequently associated with the six meteorological parameters: boundary layer height, humidity, wind direction, surface solar radiation, total cloud cover and sea level pressure. Temperature rarely became a significant variable in any season, which could partly explain the peak of monthly average ozone concentrations in October in Hong Kong. And we found the effect of solar radiation would be enhanced during extremely ozone pollution episodes (i.e., the 99th percentile). Finally, meteorological effects on MDA8 ozone had no significant changes before and after the 2010 Asian Games.
Inami, Satoshi; Moridaira, Hiroshi; Takeuchi, Daisaku; Shiba, Yo; Nohara, Yutaka; Taneichi, Hiroshi
2016-11-01
Adult spinal deformity (ASD) classification showing that ideal pelvic incidence minus lumbar lordosis (PI-LL) value is within 10° has been received widely. But no study has focused on the optimum level of PI-LL value that reflects wide variety in PI among patients. This study was conducted to determine the optimum PI-LL value specific to an individual's PI in postoperative ASD patients. 48 postoperative ASD patients were recruited. Spino-pelvic parameters and Oswestry Disability Index (ODI) were measured at the final follow-up. Factors associated with good clinical results were determined by stepwise multiple regression model using the ODI. The patients with ODI under the 75th percentile cutoff were designated into the "good" health related quality of life (HRQOL) group. In this group, the relationship between the PI-LL and PI was assessed by regression analysis. Multiple regression analysis revealed PI-LL as significant parameters associated with ODI. Thirty-six patients with an ODI <22 points (75th percentile cutoff) were categorized into a good HRQOL group, and linear regression models demonstrated the following equation: PI-LL = 0.41PI-11.12 (r = 0.45, P = 0.0059). On the basis of this equation, in the patients with a PI = 50°, the PI-LL is 9°. Whereas in those with a PI = 30°, the optimum PI-LL is calculated to be as low as 1°. In those with a PI = 80°, PI-LL is estimated at 22°. Consequently, an optimum PI-LL is inconsistent in that it depends on the individual PI.
Multivariate Boosting for Integrative Analysis of High-Dimensional Cancer Genomic Data
Xiong, Lie; Kuan, Pei-Fen; Tian, Jianan; Keles, Sunduz; Wang, Sijian
2015-01-01
In this paper, we propose a novel multivariate component-wise boosting method for fitting multivariate response regression models under the high-dimension, low sample size setting. Our method is motivated by modeling the association among different biological molecules based on multiple types of high-dimensional genomic data. Particularly, we are interested in two applications: studying the influence of DNA copy number alterations on RNA transcript levels and investigating the association between DNA methylation and gene expression. For this purpose, we model the dependence of the RNA expression levels on DNA copy number alterations and the dependence of gene expression on DNA methylation through multivariate regression models and utilize boosting-type method to handle the high dimensionality as well as model the possible nonlinear associations. The performance of the proposed method is demonstrated through simulation studies. Finally, our multivariate boosting method is applied to two breast cancer studies. PMID:26609213
Fernandes, David Douglas Sousa; Gomes, Adriano A; Costa, Gean Bezerra da; Silva, Gildo William B da; Véras, Germano
2011-12-15
This work is concerned of evaluate the use of visible and near-infrared (NIR) range, separately and combined, to determine the biodiesel content in biodiesel/diesel blends using Multiple Linear Regression (MLR) and variable selection by Successive Projections Algorithm (SPA). Full spectrum models employing Partial Least Squares (PLS) and variables selection by Stepwise (SW) regression coupled with Multiple Linear Regression (MLR) and PLS models also with variable selection by Jack-Knife (Jk) were compared the proposed methodology. Several preprocessing were evaluated, being chosen derivative Savitzky-Golay with second-order polynomial and 17-point window for NIR and visible-NIR range, with offset correction. A total of 100 blends with biodiesel content between 5 and 50% (v/v) prepared starting from ten sample of biodiesel. In the NIR and visible region the best model was the SPA-MLR using only two and eight wavelengths with RMSEP of 0.6439% (v/v) and 0.5741 respectively, while in the visible-NIR region the best model was the SW-MLR using five wavelengths and RMSEP of 0.9533% (v/v). Results indicate that both spectral ranges evaluated showed potential for developing a rapid and nondestructive method to quantify biodiesel in blends with mineral diesel. Finally, one can still mention that the improvement in terms of prediction error obtained with the procedure for variables selection was significant. Copyright © 2011 Elsevier B.V. All rights reserved.
Mckay, Garrett; Huang, Wenxi; Romera-Castillo, Cristina; Crouch, Jenna E; Rosario-Ortiz, Fernando L; Jaffé, Rudolf
2017-05-16
The antioxidant capacity and formation of photochemically produced reactive intermediates (RI) was studied for water samples collected from the Florida Everglades with different spatial (marsh versus estuarine) and temporal (wet versus dry season) characteristics. Measured RI included triplet excited states of dissolved organic matter ( 3 DOM*), singlet oxygen ( 1 O 2 ), and the hydroxyl radical ( • OH). Single and multiple linear regression modeling were performed using a broad range of extrinsic (to predict RI formation rates, R RI ) and intrinsic (to predict RI quantum yields, Φ RI ) parameters. Multiple linear regression models consistently led to better predictions of R RI and Φ RI for our data set but poor prediction of Φ RI for a previously published data set,1 probably because the predictors are intercorrelated (Pearson's r > 0.5). Single linear regression models were built with data compiled from previously published studies (n ≈ 120) in which E2:E3, S, and Φ RI values were measured, which revealed a high degree of similarity between RI-optical property relationships across DOM samples of diverse sources. This study reveals that • OH formation is, in general, decoupled from 3 DOM* and 1 O 2 formation, providing supporting evidence that 3 DOM* is not a • OH precursor. Finally, Φ RI for 1 O 2 and 3 DOM* correlated negatively with antioxidant activity (a surrogate for electron donating capacity) for the collected samples, which is consistent with intramolecular oxidation of DOM moieties by 3 DOM*.
Estimating Interaction Effects With Incomplete Predictor Variables
Enders, Craig K.; Baraldi, Amanda N.; Cham, Heining
2014-01-01
The existing missing data literature does not provide a clear prescription for estimating interaction effects with missing data, particularly when the interaction involves a pair of continuous variables. In this article, we describe maximum likelihood and multiple imputation procedures for this common analysis problem. We outline 3 latent variable model specifications for interaction analyses with missing data. These models apply procedures from the latent variable interaction literature to analyses with a single indicator per construct (e.g., a regression analysis with scale scores). We also discuss multiple imputation for interaction effects, emphasizing an approach that applies standard imputation procedures to the product of 2 raw score predictors. We thoroughly describe the process of probing interaction effects with maximum likelihood and multiple imputation. For both missing data handling techniques, we outline centering and transformation strategies that researchers can implement in popular software packages, and we use a series of real data analyses to illustrate these methods. Finally, we use computer simulations to evaluate the performance of the proposed techniques. PMID:24707955
NASA Astrophysics Data System (ADS)
Nuccitelli, Dana; Cowtan, Kevin; Jacobs, Peter; Richardson, Mark; Way, Robert G.; Blackburn, Anne-Marie; Stolpe, Martin B.; Cook, John
2014-04-01
Lu (2013) (L13) argued that solar effects and anthropogenic halogenated gases can explain most of the observed warming of global mean surface air temperatures since 1850, with virtually no contribution from atmospheric carbon dioxide (CO2) concentrations. Here we show that this conclusion is based on assumptions about the saturation of the CO2-induced greenhouse effect that have been experimentally falsified. L13 also confuses equilibrium and transient response, and relies on data sources that have been superseeded due to known inaccuracies. Furthermore, the statistical approach of sequential linear regression artificially shifts variance onto the first predictor. L13's artificial choice of regression order and neglect of other relevant data is the fundamental cause of the incorrect main conclusion. Consideration of more modern data and a more parsimonious multiple regression model leads to contradiction with L13's statistical results. Finally, the correlation arguments in L13 are falsified by considering either the more appropriate metric of global heat accumulation, or data on longer timescales.
Tighe, Elizabeth L; Schatschneider, Christopher
2016-07-01
The purpose of this study was to investigate the joint and unique contributions of morphological awareness and vocabulary knowledge at five reading comprehension levels in adult basic education (ABE) students. We introduce the statistical technique of multiple quantile regression, which enabled us to assess the predictive utility of morphological awareness and vocabulary knowledge at multiple points (quantiles) along the continuous distribution of reading comprehension. To demonstrate the efficacy of our multiple quantile regression analysis, we compared and contrasted our results with a traditional multiple regression analytic approach. Our results indicated that morphological awareness and vocabulary knowledge accounted for a large portion of the variance (82%-95%) in reading comprehension skills across all quantiles. Morphological awareness exhibited the greatest unique predictive ability at lower levels of reading comprehension whereas vocabulary knowledge exhibited the greatest unique predictive ability at higher levels of reading comprehension. These results indicate the utility of using multiple quantile regression to assess trajectories of component skills across multiple levels of reading comprehension. The implications of our findings for ABE programs are discussed. © Hammill Institute on Disabilities 2014.
Stepwise versus Hierarchical Regression: Pros and Cons
ERIC Educational Resources Information Center
Lewis, Mitzi
2007-01-01
Multiple regression is commonly used in social and behavioral data analysis. In multiple regression contexts, researchers are very often interested in determining the "best" predictors in the analysis. This focus may stem from a need to identify those predictors that are supportive of theory. Alternatively, the researcher may simply be interested…
Tokunaga, Makoto; Watanabe, Susumu; Sonoda, Shigeru
2017-09-01
Multiple linear regression analysis is often used to predict the outcome of stroke rehabilitation. However, the predictive accuracy may not be satisfactory. The objective of this study was to elucidate the predictive accuracy of a method of calculating motor Functional Independence Measure (mFIM) at discharge from mFIM effectiveness predicted by multiple regression analysis. The subjects were 505 patients with stroke who were hospitalized in a convalescent rehabilitation hospital. The formula "mFIM at discharge = mFIM effectiveness × (91 points - mFIM at admission) + mFIM at admission" was used. By including the predicted mFIM effectiveness obtained through multiple regression analysis in this formula, we obtained the predicted mFIM at discharge (A). We also used multiple regression analysis to directly predict mFIM at discharge (B). The correlation between the predicted and the measured values of mFIM at discharge was compared between A and B. The correlation coefficients were .916 for A and .878 for B. Calculating mFIM at discharge from mFIM effectiveness predicted by multiple regression analysis had a higher degree of predictive accuracy of mFIM at discharge than that directly predicted. Copyright © 2017 National Stroke Association. Published by Elsevier Inc. All rights reserved.
Use of Empirical Estimates of Shrinkage in Multiple Regression: A Caution.
ERIC Educational Resources Information Center
Kromrey, Jeffrey D.; Hines, Constance V.
1995-01-01
The accuracy of four empirical techniques to estimate shrinkage in multiple regression was studied through Monte Carlo simulation. None of the techniques provided unbiased estimates of the population squared multiple correlation coefficient, but the normalized jackknife and bootstrap techniques demonstrated marginally acceptable performance with…
Enhance-Synergism and Suppression Effects in Multiple Regression
ERIC Educational Resources Information Center
Lipovetsky, Stan; Conklin, W. Michael
2004-01-01
Relations between pairwise correlations and the coefficient of multiple determination in regression analysis are considered. The conditions for the occurrence of enhance-synergism and suppression effects when multiple determination becomes bigger than the total of squared correlations of the dependent variable with the regressors are discussed. It…
Factors related to student performance in statistics courses in Lebanon
NASA Astrophysics Data System (ADS)
Naccache, Hiba Salim
The purpose of the present study was to identify factors that may contribute to business students in Lebanese universities having difficulty in introductory and advanced statistics courses. Two statistics courses are required for business majors at Lebanese universities. Students are not obliged to be enrolled in any math courses prior to taking statistics courses. Drawing on recent educational research, this dissertation attempted to identify the relationship between (1) students’ scores on Lebanese university math admissions tests; (2) students’ scores on a test of very basic mathematical concepts; (3) students’ scores on the survey of attitude toward statistics (SATS); (4) course performance as measured by students’ final scores in the course; and (5) their scores on the final exam. Data were collected from 561 students enrolled in multiple sections of two courses: 307 students in the introductory statistics course and 260 in the advanced statistics course in seven campuses across Lebanon over one semester. The multiple regressions results revealed four significant relationships at the introductory level: between students’ scores on the math quiz with their (1) final exam scores; (2) their final averages; (3) the Cognitive subscale of the SATS with their final exam scores; and (4) their final averages. These four significant relationships were also found at the advanced level. In addition, two more significant relationships were found between students’ final average and the two subscales of Effort (5) and Affect (6). No relationship was found between students’ scores on the admission math tests and both their final exam scores and their final averages in both the introductory and advanced level courses. On the other hand, there was no relationship between students’ scores on Lebanese admissions tests and their final achievement. Although these results were consistent across course formats and instructors, they may encourage Lebanese universities to assess the effectiveness of prerequisite math courses. Moreover, these findings may lead the Lebanese Ministry of Education to make changes to the admissions exams, course prerequisites, and course content. Finally, to enhance the attitude of students, new learning techniques, such as group work during class meetings can be helpful, and future research should aim to test the effectiveness of these pedagogical techniques on students’ attitudes toward statistics.
An Effect Size for Regression Predictors in Meta-Analysis
ERIC Educational Resources Information Center
Aloe, Ariel M.; Becker, Betsy Jane
2012-01-01
A new effect size representing the predictive power of an independent variable from a multiple regression model is presented. The index, denoted as r[subscript sp], is the semipartial correlation of the predictor with the outcome of interest. This effect size can be computed when multiple predictor variables are included in the regression model…
Regression Analysis: Legal Applications in Institutional Research
ERIC Educational Resources Information Center
Frizell, Julie A.; Shippen, Benjamin S., Jr.; Luna, Andrew L.
2008-01-01
This article reviews multiple regression analysis, describes how its results should be interpreted, and instructs institutional researchers on how to conduct such analyses using an example focused on faculty pay equity between men and women. The use of multiple regression analysis will be presented as a method with which to compare salaries of…
RAWS II: A MULTIPLE REGRESSION ANALYSIS PROGRAM,
This memorandum gives instructions for the use and operation of a revised version of RAWS, a multiple regression analysis program. The program...of preprocessed data, the directed retention of variable, listing of the matrix of the normal equations and its inverse, and the bypassing of the regression analysis to provide the input variable statistics only. (Author)
Incremental Net Effects in Multiple Regression
ERIC Educational Resources Information Center
Lipovetsky, Stan; Conklin, Michael
2005-01-01
A regular problem in regression analysis is estimating the comparative importance of the predictors in the model. This work considers the 'net effects', or shares of the predictors in the coefficient of the multiple determination, which is a widely used characteristic of the quality of a regression model. Estimation of the net effects can be a…
Floating Data and the Problem with Illustrating Multiple Regression.
ERIC Educational Resources Information Center
Sachau, Daniel A.
2000-01-01
Discusses how to introduce basic concepts of multiple regression by creating a large-scale, three-dimensional regression model using the classroom walls and floor. Addresses teaching points that should be covered and reveals student reaction to the model. Finds that the greatest benefit of the model is the low fear, walk-through, nonmathematical…
Estimating Soil Cation Exchange Capacity from Soil Physical and Chemical Properties
NASA Astrophysics Data System (ADS)
Bateni, S. M.; Emamgholizadeh, S.; Shahsavani, D.
2014-12-01
The soil Cation Exchange Capacity (CEC) is an important soil characteristic that has many applications in soil science and environmental studies. For example, CEC influences soil fertility by controlling the exchange of ions in the soil. Measurement of CEC is costly and difficult. Consequently, several studies attempted to obtain CEC from readily measurable soil physical and chemical properties such as soil pH, organic matter, soil texture, bulk density, and particle size distribution. These studies have often used multiple regression or artificial neural network models. Regression-based models cannot capture the intricate relationship between CEC and soil physical and chemical attributes and provide inaccurate CEC estimates. Although neural network models perform better than regression methods, they act like a black-box and cannot generate an explicit expression for retrieval of CEC from soil properties. In a departure with regression and neural network models, this study uses Genetic Expression Programming (GEP) and Multivariate Adaptive Regression Splines (MARS) to estimate CEC from easily measurable soil variables such as clay, pH, and OM. CEC estimates from GEP and MARS are compared with measurements at two field sites in Iran. Results show that GEP and MARS can estimate CEC accurately. Also, the MARS model performs slightly better than GEP. Finally, a sensitivity test indicates that organic matter and pH have respectively the least and the most significant impact on CEC.
2017-03-23
PUBLIC RELEASE; DISTRIBUTION UNLIMITED Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and... Cost and Probability of Cost and Schedule Overrun for Program Managers Ryan C. Trudelle Follow this and additional works at: https://scholar.afit.edu...afit.edu. Recommended Citation Trudelle, Ryan C., "Using Multiple and Logistic Regression to Estimate the Median Will- Cost and Probability of Cost and
Ubiquitin-Fused and/or Multiple Early Genes from Cottontail Rabbit Papillomavirus as DNA Vaccines
Leachman, Sancy A.; Shylankevich, Mark; Slade, Martin D.; Levine, Dana; K. Sundaram, Ranjini; Xiao, Wei; Bryan, Marianne; Zelterman, Daniel; Tiegelaar, Robert E.; Brandsma, Janet L.
2002-01-01
Human papillomavirus (HPV) vaccines have the potential to prevent cervical cancer by preventing HPV infection or treating premalignant disease. We previously showed that DNA vaccination with the cottontail rabbit papillomavirus (CRPV) E6 gene induced partial protection against CRPV challenge and that the vaccine's effects were greatly enhanced by priming with granulocyte-macrophage colony-stimulating factor (GM-CSF). In the present study, two additional strategies for augmenting the clinical efficacy of CRPV E6 vaccination were evaluated. The first was to fuse a ubiquitin monomer to the CRPV E6 protein to enhance antigen processing and presentation through the major histocompatibility complex class I pathway. Rabbits vaccinated with the wild-type E6 gene plus GM-CSF or with the ubiquitin-fused E6 gene formed significantly fewer papillomas than the controls. The papillomas also required a longer time to appear and grew more slowly. Finally, a significant proportion of the papillomas subsequently regressed. The ubiquitin-fused E6 vaccine was significantly more effective than the wild-type E6 vaccine plus GM-CSF priming. The second strategy was to vaccinate with multiple CRPV early genes to increase the breadth of the CRPV-specific response. DNA vaccines encoding the wild-type CRPV E1-E2, E6, or E7 protein were tested alone and in all possible combinations. All vaccines and combinations suppressed papilloma formation, slowed papilloma growth, and stimulated subsequent papilloma regression. Finally, the two strategies were merged and a combination DNA vaccine containing ubiquitin-fused versions of the CRPV E1, E2, and E7 genes was tested. This last vaccine prevented papilloma formation at all challenge sites in all rabbits, demonstrating complete protection. PMID:12097575
Prediction of reported consumption of selected fat-containing foods.
Tuorila, H; Pangborn, R M
1988-10-01
A total of 100 American females (mean age = 20.8 years) completed a questionnaire, in which their beliefs, evaluations, liking and consumption (frequency, consumption compared to others, intention to consume) of milk, cheese, ice cream, chocolate and "high-fat foods" were measured. For the design and analysis, the basic frame of reference was the Fishbein-Ajzen model of reasoned action, but the final analyses were carried out with stepwise multiple regression analysis. In addition to the components of the Fishbein-Ajzen model, beliefs and evaluations were used as independent variables. On the average, subjects reported liking all the products but not "high-fat foods", and thought that milk and cheese were "good for you" whereas the remaining items were "bad for you". Principal component analysis for beliefs revealed factors related to pleasantness/benefit aspects, to health and weight concern and to the "functionality" of the foods. In stepwise multiple regression analyses, liking was the predominant predictor of reported consumption for all the foods, but various belief factors, particularly those related to concern with weight, also significantly predicted consumption. Social factors played only a minor role. The multiple R's of the predictive functions varied from 0.49 to 0.74. The fact that all four foods studied elicited individual sets of beliefs and belief structures, and that none of them was rated similar to the generic "high-fat foods", emphasizes that consumers attach meaning to integrated food entities rather than to ingredients.
Tools to Support Interpreting Multiple Regression in the Face of Multicollinearity
Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K.
2012-01-01
While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses. PMID:22457655
Tools to support interpreting multiple regression in the face of multicollinearity.
Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K
2012-01-01
While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses.
NASA Astrophysics Data System (ADS)
Zahari, Siti Meriam; Ramli, Norazan Mohamed; Moktar, Balkiah; Zainol, Mohammad Said
2014-09-01
In the presence of multicollinearity and multiple outliers, statistical inference of linear regression model using ordinary least squares (OLS) estimators would be severely affected and produces misleading results. To overcome this, many approaches have been investigated. These include robust methods which were reported to be less sensitive to the presence of outliers. In addition, ridge regression technique was employed to tackle multicollinearity problem. In order to mitigate both problems, a combination of ridge regression and robust methods was discussed in this study. The superiority of this approach was examined when simultaneous presence of multicollinearity and multiple outliers occurred in multiple linear regression. This study aimed to look at the performance of several well-known robust estimators; M, MM, RIDGE and robust ridge regression estimators, namely Weighted Ridge M-estimator (WRM), Weighted Ridge MM (WRMM), Ridge MM (RMM), in such a situation. Results of the study showed that in the presence of simultaneous multicollinearity and multiple outliers (in both x and y-direction), the RMM and RIDGE are more or less similar in terms of superiority over the other estimators, regardless of the number of observation, level of collinearity and percentage of outliers used. However, when outliers occurred in only single direction (y-direction), the WRMM estimator is the most superior among the robust ridge regression estimators, by producing the least variance. In conclusion, the robust ridge regression is the best alternative as compared to robust and conventional least squares estimators when dealing with simultaneous presence of multicollinearity and outliers.
An improved multiple linear regression and data analysis computer program package
NASA Technical Reports Server (NTRS)
Sidik, S. M.
1972-01-01
NEWRAP, an improved version of a previous multiple linear regression program called RAPIER, CREDUC, and CRSPLT, allows for a complete regression analysis including cross plots of the independent and dependent variables, correlation coefficients, regression coefficients, analysis of variance tables, t-statistics and their probability levels, rejection of independent variables, plots of residuals against the independent and dependent variables, and a canonical reduction of quadratic response functions useful in optimum seeking experimentation. A major improvement over RAPIER is that all regression calculations are done in double precision arithmetic.
NASA Astrophysics Data System (ADS)
Cao, Faxian; Yang, Zhijing; Ren, Jinchang; Ling, Wing-Kuen; Zhao, Huimin; Marshall, Stephen
2017-12-01
Although the sparse multinomial logistic regression (SMLR) has provided a useful tool for sparse classification, it suffers from inefficacy in dealing with high dimensional features and manually set initial regressor values. This has significantly constrained its applications for hyperspectral image (HSI) classification. In order to tackle these two drawbacks, an extreme sparse multinomial logistic regression (ESMLR) is proposed for effective classification of HSI. First, the HSI dataset is projected to a new feature space with randomly generated weight and bias. Second, an optimization model is established by the Lagrange multiplier method and the dual principle to automatically determine a good initial regressor for SMLR via minimizing the training error and the regressor value. Furthermore, the extended multi-attribute profiles (EMAPs) are utilized for extracting both the spectral and spatial features. A combinational linear multiple features learning (MFL) method is proposed to further enhance the features extracted by ESMLR and EMAPs. Finally, the logistic regression via the variable splitting and the augmented Lagrangian (LORSAL) is adopted in the proposed framework for reducing the computational time. Experiments are conducted on two well-known HSI datasets, namely the Indian Pines dataset and the Pavia University dataset, which have shown the fast and robust performance of the proposed ESMLR framework.
ERIC Educational Resources Information Center
Baylor, Carolyn; Yorkston, Kathryn; Bamer, Alyssa; Britton, Deanna; Amtmann, Dagmar
2010-01-01
Purpose: To explore variables associated with self-reported communicative participation in a sample (n = 498) of community-dwelling adults with multiple sclerosis (MS). Method: A battery of questionnaires was administered online or on paper per participant preference. Data were analyzed using multiple linear backward stepwise regression. The…
Gotvald, Anthony J.; Barth, Nancy A.; Veilleux, Andrea G.; Parrett, Charles
2012-01-01
Methods for estimating the magnitude and frequency of floods in California that are not substantially affected by regulation or diversions have been updated. Annual peak-flow data through water year 2006 were analyzed for 771 streamflow-gaging stations (streamgages) in California having 10 or more years of data. Flood-frequency estimates were computed for the streamgages by using the expected moments algorithm to fit a Pearson Type III distribution to logarithms of annual peak flows for each streamgage. Low-outlier and historic information were incorporated into the flood-frequency analysis, and a generalized Grubbs-Beck test was used to detect multiple potentially influential low outliers. Special methods for fitting the distribution were developed for streamgages in the desert region in southeastern California. Additionally, basin characteristics for the streamgages were computed by using a geographical information system. Regional regression analysis, using generalized least squares regression, was used to develop a set of equations for estimating flows with 50-, 20-, 10-, 4-, 2-, 1-, 0.5-, and 0.2-percent annual exceedance probabilities for ungaged basins in California that are outside of the southeastern desert region. Flood-frequency estimates and basin characteristics for 630 streamgages were combined to form the final database used in the regional regression analysis. Five hydrologic regions were developed for the area of California outside of the desert region. The final regional regression equations are functions of drainage area and mean annual precipitation for four of the five regions. In one region, the Sierra Nevada region, the final equations are functions of drainage area, mean basin elevation, and mean annual precipitation. Average standard errors of prediction for the regression equations in all five regions range from 42.7 to 161.9 percent. For the desert region of California, an analysis of 33 streamgages was used to develop regional estimates of all three parameters (mean, standard deviation, and skew) of the log-Pearson Type III distribution. The regional estimates were then used to develop a set of equations for estimating flows with 50-, 20-, 10-, 4-, 2-, 1-, 0.5-, and 0.2-percent annual exceedance probabilities for ungaged basins. The final regional regression equations are functions of drainage area. Average standard errors of prediction for these regression equations range from 214.2 to 856.2 percent. Annual peak-flow data through water year 2006 were analyzed for eight streamgages in California having 10 or more years of data considered to be affected by urbanization. Flood-frequency estimates were computed for the urban streamgages by fitting a Pearson Type III distribution to logarithms of annual peak flows for each streamgage. Regression analysis could not be used to develop flood-frequency estimation equations for urban streams because of the limited number of sites. Flood-frequency estimates for the eight urban sites were graphically compared to flood-frequency estimates for 630 non-urban sites. The regression equations developed from this study will be incorporated into the U.S. Geological Survey (USGS) StreamStats program. The StreamStats program is a Web-based application that provides streamflow statistics and basin characteristics for USGS streamgages and ungaged sites of interest. StreamStats can also compute basin characteristics and provide estimates of streamflow statistics for ungaged sites when users select the location of a site along any stream in California.
Motivation and Self-Management Behavior of the Individuals With Chronic Low Back Pain.
Jung, Mi Jung; Jeong, Younhee
2016-01-01
Self-management behavior is an important component for successful pain management in individuals with chronic low back pain. Motivation has been considered as an effective way to change behavior. Because there are other physical, social, and psychological factors affecting individuals with pain, it is necessary to identify the main effect of motivation on self-management behavior without the influence of those factors. The purpose of this study was to investigate the effect of motivation on self-management in controlling pain, depression, and social support. We used a nonexperimental, cross-sectional, descriptive design with mediation analysis and included 120 participants' data in the final analysis. We also used hierarchical multiple regression to test the effect of motivation, and multiple regression analysis and Sobel test were used to examine the mediating effect. Motivation itself accounted for 23.4% of the variance in self-management, F(1, 118) = 35.003, p < .001. After controlling covariates, motivation was also a significant factor for self-management. In the mediation analysis, motivation completely mediated the relationship between education and self-management, z = 2.292, p = .021. Motivation is an important part of self-management, and self-management education is not effective without motivation. The results of our study suggest that nurses incorporate motivation in nursing intervention, rather than only giving information.
Ghorbandordinejad, Farhad; Ahmadabad, Roghayyeh Moradian
2016-06-01
This study investigated the relationship between autonomy and English language achievement among third-grade high school students as mediated by foreign language classroom anxiety in a city in the north-west of Iran. A sample of 400 students (187 males, and 213 females) was assessed for their levels of autonomy and foreign language anxiety using the Autonomy Questionnaire and Foreign Language Classroom Anxiety Scale (FLCAS), respectively. Participants' scores on their final English exam were also used as the measurement of their English achievement. The results of Pearson correlation revealed a strong correlation between learners' autonomy and their English achievement (r [Formula: see text] .406, n [Formula: see text] 400, [Formula: see text]). Also, foreign language classroom anxiety was found to be significantly and negatively correlated with English achievement (r [Formula: see text] [Formula: see text].472, n [Formula: see text] 400, [Formula: see text]). Hierarchical multiple regression was used to assess the ability of autonomy to predict language learning achievement, after controlling for the influence of anxiety. In sum, the results of hierarchical multiple regressions revealed that foreign language classroom anxiety significantly mediates the relationship between autonomy and English language achievement. Implications for both teachers and learners, and suggestions for further research are provided.
Depression in non-Korean women residing in South Korea following marriage to Korean men.
Kim, Hyun-Sil; Kim, Hun-Soo
2013-06-01
The purpose of the study was to examine the roles of acculturative stress, life satisfaction, and language literacy in depression in non-Korean women residing in South Korea following marriage to Korean men. A cross-sectional study was performed, using an anonymous, self-reporting questionnaire. A total of 173 women were selected using a proportional stratified random sampling method. The relation between acculturation, depression, language literacy, life satisfaction and socio-demographic variables and the predictors of depression among participants were analyzed. The analysis included descriptive statistics and hierarchical multiple regression. Of the participants, 9.2% had depression, which was almost twice the rate of depression found in the general Korean population. In hierarchical multiple regression analysis, acculturative stress (beta=-.325, P<.001) and life satisfaction (beta=-.282, P=.003) were significantly associated with the level of depression. This final model was statistically significant and life satisfaction, acculturative stress, language literacy accounted for 31.0% (adjusted R(2)) of the variance in the depression score (P<.001). Elevated acculturative stress and less life satisfaction were significantly associated with a higher level of depression in migrant wives in Korea. Implications for practice and research are discussed. Copyright © 2013 Elsevier Inc. All rights reserved.
Suzuki, Taku; Iwamoto, Takuji; Shizu, Kanae; Suzuki, Katsuji; Yamada, Harumoto; Sato, Kazuki
2017-05-01
This retrospective study was designed to investigate prognostic factors for postoperative outcomes for cubital tunnel syndrome (CubTS) using multiple logistic regression analysis with a large number of patients. Eighty-three patients with CubTS who underwent surgeries were enrolled. The following potential prognostic factors for disease severity were selected according to previous reports: sex, age, type of surgery, disease duration, body mass index, cervical lesion, presence of diabetes mellitus, Workers' Compensation status, preoperative severity, and preoperative electrodiagnostic testing. Postoperative severity of disease was assessed 2 years after surgery by Messina's criteria which is an outcome measure specifically for CubTS. Bivariate analysis was performed to select candidate prognostic factors for multiple linear regression analyses. Multiple logistic regression analysis was conducted to identify the association between postoperative severity and selected prognostic factors. Both bivariate and multiple linear regression analysis revealed only preoperative severity as an independent risk factor for poor prognosis, while other factors did not show any significant association. Although conflicting results exist regarding prognosis of CubTS, this study supports evidence from previous studies and concludes early surgical intervention portends the most favorable prognosis. Copyright © 2017 The Japanese Orthopaedic Association. Published by Elsevier B.V. All rights reserved.
Predicting perceptual quality of images in realistic scenario using deep filter banks
NASA Astrophysics Data System (ADS)
Zhang, Weixia; Yan, Jia; Hu, Shiyong; Ma, Yang; Deng, Dexiang
2018-03-01
Classical image perceptual quality assessment models usually resort to natural scene statistic methods, which are based on an assumption that certain reliable statistical regularities hold on undistorted images and will be corrupted by introduced distortions. However, these models usually fail to accurately predict degradation severity of images in realistic scenarios since complex, multiple, and interactive authentic distortions usually appear on them. We propose a quality prediction model based on convolutional neural network. Quality-aware features extracted from filter banks of multiple convolutional layers are aggregated into the image representation. Furthermore, an easy-to-implement and effective feature selection strategy is used to further refine the image representation and finally a linear support vector regression model is trained to map image representation into images' subjective perceptual quality scores. The experimental results on benchmark databases present the effectiveness and generalizability of the proposed model.
The Geometry of Enhancement in Multiple Regression
ERIC Educational Resources Information Center
Waller, Niels G.
2011-01-01
In linear multiple regression, "enhancement" is said to occur when R[superscript 2] = b[prime]r greater than r[prime]r, where b is a p x 1 vector of standardized regression coefficients and r is a p x 1 vector of correlations between a criterion y and a set of standardized regressors, x. When p = 1 then b [is congruent to] r and…
Personality traits and life satisfaction among online game players.
Chen, Lily Shui-Lien; Tu, Hill Hung-Jen; Wang, Edward Shih-Tse
2008-04-01
The DFC Intelligence predicts worldwide online game revenues will reach $9.8 billion by 2009, making online gaming a mainstream recreational activity. Understanding online game player personality traits is therefore important. This study researches the relationship between personality traits and life satisfaction in online game players. Taipei, Taiwan, is the study location, with questionnaire surveys conducted in cyber cafe shops. Multiple regression analysis studies the causal relationship between personality traits and life satisfaction in online game players. The result shows that neuroticism has significant negative influence on life satisfaction. Both openness and conscientiousness have significant positive influence on life satisfaction. Finally, implications for leisure practice and further research are discussed.
NASA Astrophysics Data System (ADS)
Igarashi, Masayasu; Murao, Osamu
In this paper, the authors develop a multiple regression model which estimates urban earthquake vulnerability (building collapse risk and conflagration risk) for different eras, and clarify the historical changes of urban risk in Marunouchi and Ginza Districts in Tokyo, Japan using old maps and contemporary geographic information data. Also, we compare the change of urban vulnerability of the districts with the significant historical events in Tokyo. Finally, the results are loaded onto Google Earth with timescale extension to consider the possibility of urban recovery digital archives in the era of the recent geoinformatic technologies.
Jung, Juergen
2013-01-01
We explore the determinants of inspection outcomes across 1.6 million Occupational Safety and Health Agency (OSHA) audits from 1990 through 2010. We find that discretion in enforcement differs in state and federally conducted inspections. State agencies are more sensitive to local economic conditions, finding fewer standard violations and fewer serious violations as unemployment increases. Larger companies receive greater lenience in multiple dimensions. Inspector issued fines and final fines, after negotiated reductions, are both smaller during Republican presidencies. Quantile regression analysis reveals that Presidential and Congressional party affiliations have their greatest impact on the largest negotiated reductions in fines. PMID:24659856
Color vision impairment in multiple sclerosis points to retinal ganglion cell damage.
Lampert, E J; Andorra, M; Torres-Torres, R; Ortiz-Pérez, S; Llufriu, S; Sepúlveda, M; Sola, N; Saiz, A; Sánchez-Dalmau, B; Villoslada, P; Martínez-Lapiscina, Elena H
2015-11-01
Multiple Sclerosis (MS) results in color vision impairment regardless of optic neuritis (ON). The exact location of injury remains undefined. The objective of this study is to identify the region leading to dyschromatopsia in MS patients' NON-eyes. We evaluated Spearman correlations between color vision and measures of different regions in the afferent visual pathway in 106 MS patients. Regions with significant correlations were included in logistic regression models to assess their independent role in dyschromatopsia. We evaluated color vision with Hardy-Rand-Rittler plates and retinal damage using Optical Coherence Tomography. We ran SIENAX to measure Normalized Brain Parenchymal Volume (NBPV), FIRST for thalamus volume and Freesurfer for visual cortex areas. We found moderate, significant correlations between color vision and macular retinal nerve fiber layer (rho = 0.289, p = 0.003), ganglion cell complex (GCC = GCIP) (rho = 0.353, p < 0.001), thalamus (rho = 0.361, p < 0.001), and lesion volume within the optic radiations (rho = -0.230, p = 0.030). Only GCC thickness remained significant (p = 0.023) in the logistic regression model. In the final model including lesion load and NBPV as markers of diffuse neuroaxonal damage, GCC remained associated with dyschromatopsia [OR = 0.88 95 % CI (0.80-0.97) p = 0.016]. This association remained significant when we also added sex, age, and disease duration as covariates in the regression model. Dyschromatopsia in NON-eyes is due to damage of retinal ganglion cells (RGC) in MS. Color vision can serve as a marker of RGC damage in MS.
Advanced statistics: linear regression, part I: simple linear regression.
Marill, Keith A
2004-01-01
Simple linear regression is a mathematical technique used to model the relationship between a single independent predictor variable and a single dependent outcome variable. In this, the first of a two-part series exploring concepts in linear regression analysis, the four fundamental assumptions and the mechanics of simple linear regression are reviewed. The most common technique used to derive the regression line, the method of least squares, is described. The reader will be acquainted with other important concepts in simple linear regression, including: variable transformations, dummy variables, relationship to inference testing, and leverage. Simplified clinical examples with small datasets and graphic models are used to illustrate the points. This will provide a foundation for the second article in this series: a discussion of multiple linear regression, in which there are multiple predictor variables.
Chowdhury, Nilotpal; Sapru, Shantanu
2015-01-01
Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis. The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets. Four microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate - adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA). Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed. To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and interesting results and may be used as a tool to guide new research.
Chowdhury, Nilotpal; Sapru, Shantanu
2015-01-01
Introduction Microarray analysis has revolutionized the role of genomic prognostication in breast cancer. However, most studies are single series studies, and suffer from methodological problems. We sought to use a meta-analytic approach in combining multiple publicly available datasets, while correcting for batch effects, to reach a more robust oncogenomic analysis. Aim The aim of the present study was to find gene sets associated with distant metastasis free survival (DMFS) in systemically untreated, node-negative breast cancer patients, from publicly available genomic microarray datasets. Methods Four microarray series (having 742 patients) were selected after a systematic search and combined. Cox regression for each gene was done for the combined dataset (univariate, as well as multivariate – adjusted for expression of Cell cycle related genes) and for the 4 major molecular subtypes. The centre and microarray batch effects were adjusted by including them as random effects variables. The Cox regression coefficients for each analysis were then ranked and subjected to a Gene Set Enrichment Analysis (GSEA). Results Gene sets representing protein translation were independently negatively associated with metastasis in the Luminal A and Luminal B subtypes, but positively associated with metastasis in Basal tumors. Proteinaceous extracellular matrix (ECM) gene set expression was positively associated with metastasis, after adjustment for expression of cell cycle related genes on the combined dataset. Finally, the positive association of the proliferation-related genes with metastases was confirmed. Conclusion To the best of our knowledge, the results depicting mixed prognostic significance of protein translation in breast cancer subtypes are being reported for the first time. We attribute this to our study combining multiple series and performing a more robust meta-analytic Cox regression modeling on the combined dataset, thus discovering 'hidden' associations. This methodology seems to yield new and interesting results and may be used as a tool to guide new research. PMID:26080057
NASA Astrophysics Data System (ADS)
Nishidate, Izumi; Wiswadarma, Aditya; Hase, Yota; Tanaka, Noriyuki; Maeda, Takaaki; Niizeki, Kyuichi; Aizu, Yoshihisa
2011-08-01
In order to visualize melanin and blood concentrations and oxygen saturation in human skin tissue, a simple imaging technique based on multispectral diffuse reflectance images acquired at six wavelengths (500, 520, 540, 560, 580 and 600nm) was developed. The technique utilizes multiple regression analysis aided by Monte Carlo simulation for diffuse reflectance spectra. Using the absorbance spectrum as a response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as predictor variables, multiple regression analysis provides regression coefficients. Concentrations of melanin and total blood are then determined from the regression coefficients using conversion vectors that are deduced numerically in advance, while oxygen saturation is obtained directly from the regression coefficients. Experiments with a tissue-like agar gel phantom validated the method. In vivo experiments with human skin of the human hand during upper limb occlusion and of the inner forearm exposed to UV irradiation demonstrated the ability of the method to evaluate physiological reactions of human skin tissue.
ERIC Educational Resources Information Center
Quinino, Roberto C.; Reis, Edna A.; Bessegato, Lupercio F.
2013-01-01
This article proposes the use of the coefficient of determination as a statistic for hypothesis testing in multiple linear regression based on distributions acquired by beta sampling. (Contains 3 figures.)
Lifespan development of pro- and anti-saccades: multiple regression models for point estimates.
Klein, Christoph; Foerster, Friedrich; Hartnegg, Klaus; Fischer, Burkhart
2005-12-07
The comparative study of anti- and pro-saccade task performance contributes to our functional understanding of the frontal lobes, their alterations in psychiatric or neurological populations, and their changes during the life span. In the present study, we apply regression analysis to model life span developmental effects on various pro- and anti-saccade task parameters, using data of a non-representative sample of 327 participants aged 9 to 88 years. Development up to the age of about 27 years was dominated by curvilinear rather than linear effects of age. Furthermore, the largest developmental differences were found for intra-subject variability measures and the anti-saccade task parameters. Ageing, by contrast, had the shape of a global linear decline of the investigated saccade functions, lacking the differential effects of age observed during development. While these results do support the assumption that frontal lobe functions can be distinguished from other functions by their strong and protracted development, they do not confirm the assumption of disproportionate deterioration of frontal lobe functions with ageing. We finally show that the regression models applied here to quantify life span developmental effects can also be used for individual predictions in applied research contexts or clinical practice.
Rahman, Md. Jahanur; Shamim, Abu Ahmed; Klemm, Rolf D. W.; Labrique, Alain B.; Rashid, Mahbubur; Christian, Parul; West, Keith P.
2017-01-01
Birth weight, length and circumferences of the head, chest and arm are key measures of newborn size and health in developing countries. We assessed maternal socio-demographic factors associated with multiple measures of newborn size in a large rural population in Bangladesh using partial least squares (PLS) regression method. PLS regression, combining features from principal component analysis and multiple linear regression, is a multivariate technique with an ability to handle multicollinearity while simultaneously handling multiple dependent variables. We analyzed maternal and infant data from singletons (n = 14,506) born during a double-masked, cluster-randomized, placebo-controlled maternal vitamin A or β-carotene supplementation trial in rural northwest Bangladesh. PLS regression results identified numerous maternal factors (parity, age, early pregnancy MUAC, living standard index, years of education, number of antenatal care visits, preterm delivery and infant sex) significantly (p<0.001) associated with newborn size. Among them, preterm delivery had the largest negative influence on newborn size (Standardized β = -0.29 − -0.19; p<0.001). Scatter plots of the scores of first two PLS components also revealed an interaction between newborn sex and preterm delivery on birth size. PLS regression was found to be more parsimonious than both ordinary least squares regression and principal component regression. It also provided more stable estimates than the ordinary least squares regression and provided the effect measure of the covariates with greater accuracy as it accounts for the correlation among the covariates and outcomes. Therefore, PLS regression is recommended when either there are multiple outcome measurements in the same study, or the covariates are correlated, or both situations exist in a dataset. PMID:29261760
Kabir, Alamgir; Rahman, Md Jahanur; Shamim, Abu Ahmed; Klemm, Rolf D W; Labrique, Alain B; Rashid, Mahbubur; Christian, Parul; West, Keith P
2017-01-01
Birth weight, length and circumferences of the head, chest and arm are key measures of newborn size and health in developing countries. We assessed maternal socio-demographic factors associated with multiple measures of newborn size in a large rural population in Bangladesh using partial least squares (PLS) regression method. PLS regression, combining features from principal component analysis and multiple linear regression, is a multivariate technique with an ability to handle multicollinearity while simultaneously handling multiple dependent variables. We analyzed maternal and infant data from singletons (n = 14,506) born during a double-masked, cluster-randomized, placebo-controlled maternal vitamin A or β-carotene supplementation trial in rural northwest Bangladesh. PLS regression results identified numerous maternal factors (parity, age, early pregnancy MUAC, living standard index, years of education, number of antenatal care visits, preterm delivery and infant sex) significantly (p<0.001) associated with newborn size. Among them, preterm delivery had the largest negative influence on newborn size (Standardized β = -0.29 - -0.19; p<0.001). Scatter plots of the scores of first two PLS components also revealed an interaction between newborn sex and preterm delivery on birth size. PLS regression was found to be more parsimonious than both ordinary least squares regression and principal component regression. It also provided more stable estimates than the ordinary least squares regression and provided the effect measure of the covariates with greater accuracy as it accounts for the correlation among the covariates and outcomes. Therefore, PLS regression is recommended when either there are multiple outcome measurements in the same study, or the covariates are correlated, or both situations exist in a dataset.
Sa'adeh, Hala H; Darwazeh, Razan N; Khalil, Amani A; Zyoud, Sa'ed H
2018-01-01
Hypertension is the second most common cause of chronic kidney disease (CKD). Therefore, the aims of the study were to assess the knowledge, attitudes and practices (KAP) of hypertensive patients towards prevention and early detection of CKD, and to determine the clinical and socio-demographic factors, which affect the KAP regarding prevention of CKD. A cross-sectional study was held using the CKD screening Index to assess the KAP of 374 hypertensive patients who were selected from multiple primary healthcare centers in Nablus, Palestine. The CKD Screening Index is formed of three scales. First, the knowledge scale was a dichotomous scale of 30 items, while the attitude scale used 5-point Likert-type scale for 18 items and finally the practice scale was measured using 4-point Likert-type scale for 12 items. Multiple linear regression analysis was used to determine the association between clinical and socio-demographic factors and practices. In total, 374 hypertensive patients participated in the study. The mean age of participants was 59.14 ± 10.4 years, (range 26-85). The median (interquartile range) of the knowledge, attitude, and practice scores of hypertensive patients towards prevention and early detection of CKD were 20 (16-23), 69 (65-72), and 39 (36-42), respectively. In multiple linear regression analysis, patients age < 65 years ( p < 0.001) and patients with high education level ( p = 0.009) were the only factors significantly associated with higher knowledge scores. Additionally, patients age < 65 years ( p = 0.007), patients with high income ( p = 0.005), and patients with high knowledge score ( p < 0.001) were the only factors significantly associated with higher attitude scores. Furthermore, regression analysis showed that patients with higher total knowledge ( p = 0.001) as well as higher total attitudes scores towards CKD prevention ( p < 0.001), male gender ( p = 0.048), and patients with normal body mass index (BMI) ( p = 0.026) were statistically significantly associated with higher practice score towards CKD prevention. Among hypertensive patients, higher scores for total knowledge and attitudes toward prevention, male sex, and normal BMI were associated with modestly higher scores for prevention practices. Finally the findings may encourage healthcare workers to give better counseling to improve knowledge.
The M Word: Multicollinearity in Multiple Regression.
ERIC Educational Resources Information Center
Morrow-Howell, Nancy
1994-01-01
Notes that existence of substantial correlation between two or more independent variables creates problems of multicollinearity in multiple regression. Discusses multicollinearity problem in social work research in which independent variables are usually intercorrelated. Clarifies problems created by multicollinearity, explains detection of…
Ling, Ru; Liu, Jiawang
2011-12-01
To construct prediction model for health workforce and hospital beds in county hospitals of Hunan by multiple linear regression. We surveyed 16 counties in Hunan with stratified random sampling according to uniform questionnaires,and multiple linear regression analysis with 20 quotas selected by literature view was done. Independent variables in the multiple linear regression model on medical personnels in county hospitals included the counties' urban residents' income, crude death rate, medical beds, business occupancy, professional equipment value, the number of devices valued above 10 000 yuan, fixed assets, long-term debt, medical income, medical expenses, outpatient and emergency visits, hospital visits, actual available bed days, and utilization rate of hospital beds. Independent variables in the multiple linear regression model on county hospital beds included the the population of aged 65 and above in the counties, disposable income of urban residents, medical personnel of medical institutions in county area, business occupancy, the total value of professional equipment, fixed assets, long-term debt, medical income, medical expenses, outpatient and emergency visits, hospital visits, actual available bed days, utilization rate of hospital beds, and length of hospitalization. The prediction model shows good explanatory and fitting, and may be used for short- and mid-term forecasting.
A regression-kriging model for estimation of rainfall in the Laohahe basin
NASA Astrophysics Data System (ADS)
Wang, Hong; Ren, Li L.; Liu, Gao H.
2009-10-01
This paper presents a multivariate geostatistical algorithm called regression-kriging (RK) for predicting the spatial distribution of rainfall by incorporating five topographic/geographic factors of latitude, longitude, altitude, slope and aspect. The technique is illustrated using rainfall data collected at 52 rain gauges from the Laohahe basis in northeast China during 1986-2005 . Rainfall data from 44 stations were selected for modeling and the remaining 8 stations were used for model validation. To eliminate multicollinearity, the five explanatory factors were first transformed using factor analysis with three Principal Components (PCs) extracted. The rainfall data were then fitted using step-wise regression and residuals interpolated using SK. The regression coefficients were estimated by generalized least squares (GLS), which takes the spatial heteroskedasticity between rainfall and PCs into account. Finally, the rainfall prediction based on RK was compared with that predicted from ordinary kriging (OK) and ordinary least squares (OLS) multiple regression (MR). For correlated topographic factors are taken into account, RK improves the efficiency of predictions. RK achieved a lower relative root mean square error (RMSE) (44.67%) than MR (49.23%) and OK (73.60%) and a lower bias than MR and OK (23.82 versus 30.89 and 32.15 mm) for annual rainfall. It is much more effective for the wet season than for the dry season. RK is suitable for estimation of rainfall in areas where there are no stations nearby and where topography has a major influence on rainfall.
"Photographing money" task pricing
NASA Astrophysics Data System (ADS)
Jia, Zhongxiang
2018-05-01
"Photographing money" [1]is a self-service model under the mobile Internet. The task pricing is reasonable, related to the success of the commodity inspection. First of all, we analyzed the position of the mission and the membership, and introduced the factor of membership density, considering the influence of the number of members around the mission on the pricing. Multivariate regression of task location and membership density using MATLAB to establish the mathematical model of task pricing. At the same time, we can see from the life experience that membership reputation and the intensity of the task will also affect the pricing, and the data of the task success point is more reliable. Therefore, the successful point of the task is selected, and its reputation, task density, membership density and Multiple regression of task positions, according to which a nhew task pricing program. Finally, an objective evaluation is given of the advantages and disadvantages of the established model and solution method, and the improved method is pointed out.
Hong, Soo Jung
2018-08-01
This study investigates the effects of cultural norms on family health history (FHH) communication in the American, Chinese, and Korean cultures. More particularly, this study focuses on perceived family boundaries, subjective norms, stigma beliefs, and privacy boundaries, including age and gender, that affect people's FHH communication. For data analyses, hierarchical multiple regression and logistic regression methods were employed. The results indicate that participants' subjective norms, stigma beliefs, and perceived family/privacy boundaries were positively associated with current FHH communication. Age- and gender-related privacy boundaries were negatively related to perceived privacy boundaries, however. Finally, the results show that gendered cultural identities have three-way interaction effects on two associations: (1) between perceived family boundaries and perceived privacy boundaries and (2) between perceived privacy boundaries and current FHH communication. The findings have meaningful implications for future cross-cultural studies on the roles of family systems, subjective norms, and stigma beliefs in FHH communication.
Genotype-phenotype association study via new multi-task learning model
Huo, Zhouyuan; Shen, Dinggang
2018-01-01
Research on the associations between genetic variations and imaging phenotypes is developing with the advance in high-throughput genotype and brain image techniques. Regression analysis of single nucleotide polymorphisms (SNPs) and imaging measures as quantitative traits (QTs) has been proposed to identify the quantitative trait loci (QTL) via multi-task learning models. Recent studies consider the interlinked structures within SNPs and imaging QTs through group lasso, e.g. ℓ2,1-norm, leading to better predictive results and insights of SNPs. However, group sparsity is not enough for representing the correlation between multiple tasks and ℓ2,1-norm regularization is not robust either. In this paper, we propose a new multi-task learning model to analyze the associations between SNPs and QTs. We suppose that low-rank structure is also beneficial to uncover the correlation between genetic variations and imaging phenotypes. Finally, we conduct regression analysis of SNPs and QTs. Experimental results show that our model is more accurate in prediction than compared methods and presents new insights of SNPs. PMID:29218896
Functional Capacity Evaluation in Different Societal Contexts: Results of a Multicountry Study.
Ansuategui Echeita, Jone; Bethge, Matthias; van Holland, Berry J; Gross, Douglas P; Kool, Jan; Oesch, Peter; Trippolini, Maurizio A; Chapman, Elizabeth; Cheng, Andy S K; Sellars, Robert; Spavins, Megan; Streibelt, Marco; van der Wurff, Peter; Reneman, Michiel F
2018-05-25
Purpose To examine factors associated with Functional Capacity Evaluation (FCE) results in patients with painful musculoskeletal conditions, with focus on social factors across multiple countries. Methods International cross-sectional study was performed within care as usual. Simple and multiple multilevel linear regression analyses which considered measurement's dependency within clinicians and country were conducted: FCE characteristics and biopsychosocial variables from patients and clinicians as independent variables; and FCE results (floor-to-waist lift, six-minute walk, and handgrip strength) as dependent variables. Results Data were collected for 372 patients, 54 clinicians, 18 facilities and 8 countries. Patients' height and reported pain intensity were consistently associated with every FCE result. Patients' sex, height, reported pain intensity, effort during FCE, social isolation, and disability, clinician's observed physical effort, and whether FCE test was prematurely ended were associated with lift. Patient's height, Body Mass Index, post-test heart-rate, reported pain intensity and effort during FCE, days off work, and whether FCE test was prematurely ended were associated with walk. Patient's age, sex, height, affected body area, reported pain intensity and catastrophizing, and physical work demands were associated with handgrip. Final regression models explained 38‒65% of total variance. Clinician and country random effects composed 1-39% of total residual variance in these models. Conclusion Biopsychosocial factors were associated with every FCE result across multiple countries; specifically, patients' height, reported pain intensity, clinician, and measurement country. Social factors, which had been under-researched, were consistently associated with FCE performances. Patients' FCE results should be considered from a biopsychosocial perspective, including different social contexts.
Meng, Xing; Jiang, Rongtao; Lin, Dongdong; Bustillo, Juan; Jones, Thomas; Chen, Jiayu; Yu, Qingbao; Du, Yuhui; Zhang, Yu; Jiang, Tianzi; Sui, Jing; Calhoun, Vince D.
2016-01-01
Neuroimaging techniques have greatly enhanced the understanding of neurodiversity (human brain variation across individuals) in both health and disease. The ultimate goal of using brain imaging biomarkers is to perform individualized predictions. Here we proposed a generalized framework that can predict explicit values of the targeted measures by taking advantage of joint information from multiple modalities. This framework also enables whole brain voxel-wise searching by combining multivariate techniques such as ReliefF, clustering, correlation-based feature selection and multiple regression models, which is more flexible and can achieve better prediction performance than alternative atlas-based methods. For 50 healthy controls and 47 schizophrenia patients, three kinds of features derived from resting-state fMRI (fALFF), sMRI (gray matter) and DTI (fractional anisotropy) were extracted and fed into a regression model, achieving high prediction for both cognitive scores (MCCB composite r = 0.7033, MCCB social cognition r = 0.7084) and symptomatic scores (positive and negative syndrome scale [PANSS] positive r = 0.7785, PANSS negative r = 0.7804). Moreover, the brain areas likely responsible for cognitive deficits of schizophrenia, including middle temporal gyrus, dorsolateral prefrontal cortex, striatum, cuneus and cerebellum, were located with different weights, as well as regions predicting PANSS symptoms, including thalamus, striatum and inferior parietal lobule, pinpointing the potential neuromarkers. Finally, compared to a single modality, multimodal combination achieves higher prediction accuracy and enables individualized prediction on multiple clinical measures. There is more work to be done, but the current results highlight the potential utility of multimodal brain imaging biomarkers to eventually inform clinical decision-making. PMID:27177764
NASA Astrophysics Data System (ADS)
Taştan Kırık, Özgecan
2013-12-01
This study explores the science teaching efficacy beliefs of pr-service elementary teachers and the relationship between efficacy beliefs and multiple factors such as antecedent factors (participation in extracurricular activities and number of science and science teaching methods courses taken), conceptual understanding, classroom management beliefs and science teaching attitudes. Science education majors ( n = 71) and elementary education majors ( n = 262) were compared with respect to these variables. Finally, the predictors of two constructs of science teaching efficacy beliefs, personal science teaching efficacy (PSTE) and science teaching outcome expectancy (STOE), were examined by multiple linear regression analysis. According to the results, participation in extracurricular activities has a significant but low correlation with science concept knowledge, science teaching attitudes, PSTE and STOE. In addition, there is a small but significant correlation between science concept knowledge and outcome expectancy, which leads the idea that preservice elementary teachers' conceptual understanding in science contributes to their science teaching self-efficacy. This study reveals a moderate correlation between science teaching attitudes and STOE and a high correlation between science teaching attitudes and PSTE. Additionally, although the correlation coefficient is low, the number of methodology courses was found to be one of the correlates of science teaching attitudes. Furthermore, students of both majors generally had positive self-efficacy beliefs on both the STOE and PSTE. Specifically, science education majors had higher science teaching self-efficacy than elementary education majors. Regression results showed that science teaching attitude is the major factor in predicting both PSTE and STOE for both groups.
Kuiper, Gerhardus J A J M; Houben, Rik; Wetzels, Rick J H; Verhezen, Paul W M; Oerle, Rene van; Ten Cate, Hugo; Henskens, Yvonne M C; Lancé, Marcus D
2017-11-01
Low platelet counts and hematocrit levels hinder whole blood point-of-care testing of platelet function. Thus far, no reference ranges for MEA (multiple electrode aggregometry) and PFA-100 (platelet function analyzer 100) devices exist for low ranges. Through dilution methods of volunteer whole blood, platelet function at low ranges of platelet count and hematocrit levels was assessed on MEA for four agonists and for PFA-100 in two cartridges. Using (multiple) regression analysis, 95% reference intervals were computed for these low ranges. Low platelet counts affected MEA in a positive correlation (all agonists showed r 2 ≥ 0.75) and PFA-100 in an inverse correlation (closure times were prolonged with lower platelet counts). Lowered hematocrit did not affect MEA testing, except for arachidonic acid activation (ASPI), which showed a weak positive correlation (r 2 = 0.14). Closure time on PFA-100 testing was inversely correlated with hematocrit for both cartridges. Regression analysis revealed different 95% reference intervals in comparison with originally established intervals for both MEA and PFA-100 in low platelet or hematocrit conditions. Multiple regression analysis of ASPI and both tests on the PFA-100 for combined low platelet and hematocrit conditions revealed that only PFA-100 testing should be adjusted for both thrombocytopenia and anemia. 95% reference intervals were calculated using multiple regression analysis. However, coefficients of determination of PFA-100 were poor, and some variance remained unexplained. Thus, in this pilot study using (multiple) regression analysis, we could establish reference intervals of platelet function in anemia and thrombocytopenia conditions on PFA-100 and in thrombocytopenia conditions on MEA.
Li, Michael Jonathan; Distefano, Anthony; Mouttapa, Michele; Gill, Jasmeet K
2014-02-01
The present study aimed to determine whether the experience of bias-motivated bullying was associated with behaviors known to increase the risk of HIV infection among young men who have sex with men (YMSM) aged 18-29, and to assess whether the psychosocial problems moderated this relationship. Using an Internet-based direct marketing approach in sampling, we recruited 545 YMSM residing in the USA to complete an online questionnaire. Multiple linear regression analyses tested three regression models where we controlled for sociodemographics. The first model indicated that bullying during high school was associated with unprotected receptive anal intercourse within the past 12 months, while the second model indicated that bullying after high school was associated with engaging in anal intercourse while under the influence of drugs or alcohol in the past 12 months. In the final regression model, our composite measure of HIV risk behavior was found to be associated with lifetime verbal harassment. None of the psychosocial problems measured in this study - depression, low self-esteem, and internalized homonegativity - moderated any of the associations between bias-motivated bullying victimization and HIV risk behaviors in our regression models. Still, these findings provide novel evidence that bullying prevention programs in schools and communities should be included in comprehensive approaches to HIV prevention among YMSM.
As a fast and effective technique, the multiple linear regression (MLR) method has been widely used in modeling and prediction of beach bacteria concentrations. Among previous works on this subject, however, several issues were insufficiently or inconsistently addressed. Those is...
MULTIPLE REGRESSION MODELS FOR HINDCASTING AND FORECASTING MIDSUMMER HYPOXIA IN THE GULF OF MEXICO
A new suite of multiple regression models were developed that describe the relationship between the area of bottom water hypoxia along the northern Gulf of Mexico and Mississippi-Atchafalaya River nitrate concentration, total phosphorus (TP) concentration, and discharge. Variabil...
Khalil, Mohamed H.; Shebl, Mostafa K.; Kosba, Mohamed A.; El-Sabrout, Karim; Zaki, Nesma
2016-01-01
Aim: This research was conducted to determine the most affecting parameters on hatchability of indigenous and improved local chickens’ eggs. Materials and Methods: Five parameters were studied (fertility, early and late embryonic mortalities, shape index, egg weight, and egg weight loss) on four strains, namely Fayoumi, Alexandria, Matrouh, and Montazah. Multiple linear regression was performed on the studied parameters to determine the most influencing one on hatchability. Results: The results showed significant differences in commercial and scientific hatchability among strains. Alexandria strain has the highest significant commercial hatchability (80.70%). Regarding the studied strains, highly significant differences in hatching chick weight among strains were observed. Using multiple linear regression analysis, fertility made the greatest percent contribution (71.31%) to hatchability, and the lowest percent contributions were made by shape index and egg weight loss. Conclusion: A prediction of hatchability using multiple regression analysis could be a good tool to improve hatchability percentage in chickens. PMID:27651666
Mean centering, multicollinearity, and moderators in multiple regression: The reconciliation redux.
Iacobucci, Dawn; Schneider, Matthew J; Popovich, Deidre L; Bakamitsos, Georgios A
2017-02-01
In this article, we attempt to clarify our statements regarding the effects of mean centering. In a multiple regression with predictors A, B, and A × B (where A × B serves as an interaction term), mean centering A and B prior to computing the product term can clarify the regression coefficients (which is good) and the overall model fit R 2 will remain undisturbed (which is also good).
2013-01-01
application of the Hammett equation with the constants rph in the chemistry of organophosphorus compounds, Russ. Chem. Rev. 38 (1969) 795–811. [13...of oximes and OP compounds and the ability of oximes to reactivate OP- inhibited AChE. Multiple linear regression equations were analyzed using...phosphonate pairs, 21 oxime/ phosphoramidate pairs and 12 oxime/phosphate pairs. The best linear regression equation resulting from multiple regression anal
He, Dan; Kuhn, David; Parida, Laxmi
2016-06-15
Given a set of biallelic molecular markers, such as SNPs, with genotype values encoded numerically on a collection of plant, animal or human samples, the goal of genetic trait prediction is to predict the quantitative trait values by simultaneously modeling all marker effects. Genetic trait prediction is usually represented as linear regression models. In many cases, for the same set of samples and markers, multiple traits are observed. Some of these traits might be correlated with each other. Therefore, modeling all the multiple traits together may improve the prediction accuracy. In this work, we view the multitrait prediction problem from a machine learning angle: as either a multitask learning problem or a multiple output regression problem, depending on whether different traits share the same genotype matrix or not. We then adapted multitask learning algorithms and multiple output regression algorithms to solve the multitrait prediction problem. We proposed a few strategies to improve the least square error of the prediction from these algorithms. Our experiments show that modeling multiple traits together could improve the prediction accuracy for correlated traits. The programs we used are either public or directly from the referred authors, such as MALSAR (http://www.public.asu.edu/~jye02/Software/MALSAR/) package. The Avocado data set has not been published yet and is available upon request. dhe@us.ibm.com. © The Author 2016. Published by Oxford University Press.
Simple and multiple linear regression: sample size considerations.
Hanley, James A
2016-11-01
The suggested "two subjects per variable" (2SPV) rule of thumb in the Austin and Steyerberg article is a chance to bring out some long-established and quite intuitive sample size considerations for both simple and multiple linear regression. This article distinguishes two of the major uses of regression models that imply very different sample size considerations, neither served well by the 2SPV rule. The first is etiological research, which contrasts mean Y levels at differing "exposure" (X) values and thus tends to focus on a single regression coefficient, possibly adjusted for confounders. The second research genre guides clinical practice. It addresses Y levels for individuals with different covariate patterns or "profiles." It focuses on the profile-specific (mean) Y levels themselves, estimating them via linear compounds of regression coefficients and covariates. By drawing on long-established closed-form variance formulae that lie beneath the standard errors in multiple regression, and by rearranging them for heuristic purposes, one arrives at quite intuitive sample size considerations for both research genres. Copyright © 2016 Elsevier Inc. All rights reserved.
Multiple imputation for cure rate quantile regression with censored data.
Wu, Yuanshan; Yin, Guosheng
2017-03-01
The main challenge in the context of cure rate analysis is that one never knows whether censored subjects are cured or uncured, or whether they are susceptible or insusceptible to the event of interest. Considering the susceptible indicator as missing data, we propose a multiple imputation approach to cure rate quantile regression for censored data with a survival fraction. We develop an iterative algorithm to estimate the conditionally uncured probability for each subject. By utilizing this estimated probability and Bernoulli sample imputation, we can classify each subject as cured or uncured, and then employ the locally weighted method to estimate the quantile regression coefficients with only the uncured subjects. Repeating the imputation procedure multiple times and taking an average over the resultant estimators, we obtain consistent estimators for the quantile regression coefficients. Our approach relaxes the usual global linearity assumption, so that we can apply quantile regression to any particular quantile of interest. We establish asymptotic properties for the proposed estimators, including both consistency and asymptotic normality. We conduct simulation studies to assess the finite-sample performance of the proposed multiple imputation method and apply it to a lung cancer study as an illustration. © 2016, The International Biometric Society.
Undergraduate Student Motivation in Modularized Developmental Mathematics Courses
ERIC Educational Resources Information Center
Pachlhofer, Keith A.
2017-01-01
This study used the Motivated Strategies for Learning Questionnaire in modularized courses at three institutions across the nation (N = 189), and multiple regression was completed to investigate five categories of student motivation that predicted academic success and course completion. The overall multiple regression analysis was significant and…
MULGRES: a computer program for stepwise multiple regression analysis
A. Jeff Martin
1971-01-01
MULGRES is a computer program source deck that is designed for multiple regression analysis employing the technique of stepwise deletion in the search for most significant variables. The features of the program, along with inputs and outputs, are briefly described, with a note on machine compatibility.
Categorical Variables in Multiple Regression: Some Cautions.
ERIC Educational Resources Information Center
O'Grady, Kevin E.; Medoff, Deborah R.
1988-01-01
Limitations of dummy coding and nonsense coding as methods of coding categorical variables for use as predictors in multiple regression analysis are discussed. The combination of these approaches often yields estimates and tests of significance that are not intended by researchers for inclusion in their models. (SLD)
Multiple Imputation of a Randomly Censored Covariate Improves Logistic Regression Analysis.
Atem, Folefac D; Qian, Jing; Maye, Jacqueline E; Johnson, Keith A; Betensky, Rebecca A
2016-01-01
Randomly censored covariates arise frequently in epidemiologic studies. The most commonly used methods, including complete case and single imputation or substitution, suffer from inefficiency and bias. They make strong parametric assumptions or they consider limit of detection censoring only. We employ multiple imputation, in conjunction with semi-parametric modeling of the censored covariate, to overcome these shortcomings and to facilitate robust estimation. We develop a multiple imputation approach for randomly censored covariates within the framework of a logistic regression model. We use the non-parametric estimate of the covariate distribution or the semiparametric Cox model estimate in the presence of additional covariates in the model. We evaluate this procedure in simulations, and compare its operating characteristics to those from the complete case analysis and a survival regression approach. We apply the procedures to an Alzheimer's study of the association between amyloid positivity and maternal age of onset of dementia. Multiple imputation achieves lower standard errors and higher power than the complete case approach under heavy and moderate censoring and is comparable under light censoring. The survival regression approach achieves the highest power among all procedures, but does not produce interpretable estimates of association. Multiple imputation offers a favorable alternative to complete case analysis and ad hoc substitution methods in the presence of randomly censored covariates within the framework of logistic regression.
Learning to be cruel?: exploring the onset and frequency of animal cruelty.
Hensley, Christopher; Tallichet, Suzanne E
2005-02-01
Few studies have examined how animal cruelty is learned within a specific social context among incarcerated individuals. Using data from 261 inmates, this study specifically addressed how demographic characteristics and childhood experiences with animal abuse may have affected the recurrence and onset of childhood and adolescent cruelty as a learned behavior. Multiple regression analyses revealed that inmates who experienced animal cruelty at a younger age were more likely to demonstrate recurrent animal cruelty themselves. In addition, respondents who observed a friend abuse animals were more likely to hurt or kill animals more frequently. Finally, inmates who were younger when they first witnessed animal cruelty also hurt or killed animals at a younger age.
DOE Office of Scientific and Technical Information (OSTI.GOV)
Holmes, R.W.
1986-10-10
The present study was designed to establish quantitative relationships between lake air-equilibrated pH, alkalinity, and diatoms occurring in the surface sediments in high-elevation Sierra Nevada Lakes. These relationships provided the necessary information to develop predictive equations relating lake pH to the composition of surface-sediment diatom assemblages in 27 study lakes. Using the Hustedt diatom pH classification system, Index B of Renberg and Hellberg, and multiple linear regression analysis, two equations were developed which predict lake pH from the relative abundance of sediment diatoms occurring in each of four diatom pH groupings.
Air Pollutants, Climate, and the Prevalence of Pediatric Asthma in Urban Areas of China
Zhang, Juanjuan; Yan, Li; Fu, Wenlong; Yi, Jing; Chen, Yuzhi; Liu, Chuanhe; Xu, Dongqun; Wang, Qiang
2016-01-01
Background. Prevalence of childhood asthma varies significantly among regions, while its reasons are not clear yet with only a few studies reporting relevant causes for this variation. Objective. To investigate the potential role of city-average levels of air pollutants and climatic factors in order to distinguish differences in asthma prevalence in China and explain their reasons. Methods. Data pertaining to 10,777 asthmatic patients were obtained from the third nationwide survey of childhood asthma in China's urban areas. Annual mean concentrations of air pollutants and other climatic factors were obtained for the same period from several government departments. Data analysis was implemented with descriptive statistics, Pearson correlation coefficient, and multiple regression analysis. Results. Pearson correlation analysis showed that the situation of childhood asthma was strongly linked with SO2, relative humidity, and hours of sunshine (p < 0.05). Multiple regression analysis indicated that, among the predictor variables in the final step, SO2 was found to be the most powerful predictor variable amongst all (β = −19.572, p < 0.05). Furthermore, results had shown that hours of sunshine (β = −0.014, p < 0.05) was a significant component summary predictor variable. Conclusion. The findings of this study do not suggest that air pollutants or climate, at least in terms of children, plays a major role in explaining regional differences in asthma prevalence in China. PMID:27556031
An Application of Robust Method in Multiple Linear Regression Model toward Credit Card Debt
NASA Astrophysics Data System (ADS)
Amira Azmi, Nur; Saifullah Rusiman, Mohd; Khalid, Kamil; Roslan, Rozaini; Sufahani, Suliadi; Mohamad, Mahathir; Salleh, Rohayu Mohd; Hamzah, Nur Shamsidah Amir
2018-04-01
Credit card is a convenient alternative replaced cash or cheque, and it is essential component for electronic and internet commerce. In this study, the researchers attempt to determine the relationship and significance variables between credit card debt and demographic variables such as age, household income, education level, years with current employer, years at current address, debt to income ratio and other debt. The provided data covers 850 customers information. There are three methods that applied to the credit card debt data which are multiple linear regression (MLR) models, MLR models with least quartile difference (LQD) method and MLR models with mean absolute deviation method. After comparing among three methods, it is found that MLR model with LQD method became the best model with the lowest value of mean square error (MSE). According to the final model, it shows that the years with current employer, years at current address, household income in thousands and debt to income ratio are positively associated with the amount of credit debt. Meanwhile variables for age, level of education and other debt are negatively associated with amount of credit debt. This study may serve as a reference for the bank company by using robust methods, so that they could better understand their options and choice that is best aligned with their goals for inference regarding to the credit card debt.
Physical victimization, gender identity and suicide risk among transgender men and women.
Barboza, Gia Elise; Dominguez, Silvia; Chance, Elena
2016-12-01
We investigated whether being attacked physically due to one's gender identity or expression was associated with suicide risk among trans men and women living in Virginia. The sample consisted of 350 transgender men and women who participated in the Virginia Transgender Health Initiative Survey (THIS). Multivariate multinomial logistic regression was used to explore the competing outcomes associated with suicidal risk. Thirty-seven percent of trans men and women experienced at least one physical attack since the age of 13. On average, individuals experienced 3.97 (SD = 2.86) physical attacks; among these about half were attributed to one's gender identity or expression (mean = 2.08, SD = 1.96). In the multivariate multinomial regression, compared to those with no risk, being physically attacked increased the odds of both attempting and contemplating suicide regardless of gender attribution. Nevertheless, the relative impact of physical victimization on suicidal behavior was higher among those who were targeted on the basis of their gender identity or expression. Finally, no significant association was found between multiple measures of institutional discrimination and suicide risk once discriminatory and non-discriminatory physical victimization was taken into account. Trans men and women experience high levels of physical abuse and face multiple forms of discrimination. They are also at an increased risk for suicidal tendencies. Interventions that help transindividuals cope with discrimination and physical victimization simultaneously may be more effective in saving lives.
Fu, Liya; Wang, You-Gan
2011-02-15
Environmental data usually include measurements, such as water quality data, which fall below detection limits, because of limitations of the instruments or of certain analytical methods used. The fact that some responses are not detected needs to be properly taken into account in statistical analysis of such data. However, it is well-known that it is challenging to analyze a data set with detection limits, and we often have to rely on the traditional parametric methods or simple imputation methods. Distributional assumptions can lead to biased inference and justification of distributions is often not possible when the data are correlated and there is a large proportion of data below detection limits. The extent of bias is usually unknown. To draw valid conclusions and hence provide useful advice for environmental management authorities, it is essential to develop and apply an appropriate statistical methodology. This paper proposes rank-based procedures for analyzing non-normally distributed data collected at different sites over a period of time in the presence of multiple detection limits. To take account of temporal correlations within each site, we propose an optimal linear combination of estimating functions and apply the induced smoothing method to reduce the computational burden. Finally, we apply the proposed method to the water quality data collected at Susquehanna River Basin in United States of America, which clearly demonstrates the advantages of the rank regression models.
Advanced Statistics for Exotic Animal Practitioners.
Hodsoll, John; Hellier, Jennifer M; Ryan, Elizabeth G
2017-09-01
Correlation and regression assess the association between 2 or more variables. This article reviews the core knowledge needed to understand these analyses, moving from visual analysis in scatter plots through correlation, simple and multiple linear regression, and logistic regression. Correlation estimates the strength and direction of a relationship between 2 variables. Regression can be considered more general and quantifies the numerical relationships between an outcome and 1 or multiple variables in terms of a best-fit line, allowing predictions to be made. Each technique is discussed with examples and the statistical assumptions underlying their correct application. Copyright © 2017 Elsevier Inc. All rights reserved.
He, Jie; Zhao, Yunfeng; Zhao, Jingli; Gao, Jin; Han, Dandan; Xu, Pao; Yang, Runqing
2017-11-02
Because of their high economic importance, growth traits in fish are under continuous improvement. For growth traits that are recorded at multiple time-points in life, the use of univariate and multivariate animal models is limited because of the variable and irregular timing of these measures. Thus, the univariate random regression model (RRM) was introduced for the genetic analysis of dynamic growth traits in fish breeding. We used a multivariate random regression model (MRRM) to analyze genetic changes in growth traits recorded at multiple time-point of genetically-improved farmed tilapia. Legendre polynomials of different orders were applied to characterize the influences of fixed and random effects on growth trajectories. The final MRRM was determined by optimizing the univariate RRM for the analyzed traits separately via penalizing adaptively the likelihood statistical criterion, which is superior to both the Akaike information criterion and the Bayesian information criterion. In the selected MRRM, the additive genetic effects were modeled by Legendre polynomials of three orders for body weight (BWE) and body length (BL) and of two orders for body depth (BD). By using the covariance functions of the MRRM, estimated heritabilities were between 0.086 and 0.628 for BWE, 0.155 and 0.556 for BL, and 0.056 and 0.607 for BD. Only heritabilities for BD measured from 60 to 140 days of age were consistently higher than those estimated by the univariate RRM. All genetic correlations between growth time-points exceeded 0.5 for either single or pairwise time-points. Moreover, correlations between early and late growth time-points were lower. Thus, for phenotypes that are measured repeatedly in aquaculture, an MRRM can enhance the efficiency of the comprehensive selection for BWE and the main morphological traits.
Botto, Fernando; Obregon, Sebastian; Rubinstein, Fernando; Scuteri, Angelo; Nilsson, Peter M; Kotliar, Carol
2018-03-01
The main objective was to estimate the frequency of early vascular aging (EVA) in a sample of subjects from Latin America, with emphasis in young adults. We included 1416 subjects from 12 countries in Latin America who provided information about lifestyle, cardiovascular risk factors (CVRF), and anthropometrics. We measured pulse wave velocity (PWV) as a marker of arterial stiffness, and blood pressure (BP) using an oscillometric device (Mobil-O-Graph). To determine the frequency of EVA, we used multiple linear regression to estimate each subject's PWV expected for his/her age and systolic BP, and compared with observed values to obtain standardized residuals (z-scores). We defined EVA when z-score was ≥1.96. Finally, a multivariable logistic regression analysis was performed to determine baseline characteristics associated with EVA. Mean age was 49.9 ± 15.5 years, male gender was 50.3%. Mean PWV was 7.52 m/s (SD 1.97), mean systolic BP was 125.3 mmHg (SD 16.7) and mean diastolic BP was 78.9 mmHg (SD 12.2). The frequency of EVA was 5.7% in the total population, 9.8% in adults of 40 years or less and 18.7% in those 30 years or less. In these young adults, multiple logistic regression analyses demonstrated that dyslipidemia and hypertension showed an independent association with EVA, and smoking a borderline association (p = 0.07). In conclusion, the frequency of EVA in a sample from Latin America was around 6%, with higher rates in young adults. These results would support the search of CVRF and EVA during early adulthood.
Aryanpur, Mahshid; Masjedi, Mohammad Reza; Mortaz, Esmaeil; Hosseini, Mostafa; Jamaati, Hmidreza; Tabarsi, Payam; Soori, Hamid; Heydari, Gholam Reza; Kazempour-Dizaji, Mehdi; Emami, Habib; Mozafarian, Alireza
2016-01-01
Several studies have shown that smoking, as a modifiable risk factor, can affect tuberculosis (TB) in different aspects such as enhancing development of TB infection, activation of latent TB and its related mortality. Since willingness to quit smoking is a critical stage, which may lead to quit attempts, being aware of smokers' intention to quit and the related predictors can provide considerable advantages. In this cross-sectional study, subjects were recruited via a multi-stage cluster sampling method. Sampling was performed during 2012-2014 among pulmonary TB (PTB) patients referred to health centers in Tehran implementing the directly observed treatment short course (DOTS) strategy and a TB referral center. Data analysis was conducted using SPSS version 22 and the factors influencing quit intention were assessed using bivariate regression and multiple logistic regression models. In this study 1,127 newly diagnosed PTB patients were studied; from which 284 patients (22%) were current smokers. When diagnosed with TB, 59 (23.8%) smokers quit smoking. Among the remaining 189 (76.2%) patients who continued smoking, 52.4% had intention to quit. In the final multiple logistic regression model, living in urban areas (OR=8.81, P=0.003), having an office job (OR= 7.34, P=0.001), being single (OR=4.89, P=0.016) and a one unit increase in the motivation degree (OR=2.60, P<0.001) were found to increase the intention to quit smoking. The study found that PTB patients who continued smoking had remarkable intention to quit. Thus, it is recommended that smoking cessation interventions should be started at the time of TB diagnosis. Understanding the associated factors can guide the consultants to predict patients' intention to quit and select the most proper management to facilitate smoking cessation for each patient.
Bayesian LASSO, scale space and decision making in association genetics.
Pasanen, Leena; Holmström, Lasse; Sillanpää, Mikko J
2015-01-01
LASSO is a penalized regression method that facilitates model fitting in situations where there are as many, or even more explanatory variables than observations, and only a few variables are relevant in explaining the data. We focus on the Bayesian version of LASSO and consider four problems that need special attention: (i) controlling false positives, (ii) multiple comparisons, (iii) collinearity among explanatory variables, and (iv) the choice of the tuning parameter that controls the amount of shrinkage and the sparsity of the estimates. The particular application considered is association genetics, where LASSO regression can be used to find links between chromosome locations and phenotypic traits in a biological organism. However, the proposed techniques are relevant also in other contexts where LASSO is used for variable selection. We separate the true associations from false positives using the posterior distribution of the effects (regression coefficients) provided by Bayesian LASSO. We propose to solve the multiple comparisons problem by using simultaneous inference based on the joint posterior distribution of the effects. Bayesian LASSO also tends to distribute an effect among collinear variables, making detection of an association difficult. We propose to solve this problem by considering not only individual effects but also their functionals (i.e. sums and differences). Finally, whereas in Bayesian LASSO the tuning parameter is often regarded as a random variable, we adopt a scale space view and consider a whole range of fixed tuning parameters, instead. The effect estimates and the associated inference are considered for all tuning parameters in the selected range and the results are visualized with color maps that provide useful insights into data and the association problem considered. The methods are illustrated using two sets of artificial data and one real data set, all representing typical settings in association genetics.
Sritara, C; Thakkinstian, A; Ongphiphadhanakul, B; Chailurkit, L; Chanprasertyothin, S; Ratanachaiwong, W; Vathesatogkit, P; Sritara, P
2014-05-01
Using mediation analysis, a causal relationship between the AHSG gene and bone mineral density (BMD) through fetuin-A and body mass index (BMI) mediators was suggested. Fetuin-A, a multifunctional protein of hepatic origin, is associated with bone mineral density. It is unclear if this association is causal. This study aimed at clarification of this issue. A cross-sectional study was conducted among 1,741 healthy workers from the Electricity Generating Authority of Thailand (EGAT) cohort. The alpha-2-Heremans-Schmid glycoprotein (AHSG) rs2248690 gene was genotyped. Three mediation models were constructed using seemingly unrelated regression analysis. First, the ln[fetuin-A] group was regressed on the AHSG gene. Second, the BMI group was regressed on the AHSG gene and the ln[fetuin-A] group. Finally, the BMD model was constructed by fitting BMD on two mediators (ln[fetuin-A] and BMI) and the independent AHSG variable. All three analyses were adjusted for confounders. The prevalence of the minor T allele for the AHSG locus was 15.2%. The AHSG locus was highly related to serum fetuin-A levels (P < 0.001). Multiple mediation analyses showed that AHSG was significantly associated with BMD through the ln[fetuin-A] and BMI pathway, with beta coefficients of 0.0060 (95% CI 0.0038, 0.0083) and 0.0030 (95% CI 0.0020, 0.0045) at the total hip and lumbar spine, respectively. About 27.3 and 26.0% of total genetic effects on hip and spine BMD, respectively, were explained by the mediation effects of fetuin-A and BMI. Our study suggested evidence of a causal relationship between the AHSG gene and BMD through fetuin-A and BMI mediators.
Use of Thematic Mapper for water quality assessment
NASA Technical Reports Server (NTRS)
Horn, E. M.; Morrissey, L. A.
1984-01-01
The evaluation of simulated TM data obtained on an ER-2 aircraft at twenty-five predesignated sample sites for mapping water quality factors such as conductivity, pH, suspended solids, turbidity, temperature, and depth, is discussed. Using a multiple regression for the seven TM bands, an equation is developed for the suspended solids. TM bands 1, 2, 3, 4, and 6 are used with logarithm conductivity in a multiple regression. The assessment of regression equations for a high coefficient of determination (R-squared) and statistical significance is considered. Confidence intervals about the mean regression point are calculated in order to assess the robustness of the regressions used for mapping conductivity, turbidity, and suspended solids, and by regressing random subsamples of sites and comparing the resultant range of R-squared, cross validation is conducted.
Due to the complexity of the processes contributing to beach bacteria concentrations, many researchers rely on statistical modeling, among which multiple linear regression (MLR) modeling is most widely used. Despite its ease of use and interpretation, there may be time dependence...
Data from the Interagency Monitoring of Protected Visual Environments (IMPROVE) network are used to estimate organic mass to organic carbon (OM/OC) ratios across the United States by extending previously published multiple regression techniques. Our new methodology addresses com...
Analysis and Interpretation of Findings Using Multiple Regression Techniques
ERIC Educational Resources Information Center
Hoyt, William T.; Leierer, Stephen; Millington, Michael J.
2006-01-01
Multiple regression and correlation (MRC) methods form a flexible family of statistical techniques that can address a wide variety of different types of research questions of interest to rehabilitation professionals. In this article, we review basic concepts and terms, with an emphasis on interpretation of findings relevant to research questions…
Tracking the Gender Pay Gap: A Case Study
ERIC Educational Resources Information Center
Travis, Cheryl B.; Gross, Louis J.; Johnson, Bruce A.
2009-01-01
This article provides a short introduction to standard considerations in the formal study of wages and illustrates the use of multiple regression and resampling simulation approaches in a case study of faculty salaries at one university. Multiple regression is especially beneficial where it provides information on strength of association, specific…
Estimating air drying times of lumber with multiple regression
William T. Simpson
2004-01-01
In this study, the applicability of a multiple regression equation for estimating air drying times of red oak, sugar maple, and ponderosa pine lumber was evaluated. The equation allows prediction of estimated air drying times from historic weather records of temperature and relative humidity at any desired location.
Using Robust Variance Estimation to Combine Multiple Regression Estimates with Meta-Analysis
ERIC Educational Resources Information Center
Williams, Ryan
2013-01-01
The purpose of this study was to explore the use of robust variance estimation for combining commonly specified multiple regression models and for combining sample-dependent focal slope estimates from diversely specified models. The proposed estimator obviates traditionally required information about the covariance structure of the dependent…
Multiple Regression: A Leisurely Primer.
ERIC Educational Resources Information Center
Daniel, Larry G.; Onwuegbuzie, Anthony J.
Multiple regression is a useful statistical technique when the researcher is considering situations in which variables of interest are theorized to be multiply caused. It may also be useful in those situations in which the researchers is interested in studies of predictability of phenomena of interest. This paper provides an introduction to…
Using Monte Carlo Techniques to Demonstrate the Meaning and Implications of Multicollinearity
ERIC Educational Resources Information Center
Vaughan, Timothy S.; Berry, Kelly E.
2005-01-01
This article presents an in-class Monte Carlo demonstration, designed to demonstrate to students the implications of multicollinearity in a multiple regression study. In the demonstration, students already familiar with multiple regression concepts are presented with a scenario in which the "true" relationship between the response and…
ERIC Educational Resources Information Center
Bates, Reid A.; Holton, Elwood F., III; Burnett, Michael F.
1999-01-01
A case study of learning transfer demonstrates the possible effect of influential observation on linear regression analysis. A diagnostic method that tests for violation of assumptions, multicollinearity, and individual and multiple influential observations helps determine which observation to delete to eliminate bias. (SK)
Afantitis, Antreas; Melagraki, Georgia; Sarimveis, Haralambos; Koutentis, Panayiotis A; Markopoulos, John; Igglessi-Markopoulou, Olga
2006-08-01
A quantitative-structure activity relationship was obtained by applying Multiple Linear Regression Analysis to a series of 80 1-[2-hydroxyethoxy-methyl]-6-(phenylthio) thymine (HEPT) derivatives with significant anti-HIV activity. For the selection of the best among 37 different descriptors, the Elimination Selection Stepwise Regression Method (ES-SWR) was utilized. The resulting QSAR model (R (2) (CV) = 0.8160; S (PRESS) = 0.5680) proved to be very accurate both in training and predictive stages.
Amagasa, Takashi; Nakayama, Takeo
2013-08-01
To clarify how long working hours affect the likelihood of current and future depression. Using data from four repeated measurements collected from 218 clerical workers, four models associating work-related factors to the depressive mood scale were established. The final model was constructed after comparing and testing the goodness-of-fit index using structural equation modeling. Multiple logistic regression analysis was also performed. The final model showed the best fit (normed fit index = 0.908; goodness-of-fit index = 0.936; root-mean-square error of approximation = 0.018). Its standardized total effect indicated that long working hours affected depression at the time of evaluation and 1 to 3 years later. The odds ratio for depression risk was 14.7 in employees who were not long-hours overworked according to the initial survey but who were long-hours overworked according to the second survey. Long working hours increase current and future risks of depression.
Riley, Richard D; Ensor, Joie; Jackson, Dan; Burke, Danielle L
2017-01-01
Many meta-analysis models contain multiple parameters, for example due to multiple outcomes, multiple treatments or multiple regression coefficients. In particular, meta-regression models may contain multiple study-level covariates, and one-stage individual participant data meta-analysis models may contain multiple patient-level covariates and interactions. Here, we propose how to derive percentage study weights for such situations, in order to reveal the (otherwise hidden) contribution of each study toward the parameter estimates of interest. We assume that studies are independent, and utilise a decomposition of Fisher's information matrix to decompose the total variance matrix of parameter estimates into study-specific contributions, from which percentage weights are derived. This approach generalises how percentage weights are calculated in a traditional, single parameter meta-analysis model. Application is made to one- and two-stage individual participant data meta-analyses, meta-regression and network (multivariate) meta-analysis of multiple treatments. These reveal percentage study weights toward clinically important estimates, such as summary treatment effects and treatment-covariate interactions, and are especially useful when some studies are potential outliers or at high risk of bias. We also derive percentage study weights toward methodologically interesting measures, such as the magnitude of ecological bias (difference between within-study and across-study associations) and the amount of inconsistency (difference between direct and indirect evidence in a network meta-analysis).
Nie, Z Q; Ou, Y Q; Zhuang, J; Qu, Y J; Mai, J Z; Chen, J M; Liu, X Q
2016-05-01
Conditional logistic regression analysis and unconditional logistic regression analysis are commonly used in case control study, but Cox proportional hazard model is often used in survival data analysis. Most literature only refer to main effect model, however, generalized linear model differs from general linear model, and the interaction was composed of multiplicative interaction and additive interaction. The former is only statistical significant, but the latter has biological significance. In this paper, macros was written by using SAS 9.4 and the contrast ratio, attributable proportion due to interaction and synergy index were calculated while calculating the items of logistic and Cox regression interactions, and the confidence intervals of Wald, delta and profile likelihood were used to evaluate additive interaction for the reference in big data analysis in clinical epidemiology and in analysis of genetic multiplicative and additive interactions.
Wavelet regression model in forecasting crude oil price
NASA Astrophysics Data System (ADS)
Hamid, Mohd Helmie; Shabri, Ani
2017-05-01
This study presents the performance of wavelet multiple linear regression (WMLR) technique in daily crude oil forecasting. WMLR model was developed by integrating the discrete wavelet transform (DWT) and multiple linear regression (MLR) model. The original time series was decomposed to sub-time series with different scales by wavelet theory. Correlation analysis was conducted to assist in the selection of optimal decomposed components as inputs for the WMLR model. The daily WTI crude oil price series has been used in this study to test the prediction capability of the proposed model. The forecasting performance of WMLR model were also compared with regular multiple linear regression (MLR), Autoregressive Moving Average (ARIMA) and Generalized Autoregressive Conditional Heteroscedasticity (GARCH) using root mean square errors (RMSE) and mean absolute errors (MAE). Based on the experimental results, it appears that the WMLR model performs better than the other forecasting technique tested in this study.
Multiple regression for physiological data analysis: the problem of multicollinearity.
Slinker, B K; Glantz, S A
1985-07-01
Multiple linear regression, in which several predictor variables are related to a response variable, is a powerful statistical tool for gaining quantitative insight into complex in vivo physiological systems. For these insights to be correct, all predictor variables must be uncorrelated. However, in many physiological experiments the predictor variables cannot be precisely controlled and thus change in parallel (i.e., they are highly correlated). There is a redundancy of information about the response, a situation called multicollinearity, that leads to numerical problems in estimating the parameters in regression equations; the parameters are often of incorrect magnitude or sign or have large standard errors. Although multicollinearity can be avoided with good experimental design, not all interesting physiological questions can be studied without encountering multicollinearity. In these cases various ad hoc procedures have been proposed to mitigate multicollinearity. Although many of these procedures are controversial, they can be helpful in applying multiple linear regression to some physiological problems.
ERIC Educational Resources Information Center
Li, Spencer D.
2011-01-01
Mediation analysis in child and adolescent development research is possible using large secondary data sets. This article provides an overview of two statistical methods commonly used to test mediated effects in secondary analysis: multiple regression and structural equation modeling (SEM). Two empirical studies are presented to illustrate the…
A Simple and Convenient Method of Multiple Linear Regression to Calculate Iodine Molecular Constants
ERIC Educational Resources Information Center
Cooper, Paul D.
2010-01-01
A new procedure using a student-friendly least-squares multiple linear-regression technique utilizing a function within Microsoft Excel is described that enables students to calculate molecular constants from the vibronic spectrum of iodine. This method is advantageous pedagogically as it calculates molecular constants for ground and excited…
Conjoint Analysis: A Study of the Effects of Using Person Variables.
ERIC Educational Resources Information Center
Fraas, John W.; Newman, Isadore
Three statistical techniques--conjoint analysis, a multiple linear regression model, and a multiple linear regression model with a surrogate person variable--were used to estimate the relative importance of five university attributes for students in the process of selecting a college. The five attributes include: availability and variety of…
An Exploratory Study of Face-to-Face and Cyberbullying in Sixth Grade Students
ERIC Educational Resources Information Center
Accordino, Denise B.; Accordino, Michael P.
2011-01-01
In a pilot study, sixth grade students (N = 124) completed a questionnaire assessing students' experience with bullying and cyberbullying, demographic information, quality of parent-child relationship, and ways they have dealt with bullying/cyberbullying in the past. Two multiple regression analyses were conducted. The multiple regression analysis…
ERIC Educational Resources Information Center
Campbell, S. Duke; Greenberg, Barry
The development of a predictive equation capable of explaining a significant percentage of enrollment variability at Florida International University is described. A model utilizing trend analysis and a multiple regression approach to enrollment forecasting was adapted to investigate enrollment dynamics at the university. Four independent…
ERIC Educational Resources Information Center
Fraas, John W.; Newman, Isadore
1996-01-01
In a conjoint-analysis consumer-preference study, researchers must determine whether the product factor estimates, which measure consumer preferences, should be calculated and interpreted for each respondent or collectively. Multiple regression models can determine whether to aggregate data by examining factor-respondent interaction effects. This…
Double Cross-Validation in Multiple Regression: A Method of Estimating the Stability of Results.
ERIC Educational Resources Information Center
Rowell, R. Kevin
In multiple regression analysis, where resulting predictive equation effectiveness is subject to shrinkage, it is especially important to evaluate result replicability. Double cross-validation is an empirical method by which an estimate of invariance or stability can be obtained from research data. A procedure for double cross-validation is…
Vicente-Pérez, Ricardo; Avendaño-Reyes, Leonel; Mejía-Vázquez, Ángel; Álvarez-Valenzuela, F Daniel; Correa-Calderón, Abelardo; Mellado, Miguel; Meza-Herrera, Cesar A; Guerra-Liera, Juan E; Robinson, P H; Macías-Cruz, Ulises
2016-01-01
Rectal temperature (RT) is the foremost physiological variable indicating if an animal is suffering hyperthermia. However, this variable is traditionally measured by invasive methods, which may compromise animal welfare. Models to predict RT have been developed for growing pigs and lactating dairy cows, but not for pregnant heat-stressed ewes. Our aim was to develop a prediction equation for RT using non-invasive physiological variables in pregnant ewes under heat stress. A total of 192 records of respiratory frequency (RF) and hair coat temperature in various body regions (i.e., head, rump, flank, shoulder, and belly) obtained from 24 Katahdin × Pelibuey pregnant multiparous ewes were collected during the last third of gestation (i.e., d 100 to lambing) with a 15 d sampling interval. Hair coat temperatures were taken using infrared thermal imaging technology. Initially, a Pearson correlation analysis examined the relationship among variables, and then multiple linear regression analysis was used to develop the prediction equations. All predictor variables were positively correlated (P<0.01; r=0.59-0.67) with RT. The adjusted equation which best predicted RT (P<0.01; Radj(2)=56.15%; CV=0.65%) included as predictors RF and head and belly temperatures. Comparison of predicted and observed values for RT indicates a suitable agreement (P<0.01) between them with moderate accuracy (Radj(2)=56.15%) when RT was calculated with the adjusted equation. In general, the final equation does not violate any assumption of multiple regression analysis. The RT in heat-stressed pregnant ewes can be predicted with an adequate accuracy using non-invasive physiologic variables, and the final equation was: RT=35.57+0.004 (RF)+0.067 (heat temperature)+0.028 (belly temperature). Copyright © 2015 Elsevier Ltd. All rights reserved.
Information-theoretic metric as a tool to investigate nonclassical correlations
NASA Astrophysics Data System (ADS)
Rudolph, Alexander L.; Lamine, Brahim; Joyce, Michael; Vignolles, Hélène; Consiglio, David
2014-06-01
We report on a project to introduce interactive learning strategies (ILS) to physics classes at the Université Pierre et Marie Curie, one of the leading science universities in France. In Spring 2012, instructors in two large introductory classes, first-year, second-semester mechanics, and second-year introductory electricity and magnetism, enrolling approximately 500 and 250 students, respectively, introduced ILS into some, but not all, of the sections of each class. The specific ILS utilized were think-pair-share questions and Peer Instruction in the main lecture classrooms, and University of Washington Tutorials for Introductory Physics in recitation sections. Pre- and postinstruction assessments [Force Concept Inventory (FCI) and Conceptual Survey of Electricity and Magnetism (CSEM), respectively] were given, along with a series of demographic questions. Since not all lecture or recitation sections in these classes used ILS, we were able to compare the results of the FCI and CSEM between interactive and noninteractive classes taught simultaneously with the same curriculum. We also analyzed final exam results, as well as the results of student and instructor attitude surveys between classes. In our analysis, we argue that multiple linear regression modeling is superior to other common analysis tools, including normalized gain. Our results show that ILS are effective at improving student learning by all measures used: research-validated concept inventories and final exam scores, on both conceptual and traditional problem-solving questions. Multiple linear regression analysis reveals that interactivity in the classroom is a significant predictor of student learning, showing a similar or stronger relationship with student learning than such ascribed characteristics as parents’ education, and achieved characteristics such as grade point average and hours studied per week. Analysis of student and instructor attitudes shows that both groups believe that ILS improve student learning in the physics classroom and increase student engagement and motivation. All of the instructors who used ILS in this study plan to continue their use.
NASA Astrophysics Data System (ADS)
Li, Xiao Ju; Yao, Kun; Dai, Jun Yu; Song, Yun Long
2018-05-01
The underground space, also known as the “fourth dimension” of the city, reflects the efficient use of urban development intensive. Urban traffic link tunnel is a typical underground limited-length space. Due to the geographical location, the special structure of space and the curvature of the tunnel, high-temperature smoke can easily form the phenomenon of “smoke turning” and the fire risk is extremely high. This paper takes an urban traffic link tunnel as an example to focus on the relationship between curvature and the temperature near the fire source, and use the pyrosim built different curvature fire model to analyze the influence of curvature on the temperature of the fire, then using SPSS Multivariate regression analysis simulate curvature of the tunnel and fire temperature data. Finally, a prediction model of urban traffic link tunnel curvature on fire temperature was proposed. The regression model analysis and test show that the curvature is negatively correlated with the tunnel temperature. This model is feasible and can provide a theoretical reference for the urban traffic link tunnel fire protection design and the preparation of the evacuation plan. And also, it provides some reference for other related curved tunnel curvature design and smoke control measures.
Picco, Louisa; Pang, Shirlene; Lau, Ying Wen; Jeyagurunathan, Anitha; Satghare, Pratika; Abdin, Edimansyah; Vaingankar, Janhavi Ajit; Lim, Susan; Poh, Chee Lien; Chong, Siow Ann; Subramaniam, Mythily
2016-12-30
This study aimed to: (i) determine the prevalence, socio-demographic and clinical correlates of internalized stigma and (ii) explore the association between internalized stigma and quality of life, general functioning, hope and self-esteem, among a multi-ethnic Asian population of patients with mental disorders. This cross-sectional, survey recruited adult patients (n=280) who were seeking treatment at outpatient and affiliated clinics of the only tertiary psychiatric hospital in Singapore. Internalized stigma was measured using the Internalized Stigma of Mental Illness scale. 43.6% experienced moderate to high internalized stigma. After making adjustments in multiple logistic regression analysis, results revealed there were no significant socio-demographic or clinical correlates relating to internalized stigma. Individual logistic regression models found a negative relationship between quality of life, self-esteem, general functioning and internalized stigma whereby lower scores were associated with higher internalized stigma. In the final regression model, which included all psychosocial variables together, self-esteem was the only variable significantly and negatively associated with internalized stigma. The results of this study contribute to our understanding of the role internalized stigma plays in patients with mental illness, and the impact it can have on psychosocial aspects of their lives. Copyright © 2016 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Zhao, Zeng-hui; Wang, Wei-ming; Gao, Xin; Yan, Ji-xing
2013-01-01
According to the geological characteristics of Xinjiang Ili mine in western area of China, a physical model of interstratified strata composed of soft rock and hard coal seam was established. Selecting the tunnel position, deformation modulus, and strength parameters of each layer as influencing factors, the sensitivity coefficient of roadway deformation to each parameter was firstly analyzed based on a Mohr-Columb strain softening model and nonlinear elastic-plastic finite element analysis. Then the effect laws of influencing factors which showed high sensitivity were further discussed. Finally, a regression model for the relationship between roadway displacements and multifactors was obtained by equivalent linear regression under multiple factors. The results show that the roadway deformation is highly sensitive to the depth of coal seam under the floor which should be considered in the layout of coal roadway; deformation modulus and strength of coal seam and floor have a great influence on the global stability of tunnel; on the contrary, roadway deformation is not sensitive to the mechanical parameters of soft roof; roadway deformation under random combinations of multi-factors can be deduced by the regression model. These conclusions provide theoretical significance to the arrangement and stability maintenance of coal roadway. PMID:24459447
Bilgili, Mehmet; Sahin, Besir; Sangun, Levent
2013-01-01
The aim of this study is to estimate the soil temperatures of a target station using only the soil temperatures of neighboring stations without any consideration of the other variables or parameters related to soil properties. For this aim, the soil temperatures were measured at depths of 5, 10, 20, 50, and 100 cm below the earth surface at eight measuring stations in Turkey. Firstly, the multiple nonlinear regression analysis was performed with the "Enter" method to determine the relationship between the values of target station and neighboring stations. Then, the stepwise regression analysis was applied to determine the best independent variables. Finally, an artificial neural network (ANN) model was developed to estimate the soil temperature of a target station. According to the derived results for the training data set, the mean absolute percentage error and correlation coefficient ranged from 1.45% to 3.11% and from 0.9979 to 0.9986, respectively, while corresponding ranges of 1.685-3.65% and 0.9988-0.9991, respectively, were obtained based on the testing data set. The obtained results show that the developed ANN model provides a simple and accurate prediction to determine the soil temperature. In addition, the missing data at the target station could be determined within a high degree of accuracy.
Yang, Xuejiao; Deng, Shuifeng; Li, Zuohong; Li, Fei; Zhuo, Yehong
2015-01-01
Background To evaluate the efficacy and safety of the Ahmed glaucoma valve (AGV) and the risk factors associated with AGV implantation failure in a population of Chinese patients with refractory glaucoma. Method In total, 79 eyes with refractory glaucoma from 79 patients treated in our institution from November 2007 to November 2010 were enrolled in this retrospective study. The demographic data, preoperative and postoperative intraocular pressures (IOPs), best corrected visual acuity (BCVA), number of anti-glaucoma medications used, completed and qualified surgery success rates and postoperative complications were recorded to evaluate the outcomes of AGV implantation. Factors that were associated with implant failure were determined using Cox proportional hazard regression model analysis and multiple linear regression analysis. Principle Findings The average follow-up time was 12.7±5.8 months (mean±SD). We observed a significant reduction in the mean IOP from 39.9±12.6 mm Hg before surgery to 19.3±9.6 mm Hg at the final follow-up. The complete success rate was 59.5%, and the qualified success rate was 83.5%. The number of previous surgeries was negatively correlated with qualified success rate (P<0.05, OR=0.736, 95% CI 0.547-0.99). Patients with previous trabeculectomy were more likely to use multiple anti-glaucoma drugs to control IOP (P<0.01). The primary complication was determined to be a flat anterior chamber (AC). Conclusion AGV implantation was safe and effective for the management of refractory glaucoma. Patients with a greater number of previous surgeries were more likely to experience surgical failure, and patients with previous trabeculectomy were more likely to use multiple anti-glaucoma drugs to control postoperative IOP. PMID:25996991
Zhu, Yingting; Wei, Yantao; Yang, Xuejiao; Deng, Shuifeng; Li, Zuohong; Li, Fei; Zhuo, Yehong
2015-01-01
To evaluate the efficacy and safety of the Ahmed glaucoma valve (AGV) and the risk factors associated with AGV implantation failure in a population of Chinese patients with refractory glaucoma. In total, 79 eyes with refractory glaucoma from 79 patients treated in our institution from November 2007 to November 2010 were enrolled in this retrospective study. The demographic data, preoperative and postoperative intraocular pressures (IOPs), best corrected visual acuity (BCVA), number of anti-glaucoma medications used, completed and qualified surgery success rates and postoperative complications were recorded to evaluate the outcomes of AGV implantation. Factors that were associated with implant failure were determined using Cox proportional hazard regression model analysis and multiple linear regression analysis. The average follow-up time was 12.7±5.8 months (mean±SD). We observed a significant reduction in the mean IOP from 39.9±12.6 mm Hg before surgery to 19.3±9.6 mm Hg at the final follow-up. The complete success rate was 59.5%, and the qualified success rate was 83.5%. The number of previous surgeries was negatively correlated with qualified success rate (P<0.05, OR=0.736, 95% CI 0.547-0.99). Patients with previous trabeculectomy were more likely to use multiple anti-glaucoma drugs to control IOP (P<0.01). The primary complication was determined to be a flat anterior chamber (AC). AGV implantation was safe and effective for the management of refractory glaucoma. Patients with a greater number of previous surgeries were more likely to experience surgical failure, and patients with previous trabeculectomy were more likely to use multiple anti-glaucoma drugs to control postoperative IOP.
Ridge: a computer program for calculating ridge regression estimates
Donald E. Hilt; Donald W. Seegrist
1977-01-01
Least-squares coefficients for multiple-regression models may be unstable when the independent variables are highly correlated. Ridge regression is a biased estimation procedure that produces stable estimates of the coefficients. Ridge regression is discussed, and a computer program for calculating the ridge coefficients is presented.
Zhu, Xiang; Stephens, Matthew
2017-01-01
Bayesian methods for large-scale multiple regression provide attractive approaches to the analysis of genome-wide association studies (GWAS). For example, they can estimate heritability of complex traits, allowing for both polygenic and sparse models; and by incorporating external genomic data into the priors, they can increase power and yield new biological insights. However, these methods require access to individual genotypes and phenotypes, which are often not easily available. Here we provide a framework for performing these analyses without individual-level data. Specifically, we introduce a “Regression with Summary Statistics” (RSS) likelihood, which relates the multiple regression coefficients to univariate regression results that are often easily available. The RSS likelihood requires estimates of correlations among covariates (SNPs), which also can be obtained from public databases. We perform Bayesian multiple regression analysis by combining the RSS likelihood with previously proposed prior distributions, sampling posteriors by Markov chain Monte Carlo. In a wide range of simulations RSS performs similarly to analyses using the individual data, both for estimating heritability and detecting associations. We apply RSS to a GWAS of human height that contains 253,288 individuals typed at 1.06 million SNPs, for which analyses of individual-level data are practically impossible. Estimates of heritability (52%) are consistent with, but more precise, than previous results using subsets of these data. We also identify many previously unreported loci that show evidence for association with height in our analyses. Software is available at https://github.com/stephenslab/rss. PMID:29399241
Kayes, Nicola M; McPherson, Kathryn M; Schluter, Philip; Taylor, Denise; Leete, Marta; Kolt, Gregory S
2011-01-01
To explore the relationship that cognitive behavioural and other previously identified variables have with physical activity engagement in people with multiple sclerosis (MS). This study adopted a cross-sectional questionnaire design. Participants were 282 individuals with MS. Outcome measures included the Physical Activity Disability Survey--Revised, Cognitive and Behavioural Responses to Symptoms Questionnaire, Barriers to Health Promoting Activities for Disabled Persons Scale, Multiple Sclerosis Self-efficacy Scale, Self-Efficacy for Chronic Diseases Scales and Chalder Fatigue Questionnaire. Multivariable stepwise regression analyses found that greater self-efficacy, greater reported mental fatigue and lower number of perceived barriers to physical activity accounted for a significant proportion of variance in physical activity behaviour, over that accounted for by illness-related variables. Although fear-avoidance beliefs accounted for a significant proportion of variance in the initial analyses, its effect was explained by other factors in the final multivariable analyses. Self-efficacy, mental fatigue and perceived barriers to physical activity are potentially modifiable variables which could be incorporated into interventions designed to improve physical activity engagement. Future research should explore whether a measurement tool tailored to capture beliefs about physical activity identified by people with MS would better predict participation in physical activity.
NASA Astrophysics Data System (ADS)
Kiss, I.; Cioată, V. G.; Ratiu, S. A.; Rackov, M.; Penčić, M.
2018-01-01
Multivariate research is important in areas of cast-iron brake shoes manufacturing, because many variables interact with each other simultaneously. This article focuses on expressing the multiple linear regression model related to the hardness assurance by the chemical composition of the phosphorous cast irons destined to the brake shoes, having in view that the regression coefficients will illustrate the unrelated contributions of each independent variable towards predicting the dependent variable. In order to settle the multiple correlations between the hardness of the cast-iron brake shoes, and their chemical compositions several regression equations has been proposed. Is searched a mathematical solution which can determine the optimum chemical composition for the hardness desirable values. Starting from the above-mentioned affirmations two new statistical experiments are effectuated related to the values of Phosphorus [P], Manganese [Mn] and Silicon [Si]. Therefore, the regression equations, which describe the mathematical dependency between the above-mentioned elements and the hardness, are determined. As result, several correlation charts will be revealed.
ERIC Educational Resources Information Center
Porter, Kristin E.; Reardon, Sean F.; Unlu, Fatih; Bloom, Howard S.; Robinson-Cimpian, Joseph P.
2014-01-01
A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the "surface" method, the "frontier" method, the "binding-score" method, and…
ERIC Educational Resources Information Center
Hafner, Lawrence E.
A study developed a multiple regression prediction equation for each of six selected achievement variables in a popular standardized test of achievement. Subjects, 42 fourth-grade pupils randomly selected across several classes in a large elementary school in a north Florida city, were administered several standardized tests to determine predictor…
ERIC Educational Resources Information Center
Muller, Veronica; Brooks, Jessica; Tu, Wei-Mo; Moser, Erin; Lo, Chu-Ling; Chan, Fong
2015-01-01
Purpose: The main objective of this study was to determine the extent to which physical and cognitive-affective factors are associated with fibromyalgia (FM) fatigue. Method: A quantitative descriptive design using correlation techniques and multiple regression analysis. The participants consisted of 302 members of the National Fibromyalgia &…
ERIC Educational Resources Information Center
Choi, Kilchan
2011-01-01
This report explores a new latent variable regression 4-level hierarchical model for monitoring school performance over time using multisite multiple-cohorts longitudinal data. This kind of data set has a 4-level hierarchical structure: time-series observation nested within students who are nested within different cohorts of students. These…
ERIC Educational Resources Information Center
Richter, Tobias
2006-01-01
Most reading time studies using naturalistic texts yield data sets characterized by a multilevel structure: Sentences (sentence level) are nested within persons (person level). In contrast to analysis of variance and multiple regression techniques, hierarchical linear models take the multilevel structure of reading time data into account. They…
Some Applied Research Concerns Using Multiple Linear Regression Analysis.
ERIC Educational Resources Information Center
Newman, Isadore; Fraas, John W.
The intention of this paper is to provide an overall reference on how a researcher can apply multiple linear regression in order to utilize the advantages that it has to offer. The advantages and some concerns expressed about the technique are examined. A number of practical ways by which researchers can deal with such concerns as…
A Spreadsheet Tool for Learning the Multiple Regression F-Test, T-Tests, and Multicollinearity
ERIC Educational Resources Information Center
Martin, David
2008-01-01
This note presents a spreadsheet tool that allows teachers the opportunity to guide students towards answering on their own questions related to the multiple regression F-test, the t-tests, and multicollinearity. The note demonstrates approaches for using the spreadsheet that might be appropriate for three different levels of statistics classes,…
ERIC Educational Resources Information Center
Preacher, Kristopher J.; Curran, Patrick J.; Bauer, Daniel J.
2006-01-01
Simple slopes, regions of significance, and confidence bands are commonly used to evaluate interactions in multiple linear regression (MLR) models, and the use of these techniques has recently been extended to multilevel or hierarchical linear modeling (HLM) and latent curve analysis (LCA). However, conducting these tests and plotting the…
Fernandez-Lozano, Carlos; Gestal, Marcos; Munteanu, Cristian R; Dorado, Julian; Pazos, Alejandro
2016-01-01
The design of experiments and the validation of the results achieved with them are vital in any research study. This paper focuses on the use of different Machine Learning approaches for regression tasks in the field of Computational Intelligence and especially on a correct comparison between the different results provided for different methods, as those techniques are complex systems that require further study to be fully understood. A methodology commonly accepted in Computational intelligence is implemented in an R package called RRegrs. This package includes ten simple and complex regression models to carry out predictive modeling using Machine Learning and well-known regression algorithms. The framework for experimental design presented herein is evaluated and validated against RRegrs. Our results are different for three out of five state-of-the-art simple datasets and it can be stated that the selection of the best model according to our proposal is statistically significant and relevant. It is of relevance to use a statistical approach to indicate whether the differences are statistically significant using this kind of algorithms. Furthermore, our results with three real complex datasets report different best models than with the previously published methodology. Our final goal is to provide a complete methodology for the use of different steps in order to compare the results obtained in Computational Intelligence problems, as well as from other fields, such as for bioinformatics, cheminformatics, etc., given that our proposal is open and modifiable.
Gestal, Marcos; Munteanu, Cristian R.; Dorado, Julian; Pazos, Alejandro
2016-01-01
The design of experiments and the validation of the results achieved with them are vital in any research study. This paper focuses on the use of different Machine Learning approaches for regression tasks in the field of Computational Intelligence and especially on a correct comparison between the different results provided for different methods, as those techniques are complex systems that require further study to be fully understood. A methodology commonly accepted in Computational intelligence is implemented in an R package called RRegrs. This package includes ten simple and complex regression models to carry out predictive modeling using Machine Learning and well-known regression algorithms. The framework for experimental design presented herein is evaluated and validated against RRegrs. Our results are different for three out of five state-of-the-art simple datasets and it can be stated that the selection of the best model according to our proposal is statistically significant and relevant. It is of relevance to use a statistical approach to indicate whether the differences are statistically significant using this kind of algorithms. Furthermore, our results with three real complex datasets report different best models than with the previously published methodology. Our final goal is to provide a complete methodology for the use of different steps in order to compare the results obtained in Computational Intelligence problems, as well as from other fields, such as for bioinformatics, cheminformatics, etc., given that our proposal is open and modifiable. PMID:27920952
Regression Models for the Analysis of Longitudinal Gaussian Data from Multiple Sources
O’Brien, Liam M.; Fitzmaurice, Garrett M.
2006-01-01
We present a regression model for the joint analysis of longitudinal multiple source Gaussian data. Longitudinal multiple source data arise when repeated measurements are taken from two or more sources, and each source provides a measure of the same underlying variable and on the same scale. This type of data generally produces a relatively large number of observations per subject; thus estimation of an unstructured covariance matrix often may not be possible. We consider two methods by which parsimonious models for the covariance can be obtained for longitudinal multiple source data. The methods are illustrated with an example of multiple informant data arising from a longitudinal interventional trial in psychiatry. PMID:15726666
Interpretation of commonly used statistical regression models.
Kasza, Jessica; Wolfe, Rory
2014-01-01
A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.
Applied Multiple Linear Regression: A General Research Strategy
ERIC Educational Resources Information Center
Smith, Brandon B.
1969-01-01
Illustrates some of the basic concepts and procedures for using regression analysis in experimental design, analysis of variance, analysis of covariance, and curvilinear regression. Applications to evaluation of instruction and vocational education programs are illustrated. (GR)
DOE Office of Scientific and Technical Information (OSTI.GOV)
Kim, Yangho; Lee, Byung-Kook, E-mail: bklee@sch.ac.kr
Introduction: The objective of this study was to evaluate associations between blood lead, cadmium, and mercury levels with estimated glomerular filtration rate in a general population of South Korean adults. Methods: This was a cross-sectional study based on data obtained in the Korean National Health and Nutrition Examination Survey (KNHANES) (2008-2010). The final analytical sample consisted of 5924 participants. Estimated glomerular filtration rate (eGFR) was calculated using the MDRD Study equation as an indicator of glomerular function. Results: In multiple linear regression analysis of log2-transformed blood lead as a continuous variable on eGFR, after adjusting for covariates including cadmium andmore » mercury, the difference in eGFR levels associated with doubling of blood lead were -2.624 mL/min per 1.73 m Superscript-Two (95% CI: -3.803 to -1.445). In multiple linear regression analysis using quartiles of blood lead as the independent variable, the difference in eGFR levels comparing participants in the highest versus the lowest quartiles of blood lead was -3.835 mL/min per 1.73 m Superscript-Two (95% CI: -5.730 to -1.939). In a multiple linear regression analysis using blood cadmium and mercury, as continuous or categorical variables, as independent variables, neither metal was a significant predictor of eGFR. Odds ratios (ORs) and 95% CI values for reduced eGFR calculated for log2-transformed blood metals and quartiles of the three metals showed similar trends after adjustment for covariates. Discussion: In this large, representative sample of South Korean adults, elevated blood lead level was consistently associated with lower eGFR levels and with the prevalence of reduced eGFR even in blood lead levels below 10 {mu}g/dL. In conclusion, elevated blood lead level was associated with lower eGFR in a Korean general population, supporting the role of lead as a risk factor for chronic kidney disease.« less
Altmann, Vivian; Schumacher-Schuh, Artur F; Rieck, Mariana; Callegari-Jacques, Sidia M; Rieder, Carlos R M; Hutz, Mara H
2016-04-01
Levodopa is first-line treatment of Parkinson's disease motor symptoms but, dose response is highly variable. Therefore, the aim of this study was to determine how much levodopa dose could be explained by biological, pharmacological and genetic factors. A total of 224 Parkinson's disease patients were genotyped for SV2C and SLC6A3 polymorphisms by allelic discrimination assays. Comedication, demographic and clinical data were also assessed. All variables with p < 0.20 were included in a multiple regression analysis for dose prediction. The final model explained 23% of dose variation (F = 11.54; p < 0.000001). Although a good prediction model was obtained, it still needs to be tested in an independent sample to be validated.
Personality and emotional intelligence in teacher burnout.
Pishghadam, Reza; Sahebjam, Samaneh
2012-03-01
This paper aims to investigate the relationship between teacher's personality types, emotional intelligence and burnout and to predict the burnout levels of 147 teachers in the city of Mashhad (Iran). To this end, we have used three inventories: Maslach Burnout Inventory (MBI), NEO Five Factor Inventory (NEO-FFI), and Emotional Quotient Inventory (EQ-I). We used Homogeneity Analysis and Multiple Linear Regression to analyze the data. The results exhibited a significant relationship between personality types and emotional intelligence and the three dimensions of burnout. It was indicated that the best predictors for emotional exhaustion were neuroticism and extroversion, for depersonalization were intrapersonal scale of emotional intelligence and agreeableness, and for personal accomplishment were interpersonal scale and conscientiousness. Finally, the results were discussed in the context of teacher burnout.
Holtz, Carol; Sowell, Richard; VanBrackle, Lewis; Velasquez, Gabriela; Hernandez-Alonso, Virginia
2014-01-01
This quantitative study explored the level of Quality of Life (QoL) in indigenous Mexican women and identified psychosocial factors that significantly influenced their QoL, using face-to-face interviews with 101 women accessing care in an HIV clinic in Oaxaca, Mexico. Variables included demographic characteristics, levels of depression, coping style, family functioning, HIV-related beliefs, and QoL. Descriptive statistics were used to analyze participant characteristics, and women's scores on data collection instruments. Pearson's R correlational statistics were used to determine the level of significance between study variables. Multiple regression analysis examined all variables that were significantly related to QoL. Pearson's correlational analysis of relationships between Spirituality, Educating Self about HIV, Family Functioning, Emotional Support, Physical Care, and Staying Positive demonstrated positive correlation to QoL. Stigma, depression, and avoidance coping were significantly and negatively associated with QoL. The final regression model indicated that depression and avoidance coping were the best predictor variables for QoL. Copyright © 2014 Association of Nurses in AIDS Care. Published by Elsevier Inc. All rights reserved.
Population heterogeneity in the salience of multiple risk factors for adolescent delinquency.
Lanza, Stephanie T; Cooper, Brittany R; Bray, Bethany C
2014-03-01
To present mixture regression analysis as an alternative to more standard regression analysis for predicting adolescent delinquency. We demonstrate how mixture regression analysis allows for the identification of population subgroups defined by the salience of multiple risk factors. We identified population subgroups (i.e., latent classes) of individuals based on their coefficients in a regression model predicting adolescent delinquency from eight previously established risk indices drawn from the community, school, family, peer, and individual levels. The study included N = 37,763 10th-grade adolescents who participated in the Communities That Care Youth Survey. Standard, zero-inflated, and mixture Poisson and negative binomial regression models were considered. Standard and mixture negative binomial regression models were selected as optimal. The five-class regression model was interpreted based on the class-specific regression coefficients, indicating that risk factors had varying salience across classes of adolescents. Standard regression showed that all risk factors were significantly associated with delinquency. Mixture regression provided more nuanced information, suggesting a unique set of risk factors that were salient for different subgroups of adolescents. Implications for the design of subgroup-specific interventions are discussed. Copyright © 2014 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.
Influences on Academic Achievement Across High and Low Income Countries: A Re-Analysis of IEA Data.
ERIC Educational Resources Information Center
Heyneman, S.; Loxley, W.
Previous international studies of science achievement put the data through a process of winnowing to decide which variables to keep in the final regressions. Variables were allowed to enter the final regressions if they met a minimum beta coefficient criterion of 0.05 averaged across rich and poor countries alike. The criterion was an average…
ERIC Educational Resources Information Center
Porter, Kristin E.; Reardon, Sean F.; Unlu, Fatih; Bloom, Howard S.; Cimpian, Joseph R.
2017-01-01
A valuable extension of the single-rating regression discontinuity design (RDD) is a multiple-rating RDD (MRRDD). To date, four main methods have been used to estimate average treatment effects at the multiple treatment frontiers of an MRRDD: the "surface" method, the "frontier" method, the "binding-score" method, and…
ERIC Educational Resources Information Center
Woolley, Kristin K.
Many researchers are unfamiliar with suppressor variables and how they operate in multiple regression analyses. This paper describes the role suppressor variables play in a multiple regression model and provides practical examples that explain how they can change research results. A variable that when added as another predictor increases the total…
ERIC Educational Resources Information Center
Martz, Erin
2004-01-01
Because the onset of a spinal cord injury may involve a brush with death and because serious injury and disability can act as a reminder of death, death anxiety was examined as a predictor of posttraumatic stress levels among individuals with disabilities. This cross-sectional study used multiple regression and multivariate multiple regression to…
Regression Analysis of Top of Descent Location for Idle-thrust Descents
NASA Technical Reports Server (NTRS)
Stell, Laurel; Bronsvoort, Jesper; McDonald, Greg
2013-01-01
In this paper, multiple regression analysis is used to model the top of descent (TOD) location of user-preferred descent trajectories computed by the flight management system (FMS) on over 1000 commercial flights into Melbourne, Australia. The independent variables cruise altitude, final altitude, cruise Mach, descent speed, wind, and engine type were also recorded or computed post-operations. Both first-order and second-order models are considered, where cross-validation, hypothesis testing, and additional analysis are used to compare models. This identifies the models that should give the smallest errors if used to predict TOD location for new data in the future. A model that is linear in TOD altitude, final altitude, descent speed, and wind gives an estimated standard deviation of 3.9 nmi for TOD location given the trajec- tory parameters, which means about 80% of predictions would have error less than 5 nmi in absolute value. This accuracy is better than demonstrated by other ground automation predictions using kinetic models. Furthermore, this approach would enable online learning of the model. Additional data or further knowl- edge of algorithms is necessary to conclude definitively that no second-order terms are appropriate. Possible applications of the linear model are described, including enabling arriving aircraft to fly optimized descents computed by the FMS even in congested airspace. In particular, a model for TOD location that is linear in the independent variables would enable decision support tool human-machine interfaces for which a kinetic approach would be computationally too slow.
McClelland, Gary H; Irwin, Julie R; Disatnik, David; Sivan, Liron
2017-02-01
Multicollinearity is irrelevant to the search for moderator variables, contrary to the implications of Iacobucci, Schneider, Popovich, and Bakamitsos (Behavior Research Methods, 2016, this issue). Multicollinearity is like the red herring in a mystery novel that distracts the statistical detective from the pursuit of a true moderator relationship. We show multicollinearity is completely irrelevant for tests of moderator variables. Furthermore, readers of Iacobucci et al. might be confused by a number of their errors. We note those errors, but more positively, we describe a variety of methods researchers might use to test and interpret their moderated multiple regression models, including two-stage testing, mean-centering, spotlighting, orthogonalizing, and floodlighting without regard to putative issues of multicollinearity. We cite a number of recent studies in the psychological literature in which the researchers used these methods appropriately to test, to interpret, and to report their moderated multiple regression models. We conclude with a set of recommendations for the analysis and reporting of moderated multiple regression that should help researchers better understand their models and facilitate generalizations across studies.
Climatological Modeling of Monthly Air Temperature and Precipitation in Egypt through GIS Techniques
NASA Astrophysics Data System (ADS)
El Kenawy, A.
2009-09-01
This paper describes a method for modeling and mapping four climatic variables (maximum temperature, minimum temperature, mean temperature and total precipitation) in Egypt using a multiple regression approach implemented in a GIS environment. In this model, a set of variables including latitude, longitude, elevation within a distance of 5, 10 and 15 km, slope, aspect, distance to the Mediterranean Sea, distance to the Red Sea, distance to the Nile, ratio between land and water masses within a radius of 5, 10, 15 km, the Normalized Difference Vegetation Index (NDVI), the Normalized Difference Water Index (NDWI), the Normalized Difference Temperature Index (NDTI) and reflectance are included as independent variables. These variables were integrated as raster layers in MiraMon software at a spatial resolution of 1 km. Climatic variables were considered as dependent variables and averaged from quality controlled and homogenized 39 series distributing across the entire country during the period of (1957-2006). For each climatic variable, digital and objective maps were finally obtained using the multiple regression coefficients at monthly, seasonal and annual timescale. The accuracy of these maps were assessed through cross-validation between predicted and observed values using a set of statistics including coefficient of determination (R2), root mean square error (RMSE), mean absolute error (MAE), mean bias Error (MBE) and D Willmott statistic. These maps are valuable in the sense of spatial resolution as well as the number of observatories involved in the current analysis.
Palomo, M J; Quintanilla, R; Izquierdo, M D; Mogas, T; Paramio, M T
2016-12-01
This work analyses the changes that caprine spermatozoa undergo during in vitro fertilization (IVF) of in vitro matured prepubertal goat oocytes and their relationship with IVF outcome, in order to obtain an effective model that allows prediction of in vitro fertility on the basis of semen assessment. The evolution of several sperm parameters (motility, viability and acrosomal integrity) during IVF and their relationship with three IVF outcome criteria (total penetration, normal penetration and cleavage rates) were studied in a total of 56 IVF replicates. Moderate correlation coefficients between some sperm parameters and IVF outcome were observed. In addition, stepwise multiple regression analyses were conducted that considered three grouping of sperm parameters as potential explanatory variables of the three IVF outcome criteria. The proportion of IVF outcome variation that can be explained by the fitted models ranged from 0.62 to 0.86, depending upon the trait analysed and the variables considered. Seven out of 32 sperm parameters were selected as partial covariates in at least one of the nine multiple regression models. Among these, progressive sperm motility assessed immediately after swim-up, the percentage of dead sperm with intact acrosome and the incidence of acrosome reaction both determined just before the gamete co-culture, and finally the proportion of viable spermatozoa at 17 h post-insemination were the most frequently selected sperm parameters. Nevertheless, the predictive ability of these models must be confirmed in a larger sample size experiment.
Work related stress and blood glucose levels.
Sancini, A; Ricci, S; Tomei, F; Sacco, C; Pacchiarotti, A; Nardone, N; Ricci, P; Suppi, A; De Cesare, D P; Anzelmo, V; Giubilati, R; Pimpinella, B; Rosati, M V; Tomei, G
2017-01-01
The aim of the study is to evaluate work-related subjective stress in a group of workers on a major Italian company in the field of healthcare through the administration of a valid "questionnaire-tool indicator" (HSE Indicator Tool), and to analyze any correlation between stress levels taken from questionnaire scores and blood glucose values. We studied a final sample consisting of 241 subjects with different tasks. The HSE questionnaire - made up of 35 items (divided into 7 organizational dimensions) with 5 possible answers - has been distributed to all the subjects in occasion of the health surveillance examinations provided by law. The questionnaire was then analyzed using its specific software to process the results related to the 7 dimensions. These results were compared using the Pearson correlation and multiple linear regression with the blood glucose values obtained from each subject. From the analysis of the data the following areas resulted critical, in other words linked to an intermediate (yellow area) or high (red area) condition of stress: sustain from managers, sustain from colleagues, quality of relationships and professional changes. A significant positive correlation (p <0.05) between the mean values of all critical areas and the concentrations of glucose values have been highlighted with the correlation index of Pearson. Multiple linear regression confirmed these findings, showing that the critical dimensions resulting from the questionnaire were the significant variables that can increase the levels of blood glucose. The preliminary results indicate that perceived work stress can be statistically associated with increased levels of blood glucose.
Hwang, In Cheol; Ahn, Hong Yup; Park, Sang Min; Shim, Jae Yong; Kim, Kyoung Kon
2013-03-01
There is scant research concerning the prediction of imminent death, and current studies simply list events "that have already occurred" around 48 h of the death. We sought to determine what events herald the onset of dying process using the length of time from "any change" to death. This is a prospective observational study with chart audit. Inclusion criteria were terminal cancer patients who passed away in a palliative care unit. The analysis was limited to 181 patients who had medical records for their final week. Commonly observed events in the terminally ill were determined and their significant changes were defined beforehand. We selected the statistically significant changes by multiple logistic regression analysis and evaluated their predictive values for "death within 48 h." The median age was 67 years and there were 103 male patients. After adjusting for age, sex, primary cancer site, metastatic site, and cancer treatment, multiple logistic regression analyses for association between the events and "death within 48 h" revealed some significant changes: confused mental state, decreased blood pressure, increased pulse pressure, low oxygen saturation, death rattle, and decreased conscious level. The events that had higher predictability for death within 48 h were decreased blood pressure and low oxygen saturation, and the positive and negative predictive values of their combination were 95.0 and 81.4%, respectively. The most reliable events to predict impending death were decreased blood pressure and low oxygen saturation.
Choi, Kang; Im, Hyoungjune; Kim, Joohan; Choi, Kwang H; Jon, Duk-In; Hong, Hyunju; Hong, Narei; Lee, Eunjung; Seok, Jeong-Ho
2013-11-01
Early-life stress (ELS) may mediate adjustment problems while resilience may protect individuals against adjustment problems during military service. We investigated the relationship of ELS and resilience with adjustment problem factor scores in the Korea Military Personality Test (KMPT) in candidates for the military service. Four hundred and sixty-one candidates participated in this study. Vulnerability traits for military adjustment, ELS, and resilience were assessed using the KMPT, the Korean Early-Life Abuse Experience Questionnaire, and the Resilience Quotient Test, respectively. Data were analyzed using multiple linear regression analyses. The final model of the multiple linear regression analyses explained 30.2 % of the total variances of the sum of the adjustment problem factor scores of the KMPT. Neglect and exposure to domestic violence had a positive association with the total adjustment problem factor scores of the KMPT, but emotion control, impulse control, and optimism factor scores as well as education and occupational status were inversely associated with the total military adjustment problem score. ELS and resilience are important modulating factors in adjusting to military service. We suggest that neglect and exposure to domestic violence during early life may increase problem with adjustment, but capacity to control emotion and impulse as well as optimistic attitude may play protective roles in adjustment to military life. The screening procedures for ELS and the development of psychological interventions may be helpful for young adults to adjust to military service.
Jia, He; Li, Huimian; Zhang, Yan; Li, Che; Hu, Yingyun; Xia, Chunfang
2015-01-01
The present study aimed to explore the association between RDW and CAS in patients with ischemic stroke, expecting to find a new and significant diagnosis index for clinical practice. This cross-sectional study involves 432 consecutive patients with primary ischemic stroke (within 72 h). All subjects were confirmed by magnetic resonance imaging, and underwent physical examination, laboratory tests and carotid ultrasonography check. Finally, 392 patients were included according to the exclusion criteria. The odds ratios of independent variables were calculated using stepwise multiple logistic regression. Carotid intimal-medial thickness (IMT) and RDW are both significantly different between CAS group and control group. Univariate analyses show that high-sensitive C-reactive protein (Hs-CRP) and RDW (r=0.436) are both in significantly positive association with IMT. Stepwise multiple logistic regression shows that RDW is an independent protective factor of CAS in patients with ischemic stroke. Compared with the lowest quartile, the second to fourth quartiles are 1.13 (95% CI: 1.13-3.05), 2.02 (95% CI: 1.66-4.67), and 3.10 (95% CI: 2.46-7.65), respectively. The present study suggested that RDW level were higher than non-CAS in patients with primary ischemic stroke. Our results facilitated a bridge to connect RDW with ischemic stroke and further confirmed the role of RDW in the progression of the ischemic stroke. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.
O'Brien, Celia Laird; Thomas, John X; Green, Marianne M
2018-01-01
Medical educators struggle to find effective ways to assess essential competencies such as communication, professionalism, and teamwork. Portfolio-based assessment provides one method of addressing this problem by allowing faculty reviewers to judge performance, as based on a longitudinal record of student behavior. At the Feinberg School of Medicine, the portfolio system measures behavioral competence using multiple assessments collected over time. This study examines whether a preclerkship portfolio review is a valid method of identifying problematic student behavior affecting later performance in clerkships. The authors divided students into two groups based on a summative preclerkship portfolio review in 2014: students who had concerning behavior in one or more competencies and students progressing satisfactorily. They compared how students in these groups later performed on two clerkship outcomes as of October 2015: final grades in required clerkships, and performance on a clerkship clinical composite score. They used Mann-Whitney tests and multiple linear regression to examine the relationship between portfolio review results and clerkship outcomes. They used USMLE Step 1 to control for knowledge acquisition. Students with concerning behavior preclerkship received significantly lower clerkship grades than students progressing satisfactorily (P = .002). They also scored significantly lower on the clinical composite score (P < .001). Regression analysis indicated concerning behavior was associated with lower clinical composite scores, even after controlling for knowledge acquisition. The results show a preclerkship portfolio review can identify behaviors that impact clerkship performance. A comprehensive portfolio system is a valid way to measure behavioral competencies.
Estimating the Biodegradability of Treated Sewage Samples Using Synchronous Fluorescence Spectra
Lai, Tien M.; Shin, Jae-Ki; Hur, Jin
2011-01-01
Synchronous fluorescence spectra (SFS) and the first derivative spectra of the influent versus the effluent wastewater samples were compared and the use of fluorescence indices is suggested as a means to estimate the biodegradability of the effluent wastewater. Three distinct peaks were identified from the SFS of the effluent wastewater samples. Protein-like fluorescence (PLF) was reduced, whereas fulvic and/or humic-like fluorescence (HLF) were enhanced, suggesting that the two fluorescence characteristics may represent biodegradable and refractory components, respectively. Five fluorescence indices were selected for the biodegradability estimation based on the spectral features changing from the influent to the effluent. Among the selected indices, the relative distribution of PLF to the total fluorescence area of SFS (Index II) exhibited the highest correlation coefficient with total organic carbon (TOC)-based biodegradability, which was even higher than those obtained with the traditional oxygen demand-based parameters. A multiple regression analysis using Index II and the area ratio of PLF to HLF (Index III) demonstrated the enhancement of the correlations from 0.558 to 0.711 for TOC-based biodegradability. The multiple regression equation finally obtained was 0.148 × Index II − 4.964 × Index III − 0.001 and 0.046 × Index II − 1.128 × Index III + 0.026. The fluorescence indices proposed here are expected to be utilized for successful development of real-time monitoring using a simple fluorescence sensing device for the biodegradability of treated sewage. PMID:22164023
Estimating the biodegradability of treated sewage samples using synchronous fluorescence spectra.
Lai, Tien M; Shin, Jae-Ki; Hur, Jin
2011-01-01
Synchronous fluorescence spectra (SFS) and the first derivative spectra of the influent versus the effluent wastewater samples were compared and the use of fluorescence indices is suggested as a means to estimate the biodegradability of the effluent wastewater. Three distinct peaks were identified from the SFS of the effluent wastewater samples. Protein-like fluorescence (PLF) was reduced, whereas fulvic and/or humic-like fluorescence (HLF) were enhanced, suggesting that the two fluorescence characteristics may represent biodegradable and refractory components, respectively. Five fluorescence indices were selected for the biodegradability estimation based on the spectral features changing from the influent to the effluent. Among the selected indices, the relative distribution of PLF to the total fluorescence area of SFS (Index II) exhibited the highest correlation coefficient with total organic carbon (TOC)-based biodegradability, which was even higher than those obtained with the traditional oxygen demand-based parameters. A multiple regression analysis using Index II and the area ratio of PLF to HLF (Index III) demonstrated the enhancement of the correlations from 0.558 to 0.711 for TOC-based biodegradability. The multiple regression equation finally obtained was 0.148 × Index II - 4.964 × Index III - 0.001 and 0.046 × Index II - 1.128 × Index III + 0.026. The fluorescence indices proposed here are expected to be utilized for successful development of real-time monitoring using a simple fluorescence sensing device for the biodegradability of treated sewage.
Raggi, Alberto; Giovannetti, Ambra Mara; Schiavolin, Silvia; Brambilla, Laura; Brenna, Greta; Confalonieri, Paolo Agostino; Cortese, Francesca; Frangiamore, Rita; Leonardi, Matilde; Mantegazza, Renato Emilio; Moscatelli, Marco; Ponzio, Michela; Torri Clerici, Valentina; Zaratin, Paola; De Torres, Laura
2018-04-16
This cross-sectional study aims to identify the predictors of work-related difficulties in a sample of employed persons with multiple sclerosis as addressed with the Multiple Sclerosis Questionnaire for Job Difficulties. Hierarchical linear regression analysis was conducted to identify predictors of work difficulties: predictors included demographic variables (age, formal education), disease duration and severity, perceived disability and psychological variables (cognitive dysfunction, depression and anxiety). The targets were the questionnaire's overall score and its six subscales. A total of 177 participants (108 females, aged 21-63) were recruited. Age, perceived disability and depression were direct and significant predictors of the questionnaire total score, and the final model explained 43.7% of its variation. The models built on the questionnaire's subscales show that perceived disability and depression were direct and significant predictors of most of its subscales. Our results show that, among patients with multiple sclerosis, those who were older, with higher perceived disability and higher depression symptoms have more and more severe work-related difficulties. The Multiple Sclerosis Questionnaire for Job Difficulties can be fruitfully exploited to plan tailored actions to limit the likelihood of near-future job loss in persons of working age with multiple sclerosis. Implications for rehabilitation Difficulties with work are common among people with multiple sclerosis and are usually addressed in terms of unemployment or job loss. The Multiple Sclerosis Questionnaire for Job Difficulties is a disease-specific questionnaire developed to address the amount and severity of work-related difficulties. We found that work-related difficulties were associated to older age, higher perceived disability and depressive symptoms. Mental health issues and perceived disability should be consistently included in future research targeting work-related difficulties.
Brown, C. Erwin
1993-01-01
Correlation analysis in conjunction with principal-component and multiple-regression analyses were applied to laboratory chemical and petrographic data to assess the usefulness of these techniques in evaluating selected physical and hydraulic properties of carbonate-rock aquifers in central Pennsylvania. Correlation and principal-component analyses were used to establish relations and associations among variables, to determine dimensions of property variation of samples, and to filter the variables containing similar information. Principal-component and correlation analyses showed that porosity is related to other measured variables and that permeability is most related to porosity and grain size. Four principal components are found to be significant in explaining the variance of data. Stepwise multiple-regression analysis was used to see how well the measured variables could predict porosity and (or) permeability for this suite of rocks. The variation in permeability and porosity is not totally predicted by the other variables, but the regression is significant at the 5% significance level. ?? 1993.
Liu, Qi; Wu, Youcong; Yuan, Youhua; Bai, Li; Niu, Kun
2011-12-01
To research the relationship between the virulence factors of Saccharomyces albicans (S. albicans) and the random amplified polymorphic DNA (RAPD) bands of them, and establish the regression model by multiple regression analysis. Extracellular phospholipase, secreted proteinase, ability to generate germ tubes and adhere to oral mucosal cells of 92 strains of S. albicans were measured in vitro; RAPD-polymerase chain reaction (RAPD-PCR) was used to get their bands. Multiple regression for virulence factors of S. albicans and RAPD-PCR bands was established. The extracellular phospholipase activity was associated with 4 RAPD bands: 350, 450, 650 and 1 300 bp (P < 0.05); secreted proteinase activity of S. albicans was associated with 2 bands: 350 and 1 200 bp (P < 0.05); the ability of germ tube produce was associated with 2 bands: 400 and 550 bp (P < 0.05). Some RAPD bands will reflect the virulence factors of S. albicans indirectly. These bands would contain some important messages for regulation of S. albicans virulence factors.
Simultaneous multiple non-crossing quantile regression estimation using kernel constraints
Liu, Yufeng; Wu, Yichao
2011-01-01
Quantile regression (QR) is a very useful statistical tool for learning the relationship between the response variable and covariates. For many applications, one often needs to estimate multiple conditional quantile functions of the response variable given covariates. Although one can estimate multiple quantiles separately, it is of great interest to estimate them simultaneously. One advantage of simultaneous estimation is that multiple quantiles can share strength among them to gain better estimation accuracy than individually estimated quantile functions. Another important advantage of joint estimation is the feasibility of incorporating simultaneous non-crossing constraints of QR functions. In this paper, we propose a new kernel-based multiple QR estimation technique, namely simultaneous non-crossing quantile regression (SNQR). We use kernel representations for QR functions and apply constraints on the kernel coefficients to avoid crossing. Both unregularised and regularised SNQR techniques are considered. Asymptotic properties such as asymptotic normality of linear SNQR and oracle properties of the sparse linear SNQR are developed. Our numerical results demonstrate the competitive performance of our SNQR over the original individual QR estimation. PMID:22190842
Monitoring heavy metal Cr in soil based on hyperspectral data using regression analysis
NASA Astrophysics Data System (ADS)
Zhang, Ningyu; Xu, Fuyun; Zhuang, Shidong; He, Changwei
2016-10-01
Heavy metal pollution in soils is one of the most critical problems in the global ecology and environment safety nowadays. Hyperspectral remote sensing and its application is capable of high speed, low cost, less risk and less damage, and provides a good method for detecting heavy metals in soil. This paper proposed a new idea of applying regression analysis of stepwise multiple regression between the spectral data and monitoring the amount of heavy metal Cr by sample points in soil for environmental protection. In the measurement, a FieldSpec HandHeld spectroradiometer is used to collect reflectance spectra of sample points over the wavelength range of 325-1075 nm. Then the spectral data measured by the spectroradiometer is preprocessed to reduced the influence of the external factors, and the preprocessed methods include first-order differential equation, second-order differential equation and continuum removal method. The algorithms of stepwise multiple regression are established accordingly, and the accuracy of each equation is tested. The results showed that the accuracy of first-order differential equation works best, which makes it feasible to predict the content of heavy metal Cr by using stepwise multiple regression.
Valentijn, Pim P; Vrijhoef, Hubertus J M; Ruwaard, Dirk; de Bont, Antoinette; Arends, Rosa Y; Bruijnzeels, Marc A
2015-01-22
Forming partnerships is a prominent strategy used to promote integrated service delivery across health and social service systems. Evidence about the collaboration process upon which partnerships evolve has rarely been addressed in an integrated-care setting. This study explores the longitudinal relationship of the collaboration process and the influence on the final perceived success of a partnership in such a setting. The collaboration process through which partnerships evolve is based on a conceptual framework which identifies five themes: shared ambition, interests and mutual gains, relationship dynamics, organisational dynamics and process management. Fifty-nine out of 69 partnerships from a national programme in the Netherlands participated in this survey study. At baseline, 338 steering committee members responded, and they returned 320 questionnaires at follow-up. Multiple-regression-analyses were conducted to explore the relationship between the baseline as well as the change in the collaboration process and the final success of the partnerships. Mutual gains and process management were the most significant baseline predictors for the final success of the partnership. A positive change in the relationship dynamics had a significant effect on the final success of a partnership. Insight into the collaboration process of integrated primary care partnerships offers a potentially powerful way of predicting their success. Our findings underscore the importance of monitoring the collaboration process during the development of the partnerships in order to achieve their full collaborative advantage.
Forecasting USAF JP-8 Fuel Needs
2009-03-01
versus complex ones. When we consider long -term forecasts, 5-years in this case, multiple regression outperforms ANN modeling within the specified...with more simple and easy-to-implement methods, versus complex ones. When we consider long -term 5-year forecasts, our multiple regression model...effort. The insight and experience was certainly appreciated. Special thanks to my Turkish peers for their continuous support and help during this long
ERIC Educational Resources Information Center
Le, Huy; Marcus, Justin
2012-01-01
This study used Monte Carlo simulation to examine the properties of the overall odds ratio (OOR), which was recently introduced as an index for overall effect size in multiple logistic regression. It was found that the OOR was relatively independent of study base rate and performed better than most commonly used R-square analogs in indexing model…
ERIC Educational Resources Information Center
Pecorella, Patricia A.; Bowers, David G.
Multiple regression in a double cross-validated design was used to predict two performance measures (total variable expense and absence rate) by multi-month period in five industrial firms. The regressions do cross-validate, and produce multiple coefficients which display both concurrent and predictive effects, peaking 18 months to two years…
USDA-ARS?s Scientific Manuscript database
A technique of using multiple calibration sets in partial least squares regression (PLS) was proposed to improve the quantitative determination of ammonia from open-path Fourier transform infrared spectra. The spectra were measured near animal farms, and the path-integrated concentration of ammonia...
Jena, Subhransu S; Alexander, Mathew; Aaron, Sanjith; Mathew, Vivek; Thomas, Maya Mary; Patil, Anil K; Sivadasan, Ajith; Muthusamy, Karthik; Mani, Sunithi; Rebekah, J Grace
2015-01-01
Multiple sclerosis (MS) has a spectrum of heterogeneity, as seen in western and eastern hemispheres, in the clinical features, topography of involvement and differences in natural history. To study the clinical spectrum, imaging, and electrophysiological as well as cerebrospinal fluid (CSF) characteristics and correlate them with outcome. Retrospective analysis of MS patients during a period of 20 years. Cases were selected according to recent McDonald's criteria (2010), They were managed in the Department of Neurology, Christian Medical College, Vellore. Chi-square and Fisher's exact tests were used for categorical variables. Multiple binary logistic regressions were done to assess significance. Kaplan-Meier curves were drawn to estimate the time to irreversible disability. A total of 157 patients with female preponderance (55%) were included. The inter quartile range duration of follow-up was 9.1 (8.2, 11) years for 114 patients, who were included for final outcome analysis. Relapsing remitting MS (RRMS) (54.1%) was the most common type of MS seen. RRMS had a significantly better outcome (odds ratio: 0.12, 95% confidence interval: 0.02-0.57, P = 0.008) compared to progressive form of MS (primary progressive, secondary progressive). The Expanded Disability Status Scale score of patients at presentation and at final follow-up was 4.4 ± 1.31 and 4.1 ± 2.31, respectively. During the first presentation, polysymptomatic manifestations like motor and sphincteric involvement, incomplete recovery from the first attack; and, during the disease course, bowel, bladder, cerebellar and pyramidal affliction, predicted a worse outcome. A high incidence of optico-spinal presentation, predominance of RRMS and a low yield on cerebrospinal fluid (CSF) studies are the major findings of our study. A notable feature was the analysis of prognostic markers of disability.
Standardized Regression Coefficients as Indices of Effect Sizes in Meta-Analysis
ERIC Educational Resources Information Center
Kim, Rae Seon
2011-01-01
When conducting a meta-analysis, it is common to find many collected studies that report regression analyses, because multiple regression analysis is widely used in many fields. Meta-analysis uses effect sizes drawn from individual studies as a means of synthesizing a collection of results. However, indices of effect size from regression analyses…
Regression Analysis of Stage Variability for West-Central Florida Lakes
Sacks, Laura A.; Ellison, Donald L.; Swancar, Amy
2008-01-01
The variability in a lake's stage depends upon many factors, including surface-water flows, meteorological conditions, and hydrogeologic characteristics near the lake. An understanding of the factors controlling lake-stage variability for a population of lakes may be helpful to water managers who set regulatory levels for lakes. The goal of this study is to determine whether lake-stage variability can be predicted using multiple linear regression and readily available lake and basin characteristics defined for each lake. Regressions were evaluated for a recent 10-year period (1996-2005) and for a historical 10-year period (1954-63). Ground-water pumping is considered to have affected stage at many of the 98 lakes included in the recent period analysis, and not to have affected stage at the 20 lakes included in the historical period analysis. For the recent period, regression models had coefficients of determination (R2) values ranging from 0.60 to 0.74, and up to five explanatory variables. Standard errors ranged from 21 to 37 percent of the average stage variability. Net leakage was the most important explanatory variable in regressions describing the full range and low range in stage variability for the recent period. The most important explanatory variable in the model predicting the high range in stage variability was the height over median lake stage at which surface-water outflow would occur. Other explanatory variables in final regression models for the recent period included the range in annual rainfall for the period and several variables related to local and regional hydrogeology: (1) ground-water pumping within 1 mile of each lake, (2) the amount of ground-water inflow (by category), (3) the head gradient between the lake and the Upper Floridan aquifer, and (4) the thickness of the intermediate confining unit. Many of the variables in final regression models are related to hydrogeologic characteristics, underscoring the importance of ground-water exchange in controlling the stage of karst lakes in Florida. Regression equations were used to predict lake-stage variability for the recent period for 12 additional lakes, and the median difference between predicted and observed values ranged from 11 to 23 percent. Coefficients of determination for the historical period were considerably lower (maximum R2 of 0.28) than for the recent period. Reasons for these low R2 values are probably related to the small number of lakes (20) with stage data for an equivalent time period that were unaffected by ground-water pumping, the similarity of many of the lake types (large surface-water drainage lakes), and the greater uncertainty in defining historical basin characteristics. The lack of lake-stage data unaffected by ground-water pumping and the poor regression results obtained for that group of lakes limit the ability to predict natural lake-stage variability using this method in west-central Florida.
Correlation and simple linear regression.
Eberly, Lynn E
2007-01-01
This chapter highlights important steps in using correlation and simple linear regression to address scientific questions about the association of two continuous variables with each other. These steps include estimation and inference, assessing model fit, the connection between regression and ANOVA, and study design. Examples in microbiology are used throughout. This chapter provides a framework that is helpful in understanding more complex statistical techniques, such as multiple linear regression, linear mixed effects models, logistic regression, and proportional hazards regression.
Regression Model Term Selection for the Analysis of Strain-Gage Balance Calibration Data
NASA Technical Reports Server (NTRS)
Ulbrich, Norbert Manfred; Volden, Thomas R.
2010-01-01
The paper discusses the selection of regression model terms for the analysis of wind tunnel strain-gage balance calibration data. Different function class combinations are presented that may be used to analyze calibration data using either a non-iterative or an iterative method. The role of the intercept term in a regression model of calibration data is reviewed. In addition, useful algorithms and metrics originating from linear algebra and statistics are recommended that will help an analyst (i) to identify and avoid both linear and near-linear dependencies between regression model terms and (ii) to make sure that the selected regression model of the calibration data uses only statistically significant terms. Three different tests are suggested that may be used to objectively assess the predictive capability of the final regression model of the calibration data. These tests use both the original data points and regression model independent confirmation points. Finally, data from a simplified manual calibration of the Ames MK40 balance is used to illustrate the application of some of the metrics and tests to a realistic calibration data set.
Final height in elite male artistic gymnasts.
Georgopoulos, Neoklis A; Theodoropoulou, Anastasia; Roupas, Nikolaos D; Armeni, Anastasia K; Koukkou, Eftychia; Leglise, Michel; Markou, Kostas B
2012-01-01
Elite male artistic gymnasts (AG) are exposed to high levels of physical and psychological stress during adolescence and experience a significant late maturation in both linear growth and pubertal development. The aim of the present study was to determine the impact of intensive physical training on the adult final height in elite male AG. This study is unique in character, as all variables were measured on the field of competition. The study was prospective and longitudinal; however, the current analysis of data is cross-sectional. Data from 86 elite male AG were obtained during the gymnastics competitions of European and World Championships. Clinical evaluation included height and weight measurements, as well as assessment of pubic hair and genital development according to Tanner's stages of pubertal development. The laboratory investigation included determination of skeletal maturation. All athletes completed a questionnaire that included questions on personal (onset and intensity of training, number of competitions per year) and family data (paternal and maternal heights). Male AG were below the 50th percentile for both final height and weight. Elite male AG had final height standard deviation score (SDS) lower than their genetic predisposition. Final height SDS was correlated positively with target height SDS (r = 0.430, p < 0.001) and weight SDS (r = 0.477, p < 0.001) and negatively to the intensity of training (r = -0.252, p = 0.022). The main factors influencing final height, by multiple regression analysis were weight SDS (p < 0.001) and target height SDS (p = 0.003). In elite maleAG, final height falls short of genetic predisposition, still well within normal limits. Considering medical and psychological risks in general, and based on the results of this research project, the International Federation of Gymnastics has increased the age limit for participants in international gymnastics competitions by 1 year.
Factor analysis and multiple regression between topography and precipitation on Jeju Island, Korea
NASA Astrophysics Data System (ADS)
Um, Myoung-Jin; Yun, Hyeseon; Jeong, Chang-Sam; Heo, Jun-Haeng
2011-11-01
SummaryIn this study, new factors that influence precipitation were extracted from geographic variables using factor analysis, which allow for an accurate estimation of orographic precipitation. Correlation analysis was also used to examine the relationship between nine topographic variables from digital elevation models (DEMs) and the precipitation in Jeju Island. In addition, a spatial analysis was performed in order to verify the validity of the regression model. From the results of the correlation analysis, it was found that all of the topographic variables had a positive correlation with the precipitation. The relations between the variables also changed in accordance with a change in the precipitation duration. However, upon examining the correlation matrix, no significant relationship between the latitude and the aspect was found. According to the factor analysis, eight topographic variables (latitude being the exception) were found to have a direct influence on the precipitation. Three factors were then extracted from the eight topographic variables. By directly comparing the multiple regression model with the factors (model 1) to the multiple regression model with the topographic variables (model 3), it was found that model 1 did not violate the limits of statistical significance and multicollinearity. As such, model 1 was considered to be appropriate for estimating the precipitation when taking into account the topography. In the study of model 1, the multiple regression model using factor analysis was found to be the best method for estimating the orographic precipitation on Jeju Island.
Weather Impact on Airport Arrival Meter Fix Throughput
NASA Technical Reports Server (NTRS)
Wang, Yao
2017-01-01
Time-based flow management provides arrival aircraft schedules based on arrival airport conditions, airport capacity, required spacing, and weather conditions. In order to meet a scheduled time at which arrival aircraft can cross an airport arrival meter fix prior to entering the airport terminal airspace, air traffic controllers make regulations on air traffic. Severe weather may create an airport arrival bottleneck if one or more of airport arrival meter fixes are partially or completely blocked by the weather and the arrival demand has not been reduced accordingly. Under these conditions, aircraft are frequently being put in holding patterns until they can be rerouted. A model that predicts the weather impacted meter fix throughput may help air traffic controllers direct arrival flows into the airport more efficiently, minimizing arrival meter fix congestion. This paper presents an analysis of air traffic flows across arrival meter fixes at the Newark Liberty International Airport (EWR). Several scenarios of weather impacted EWR arrival fix flows are described. Furthermore, multiple linear regression and regression tree ensemble learning approaches for translating multiple sector Weather Impacted Traffic Indexes (WITI) to EWR arrival meter fix throughputs are examined. These weather translation models are developed and validated using the EWR arrival flight and weather data for the period of April-September in 2014. This study also compares the performance of the regression tree ensemble with traditional multiple linear regression models for estimating the weather impacted throughputs at each of the EWR arrival meter fixes. For all meter fixes investigated, the results from the regression tree ensemble weather translation models show a stronger correlation between model outputs and observed meter fix throughputs than that produced from multiple linear regression method.
Nguyen, Quynh C.; Osypuk, Theresa L.; Schmidt, Nicole M.; Glymour, M. Maria; Tchetgen Tchetgen, Eric J.
2015-01-01
Despite the recent flourishing of mediation analysis techniques, many modern approaches are difficult to implement or applicable to only a restricted range of regression models. This report provides practical guidance for implementing a new technique utilizing inverse odds ratio weighting (IORW) to estimate natural direct and indirect effects for mediation analyses. IORW takes advantage of the odds ratio's invariance property and condenses information on the odds ratio for the relationship between the exposure (treatment) and multiple mediators, conditional on covariates, by regressing exposure on mediators and covariates. The inverse of the covariate-adjusted exposure-mediator odds ratio association is used to weight the primary analytical regression of the outcome on treatment. The treatment coefficient in such a weighted regression estimates the natural direct effect of treatment on the outcome, and indirect effects are identified by subtracting direct effects from total effects. Weighting renders treatment and mediators independent, thereby deactivating indirect pathways of the mediators. This new mediation technique accommodates multiple discrete or continuous mediators. IORW is easily implemented and is appropriate for any standard regression model, including quantile regression and survival analysis. An empirical example is given using data from the Moving to Opportunity (1994–2002) experiment, testing whether neighborhood context mediated the effects of a housing voucher program on obesity. Relevant Stata code (StataCorp LP, College Station, Texas) is provided. PMID:25693776
A Statistical Multimodel Ensemble Approach to Improving Long-Range Forecasting in Pakistan
2012-03-01
Impact of global warming on monsoon variability in Pakistan. J. Anim. Pl. Sci., 21, no. 1, 107–110. Gillies, S., T. Murphree, and D. Meyer, 2012...are generated by multiple regression models that relate globally distributed oceanic and atmospheric predictors to local predictands. The...generated by multiple regression models that relate globally distributed oceanic and atmospheric predictors to local predictands. The predictands are
Suppression Situations in Multiple Linear Regression
ERIC Educational Resources Information Center
Shieh, Gwowen
2006-01-01
This article proposes alternative expressions for the two most prevailing definitions of suppression without resorting to the standardized regression modeling. The formulation provides a simple basis for the examination of their relationship. For the two-predictor regression, the author demonstrates that the previous results in the literature are…
Yang, Xiaowei; Nie, Kun
2008-03-15
Longitudinal data sets in biomedical research often consist of large numbers of repeated measures. In many cases, the trajectories do not look globally linear or polynomial, making it difficult to summarize the data or test hypotheses using standard longitudinal data analysis based on various linear models. An alternative approach is to apply the approaches of functional data analysis, which directly target the continuous nonlinear curves underlying discretely sampled repeated measures. For the purposes of data exploration, many functional data analysis strategies have been developed based on various schemes of smoothing, but fewer options are available for making causal inferences regarding predictor-outcome relationships, a common task seen in hypothesis-driven medical studies. To compare groups of curves, two testing strategies with good power have been proposed for high-dimensional analysis of variance: the Fourier-based adaptive Neyman test and the wavelet-based thresholding test. Using a smoking cessation clinical trial data set, this paper demonstrates how to extend the strategies for hypothesis testing into the framework of functional linear regression models (FLRMs) with continuous functional responses and categorical or continuous scalar predictors. The analysis procedure consists of three steps: first, apply the Fourier or wavelet transform to the original repeated measures; then fit a multivariate linear model in the transformed domain; and finally, test the regression coefficients using either adaptive Neyman or thresholding statistics. Since a FLRM can be viewed as a natural extension of the traditional multiple linear regression model, the development of this model and computational tools should enhance the capacity of medical statistics for longitudinal data.
NASA Astrophysics Data System (ADS)
Yoshida, Kenichiro; Nishidate, Izumi; Ojima, Nobutoshi; Iwata, Kayoko
2014-01-01
To quantitatively evaluate skin chromophores over a wide region of curved skin surface, we propose an approach that suppresses the effect of the shading-derived error in the reflectance on the estimation of chromophore concentrations, without sacrificing the accuracy of that estimation. In our method, we use multiple regression analysis, assuming the absorbance spectrum as the response variable and the extinction coefficients of melanin, oxygenated hemoglobin, and deoxygenated hemoglobin as the predictor variables. The concentrations of melanin and total hemoglobin are determined from the multiple regression coefficients using compensation formulae (CF) based on the diffuse reflectance spectra derived from a Monte Carlo simulation. To suppress the shading-derived error, we investigated three different combinations of multiple regression coefficients for the CF. In vivo measurements with the forearm skin demonstrated that the proposed approach can reduce the estimation errors that are due to shading-derived errors in the reflectance. With the best combination of multiple regression coefficients, we estimated that the ratio of the error to the chromophore concentrations is about 10%. The proposed method does not require any measurements or assumptions about the shape of the subjects; this is an advantage over other studies related to the reduction of shading-derived errors.
Byun, Bo-Ram; Kim, Yong-Il; Yamaguchi, Tetsutaro; Maki, Koutaro; Son, Woo-Sung
2015-01-01
This study was aimed to examine the correlation between skeletal maturation status and parameters from the odontoid process/body of the second vertebra and the bodies of third and fourth cervical vertebrae and simultaneously build multiple regression models to be able to estimate skeletal maturation status in Korean girls. Hand-wrist radiographs and cone beam computed tomography (CBCT) images were obtained from 74 Korean girls (6-18 years of age). CBCT-generated cervical vertebral maturation (CVM) was used to demarcate the odontoid process and the body of the second cervical vertebra, based on the dentocentral synchondrosis. Correlation coefficient analysis and multiple linear regression analysis were used for each parameter of the cervical vertebrae (P < 0.05). Forty-seven of 64 parameters from CBCT-generated CVM (independent variables) exhibited statistically significant correlations (P < 0.05). The multiple regression model with the greatest R (2) had six parameters (PH2/W2, UW2/W2, (OH+AH2)/LW2, UW3/LW3, D3, and H4/W4) as independent variables with a variance inflation factor (VIF) of <2. CBCT-generated CVM was able to include parameters from the second cervical vertebral body and odontoid process, respectively, for the multiple regression models. This suggests that quantitative analysis might be used to estimate skeletal maturation status.
ERIC Educational Resources Information Center
Crawford, John R.; Garthwaite, Paul H.; Denham, Annie K.; Chelune, Gordon J.
2012-01-01
Regression equations have many useful roles in psychological assessment. Moreover, there is a large reservoir of published data that could be used to build regression equations; these equations could then be employed to test a wide variety of hypotheses concerning the functioning of individual cases. This resource is currently underused because…
Violent video games and delinquent behavior in adolescents: A risk factor perspective.
Exelmans, Liese; Custers, Kathleen; Van den Bulck, Jan
2015-05-01
Over the years, criminological research has identified a number of risk factors that contribute to the development of aggressive and delinquent behavior. Although studies have identified media violence in general and violent video gaming in particular as significant predictors of aggressive behavior, exposure to violent video games has been largely omitted from the risk factor literature on delinquent behavior. This cross-sectional study therefore investigates the relationship between violent video game play and adolescents' delinquent behavior using a risk factor approach. An online survey was completed by 3,372 Flemish adolescents, aged 12-18 years old. Data were analyzed by means of negative binomial regression modelling. Results indicated a significant contribution of violent video games in delinquent behavior over and beyond multiple known risk variables (peer delinquency, sensation seeking, prior victimization, and alienation). Moreover, the final model that incorporated the gaming genres proved to be significantly better than the model without the gaming genres. Results provided support for a cumulative and multiplicative risk model for delinquent behavior. Aggr. Behav. 41:267-279, 2015. © 2015 Wiley Periodicals, Inc. © 2015 Wiley Periodicals, Inc.
Prevalence and predictors of dysphagia in Iranian patients with multiple sclerosis
Tarameshlu, Maryam; Azimi, Amir Reza; Ghelichi, Leila; Ansari, Noureddin Nakhostin
2017-01-01
Background: Dysphagia is frequently observed in patients with multiple sclerosis (MS). Dysphagia and its complications are common causes of morbidity and mortality in final stages of MS disease. This study aimed at determining the prevalence of dysphagia in Iranian patients with MS and identifying predictors associated with dysphagia. Methods: A total of 230 MS patients were enrolled in this cross-sectional study. Dysphagia was evaluated using Mann Assessment of Swallowing Ability (MASA). Demographic characteristics (age and gender), duration of the disease, disease course, and Expanded Disability Status Scale (EDSS) were recorded for all participants. Results: In total, dysphagia was found in 85 participants (37%) with mild to severe dysphagia (mild 50.6%; moderate 29.4%; and severe 20%). The logistic regression model demonstrated that disability status in EDSS (OR= 2.1; 95% CI 0.5-1.2) and disease duration (OR= 2.3; 95% CI 0.4-1.1) predicts a high risk for dysphagia in MS patients. Conclusion: Dysphagia is prevalent in Iranian patients with MS. Disability level and disease duration are significant predictors of dysphagia after MS.
Graffelman, Jan; van Eeuwijk, Fred
2005-12-01
The scatter plot is a well known and easily applicable graphical tool to explore relationships between two quantitative variables. For the exploration of relations between multiple variables, generalisations of the scatter plot are useful. We present an overview of multivariate scatter plots focussing on the following situations. Firstly, we look at a scatter plot for portraying relations between quantitative variables within one data matrix. Secondly, we discuss a similar plot for the case of qualitative variables. Thirdly, we describe scatter plots for the relationships between two sets of variables where we focus on correlations. Finally, we treat plots of the relationships between multiple response and predictor variables, focussing on the matrix of regression coefficients. We will present both known and new results, where an important original contribution concerns a procedure for the inclusion of scales for the variables in multivariate scatter plots. We provide software for drawing such scales. We illustrate the construction and interpretation of the plots by means of examples on data collected in a genomic research program on taste in tomato.
NASA Astrophysics Data System (ADS)
Varun, Sajja; Reddy, Kalakada Bhargav Bal; Vardhan Reddy, R. R. Vishnu
2016-09-01
In this research work, development of a multi response optimization technique has been undertaken, using traditional desirability analysis and non-traditional particle swarm optimization techniques (for different customer's priorities) in wire electrical discharge machining (WEDM). Monel 400 has been selected as work material for experimentation. The effect of key process parameters such as pulse on time (TON), pulse off time (TOFF), peak current (IP), wire feed (WF) were on material removal rate (MRR) and surface roughness(SR) in WEDM operation were investigated. Further, the responses such as MRR and SR were modelled empirically through regression analysis. The developed models can be used by the machinists to predict the MRR and SR over a wide range of input parameters. The optimization of multiple responses has been done for satisfying the priorities of multiple users by using Taguchi-desirability function method and particle swarm optimization technique. The analysis of variance (ANOVA) is also applied to investigate the effect of influential parameters. Finally, the confirmation experiments were conducted for the optimal set of machining parameters, and the betterment has been proved.
Svenson, Gary R; Ostergren, Per-Olof; Merlo, Juan; Råstam, Lennart
2002-12-01
The aim of this study was to gain an understanding of consistent condom use. We took the perspective that condom use involves the ability to handle situational risks influenced at multiple levels, including the individual, dyadic, and social. The hypothesis was that action control, as measured by self-regulation, implementation intentions, and self-efficacy, was the primary determinant. The study was conducted at part of a community-based intervention at a major university (36,000 students). Data was collected using a validated questionnaire mailed to a random sample of students (n = 493, response rate = 71.5%). Statistical analysis included logistic regression models that successively included background, individual, dyadic, and social variables. In the final model, consistent condom use was higher among students with strong implementation intentions, high self-regulation and positive peer norms. The results contribute new knowledge on action control in predicting sexual risk behaviors and lends support to the conceptualization and analysis of HIV/sexually transmitted infection prevention at multiple levels of influence.
Bitter Melon Reduces Head and Neck Squamous Cell Carcinoma Growth by Targeting c-Met Signaling
Nerurkar, Pratibha; Gonzalez, Juan G.; Crawford, Susan; Varvares, Mark; Ray, Ratna B.
2013-01-01
Head and neck squamous cell carcinoma (HNSCC) remains difficult to treat, and despite of advances in treatment, the overall survival rate has only modestly improved over the past several years. Thus, there is an urgent need for additional therapeutic modalities. We hypothesized that treatment of HNSCC cells with a dietary product such as bitter melon extract (BME) modulates multiple signaling pathways and regresses HNSCC tumor growth in a preclinical model. We observed a reduced cell proliferation in HNSCC cell lines. The mechanistic studies reveal that treatment of BME in HNSCC cells inhibited c-Met signaling pathway. We also observed that BME treatment in HNSCC reduced phosphoStat3, c-myc and Mcl-1 expression, downstream signaling molecules of c-Met. Furthermore, BME treatment in HNSCC cells modulated the expression of key cell cycle progression molecules leading to halted cell growth. Finally, BME feeding in mice bearing HNSCC xenograft tumor resulted in an inhibition of tumor growth and c-Met expression. Together, our results suggested that BME treatment in HNSCC cells modulates multiple signaling pathways and may have therapeutic potential for treating HNSCC. PMID:24147107
Winters, Eric R; Petosa, Rick L; Charlton, Thomas E
2003-06-01
To examine whether knowledge of high school students' actions of self-regulation, and perceptions of self-efficacy to overcome exercise barriers, social situation, and outcome expectation will predict non-school related moderate and vigorous physical exercise. High school students enrolled in introductory Physical Education courses completed questionnaires that targeted selected Social Cognitive Theory variables. They also self-reported their typical "leisure-time" exercise participation using a standardized questionnaire. Bivariate correlation statistic and hierarchical regression were conducted on reports of moderate and vigorous exercise frequency. Each predictor variable was significantly associated with measures of moderate and vigorous exercise frequency. All predictor variables were significant in the final regression model used to explain vigorous exercise. After controlling for the effects of gender, the psychosocial variables explained 29% of variance in vigorous exercise frequency. Three of four predictor variables were significant in the final regression equation used to explain moderate exercise. The final regression equation accounted for 11% of variance in moderate exercise frequency. Professionals who attempt to increase the prevalence of physical exercise through educational methods should focus on the psychosocial variables utilized in this study.
Functional Regression Models for Epistasis Analysis of Multiple Quantitative Traits.
Zhang, Futao; Xie, Dan; Liang, Meimei; Xiong, Momiao
2016-04-01
To date, most genetic analyses of phenotypes have focused on analyzing single traits or analyzing each phenotype independently. However, joint epistasis analysis of multiple complementary traits will increase statistical power and improve our understanding of the complicated genetic structure of the complex diseases. Despite their importance in uncovering the genetic structure of complex traits, the statistical methods for identifying epistasis in multiple phenotypes remains fundamentally unexplored. To fill this gap, we formulate a test for interaction between two genes in multiple quantitative trait analysis as a multiple functional regression (MFRG) in which the genotype functions (genetic variant profiles) are defined as a function of the genomic position of the genetic variants. We use large-scale simulations to calculate Type I error rates for testing interaction between two genes with multiple phenotypes and to compare the power with multivariate pairwise interaction analysis and single trait interaction analysis by a single variate functional regression model. To further evaluate performance, the MFRG for epistasis analysis is applied to five phenotypes of exome sequence data from the NHLBI's Exome Sequencing Project (ESP) to detect pleiotropic epistasis. A total of 267 pairs of genes that formed a genetic interaction network showed significant evidence of epistasis influencing five traits. The results demonstrate that the joint interaction analysis of multiple phenotypes has a much higher power to detect interaction than the interaction analysis of a single trait and may open a new direction to fully uncovering the genetic structure of multiple phenotypes.
Raj, Retheep; Sivanandan, K S
2017-01-01
Estimation of elbow dynamics has been the object of numerous investigations. In this work a solution is proposed for estimating elbow movement velocity and elbow joint angle from Surface Electromyography (SEMG) signals. Here the Surface Electromyography signals are acquired from the biceps brachii muscle of human hand. Two time-domain parameters, Integrated EMG (IEMG) and Zero Crossing (ZC), are extracted from the Surface Electromyography signal. The relationship between the time domain parameters, IEMG and ZC with elbow angular displacement and elbow angular velocity during extension and flexion of the elbow are studied. A multiple input-multiple output model is derived for identifying the kinematics of elbow. A Nonlinear Auto Regressive with eXogenous inputs (NARX) structure based multiple layer perceptron neural network (MLPNN) model is proposed for the estimation of elbow joint angle and elbow angular velocity. The proposed NARX MLPNN model is trained using Levenberg-marquardt based algorithm. The proposed model is estimating the elbow joint angle and elbow movement angular velocity with appreciable accuracy. The model is validated using regression coefficient value (R). The average regression coefficient value (R) obtained for elbow angular displacement prediction is 0.9641 and for the elbow anglular velocity prediction is 0.9347. The Nonlinear Auto Regressive with eXogenous inputs (NARX) structure based multiple layer perceptron neural networks (MLPNN) model can be used for the estimation of angular displacement and movement angular velocity of the elbow with good accuracy.
Upper extremity disorders in heavy industry workers in Greece.
Tsouvaltzidou, Thomaella; Alexopoulos, Evangelos; Fragkakis, Ioannis; Jelastopulu, Eleni
2017-06-18
To investigate the disability due to musculoskeletal disorders of the upper extremities in heavy industry workers. The population under study consisted of 802 employees, both white- and blue-collar, working in a shipyard industry in Athens, Greece. Data were collected through the distribution of questionnaires and the recording of individual and job-related characteristics during the period 2006-2009. The questionnaires used were the Quick Disabilities of the Arm, Shoulder and Hand (QD) Outcome Measure, the Work Ability Index (WAI) and the Short-Form-36 (SF-36) Health Survey. The QD was divided into three parameters - movement restrictions in everyday activities, work and sports/music activities - and the SF-36 into two items, physical and emotional. Multiple linear regression analysis was performed by means of the SPSS v.22 for Windows Statistical Package. The answers given by the participants for the QD did not reveal great discomfort regarding the execution of manual tasks, with the majority of the participants scoring under 5%, meaning no disability. After conducting multiple linear regression, age revealed a positive association with the parameter of restrictions in everyday activities (b = 0.64, P = 0.000). Basic education showed a statistically significant association regarding restrictions during leisure activities, with b = 2.140 ( P = 0.029) for compulsory education graduates. WAI's final score displayed negative charging in the regression analysis of all three parameters, with b = -0.142 ( P = 0.0), b = -0.099 ( P = 0.055) and b = -0.376 ( P = 0.001) respectively, while the physical and emotional components of SF-36 associated with movement restrictions only in daily activities and work. The participants' specialty made no statistically significant associations with any of the three parameters of the QD. Increased musculoskeletal disorders of the upper extremity are associated with older age, lower basic education and physical and mental/emotional health and reduced working ability.
Tardy, Claudine; Goffinet, Marine; Boubekeur, Nadia; Ackermann, Rose; Sy, Gavin; Bluteau, Alice; Cholez, Guy; Keyserling, Constance; Lalwani, Narendra; Paolini, John F; Dasseux, Jean-Louis; Barbaras, Ronald; Baron, Rudi
2014-01-01
CER-001 is a novel engineered HDL-mimetic comprised of recombinant human apoA-I and phospholipids that was designed to mimic the beneficial properties of nascent pre-β HDL. In this study, we have evaluated the capacity of CER-001 to perform reverse lipid transport in single dose studies as well as to regress atherosclerosis in LDLr(-/-) mice after short-term multiple-dose infusions. CER-001 induced cholesterol efflux from macrophages and exhibited anti-inflammatory response similar to natural HDL. Studies with HUVEC demonstrated CER-001 at a concentration of 500 μg/mL completely suppressed the secretion of cytokines IL-6, IL-8, GM-CSF and MCP-1. Following infusion of CER-001 (10mg/kg) in C57Bl/6J mice, we observed a transient increase in the mobilization of unesterified cholesterol in HDL particles containing recombinant human apoA-I. Finally we show that cholesterol elimination was stimulated in CER-001 treated animals as demonstrated by the increased cholesterol concentration in liver and feces. In a familial hypercholesterolemia mouse model (LDL-receptor deficient mice), the infusion of CER-001 caused 17% and 32% reductions in plaque size, 17% and 23% reductions in lipid content after 5 and 10 doses given every 2 days, respectively. Also, there was an 80% reduction in macrophage content in the plaque following 5 doses, and decreased VCAM-1 expression by 16% and 22% in the plaque following 5 and 10 intravenous doses of CER-001, respectively. These data demonstrate that CER-001 rapidly enhances reverse lipid transport in the mouse, reducing vascular inflammation and promoting regression of diet-induced atherosclerosis in LDLr(-/-) mice upon a short-term multiple dose treatment. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Attributions and self-efficacy for physical activity in multiple sclerosis.
Nickel, D; Spink, K; Andersen, M; Knox, K
2014-01-01
Self-efficacy is an important predictor of health-related physical activity in multiple sclerosis (MS). While past experiences are believed to influence efficacy beliefs, the explanations individuals provide for these experiences also may be critical. Our objective was to test the hypothesis that perceived success or failure to accumulate 150 min of physical activity in the previous week would moderate the relationship between the attributional dimension of stability and self-efficacy to exercise in the future. Forty-two adults with MS participated in this cross-sectional descriptive study. Participants completed questions assessing physical activity, perceived outcome for meeting the recommended level of endurance activity, attributions for the outcome, and exercise self-efficacy. Results from hierarchical multiple regression revealed a significant main effect for perceived outcome predicting self-efficacy that was qualified by a significant interaction. The final model, which included perceived outcome, stability, and the interaction term, predicted 37% of the variance in exercise self-efficacy, F (3, 38) = 7.27, p = .001. Our findings suggest that the best prediction of self-efficacy in the MS population may include the interaction of specific attributional dimensions with success/failure at meeting the recommended physical activity dose. Attributions may be another target for interventions aimed at increasing the physical activity in MS.
Regression Commonality Analysis: A Technique for Quantitative Theory Building
ERIC Educational Resources Information Center
Nimon, Kim; Reio, Thomas G., Jr.
2011-01-01
When it comes to multiple linear regression analysis (MLR), it is common for social and behavioral science researchers to rely predominately on beta weights when evaluating how predictors contribute to a regression model. Presenting an underutilized statistical technique, this article describes how organizational researchers can use commonality…
Precision Efficacy Analysis for Regression.
ERIC Educational Resources Information Center
Brooks, Gordon P.
When multiple linear regression is used to develop a prediction model, sample size must be large enough to ensure stable coefficients. If the derivation sample size is inadequate, the model may not predict well for future subjects. The precision efficacy analysis for regression (PEAR) method uses a cross- validity approach to select sample sizes…
Rasmussen, Patrick P.; Gray, John R.; Glysson, G. Douglas; Ziegler, Andrew C.
2009-01-01
In-stream continuous turbidity and streamflow data, calibrated with measured suspended-sediment concentration data, can be used to compute a time series of suspended-sediment concentration and load at a stream site. Development of a simple linear (ordinary least squares) regression model for computing suspended-sediment concentrations from instantaneous turbidity data is the first step in the computation process. If the model standard percentage error (MSPE) of the simple linear regression model meets a minimum criterion, this model should be used to compute a time series of suspended-sediment concentrations. Otherwise, a multiple linear regression model using paired instantaneous turbidity and streamflow data is developed and compared to the simple regression model. If the inclusion of the streamflow variable proves to be statistically significant and the uncertainty associated with the multiple regression model results in an improvement over that for the simple linear model, the turbidity-streamflow multiple linear regression model should be used to compute a suspended-sediment concentration time series. The computed concentration time series is subsequently used with its paired streamflow time series to compute suspended-sediment loads by standard U.S. Geological Survey techniques. Once an acceptable regression model is developed, it can be used to compute suspended-sediment concentration beyond the period of record used in model development with proper ongoing collection and analysis of calibration samples. Regression models to compute suspended-sediment concentrations are generally site specific and should never be considered static, but they represent a set period in a continually dynamic system in which additional data will help verify any change in sediment load, type, and source.
Chlamydia trachomatis IgM seropositivity during pregnancy and assessment of its risk factors.
Rahman, M; Chowdhury, S B; Akhtar, N; Jahan, M; Jahan, M K; Jebunnahar, S
2014-01-01
The study was undertaken to determine socio-demographic and reproductive risk factors associated with Chlamydia trachomaties IgM seropositivity during pregnancy. This cross sectional comparative study was carried out in the obstetrics outdoor of Bangabandhu Sheikh Mujib Medical University (BSMMU), Dhaka, Bangladesh in collaboration with the department of Virology between the periods from July 2007 to December 2008. Pregnant women at their first visit to the hospital were approached consecutively and asked to complete a questionnaire and 2cc blood was collected from each subject for Chlamydia trachomatis IgM antibody testing using ELISA method. The study population was divided into two groups according to the presence and absence of serum Chlamydia trachomatis IgM antibody. Finally socio-demographic and reproductive risk factors were compared between the groups. Among 172 women the sero-prevalence of Chlamydia IgM was 41%. The multiple logistic regression model (step wise) finally extracted for characteristics correlated with seropositivity. Ten years or less (≤SSC) education (OR 2.6 95% CI 1.1to 5.9), history of adverse pregnancy outcome (OR 2.8 95% CI 1.2 to 6.5) and multiple sex partner of husband (OR 4.1 95% CI 1.2 to 14.8) were associated with chlamydia infection. The use of condom (OR 0.28 95% CI 0.12 to 0.63) was associated with decreased risk of infection. Chlamydia trachomatis infection during pregnancy is associated with risk factors on the basis of which selective screening can be done.
The prediction of intelligence in preschool children using alternative models to regression.
Finch, W Holmes; Chang, Mei; Davis, Andrew S; Holden, Jocelyn E; Rothlisberg, Barbara A; McIntosh, David E
2011-12-01
Statistical prediction of an outcome variable using multiple independent variables is a common practice in the social and behavioral sciences. For example, neuropsychologists are sometimes called upon to provide predictions of preinjury cognitive functioning for individuals who have suffered a traumatic brain injury. Typically, these predictions are made using standard multiple linear regression models with several demographic variables (e.g., gender, ethnicity, education level) as predictors. Prior research has shown conflicting evidence regarding the ability of such models to provide accurate predictions of outcome variables such as full-scale intelligence (FSIQ) test scores. The present study had two goals: (1) to demonstrate the utility of a set of alternative prediction methods that have been applied extensively in the natural sciences and business but have not been frequently explored in the social sciences and (2) to develop models that can be used to predict premorbid cognitive functioning in preschool children. Predictions of Stanford-Binet 5 FSIQ scores for preschool-aged children is used to compare the performance of a multiple regression model with several of these alternative methods. Results demonstrate that classification and regression trees provided more accurate predictions of FSIQ scores than does the more traditional regression approach. Implications of these results are discussed.
Optimization of fixture layouts of glass laser optics using multiple kernel regression.
Su, Jianhua; Cao, Enhua; Qiao, Hong
2014-05-10
We aim to build an integrated fixturing model to describe the structural properties and thermal properties of the support frame of glass laser optics. Therefore, (a) a near global optimal set of clamps can be computed to minimize the surface shape error of the glass laser optic based on the proposed model, and (b) a desired surface shape error can be obtained by adjusting the clamping forces under various environmental temperatures based on the model. To construct the model, we develop a new multiple kernel learning method and call it multiple kernel support vector functional regression. The proposed method uses two layer regressions to group and order the data sources by the weights of the kernels and the factors of the layers. Because of that, the influences of the clamps and the temperature can be evaluated by grouping them into different layers.
Prediction of anthropometric foot characteristics in children.
Morrison, Stewart C; Durward, Brian R; Watt, Gordon F; Donaldson, Malcolm D C
2009-01-01
The establishment of growth reference values is needed in pediatric practice where pathologic conditions can have a detrimental effect on the growth and development of the pediatric foot. This study aims to use multiple regression to evaluate the effects of multiple predictor variables (height, age, body mass, and gender) on anthropometric characteristics of the peripubescent foot. Two hundred children aged 9 to 12 years were recruited, and three anthropometric measurements of the pediatric foot were recorded (foot length, forefoot width, and navicular height). Multiple regression analysis was conducted, and coefficients for gender, height, and body mass all had significant relationships for the prediction of forefoot width and foot length (P < or = .05, r > or = 0.7). The coefficients for gender and body mass were not significant for the prediction of navicular height (P > or = .05), whereas height was (P < or = .05). Normative growth reference values and prognostic regression equations are presented for the peripubescent foot.
Birthweight Related Factors in Northwestern Iran: Using Quantile Regression Method.
Fallah, Ramazan; Kazemnejad, Anoshirvan; Zayeri, Farid; Shoghli, Alireza
2015-11-18
Birthweight is one of the most important predicting indicators of the health status in adulthood. Having a balanced birthweight is one of the priorities of the health system in most of the industrial and developed countries. This indicator is used to assess the growth and health status of the infants. The aim of this study was to assess the birthweight of the neonates by using quantile regression in Zanjan province. This analytical descriptive study was carried out using pre-registered (March 2010 - March 2012) data of neonates in urban/rural health centers of Zanjan province using multiple-stage cluster sampling. Data were analyzed using multiple linear regressions andquantile regression method and SAS 9.2 statistical software. From 8456 newborn baby, 4146 (49%) were female. The mean age of the mothers was 27.1±5.4 years. The mean birthweight of the neonates was 3104 ± 431 grams. Five hundred and seventy-three patients (6.8%) of the neonates were less than 2500 grams. In all quantiles, gestational age of neonates (p<0.05), weight and educational level of the mothers (p<0.05) showed a linear significant relationship with the i of the neonates. However, sex and birth rank of the neonates, mothers age, place of residence (urban/rural) and career were not significant in all quantiles (p>0.05). This study revealed the results of multiple linear regression and quantile regression were not identical. We strictly recommend the use of quantile regression when an asymmetric response variable or data with outliers is available.
Birthweight Related Factors in Northwestern Iran: Using Quantile Regression Method
Fallah, Ramazan; Kazemnejad, Anoshirvan; Zayeri, Farid; Shoghli, Alireza
2016-01-01
Introduction: Birthweight is one of the most important predicting indicators of the health status in adulthood. Having a balanced birthweight is one of the priorities of the health system in most of the industrial and developed countries. This indicator is used to assess the growth and health status of the infants. The aim of this study was to assess the birthweight of the neonates by using quantile regression in Zanjan province. Methods: This analytical descriptive study was carried out using pre-registered (March 2010 - March 2012) data of neonates in urban/rural health centers of Zanjan province using multiple-stage cluster sampling. Data were analyzed using multiple linear regressions andquantile regression method and SAS 9.2 statistical software. Results: From 8456 newborn baby, 4146 (49%) were female. The mean age of the mothers was 27.1±5.4 years. The mean birthweight of the neonates was 3104 ± 431 grams. Five hundred and seventy-three patients (6.8%) of the neonates were less than 2500 grams. In all quantiles, gestational age of neonates (p<0.05), weight and educational level of the mothers (p<0.05) showed a linear significant relationship with the i of the neonates. However, sex and birth rank of the neonates, mothers age, place of residence (urban/rural) and career were not significant in all quantiles (p>0.05). Conclusion: This study revealed the results of multiple linear regression and quantile regression were not identical. We strictly recommend the use of quantile regression when an asymmetric response variable or data with outliers is available. PMID:26925889
Contribution of neurocognition to 18-month employment outcomes in first-episode psychosis.
Karambelas, George J; Cotton, Sue M; Farhall, John; Killackey, Eóin; Allott, Kelly A
2017-10-27
To examine whether baseline neurocognition predicts vocational outcomes over 18 months in patients with first-episode psychosis enrolled in a randomized controlled trial of Individual Placement and Support or treatment as usual. One-hundred and thirty-four first-episode psychosis participants completed an extensive neurocognitive battery. Principal axis factor analysis using PROMAX rotation was used to determine the underlying structure of the battery. Setwise (hierarchical) multiple linear and logistic regressions were used to examine predictors of (1) total hours employed over 18 months and (2) employment status, respectively. Neurocognition factors were entered in the models after accounting for age, gender, premorbid IQ, negative symptoms, treatment group allocation and employment status at baseline. Five neurocognitive factors were extracted: (1) processing speed, (2) verbal learning and memory, (3) knowledge and reasoning, (4) attention and working memory and (5) visual organization and memory. Employment status over 18 months was not significantly predicted by any of the predictors in the final model. Total hours employed over 18 months were significantly predicted by gender (P = .027), negative symptoms (P = .032) and verbal learning and memory (P = .040). Every step of the regression model was a significant predictor of total hours worked overall (final model: P = .013). Verbal learning and memory, negative symptoms and gender were implicated in duration of employment in first-episode psychosis. The other neurocognitive domains did not significantly contribute to the prediction of vocational outcomes over 18 months. Interventions targeting verbal memory may improve vocational outcomes in early psychosis. © 2017 John Wiley & Sons Australia, Ltd.
Woodhouse, Lisa J; Manning, Lisa; Potter, John F; Berge, Eivind; Sprigg, Nikola; Wardlaw, Joanna; Lees, Kennedy R; Bath, Philip M; Robinson, Thompson G
2017-05-01
Over 50% of patients are already taking blood pressure-lowering therapy on hospital admission for acute stroke. An individual patient data meta-analysis from randomized controlled trials was undertaken to determine the effect of continuation versus temporarily stopping preexisting antihypertensive medication in acute stroke. Key databases were searched for trials against the following inclusion criteria: randomized design; stroke onset ≤48 hours; investigating the effect of continuation versus stopping prestroke antihypertensive medication; and follow-up of ≥2 weeks. Two randomized controlled trials were identified and included in this meta-analysis of individual patient data from 2860 patients with ≤48 hours of acute stroke. Risk of bias in each study was low. In adjusted logistic regression and multiple regression analyses (using random effects), we found no significant association between continuation of prestroke antihypertensive therapy (versus stopping) and risk of death or dependency at final follow-up: odds ratio 0.96 (95% confidence interval, 0.80-1.14). No significant associations were found between continuation (versus stopping) of therapy and secondary outcomes at final follow-up. Analyses for death and dependency in prespecified subgroups revealed no significant associations with continuation versus temporarily stopping therapy, with the exception of patients randomized ≤12 hours, in whom a difference favoring stopping treatment met statistical significance. We found no significant benefit with continuation of antihypertensive treatment in the acute stroke period. Therefore, there is no urgency to administer preexisting antihypertensive therapy in the first few hours or days after stroke, unless indicated for other comorbid conditions. © 2017 American Heart Association, Inc.
Weighted regression analysis and interval estimators
Donald W. Seegrist
1974-01-01
A method for deriving the weighted least squares estimators for the parameters of a multiple regression model. Confidence intervals for expected values, and prediction intervals for the means of future samples are given.
Stroop Color-Word Interference Test: Normative data for Spanish-speaking pediatric population.
Rivera, D; Morlett-Paredes, A; Peñalver Guia, A I; Irías Escher, M J; Soto-Añari, M; Aguayo Arelis, A; Rute-Pérez, S; Rodríguez-Lorenzana, A; Rodríguez-Agudelo, Y; Albaladejo-Blázquez, N; García de la Cadena, C; Ibáñez-Alfonso, J A; Rodriguez-Irizarry, W; García-Guerrero, C E; Delgado-Mejía, I D; Padilla-López, A; Vergara-Moragues, E; Barrios Nevado, M D; Saracostti Schwartzman, M; Arango-Lasprilla, J C
2017-01-01
To generate normative data for the Stroop Word-Color Interference test in Spanish-speaking pediatric populations. The sample consisted of 4,373 healthy children from nine countries in Latin America (Chile, Cuba, Ecuador, Guatemala, Honduras, Mexico, Paraguay, Peru, and Puerto Rico) and Spain. Each participant was administered the Stroop Word-Color Interference test as part of a larger neuropsychological battery. The Stroop Word, Stroop Color, Stroop Word-Color, and Stroop Interference scores were normed using multiple linear regressions and standard deviations of residual values. Age, age2, sex, and mean level of parental education (MLPE) were included as predictors in the analyses. The final multiple linear regression models showed main effects for age on all scores, except on Stroop Interference for Guatemala, such that scores increased linearly as a function of age. Age2 affected Stroop Word scores for all countries, Stroop Color scores for Ecuador, Mexico, Peru, and Spain; Stroop Word-Color scores for Ecuador, Mexico, and Paraguay; and Stroop Interference scores for Cuba, Guatemala, and Spain. MLPE affected Stroop Word scores for Chile, Mexico, and Puerto Rico; Stroop Color scores for Mexico, Puerto Rico, and Spain; Stroop Word-Color scores for Ecuador, Guatemala, Mexico, Puerto Rico and Spain; and Stroop-Interference scores for Ecuador, Mexico, and Spain. Sex affected Stroop Word scores for Spain, Stroop Color scores for Mexico, and Stroop Interference for Honduras. This is the largest Spanish-speaking pediatric normative study in the world, and it will allow neuropsychologists from these countries to have a more accurate approach to interpret the Stroop Word-Color Interference test in pediatric populations.
Predictors of Early Reading Skill in 5-Year-Old Children With Hearing Loss Who Use Spoken Language
Ching, Teresa Y.C.; Crowe, Kathryn; Day, Julia; Seeto, Mark
2013-01-01
This research investigated the concurrent association between early reading skills and phonological awareness (PA), print knowledge, language, cognitive, and demographic variables in 101 5-year-old children with prelingual hearing losses ranging from mild to profound who communicated primarily using spoken language. All participants were fitted with hearing aids (n = 71) or cochlear implants (n = 30). They completed standardized assessments of PA, receptive vocabulary, letter knowledge, word and non-word reading, passage comprehension, math reasoning, and nonverbal cognitive ability. Multiple regressions revealed that PA (assessed using judgments of similarity based on words’ initial or final sounds) made a significant, independent contribution to children’s early reading ability (for both letters and words/non-words) after controlling for variation in receptive vocabulary, nonverbal cognitive ability, and a range of demographic variables (including gender, degree of hearing loss, communication mode, type of sensory device, age at fitting of sensory devices, and level of maternal education). Importantly, the relationship between PA and reading was specific to reading and did not generalize to another academic ability, math reasoning. Additional multiple regressions showed that letter knowledge (names or sounds) was superior in children whose mothers had undertaken post-secondary education, and that better receptive vocabulary was associated with less severe hearing loss, use of a cochlear implant, and earlier age at implant switch-on. Earlier fitting of hearing aids or cochlear implants was not, however, significantly associated with better PA or reading outcomes in this cohort of children, most of whom were fitted with sensory devices before 3 years of age. PMID:24563553
Myung, Seung-Kwon; Seo, Hong Gwan; Cheong, Yoo-Seock; Park, Sohee; Lee, Wonkyong B; Fong, Geoffrey T
2012-01-01
Background Few studies have reported the factors associated with intention to quit smoking among Korean adult smokers. This study aimed to examine sociodemographic characteristics, smoking-related beliefs, and smoking-restriction variables associated with intention to quit smoking among Korean adult smokers. Methods We used data from the International Tobacco Control Korea Survey, which was conducted from November through December 2005 by using random-digit dialing and computer-assisted telephone interviewing of male and female smokers aged 19 years or older in 16 metropolitan areas and provinces of Korea. We performed univariate analysis and multiple logistic regression analysis to identify predictors of intention to quit. Results A total of 995 respondents were included in the final analysis. Of those, 74.9% (n = 745) intended to quit smoking. In univariate analyses, smokers with an intention to quit were younger, smoked fewer cigarettes per day, had a higher annual income, were more educated, were more likely to have a religious affiliation, drank less alcohol per week, were less likely to have self-exempting beliefs, and were more likely to have self-efficacy beliefs regarding quitting, to believe that smoking had damaged their health, and to report that smoking was never allowed anywhere in their home. In multiple logistic regression analysis, higher education level, having a religious affiliation, and a higher self-efficacy regarding quitting were significantly associated with intention to quit. Conclusions Sociodemographic factors, smoking-related beliefs, and smoking restrictions at home were associated with intention to quit smoking among Korean adults. PMID:22186157
Myung, Seung-Kwon; Seo, Hong Gwan; Cheong, Yoo-Seock; Park, Sohee; Lee, Wonkyong B; Fong, Geoffrey T
2012-01-01
Few studies have reported the factors associated with intention to quit smoking among Korean adult smokers. This study aimed to examine sociodemographic characteristics, smoking-related beliefs, and smoking-restriction variables associated with intention to quit smoking among Korean adult smokers. We used data from the International Tobacco Control Korea Survey, which was conducted from November through December 2005 by using random-digit dialing and computer-assisted telephone interviewing of male and female smokers aged 19 years or older in 16 metropolitan areas and provinces of Korea. We performed univariate analysis and multiple logistic regression analysis to identify predictors of intention to quit. A total of 995 respondents were included in the final analysis. Of those, 74.9% (n = 745) intended to quit smoking. In univariate analyses, smokers with an intention to quit were younger, smoked fewer cigarettes per day, had a higher annual income, were more educated, were more likely to have a religious affiliation, drank less alcohol per week, were less likely to have self-exempting beliefs, and were more likely to have self-efficacy beliefs regarding quitting, to believe that smoking had damaged their health, and to report that smoking was never allowed anywhere in their home. In multiple logistic regression analysis, higher education level, having a religious affiliation, and a higher self-efficacy regarding quitting were significantly associated with intention to quit. Sociodemographic factors, smoking-related beliefs, and smoking restrictions at home were associated with intention to quit smoking among Korean adults.
NASA Astrophysics Data System (ADS)
Andreasen, Daniel; Edmund, Jens M.; Zografos, Vasileios; Menze, Bjoern H.; Van Leemput, Koen
2016-03-01
In radiotherapy treatment planning that is only based on magnetic resonance imaging (MRI), the electron density information usually obtained from computed tomography (CT) must be derived from the MRI by synthesizing a so-called pseudo CT (pCT). This is a non-trivial task since MRI intensities are neither uniquely nor quantitatively related to electron density. Typical approaches involve either a classification or regression model requiring specialized MRI sequences to solve intensity ambiguities, or an atlas-based model necessitating multiple registrations between atlases and subject scans. In this work, we explore a machine learning approach for creating a pCT of the pelvic region from conventional MRI sequences without using atlases. We use a random forest provided with information about local texture, edges and spatial features derived from the MRI. This helps to solve intensity ambiguities. Furthermore, we use the concept of auto-context by sequentially training a number of classification forests to create and improve context features, which are finally used to train a regression forest for pCT prediction. We evaluate the pCT quality in terms of the voxel-wise error and the radiologic accuracy as measured by water-equivalent path lengths. We compare the performance of our method against two baseline pCT strategies, which either set all MRI voxels in the subject equal to the CT value of water, or in addition transfer the bone volume from the real CT. We show an improved performance compared to both baseline pCTs suggesting that our method may be useful for MRI-only radiotherapy.
Kobuse, Hiroe; Morishima, Toshitaka; Tanaka, Masayuki; Murakami, Genki; Hirose, Masahiro; Imanaka, Yuichi
2014-06-01
To develop a reliable and valid questionnaire that can distinguish features of organizational culture for patient safety across subgroups such as hospitals, professions, management/non-management positions and units/wards. We developed a Hospital Organizational Culture Questionnaire based on a conceptual framework incorporating items from a review of existing literature. The questionnaire was administered to hospital staff including doctors, nurses, allied health personnel, and administrative staff at six public hospitals in Japan. Reliability and validity were assessed through exploratory factor analysis, multitrait scaling analysis, Cronbach's alpha coefficient and multiple regression analysis using staff-perceived achievement of safety as the response variable. Discriminative power across subgroups was assessed with radar chart profiling. Of the 3304 hospital staff surveyed, 2924 (88.5%) responded. After exploratory factor analysis and multitrait analysis, the finalized questionnaire was composed of 24 items in the following eight dimensions: improvement orientation, passion for mission, professional growth, resource allocation prioritization, inter-sectional collaboration, responsibility and authority, teamwork, and information sharing. Construct validity and internal consistency of dimensions were confirmed with multitrait analysis and Cronbach's alpha coefficients, respectively. Multiple regression analysis showed that improvement orientation, passion for mission, resource allocation prioritization and information sharing were significantly associated with higher achievement in safety practices. Our questionnaire tool was able to distinguish features of safety culture among different subgroups. Our questionnaire demonstrated excellent validity and reliability, and revealed distinct cultural patterns among different subgroups. Quantitative assessment of organizational safety culture with this tool may further the understanding of associated characteristics of each subgroup and provide insight into organizational readiness for patient safety improvement. © 2014 John Wiley & Sons, Ltd.
Modeling the impact of COPD on the brain.
Borson, Soo; Scanlan, James; Friedman, Seth; Zuhr, Elizabeth; Fields, Julie; Aylward, Elizabeth; Mahurin, Rodney; Richards, Todd; Anzai, Yoshimi; Yukawa, Michi; Yeh, Shingshing
2008-01-01
Previous studies have shown that COPD adversely affects distant organs and body systems, including the brain. This pilot study aims to model the relationships between respiratory insufficiency and domains related to brain function, including low mood, subtly impaired cognition, systemic inflammation, and brain structural and neurochemical abnormalities. Nine healthy controls were compared with 18 age- and education-matched medically stable-COPD patients, half of whom were oxygen-dependent. Measures included depression, anxiety, cognition, health status, spirometry, oximetry at rest and during 6-minute walk, and resting plasma cytokines and soluble receptors, brain MRI, and MR spectroscopy in regions relevant to mood and cognition. ANOVA was used to compare controls with patients and with COPD subgroups (oxygen users [n = 9] and nonusers [n = 9]), and only variables showing group differences at p < or = 0.05 were included in multiple regressions controlling for age, gender, and education to develop the final model. Controls and COPD patients differed significantly in global cognition and memory, mood, and soluble TNFR1 levels but not brain structural or neurochemical measures. Multiple regressions identified pathways linking disease severity with impaired performance on sensitive cognitive processing measures, mediated through oxygen dependence, and with systemic inflammation (TNFR1), related through poor 6-minute walk performance. Oxygen desaturation with activity was related to indicators of brain tissue damage (increased frontal choline, which in turn was associated with subcortical white matter attenuation). This empirically derived model provides a conceptual framework for future studies of clinical interventions to protect the brain in patients with COPD, such as earlier oxygen supplementation for patients with desaturation during everyday activities.
Modeling the impact of COPD on the brain
Borson, Soo; Scanlan, James; Friedman, Seth; Zuhr, Elizabeth; Fields, Julie; Aylward, Elizabeth; Mahurin, Rodney; Richards, Todd; Anzai, Yoshimi; Yukawa, Michi; Yeh, Shingshing
2008-01-01
Previous studies have shown that COPD adversely affects distant organs and body systems, including the brain. This pilot study aims to model the relationships between respiratory insufficiency and domains related to brain function, including low mood, subtly impaired cognition, systemic inflammation, and brain structural and neurochemical abnormalities. Nine healthy controls were compared with 18 age- and education-matched medically stable COPD patients, half of whom were oxygen-dependent. Measures included depression, anxiety, cognition, health status, spirometry, oximetry at rest and during 6-minute walk, and resting plasma cytokines and soluble receptors, brain MRI, and MR spectroscopy in regions relevant to mood and cognition. ANOVA was used to compare controls with patients and with COPD subgroups (oxygen users [n = 9] and nonusers [n = 9]), and only variables showing group differences at p ≤ 0.05 were included in multiple regressions controlling for age, gender, and education to develop the final model. Controls and COPD patients differed significantly in global cognition and memory, mood, and soluble TNFR1 levels but not brain structural or neurochemical measures. Multiple regressions identified pathways linking disease severity with impaired performance on sensitive cognitive processing measures, mediated through oxygen dependence, and with systemic inflammation (TNFR1), related through poor 6-minute walk performance. Oxygen desaturation with activity was related to indicators of brain tissue damage (increased frontal choline, which in turn was associated with subcortical white matter attenuation). This empirically derived model provides a conceptual framework for future studies of clinical interventions to protect the brain in patients with COPD, such as earlier oxygen supplementation for patients with desaturation during everyday activities. PMID:18990971
Chen, Ying-Jen; Ho, Meng-Yang; Chen, Kwan-Ju; Hsu, Chia-Fen; Ryu, Shan-Jin
2009-08-01
The aims of the present study were to (i) investigate if traditional Chinese word reading ability can be used for estimating premorbid general intelligence; and (ii) to provide multiple regression equations for estimating premorbid performance on Raven's Standard Progressive Matrices (RSPM), using age, years of education and Chinese Graded Word Reading Test (CGWRT) scores as predictor variables. Four hundred and twenty-six healthy volunteers (201 male, 225 female), aged 16-93 years (mean +/- SD, 41.92 +/- 18.19 years) undertook the tests individually under supervised conditions. Seventy percent of subjects were randomly allocated to the derivation group (n = 296), and the rest to the validation group (n = 130). RSPM score was positively correlated with CGWRT score and years of education. RSPM and CGWRT scores and years of education were also inversely correlated with age, but the declining trend for RSPM performance against age was steeper than that for CGWRT performance. Separate multiple regression equations were derived for estimating RSPM scores using different combinations of age, years of education, and CGWRT score for both groups. The multiple regression coefficient of each equation ranged from 0.71 to 0.80 with the standard error of estimate between 7 and 8 RSPM points. When fitting the data of one group to the equations derived from its counterpart group, the cross-validation multiple regression coefficients ranged from 0.71 to 0.79. There were no significant differences in the 'predicted-obtained' RSPM discrepancies between any equations. The regression equations derived in the present study may provide a basis for estimating premorbid RSPM performance.
Tay, Cheryl Sihui; Sterzing, Thorsten; Lim, Chen Yen; Ding, Rui; Kong, Pui Wah
2017-05-01
This study examined (a) the strength of four individual footwear perception factors to influence the overall preference of running shoes and (b) whether these perception factors satisfied the nonmulticollinear assumption in a regression model. Running footwear must fulfill multiple functional criteria to satisfy its potential users. Footwear perception factors, such as fit and cushioning, are commonly used to guide shoe design and development, but it is unclear whether running-footwear users are able to differentiate one factor from another. One hundred casual runners assessed four running shoes on a 15-cm visual analogue scale for four footwear perception factors (fit, cushioning, arch support, and stability) as well as for overall preference during a treadmill running protocol. Diagnostic tests showed an absence of multicollinearity between factors, where values for tolerance ranged from .36 to .72, corresponding to variance inflation factors of 2.8 to 1.4. The multiple regression model of these four footwear perception variables accounted for 77.7% to 81.6% of variance in overall preference, with each factor explaining a unique part of the total variance. Casual runners were able to rate each footwear perception factor separately, thus assigning each factor a true potential to improve overall preference for the users. The results also support the use of a multiple regression model of footwear perception factors to predict overall running shoe preference. Regression modeling is a useful tool for running-shoe manufacturers to more precisely evaluate how individual factors contribute to the subjective assessment of running footwear.
A population-based study on the association between rheumatoid arthritis and voice problems.
Hah, J Hun; An, Soo-Youn; Sim, Songyong; Kim, So Young; Oh, Dong Jun; Park, Bumjung; Kim, Sung-Gyun; Choi, Hyo Geun
2016-07-01
The objective of this study was to investigate whether rheumatoid arthritis increases the frequency of organic laryngeal lesions and the subjective voice complaint rate in those with no organic laryngeal lesion. We performed a cross-sectional study using the data from 19,368 participants (418 rheumatoid arthritis patients and 18,950 controls) of the 2008-2011 Korea National Health and Nutrition Examination Survey. The associations between rheumatoid arthritis and organic laryngeal lesions/subjective voice complaints were analyzed using simple/multiple logistic regression analysis with complex sample adjusting for confounding factors, including age, sex, smoking status, stress level, and body mass index, which could provoke voice problems. Vocal nodules, vocal polyp, and vocal palsy were not associated with rheumatoid arthritis in a multiple regression analysis, and only laryngitis showed a positive association (adjusted odds ratio, 1.59; 95 % confidence interval, 1.01-2.52; P = 0.047). Rheumatoid arthritis was associated with subjective voice discomfort in a simple regression analysis, but not in a multiple regression analysis. Participants with rheumatoid arthritis were older, more often female, and had higher stress levels than those without rheumatoid arthritis. These factors were associated with subjective voice complaints in both simple and multiple regression analyses. Rheumatoid arthritis was not associated with organic laryngeal diseases except laryngitis. Rheumatoid arthritis did not increase the odds ratio for subjective voice complaints. Voice problems in participants with rheumatoid arthritis originated from the characteristics of the rheumatoid arthritis group (higher mean age, female sex, and stress level) rather than rheumatoid arthritis itself.
Predicting MHC-II binding affinity using multiple instance regression
EL-Manzalawy, Yasser; Dobbs, Drena; Honavar, Vasant
2011-01-01
Reliably predicting the ability of antigen peptides to bind to major histocompatibility complex class II (MHC-II) molecules is an essential step in developing new vaccines. Uncovering the amino acid sequence correlates of the binding affinity of MHC-II binding peptides is important for understanding pathogenesis and immune response. The task of predicting MHC-II binding peptides is complicated by the significant variability in their length. Most existing computational methods for predicting MHC-II binding peptides focus on identifying a nine amino acids core region in each binding peptide. We formulate the problems of qualitatively and quantitatively predicting flexible length MHC-II peptides as multiple instance learning and multiple instance regression problems, respectively. Based on this formulation, we introduce MHCMIR, a novel method for predicting MHC-II binding affinity using multiple instance regression. We present results of experiments using several benchmark datasets that show that MHCMIR is competitive with the state-of-the-art methods for predicting MHC-II binding peptides. An online web server that implements the MHCMIR method for MHC-II binding affinity prediction is freely accessible at http://ailab.cs.iastate.edu/mhcmir. PMID:20855923
Burgette, Lane F; Reiter, Jerome P
2013-06-01
Multinomial outcomes with many levels can be challenging to model. Information typically accrues slowly with increasing sample size, yet the parameter space expands rapidly with additional covariates. Shrinking all regression parameters towards zero, as often done in models of continuous or binary response variables, is unsatisfactory, since setting parameters equal to zero in multinomial models does not necessarily imply "no effect." We propose an approach to modeling multinomial outcomes with many levels based on a Bayesian multinomial probit (MNP) model and a multiple shrinkage prior distribution for the regression parameters. The prior distribution encourages the MNP regression parameters to shrink toward a number of learned locations, thereby substantially reducing the dimension of the parameter space. Using simulated data, we compare the predictive performance of this model against two other recently-proposed methods for big multinomial models. The results suggest that the fully Bayesian, multiple shrinkage approach can outperform these other methods. We apply the multiple shrinkage MNP to simulating replacement values for areal identifiers, e.g., census tract indicators, in order to protect data confidentiality in public use datasets.
Abdelhamid, Mahmoud; Mosharafa, Ashraf A; Ibrahim, Hamdy; Selim, Hany M; Hamed, Mohamed; Elghoneimy, Mohamed N; Salem, Hosny K; Abdelazim, Mohamed S; Badawy, Hesham
2016-11-01
To evaluate the ability of noncontrast CT parameters (stone size, stone attenuation, and skin-to-stone distance [SSD]) to predict the outcome of extracorporeal shockwave lithotripsy (SWL) in a prospective cohort of patients with renal and upper ureteric stones. Patients with stones 5 to 20 mm were prospectively enrolled from 2011 to 2014. Patients had NCCT with recording of stone size, stone mean attenuation, and SSD, as well as various stone and patient parameters. The numbers of needed sessions as well as the final outcome were determined, with SWL failure defined as residual fragments >3 mm. Predictors of SWL failure were assessed by multiple regression analysis. Two hundred twenty patients (mean ± standard deviation [SD] age 41.5 ± 12.4 years) underwent SWL. Mean ± SD stone size was 11.3 ± 4.1 mm, while mean ± SD stone attenuation was 795.1 ± 340.4 HU. Mean ± SD SSD was 9.4 ± 2.1 cm. The average number of sessions was 1.64. SWL was effective in 186 (84.5%) patients (group A), while 34 (15.5%) patients had significant residual fragments (>3 mm). On univariate analysis, predictors of SWL failure included stone attenuation >1000 HU, older age, higher body mass index, higher attenuation value, larger stone size, and longer SSD. Increased SSD and higher stone attenuation retained their significance as independent predictors of SWL failure (p < 0.05) on multiple regression analysis both after first session and as final SWL outcome. A positive correlation was found between number of SWL sessions and mean stone attenuation (r = 0.6, p < 0.001) and SSD (r = 4, p < 0.001). Stone mean attenuation and SSD on noncontrast CT are significant independent predictors of SWL outcome in patients with renal and ureteric stones. These parameters should be included in clinical decision algorithms for patients with urolithiasis. For patients with stones having mean attenuation of >1000 HU and/or large SSDs, alternatives to SWL should be considered.
Li, Qian-Qian; Zhang, Da-Jun; Guo, Lan-Ting; Feng, Zheng-Zhi; Wu, Ming-Xia
2007-09-01
To explore the status and influencing factors on anxiety sensitivity among middle school students in Chongqing. 58 classes from 12 schools were randomly selected in four administrative districts of Chongqing city. A total number of 2700 students was included for final analysis including 48.5% from junior high school and 51.5% from senior high school students with 49.2% boys and 50.8% girls. The Chinese version of the Anxiety Sensitivity Index-Revision, Adolescent Self-Rating Life Events Check List (ASLEC) and State-Trait Anxiety Inventory (STAI) were used. (1) There was no significant difference between grade groups (P = 0.49). (2) The level of girl's anxiety sensitivity was always higher than boy's (P < 0.001). (3) Data from multiple linear regression showed that the influential factors to the degree of anxiety sensitivity were: state of anxiety, trait anxiety, life events, sex, stress from learning, etc (standard coefficients of regression were 0.258, 0.163, 0.112, 0.093, 0.124, -0.096, 0.096). The major influential factors of anxiety sensitivity would include: sex, stress from learning, life events, interpersonal relationship, state of anxiety and trait anxiety.
Cuddy, L L; Thompson, W F
1992-01-01
In a probe-tone experiment, two groups of listeners--one trained, the other untrained, in traditional music theory--rated the goodness of fit of each of the 12 notes of the chromatic scale to four-voice harmonic sequences. Sequences were 12 simplified excerpts from Bach chorales, 4 nonmodulating, and 8 modulating. Modulations occurred either one or two steps in either the clockwise or the counterclockwise direction on the cycle of fifths. A consistent pattern of probe-tone ratings was obtained for each sequence, with no significant differences between listener groups. Two methods of analysis (Fourier analysis and regression analysis) revealed a directional asymmetry in the perceived key movement conveyed by modulating sequences. For a given modulation distance, modulations in the counterclockwise direction effected a clearer shift in tonal organization toward the final key than did clockwise modulations. The nature of the directional asymmetry was consistent with results reported for identification and rating of key change in the sequences (Thompson & Cuddy, 1989a). Further, according to the multiple-regression analysis, probe-tone ratings did not merely reflect the distribution of tones in the sequence. Rather, ratings were sensitive to the temporal structure of the tonal organization in the sequence.
Zhong-xiang, Feng; Shi-sheng, Lu; Wei-hua, Zhang; Nan-nan, Zhang
2014-01-01
In order to build a combined model which can meet the variation rule of death toll data for road traffic accidents and can reflect the influence of multiple factors on traffic accidents and improve prediction accuracy for accidents, the Verhulst model was built based on the number of death tolls for road traffic accidents in China from 2002 to 2011; and car ownership, population, GDP, highway freight volume, highway passenger transportation volume, and highway mileage were chosen as the factors to build the death toll multivariate linear regression model. Then the two models were combined to be a combined prediction model which has weight coefficient. Shapley value method was applied to calculate the weight coefficient by assessing contributions. Finally, the combined model was used to recalculate the number of death tolls from 2002 to 2011, and the combined model was compared with the Verhulst and multivariate linear regression models. The results showed that the new model could not only characterize the death toll data characteristics but also quantify the degree of influence to the death toll by each influencing factor and had high accuracy as well as strong practicability. PMID:25610454
Feng, Zhong-xiang; Lu, Shi-sheng; Zhang, Wei-hua; Zhang, Nan-nan
2014-01-01
In order to build a combined model which can meet the variation rule of death toll data for road traffic accidents and can reflect the influence of multiple factors on traffic accidents and improve prediction accuracy for accidents, the Verhulst model was built based on the number of death tolls for road traffic accidents in China from 2002 to 2011; and car ownership, population, GDP, highway freight volume, highway passenger transportation volume, and highway mileage were chosen as the factors to build the death toll multivariate linear regression model. Then the two models were combined to be a combined prediction model which has weight coefficient. Shapley value method was applied to calculate the weight coefficient by assessing contributions. Finally, the combined model was used to recalculate the number of death tolls from 2002 to 2011, and the combined model was compared with the Verhulst and multivariate linear regression models. The results showed that the new model could not only characterize the death toll data characteristics but also quantify the degree of influence to the death toll by each influencing factor and had high accuracy as well as strong practicability.
[Gaussian process regression and its application in near-infrared spectroscopy analysis].
Feng, Ai-Ming; Fang, Li-Min; Lin, Min
2011-06-01
Gaussian process (GP) is applied in the present paper as a chemometric method to explore the complicated relationship between the near infrared (NIR) spectra and ingredients. After the outliers were detected by Monte Carlo cross validation (MCCV) method and removed from dataset, different preprocessing methods, such as multiplicative scatter correction (MSC), smoothing and derivate, were tried for the best performance of the models. Furthermore, uninformative variable elimination (UVE) was introduced as a variable selection technique and the characteristic wavelengths obtained were further employed as input for modeling. A public dataset with 80 NIR spectra of corn was introduced as an example for evaluating the new algorithm. The optimal models for oil, starch and protein were obtained by the GP regression method. The performance of the final models were evaluated according to the root mean square error of calibration (RMSEC), root mean square error of cross-validation (RMSECV), root mean square error of prediction (RMSEP) and correlation coefficient (r). The models give good calibration ability with r values above 0.99 and the prediction ability is also satisfactory with r values higher than 0.96. The overall results demonstrate that GP algorithm is an effective chemometric method and is promising for the NIR analysis.
Predicting Homelessness among Emerging Adults Aging Out of Foster Care.
Shah, Melissa Ford; Liu, Qinghua; Mark Eddy, J; Barkan, Susan; Marshall, David; Mancuso, David; Lucenko, Barbara; Huber, Alice
2017-09-01
This study examines risk and protective factors associated with experiencing homelessness in the year after "aging out" of foster care. Using a state-level integrated administrative database, we identified 1,202 emerging adults in Washington State who exited foster care between July 2010 and June 2012. Initial bivariate analyses were conducted to assess the association between candidate predictive factors and an indicator of homelessness in a 12-month follow-up period. After deploying a stepwise regression process, the final logistic regression model included 15 predictive factors. Youth who were parents, who had recently experienced housing instability, or who were African American had approximately twice the odds of experiencing homelessness in the year after exiting foster care. In addition, youth who had experienced disrupted adoptions, had multiple foster care placements (especially in congregate care settings), or had been involved with the juvenile justice system were more likely to become homeless. In contrast, youth were less likely to experience homelessness if they had ever been placed with a relative while in foster care or had a high cumulative grade point average relative to their peers. © Society for Community Research and Action 2016.
Ertefaie, Ashkan; Shortreed, Susan; Chakraborty, Bibhas
2016-06-15
Q-learning is a regression-based approach that uses longitudinal data to construct dynamic treatment regimes, which are sequences of decision rules that use patient information to inform future treatment decisions. An optimal dynamic treatment regime is composed of a sequence of decision rules that indicate how to optimally individualize treatment using the patients' baseline and time-varying characteristics to optimize the final outcome. Constructing optimal dynamic regimes using Q-learning depends heavily on the assumption that regression models at each decision point are correctly specified; yet model checking in the context of Q-learning has been largely overlooked in the current literature. In this article, we show that residual plots obtained from standard Q-learning models may fail to adequately check the quality of the model fit. We present a modified Q-learning procedure that accommodates residual analyses using standard tools. We present simulation studies showing the advantage of the proposed modification over standard Q-learning. We illustrate this new Q-learning approach using data collected from a sequential multiple assignment randomized trial of patients with schizophrenia. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.
Gallagher, Jennifer E; Patel, Resmi; Donaldson, Nora; Wilson, Nairn HF
2007-01-01
Background Dental graduates are joining a profession experiencing changes in systems of care, funding and skill mix. Research into the motivation and expectations of the emerging workforce is vital to inform professional and policy decisions. The objective of this research was to investigate final year dental students' perceived motivation for their choice of career in relation to sex, ethnicity and mode of entry. Methods Self-administered questionnaire survey of all final year dental students at King's College London. Data were entered into SPSS; statistical analysis included Chi Squared tests for linear association, multiple regression, factor analysis and logistic regression. Results A response of 90% (n = 126) was achieved. The majority were aged 23 years (59%), female (58%) and Asian (70%). One in 10 were mature students. Eighty per cent identified 11 or more 'important' or 'very important' influences, the most common of which were related to features of the job: 'regular working hours' (91%), 'degree leading to recognised job' (90%) and 'job security' (90%). There were significant differences in important influences by sex (males > females: 'able to run own business'; females > males: 'a desire to work with people'), ethnic group (Asians > white: 'wish to provide public service', 'influence of friends', 'desire to work in healthcare', having 'tried an alternative career/course' and 'work experience') and mode of entry (mature > early entry: 'a desire to work with people'). Multivariate analysis suggested 61% of the variation in influences is explained by five factors: the 'professional job' (31%), 'healthcare-people' (11%), 'academic-scientific' (8%), 'careers-advising' (6%), and 'family/friends' (6%). The single major influence on choice of career was a 'desire to work with people'; Indian students were twice as likely to report this as white or other ethnic groups. Conclusion Final year dental students report a wide range of important influences on their choice of dentistry, with variation by sex, ethnicity and mode of entry in relation to individual influences. Features of the 'professional job', followed by 'healthcare and people' were the most important underlying factors influencing choice of career. PMID:17573967
Quantile Regression in the Study of Developmental Sciences
ERIC Educational Resources Information Center
Petscher, Yaacov; Logan, Jessica A. R.
2014-01-01
Linear regression analysis is one of the most common techniques applied in developmental research, but only allows for an estimate of the average relations between the predictor(s) and the outcome. This study describes quantile regression, which provides estimates of the relations between the predictor(s) and outcome, but across multiple points of…
Maintenance Operations in Mission Oriented Protective Posture Level IV (MOPPIV)
1987-10-01
Repair FADAC Printed Circuit Board ............. 6 3. Data Analysis Techniques ............................. 6 a. Multiple Linear Regression... ANALYSIS /DISCUSSION ............................... 12 1. Exa-ple of Regression Analysis ..................... 12 S2. Regression results for all tasks...6 * TABLE 9. Task Grouping for Analysis ........................ 7 "TABXLE 10. Remove/Replace H60A3 Power Pack................. 8 TABLE
Curcic, Marijana; Buha, Aleksandra; Stankovic, Sanja; Milovanovic, Vesna; Bulat, Zorica; Đukić-Ćosić, Danijela; Antonijević, Evica; Vučinić, Slavica; Matović, Vesna; Antonijevic, Biljana
2017-02-01
The objective of this study was to assess toxicity of Cd and BDE-209 mixture on haematological parameters in subacutely exposed rats and to determine the presence and type of interactions between these two chemicals using multiple factorial regression analysis. Furthermore, for the assessment of interaction type, an isobologram based methodology was applied and compared with multiple factorial regression analysis. Chemicals were given by oral gavage to the male Wistar rats weighing 200-240g for 28days. Animals were divided in 16 groups (8/group): control vehiculum group, three groups of rats were treated with 2.5, 7.5 or 15mg Cd/kg/day. These doses were chosen on the bases of literature data and reflect relatively high Cd environmental exposure, three groups of rats were treated with 1000, 2000 or 4000mg BDE-209/kg/bw/day, doses proved to induce toxic effects in rats. Furthermore, nine groups of animals were treated with different mixtures of Cd and BDE-209 containing doses of Cd and BDE-209 stated above. Blood samples were taken at the end of experiment and red blood cells, white blood cells and platelets counts were determined. For interaction assessment multiple factorial regression analysis and fitted isobologram approach were used. In this study, we focused on multiple factorial regression analysis as a method for interaction assessment. We also investigated the interactions between Cd and BDE-209 by the derived model for the description of the obtained fitted isobologram curves. Current study indicated that co-exposure to Cd and BDE-209 can result in significant decrease in RBC count, increase in WBC count and decrease in PLT count, when compared with controls. Multiple factorial regression analysis used for the assessment of interactions type between Cd and BDE-209 indicated synergism for the effect on RBC count and no interactions i.e. additivity for the effects on WBC and PLT counts. On the other hand, isobologram based approach showed slight antagonism for the effects on RBC and WBC while no interactions were proved for the joint effect on PLT count. These results confirm that the assessment of interactions between chemicals in the mixture greatly depends on the concept or method used for this evaluation. Copyright © 2016 Elsevier Ireland Ltd. All rights reserved.
Kanada, Yoshikiyo; Sakurai, Hiroaki; Sugiura, Yoshito; Arai, Tomoaki; Koyama, Soichiro; Tanabe, Shigeo
2017-11-01
[Purpose] To create a regression formula in order to estimate 1RM for knee extensors, based on the maximal isometric muscle strength measured using a hand-held dynamometer and data regarding the body composition. [Subjects and Methods] Measurement was performed in 21 healthy males in their twenties to thirties. Single regression analysis was performed, with measurement values representing 1RM and the maximal isometric muscle strength as dependent and independent variables, respectively. Furthermore, multiple regression analysis was performed, with data regarding the body composition incorporated as another independent variable, in addition to the maximal isometric muscle strength. [Results] Through single regression analysis with the maximal isometric muscle strength as an independent variable, the following regression formula was created: 1RM (kg)=0.714 + 0.783 × maximal isometric muscle strength (kgf). On multiple regression analysis, only the total muscle mass was extracted. [Conclusion] A highly accurate regression formula to estimate 1RM was created based on both the maximal isometric muscle strength and body composition. Using a hand-held dynamometer and body composition analyzer, it was possible to measure these items in a short time, and obtain clinically useful results.
NASA Technical Reports Server (NTRS)
Stolzer, Alan J.; Halford, Carl
2007-01-01
In a previous study, multiple regression techniques were applied to Flight Operations Quality Assurance-derived data to develop parsimonious model(s) for fuel consumption on the Boeing 757 airplane. The present study examined several data mining algorithms, including neural networks, on the fuel consumption problem and compared them to the multiple regression results obtained earlier. Using regression methods, parsimonious models were obtained that explained approximately 85% of the variation in fuel flow. In general data mining methods were more effective in predicting fuel consumption. Classification and Regression Tree methods reported correlation coefficients of .91 to .92, and General Linear Models and Multilayer Perceptron neural networks reported correlation coefficients of about .99. These data mining models show great promise for use in further examining large FOQA databases for operational and safety improvements.
DiFrancesco, Robin; Rosenkranz, Susan L.; Taylor, Charlene R.; Pande, Poonam G.; Siminski, Suzanne M.; Jenny, Richard W.; Morse, Gene D.
2013-01-01
Among National Institutes of Health (NIH) HIV Research Networks conducting multicenter trials, samples from protocols that span several years are analyzed at multiple clinical pharmacology laboratories (CPLs) for multiple antiretrovirals (ARV). Drug assay data are, in turn, entered into study-specific datasets that are used for pharmacokinetic analyses, merged to conduct cross-protocol pharmacokinetic analysis and integrated with pharmacogenomics research to investigate pharmacokinetic-pharmacogenetic associations. The CPLs participate in a semi-annual proficiency testing (PT) program implemented by the Clinical Pharmacology Quality Assurance (CPQA) program. Using results from multiple PT rounds, longitudinal analyses of recovery are reflective of accuracy and precision within/across laboratories. The objectives of this longitudinal analysis of PT across multiple CPLs were to develop and test statistical models that longitudinally: (1)assess the precision and accuracy of concentrations reported by individual CPLs; (2)determine factors associated with round-specific and long-term assay accuracy, precision and bias using a new regression model. A measure of absolute recovery is explored as a simultaneous measure of accuracy and precision. Overall, the analysis outcomes assured 97% accuracy (±20% of the final target concentration of all (21)drug concentration results reported for clinical trial samples by multiple CPLs).Using the CLIA acceptance of meeting criteria for ≥2/3 consecutive rounds, all ten laboratories that participated in three or more rounds per analyte maintained CLIA proficiency. Significant associations were present between magnitude of error and CPL (Kruskal Wallis [KW]p<0.001), and ARV (KW p<0.001). PMID:24052065
DiFrancesco, Robin; Rosenkranz, Susan L; Taylor, Charlene R; Pande, Poonam G; Siminski, Suzanne M; Jenny, Richard W; Morse, Gene D
2013-10-01
Among National Institutes of Health HIV Research Networks conducting multicenter trials, samples from protocols that span several years are analyzed at multiple clinical pharmacology laboratories (CPLs) for multiple antiretrovirals. Drug assay data are, in turn, entered into study-specific data sets that are used for pharmacokinetic analyses, merged to conduct cross-protocol pharmacokinetic analysis, and integrated with pharmacogenomics research to investigate pharmacokinetic-pharmacogenetic associations. The CPLs participate in a semiannual proficiency testing (PT) program implemented by the Clinical Pharmacology Quality Assurance program. Using results from multiple PT rounds, longitudinal analyses of recovery are reflective of accuracy and precision within/across laboratories. The objectives of this longitudinal analysis of PT across multiple CPLs were to develop and test statistical models that longitudinally: (1) assess the precision and accuracy of concentrations reported by individual CPLs and (2) determine factors associated with round-specific and long-term assay accuracy, precision, and bias using a new regression model. A measure of absolute recovery is explored as a simultaneous measure of accuracy and precision. Overall, the analysis outcomes assured 97% accuracy (±20% of the final target concentration of all (21) drug concentration results reported for clinical trial samples by multiple CPLs). Using the Clinical Laboratory Improvement Act acceptance of meeting criteria for ≥2/3 consecutive rounds, all 10 laboratories that participated in 3 or more rounds per analyte maintained Clinical Laboratory Improvement Act proficiency. Significant associations were present between magnitude of error and CPL (Kruskal-Wallis P < 0.001) and antiretroviral (Kruskal-Wallis P < 0.001).
Robinson, J J; Wharrad, H
2001-05-01
The relationship between attendance at birth and maternal mortality rates: an exploration of United Nations' data sets including the ratios of physicians and nurses to population, GNP per capita and female literacy. This is the third and final paper drawing on data taken from United Nations (UN) data sets. The first paper examined the global distribution of health professionals (as measured by ratios of physicians and nurses to population), and its relationship to gross national product per capita (GNP) (Wharrad & Robinson 1999). The second paper explored the relationships between the global distribution of physicians and nurses, GNP, female literacy and the health outcome indicators of infant and under five mortality rates (IMR and u5MR) (Robinson & Wharrad 2000). In the present paper, the global distribution of health professionals is explored in relation to maternal mortality rates (MMRs). The proportion of births attended by medical and nonmedical staff defined as "attendance at birth by trained personnel" (physicians, nurses, midwives or primary health care workers trained in midwifery skills), is included as an additional independent variable in the regression analyses, together with the ratio of physicians and nurses to population, female literacy and GNP. To extend our earlier analyses by considering the relationships between the global distribution of health professionals (ratios of physicians and nurses to population, and the proportion of births attended by trained health personnel), GNP, female literacy and MMR.
Rong, S S; Feng, M Y; Wang, N; Meng, H; Thomas, R; Fan, S; Wang, R; Wang, X; Tang, X; Liang, Y B
2013-03-01
To evaluate the association between early and late postoperative intraocular pressure (IOP) and determine if early postoperative IOP can predict the surgical outcome. A total of 165 consecutive patients with primary angle-closure glaucoma (PACG) undergoing primary mitomycin-C-augmented trabeculectomy underwent a comprehensive eye examination before surgery and were followed-up on days 1, 7, 14, and 30, and months 3, 6, 12, and 18. IOPs on days 1, 7, 14, and 30 were stratified into groups A (<10 mm Hg), B (≥10 and <15 mm Hg), C (≥15 and <20 mm Hg), and D (≥20 mm Hg). Differences between groups were analyzed using analysis of variance (ANOVA) and Fisher's exact test. Multivariable regression was used to exam the predictive ability of early IOP for final outcome. The mean age was 62.5±7.9 years and 41.21% (n=68) were males. Stratified by IOP on days 1, 7, 14, and 30, respectively, mean IOPs at month 18 were different among groups A, B, C, and D (ANOVA, P=0.047, P=0.033, P=0.008, and P<0.001, respectively). Once the IOPs were settled with interventions on day 7 a higher IOP level was associated with decreasing success rate under different outcome definitions, final IOP <15 mm Hg (Fisher's exact P=0.001) and <20 mm Hg (P=0.039) without medication. Multiple regression showed early IOP predicted final IOP independently from baseline variables. A cutoff value of 13.5 mm Hg on day 7 achieved an accuracy of 80.0 and 57.1% in predicting IOP<15 mm Hg without medication and failure after surgery, respectively. The IOP at 18 months following primary antifibrotic-augmented trabeculectomy in PACG patients is associated with and predicted by the postoperative IOPs at 1 month. Control of early IOP to 13.5 or less may provide better outcomes.
Cross Validation of Selection of Variables in Multiple Regression.
1979-12-01
55 vii CROSS VALIDATION OF SELECTION OF VARIABLES IN MULTIPLE REGRESSION I Introduction Background Long term DoD planning gcals...028545024 .31109000 BF * SS - .008700618 .0471961 Constant - .70977903 85.146786 55 had adequate predictive capabilities; the other two models (the...71ZCO F111D Control 54 73EGO FlIID Computer, General Purpose 55 73EPO FII1D Converter-Multiplexer 56 73HAO flllD Stabilizer Platform 57 73HCO F1ID
Byun, Bo-Ram; Kim, Yong-Il; Maki, Koutaro; Son, Woo-Sung
2015-01-01
This study was aimed to examine the correlation between skeletal maturation status and parameters from the odontoid process/body of the second vertebra and the bodies of third and fourth cervical vertebrae and simultaneously build multiple regression models to be able to estimate skeletal maturation status in Korean girls. Hand-wrist radiographs and cone beam computed tomography (CBCT) images were obtained from 74 Korean girls (6–18 years of age). CBCT-generated cervical vertebral maturation (CVM) was used to demarcate the odontoid process and the body of the second cervical vertebra, based on the dentocentral synchondrosis. Correlation coefficient analysis and multiple linear regression analysis were used for each parameter of the cervical vertebrae (P < 0.05). Forty-seven of 64 parameters from CBCT-generated CVM (independent variables) exhibited statistically significant correlations (P < 0.05). The multiple regression model with the greatest R 2 had six parameters (PH2/W2, UW2/W2, (OH+AH2)/LW2, UW3/LW3, D3, and H4/W4) as independent variables with a variance inflation factor (VIF) of <2. CBCT-generated CVM was able to include parameters from the second cervical vertebral body and odontoid process, respectively, for the multiple regression models. This suggests that quantitative analysis might be used to estimate skeletal maturation status. PMID:25878721
NeCamp, Timothy; Kilbourne, Amy; Almirall, Daniel
2017-08-01
Cluster-level dynamic treatment regimens can be used to guide sequential treatment decision-making at the cluster level in order to improve outcomes at the individual or patient-level. In a cluster-level dynamic treatment regimen, the treatment is potentially adapted and re-adapted over time based on changes in the cluster that could be impacted by prior intervention, including aggregate measures of the individuals or patients that compose it. Cluster-randomized sequential multiple assignment randomized trials can be used to answer multiple open questions preventing scientists from developing high-quality cluster-level dynamic treatment regimens. In a cluster-randomized sequential multiple assignment randomized trial, sequential randomizations occur at the cluster level and outcomes are observed at the individual level. This manuscript makes two contributions to the design and analysis of cluster-randomized sequential multiple assignment randomized trials. First, a weighted least squares regression approach is proposed for comparing the mean of a patient-level outcome between the cluster-level dynamic treatment regimens embedded in a sequential multiple assignment randomized trial. The regression approach facilitates the use of baseline covariates which is often critical in the analysis of cluster-level trials. Second, sample size calculators are derived for two common cluster-randomized sequential multiple assignment randomized trial designs for use when the primary aim is a between-dynamic treatment regimen comparison of the mean of a continuous patient-level outcome. The methods are motivated by the Adaptive Implementation of Effective Programs Trial which is, to our knowledge, the first-ever cluster-randomized sequential multiple assignment randomized trial in psychiatry.
Adjusted variable plots for Cox's proportional hazards regression model.
Hall, C B; Zeger, S L; Bandeen-Roche, K J
1996-01-01
Adjusted variable plots are useful in linear regression for outlier detection and for qualitative evaluation of the fit of a model. In this paper, we extend adjusted variable plots to Cox's proportional hazards model for possibly censored survival data. We propose three different plots: a risk level adjusted variable (RLAV) plot in which each observation in each risk set appears, a subject level adjusted variable (SLAV) plot in which each subject is represented by one point, and an event level adjusted variable (ELAV) plot in which the entire risk set at each failure event is represented by a single point. The latter two plots are derived from the RLAV by combining multiple points. In each point, the regression coefficient and standard error from a Cox proportional hazards regression is obtained by a simple linear regression through the origin fit to the coordinates of the pictured points. The plots are illustrated with a reanalysis of a dataset of 65 patients with multiple myeloma.
Esserman, Denise A.; Moore, Charity G.; Roth, Mary T.
2009-01-01
Older community dwelling adults often take multiple medications for numerous chronic diseases. Non-adherence to these medications can have a large public health impact. Therefore, the measurement and modeling of medication adherence in the setting of polypharmacy is an important area of research. We apply a variety of different modeling techniques (standard linear regression; weighted linear regression; adjusted linear regression; naïve logistic regression; beta-binomial (BB) regression; generalized estimating equations (GEE)) to binary medication adherence data from a study in a North Carolina based population of older adults, where each medication an individual was taking was classified as adherent or non-adherent. In addition, through simulation we compare these different methods based on Type I error rates, bias, power, empirical 95% coverage, and goodness of fit. We find that estimation and inference using GEE is robust to a wide variety of scenarios and we recommend using this in the setting of polypharmacy when adherence is dichotomously measured for multiple medications per person. PMID:20414358
NASA Astrophysics Data System (ADS)
Sahabiev, I. A.; Ryazanov, S. S.; Kolcova, T. G.; Grigoryan, B. R.
2018-03-01
The three most common techniques to interpolate soil properties at a field scale—ordinary kriging (OK), regression kriging with multiple linear regression drift model (RK + MLR), and regression kriging with principal component regression drift model (RK + PCR)—were examined. The results of the performed study were compiled into an algorithm of choosing the most appropriate soil mapping technique. Relief attributes were used as the auxiliary variables. When spatial dependence of a target variable was strong, the OK method showed more accurate interpolation results, and the inclusion of the auxiliary data resulted in an insignificant improvement in prediction accuracy. According to the algorithm, the RK + PCR method effectively eliminates multicollinearity of explanatory variables. However, if the number of predictors is less than ten, the probability of multicollinearity is reduced, and application of the PCR becomes irrational. In that case, the multiple linear regression should be used instead.
Genetic Programming Transforms in Linear Regression Situations
NASA Astrophysics Data System (ADS)
Castillo, Flor; Kordon, Arthur; Villa, Carlos
The chapter summarizes the use of Genetic Programming (GP) inMultiple Linear Regression (MLR) to address multicollinearity and Lack of Fit (LOF). The basis of the proposed method is applying appropriate input transforms (model respecification) that deal with these issues while preserving the information content of the original variables. The transforms are selected from symbolic regression models with optimal trade-off between accuracy of prediction and expressional complexity, generated by multiobjective Pareto-front GP. The chapter includes a comparative study of the GP-generated transforms with Ridge Regression, a variant of ordinary Multiple Linear Regression, which has been a useful and commonly employed approach for reducing multicollinearity. The advantages of GP-generated model respecification are clearly defined and demonstrated. Some recommendations for transforms selection are given as well. The application benefits of the proposed approach are illustrated with a real industrial application in one of the broadest empirical modeling areas in manufacturing - robust inferential sensors. The chapter contributes to increasing the awareness of the potential of GP in statistical model building by MLR.
A Solution to Separation and Multicollinearity in Multiple Logistic Regression
Shen, Jianzhao; Gao, Sujuan
2010-01-01
In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27–38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth’s penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study. PMID:20376286
A Solution to Separation and Multicollinearity in Multiple Logistic Regression.
Shen, Jianzhao; Gao, Sujuan
2008-10-01
In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27-38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth's penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study.
Chen, Chunhui; Chen, Chuansheng; Moyzis, Robert; Stern, Hal; He, Qinghua; Li, He; Li, Jin; Zhu, Bi; Dong, Qi
2011-01-01
Traditional behavioral genetic studies (e.g., twin, adoption studies) have shown that human personality has moderate to high heritability, but recent molecular behavioral genetic studies have failed to identify quantitative trait loci (QTL) with consistent effects. The current study adopted a multi-step approach (ANOVA followed by multiple regression and permutation) to assess the cumulative effects of multiple QTLs. Using a system-level (dopamine system) genetic approach, we investigated a personality trait deeply rooted in the nervous system (the Highly Sensitive Personality, HSP). 480 healthy Chinese college students were given the HSP scale and genotyped for 98 representative polymorphisms in all major dopamine neurotransmitter genes. In addition, two environment factors (stressful life events and parental warmth) that have been implicated for their contributions to personality development were included to investigate their relative contributions as compared to genetic factors. In Step 1, using ANOVA, we identified 10 polymorphisms that made statistically significant contributions to HSP. In Step 2, these polymorphism's main effects and interactions were assessed using multiple regression. This model accounted for 15% of the variance of HSP (p<0.001). Recent stressful life events accounted for an additional 2% of the variance. Finally, permutation analyses ascertained the probability of obtaining these findings by chance to be very low, p ranging from 0.001 to 0.006. Dividing these loci by the subsystems of dopamine synthesis, degradation/transport, receptor and modulation, we found that the modulation and receptor subsystems made the most significant contribution to HSP. The results of this study demonstrate the utility of a multi-step neuronal system-level approach in assessing genetic contributions to individual differences in human behavior. It can potentially bridge the gap between the high heritability estimates based on traditional behavioral genetics and the lack of reproducible genetic effects observed currently from molecular genetic studies.
Barzegar, Rahim; Moghaddam, Asghar Asghari; Deo, Ravinesh; Fijani, Elham; Tziritis, Evangelos
2018-04-15
Constructing accurate and reliable groundwater risk maps provide scientifically prudent and strategic measures for the protection and management of groundwater. The objectives of this paper are to design and validate machine learning based-risk maps using ensemble-based modelling with an integrative approach. We employ the extreme learning machines (ELM), multivariate regression splines (MARS), M5 Tree and support vector regression (SVR) applied in multiple aquifer systems (e.g. unconfined, semi-confined and confined) in the Marand plain, North West Iran, to encapsulate the merits of individual learning algorithms in a final committee-based ANN model. The DRASTIC Vulnerability Index (VI) ranged from 56.7 to 128.1, categorized with no risk, low and moderate vulnerability thresholds. The correlation coefficient (r) and Willmott's Index (d) between NO 3 concentrations and VI were 0.64 and 0.314, respectively. To introduce improvements in the original DRASTIC method, the vulnerability indices were adjusted by NO 3 concentrations, termed as the groundwater contamination risk (GCR). Seven DRASTIC parameters utilized as the model inputs and GCR values utilized as the outputs of individual machine learning models were served in the fully optimized committee-based ANN-predictive model. The correlation indicators demonstrated that the ELM and SVR models outperformed the MARS and M5 Tree models, by virtue of a larger d and r value. Subsequently, the r and d metrics for the ANN-committee based multi-model in the testing phase were 0.8889 and 0.7913, respectively; revealing the superiority of the integrated (or ensemble) machine learning models when compared with the original DRASTIC approach. The newly designed multi-model ensemble-based approach can be considered as a pragmatic step for mapping groundwater contamination risks of multiple aquifer systems with multi-model techniques, yielding the high accuracy of the ANN committee-based model. Copyright © 2017 Elsevier B.V. All rights reserved.
Chen, Chunhui; Chen, Chuansheng; Moyzis, Robert; Stern, Hal; He, Qinghua; Li, He; Li, Jin; Zhu, Bi; Dong, Qi
2011-01-01
Traditional behavioral genetic studies (e.g., twin, adoption studies) have shown that human personality has moderate to high heritability, but recent molecular behavioral genetic studies have failed to identify quantitative trait loci (QTL) with consistent effects. The current study adopted a multi-step approach (ANOVA followed by multiple regression and permutation) to assess the cumulative effects of multiple QTLs. Using a system-level (dopamine system) genetic approach, we investigated a personality trait deeply rooted in the nervous system (the Highly Sensitive Personality, HSP). 480 healthy Chinese college students were given the HSP scale and genotyped for 98 representative polymorphisms in all major dopamine neurotransmitter genes. In addition, two environment factors (stressful life events and parental warmth) that have been implicated for their contributions to personality development were included to investigate their relative contributions as compared to genetic factors. In Step 1, using ANOVA, we identified 10 polymorphisms that made statistically significant contributions to HSP. In Step 2, these polymorphism's main effects and interactions were assessed using multiple regression. This model accounted for 15% of the variance of HSP (p<0.001). Recent stressful life events accounted for an additional 2% of the variance. Finally, permutation analyses ascertained the probability of obtaining these findings by chance to be very low, p ranging from 0.001 to 0.006. Dividing these loci by the subsystems of dopamine synthesis, degradation/transport, receptor and modulation, we found that the modulation and receptor subsystems made the most significant contribution to HSP. The results of this study demonstrate the utility of a multi-step neuronal system-level approach in assessing genetic contributions to individual differences in human behavior. It can potentially bridge the gap between the high heritability estimates based on traditional behavioral genetics and the lack of reproducible genetic effects observed currently from molecular genetic studies. PMID:21765900
Race, law, and health: Examination of 'Stand Your Ground' and defendant convictions in Florida.
Ackermann, Nicole; Goodman, Melody S; Gilbert, Keon; Arroyo-Johnson, Cassandra; Pagano, Marcello
2015-10-01
Previous analyses of Stand Your Ground (SYG) cases have been primarily descriptive. We examine the relationship between race of the victim and conviction of the defendant in SYG cases in Florida from 2005 to 2013. Using a regression analytic approach, we allow for simultaneous examination of multiple factors to better understand existing interrelationships. Data was obtained from the Tampa Bay Times SYG database (237 cases) which was supplemented with available online court documents and/or news reports. After excluding cases which were, still pending as of January 2015; had multiple outcomes (because of multiple suspects); and missing information on race of victim and weapon of victim, our final analytic sample has 204 cases. We chose whether the case resulted in a conviction as the outcome. We develop logistic regression models using significant bivariate predictors as candidates. These include race of the victim (White, non-White), whether the defendant could have retreated from the situation, whether the defendant pursued the victim, if the victim was unarmed, and who was the initiator of the confrontation. We find race of the victim to be a significant predictor of case outcome in this data set. After controlling for other variables, the defendant is two times (OR = 2.1, 95% CI [1.07, 4.10]) more likely to be convicted in a case that involves White victims compared to those involving non-White victims. Our results depict a disturbing message: SYG legislation in Florida has a quantifiable racial bias that reveals a leniency in convictions if the victim is non-White, which provides evidence towards unequal treatment under the law. Rather than attempting to hide the outcomes of these laws, as was done in Florida, other states with SYG laws should carry out similar analyses to see if their manifestations are the same as those in Florida, and all should remediate any injustices found. Copyright © 2015 Elsevier Ltd. All rights reserved.
Sullivan, Timothy; Aberg, Judith
2017-01-01
Abstract Background The timely identification of carbapenem resistance is essential in the management of patients with Klebsiella pneumoniae bloodstream infection (BSI). An algorithm using electronic medical record (EMR) data to quickly predict resistance could potentially help guide therapy until more definitive resistance testing results are available. Methods All cases of K. pneumoniae BSI at Mount Sinai Hospital from September 2012 through September 2016 were identified. Cases of persistent BSI or recurrent BSI within 2 weeks were included only once. Patients with recurrent BSI after more than 2 weeks of negative blood cultures were considered distinct cases and included more than once. Carbapenem resistance was defined as an imipenem minimum inhibitory concentration of ≥2 μg/ml. Extensive EMR data for each patient were compiled into a relational database using SQLite. Possible risk factors for carbapenem resistance were queried from the database and analyzed via univariate methods. Significant factors were then entered into a multiple logistic regression model in a forward stepwise approach using SPSS. Results A total of 613 cases of K. pneumoniae BSI were identified in 540 unique patients. The overall incidence of imipenem resistance was 10% (61 cases). Significant markers of resistance included in the final model were (1) prior colonization with imipenem-resistant Klebsiella pneumoniae; (2) hospital unit (defined as high-risk unit, low-risk unit, and emergency department); (3) total inpatient days in the previous 5 years; (4) total days of oral or parenteral antibiotics in the past 2 years; and (5) age >60 years old (Figure 1). The model generated a receiver operating characteristic curve with an area under the curve of 0.75 (Figure 2). At a cut point of 0.083, the model correctly predicted 72% of imipenem-resistant cases while incorrectly labeling 32% of susceptible cases as resistant (Sn = 72%, Sp = 63%, Figure 3). Conclusion A multiple logistic regression model using EMR data can generate immediate, clinically useful predictions of carbapenem resistance in patients with K. pneumoniae BSI. Larger data sets are needed to improve and validate these findings. Figure 1. Algorithm variables Figure 2. Receiver operating characteristic curve Figure 3. Classification table Disclosures All authors: No reported disclosures.
NASA Technical Reports Server (NTRS)
Whitlock, C. H.; Kuo, C. Y.
1979-01-01
The objective of this paper is to define optical physics and/or environmental conditions under which the linear multiple-regression should be applicable. An investigation of the signal-response equations is conducted and the concept is tested by application to actual remote sensing data from a laboratory experiment performed under controlled conditions. Investigation of the signal-response equations shows that the exact solution for a number of optical physics conditions is of the same form as a linearized multiple-regression equation, even if nonlinear contributions from surface reflections, atmospheric constituents, or other water pollutants are included. Limitations on achieving this type of solution are defined.
Wijekoon, Chandrani Nirmala; Amaratunge, Heshan; de Silva, Yashica; Senanayake, Solith; Jayawardane, Pradeepa; Senarath, Upul
2017-09-25
Emotional intelligence (EI) has been linked with academic and professional success. Such data are scarce in Sri Lanka. This study was conducted to describe the pattern of EI, to determine its predictors and to determine the effect of EI on academic performance at the final MBBS examination, in medical undergraduates of a Sri Lankan university. This is a cross-sectional study in a selected university, involving those who did final MBBS examination in 2016. Consecutive sampling was done. EI was assessed with self-administered Genos Emotional Intelligence Full Version (7 domains; 70 questions equally weighted; total score 350). Socio-demographic data were obtained using a self-administered questionnaire. Academic performance was assessed with final MBBS results in the first attempt. Of 148 eligible students 130 responded (response rate-88%); 61.5% were females; mean age was 26.3 ± 1 years. Mean total EI score was 241.5 (females-245.5, males-235.1; p = 0.045).Among different domains, mean score was highest for Emotional Self-Awareness (36.8/50) and lowest for Emotional Expression (32.6/50). Multiple linear regression analysis indicated that having good family support (p = 0.002), socializing well in university (p = 0.024) and being satisfied with facilities available for learning (p = 0.002), were independent predictors of EI. At the final MBBS examination 51.6% obtained classes, 31.5% passed the examination without classes and 16.9% got repeated. Females had better academic performance than males (p = 0.009). Mean EI of second-class upper division, second-class lower division, pass and repeat groups were 249.4, 246.6, 240.2 and 226.9, respectively (with one-way ANOVA p = 0.015). After adjusting for gender, ordinal regression analysis indicated that, total EI score was an independent predictor of final MBBS results [β-0.018 (95% CI 0.005-0.031); p = 0.006]. In the study population, both EI and academic performance were higher among females. Independent of gender, academic performance was better in those who were more emotionally intelligent. Several psychosocial factors were found to be independent predictors of EI. These results suggest that emotional skills development might enhance academic performance of medical undergraduates in Sri Lanka. Further research is needed in this under-explored area.
Xu, Wei; Chen, Da-Wei; Jin, Yan-Bin; Dong, Zhen-Jun; Zhang, Wei-Jiang; Chen, Jin-Wen; Yang, Shu-Mei; Wang, Jian-Rong
2015-02-01
[Purpose] The aim of this study was to determine fall incidence and explore clinical factors of falls among older Chinese veterans in military communities. [Subjects and Methods] We carried out a 12-month prospective study among 13 military communities in Beijing, China. Fall events were obtained by self-report to military community liaisons and monthly telephone interviews by researchers. [Results] Among the final sample of 447 older veterans, 86 fell once, 25 fell twice or more, and 152 falls occurred altogether. The incidence of falls and fallers were 342/1,000 person-years and 249/1,000 person-years. In Cox regression models, independent clinical factors associated with falls were visual acuity (RR=0.47), stroke (RR=2.43), lumbar diseases (RR=1.73), sedatives (RR=1.80), fall history in the past 6 months (RR=2.77), multiple chronic diseases (RR=1.53), multiple medications (RR=1.34), and five-repetition sit-to-stand test score (RR=1.41). Hearing acuity was close to being statistically significant. [Conclusion] The incidences of falls and fallers among older Chinese veterans were lower than those of Hong Kong and western countries. The clinical risk factors of falls were poor senses, stroke, lumbar diseases, taking sedatives, fall history in the past 6 months, having multiple chronic diseases, taking multiple medications, and poor physical function. The preventive strategies targeting the above risk factors are very significant for reducing falls.
Xu, Wei; Chen, Da-Wei; Jin, Yan-Bin; Dong, Zhen-Jun; Zhang, Wei-Jiang; Chen, Jin-Wen; Yang, Shu-Mei; Wang, Jian-Rong
2015-01-01
[Purpose] The aim of this study was to determine fall incidence and explore clinical factors of falls among older Chinese veterans in military communities. [Subjects and Methods] We carried out a 12-month prospective study among 13 military communities in Beijing, China. Fall events were obtained by self-report to military community liaisons and monthly telephone interviews by researchers. [Results] Among the final sample of 447 older veterans, 86 fell once, 25 fell twice or more, and 152 falls occurred altogether. The incidence of falls and fallers were 342/1,000 person-years and 249/1,000 person-years. In Cox regression models, independent clinical factors associated with falls were visual acuity (RR=0.47), stroke (RR=2.43), lumbar diseases (RR=1.73), sedatives (RR=1.80), fall history in the past 6 months (RR=2.77), multiple chronic diseases (RR=1.53), multiple medications (RR=1.34), and five-repetition sit-to-stand test score (RR=1.41). Hearing acuity was close to being statistically significant. [Conclusion] The incidences of falls and fallers among older Chinese veterans were lower than those of Hong Kong and western countries. The clinical risk factors of falls were poor senses, stroke, lumbar diseases, taking sedatives, fall history in the past 6 months, having multiple chronic diseases, taking multiple medications, and poor physical function. The preventive strategies targeting the above risk factors are very significant for reducing falls. PMID:25729162
Morphological characteristics associated with rupture risk of multiple intracranial aneurysms.
Wang, Guang-Xian; Liu, Lan-Lan; Wen, Li; Cao, Yun-Xing; Pei, Yu-Chun; Zhang, Dong
2017-10-01
To identify the morphological parameters that are related to intracranial aneurysms (IAs) rupture using a case-control model. A total of 107 patients with multiple IAs and aneurysmal subarachnoid hemorrhage between August 2011 and February 2017 were enrolled in this study. Characteristics of IAs location, shape, neck width, perpendicular height, depth, maximum size, flow angle, parent vessel diameter (PVD), aspect ratio (AR) and size ratio (SR) were evaluated using CT angiography. Multiple logistic regression analysis was used to identify the independent risk factors associated with IAs rupture. Receiver operating characteristic curve analysis was performed on the final model, and the optimal thresholds were obtained. IAs located in the internal carotid artery (ICA) was associated with a negative risk of rupture, whereas AR, SR1 (height/PVD) and SR2 (depth/PVD) were associated with increased risk of rupture. When SR was calculated differently, the odds ratio values of these factors were also different. The receiver operating characteristic curve showed that AR, SR1 and SR2 had cut-off values of 1.01, 1.48 and 1.40, respectively. SR3 (maximum size/PVD) was not associated with IAs rupture. IAs located in the ICA are associated with a negative risk of rupture, while high AR (>1.01), SR1 (>1.48) or SR2 (>1.40) are risk factors for multiple IAs rupture. Copyright © 2017 Hainan Medical University. Production and hosting by Elsevier B.V. All rights reserved.
RRegrs: an R package for computer-aided model selection with multiple regression models.
Tsiliki, Georgia; Munteanu, Cristian R; Seoane, Jose A; Fernandez-Lozano, Carlos; Sarimveis, Haralambos; Willighagen, Egon L
2015-01-01
Predictive regression models can be created with many different modelling approaches. Choices need to be made for data set splitting, cross-validation methods, specific regression parameters and best model criteria, as they all affect the accuracy and efficiency of the produced predictive models, and therefore, raising model reproducibility and comparison issues. Cheminformatics and bioinformatics are extensively using predictive modelling and exhibit a need for standardization of these methodologies in order to assist model selection and speed up the process of predictive model development. A tool accessible to all users, irrespectively of their statistical knowledge, would be valuable if it tests several simple and complex regression models and validation schemes, produce unified reports, and offer the option to be integrated into more extensive studies. Additionally, such methodology should be implemented as a free programming package, in order to be continuously adapted and redistributed by others. We propose an integrated framework for creating multiple regression models, called RRegrs. The tool offers the option of ten simple and complex regression methods combined with repeated 10-fold and leave-one-out cross-validation. Methods include Multiple Linear regression, Generalized Linear Model with Stepwise Feature Selection, Partial Least Squares regression, Lasso regression, and Support Vector Machines Recursive Feature Elimination. The new framework is an automated fully validated procedure which produces standardized reports to quickly oversee the impact of choices in modelling algorithms and assess the model and cross-validation results. The methodology was implemented as an open source R package, available at https://www.github.com/enanomapper/RRegrs, by reusing and extending on the caret package. The universality of the new methodology is demonstrated using five standard data sets from different scientific fields. Its efficiency in cheminformatics and QSAR modelling is shown with three use cases: proteomics data for surface-modified gold nanoparticles, nano-metal oxides descriptor data, and molecular descriptors for acute aquatic toxicity data. The results show that for all data sets RRegrs reports models with equal or better performance for both training and test sets than those reported in the original publications. Its good performance as well as its adaptability in terms of parameter optimization could make RRegrs a popular framework to assist the initial exploration of predictive models, and with that, the design of more comprehensive in silico screening applications.Graphical abstractRRegrs is a computer-aided model selection framework for R multiple regression models; this is a fully validated procedure with application to QSAR modelling.
Fear of falling in older adults living at home: associated factors.
Vitorino, Luciano Magalhães; Teixeira, Carla Araujo Bastos; Boas, Eliandra Laís Vilas; Pereira, Rúbia Lopes; Santos, Naiana Oliveira Dos; Rozendo, Célia Alves
2017-04-10
To identify the factors associated with the fear of falling in the older adultliving at home. Cross-sectional study with probabilistic sampling of older adultenrolled in two Family Health Strategies (FHS). The fear of falling was measured by the Brazilian version of the Falls Efficacy Scale-International and by a household questionnairethat contained the explanatory variables. Multiple Linear Regression using the stepwise selection technique and the Generalized Linear Models were used in the statistical analyses. A total of170 older adultsparticipated in the research, 85 from each FHS. The majority (57.1%) aged between 60 and 69; 67.6% were female; 46.1% fell once in the last year. The majority of the older adults(66.5%) had highfear of falling. In the final multiple linear regression model, it was identified that a higher number of previous falls, female gender, older age, and worse health self-assessment explained 37% of the fear of falling among the older adult. The findings reinforce the need to assess the fear of falling among the older adultliving at home, in conjunction with the development and use ofstrategies based on modifiable factors by professionalsto reduce falls and improve health status, which may contribute to the reduction of the fear of falling among the older adult. Identificar os fatores associados ao medo de cair em idosos residentes no domicílio. Estudo transversal com amostragem probabilística de idosos cadastrados em duas Estratégias Saúde da Família (ESF). O medo de cair foi avaliado pela versão brasileira da escala Falls Efficacy Scale International e por um inquérito domiciliar que continha as variáveis explicativas.A Regressão Linear Múltipla por meio da técnica stepwise selectione osModelos Lineares Generalizados foram utilizados nas análises estatísticas. Participaram da pesquisa170 idosos, 85 de cada ESF. A maioria (57,1%) tinha entre 60 e 69 anos de idade; 67,6% eram do sexo feminino; 46,1% tiveram queda no último ano. A maioria dos idosos (66,5%) tinha elevado medo de cair. No modelo final de regressão multivariada, identificou-se que maior número de quedas anteriores, sexo feminino, idade mais avançada, e pior autoavaliação de saúde explicaram 37% do medo de cair entre os idosos. Os achados reforçam a necessidade da avaliação do medo de cair entre os idosos que residem no próprio domicílio, assim como o desenvolvimento e a utilização de estratégias pelos profissionais voltadas para os fatores modificáveis,de modo a reduzir as quedas e melhorar o estado de saúde, o que pode contribuir para a diminuição do medo de cair entre os idosos.
Louys, Julien; Meloro, Carlo; Elton, Sarah; Ditchfield, Peter; Bishop, Laura C
2015-01-01
We test the performance of two models that use mammalian communities to reconstruct multivariate palaeoenvironments. While both models exploit the correlation between mammal communities (defined in terms of functional groups) and arboreal heterogeneity, the first uses a multiple multivariate regression of community structure and arboreal heterogeneity, while the second uses a linear regression of the principal components of each ecospace. The success of these methods means the palaeoenvironment of a particular locality can be reconstructed in terms of the proportions of heavy, moderate, light, and absent tree canopy cover. The linear regression is less biased, and more precisely and accurately reconstructs heavy tree canopy cover than the multiple multivariate model. However, the multiple multivariate model performs better than the linear regression for all other canopy cover categories. Both models consistently perform better than randomly generated reconstructions. We apply both models to the palaeocommunity of the Upper Laetolil Beds, Tanzania. Our reconstructions indicate that there was very little heavy tree cover at this site (likely less than 10%), with the palaeo-landscape instead comprising a mixture of light and absent tree cover. These reconstructions help resolve the previous conflicting palaeoecological reconstructions made for this site. Copyright © 2014 Elsevier Ltd. All rights reserved.
Cruz, Antonio M; Barr, Cameron; Puñales-Pozo, Elsa
2008-01-01
This research's main goals were to build a predictor for a turnaround time (TAT) indicator for estimating its values and use a numerical clustering technique for finding possible causes of undesirable TAT values. The following stages were used: domain understanding, data characterisation and sample reduction and insight characterisation. Building the TAT indicator multiple linear regression predictor and clustering techniques were used for improving corrective maintenance task efficiency in a clinical engineering department (CED). The indicator being studied was turnaround time (TAT). Multiple linear regression was used for building a predictive TAT value model. The variables contributing to such model were clinical engineering department response time (CE(rt), 0.415 positive coefficient), stock service response time (Stock(rt), 0.734 positive coefficient), priority level (0.21 positive coefficient) and service time (0.06 positive coefficient). The regression process showed heavy reliance on Stock(rt), CE(rt) and priority, in that order. Clustering techniques revealed the main causes of high TAT values. This examination has provided a means for analysing current technical service quality and effectiveness. In doing so, it has demonstrated a process for identifying areas and methods of improvement and a model against which to analyse these methods' effectiveness.
Pratt, Bethany; Chang, Heejun
2012-03-30
The relationship among land cover, topography, built structure and stream water quality in the Portland Metro region of Oregon and Clark County, Washington areas, USA, is analyzed using ordinary least squares (OLS) and geographically weighted (GWR) multiple regression models. Two scales of analysis, a sectional watershed and a buffer, offered a local and a global investigation of the sources of stream pollutants. Model accuracy, measured by R(2) values, fluctuated according to the scale, season, and regression method used. While most wet season water quality parameters are associated with urban land covers, most dry season water quality parameters are related topographic features such as elevation and slope. GWR models, which take into consideration local relations of spatial autocorrelation, had stronger results than OLS regression models. In the multiple regression models, sectioned watershed results were consistently better than the sectioned buffer results, except for dry season pH and stream temperature parameters. This suggests that while riparian land cover does have an effect on water quality, a wider contributing area needs to be included in order to account for distant sources of pollutants. Copyright © 2012 Elsevier B.V. All rights reserved.
1981-09-01
corresponds to the same square footage that consumed the electrical energy. 3. The basic assumptions of multiple linear regres- sion, as enumerated in...7. Data related to the sample of bases is assumed to be representative of bases in the population. Limitations Basic limitations on this research were... Ratemaking --Overview. Rand Report R-5894, Santa Monica CA, May 1977. Chatterjee, Samprit, and Bertram Price. Regression Analysis by Example. New York: John
Villarrasa-Sapiña, Israel; Serra-Añó, Pilar; Pardo-Ibáñez, Alberto; Gonzalez, Luis-Millán; García-Massó, Xavier
2017-01-01
Obesity is now a serious worldwide challenge, especially in children. This condition can cause a number of different health problems, including musculoskeletal disorders, some of which are due to mechanical stress caused by excess body weight. The aim of this study was to determine the association between body composition and the vertical ground reaction force produced during walking in obese children. Sixteen children participated in the study, six females and ten males [11.5 (1.2) years old, 69.8 (15.5) kg, 1.56 (0.09) m, and 28.36 (3.74) kg/m 2 of body mass index (BMI)]. Total weight, lean mass and fat mass were measured by dual-energy X-ray absorptiometry and vertical forces while walking were obtained by a force platform. The vertical force variables analysed were impact and propulsive forces, and the rate of development of both. Multiple regression models for each vertical force parameter were calculated using the body composition variables as input. The impact force regression model was found to be positively related to the weight of obese children and negatively related to lean mass. The regression model showed lean mass was positively related to the propulsive rate. Finally, regression models for impact and propulsive force showed a direct relationship with body weight. Impact force is positively related to the weight of obese children, but lean mass helps to reduce the impact force in this population. Exercise could help obese persons to reduce their total body weight and increase their lean mass, thus reducing impact forces during sports and other activities. Copyright © 2016 Elsevier Ltd. All rights reserved.
Fitzpatrick, Cole D; Rakasi, Saritha; Knodler, Michael A
2017-01-01
Speed is one of the most important factors in traffic safety as higher speeds are linked to increased crash risk and higher injury severities. Nearly a third of fatal crashes in the United States are designated as "speeding-related", which is defined as either "the driver behavior of exceeding the posted speed limit or driving too fast for conditions." While many studies have utilized the speeding-related designation in safety analyses, no studies have examined the underlying accuracy of this designation. Herein, we investigate the speeding-related crash designation through the development of a series of logistic regression models that were derived from the established speeding-related crash typologies and validated using a blind review, by multiple researchers, of 604 crash narratives. The developed logistic regression model accurately identified crashes which were not originally designated as speeding-related but had crash narratives that suggested speeding as a causative factor. Only 53.4% of crashes designated as speeding-related contained narratives which described speeding as a causative factor. Further investigation of these crashes revealed that the driver contributing code (DCC) of "driving too fast for conditions" was being used in three separate situations. Additionally, this DCC was also incorrectly used when "exceeding the posted speed limit" would likely have been a more appropriate designation. Finally, it was determined that the responding officer only utilized one DCC in 82% of crashes not designated as speeding-related but contained a narrative indicating speed as a contributing causal factor. The use of logistic regression models based upon speeding-related crash typologies offers a promising method by which all possible speeding-related crashes could be identified. Published by Elsevier Ltd.
NASA Astrophysics Data System (ADS)
Emamgolizadeh, S.; Bateni, S. M.; Shahsavani, D.; Ashrafi, T.; Ghorbani, H.
2015-10-01
The soil cation exchange capacity (CEC) is one of the main soil chemical properties, which is required in various fields such as environmental and agricultural engineering as well as soil science. In situ measurement of CEC is time consuming and costly. Hence, numerous studies have used traditional regression-based techniques to estimate CEC from more easily measurable soil parameters (e.g., soil texture, organic matter (OM), and pH). However, these models may not be able to adequately capture the complex and highly nonlinear relationship between CEC and its influential soil variables. In this study, Genetic Expression Programming (GEP) and Multivariate Adaptive Regression Splines (MARS) were employed to estimate CEC from more readily measurable soil physical and chemical variables (e.g., OM, clay, and pH) by developing functional relations. The GEP- and MARS-based functional relations were tested at two field sites in Iran. Results showed that GEP and MARS can provide reliable estimates of CEC. Also, it was found that the MARS model (with root-mean-square-error (RMSE) of 0.318 Cmol+ kg-1 and correlation coefficient (R2) of 0.864) generated slightly better results than the GEP model (with RMSE of 0.270 Cmol+ kg-1 and R2 of 0.807). The performance of GEP and MARS models was compared with two existing approaches, namely artificial neural network (ANN) and multiple linear regression (MLR). The comparison indicated that MARS and GEP outperformed the MLP model, but they did not perform as good as ANN. Finally, a sensitivity analysis was conducted to determine the most and the least influential variables affecting CEC. It was found that OM and pH have the most and least significant effect on CEC, respectively.
The effects of climate change on harp seals (Pagophilus groenlandicus).
Johnston, David W; Bowers, Matthew T; Friedlaender, Ari S; Lavigne, David M
2012-01-01
Harp seals (Pagophilus groenlandicus) have evolved life history strategies to exploit seasonal sea ice as a breeding platform. As such, individuals are prepared to deal with fluctuations in the quantity and quality of ice in their breeding areas. It remains unclear, however, how shifts in climate may affect seal populations. The present study assesses the effects of climate change on harp seals through three linked analyses. First, we tested the effects of short-term climate variability on young-of-the year harp seal mortality using a linear regression of sea ice cover in the Gulf of St. Lawrence against stranding rates of dead harp seals in the region during 1992 to 2010. A similar regression of stranding rates and North Atlantic Oscillation (NAO) index values was also conducted. These analyses revealed negative correlations between both ice cover and NAO conditions and seal mortality, indicating that lighter ice cover and lower NAO values result in higher mortality. A retrospective cross-correlation analysis of NAO conditions and sea ice cover from 1978 to 2011 revealed that NAO-related changes in sea ice may have contributed to the depletion of seals on the east coast of Canada during 1950 to 1972, and to their recovery during 1973 to 2000. This historical retrospective also reveals opposite links between neonatal mortality in harp seals in the Northeast Atlantic and NAO phase. Finally, an assessment of the long-term trends in sea ice cover in the breeding regions of harp seals across the entire North Atlantic during 1979 through 2011 using multiple linear regression models and mixed effects linear regression models revealed that sea ice cover in all harp seal breeding regions has been declining by as much as 6 percent per decade over the time series of available satellite data.
The Effects of Climate Change on Harp Seals (Pagophilus groenlandicus)
Johnston, David W.; Bowers, Matthew T.; Friedlaender, Ari S.; Lavigne, David M.
2012-01-01
Harp seals (Pagophilus groenlandicus) have evolved life history strategies to exploit seasonal sea ice as a breeding platform. As such, individuals are prepared to deal with fluctuations in the quantity and quality of ice in their breeding areas. It remains unclear, however, how shifts in climate may affect seal populations. The present study assesses the effects of climate change on harp seals through three linked analyses. First, we tested the effects of short-term climate variability on young-of-the year harp seal mortality using a linear regression of sea ice cover in the Gulf of St. Lawrence against stranding rates of dead harp seals in the region during 1992 to 2010. A similar regression of stranding rates and North Atlantic Oscillation (NAO) index values was also conducted. These analyses revealed negative correlations between both ice cover and NAO conditions and seal mortality, indicating that lighter ice cover and lower NAO values result in higher mortality. A retrospective cross-correlation analysis of NAO conditions and sea ice cover from 1978 to 2011 revealed that NAO-related changes in sea ice may have contributed to the depletion of seals on the east coast of Canada during 1950 to 1972, and to their recovery during 1973 to 2000. This historical retrospective also reveals opposite links between neonatal mortality in harp seals in the Northeast Atlantic and NAO phase. Finally, an assessment of the long-term trends in sea ice cover in the breeding regions of harp seals across the entire North Atlantic during 1979 through 2011 using multiple linear regression models and mixed effects linear regression models revealed that sea ice cover in all harp seal breeding regions has been declining by as much as 6 percent per decade over the time series of available satellite data. PMID:22238591
David, Ingrid; Garreau, Hervé; Balmisse, Elodie; Billon, Yvon; Canario, Laurianne
2017-01-20
Some genetic studies need to take into account correlations between traits that are repeatedly measured over time. Multiple-trait random regression models are commonly used to analyze repeated traits but suffer from several major drawbacks. In the present study, we developed a multiple-trait extension of the structured antedependence model (SAD) to overcome this issue and validated its usefulness by modeling the association between litter size (LS) and average birth weight (ABW) over parities in pigs and rabbits. The single-trait SAD model assumes that a random effect at time [Formula: see text] can be explained by the previous values of the random effect (i.e. at previous times). The proposed multiple-trait extension of the SAD model consists in adding a cross-antedependence parameter to the single-trait SAD model. This model can be easily fitted using ASReml and the OWN Fortran program that we have developed. In comparison with the random regression model, we used our multiple-trait SAD model to analyze the LS and ABW of 4345 litters from 1817 Large White sows and 8706 litters from 2286 L-1777 does over a maximum of five successive parities. For both species, the multiple-trait SAD fitted the data better than the random regression model. The difference between AIC of the two models (AIC_random regression-AIC_SAD) were equal to 7 and 227 for pigs and rabbits, respectively. A similar pattern of heritability and correlation estimates was obtained for both species. Heritabilities were lower for LS (ranging from 0.09 to 0.29) than for ABW (ranging from 0.23 to 0.39). The general trend was a decrease of the genetic correlation for a given trait between more distant parities. Estimates of genetic correlations between LS and ABW were negative and ranged from -0.03 to -0.52 across parities. No correlation was observed between the permanent environmental effects, except between the permanent environmental effects of LS and ABW of the same parity, for which the estimate of the correlation was strongly negative (ranging from -0.57 to -0.67). We demonstrated that application of our multiple-trait SAD model is feasible for studying several traits with repeated measurements and showed that it provided a better fit to the data than the random regression model.
NASA Technical Reports Server (NTRS)
Parsons, Vickie s.
2009-01-01
The request to conduct an independent review of regression models, developed for determining the expected Launch Commit Criteria (LCC) External Tank (ET)-04 cycle count for the Space Shuttle ET tanking process, was submitted to the NASA Engineering and Safety Center NESC on September 20, 2005. The NESC team performed an independent review of regression models documented in Prepress Regression Analysis, Tom Clark and Angela Krenn, 10/27/05. This consultation consisted of a peer review by statistical experts of the proposed regression models provided in the Prepress Regression Analysis. This document is the consultation's final report.
5 CFR 591.219 - How does OPM compute shelter price indexes?
Code of Federal Regulations, 2014 CFR
2014-01-01
... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...
5 CFR 591.219 - How does OPM compute shelter price indexes?
Code of Federal Regulations, 2011 CFR
2011-01-01
... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...
5 CFR 591.219 - How does OPM compute shelter price indexes?
Code of Federal Regulations, 2013 CFR
2013-01-01
... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...
5 CFR 591.219 - How does OPM compute shelter price indexes?
Code of Federal Regulations, 2012 CFR
2012-01-01
... estimates in hedonic regressions (a type of multiple regression) to compute for each COLA survey area the price index for rental and/or rental equivalent units of comparable quality and size between the COLA...
Krasikova, Dina V; Le, Huy; Bachura, Eric
2018-06-01
To address a long-standing concern regarding a gap between organizational science and practice, scholars called for more intuitive and meaningful ways of communicating research results to users of academic research. In this article, we develop a common language effect size index (CLβ) that can help translate research results to practice. We demonstrate how CLβ can be computed and used to interpret the effects of continuous and categorical predictors in multiple linear regression models. We also elaborate on how the proposed CLβ index is computed and used to interpret interactions and nonlinear effects in regression models. In addition, we test the robustness of the proposed index to violations of normality and provide means for computing standard errors and constructing confidence intervals around its estimates. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
Steen, Paul J.; Passino-Reader, Dora R.; Wiley, Michael J.
2006-01-01
As a part of the Great Lakes Regional Aquatic Gap Analysis Project, we evaluated methodologies for modeling associations between fish species and habitat characteristics at a landscape scale. To do this, we created brook trout Salvelinus fontinalis presence and absence models based on four different techniques: multiple linear regression, logistic regression, neural networks, and classification trees. The models were tested in two ways: by application to an independent validation database and cross-validation using the training data, and by visual comparison of statewide distribution maps with historically recorded occurrences from the Michigan Fish Atlas. Although differences in the accuracy of our models were slight, the logistic regression model predicted with the least error, followed by multiple regression, then classification trees, then the neural networks. These models will provide natural resource managers a way to identify habitats requiring protection for the conservation of fish species.
Factors contributing to practice variation in post-stroke rehabilitation.
Lee, A J; Huber, J H; Stason, W B
1997-01-01
OBJECTIVE: To analyze geographic variability in the utilization and cost of post-stroke medical care using multiple linear regression. DATA SOURCES/STUDY SETTING: A 20 percent random sample of Medicare beneficiaries with an admission to an acute care hospital for stroke during the first six months of 1991, supplemented by data from their Medicare claims and beneficiary records, the Medicare Cost Reports for hospitals and nursing homes, and the Area Resource File. STUDY DESIGN: Weighted least squares regression is used to analyze variations in post-stroke practice patterns across 151 MSAs (Metropolitan Statistical Areas). Average post-stroke costs, utilization rates, and facility lengths of stay are regressed on patient and market characteristics. DATA COLLECTION/EXTRACTION METHODS: For a six-month post-stroke interval, beneficiary-level post-stroke costs and service utilization are averaged by MSA. Variables describing market conditions are then added to these MSA-level records. PRINCIPAL FINDINGS: Patient variables rarely explain more than a third of practice variation, and often they explain substantially less than that. Market variables (with some exception) tend to be relatively less important. Finally, one-half to two-thirds of the practice variation across MSAs is unexplained by the patient and market factors measured in our data. CONCLUSIONS: A substantial portion of inter-MSA variability in utilization and intensity of post-stroke rehabilitation services cannot be explained by differences in patient characteristics. Given the large practice differences observed across MSAs, it seems unlikely that unmeasured patient differences can account for much more of the practice differences. PMID:9180616
Xue, Dan; Yin, Jingyuan
2014-05-01
In this study, we explored the potential applications of the Ozone Monitoring Instrument (OMI) satellite sensor in air pollution research. The OMI planetary boundary layer sulfur dioxide (SO2_PBL) column density and daily average surface SO2 concentration of Shanghai from 2004 to 2012 were analyzed. After several consecutive years of increase, the surface SO2 concentration finally declined in 2007. It was higher in winter than in other seasons. The coefficient between daily average surface SO2 concentration and SO2_PBL was only 0.316. But SO2_PBL was found to be a highly significant predictor of the surface SO2 concentration using the simple regression model. Five meteorological factors were considered in this study, among them, temperature, dew point, relative humidity, and wind speed were negatively correlated with surface SO2 concentration, while pressure was positively correlated. Furthermore, it was found that dew point was a more effective predictor than temperature. When these meteorological factors were used in multiple regression, the determination coefficient reached 0.379. The relationship of the surface SO2 concentration and meteorological factors was seasonally dependent. In summer and autumn, the regression model performed better than in spring and winter. The surface SO2 concentration predicting method proposed in this study can be easily adapted for other regions, especially most useful for those having no operational air pollution forecasting services or having sparse ground monitoring networks.
Predicting the demand of physician workforce: an international model based on "crowd behaviors".
Tsai, Tsuen-Chiuan; Eliasziw, Misha; Chen, Der-Fang
2012-03-26
Appropriateness of physician workforce greatly influences the quality of healthcare. When facing the crisis of physician shortages, the correction of manpower always takes an extended time period, and both the public and health personnel suffer. To calculate an appropriate number of Physician Density (PD) for a specific country, this study was designed to create a PD prediction model, based on health-related data from many countries. Twelve factors that could possibly impact physicians' demand were chosen, and data of these factors from 130 countries (by reviewing 195) were extracted. Multiple stepwise-linear regression was used to derive the PD prediction model, and a split-sample cross-validation procedure was performed to evaluate the generalizability of the results. Using data from 130 countries, with the consideration of the correlation between variables, and preventing multi-collinearity, seven out of the 12 predictor variables were selected for entry into the stepwise regression procedure. The final model was: PD = (5.014 - 0.128 × proportion under age 15 years + 0.034 × life expectancy)2, with R2 of 80.4%. Using the prediction equation, 70 countries had PDs with "negative discrepancy", while 58 had PDs with "positive discrepancy". This study provided a regression-based PD model to calculate a "norm" number of PD for a specific country. A large PD discrepancy in a country indicates the needs to examine physician's workloads and their well-being, the effectiveness/efficiency of medical care, the promotion of population health and the team resource management.
Effect of antenatal corticosteroids on fetal growth and gestational age at birth.
Murphy, Kellie E; Willan, Andrew R; Hannah, Mary E; Ohlsson, Arne; Kelly, Edmond N; Matthews, Stephen G; Saigal, Saroj; Asztalos, Elizabeth; Ross, Susan; Delisle, Marie-France; Amankwah, Kofi; Guselle, Patricia; Gafni, Amiram; Lee, Shoo K; Armson, B Anthony
2012-05-01
To estimate the effect of multiple courses of antenatal corticosteroids on neonatal size, controlling for gestational age at birth and other confounders, and to determine whether there was a dose-response relationship between number of courses of antenatal corticosteroids and neonatal size. This is a secondary analysis of the Multiple Courses of Antenatal Corticosteroids for Preterm Birth Study, a double-blind randomized controlled trial of single compared with multiple courses of antenatal corticosteroids in women at risk for preterm birth and in which fetuses administered multiple courses of antenatal corticosteroids weighed less, were shorter, and had smaller head circumferences at birth. All women (n=1,858) and children (n=2,304) enrolled in the Multiple Courses of Antenatal Corticosteroids for Preterm Birth Study were included in the current analysis. Multiple linear regression analyses were undertaken. Compared with placebo, neonates in the antenatal corticosteroids group were born earlier (estimated difference and confidence interval [CI]: -0.428 weeks, CI -0.10264 to -0.75336; P=.01). Controlling for gestational age at birth and confounding factors, multiple courses of antenatal corticosteroids were associated with a decrease in birth weight (-33.50 g, CI -66.27120 to -0.72880; P=.045), length (-0.339 cm, CI -0.6212 to -0.05676]; P=.019), and head circumference (-0.296 cm, -0.45672 to -0.13528; P<.001). For each additional course of antenatal corticosteroids, there was a trend toward an incremental decrease in birth weight, length, and head circumference. Fetuses exposed to multiple courses of antenatal corticosteroids were smaller at birth. The reduction in size was partially attributed to being born at an earlier gestational age but also was attributed to decreased fetal growth. Finally, a dose-response relationship exists between the number of corticosteroid courses and a decrease in fetal growth. The long-term effect of these findings is unknown. ClinicalTrials.gov, www.clinicaltrials.gov, NCT00187382. II.
Modification of the USLE K factor for soil erodibility assessment on calcareous soils in Iran
NASA Astrophysics Data System (ADS)
Ostovari, Yaser; Ghorbani-Dashtaki, Shoja; Bahrami, Hossein-Ali; Naderi, Mehdi; Dematte, Jose Alexandre M.; Kerry, Ruth
2016-11-01
The measurement of soil erodibility (K) in the field is tedious, time-consuming and expensive; therefore, its prediction through pedotransfer functions (PTFs) could be far less costly and time-consuming. The aim of this study was to develop new PTFs to estimate the K factor using multiple linear regression, Mamdani fuzzy inference systems, and artificial neural networks. For this purpose, K was measured in 40 erosion plots with natural rainfall. Various soil properties including the soil particle size distribution, calcium carbonate equivalent, organic matter, permeability, and wet-aggregate stability were measured. The results showed that the mean measured K was 0.014 t h MJ- 1 mm- 1 and 2.08 times less than the estimated mean K (0.030 t h MJ- 1 mm- 1) using the USLE model. Permeability, wet-aggregate stability, very fine sand, and calcium carbonate were selected as independent variables by forward stepwise regression in order to assess the ability of multiple linear regression, Mamdani fuzzy inference systems and artificial neural networks to predict K. The calcium carbonate equivalent, which is not accounted for in the USLE model, had a significant impact on K in multiple linear regression due to its strong influence on the stability of aggregates and soil permeability. Statistical indices in validation and calibration datasets determined that the artificial neural networks method with the highest R2, lowest RMSE, and lowest ME was the best model for estimating the K factor. A strong correlation (R2 = 0.81, n = 40, p < 0.05) between the estimated K from multiple linear regression and measured K indicates that the use of calcium carbonate equivalent as a predictor variable gives a better estimation of K in areas with calcareous soils.
Aqil, Muhammad; Kita, Ichiro; Yano, Akira; Nishiyama, Soichi
2007-10-01
Traditionally, the multiple linear regression technique has been one of the most widely used models in simulating hydrological time series. However, when the nonlinear phenomenon is significant, the multiple linear will fail to develop an appropriate predictive model. Recently, neuro-fuzzy systems have gained much popularity for calibrating the nonlinear relationships. This study evaluated the potential of a neuro-fuzzy system as an alternative to the traditional statistical regression technique for the purpose of predicting flow from a local source in a river basin. The effectiveness of the proposed identification technique was demonstrated through a simulation study of the river flow time series of the Citarum River in Indonesia. Furthermore, in order to provide the uncertainty associated with the estimation of river flow, a Monte Carlo simulation was performed. As a comparison, a multiple linear regression analysis that was being used by the Citarum River Authority was also examined using various statistical indices. The simulation results using 95% confidence intervals indicated that the neuro-fuzzy model consistently underestimated the magnitude of high flow while the low and medium flow magnitudes were estimated closer to the observed data. The comparison of the prediction accuracy of the neuro-fuzzy and linear regression methods indicated that the neuro-fuzzy approach was more accurate in predicting river flow dynamics. The neuro-fuzzy model was able to improve the root mean square error (RMSE) and mean absolute percentage error (MAPE) values of the multiple linear regression forecasts by about 13.52% and 10.73%, respectively. Considering its simplicity and efficiency, the neuro-fuzzy model is recommended as an alternative tool for modeling of flow dynamics in the study area.
González Costa, J J; Reigosa, M J; Matías, J M; Covelo, E F
2017-09-01
The aim of this study was to model the sorption and retention of Cd, Cu, Ni, Pb and Zn in soils. To that extent, the sorption and retention of these metals were studied and the soil characterization was performed separately. Multiple stepwise regression was used to produce multivariate models with linear techniques and with support vector machines, all of which included 15 explanatory variables characterizing soils. When the R-squared values are represented, two different groups are noticed. Cr, Cu and Pb sorption and retention show a higher R-squared; the most explanatory variables being humified organic matter, Al oxides and, in some cases, cation-exchange capacity (CEC). The other group of metals (Cd, Ni and Zn) shows a lower R-squared, and clays are the most explanatory variables, including a percentage of vermiculite and slime. In some cases, quartz, plagioclase or hematite percentages also show some explanatory capacity. Support Vector Machine (SVM) regression shows that the different models are not as regular as in multiple regression in terms of number of variables, the regression for nickel adsorption being the one with the highest number of variables in its optimal model. On the other hand, there are cases where the most explanatory variables are the same for two metals, as it happens with Cd and Cr adsorption. A similar adsorption mechanism is thus postulated. These patterns of the introduction of variables in the model allow us to create explainability sequences. Those which are the most similar to the selectivity sequences obtained by Covelo (2005) are Mn oxides in multiple regression and change capacity in SVM. Among all the variables, the only one that is explanatory for all the metals after applying the maximum parsimony principle is the percentage of sand in the retention process. In the competitive model arising from the aforementioned sequences, the most intense competitiveness for the adsorption and retention of different metals appears between Cr and Cd, Cu and Zn in multiple regression; and between Cr and Cd in SVM regression. Copyright © 2017 Elsevier B.V. All rights reserved.
Importance of initial and final state effects for azimuthal correlations in p + Pb collisions
Greif, Moritz; Greiner, Carsten; Schenke, Bjorn; ...
2017-11-27
In this work, we investigate the relative importance of initial and final state effects on azimuthal correlations of gluons in low and high multiplicity p+Pb collisions. To achieve this, we couple Yang-Mills dynamics of pre-equilibrium gluon fields (IP-GLASMA) to a perturbative QCD based parton cascade for the final state evolution (BAMPS) on an event-by-event basis. We find that signatures of both the initial state correlations and final state interactions are seen in azimuthal correlation observables, such as v 2 {2PC} (p T), their strength depending on the event multiplicity and transverse momentum. Initial state correlations dominate v 2 {2PC} (pmore » T) in low multiplicity events for transverse momenta p T > 2 GeV. Lastly, while final state interactions are dominant in high multiplicity events, initial state correlations affect v 2 {2PC} (p T) for p T > 2 GeV as well as the pT integrated v 2 {2PC}.« less
Importance of initial and final state effects for azimuthal correlations in p + Pb collisions
DOE Office of Scientific and Technical Information (OSTI.GOV)
Greif, Moritz; Greiner, Carsten; Schenke, Bjorn
In this work, we investigate the relative importance of initial and final state effects on azimuthal correlations of gluons in low and high multiplicity p+Pb collisions. To achieve this, we couple Yang-Mills dynamics of pre-equilibrium gluon fields (IP-GLASMA) to a perturbative QCD based parton cascade for the final state evolution (BAMPS) on an event-by-event basis. We find that signatures of both the initial state correlations and final state interactions are seen in azimuthal correlation observables, such as v 2 {2PC} (p T), their strength depending on the event multiplicity and transverse momentum. Initial state correlations dominate v 2 {2PC} (pmore » T) in low multiplicity events for transverse momenta p T > 2 GeV. Lastly, while final state interactions are dominant in high multiplicity events, initial state correlations affect v 2 {2PC} (p T) for p T > 2 GeV as well as the pT integrated v 2 {2PC}.« less
Nguyen, Quynh C; Osypuk, Theresa L; Schmidt, Nicole M; Glymour, M Maria; Tchetgen Tchetgen, Eric J
2015-03-01
Despite the recent flourishing of mediation analysis techniques, many modern approaches are difficult to implement or applicable to only a restricted range of regression models. This report provides practical guidance for implementing a new technique utilizing inverse odds ratio weighting (IORW) to estimate natural direct and indirect effects for mediation analyses. IORW takes advantage of the odds ratio's invariance property and condenses information on the odds ratio for the relationship between the exposure (treatment) and multiple mediators, conditional on covariates, by regressing exposure on mediators and covariates. The inverse of the covariate-adjusted exposure-mediator odds ratio association is used to weight the primary analytical regression of the outcome on treatment. The treatment coefficient in such a weighted regression estimates the natural direct effect of treatment on the outcome, and indirect effects are identified by subtracting direct effects from total effects. Weighting renders treatment and mediators independent, thereby deactivating indirect pathways of the mediators. This new mediation technique accommodates multiple discrete or continuous mediators. IORW is easily implemented and is appropriate for any standard regression model, including quantile regression and survival analysis. An empirical example is given using data from the Moving to Opportunity (1994-2002) experiment, testing whether neighborhood context mediated the effects of a housing voucher program on obesity. Relevant Stata code (StataCorp LP, College Station, Texas) is provided. © The Author 2015. Published by Oxford University Press on behalf of the Johns Hopkins Bloomberg School of Public Health. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
ERIC Educational Resources Information Center
Cepeda-Cuervo, Edilberto; Núñez-Antón, Vicente
2013-01-01
In this article, a proposed Bayesian extension of the generalized beta spatial regression models is applied to the analysis of the quality of education in Colombia. We briefly revise the beta distribution and describe the joint modeling approach for the mean and dispersion parameters in the spatial regression models' setting. Finally, we motivate…
Agha, Salah R; Alnahhal, Mohammed J
2012-11-01
The current study investigates the possibility of obtaining the anthropometric dimensions, critical to school furniture design, without measuring all of them. The study first selects some anthropometric dimensions that are easy to measure. Two methods are then used to check if these easy-to-measure dimensions can predict the dimensions critical to the furniture design. These methods are multiple linear regression and neural networks. Each dimension that is deemed necessary to ergonomically design school furniture is expressed as a function of some other measured anthropometric dimensions. Results show that out of the five dimensions needed for chair design, four can be related to other dimensions that can be measured while children are standing. Therefore, the method suggested here would definitely save time and effort and avoid the difficulty of dealing with students while measuring these dimensions. In general, it was found that neural networks perform better than multiple linear regression in the current study. Copyright © 2012 Elsevier Ltd and The Ergonomics Society. All rights reserved.
NASA Astrophysics Data System (ADS)
Cai, Jun; Wang, Kuaishe; Shi, Jiamin; Wang, Wen; Liu, Yingying
2018-01-01
Constitutive analysis for hot working of BFe10-1-2 alloy was carried out by using experimental stress-strain data from isothermal hot compression tests, in a wide range of temperature of 1,023 1,273 K, and strain rate range of 0.001 10 s-1. A constitutive equation based on modified double multiple nonlinear regression was proposed considering the independent effects of strain, strain rate, temperature and their interrelation. The predicted flow stress data calculated from the developed equation was compared with the experimental data. Correlation coefficient (R), average absolute relative error (AARE) and relative errors were introduced to verify the validity of the developed constitutive equation. Subsequently, a comparative study was made on the capability of strain-compensated Arrhenius-type constitutive model. The results showed that the developed constitutive equation based on modified double multiple nonlinear regression could predict flow stress of BFe10-1-2 alloy with good correlation and generalization.
Marston, Louise; Peacock, Janet L; Yu, Keming; Brocklehurst, Peter; Calvert, Sandra A; Greenough, Anne; Marlow, Neil
2009-07-01
Studies of prematurely born infants contain a relatively large percentage of multiple births, so the resulting data have a hierarchical structure with small clusters of size 1, 2 or 3. Ignoring the clustering may lead to incorrect inferences. The aim of this study was to compare statistical methods which can be used to analyse such data: generalised estimating equations, multilevel models, multiple linear regression and logistic regression. Four datasets which differed in total size and in percentage of multiple births (n = 254, multiple 18%; n = 176, multiple 9%; n = 10 098, multiple 3%; n = 1585, multiple 8%) were analysed. With the continuous outcome, two-level models produced similar results in the larger dataset, while generalised least squares multilevel modelling (ML GLS 'xtreg' in Stata) and maximum likelihood multilevel modelling (ML MLE 'xtmixed' in Stata) produced divergent estimates using the smaller dataset. For the dichotomous outcome, most methods, except generalised least squares multilevel modelling (ML GH 'xtlogit' in Stata) gave similar odds ratios and 95% confidence intervals within datasets. For the continuous outcome, our results suggest using multilevel modelling. We conclude that generalised least squares multilevel modelling (ML GLS 'xtreg' in Stata) and maximum likelihood multilevel modelling (ML MLE 'xtmixed' in Stata) should be used with caution when the dataset is small. Where the outcome is dichotomous and there is a relatively large percentage of non-independent data, it is recommended that these are accounted for in analyses using logistic regression with adjusted standard errors or multilevel modelling. If, however, the dataset has a small percentage of clusters greater than size 1 (e.g. a population dataset of children where there are few multiples) there appears to be less need to adjust for clustering.
Potts, Tiffany M; Nguyen, Jacqueline L; Ghai, Kanika; Li, Kathy; Perlmuter, Lawrence
2015-04-15
To investigate whether perceptions of task difficulty on neuropsychological tests predicted academic achievement after controlling for glucose levels and depression. Participants were type 1 diabetic adolescents, with a mean age = 12.5 years (23 females and 16 males), seen at a northwest suburban Chicago hospital. The sample population was free of co-morbid clinical health conditions. Subjects completed a three-part neuropsychological battery including the Digit Symbol Task, Trail Making Test, and Controlled Oral Word Association test. Following each task, individuals rated task difficulty and then completed a depression inventory. Performance on these three tests is reflective of neuropsychological status in relation to glucose control. Blood glucose levels were measured immediately prior to and after completing the neuropsychological battery using a glucose meter. HbA1c levels were obtained from medical records. Academic performance was based on self-reported grades in Math, Science, and English. Data was analyzed using multiple regression models to evaluate the associations between academic performance, perception of task difficulty, and glucose control. Perceptions of difficulty on a neuropsychological battery significantly predicted academic performance after accounting for glucose control and depression. Perceptions of difficulty on the neuropsychological tests were inversely correlated with academic performance (r = -0.48), while acute (blood glucose) and long-term glucose levels increased along with perceptions of task difficulty (r = 0.47). Additionally, higher depression scores were associated with poorer academic performance (r = -0.43). With the first regression analysis, perception of difficulty on the neuropsychological tasks contributed to 8% of the variance in academic performance after controlling for peripheral blood glucose and depression. In the second regression analysis, perception of difficulty accounted for 11% of the variance after accounting for academic performance and depression. The final regression analysis indicated that perception of difficulty increased with peripheral blood glucose, contributing to 22% of the variance. Most importantly, after controlling for perceptions of task difficulty, academic performance no longer predicted glucose levels. Finally, subjects who found the cognitive battery difficult were likely to have poor academic grades. Perceptions of difficulty on neurological tests exhibited a significant association with academic achievement, indicating that deficits in this skill may lead to academic disadvantage in diabetic patients.
ERIC Educational Resources Information Center
Petrowsky, Michael C.
This paper analyzes the results of a pilot study at Glendale Community College (Arizona) to assess the effectiveness of a comprehensive multiple choice final exam in the macroeconomic principles course. The "pilot project" involved the administration of a 50-question multiple choice exam to 71 students in three macroeconomics sections.…
Curran, Janet H.; Barth, Nancy A.; Veilleux, Andrea G.; Ourso, Robert T.
2016-03-16
Estimates of the magnitude and frequency of floods are needed across Alaska for engineering design of transportation and water-conveyance structures, flood-insurance studies, flood-plain management, and other water-resource purposes. This report updates methods for estimating flood magnitude and frequency in Alaska and conterminous basins in Canada. Annual peak-flow data through water year 2012 were compiled from 387 streamgages on unregulated streams with at least 10 years of record. Flood-frequency estimates were computed for each streamgage using the Expected Moments Algorithm to fit a Pearson Type III distribution to the logarithms of annual peak flows. A multiple Grubbs-Beck test was used to identify potentially influential low floods in the time series of peak flows for censoring in the flood frequency analysis.For two new regional skew areas, flood-frequency estimates using station skew were computed for stations with at least 25 years of record for use in a Bayesian least-squares regression analysis to determine a regional skew value. The consideration of basin characteristics as explanatory variables for regional skew resulted in improvements in precision too small to warrant the additional model complexity, and a constant model was adopted. Regional Skew Area 1 in eastern-central Alaska had a regional skew of 0.54 and an average variance of prediction of 0.45, corresponding to an effective record length of 22 years. Regional Skew Area 2, encompassing coastal areas bordering the Gulf of Alaska, had a regional skew of 0.18 and an average variance of prediction of 0.12, corresponding to an effective record length of 59 years. Station flood-frequency estimates for study sites in regional skew areas were then recomputed using a weighted skew incorporating the station skew and regional skew. In a new regional skew exclusion area outside the regional skew areas, the density of long-record streamgages was too sparse for regional analysis and station skew was used for all estimates. Final station flood frequency estimates for all study streamgages are presented for the 50-, 20-, 10-, 4-, 2-, 1-, 0.5-, and 0.2-percent annual exceedance probabilities.Regional multiple-regression analysis was used to produce equations for estimating flood frequency statistics from explanatory basin characteristics. Basin characteristics, including physical and climatic variables, were updated for all study streamgages using a geographical information system and geospatial source data. Screening for similar-sized nested basins eliminated hydrologically redundant sites, and screening for eligibility for analysis of explanatory variables eliminated regulated peaks, outburst peaks, and sites with indeterminate basin characteristics. An ordinary least‑squares regression used flood-frequency statistics and basin characteristics for 341 streamgages (284 in Alaska and 57 in Canada) to determine the most suitable combination of basin characteristics for a flood-frequency regression model and to explore regional grouping of streamgages for explaining variability in flood-frequency statistics across the study area. The most suitable model for explaining flood frequency used drainage area and mean annual precipitation as explanatory variables for the entire study area as a region. Final regression equations for estimating the 50-, 20-, 10-, 4-, 2-, 1-, 0.5-, and 0.2-percent annual exceedance probability discharge in Alaska and conterminous basins in Canada were developed using a generalized least-squares regression. The average standard error of prediction for the regression equations for the various annual exceedance probabilities ranged from 69 to 82 percent, and the pseudo-coefficient of determination (pseudo-R2) ranged from 85 to 91 percent.The regional regression equations from this study were incorporated into the U.S. Geological Survey StreamStats program for a limited area of the State—the Cook Inlet Basin. StreamStats is a national web-based geographic information system application that facilitates retrieval of streamflow statistics and associated information. StreamStats retrieves published data for gaged sites and, for user-selected ungaged sites, delineates drainage areas from topographic and hydrographic data, computes basin characteristics, and computes flood frequency estimates using the regional regression equations.
Gromisch, Elizabeth S; Portnoy, Jeffrey G; Foley, Frederick W
2018-05-15
Cognitive impairment is a prevalent and often intrusive problem among persons with multiple sclerosis (PwMS). Valid and reliable assessments, including quick screening measures, are crucial. The Brief International Cognitive Assessment for MS (BICAMS) was developed for this reason. While it lends itself to use in locations where formal neuropsychological resources might be limited, it does not include measures of verbal fluency or executive functioning, domains assessed as part of the Minimal Assessment of Cognitive Function in MS (MACFIMS). Given previous evidence that shortened MACFIMS measures have strong criterion validity, this study aimed to determine which of these should be included in the abbreviated MACFIMS (aMACFIMS), and how the aMACFIMS compares to the BICAMS. One hundred forty-seven PwMS were included in the analyses. A stepwise logistic regression was used to determine the measures in the aMACFIMS. Receiver-operating-characteristic (ROC) curves assessed the classification accuracy, sensitivity, and specificity. The batteries' sensitivity, specificity, and predictive values were then compared. Compared to the BICAMS, the final aMACFIMS had higher specificity (87% versus 72%) and positive predictive value (86% versus 77%), but lower sensitivity (71% versus 81%). The aMACFIMS has several benefits, including reduced administration time and the addition of a verbal fluency/executive functioning measure. Copyright © 2018 Elsevier B.V. All rights reserved.
D'Ambrosio, Alessandro; Pagani, Elisabetta; Riccitelli, Gianna C; Colombo, Bruno; Rodegher, Mariaemma; Falini, Andrea; Comi, Giancarlo; Filippi, Massimo; Rocca, Maria A
2017-08-01
To investigate the role of cerebellar sub-regions on motor and cognitive performance in multiple sclerosis (MS) patients. Whole and sub-regional cerebellar volumes, brain volumes, T2 hyperintense lesion volumes (LV), and motor performance scores were obtained from 95 relapse-onset MS patients and 32 healthy controls (HC). MS patients also underwent an evaluation of working memory and processing speed functions. Cerebellar anterior and posterior lobes were segmented using the Spatially Unbiased Infratentorial Toolbox (SUIT) from Statistical Parametric Mapping (SPM12). Multivariate linear regression models assessed the relationship between magnetic resonance imaging (MRI) measures and motor/cognitive scores. Compared to HC, only secondary progressive multiple sclerosis (SPMS) patients had lower cerebellar volumes (total and posterior cerebellum). In MS patients, lower anterior cerebellar volume and brain T2 LV predicted worse motor performance, whereas lower posterior cerebellar volume and brain T2 LV predicted poor cognitive performance. Global measures of brain volume and infratentorial T2 LV were not selected by the final multivariate models. Cerebellar volumetric abnormalities are likely to play an important contribution to explain motor and cognitive performance in MS patients. Consistently with functional mapping studies, cerebellar posterior-inferior volume accounted for variance in cognitive measures, whereas anterior cerebellar volume accounted for variance in motor performance, supporting the assessment of cerebellar damage at sub-regional level.
77 FR 3121 - Program Integrity: Gainful Employment-Debt Measures; Correction
Federal Register 2010, 2011, 2012, 2013, 2014
2012-01-23
...On June 13, 2011, the Secretary of Education (Secretary) published a notice of final regulations in the Federal Register for Program Integrity: Gainful Employment--Debt Measures (Gainful Employment--Debt Measures) (76 FR 34386). In the preamble of the final regulations, we used the wrong data to calculate the percent of total variance in institutions' repayment rates that may be explained by race/ethnicity. Our intent was to use the data that included all minority students per institution. However, we mistakenly used the data for a subset of minority students per institution. We have now recalculated the total variance using the data that includes all minority students. Through this document, we correct, in the preamble of the Gainful Employment--Debt Measures final regulations, the errors resulting from this misapplication. We do not change the regression analysis model itself; we are using the same model with the appropriate data. Through this notice we also correct, in the preamble of the Gainful Employment--Debt Measures final regulations, our description of one component of the regression analysis. The preamble referred to use of an institutional variable measuring acceptance rates. This description was incorrect; in fact we used an institutional variable measuring retention rates. Correcting this language does not change the regression analysis model itself or the variance explained by the model. The text of the final regulations remains unchanged.
Akimoto, Yuki; Yugi, Katsuyuki; Uda, Shinsuke; Kudo, Takamasa; Komori, Yasunori; Kubota, Hiroyuki; Kuroda, Shinya
2013-01-01
Cells use common signaling molecules for the selective control of downstream gene expression and cell-fate decisions. The relationship between signaling molecules and downstream gene expression and cellular phenotypes is a multiple-input and multiple-output (MIMO) system and is difficult to understand due to its complexity. For example, it has been reported that, in PC12 cells, different types of growth factors activate MAP kinases (MAPKs) including ERK, JNK, and p38, and CREB, for selective protein expression of immediate early genes (IEGs) such as c-FOS, c-JUN, EGR1, JUNB, and FOSB, leading to cell differentiation, proliferation and cell death; however, how multiple-inputs such as MAPKs and CREB regulate multiple-outputs such as expression of the IEGs and cellular phenotypes remains unclear. To address this issue, we employed a statistical method called partial least squares (PLS) regression, which involves a reduction of the dimensionality of the inputs and outputs into latent variables and a linear regression between these latent variables. We measured 1,200 data points for MAPKs and CREB as the inputs and 1,900 data points for IEGs and cellular phenotypes as the outputs, and we constructed the PLS model from these data. The PLS model highlighted the complexity of the MIMO system and growth factor-specific input-output relationships of cell-fate decisions in PC12 cells. Furthermore, to reduce the complexity, we applied a backward elimination method to the PLS regression, in which 60 input variables were reduced to 5 variables, including the phosphorylation of ERK at 10 min, CREB at 5 min and 60 min, AKT at 5 min and JNK at 30 min. The simple PLS model with only 5 input variables demonstrated a predictive ability comparable to that of the full PLS model. The 5 input variables effectively extracted the growth factor-specific simple relationships within the MIMO system in cell-fate decisions in PC12 cells.
Running, Alice; Hildreth, Laura
2017-03-01
To examine the effectiveness of a bio-energy intervention on self-reported stress for a convenience sample of University students, faculty, and staff during finals week. We hypothesized that participants would report a decrease in stress after a 20 minute bio-energy intervention. A quasi-experimental, single-group, pretest-posttest design was used. Thirty-nine faculty, staff, and students participated. Participants served as their own controls. A specific technique was provided by each bio-energy practitioner for 20 minutes after participants had completed a visual analogue scale identifying level of stress and listing two positive and negative behaviors they were currently using in response to stress. A one-sample t test indicates that bio-energy therapy significantly reduces stress, t(35) = 7.74, p < .0001. A multiple regression analysis further indicates that the decrease in stress levels is significantly greater for higher initial stress levels, t(31) = 4.748, p < .0001); decreases in stress are significantly greater for faculty and staff compared to students, t(31) = -2.223, p = .034; and decreases in stress levels are marginally significantly higher for older participants, t(31) =1.946, p = .061. Bio-energy therapy may have benefit in reducing stress for faculty, staff, and students during final examination week. Further research is needed.
Kashfi, H.; Yazdani, A. R.; Latifi, M.; Shirani Bidabadi, F.
2011-01-01
The purpose of this research is to study any effects of managerial strategies on prevention of ketosis metabolic disorder in transition period in Shahroud commercial dairy farms. For this purpose, a questionnaire was prepared in order to obtain required information about the performance of these managerial strategies, performance costs, involvement situation with disorders relying upon clinical signs and treatment and health records, producing and economic situation, and fertility rate and its costs. The considered managerial guidelines include body condition score management or type evaluation in transition period, increase in dry matter intake close to parturition, using propylene glycol, using niacin, and high-quality feeding (the importance of feed quality) in transition period. Finally and upon arrangement of data, it was possible to study any effects of mentioned managerial strategies on related variants through multiple linear regressions. Furthermore, in order to study any relation among variables, we considered Pearson correlation coefficients as well. Finally, it was revealed that any application of managerial strategies for prevention from Ketosis in transition period has a significant effect in betterment of managerial and economic parameters. PMID:23738102
Detection of epistatic effects with logic regression and a classical linear regression model.
Malina, Magdalena; Ickstadt, Katja; Schwender, Holger; Posch, Martin; Bogdan, Małgorzata
2014-02-01
To locate multiple interacting quantitative trait loci (QTL) influencing a trait of interest within experimental populations, usually methods as the Cockerham's model are applied. Within this framework, interactions are understood as the part of the joined effect of several genes which cannot be explained as the sum of their additive effects. However, if a change in the phenotype (as disease) is caused by Boolean combinations of genotypes of several QTLs, this Cockerham's approach is often not capable to identify them properly. To detect such interactions more efficiently, we propose a logic regression framework. Even though with the logic regression approach a larger number of models has to be considered (requiring more stringent multiple testing correction) the efficient representation of higher order logic interactions in logic regression models leads to a significant increase of power to detect such interactions as compared to a Cockerham's approach. The increase in power is demonstrated analytically for a simple two-way interaction model and illustrated in more complex settings with simulation study and real data analysis.
Statistics in biomedical laboratory and clinical science: applications, issues and pitfalls.
Ludbrook, John
2008-01-01
This review is directed at biomedical scientists who want to gain a better understanding of statistics: what tests to use, when, and why. In my view, even during the planning stage of a study it is very important to seek the advice of a qualified biostatistician. When designing and analyzing a study, it is important to construct and test global hypotheses, rather than to make multiple tests on the data. If the latter cannot be avoided, it is essential to control the risk of making false-positive inferences by applying multiple comparison procedures. For comparing two means or two proportions, it is best to use exact permutation tests rather then the better known, classical, ones. For comparing many means, analysis of variance, often of a complex type, is the most powerful approach. The correlation coefficient should never be used to compare the performances of two methods of measurement, or two measures, because it does not detect bias. Instead the Altman-Bland method of differences or least-products linear regression analysis should be preferred. Finally, the educational value to investigators of interaction with a biostatistician, before, during and after a study, cannot be overemphasized. (c) 2007 S. Karger AG, Basel.
Hunter-Gatherer Inter-Band Interaction Rates: Implications for Cumulative Culture
Hill, Kim R.; Wood, Brian M.; Baggio, Jacopo; Hurtado, A. Magdalena; Boyd, Robert T.
2014-01-01
Our species exhibits spectacular success due to cumulative culture. While cognitive evolution of social learning mechanisms may be partially responsible for adaptive human culture, features of early human social structure may also play a role by increasing the number potential models from which to learn innovations. We present interview data on interactions between same-sex adult dyads of Ache and Hadza hunter-gatherers living in multiple distinct residential bands (20 Ache bands; 42 Hadza bands; 1201 dyads) throughout a tribal home range. Results show high probabilities (5%–29% per year) of cultural and cooperative interactions between randomly chosen adults. Multiple regression suggests that ritual relationships increase interaction rates more than kinship, and that affinal kin interact more often than dyads with no relationship. These may be important features of human sociality. Finally, yearly interaction rates along with survival data allow us to estimate expected lifetime partners for a variety of social activities, and compare those to chimpanzees. Hadza and Ache men are estimated to observe over 300 men making tools in a lifetime, whereas male chimpanzees interact with only about 20 other males in a lifetime. High intergroup interaction rates in ancestral humans may have promoted the evolution of cumulative culture. PMID:25047714
On the effect of model parameters on forecast objects
NASA Astrophysics Data System (ADS)
Marzban, Caren; Jones, Corinne; Li, Ning; Sandgathe, Scott
2018-04-01
Many physics-based numerical models produce a gridded, spatial field of forecasts, e.g., a temperature map
. The field for some quantities generally consists of spatially coherent and disconnected objects
. Such objects arise in many problems, including precipitation forecasts in atmospheric models, eddy currents in ocean models, and models of forest fires. Certain features of these objects (e.g., location, size, intensity, and shape) are generally of interest. Here, a methodology is developed for assessing the impact of model parameters on the features of forecast objects. The main ingredients of the methodology include the use of (1) Latin hypercube sampling for varying the values of the model parameters, (2) statistical clustering algorithms for identifying objects, (3) multivariate multiple regression for assessing the impact of multiple model parameters on the distribution (across the forecast domain) of object features, and (4) methods for reducing the number of hypothesis tests and controlling the resulting errors. The final output
of the methodology is a series of box plots and confidence intervals that visually display the sensitivities. The methodology is demonstrated on precipitation forecasts from a mesoscale numerical weather prediction model.
Evolution of accelerometer methods for physical activity research.
Troiano, Richard P; McClain, James J; Brychta, Robert J; Chen, Kong Y
2014-07-01
The technology and application of current accelerometer-based devices in physical activity (PA) research allow the capture and storage or transmission of large volumes of raw acceleration signal data. These rich data not only provide opportunities to improve PA characterisation, but also bring logistical and analytic challenges. We discuss how researchers and developers from multiple disciplines are responding to the analytic challenges and how advances in data storage, transmission and big data computing will minimise logistical challenges. These new approaches also bring the need for several paradigm shifts for PA researchers, including a shift from count-based approaches and regression calibrations for PA energy expenditure (PAEE) estimation to activity characterisation and EE estimation based on features extracted from raw acceleration signals. Furthermore, a collaborative approach towards analytic methods is proposed to facilitate PA research, which requires a shift away from multiple independent calibration studies. Finally, we make the case for a distinction between PA represented by accelerometer-based devices and PA assessed by self-report. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.
Nugent, Linda E; Wallston, Kenneth A
2016-12-01
Modified social learning theory (MSLT) applied to health predicts that health behavior is a multiplicative function of health value and perceptions of control over health. The self-management behaviors of persons with Type 2 diabetes mellitus, internal diabetes locus of control (IDLC), diabetes self-efficacy (DSE), and health value (HV) were assessed with an index of diabetes self-care activities in 107 patients receiving insulin. Multiple regression analysis showed DSE as the only MSLT construct that correlated with the index of diabetes self-care behaviors (β = .21, p < .05). While the predicted three-way interaction of IDLC × DSE × HV was significant (∆R 2 = 4.5 %, p < .05) in the final step of the hierarchical model, the pattern of the findings only partially supported MSLT. Instead of finding that patients who were simultaneously high on all three predictors scored highest on the behavioral index, we found that patients who were low on all three constructs reported the least amount of diabetes self-care behavior. Implications for further modification of MSLT and its applications to clinical practice are discussed.
Hunter-gatherer inter-band interaction rates: implications for cumulative culture.
Hill, Kim R; Wood, Brian M; Baggio, Jacopo; Hurtado, A Magdalena; Boyd, Robert T
2014-01-01
Our species exhibits spectacular success due to cumulative culture. While cognitive evolution of social learning mechanisms may be partially responsible for adaptive human culture, features of early human social structure may also play a role by increasing the number potential models from which to learn innovations. We present interview data on interactions between same-sex adult dyads of Ache and Hadza hunter-gatherers living in multiple distinct residential bands (20 Ache bands; 42 Hadza bands; 1201 dyads) throughout a tribal home range. Results show high probabilities (5%-29% per year) of cultural and cooperative interactions between randomly chosen adults. Multiple regression suggests that ritual relationships increase interaction rates more than kinship, and that affinal kin interact more often than dyads with no relationship. These may be important features of human sociality. Finally, yearly interaction rates along with survival data allow us to estimate expected lifetime partners for a variety of social activities, and compare those to chimpanzees. Hadza and Ache men are estimated to observe over 300 men making tools in a lifetime, whereas male chimpanzees interact with only about 20 other males in a lifetime. High intergroup interaction rates in ancestral humans may have promoted the evolution of cumulative culture.
Ali, Farrah; Khan, Rehan; Khan, Abdul Quaiyoom; Lateef, Md Abdul; Maqbool, Tahir; Sultana, Sarwat
2014-07-01
Cancer is the final outcome of a plethora of events. Targeting the proliferation or inducing programmed cell death in a proliferating population is a major standpoint in the cancer therapy. However, proliferation is regulated by several cellular and immunologic processes. This study reports the inhibition of proliferation by augmenting immune surveillance, silencing acute inflammation, and inducing p53-mediated apoptosis of skin cancer by 3 promising medicinal extracts. We used the well-characterized model for experimental skin carcinogenesis in mice for 32 weeks to study the chemopreventive effect of the methanolic extracts of Trigonella foenumgraecum, Eclipta alba, and Calendula officinalis. All 3 extracts reduced the number, incidence, and multiplicity of tumors, which was confirmed by the pathologic studies that showed regressed tumors. There was a significant reduction in the PCNA+ nuclei in all treatment groups 32 weeks after the initiation. Mechanistic studies revealed that proliferative population in tumors is diminished by the restoration of the endogenous antioxidant defense, inhibition of the stress-related signal-transducing element NFκB, reduction of inflammation, enhancement of immunosurveillance of the genetically mutated cells, along with silencing of the cell cycle progression signals. Finally, all 3 medicinal extracts induced stable expression of p53 within the tumors, confirmed by the CFDA-Cy3 apoptosis assay. Results of our study confirm that these extracts not only limit the rate of proliferation by inhibition of the processes integral to cancer development but also induce stable cytoplasmic expression of p53-mediated apoptosis, leading to fewer and regressed tumors in mice. © The Author(s) 2013.
Shan, Zhi; Deng, Guoying; Li, Jipeng; Li, Yangyang; Zhang, Yongxing; Zhao, Qinghua
2013-01-01
This study investigates the neck/shoulder pain (NSP) and low back pain (LBP) among current high school students in Shanghai and explores the relationship between these pains and their possible influences, including digital products, physical activity, and psychological status. An anonymous self-assessment was administered to 3,600 students across 30 high schools in Shanghai. This questionnaire examined the prevalence of NSP and LBP and the level of physical activity as well as the use of mobile phones, personal computers (PC) and tablet computers (Tablet). The CES-D (Center for Epidemiological Studies Depression) scale was also included in the survey. The survey data were analyzed using the chi-square test, univariate logistic analyses and a multivariate logistic regression model. Three thousand sixteen valid questionnaires were received including 1,460 (48.41%) from male respondents and 1,556 (51.59%) from female respondents. The high school students in this study showed NSP and LBP rates of 40.8% and 33.1%, respectively, and the prevalence of both influenced by the student's grade, use of digital products, and mental status; these factors affected the rates of NSP and LBP to varying degrees. The multivariate logistic regression analysis revealed that Gender, grade, soreness after exercise, PC using habits, tablet use, sitting time after school and academic stress entered the final model of NSP, while the final model of LBP consisted of gender, grade, soreness after exercise, PC using habits, mobile phone use, sitting time after school, academic stress and CES-D score. High school students in Shanghai showed high prevalence of NSP and LBP that were closely related to multiple factors. Appropriate interventions should be implemented to reduce the occurrences of NSP and LBP.
Predicting landslide vegetation in patches on landscape gradients in Puerto Rico
Myster, R.W.; Thomlinson, J.R.; Larsen, M.C.
1997-01-01
We explored the predictive value of common landscape characteristics for landslide vegetative stages in the Luquillo Experimental Forest of Puerto Rico using four different analyses. Maximum likelihood logistic regression showed that aspect, age, and substrate type could be used to predict vegetative structural stage. In addition it showed that the structural complexity of the vegetation was greater in landslides (1) facing the southeast (away from the dominant wind direction of recent hurricanes), (2) that were older, and (3) that had volcaniclastic rather than dioritic substrate. Multiple regression indicated that both elevation and age could be used to predict the current vegetation, and that vegetation complexity was greater both at lower elevation and in older landslides. Pearson product-moment correlation coefficients showed that (1) the presence of volcaniclastic substrate in landslides was negatively correlated with aspect, age, and elevation, (2) that road association and age were positively correlated, and (3) that slope was negatively correlated with area. Finally, principal components analysis showed that landslides were differentiated on axes defined primarily by age, aspect class, and elevation in the positive direction, and by volcaniclastic substrate in the negative direction. Because several statistical techniques indicated that age, aspect, elevation, and substrate were important in determining vegetation complexity on landslides, we conclude that landslide succession is influenced by variation in these landscape traits. In particular, we would expect to find more successional development on landslides which are older, face away from hurricane winds, are at lower elevation, and are on volcaniclastic substrate. Finally, our results lead into a hierarchical conceptual model of succession on landscapes where the biota respond first to either gradients or disturbance depending on their relative severity, and then to more local biotic mechanisms such as dispersal, predation and competition.
Psychological well-being in individuals with mild cognitive impairment.
Gates, Nicola; Valenzuela, Michael; Sachdev, Perminder S; Singh, Maria A Fiatarone
2014-01-01
Cognitive impairments associated with aging and dementia are major sources of burden, deterioration in life quality, and reduced psychological well-being (PWB). Preventative measures to both reduce incident disease and improve PWB in those afflicted are increasingly targeting individuals with mild cognitive impairment (MCI) at early disease stage. However, there is very limited information regarding the relationships between early cognitive changes and memory concern, and life quality and PWB in adults with MCI; furthermore, PWB outcomes are too commonly overlooked in intervention trials. The purpose of this study was therefore to empirically test a theoretical model of PWB in MCI in order to inform clinical intervention. Baseline data from a convenience sample of 100 community-dwelling adults diagnosed with MCI enrolled in the Study of Mental Activity and Regular Training (SMART) trial were collected. A series of regression analyses were performed to develop a reduced model, then hierarchical regression with the Baron Kenny test of mediation derived the final three-tiered model of PWB. Significant predictors of PWB were subjective memory concern, cognitive function, evaluations of quality of life, and negative affect, with a final model explaining 61% of the variance of PWB in MCI. Our empirical findings support a theoretical tiered model of PWB in MCI and contribute to an understanding of the way in which early subtle cognitive deficits impact upon PWB. Multiple targets and entry points for clinical intervention were identified. These include improving the cognitive difficulties associated with MCI. Additionally, these highlight the importance of reducing memory concern, addressing low mood, and suggest that improving a person's quality of life may attenuate the negative effects of depression and anxiety on PWB in this cohort.
Ensemble predictive model for more accurate soil organic carbon spectroscopic estimation
NASA Astrophysics Data System (ADS)
Vašát, Radim; Kodešová, Radka; Borůvka, Luboš
2017-07-01
A myriad of signal pre-processing strategies and multivariate calibration techniques has been explored in attempt to improve the spectroscopic prediction of soil organic carbon (SOC) over the last few decades. Therefore, to come up with a novel, more powerful, and accurate predictive approach to beat the rank becomes a challenging task. However, there may be a way, so that combine several individual predictions into a single final one (according to ensemble learning theory). As this approach performs best when combining in nature different predictive algorithms that are calibrated with structurally different predictor variables, we tested predictors of two different kinds: 1) reflectance values (or transforms) at each wavelength and 2) absorption feature parameters. Consequently we applied four different calibration techniques, two per each type of predictors: a) partial least squares regression and support vector machines for type 1, and b) multiple linear regression and random forest for type 2. The weights to be assigned to individual predictions within the ensemble model (constructed as a weighted average) were determined by an automated procedure that ensured the best solution among all possible was selected. The approach was tested at soil samples taken from surface horizon of four sites differing in the prevailing soil units. By employing the ensemble predictive model the prediction accuracy of SOC improved at all four sites. The coefficient of determination in cross-validation (R2cv) increased from 0.849, 0.611, 0.811 and 0.644 (the best individual predictions) to 0.864, 0.650, 0.824 and 0.698 for Site 1, 2, 3 and 4, respectively. Generally, the ensemble model affected the final prediction so that the maximal deviations of predicted vs. observed values of the individual predictions were reduced, and thus the correlation cloud became thinner as desired.
Tsujiuchi, Takuya; Yamaguchi, Maya; Masuda, Kazutaka; Tsuchida, Marisa; Inomata, Tadashi; Kumano, Hiroaki; Kikuchi, Yasushi; Augusterfer, Eugene F; Mollica, Richard F
2016-01-01
This study investigated post-traumatic stress symptoms in relation to the population affected by the Fukushima Nuclear Disaster, one year after the disaster. Additionally, we investigated social factors, such as forced displacement, which we hypothesize contributed to the high prevalence of post-traumatic stress. Finally, we report of written narratives that were collected from the impacted population. Using the Impact of Event Scale-Revised (IES-R), questionnaires were sent to 2,011 households of those displaced from Fukushima prefecture living temporarily in Saitama prefecture. Of the 490 replies; 350 met the criteria for inclusion in the study. Multiple logistic regression analysis was performed to examine several characteristics and variables of social factors as predictors of probable post-traumatic stress disorder, PTSD. The mean score of IES-R was 36.15±21.55, with 59.4% having scores of 30 or higher, thus indicating a probable PTSD. No significant differences in percentages of high-risk subjects were found among sex, age, evacuation area, housing damages, tsunami affected, family split-up, and acquaintance support. By the result of multiple logistic regression analysis, the significant predictors of probable PTSD were chronic physical diseases (OR = 1.97), chronic mental diseases (OR = 6.25), worries about livelihood (OR = 2.27), lost jobs (OR = 1.71), lost social ties (OR = 2.27), and concerns about compensation (OR = 3.74). Although there are limitations in assuming a diagnosis of PTSD based on self-report IES-R, our findings indicate that there was a high-risk of PTSD strongly related to the nuclear disaster and its consequent evacuation and displacement. Therefore, recovery efforts must focus not only on medical and psychological treatment alone, but also on social and economic issues related to the displacement, as well.
Zhong, Buqing; Liang, Tao; Wang, Lingqing; Li, Kexin
2014-08-15
An extensive soil survey was conducted to study pollution sources and delineate contamination of heavy metals in one of the metalliferous industrial bases, in the karst areas of southwest China. A total of 597 topsoil samples were collected and the concentrations of five heavy metals, namely Cd, As (metalloid), Pb, Hg and Cr were analyzed. Stochastic models including a conditional inference tree (CIT) and a finite mixture distribution model (FMDM) were applied to identify the sources and partition the contribution from natural and anthropogenic sources for heavy metal in topsoils of the study area. Regression trees for Cd, As, Pb and Hg were proved to depend mostly on indicators of anthropogenic activities such as industrial type and distance from urban area, while the regression tree for Cr was found to be mainly influenced by the geogenic characteristics. The FMDM analysis showed that the geometric means of modeled background values for Cd, As, Pb, Hg and Cr were close to their background values previously reported in the study area, while the contamination of Cd and Hg were widespread in the study area, imposing potentially detrimental effects on organisms through the food chain. Finally, the probabilities of single and multiple heavy metals exceeding the threshold values derived from the FMDM were estimated using indicator kriging (IK) and multivariate indicator kriging (MVIK). The high probabilities exceeding the thresholds of heavy metals were associated with metalliferous production and atmospheric deposition of heavy metals transported from the urban and industrial areas. Geostatistics coupled with stochastic models provide an effective way to delineate multiple heavy metal pollution to facilitate improved environmental management. Copyright © 2014 Elsevier B.V. All rights reserved.
Lueangpiansamut, Juthamas; Chatrchaiwiwatana, Supaporn; Muktabhant, Benja; Inthalohit, Warangkana
2012-08-01
To evaluate relationship between dental caries status, nutritional status, snack foods, and sugar-sweetened beverages consumption among primary schoolchildren grade 4-6 in Na Klang district, Nongbua Lampoo province, Thailand in 2011. The subjects included 111 children (57 boys and 54 girls), aged 11 and 12 years, who were studying in grades 4 to 6 in the year 2011. The data were collected through questionnaires, interview, and oral examination. Results were obtained by means of descriptive, bivariate, and multiple logistic regression analyses. Prevalence of dental caries in the children was 82.9% with the mean DMFT of 2.28. The dental caries prevalence in permanent and primary dentitions was 69.4% and 34.2%, respectively. About 10.2% of the children were underweight, 13.0% were obese, and 7.5% were stunting. Findings from the final multiple logistic regression models showed that weight-for-age malnutrition as well as eating sweets before bedtime were significantly related to dental caries in primary dentition, with the adjusted odds ratio (95% CI) being 6.68 (1.57, 28.41) and 5.34 (1.60, 17.77), respectively. Family income was significantly related to permanent dental caries with the odds ratio (95% CI) being 9.60 (1.89, 48.59). Nutritional status is associated with dental caries among these elementary schoolchildren. Larger studies extending to cover other elementary schools in Na Klang district should be conducted so that the results will be representative of all elementary schools in Na Klang district, Nongbua Lampoo province.
Salehpoor, Ghasem; Rezaei, Sajjad; Hosseininezhad, Mozaffar
2014-11-01
Although studies have demonstrated significant negative relationships between quality of life (QOL), fatigue, and the most common psychological symptoms (depression, anxiety, stress), the main ambiguity of previous studies on QOL is in the relative importance of these predictors. Also, there is lack of adequate knowledge about the actual contribution of each of them in the prediction of QOL dimensions. Thus, the main objective of this study is to assess the role of fatigue, depression, anxiety, and stress in relation to QOL of multiple sclerosis (MS) patients. One hundred and sixty-two MS patients completed the questionnaire on demographic variables, and then they were evaluated by the Persian versions of Short-Form Health Survey Questionnaire (SF-36), Fatigue Survey Scale (FSS), and Depression, Anxiety, Stress Scale-21 (DASS-21). Data were analyzed by Pearson correlation coefficient and hierarchical regression. Correlation analysis showed a significant relationship between QOL elements in SF-36 (physical component summary and mental component summary) and depression, fatigue, stress, and anxiety (P < 0.01). Hierarchical regression analysis indicated that among the predictor variables in the final step, fatigue, depression, and anxiety were identified as the physical component summary predictor variables. Anxiety was found to be the most powerful predictor variable amongst all (β = -0.46, P < 0.001). Furthermore, results have shown depression as the only significant mental component summary predictor variable (β = -0.39, P < 0.001). This study has highlighted the role of anxiety, fatigue, and depression in physical dimensions and the role of depression in psychological dimensions of the lives of MS patients. In addition, the findings of this study indirectly suggest that psychological interventions for reducing fatigue, depression, and anxiety can lead to improved QOL of MS patients.
Min, Jung-Ah; Lee, Chang-Uk; Chae, Jeong-Ho
2015-01-01
Few studies have investigated the role of protective factors for suicidal ideation, which include resilience and social support among psychiatric patients with depression and/or anxiety disorders who are at increased risk of suicide. Demographic data, history of childhood maltreatment, and levels of depression, anxiety, problematic alcohol use, resilience, perceived social support, and current suicidal ideation were collected from a total of 436 patients diagnosed with depression and/or anxiety disorders. Hierarchical multiple logistic regression analyses were used to identify the independent and interaction effects of potentially influencing factors. Moderate-severe suicidal ideation was reported in 24.5% of our sample. After controlling for relevant covariates, history of emotional neglect and sexual abuse, low resilience, and high depression and anxiety symptoms were sequentially included in the model. In the final model, high depression (adjusted odds ratio (OR)=9.33, confidence interval (CI) 3.99-21.77) and anxiety (adjusted OR=2.62, CI=1.24-5.53) were independently associated with moderate-severe suicidal ideation among risk factors whereas resilience was not. In the multiple logistic regression model that examined interaction effects between risk and protective factors, the interactions between resilience and depression (p<.001) and between resilience and anxiety were significant (p=.021). A higher level of resilience was protective against moderate-severe suicide ideation among those with higher levels of depression or anxiety symptoms. Our results indicate that resilience potentially moderates the risk of depression and anxiety symptoms on suicidal ideation in patients with depression and/or anxiety disorders. Assessment of resilience and intervention focused on resilience enhancement is suggested for suicide prevention. Copyright © 2014 Elsevier Inc. All rights reserved.
Factors affecting Korean nursing student empowerment in clinical practice.
Ahn, Yang-Heui; Choi, Jihea
2015-12-01
Understanding the phenomenon of nursing student empowerment in clinical practice is important. Investigating the cognition of empowerment and identifying predictors are necessary to enhance nursing student empowerment in clinical practice. To identify empowerment predictors for Korean nursing students in clinical practice based on studies by Bradbury-Jones et al. and Spreitzer. A cross-sectional design was used for this study. This study was performed in three nursing colleges in Korea, all of which had similar baccalaureate nursing curricula. Three hundred seven junior or senior nursing students completed a survey designed to measure factors that were hypothesized to influence nursing student empowerment in clinical practice. Data were collected from November to December 2011. Study variables included self-esteem, clinical decision making, being valued as a learner, satisfaction regarding practice with a team member, perception on professor/instructor/clinical preceptor attitude, and total number of clinical practice fields. Data were analyzed using stepwise multiple regression analyses. All of the hypothesized study variables were significantly correlated to nursing student empowerment. Stepwise multiple regression analysis revealed that clinical decision making in nursing (t=7.59, p<0.001), being valued as a learner (t=6.24, p<0.001), self-esteem (t=3.62, p<0.001), and total number of clinical practice fields (t=2.06, p=0.040). The explanatory power of these predictors was 35% (F=40.71, p<0.001). Enhancing nursing student empowerment in clinical practice will be possible by using educational strategies to improve nursing student clinical decision making. Simultaneously, attitudes of nurse educators are also important to ensure that nursing students are treated as valued learners and to increase student self-esteem in clinical practice. Finally, diverse clinical practice field environments should be considered to enhance experience. Copyright © 2015 Elsevier Ltd. All rights reserved.
Connizzo, Brianne K; Adams, Sheila M; Adams, Thomas H; Jawad, Abbas F; Birk, David E; Soslowsky, Louis J
2016-06-14
Recent advances in technology have allowed for the measurement of dynamic processes (re-alignment, crimp, deformation, sliding), but only a limited number of studies have investigated their relationship with mechanical properties. The overall objective of this study was to investigate the role of composition, structure, and the dynamic response to load in predicting tendon mechanical properties in a multi-level fashion mimicking native hierarchical collagen structure. Multiple linear regression models were investigated to determine the relationships between composition/structure, dynamic processes, and mechanical properties. Mediation was then used to determine if dynamic processes mediated structure-function relationships. Dynamic processes were strong predictors of mechanical properties. These predictions were location-dependent, with the insertion site utilizing all four dynamic responses and the midsubstance responding primarily with fibril deformation and sliding. In addition, dynamic processes were moderately predicted by composition and structure in a regionally-dependent manner. Finally, dynamic processes were partial mediators of the relationship between composition/structure and mechanical function, and results suggested that mediation is likely shared between multiple dynamic processes. In conclusion, the mechanical properties at the midsubstance of the tendon are controlled primarily by fibril structure and this region responds to load via fibril deformation and sliding. Conversely, the mechanical function at the insertion site is controlled by many other important parameters and the region responds to load via all four dynamic mechanisms. Overall, this study presents a strong foundation on which to design future experimental and modeling efforts in order to fully understand the complex structure-function relationships present in tendon. Copyright © 2016 Elsevier Ltd. All rights reserved.
Regression in autistic spectrum disorders.
Stefanatos, Gerry A
2008-12-01
A significant proportion of children diagnosed with Autistic Spectrum Disorder experience a developmental regression characterized by a loss of previously-acquired skills. This may involve a loss of speech or social responsitivity, but often entails both. This paper critically reviews the phenomena of regression in autistic spectrum disorders, highlighting the characteristics of regression, age of onset, temporal course, and long-term outcome. Important considerations for diagnosis are discussed and multiple etiological factors currently hypothesized to underlie the phenomenon are reviewed. It is argued that regressive autistic spectrum disorders can be conceptualized on a spectrum with other regressive disorders that may share common pathophysiological features. The implications of this viewpoint are discussed.
Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses.
Faul, Franz; Erdfelder, Edgar; Buchner, Axel; Lang, Albert-Georg
2009-11-01
G*Power is a free power analysis program for a variety of statistical tests. We present extensions and improvements of the version introduced by Faul, Erdfelder, Lang, and Buchner (2007) in the domain of correlation and regression analyses. In the new version, we have added procedures to analyze the power of tests based on (1) single-sample tetrachoric correlations, (2) comparisons of dependent correlations, (3) bivariate linear regression, (4) multiple linear regression based on the random predictor model, (5) logistic regression, and (6) Poisson regression. We describe these new features and provide a brief introduction to their scope and handling.
Warnick, Elizabeth; Dearden, Kirk A; Slater, Sharon; Butrón, Betzabé; Lanata, Claudio F; Huffman, Sandra L
2004-01-01
To test the hypothesis that social marketing improves women's awareness and consumption of multivitamin and mineral supplements. Formative research and baseline and final surveys using a multistaged stratified cluster sample. Department of Santa Cruz, Bolivia. Women 15 to 49 years old (n=1709 at baseline and n=1735 at final survey). Social marketing campaign using radio and television spots. Awareness and use of multivitamins, including VitalDía, the brand promoted as part of this social marketing campaign. Cross-tabulations to assess changes over time in awareness and use of multivitamins. Logistic regression analyses to identify determinants of multivitamin use. The campaign increased women's awareness and use of multiple supplements, including VitalDía. Awareness of multiple supplements nearly doubled among women with 6 to 8 years of schooling, tripled among women with 4 to 5 years of education, and more than quadrupled among women with less than 4 years of schooling. After 9 months of social marketing, 11% of women had taken VitalDía one or more times, 7% had taken it at least once in the last 3 months, and 4% had used it one or more times in the last month. Improvements in the use of VitalDía were evident for women of all socioeconomic and educational levels, with the greatest increases occurring in the least advantaged groups. Additionally, women who had a positive perception of the benefits of multivitamins were 1.7 times (95% confidence interval 1.2-2.3; P <.01) more likely than women who did not have a positive perception to ever use VitalDía, once the effects of social class were adjusted. Social marketing of multiple supplements reached resource-poor women and can be used to bridge gaps in access, improve awareness of supplementation as an option, and increase the likelihood that women will try supplements.
Interpret with caution: multicollinearity in multiple regression of cognitive data.
Morrison, Catriona M
2003-08-01
Shibihara and Kondo in 2002 reported a reanalysis of the 1997 Kanji picture-naming data of Yamazaki, Ellis, Morrison, and Lambon-Ralph in which independent variables were highly correlated. Their addition of the variable visual familiarity altered the previously reported pattern of results, indicating that visual familiarity, but not age of acquisition, was important in predicting Kanji naming speed. The present paper argues that caution should be taken when drawing conclusions from multiple regression analyses in which the independent variables are so highly correlated, as such multicollinearity can lead to unreliable output.
STATLIB: NSWC Library of Statistical Programs and Subroutines
1989-08-01
Uncorrelated Weighted Polynomial Regression 41 .WEPORC Correlated Weighted Polynomial Regression 45 MROP Multiple Regression Using Orthogonal Polynomials ...could not and should not be con- NSWC TR 89-97 verted to the new general purpose computer (the current CDC 995). Some were designed tu compute...personal computers. They are referred to as SPSSPC+, BMDPC, and SASPC and in general are less comprehensive than their mainframe counterparts. The basic
Influence of an injury reduction program on injury and fitness outcomes among soldiers
Knapik, J; Bullock, S; Canada, S; Toney, E; Wells, J; Hoedebecke, E; Jones, B
2004-01-01
Objective: This study evaluated the influence of a multiple injury control intervention on injury and physical fitness outcomes among soldiers attending United States Army Ordnance School Advanced Individual Training. Methods: The study design was quasiexperimental involving a historical control group (n = 2559) that was compared to a multiple intervention group (n = 1283). Interventions in the multiple intervention group included modified physical training, injury education, and a unit based injury surveillance system (UBISS). The management responsible for training independently formed an Injury Control Advisory Committee that examined surveillance reports from the UBISS and recommended changes to training. On arrival at school, individual soldiers completed a demographics and lifestyle questionnaire and took an army physical fitness test (APFT: push-ups, sit-ups, and two mile run). Injuries among soldiers were tracked by a clinic based injury surveillance system that was separate from the UBISS. Soldiers completed a final APFT eight weeks after arrival at school. Results: Cox regression (survival analysis) was used to examine differences in time to the first injury while controlling for group differences in demographics, lifestyle characteristics, and physical fitness. The adjusted relative risk of a time loss injury was 1.5 (95% confidence interval 1.2 to 1.8) times higher in the historical control men and 1.8 (95% confidence interval 1.1 to 2.8) times higher in the historical control women compared with the multiple intervention men and women, respectively. After correcting for the lower initial fitness of the multiple intervention group, there were no significant differences between the multiple intervention and historical control groups in terms of improvements in push-ups, sit-ups, or two mile run performance. Conclusions: This multiple intervention program contributed to a reduction in injuries while improvements in physical fitness were similar to a traditional physical training program previously used at the school. PMID:14760025
Lobier, Muriel A.; Peyrin, Carole; Pichat, Cédric; Le Bas, Jean-François; Valdois, Sylviane
2014-01-01
The visual attention (VA) span deficit hypothesis of developmental dyslexia posits that impaired multiple element processing can be responsible for poor reading outcomes. In VA span impaired dyslexic children, poor performance on letter report tasks is associated with reduced parietal activations for multiple letter processing. While this hints towards a non-specific, attention-based dysfunction, it is still unclear whether reduced parietal activity generalizes to other types of stimuli. Furthermore, putative links between reduced parietal activity and reduced ventral occipito-temporal (vOT) in dyslexia have yet to be explored. Using functional magnetic resonance imaging, we measured brain activity in 12 VA span impaired dyslexic adults and 12 adult skilled readers while they carried out a categorization task on single or multiple alphanumeric or non-alphanumeric characters. While healthy readers activated parietal areas more strongly for multiple than single element processing (right-sided for alphanumeric and bilateral for non-alphanumeric), similar stronger multiple element right parietal activations were absent for dyslexic participants. Contrasts between skilled and dyslexic readers revealed significantly reduced right superior parietal lobule (SPL) activity for dyslexic readers regardless of stimuli type. Using a priori anatomically defined regions of interest, we showed that neural activity was reduced for dyslexic participants in both SPL and vOT bilaterally. Finally, we used multiple regressions to test whether SPL activity was related to vOT activity in each group. In the left hemisphere, SPL activity covaried with vOT activity for both normal and dyslexic readers. In contrast, in the right hemisphere, SPL activity covaried with vOT activity only for dyslexic readers. These results bring critical support to the VA interpretation of the VA Span deficit. In addition, they offer a new insight on how deficits in automatic vOT based word recognition could arise in developmental dyslexia. PMID:25071509
Sandquist, Mary K; Clee, Mark S; Patel, Smruti K; Howard, Kelli A; Yunger, Toni; Nagaraj, Usha D; Jones, Blaise V; Fei, Lin; Vadivelu, Sudhakar; Wong, Hector R
2017-07-01
This study was intended to describe and correlate the neuroimaging findings in pediatric patients after sepsis. Retrospective chart review. Single tertiary care PICU. Patients admitted to Cincinnati Children's Hospital Medical Center with a discharge diagnosis of sepsis or septic shock between 2004 and 2013 were crossmatched with patients who underwent neuroimaging during the same time period. All neuroimaging studies that occurred during or subsequent to a septic event were reviewed, and all new imaging findings were recorded and classified. As many patients experienced multiple septic events and/or had multiple neuroimaging studies after sepsis, our statistical analysis utilized the most recent or "final" imaging study available for each patient so that only brain imaging findings that persisted were included. A total of 389 children with sepsis and 1,705 concurrent or subsequent neuroimaging studies were included in the study. Median age at first septic event was 3.4 years (interquartile range, 0.7-11.5). Median time from first sepsis event to final neuroimaging was 157 days (interquartile range, 10-1,054). The most common indications for final imaging were follow-up (21%), altered mental status (18%), and fever/concern for infection (15%). Sixty-three percentage (n = 243) of final imaging studies demonstrated abnormal findings, the most common of which were volume loss (39%) and MRI signal and/or CT attenuation abnormalities (21%). On multivariable logistic regression, highest Pediatric Risk of Mortality score and presence of oncologic diagnosis/organ transplantation were independently associated with any abnormal final neuroimaging study findings (odds ratio, 1.032; p = 0.048 and odds ratio, 1.632; p = 0.041), although early timing of neuroimaging demonstrated a negative association (odds ratio, 0.606; p = 0.039). The most common abnormal finding of volume loss was independently associated with highest Pediatric Risk of Mortality score (odds ratio, 1.037; p = 0.016) and oncologic diagnosis/organ transplantation (odds ratio, 2.207; p = 0.001) and was negatively associated with early timing of neuroimaging (odds ratio, 0.575; p = 0.037). The majority of pediatric patients with sepsis and concurrent or subsequent neuroimaging have abnormal neuroimaging findings. The implications of this high incidence for long-term neurologic outcomes and follow-up require further exploration.
Seaman, Shaun R; Hughes, Rachael A
2018-06-01
Estimating the parameters of a regression model of interest is complicated by missing data on the variables in that model. Multiple imputation is commonly used to handle these missing data. Joint model multiple imputation and full-conditional specification multiple imputation are known to yield imputed data with the same asymptotic distribution when the conditional models of full-conditional specification are compatible with that joint model. We show that this asymptotic equivalence of imputation distributions does not imply that joint model multiple imputation and full-conditional specification multiple imputation will also yield asymptotically equally efficient inference about the parameters of the model of interest, nor that they will be equally robust to misspecification of the joint model. When the conditional models used by full-conditional specification multiple imputation are linear, logistic and multinomial regressions, these are compatible with a restricted general location joint model. We show that multiple imputation using the restricted general location joint model can be substantially more asymptotically efficient than full-conditional specification multiple imputation, but this typically requires very strong associations between variables. When associations are weaker, the efficiency gain is small. Moreover, full-conditional specification multiple imputation is shown to be potentially much more robust than joint model multiple imputation using the restricted general location model to mispecification of that model when there is substantial missingness in the outcome variable.
The positive and negative consequences of multiple-choice testing.
Roediger, Henry L; Marsh, Elizabeth J
2005-09-01
Multiple-choice tests are commonly used in educational settings but with unknown effects on students' knowledge. The authors examined the consequences of taking a multiple-choice test on a later general knowledge test in which students were warned not to guess. A large positive testing effect was obtained: Prior testing of facts aided final cued-recall performance. However, prior testing also had negative consequences. Prior reading of a greater number of multiple-choice lures decreased the positive testing effect and increased production of multiple-choice lures as incorrect answers on the final test. Multiple-choice testing may inadvertently lead to the creation of false knowledge.
Mohd Yusof, Mohd Yusmiaidil Putera; Cauwels, Rita; Deschepper, Ellen; Martens, Luc
2015-08-01
The third molar development (TMD) has been widely utilized as one of the radiographic method for dental age estimation. By using the same radiograph of the same individual, third molar eruption (TME) information can be incorporated to the TMD regression model. This study aims to evaluate the performance of dental age estimation in individual method models and the combined model (TMD and TME) based on the classic regressions of multiple linear and principal component analysis. A sample of 705 digital panoramic radiographs of Malay sub-adults aged between 14.1 and 23.8 years was collected. The techniques described by Gleiser and Hunt (modified by Kohler) and Olze were employed to stage the TMD and TME, respectively. The data was divided to develop three respective models based on the two regressions of multiple linear and principal component analysis. The trained models were then validated on the test sample and the accuracy of age prediction was compared between each model. The coefficient of determination (R²) and root mean square error (RMSE) were calculated. In both genders, adjusted R² yielded an increment in the linear regressions of combined model as compared to the individual models. The overall decrease in RMSE was detected in combined model as compared to TMD (0.03-0.06) and TME (0.2-0.8). In principal component regression, low value of adjusted R(2) and high RMSE except in male were exhibited in combined model. Dental age estimation is better predicted using combined model in multiple linear regression models. Copyright © 2015 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.
On the method of Ermakov and Zolotukhin for multiple integration
NASA Technical Reports Server (NTRS)
Cranley, R.; Patterson, T. N. L.
1971-01-01
By introducing the idea of pseudo-implementation, a practical assessment of the method for multiple integration is made. The performance of the method is found to be unimpressive in comparison with a recent regression method.
Criteria for the use of regression analysis for remote sensing of sediment and pollutants
NASA Technical Reports Server (NTRS)
Whitlock, C. H.; Kuo, C. Y.; Lecroy, S. R.
1982-01-01
An examination of limitations, requirements, and precision of the linear multiple-regression technique for quantification of marine environmental parameters is conducted. Both environmental and optical physics conditions have been defined for which an exact solution to the signal response equations is of the same form as the multiple regression equation. Various statistical parameters are examined to define a criteria for selection of an unbiased fit when upwelled radiance values contain error and are correlated with each other. Field experimental data are examined to define data smoothing requirements in order to satisfy the criteria of Daniel and Wood (1971). Recommendations are made concerning improved selection of ground-truth locations to maximize variance and to minimize physical errors associated with the remote sensing experiment.
Introduction to the use of regression models in epidemiology.
Bender, Ralf
2009-01-01
Regression modeling is one of the most important statistical techniques used in analytical epidemiology. By means of regression models the effect of one or several explanatory variables (e.g., exposures, subject characteristics, risk factors) on a response variable such as mortality or cancer can be investigated. From multiple regression models, adjusted effect estimates can be obtained that take the effect of potential confounders into account. Regression methods can be applied in all epidemiologic study designs so that they represent a universal tool for data analysis in epidemiology. Different kinds of regression models have been developed in dependence on the measurement scale of the response variable and the study design. The most important methods are linear regression for continuous outcomes, logistic regression for binary outcomes, Cox regression for time-to-event data, and Poisson regression for frequencies and rates. This chapter provides a nontechnical introduction to these regression models with illustrating examples from cancer research.
Quijada-Morín, Natalia; Williams, Pascale; Rivas-Gonzalo, Julián C; Doco, Thierry; Escribano-Bailón, M Teresa
2014-07-01
The influence of the proanthocyanidic, polysaccharide and oligosaccharide composition on astringency perception of Tempranillo wines has been evaluated. Statistical analyses revealed the existence of relationships between chemical composition and perceived astringency. Proanthocyanidic subunit distribution had the strongest contribution to the multiple linear regression (MLR) model. Polysaccharide families showed clear opposition to astringency perception according to principal component analysis (PCA) results, being stronger for mannoproteins and rhamnogalacturonan-II (RG-II), but only Polysaccharides Rich in Arabinose and Galactose (PRAGs) were considered in the final fitted MLR model, which explained 96.8% of the variability observed in the data. Oligosaccharides did not show a clear opposition, revealing that structure and size of carbohydrates are important for astringency perception. Mannose and galactose residues in the oligosaccharide fraction are positively related to astringency perception, probably because its presence is consequence of the degradation of polysaccharides. Copyright © 2014 Elsevier Ltd. All rights reserved.
Martinez-Fiestas, Myriam; Rodríguez-Garzón, Ignacio; Delgado-Padial, Antonio; Lucas-Ruiz, Valeriano
2017-09-01
This article presents a cross-cultural study on perceived risk in the construction industry. Worker samples from three different countries were studied: Spain, Peru and Nicaragua. The main goal was to explain how construction workers perceive their occupational hazard and to analyze how this is related to their national culture. The model used to measure perceived risk was the psychometric paradigm. The results show three very similar profiles, indicating that risk perception is independent of nationality. A cultural analysis was conducted using the Hofstede model. The results of this analysis and the relation to perceived risk showed that risk perception in construction is independent of national culture. Finally, a multiple lineal regression analysis was conducted to determine what qualitative attributes could predict the global quantitative size of risk perception. All of the findings have important implications regarding the management of safety in the workplace.
[Predictors of employment intention for mentally disabled persons].
Han, Sang-Sook; Han, Jeong Hye; Yun, Eun Kyoung
2008-08-01
This study was conducted to determine the predictors of employment intention for mentally disabled persons. Mentally disabled persons who had participated in rehabilitation programs in one of 16 mental health centers and 9 community rehabilitation centers located in Seoul and Kyunggi province were recruited for this study. A random sampling method was used and 414 respondents were used for final analysis. Data was analyzed by Pearson's correlation, and stepwise multiple regression using the SPSS Win 14.0. The predictors influencing employment intention of the mentally disabled person were observed as employment desire (beta=.48), guardian's expectation (beta=.26), professional's support (beta=.23), financial management (beta=.10), eating habits (beta=.07), and quality of life (beta=-.01). Six factors explained 61.1% of employment intention of mentally disabled persons. The employment intention of a mentally disabled person was influenced by employment desire, diet self-efficacy, guardian's expectation, professional's support, quality of life, financial management and eating habits.
Megalopoulos, Fivos A; Ochsenkuehn-Petropoulou, Maria T
2015-01-01
A statistical model based on multiple linear regression is developed, to estimate the bromine residual that can be expected after the bromination of cooling water. Make-up water sampled from a power plant in the Greek territory was used for the creation of the various cooling water matrices under investigation. The amount of bromine fed to the circuit, as well as other important operational parameters such as concentration at the cooling tower, temperature, organic load and contact time are taken as the independent variables. It is found that the highest contribution to the model's predictive ability comes from cooling water's organic load concentration, followed by the amount of bromine fed to the circuit, the water's mean temperature, the duration of the bromination period and finally its conductivity. Comparison of the model results with the experimental data confirms its ability to predict residual bromine given specific bromination conditions.
Bayesian Group Bridge for Bi-level Variable Selection.
Mallick, Himel; Yi, Nengjun
2017-06-01
A Bayesian bi-level variable selection method (BAGB: Bayesian Analysis of Group Bridge) is developed for regularized regression and classification. This new development is motivated by grouped data, where generic variables can be divided into multiple groups, with variables in the same group being mechanistically related or statistically correlated. As an alternative to frequentist group variable selection methods, BAGB incorporates structural information among predictors through a group-wise shrinkage prior. Posterior computation proceeds via an efficient MCMC algorithm. In addition to the usual ease-of-interpretation of hierarchical linear models, the Bayesian formulation produces valid standard errors, a feature that is notably absent in the frequentist framework. Empirical evidence of the attractiveness of the method is illustrated by extensive Monte Carlo simulations and real data analysis. Finally, several extensions of this new approach are presented, providing a unified framework for bi-level variable selection in general models with flexible penalties.
Long-term treatment of an addictive personality.
Seymour, Peter M
2003-01-01
There is infrequent discussion of long-term psychotherapy of persons with addiction, particularly in the self-psychology literature. In addition, some question whether long-term psychotherapy can be helpful in severe psychiatric disorders. The author describes the treatment of a woman with multiple diagnoses, including bulimia and alcohol and drug addiction, which took place over a period of almost 7 years. These issues are addressed from a self-psychological perspective, with progression of the treatment from early facilitation of a selfobject transference to more intense selfobject transference-countertransference states. Behavioral interventions (e.g., recommendation of inpatient chemical dependency treatment) are also discussed. The author describes the patient's dramatic progress and subsequent regression. Finally, there is a discussion of the addiction from self-psychological and biological perspectives of this woman's particular developmental and treatment issues, as well as a discussion of the confrontation and limit setting in a self-psychologically oriented treatment.
Functional genomic Landscape of Human Breast Cancer drivers, vulnerabilities, and resistance
Marcotte, Richard; Sayad, Azin; Brown, Kevin R.; Sanchez-Garcia, Felix; Reimand, Jüri; Haider, Maliha; Virtanen, Carl; Bradner, James E.; Bader, Gary D.; Mills, Gordon B.; Pe’er, Dana; Moffat, Jason; Neel, Benjamin G.
2016-01-01
Summary Large-scale genomic studies have identified multiple somatic aberrations in breast cancer, including copy number alterations, and point mutations. Still, identifying causal variants and emergent vulnerabilities that arise as a consequence of genetic alterations remain major challenges. We performed whole genome shRNA “dropout screens” on 77 breast cancer cell lines. Using a hierarchical linear regression algorithm to score our screen results and integrate them with accompanying detailed genetic and proteomic information, we identify vulnerabilities in breast cancer, including candidate “drivers,” and reveal general functional genomic properties of cancer cells. Comparisons of gene essentiality with drug sensitivity data suggest potential resistance mechanisms, effects of existing anti-cancer drugs, and opportunities for combination therapy. Finally, we demonstrate the utility of this large dataset by identifying BRD4 as a potential target in luminal breast cancer, and PIK3CA mutations as a resistance determinant for BET-inhibitors. PMID:26771497
Patounakis, George; Hill, Micah J
2018-06-01
The purpose of the current review is to describe the common pitfalls in design and statistical analysis of reproductive medicine studies. It serves to guide both authors and reviewers toward reducing the incidence of spurious statistical results and erroneous conclusions. The large amount of data gathered in IVF cycles leads to problems with multiplicity, multicollinearity, and over fitting of regression models. Furthermore, the use of the word 'trend' to describe nonsignificant results has increased in recent years. Finally, methods to accurately account for female age in infertility research models are becoming more common and necessary. The pitfalls of study design and analysis reviewed provide a framework for authors and reviewers to approach clinical research in the field of reproductive medicine. By providing a more rigorous approach to study design and analysis, the literature in reproductive medicine will have more reliable conclusions that can stand the test of time.
Core OCD Symptoms: Exploration of Specificity and Relations with Psychopathology
Stasik, Sara M.; Naragon-Gainey, Kristin; Chmielewski, Michael; Watson, David
2012-01-01
Obsessive-compulsive disorder (OCD) is a heterogeneous condition, comprised of multiple symptom domains. This study used aggregate composite scales representing three core OCD dimensions (Checking, Cleaning, Rituals), as well as Hoarding, to examine the discriminant validity, diagnostic specificity, and predictive ability of OCD symptom scales. The core OCD scales demonstrated strong patterns of convergent and discriminant validity – suggesting that these dimensions are distinct from other self-reported symptoms – whereas hoarding symptoms correlated just as strongly with OCD and non-OCD symptoms in most analyses. Across analyses, our results indicated that Checking is a particularly strong, specific marker of OCD diagnosis, whereas the specificity of Cleaning and Hoarding to OCD was less strong. Finally, the OCD Checking scale was the only significant predictor of OCD diagnosis in logistic regression analyses. Results are discussed with regard to the importance of assessing OCD symptom dimensions separately and implications for classification. PMID:23026094
Iakova, Maria; Ballabeni, Pierluigi; Erhart, Peter; Seichert, Nikola; Luthi, François; Dériaz, Olivier
2012-12-01
This study aimed to identify self-perception variables which may predict return to work (RTW) in orthopedic trauma patients 2 years after rehabilitation. A prospective cohort investigated 1,207 orthopedic trauma inpatients, hospitalised in rehabilitation, clinics at admission, discharge, and 2 years after discharge. Information on potential predictors was obtained from self administered questionnaires. Multiple logistic regression models were applied. In the final model, a higher likelihood of RTW was predicted by: better general health and lower pain at admission; health and pain improvements during hospitalisation; lower impact of event (IES-R) avoidance behaviour score; higher IES-R hyperarousal score, higher SF-36 mental score and low perceived severity of the injury. RTW is not only predicted by perceived health, pain and severity of the accident at the beginning of a rehabilitation program, but also by the changes in pain and health perceptions observed during hospitalisation.
Eruption of the permanent maxillary canines in relation to mandibular second molar maturity.
Perinetti, Giuseppe; Callovi, Marilena; Salgarello, Stefano; Biasotto, Matteo; Contardo, Luca
2013-07-01
To evaluate the timing of spontaneous maxillary canine eruption in relation to stages of mandibular second molar maturation. Potential confounding effects from such factors as age, growth phase, and facial features were also explored. A sample of 106 healthy subjects (48 females and 58 males; age range, 9.4-14.3 years) with both permanent maxillary canines during the final phase of intraoral eruption were included. Mandibular second molar maturation (stages E to H) was assessed according to the method of Demirjian. Skeletal maturity was determined using the cervical vertebral maturational (CVM) method. Facial vertical and sagittal relationships were evaluated by recording the Sella-Nasion/mandibular plane (SN/MP) angle and the ANB angle. An ordered multiple logistic regression was run to evaluate adjusted correlation of each parameter with the mandibular second molar maturational stage. Overall, the prevalence of the different second molar maturational stages was 36.8%, 37.8%, and 27.4% for stages E, F and G, respectively. According to the regression model, this relation was not influenced by sex, CVM stage, SN/MP angle, and ANB angle. Irrespective of sex, growth phase, and facial features, the maturational stage of the mandibular second molar may be a reliable indicator for the timing of spontaneous eruption of the maxillary canine.
[Factors affecting the DAPI fluorescence direct count in the tidal river sediment].
Chen, Chen; Huang, Shan; Wu, Qun-he; Li, Rui-yi; Zhang, Ren-duo
2010-08-01
The factors affecting the DAPI (4', 6-diamidino-2-phenylidole) fluorescence direct count in the tidal river sediment were examined. Sediment samples were collected from the Guangzhou section of the Pearl River. Besides sediment texture and organic matter, an improved staining procedure and the involved parameters were analyzed. Results showed that the procedure with the sediment with 2000 fold dilution and ultrasonic water bath for 10 min, and with a final DAPI concentration of 10 microg x mL(-1) and staining time for more than 30 min produced the optimum results of DAPI direct count in the sediment. The total bacterial number was correlated to the proportion of the non-nucleoid-containing cells to the total bacterial number (r = 0.587, p = 0.004). The organic matter content also correlated to the ration. The clay content had a strong correlation with the organic matter, through which the clay content also affected the ratio. A multiple regression analysis between the ration versus the organic matter, the total bacterial number, and the clay content showed that the regression equation fit the measure values satisfactorily (r = 0.694). These results indicated that the above factors needed to be considered in the applications of the DAPI fluorescence direct counting method to the tidal river sediment.
2013-01-01
Objectives. Global perceptions of stress (GPS) have major implications for mental and physical health, and stress in midlife may influence adaptation in later life. Thus, it is important to determine the unique and interactive effects of diverse influences of role stress (at work or in personal relationships), loneliness, life events, time pressure, caregiving, finances, discrimination, and neighborhood circumstances on these GPS. Method. Exploratory regression trees and random forests were used to examine complex interactions among myriad events and chronic stressors in middle-aged participants’ (N = 410; mean age = 52.12) GPS. Results. Different role and domain stressors were influential at high and low levels of loneliness. Varied combinations of these stressors resulting in similar levels of perceived stress are also outlined as examples of equifinality. Loneliness emerged as an important predictor across trees. Discussion. Exploring multiple stressors simultaneously provides insights into the diversity of stressor combinations across individuals—even those with similar levels of global perceived stress—and answers theoretical mandates to better understand the influence of stress by sampling from many domain and role stressors. Further, the unique influences of each predictor relative to the others inform theory and applied work. Finally, examples of equifinality and multifinality call for targeted interventions. PMID:23341437
Short communication: Effect of heat stress on nonreturn rate of Italian Holstein cows.
Biffani, S; Bernabucci, U; Vitali, A; Lacetera, N; Nardone, A
2016-07-01
The data set consisted of 1,016,856 inseminations of 191,012 first, second, and third parity Holstein cows from 484 farms. Data were collected from year 2001 through 2007 and included meteorological data from 35 weather stations. Nonreturn rate at 56 d after first insemination (NR56) was considered. A logit model was used to estimate the effect of temperature-humidity index (THI) on reproduction across parities. Then, least squares means were used to detect the THI breakpoints using a 2-phase linear regression procedure. Finally, a multiple-trait threshold model was used to estimate variance components for NR56 in first and second parity cows. A dummy regression variable (t) was used to estimate NR56 decline due to heat stress. The NR56, both for first and second parity cows, was significantly (unfavorable) affected by THI from 4 d before 5 d after the insemination date. Additive genetic variances for NR56 increased from first to second parity both for general and heat stress effect. Genetic correlations between general and heat stress effects were -0.31 for first parity and -0.45 for second parity cows. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.
Gao, Jinghong; Chen, Xiaojun; Woodward, Alistair; Liu, Xiaobo; Wu, Haixia; Lu, Yaogui; Li, Liping; Liu, Qiyong
2016-01-01
Few studies examined the associations of meteorological factors with road traffic injuries (RTIs). The purpose of the present study was to quantify the contributions of meteorological factors to RTI cases treated at a tertiary level hospital in Shantou city, China. A time-series diagram was employed to illustrate the time trends and seasonal variation of RTIs, and correlation analysis and multiple linear regression analysis were conducted to investigate the relationships between meteorological parameters and RTIs. RTIs followed a seasonal pattern as more cases occurred during summer and winter months. RTIs are positively correlated with temperature and sunshine duration, while negatively associated with wind speed. Temperature, sunshine hour and wind speed were included in the final linear model with regression coefficients of 0.65 (t = 2.36, P = 0.019), 2.23 (t = 2.72, P = 0.007) and −27.66 (t = −5.67, P < 0.001), respectively, accounting for 19.93% of the total variation of RTI cases. The findings can help us better understand the associations between meteorological factors and RTIs, and with potential contributions to the development and implementation of regional level evidence-based weather-responsive traffic management system in the future. PMID:27853316
Assessment of sleep quality and correlates in a large cohort of Colombian women around menopause.
Monterrosa-Castro, Alvaro; Marrugo-Flórez, Martha; Romero-Pérez, Ivette; Fernández-Alonso, Ana M; Chedraui, Peter; Pérez-López, Faustino R
2013-04-01
The aim of this study was to determine the relationship between self-reported sleep quality, menopausal symptom intensity, and correlates (including ethnicity) among middle-aged women. The present cross-sectional study involved 1,078 Colombian women aged 40 to 59 years who completed the Pittsburgh Sleep Quality Index (PSQI), the Menopause Rating Scale (MRS), and a general questionnaire exploring sociodemographic data. The median [interquartile range] age of the whole sample was 49.0 [9.0] years. Among the participants, 45.4% were postmenopausal, 57.2% had increased body mass index values, 13.9% were black, 20.7% had hypertension, 74.1% had a stable partner, and 3.8% used hormone therapy. The prevalence of poor sleep quality was 57.1% (PSQI global score ≥5). Significant correlations between PSQI global scores and MRS total and subscale scores were found. Multiple linear regression analysis found that higher PSQI scores (poorer quality of sleep) correlated with higher MRS psychological and somatic subscale scores (more severe symptoms), smoking habit, and hypertension. Menopause status and black ethnicity were excluded from the final regression model. Despite study limitations, poor sleep quality is highly prevalent in this large middle-aged Colombian female sample and is related to menopausal symptom severity, tobacco use, and presence of hypertension.
Cunningham, Jennifer; Wallston, Kenneth A; Wilkins, Consuelo H; Hull, Pamela C; Miller, Stephania T
2015-12-01
This study describes the development and psychometric evaluation of HPV Clinical Trial Survey for Parents with Children Aged 9 to 15 (CTSP-HPV) using traditional instrument development methods and community engagement principles. An expert panel and parental input informed survey content and parents recommended study design changes (e.g., flyer wording). A convenience sample of 256 parents completed the final survey measuring parental willingness to consent to HPV clinical trial (CT) participation and other factors hypothesized to influence willingness (e.g., HPV vaccine benefits). Cronbach's a, Spearman correlations, and multiple linear regression were used to estimate internal consistency, convergent and discriminant validity, and predictively validity, respectively. Internal reliability was confirmed for all scales (a ≥ 0.70.). Parental willingness was positively associated (p < 0.05) with trust in medical researchers, adolescent CT knowledge, HPV vaccine benefits, advantages of adolescent CTs (r range 0.33-0.42), supporting convergent validity. Moderate discriminant construct validity was also demonstrated. Regression results indicate reasonable predictive validity with the six scales accounting for 31% of the variance in parents' willingness. This instrument can inform interventions based on factors that influence parental willingness, which may lead to the eventual increase in trial participation. Further psychometric testing is warranted. © 2015 Wiley Periodicals, Inc.
Crawford, John R; Garthwaite, Paul H; Denham, Annie K; Chelune, Gordon J
2012-12-01
Regression equations have many useful roles in psychological assessment. Moreover, there is a large reservoir of published data that could be used to build regression equations; these equations could then be employed to test a wide variety of hypotheses concerning the functioning of individual cases. This resource is currently underused because (a) not all psychologists are aware that regression equations can be built not only from raw data but also using only basic summary data for a sample, and (b) the computations involved are tedious and prone to error. In an attempt to overcome these barriers, Crawford and Garthwaite (2007) provided methods to build and apply simple linear regression models using summary statistics as data. In the present study, we extend this work to set out the steps required to build multiple regression models from sample summary statistics and the further steps required to compute the associated statistics for drawing inferences concerning an individual case. We also develop, describe, and make available a computer program that implements these methods. Although there are caveats associated with the use of the methods, these need to be balanced against pragmatic considerations and against the alternative of either entirely ignoring a pertinent data set or using it informally to provide a clinical "guesstimate." Upgraded versions of earlier programs for regression in the single case are also provided; these add the point and interval estimates of effect size developed in the present article.
Multiple linear regression models are often used to predict levels of fecal indicator bacteria (FIB) in recreational swimming waters based on independent variables (IVs) such as meteorologic, hydrodynamic, and water-quality measures. The IVs used for these analyses are traditiona...
Campos-Filho, N; Franco, E L
1989-02-01
A frequent procedure in matched case-control studies is to report results from the multivariate unmatched analyses if they do not differ substantially from the ones obtained after conditioning on the matching variables. Although conceptually simple, this rule requires that an extensive series of logistic regression models be evaluated by both the conditional and unconditional maximum likelihood methods. Most computer programs for logistic regression employ only one maximum likelihood method, which requires that the analyses be performed in separate steps. This paper describes a Pascal microcomputer (IBM PC) program that performs multiple logistic regression by both maximum likelihood estimation methods, which obviates the need for switching between programs to obtain relative risk estimates from both matched and unmatched analyses. The program calculates most standard statistics and allows factoring of categorical or continuous variables by two distinct methods of contrast. A built-in, descriptive statistics option allows the user to inspect the distribution of cases and controls across categories of any given variable.
Ng, Kar Yong; Awang, Norhashidah
2018-01-06
Frequent haze occurrences in Malaysia have made the management of PM 10 (particulate matter with aerodynamic less than 10 μm) pollution a critical task. This requires knowledge on factors associating with PM 10 variation and good forecast of PM 10 concentrations. Hence, this paper demonstrates the prediction of 1-day-ahead daily average PM 10 concentrations based on predictor variables including meteorological parameters and gaseous pollutants. Three different models were built. They were multiple linear regression (MLR) model with lagged predictor variables (MLR1), MLR model with lagged predictor variables and PM 10 concentrations (MLR2) and regression with time series error (RTSE) model. The findings revealed that humidity, temperature, wind speed, wind direction, carbon monoxide and ozone were the main factors explaining the PM 10 variation in Peninsular Malaysia. Comparison among the three models showed that MLR2 model was on a same level with RTSE model in terms of forecasting accuracy, while MLR1 model was the worst.
Carvalho, Carlos; Gomes, Danielo G.; Agoulmine, Nazim; de Souza, José Neuman
2011-01-01
This paper proposes a method based on multivariate spatial and temporal correlation to improve prediction accuracy in data reduction for Wireless Sensor Networks (WSN). Prediction of data not sent to the sink node is a technique used to save energy in WSNs by reducing the amount of data traffic. However, it may not be very accurate. Simulations were made involving simple linear regression and multiple linear regression functions to assess the performance of the proposed method. The results show a higher correlation between gathered inputs when compared to time, which is an independent variable widely used for prediction and forecasting. Prediction accuracy is lower when simple linear regression is used, whereas multiple linear regression is the most accurate one. In addition to that, our proposal outperforms some current solutions by about 50% in humidity prediction and 21% in light prediction. To the best of our knowledge, we believe that we are probably the first to address prediction based on multivariate correlation for WSN data reduction. PMID:22346626
Army College Fund Cost-Effectiveness Study
1990-11-01
Section A.2 presents a theory of enlistment supply to provide a basis for specifying the regression model , The model Is specified in Section A.3, which...Supplementary materials are included in the final four sections. Section A.6 provides annual trends in the regression model variables. Estimates of the model ...millions, A.S. ESTIMATION OF A YOUTH EARNINGS FORECASTING MODEL Civilian pay is an important explanatory variable in the regression model . Previous
Nolan, Bernard T.; Fienen, Michael N.; Lorenz, David L.
2015-01-01
We used a statistical learning framework to evaluate the ability of three machine-learning methods to predict nitrate concentration in shallow groundwater of the Central Valley, California: boosted regression trees (BRT), artificial neural networks (ANN), and Bayesian networks (BN). Machine learning methods can learn complex patterns in the data but because of overfitting may not generalize well to new data. The statistical learning framework involves cross-validation (CV) training and testing data and a separate hold-out data set for model evaluation, with the goal of optimizing predictive performance by controlling for model overfit. The order of prediction performance according to both CV testing R2 and that for the hold-out data set was BRT > BN > ANN. For each method we identified two models based on CV testing results: that with maximum testing R2 and a version with R2 within one standard error of the maximum (the 1SE model). The former yielded CV training R2 values of 0.94–1.0. Cross-validation testing R2 values indicate predictive performance, and these were 0.22–0.39 for the maximum R2 models and 0.19–0.36 for the 1SE models. Evaluation with hold-out data suggested that the 1SE BRT and ANN models predicted better for an independent data set compared with the maximum R2 versions, which is relevant to extrapolation by mapping. Scatterplots of predicted vs. observed hold-out data obtained for final models helped identify prediction bias, which was fairly pronounced for ANN and BN. Lastly, the models were compared with multiple linear regression (MLR) and a previous random forest regression (RFR) model. Whereas BRT results were comparable to RFR, MLR had low hold-out R2 (0.07) and explained less than half the variation in the training data. Spatial patterns of predictions by the final, 1SE BRT model agreed reasonably well with previously observed patterns of nitrate occurrence in groundwater of the Central Valley.
The influence of patient factors on femoral rotation after total hip arthroplasty.
Tezuka, Taro; Inaba, Yutaka; Kobayashi, Naomi; Choe, Hyonmin; Higashihira, Syota; Saito, Tomoyuki
2018-06-09
A postoperative change in femoral rotation following total hip arthroplasty (THA) might be the cause of dislocation due to the change in combined anteversion. However, very few studies have evaluated the femoral rotation angle following THA, or the factors that influence femoral rotation. We aimed to evaluate changes in femoral rotation after THA, and to investigate preoperative patient factors that influence femoral rotation after THA. This study involved 211 hips treated with primary THA. We used computed tomography to measure the femoral rotation angle before and one week after THA. In addition, multiple regression analysis was performed to evaluate preoperative patient factors that could influence femoral rotation after THA. The femoral rotation angle was 0.2 ± 14° externally before surgery and 4.4 ± 12° internally after surgery (p < 0.001). Multiple regression analysis revealed that sex (β = 0.19; p = 0.003), age (β = 0.15; p = 0.017), preoperative anatomical femoral anteversion (β = - 0.25; p = 0.002), and preoperative femoral rotation angle (β = 0.36; p < 0.001) were significantly associated with the postoperative femoral rotation angle. The final model of the regression formula was described by the following equation: [postoperative femoral rotation angle = 5.41 × sex (female: 0, male: 1) + 0.15 × age - 0.22 × preoperative anatomical femoral anteversion + 0.33 × preoperative femoral rotation angle - 10.1]. The current study showed the mean internal change of 4.6° in the femoral rotation angle one week after THA. Sex, age, preoperative anatomical femoral anteversion and preoperative femoral rotation were associated with postoperative femoral rotation. The patients who were male, older, and who exhibited lesser preoperative anatomical femoral anteversion or greater preoperative femoral rotation angles, tended to demonstrate an externally rotated femur after THA. Conversely, patients who were female, younger, and who exhibited greater preoperative anatomical femoral anteversion or lesser preoperative femoral rotation angles, tended to demonstrate an internal rotation of the femur after THA.
Allan, Bruce D; Hassan, Hala; Ieong, Alvin
2015-05-01
To describe and evaluate a new multiple regression-derived nomogram for myopic wavefront laser in situ keratomileusis (LASIK). Moorfields Eye Hospital, London, United Kingdom. Prospective comparative case series. Multiple regression modeling was used to derive a simplified formula for adjusting attempted spherical correction in myopic LASIK. An adaptation of Thibos' power vector method was then applied to derive adjustments to attempted cylindrical correction in eyes with 1.0 diopter (D) or more of preoperative cylinder. These elements were combined in a new nomogram (nomogram II). The 3-month refractive results for myopic wavefront LASIK (spherical equivalent ≤11.0 D; cylinder ≤4.5 D) were compared between 299 consecutive eyes treated using the earlier nomogram (nomogram I) in 2009 and 2010 and 414 eyes treated using nomogram II in 2011 and 2012. There was no significant difference in treatment accuracy (variance in the postoperative manifest refraction spherical equivalent error) between nomogram I and nomogram II (P = .73, Bartlett test). Fewer patients treated with nomogram II had more than 0.5 D of residual postoperative astigmatism (P = .0001, Fisher exact test). There was no significant coupling between adjustments to the attempted cylinder and the achieved sphere (P = .18, t test). Discarding marginal influences from a multiple regression-derived nomogram for myopic wavefront LASIK had no clinically significant effect on treatment accuracy. Thibos' power vector method can be used to guide adjustments to the treatment cylinder alongside nomograms designed to optimize postoperative spherical equivalent results in myopic LASIK. mentioned. Copyright © 2015 ASCRS and ESCRS. Published by Elsevier Inc. All rights reserved.
Almalki, Mohammed J; FitzGerald, Gerry; Clark, Michele
2012-09-12
Quality of work life (QWL) has been found to influence the commitment of health professionals, including nurses. However, reliable information on QWL and turnover intention of primary health care (PHC) nurses is limited. The aim of this study was to examine the relationship between QWL and turnover intention of PHC nurses in Saudi Arabia. A cross-sectional survey was used in this study. Data were collected using Brooks' survey of Quality of Nursing Work Life, the Anticipated Turnover Scale and demographic data questions. A total of 508 PHC nurses in the Jazan Region, Saudi Arabia, completed the questionnaire (RR = 87%). Descriptive statistics, t-test, ANOVA, General Linear Model (GLM) univariate analysis, standard multiple regression, and hierarchical multiple regression were applied for analysis using SPSS v17 for Windows. Findings suggested that the respondents were dissatisfied with their work life, with almost 40% indicating a turnover intention from their current PHC centres. Turnover intention was significantly related to QWL. Using standard multiple regression, 26% of the variance in turnover intention was explained by QWL, p < 0.001, with R2 = .263. Further analysis using hierarchical multiple regression found that the total variance explained by the model as a whole (demographics and QWL) was 32.1%, p < 0.001. QWL explained an additional 19% of the variance in turnover intention, after controlling for demographic variables. Creating and maintaining a healthy work life for PHC nurses is very important to improve their work satisfaction, reduce turnover, enhance productivity and improve nursing care outcomes.
2012-01-01
Background Quality of work life (QWL) has been found to influence the commitment of health professionals, including nurses. However, reliable information on QWL and turnover intention of primary health care (PHC) nurses is limited. The aim of this study was to examine the relationship between QWL and turnover intention of PHC nurses in Saudi Arabia. Methods A cross-sectional survey was used in this study. Data were collected using Brooks’ survey of Quality of Nursing Work Life, the Anticipated Turnover Scale and demographic data questions. A total of 508 PHC nurses in the Jazan Region, Saudi Arabia, completed the questionnaire (RR = 87%). Descriptive statistics, t-test, ANOVA, General Linear Model (GLM) univariate analysis, standard multiple regression, and hierarchical multiple regression were applied for analysis using SPSS v17 for Windows. Results Findings suggested that the respondents were dissatisfied with their work life, with almost 40% indicating a turnover intention from their current PHC centres. Turnover intention was significantly related to QWL. Using standard multiple regression, 26% of the variance in turnover intention was explained by QWL, p < 0.001, with R2 = .263. Further analysis using hierarchical multiple regression found that the total variance explained by the model as a whole (demographics and QWL) was 32.1%, p < 0.001. QWL explained an additional 19% of the variance in turnover intention, after controlling for demographic variables. Conclusions Creating and maintaining a healthy work life for PHC nurses is very important to improve their work satisfaction, reduce turnover, enhance productivity and improve nursing care outcomes. PMID:22970764
Risk factors for autistic regression: results of an ambispective cohort study.
Zhang, Ying; Xu, Qiong; Liu, Jing; Li, She-chang; Xu, Xiu
2012-08-01
A subgroup of children diagnosed with autism experience developmental regression featured by a loss of previously acquired abilities. The pathogeny of autistic regression is unknown, although many risk factors likely exist. To better characterize autistic regression and investigate the association between autistic regression and potential influencing factors in Chinese autistic children, we conducted an ambispective study with a cohort of 170 autistic subjects. Analyses by multiple logistic regression showed significant correlations between autistic regression and febrile seizures (OR = 3.53, 95% CI = 1.17-10.65, P = .025), as well as with a family history of neuropsychiatric disorders (OR = 3.62, 95% CI = 1.35-9.71, P = .011). This study suggests that febrile seizures and family history of neuropsychiatric disorders are correlated with autistic regression.
NASA Astrophysics Data System (ADS)
Sharudin, R. W.; AbdulBari Ali, S.; Zulkarnain, M.; Shukri, M. A.
2018-05-01
This study reports on the integration of Artificial Neural Network (ANNs) with experimental data in predicting the solubility of carbon dioxide (CO2) blowing agent in SEBS by generating highest possible value for Regression coefficient (R2). Basically, foaming of thermoplastic elastomer with CO2 is highly affected by the CO2 solubility. The ability of ANN in predicting interpolated data of CO2 solubility was investigated by comparing training results via different method of network training. Regards to the final prediction result for CO2 solubility by ANN, the prediction trend (output generate) was corroborated with the experimental results. The obtained result of different method of training showed the trend of output generated by Gradient Descent with Momentum & Adaptive LR (traingdx) required longer training time and required more accurate input to produce better output with final Regression Value of 0.88. However, it goes vice versa with Levenberg-Marquardt (trainlm) technique as it produced better output in quick detention time with final Regression Value of 0.91.
Miozzo, Michele; Pulvermüller, Friedemann; Hauk, Olaf
2015-01-01
The time course of brain activation during word production has become an area of increasingly intense investigation in cognitive neuroscience. The predominant view has been that semantic and phonological processes are activated sequentially, at about 150 and 200–400 ms after picture onset. Although evidence from prior studies has been interpreted as supporting this view, these studies were arguably not ideally suited to detect early brain activation of semantic and phonological processes. We here used a multiple linear regression approach to magnetoencephalography (MEG) analysis of picture naming in order to investigate early effects of variables specifically related to visual, semantic, and phonological processing. This was combined with distributed minimum-norm source estimation and region-of-interest analysis. Brain activation associated with visual image complexity appeared in occipital cortex at about 100 ms after picture presentation onset. At about 150 ms, semantic variables became physiologically manifest in left frontotemporal regions. In the same latency range, we found an effect of phonological variables in the left middle temporal gyrus. Our results demonstrate that multiple linear regression analysis is sensitive to early effects of multiple psycholinguistic variables in picture naming. Crucially, our results suggest that access to phonological information might begin in parallel with semantic processing around 150 ms after picture onset. PMID:25005037
ERIC Educational Resources Information Center
Kobrin, Jennifer L.; Sinharay, Sandip; Haberman, Shelby J.; Chajewski, Michael
2011-01-01
This study examined the adequacy of a multiple linear regression model for predicting first-year college grade point average (FYGPA) using SAT[R] scores and high school grade point average (HSGPA). A variety of techniques, both graphical and statistical, were used to examine if it is possible to improve on the linear regression model. The results…
Determining Sample Size for Accurate Estimation of the Squared Multiple Correlation Coefficient.
ERIC Educational Resources Information Center
Algina, James; Olejnik, Stephen
2000-01-01
Discusses determining sample size for estimation of the squared multiple correlation coefficient and presents regression equations that permit determination of the sample size for estimating this parameter for up to 20 predictor variables. (SLD)
Musuku, Adrien; Tan, Aimin; Awaiye, Kayode; Trabelsi, Fethi
2013-09-01
Linear calibration is usually performed using eight to ten calibration concentration levels in regulated LC-MS bioanalysis because a minimum of six are specified in regulatory guidelines. However, we have previously reported that two-concentration linear calibration is as reliable as or even better than using multiple concentrations. The purpose of this research is to compare two-concentration with multiple-concentration linear calibration through retrospective data analysis of multiple bioanalytical projects that were conducted in an independent regulated bioanalytical laboratory. A total of 12 bioanalytical projects were randomly selected: two validations and two studies for each of the three most commonly used types of sample extraction methods (protein precipitation, liquid-liquid extraction, solid-phase extraction). When the existing data were retrospectively linearly regressed using only the lowest and the highest concentration levels, no extra batch failure/QC rejection was observed and the differences in accuracy and precision between the original multi-concentration regression and the new two-concentration linear regression are negligible. Specifically, the differences in overall mean apparent bias (square root of mean individual bias squares) are within the ranges of -0.3% to 0.7% and 0.1-0.7% for the validations and studies, respectively. The differences in mean QC concentrations are within the ranges of -0.6% to 1.8% and -0.8% to 2.5% for the validations and studies, respectively. The differences in %CV are within the ranges of -0.7% to 0.9% and -0.3% to 0.6% for the validations and studies, respectively. The average differences in study sample concentrations are within the range of -0.8% to 2.3%. With two-concentration linear regression, an average of 13% of time and cost could have been saved for each batch together with 53% of saving in the lead-in for each project (the preparation of working standard solutions, spiking, and aliquoting). Furthermore, examples are given as how to evaluate the linearity over the entire concentration range when only two concentration levels are used for linear regression. To conclude, two-concentration linear regression is accurate and robust enough for routine use in regulated LC-MS bioanalysis and it significantly saves time and cost as well. Copyright © 2013 Elsevier B.V. All rights reserved.
Flood characteristics of Alaskan streams
Lamke, R.D.
1979-01-01
Peak discharge data for Alaskan streams are summarized and analyzed. Multiple-regression equations relating peak discharge magnitude and frequency to climatic and physical characteristics of 260 gaged basins were determined in order to estimate average recurrence interval of floods at ungaged sites. These equations are for 1.25-, 2-, 5-, 10-, 25-, and 50-year average recurrence intervals. In this report, Alaska was divided into two regions, one having a maritime climate with fall and winter rains and floods, the other having spring and summer floods of a variety or combinations of causes. Average standard errors of the six multiple-regression equations for these two regions were 48 and 74 percent, respectively. Maximum recorded floods at more than 400 sites throughout Alaska are tabulated. Maps showing lines of equal intensity of the principal climatic variables found to be significant (mean annual precipitation and mean minimum January temperature), and location of the 260 sites used in the multiple-regression analyses are included. Little flood data have been collected in western and arctic Alaska, and the predictive equations are therefore less reliable for those areas. (Woodard-USGS)
Suresh, Arumuganainar; Choi, Hong Lim
2011-10-01
Swine waste land application has increased due to organic fertilization, but excess application in an arable system can cause environmental risk. Therefore, in situ characterizations of such resources are important prior to application. To explore this, 41 swine slurry samples were collected from Korea, and wide differences were observed in the physico-biochemical properties. However, significant (P<0.001) multiple property correlations (R²) were obtained between nutrients with specific gravity (SG), electrical conductivity (EC), total solids (TS) and pH. The different combinations of hydrometer, EC meter, drying oven and pH meter were found useful to estimate Mn, Fe, Ca, K, Al, Na, N and 5-day biochemical oxygen demands (BOD₅) at improved R² values of 0.83, 0.82, 0.77, 0.75, 0.67, 0.47, 0.88 and 0.70, respectively. The results from this study suggest that multiple property regressions can facilitate the prediction of micronutrients and organic matter much better than a single property regression for livestock waste. Copyright © 2011 Elsevier Ltd. All rights reserved.
Mutter, Brigitte; Alcorn, Mark B; Welsh, Marilyn
2006-06-01
This study of the relationship between theory of mind and executive function examined whether on the false-belief task age differences between 3 and 5 ears of age are related to development of working-memory capacity and inhibitory processes. 72 children completed tasks measuring false belief, working memory, and inhibition. Significant age effects were observed for false-belief and working-memory performance, as well as for the false-alarm and perseveration measures of inhibition. A simultaneous multiple linear regression specified the contribution of age, inhibition, and working memory to the prediction of false-belief performance. This model was significant, explaining a total of 36% of the variance. To examine the independent contributions of the working-memory and inhibition variables, after controlling for age, two hierarchical multiple linear regressions were conducted. These multiple regression analyses indicate that working memory and inhibition make small, overlapping contributions to false-belief performance after accounting for age, but that working memory, as measured in this study, is a somewhat better predictor of false-belief understanding than is inhibition.
Mapping diffuse photosynthetically active radiation from satellite data in Thailand
NASA Astrophysics Data System (ADS)
Choosri, P.; Janjai, S.; Nunez, M.; Buntoung, S.; Charuchittipan, D.
2017-12-01
In this paper, calculation of monthly average hourly diffuse photosynthetically active radiation (PAR) using satellite data is proposed. Diffuse PAR was analyzed at four stations in Thailand. A radiative transfer model was used for calculating the diffuse PAR for cloudless sky conditions. Differences between the diffuse PAR under all sky conditions obtained from the ground-based measurements and those from the model are representative of cloud effects. Two models are developed, one describing diffuse PAR only as a function of solar zenith angle, and the second one as a multiple linear regression with solar zenith angle and satellite reflectivity acting linearly and aerosol optical depth acting in logarithmic functions. When tested with an independent data set, the multiple regression model performed best with a higher coefficient of variance R2 (0.78 vs. 0.70), lower root mean square difference (RMSD) (12.92% vs. 13.05%) and the same mean bias difference (MBD) of -2.20%. Results from the multiple regression model are used to map diffuse PAR throughout the country as monthly averages of hourly data.
Clifford support vector machines for classification, regression, and recurrence.
Bayro-Corrochano, Eduardo Jose; Arana-Daniel, Nancy
2010-11-01
This paper introduces the Clifford support vector machines (CSVM) as a generalization of the real and complex-valued support vector machines using the Clifford geometric algebra. In this framework, we handle the design of kernels involving the Clifford or geometric product. In this approach, one redefines the optimization variables as multivectors. This allows us to have a multivector as output. Therefore, we can represent multiple classes according to the dimension of the geometric algebra in which we work. We show that one can apply CSVM for classification and regression and also to build a recurrent CSVM. The CSVM is an attractive approach for the multiple input multiple output processing of high-dimensional geometric entities. We carried out comparisons between CSVM and the current approaches to solve multiclass classification and regression. We also study the performance of the recurrent CSVM with experiments involving time series. The authors believe that this paper can be of great use for researchers and practitioners interested in multiclass hypercomplex computing, particularly for applications in complex and quaternion signal and image processing, satellite control, neurocomputation, pattern recognition, computer vision, augmented virtual reality, robotics, and humanoids.
A general equation to obtain multiple cut-off scores on a test from multinomial logistic regression.
Bersabé, Rosa; Rivas, Teresa
2010-05-01
The authors derive a general equation to compute multiple cut-offs on a total test score in order to classify individuals into more than two ordinal categories. The equation is derived from the multinomial logistic regression (MLR) model, which is an extension of the binary logistic regression (BLR) model to accommodate polytomous outcome variables. From this analytical procedure, cut-off scores are established at the test score (the predictor variable) at which an individual is as likely to be in category j as in category j+1 of an ordinal outcome variable. The application of the complete procedure is illustrated by an example with data from an actual study on eating disorders. In this example, two cut-off scores on the Eating Attitudes Test (EAT-26) scores are obtained in order to classify individuals into three ordinal categories: asymptomatic, symptomatic and eating disorder. Diagnoses were made from the responses to a self-report (Q-EDD) that operationalises DSM-IV criteria for eating disorders. Alternatives to the MLR model to set multiple cut-off scores are discussed.
Partial least squares (PLS) analysis offers a number of advantages over the more traditionally used regression analyses applied in landscape ecology, particularly for determining the associations among multiple constituents of surface water and landscape configuration. Common dat...
An Update of the Bodeker Scientific Vertically Resolved, Global, Gap-Free Ozone Database
NASA Astrophysics Data System (ADS)
Kremser, S.; Bodeker, G. E.; Lewis, J.; Hassler, B.
2016-12-01
High vertical resolution ozone measurements from multiple satellite-based instruments have been merged with measurements from the global ozonesonde network to calculate monthly mean ozone values in 5º latitude zones. Ozone number densities and ozone mixing ratios are provided on 70 altitude levels (1 to 70 km) and on 70 pressure levels spaced approximately 1 km apart (878.4 hPa to 0.046 hPa). These data are sparse and do not cover the entire globe or altitude range. To provide a gap-free database, a least squares regression model is fitted to these data and then evaluated globally. By applying a single fit at each level, and using the approach of allowing the regression fits to change only slightly from one level to the next, the regression is less sensitive to measurement anomalies at individual stations or to individual satellite-based instruments. Particular attention is paid to ensuring that the low ozone abundances in the polar regions are captured. This presentation reports on updates to an earlier version of the vertically resolved ozone database, including the incorporation of new ozone measurements and new techniques for combining the data. Compared to previous versions of the database, particular attention is paid to avoiding spatial and temporal sampling biases and tracing uncertainties through to the final product. This updated database, developed within the New Zealand Deep South National Science Challenge, is suitable for assessing ozone fields from chemistry-climate model simulations or for providing the ozone boundary conditions for global climate model simulations that do not treat stratospheric chemistry interactively.
Space, race, and poverty: Spatial inequalities in walkable neighborhood amenities?
Aldstadt, Jared; Whalen, John; White, Kellee; Castro, Marcia C.; Williams, David R.
2017-01-01
BACKGROUND Multiple and varied benefits have been suggested for increased neighborhood walkability. However, spatial inequalities in neighborhood walkability likely exist and may be attributable, in part, to residential segregation. OBJECTIVE Utilizing a spatial demographic perspective, we evaluated potential spatial inequalities in walkable neighborhood amenities across census tracts in Boston, MA (US). METHODS The independent variables included minority racial/ethnic population percentages and percent of families in poverty. Walkable neighborhood amenities were assessed with a composite measure. Spatial autocorrelation in key study variables were first calculated with the Global Moran’s I statistic. Then, Spearman correlations between neighborhood socio-demographic characteristics and walkable neighborhood amenities were calculated as well as Spearman correlations accounting for spatial autocorrelation. We fit ordinary least squares (OLS) regression and spatial autoregressive models, when appropriate, as a final step. RESULTS Significant positive spatial autocorrelation was found in neighborhood socio-demographic characteristics (e.g. census tract percent Black), but not walkable neighborhood amenities or in the OLS regression residuals. Spearman correlations between neighborhood socio-demographic characteristics and walkable neighborhood amenities were not statistically significant, nor were neighborhood socio-demographic characteristics significantly associated with walkable neighborhood amenities in OLS regression models. CONCLUSIONS Our results suggest that there is residential segregation in Boston and that spatial inequalities do not necessarily show up using a composite measure. COMMENTS Future research in other geographic areas (including international contexts) and using different definitions of neighborhoods (including small-area definitions) should evaluate if spatial inequalities are found using composite measures but also should use measures of specific neighborhood amenities. PMID:29046612
Kasprzyk, Danuta; Tshimanga, Mufuta; Hamilton, Deven T; Gorn, Gerald J; Montaño, Daniel E
2018-02-01
Male circumcision (MC) significantly reduces HIV acquisition among men, leading WHO/UNAIDS to recommend high HIV and low MC prevalence countries circumcise 80% of adolescents and men age 15-49. Despite significant investment to increase MC capacity only 27% of the goal has been achieved in Zimbabwe. To increase adoption, research to create evidence-based messages is greatly needed. The Integrated Behavioral Model (IBM) was used to investigate factors affecting MC motivation among adolescents. Based on qualitative elicitation study results a survey was designed and administered to a representative sample of 802 adolescent boys aged 13-17 in two urban and two rural areas in Zimbabwe. Multiple regression analysis found all six IBM constructs (2 attitude, 2 social influence, 2 personal agency) significantly explained MC intention (R 2 = 0.55). Stepwise regression analysis of beliefs underlying each IBM belief-based construct found 9 behavioral, 6 injunctive norm, 2 descriptive norm, 5 efficacy, and 8 control beliefs significantly explained MC intention. A final stepwise regression of all the significant IBM construct beliefs identified 12 key beliefs best explaining intention. Similar analyses were carried out with subgroups of adolescents by urban-rural and age. Different sets of behavioral, normative, efficacy, and control beliefs were significant for each sub-group. This study demonstrates the application of theory-driven research to identify evidence-based targets for the design of effective MC messages for interventions to increase adolescents' motivation. Incorporating these findings into communication campaigns is likely to improve demand for MC.
Malosetti, Marcos; Ribaut, Jean-Marcel; van Eeuwijk, Fred A.
2013-01-01
Genotype-by-environment interaction (GEI) is an important phenomenon in plant breeding. This paper presents a series of models for describing, exploring, understanding, and predicting GEI. All models depart from a two-way table of genotype by environment means. First, a series of descriptive and explorative models/approaches are presented: Finlay–Wilkinson model, AMMI model, GGE biplot. All of these approaches have in common that they merely try to group genotypes and environments and do not use other information than the two-way table of means. Next, factorial regression is introduced as an approach to explicitly introduce genotypic and environmental covariates for describing and explaining GEI. Finally, QTL modeling is presented as a natural extension of factorial regression, where marker information is translated into genetic predictors. Tests for regression coefficients corresponding to these genetic predictors are tests for main effect QTL expression and QTL by environment interaction (QEI). QTL models for which QEI depends on environmental covariables form an interesting model class for predicting GEI for new genotypes and new environments. For realistic modeling of genotypic differences across multiple environments, sophisticated mixed models are necessary to allow for heterogeneity of genetic variances and correlations across environments. The use and interpretation of all models is illustrated by an example data set from the CIMMYT maize breeding program, containing environments differing in drought and nitrogen stress. To help readers to carry out the statistical analyses, GenStat® programs, 15th Edition and Discovery® version, are presented as “Appendix.” PMID:23487515
Occlusal wear and occlusal condition in a convenience sample of young adults.
Van't Spijker, A; Kreulen, C M; Bronkhorst, E M; Creugers, N H J
2015-01-01
To study progression of tooth wear quantitatively in a convenient sample of young adults and to assess possible correlations with occlusal conditions. Twenty-eight dental students participated in a three-year follow up study on tooth wear. Visible wear facets on full arch gypsum casts were assessed using a flatbed scanner and measuring software. Regression analyses were used to assess possible associations between the registered occlusal conditions 'occlusal guidance scheme', 'vertical overbite', 'horizontal overbite', 'depth of sagittal curve', 'canine Angle class relation', 'history of orthodontic treatment', and 'self-reported grinding/clenching' (independent variables) and increase of wear facets (dependent variable). Mean increase in facet surface areas ranged from 1.2 mm2 (premolars, incisors) to 3.4 mm2 (molars); the relative increase ranged from 15% to 23%. Backward regression analysis showed no significant relation for 'group function', 'vertical overbite', 'depth of sagittal curve', 'history of orthodontic treatment' nor 'self-reported clenching. The final multiple linear regression model showed significant associations amongst 'anterior protected articulation' and 'horizontal overbite' and increase of facet surface areas. For all teeth combined, only 'anterior protected articulation' had a significant effect. 'Self reported grinding' did not have a significant effect (p>0.07). In this study 'anterior protected articulation' and 'horizontal overbite', were significantly associated with the progression of tooth wear. Self reported grinding was not significantly associated with progression of tooth wear. Occlusal conditions such as anterior protected articulation and horizontal overbite seem to have an effect on the progression of occlusal tooth wear in this convenient sample of young adults. Copyright © 2014 Elsevier Ltd. All rights reserved.
Welch, J.E.; Lund, L.J.
1989-01-01
A soil column study was conducted to assess the movement of Zn in sewage-sludge-amended soils. Varables investigated were soil properties, irrigation water quality, and soil moisture level. Bulk samples of the surface layer of six soil series were packed into columns, 10.2 cm in diameter and 110 cm in length. An anaerobically digested municipal sewage sludge was incorporated into the top 20 cm of each column at a rate of 300 mg ha-1. The columns were maintained at moisture levels of saturation and unsaturation and were leached with two waters of different quality. At the termination of leaching, the columns were cut open and the soil was sectioned and analyzed. Zinc movement was evaluated by mass balance accounting and correlation and regression analysis. Zinc movement in the unsaturated columns ranged from 3 to 30 cm, with a mean of 10 cm. The difference in irrigation water quality did not have an effect on Zn movement. Most of the Zn applied to the unsaturated columns remained in the sludge-amended soil layer (96.1 to 99.6%, with a mean of 98.1%). The major portion of Zn leached from the sludge-amended soil layer accumulated in the 0- to 3-cm depth (35.7 to 100%, with a mean of 73.6%). The mean final soil pH values decreased in the order: saturated columns = sludge-amended soil layer > untreated soils > unsaturated columns. Total Zn leached from the sludge-amended soil layer was correlated negatively at P = 0.001 with final pH (r = -0.85). Depth of Zn movement was correlated negatively at P = 0.001 with final pH (r = -0.91). Multiple linear regression analysis showed that the final pH accounted for 72% of the variation in the total amounts of Zn leached from the sludge-amended soil layer of the unsaturated columns and accounted for 82% of the variation in the depth of Zn movement among the unsaturated columns. A significant correlation was not found between Zn and organic carbon in soil solutions, but a negative correlation significant at P = 0.001 was found between pH and Zn (r = -0.61).
Viswanathan, M; Pearl, D L; Taboada, E N; Parmley, E J; Mutschall, S K; Jardine, C M
2017-05-01
Using data collected from a cross-sectional study of 25 farms (eight beef, eight swine and nine dairy) in 2010, we assessed clustering of molecular subtypes of C. jejuni based on a Campylobacter-specific 40 gene comparative genomic fingerprinting assay (CGF40) subtypes, using unweighted pair-group method with arithmetic mean (UPGMA) analysis, and multiple correspondence analysis. Exact logistic regression was used to determine which genes differentiate wildlife and livestock subtypes in our study population. A total of 33 bovine livestock (17 beef and 16 dairy), 26 wildlife (20 raccoon (Procyon lotor), five skunk (Mephitis mephitis) and one mouse (Peromyscus spp.) C. jejuni isolates were subtyped using CGF40. Dendrogram analysis, based on UPGMA, showed distinct branches separating bovine livestock and mammalian wildlife isolates. Furthermore, two-dimensional multiple correspondence analysis was highly concordant with dendrogram analysis showing clear differentiation between livestock and wildlife CGF40 subtypes. Based on multilevel logistic regression models with a random intercept for farm of origin, we found that isolates in general, and raccoons more specifically, were significantly more likely to be part of the wildlife branch. Exact logistic regression conducted gene by gene revealed 15 genes that were predictive of whether an isolate was of wildlife or bovine livestock isolate origin. Both multiple correspondence analysis and exact logistic regression revealed that in most cases, the presence of a particular gene (13 of 15) was associated with an isolate being of livestock rather than wildlife origin. In conclusion, the evidence gained from dendrogram analysis, multiple correspondence analysis and exact logistic regression indicates that mammalian wildlife carry CGF40 subtypes of C. jejuni distinct from those carried by bovine livestock. Future studies focused on source attribution of C. jejuni in human infections will help determine whether wildlife transmit Campylobacter jejuni directly to humans. © 2016 Blackwell Verlag GmbH.
Ono, Tomohiro; Nakamura, Mitsuhiro; Hirose, Yoshinori; Kitsuda, Kenji; Ono, Yuka; Ishigaki, Takashi; Hiraoka, Masahiro
2017-09-01
To estimate the lung tumor position from multiple anatomical features on four-dimensional computed tomography (4D-CT) data sets using single regression analysis (SRA) and multiple regression analysis (MRA) approach and evaluate an impact of the approach on internal target volume (ITV) for stereotactic body radiotherapy (SBRT) of the lung. Eleven consecutive lung cancer patients (12 cases) underwent 4D-CT scanning. The three-dimensional (3D) lung tumor motion exceeded 5 mm. The 3D tumor position and anatomical features, including lung volume, diaphragm, abdominal wall, and chest wall positions, were measured on 4D-CT images. The tumor position was estimated by SRA using each anatomical feature and MRA using all anatomical features. The difference between the actual and estimated tumor positions was defined as the root-mean-square error (RMSE). A standard partial regression coefficient for the MRA was evaluated. The 3D lung tumor position showed a high correlation with the lung volume (R = 0.92 ± 0.10). Additionally, ITVs derived from SRA and MRA approaches were compared with ITV derived from contouring gross tumor volumes on all 10 phases of the 4D-CT (conventional ITV). The RMSE of the SRA was within 3.7 mm in all directions. Also, the RMSE of the MRA was within 1.6 mm in all directions. The standard partial regression coefficient for the lung volume was the largest and had the most influence on the estimated tumor position. Compared with conventional ITV, average percentage decrease of ITV were 31.9% and 38.3% using SRA and MRA approaches, respectively. The estimation accuracy of lung tumor position was improved by the MRA approach, which provided smaller ITV than conventional ITV. © 2017 The Authors. Journal of Applied Clinical Medical Physics published by Wiley Periodicals, Inc. on behalf of American Association of Physicists in Medicine.
Using the Ridge Regression Procedures to Estimate the Multiple Linear Regression Coefficients
NASA Astrophysics Data System (ADS)
Gorgees, HazimMansoor; Mahdi, FatimahAssim
2018-05-01
This article concerns with comparing the performance of different types of ordinary ridge regression estimators that have been already proposed to estimate the regression parameters when the near exact linear relationships among the explanatory variables is presented. For this situations we employ the data obtained from tagi gas filling company during the period (2008-2010). The main result we reached is that the method based on the condition number performs better than other methods since it has smaller mean square error (MSE) than the other stated methods.
ERIC Educational Resources Information Center
Carter, David S.
1979-01-01
There are a variety of formulas for reducing the positive bias which occurs in estimating R squared in multiple regression or correlation equations. Five different formulas are evaluated in a Monte Carlo study, and recommendations are made. (JKS)
Estimating Optimal Transformations for Multiple Regression and Correlation.
1982-07-01
S w.EECTli1Z"", , J OCT 0 11982 u! !for Public its... .. . ESTIMATING OPTIMAL TRANSFORMATIONS FOR MULTIPLE REGRESSION AND CORRELATION by Leo...in the plot lb of *(yk) versus 1 < k < 200. Figure lc is a plot of $*(xk) versus xk. These plots clearly suggest the transformati " s 6(y) = log(y) and...direct .814 .022 ACE .808 .031 -13- Figure la6L ’ ’ I . . . S " ’ ’ . . I ’ 6- - - .4...... Co o • . o ’ 0 0.2 0.4 0.5 0.8 1 Fi gure lb2 2 2 // II / / -/
Bark analysis as a guide to cassava nutrition in Sierra Leone
DOE Office of Scientific and Technical Information (OSTI.GOV)
Godfrey-Sam-Aggrey, W.; Garber, M.J.
1979-01-01
Cassava main stem barks from two experiments in which similar fertilizers were applied directly in a 2/sup 5/ confounded factorial design were analyzed and the bark nutrients used as a guide to cassava nutrition. The application of multiple regression analysis to the respective root yields and bark nutrient concentrations enable nutrient levels and optimum adjusted root yields to be derived. Differences in bark nutrient concentrations reflected soil fertility levels. Bark analysis and the application of multiple regression analysis to root yields and bark nutrients appear to be useful tools for predicting fertilizer recommendations for cassava production.
NASA Astrophysics Data System (ADS)
Shastri, Niket; Pathak, Kamlesh
2018-05-01
The water vapor content in atmosphere plays very important role in climate. In this paper the application of GPS signal in meteorology is discussed, which is useful technique that is used to estimate the perceptible water vapor of atmosphere. In this paper various algorithms like artificial neural network, support vector machine and multiple linear regression are use to predict perceptible water vapor. The comparative studies in terms of root mean square error and mean absolute errors are also carried out for all the algorithms.
NASA Astrophysics Data System (ADS)
Shi, Jinfei; Zhu, Songqing; Chen, Ruwen
2017-12-01
An order selection method based on multiple stepwise regressions is proposed for General Expression of Nonlinear Autoregressive model which converts the model order problem into the variable selection of multiple linear regression equation. The partial autocorrelation function is adopted to define the linear term in GNAR model. The result is set as the initial model, and then the nonlinear terms are introduced gradually. Statistics are chosen to study the improvements of both the new introduced and originally existed variables for the model characteristics, which are adopted to determine the model variables to retain or eliminate. So the optimal model is obtained through data fitting effect measurement or significance test. The simulation and classic time-series data experiment results show that the method proposed is simple, reliable and can be applied to practical engineering.
Regression Analysis with Dummy Variables: Use and Interpretation.
ERIC Educational Resources Information Center
Hinkle, Dennis E.; Oliver, J. Dale
1986-01-01
Multiple regression analysis (MRA) may be used when both continuous and categorical variables are included as independent research variables. The use of MRA with categorical variables involves dummy coding, that is, assigning zeros and ones to levels of categorical variables. Caution is urged in results interpretation. (Author/CH)
Applied Statistics: From Bivariate through Multivariate Techniques [with CD-ROM
ERIC Educational Resources Information Center
Warner, Rebecca M.
2007-01-01
This book provides a clear introduction to widely used topics in bivariate and multivariate statistics, including multiple regression, discriminant analysis, MANOVA, factor analysis, and binary logistic regression. The approach is applied and does not require formal mathematics; equations are accompanied by verbal explanations. Students are asked…
REGRESSION MODELS THAT RELATE STREAMS TO WATERSHEDS: COPING WITH NUMEROUS, COLLINEAR PEDICTORS
GIS efforts can produce a very large number of watershed variables (climate, land use/land cover and topography, all defined for multiple areas of influence) that could serve as candidate predictors in a regression model of reach-scale stream features. Invariably, many of these ...
Identifying the Factors That Influence Change in SEBD Using Logistic Regression Analysis
ERIC Educational Resources Information Center
Camilleri, Liberato; Cefai, Carmel
2013-01-01
Multiple linear regression and ANOVA models are widely used in applications since they provide effective statistical tools for assessing the relationship between a continuous dependent variable and several predictors. However these models rely heavily on linearity and normality assumptions and they do not accommodate categorical dependent…
A Constrained Linear Estimator for Multiple Regression
ERIC Educational Resources Information Center
Davis-Stober, Clintin P.; Dana, Jason; Budescu, David V.
2010-01-01
"Improper linear models" (see Dawes, Am. Psychol. 34:571-582, "1979"), such as equal weighting, have garnered interest as alternatives to standard regression models. We analyze the general circumstances under which these models perform well by recasting a class of "improper" linear models as "proper" statistical models with a single predictor. We…
Cooperative Control of Multiple Unmanned Autonomous Vehicles
2005-06-03
I I Final Report 4. TITLE AND SUBTITLE 5. FUNDING NUMBERS Cooperative Control of Multiple Unmanned Autonomous Vehicles F49620-01-1-0337 6. AUTHOR(S... Autonomous Vehicles Final Report Kendall E. Nygard Department of Computer Science and Operations Research North Dakota State University Fargo, ND 58105-5164
Regression of a vaginal leiomyoma after ovariohysterectomy in a dog: a case report.
Sathya, Suresh; Linn, Kathleen
2014-01-01
An 11 yr old female mixed-breed Siberian husky was presented with a history of sanguineous vaginal discharge, swelling of the perineal area, decreased appetite, and lethargy. A single, large vaginal leiomyoma and multiple mammary tumors were diagnosed. Mastectomy and ovariohysterectomy were performed. The vaginal leiomyoma regressed completely after ovariohysterectomy. This is the first reported case of spontaneous regression of a vaginal leiomyoma after ovariohysterectomy in a dog.
Chung, Yuh-Jin; Jung, Woo-Chul
2017-01-01
In the distribution service industry, sales people often experience multiple occupational stressors such as excessive emotional labor, workplace mistreatment, and job insecurity. The present study aimed to explore the associations of these stressors with depressive symptoms among women sales workers at a clothing shopping mall in Korea. A cross sectional study was conducted on 583 women who consist of clothing sales workers and manual workers using a structured questionnaire to assess demographic factors, occupational stressors, and depressive symptoms. Multiple regression analyses were performed to explore the association of these stressors with depressive symptoms. Scores for job stress subscales such as job demand, job control, and job insecurity were higher among sales workers than among manual workers (p < 0.01). The multiple regression analysis revealed the association between occupation and depressive symptoms after controlling for age, educational level, cohabiting status, and occupational stressors (sβ = 0.08, p = 0.04). A significant interaction effect between occupation and social support was also observed in this model (sβ = −0.09, p = 0.02). The multiple regression analysis stratified by occupation showed that job demand, job insecurity, and workplace mistreatment were significantly associated with depressive symptoms in both occupations (p < 0.05), although the strength of statistical associations were slightly different. We found negative associations of social support (sβ = −0.22, p < 0.01) and emotional effort (sβ = −0.17, p < 0.01) with depressive symptoms in another multiple regression model for sales workers. Emotional dissonance (sβ = 0.23, p < 0.01) showed positive association with depressive symptoms in this model. The result of this study indicated that reducing occupational stressors would be effective for women sales workers to prevent depressive symptoms. In particular, promoting social support could be the most effective way to promote women sales workers’ mental health. PMID:29168777
Chung, Yuh-Jin; Jung, Woo-Chul; Kim, Hyunjoo; Cho, Seong-Sik
2017-11-23
In the distribution service industry, sales people often experience multiple occupational stressors such as excessive emotional labor, workplace mistreatment, and job insecurity. The present study aimed to explore the associations of these stressors with depressive symptoms among women sales workers at a clothing shopping mall in Korea. A cross sectional study was conducted on 583 women who consist of clothing sales workers and manual workers using a structured questionnaire to assess demographic factors, occupational stressors, and depressive symptoms. Multiple regression analyses were performed to explore the association of these stressors with depressive symptoms. Scores for job stress subscales such as job demand, job control, and job insecurity were higher among sales workers than among manual workers ( p < 0.01). The multiple regression analysis revealed the association between occupation and depressive symptoms after controlling for age, educational level, cohabiting status, and occupational stressors (sβ = 0.08, p = 0.04). A significant interaction effect between occupation and social support was also observed in this model (sβ = -0.09, p = 0.02). The multiple regression analysis stratified by occupation showed that job demand, job insecurity, and workplace mistreatment were significantly associated with depressive symptoms in both occupations ( p < 0.05), although the strength of statistical associations were slightly different. We found negative associations of social support (sβ = -0.22, p < 0.01) and emotional effort (sβ = -0.17, p < 0.01) with depressive symptoms in another multiple regression model for sales workers. Emotional dissonance (sβ = 0.23, p < 0.01) showed positive association with depressive symptoms in this model. The result of this study indicated that reducing occupational stressors would be effective for women sales workers to prevent depressive symptoms. In particular, promoting social support could be the most effective way to promote women sales workers' mental health.
ERIC Educational Resources Information Center
Braten, Ivar; Stromso, Helge I.
2010-01-01
In this study, law students (n = 49) read multiple authentic documents presenting conflicting information on the topic of climate change and responded to verification tasks assessing their superficial as well as their deeper-level within- and across-documents comprehension. Hierarchical multiple regression analyses showed that even after variance…
Agger, Sean A.; Marney, Luke C.; Hoofnagle, Andrew N.
2011-01-01
BACKGROUND If liquid-chromatography–multiple-reaction–monitoring mass spectrometry (LC-MRM/MS) could be used in the large-scale preclinical verification of putative biomarkers, it would obviate the need for the development of expensive immunoassays. In addition, the translation of novel biomarkers to clinical use would be accelerated if the assays used in preclinical studies were the same as those used in the clinical laboratory. To validate this approach, we developed a multiplexed assay for the quantification of 2 clinically well-known biomarkers in human plasma, apolipoprotein A-I and apolipoprotein B (apoA-I and apoB). METHODS We used PeptideAtlas to identify candidate peptides. Human samples were denatured with urea or trifluoroethanol, reduced and alkylated, and digested with trypsin. We compared reversed-phase chromatographic separation of peptides with normal flow and microflow, and we normalized endogenous peptide peak areas to internal standard peptides. We evaluated different methods of calibration and compared the final method with a nephelometric immunoassay. RESULTS We developed a final method using trifluoroethanol denaturation, 21-h digestion, normal flow chromatography-electrospray ionization, and calibration with a single normal human plasma sample. For samples injected in duplicate, the method had intraassay CVs <6% and interassay CVs <12% for both proteins, and compared well with immunoassay (n = 47; Deming regression, LC-MRM/MS = 1.17 × immunoassay – 36.6; Sx|y = 10.3 for apoA-I and LC-MRM/MS = 1.21 × immunoassay + 7.0; Sx|y = 7.9 for apoB). CONCLUSIONS Multiplexed quantification of proteins in human plasma/serum by LC-MRM/MS is possible and compares well with clinically useful immunoassays. The potential application of single-point calibration to large clinical studies could simplify efforts to reduce day-to-day digestion variability. PMID:20923952
Deep ensemble learning of sparse regression models for brain disease diagnosis.
Suk, Heung-Il; Lee, Seong-Whan; Shen, Dinggang
2017-04-01
Recent studies on brain imaging analysis witnessed the core roles of machine learning techniques in computer-assisted intervention for brain disease diagnosis. Of various machine-learning techniques, sparse regression models have proved their effectiveness in handling high-dimensional data but with a small number of training samples, especially in medical problems. In the meantime, deep learning methods have been making great successes by outperforming the state-of-the-art performances in various applications. In this paper, we propose a novel framework that combines the two conceptually different methods of sparse regression and deep learning for Alzheimer's disease/mild cognitive impairment diagnosis and prognosis. Specifically, we first train multiple sparse regression models, each of which is trained with different values of a regularization control parameter. Thus, our multiple sparse regression models potentially select different feature subsets from the original feature set; thereby they have different powers to predict the response values, i.e., clinical label and clinical scores in our work. By regarding the response values from our sparse regression models as target-level representations, we then build a deep convolutional neural network for clinical decision making, which thus we call 'Deep Ensemble Sparse Regression Network.' To our best knowledge, this is the first work that combines sparse regression models with deep neural network. In our experiments with the ADNI cohort, we validated the effectiveness of the proposed method by achieving the highest diagnostic accuracies in three classification tasks. We also rigorously analyzed our results and compared with the previous studies on the ADNI cohort in the literature. Copyright © 2017 Elsevier B.V. All rights reserved.
Deep ensemble learning of sparse regression models for brain disease diagnosis
Suk, Heung-Il; Lee, Seong-Whan; Shen, Dinggang
2018-01-01
Recent studies on brain imaging analysis witnessed the core roles of machine learning techniques in computer-assisted intervention for brain disease diagnosis. Of various machine-learning techniques, sparse regression models have proved their effectiveness in handling high-dimensional data but with a small number of training samples, especially in medical problems. In the meantime, deep learning methods have been making great successes by outperforming the state-of-the-art performances in various applications. In this paper, we propose a novel framework that combines the two conceptually different methods of sparse regression and deep learning for Alzheimer’s disease/mild cognitive impairment diagnosis and prognosis. Specifically, we first train multiple sparse regression models, each of which is trained with different values of a regularization control parameter. Thus, our multiple sparse regression models potentially select different feature subsets from the original feature set; thereby they have different powers to predict the response values, i.e., clinical label and clinical scores in our work. By regarding the response values from our sparse regression models as target-level representations, we then build a deep convolutional neural network for clinical decision making, which thus we call ‘ Deep Ensemble Sparse Regression Network.’ To our best knowledge, this is the first work that combines sparse regression models with deep neural network. In our experiments with the ADNI cohort, we validated the effectiveness of the proposed method by achieving the highest diagnostic accuracies in three classification tasks. We also rigorously analyzed our results and compared with the previous studies on the ADNI cohort in the literature. PMID:28167394
Yan, Chao-Gan; Craddock, R. Cameron; Zuo, Xi-Nian; Zang, Yu-Feng; Milham, Michael P.
2014-01-01
As researchers increase their efforts to characterize variations in the functional connectome across studies and individuals, concerns about the many sources of nuisance variation present and their impact on resting state fMRI (R-fMRI) measures continue to grow. Although substantial within-site variation can exist, efforts to aggregate data across multiple sites such as the 1000 Functional Connectomes Project (FCP) and International Neuroimaging Data-sharing Initiative (INDI) datasets amplify these concerns. The present work draws upon standardization approaches commonly used in the microarray gene expression literature, and to a lesser extent recent imaging studies, and compares them with respect to their impact on relationships between common R-fMRI measures and nuisance variables (e.g., imaging site, motion), as well as phenotypic variables of interest (age, sex). Standardization approaches differed with regard to whether they were applied post-hoc vs. during pre-processing, and at the individual vs. group level; additionally they varied in whether they addressed additive effects vs. additive + multiplicative effects, and were parametric vs. non-parametric. While all standardization approaches were effective at reducing undesirable relationships with nuisance variables, post-hoc approaches were generally more effective than global signal regression (GSR). Across approaches, correction for additive effects (global mean) appeared to be more important than for multiplicative effects (global SD) for all R-fMRI measures, with the exception of amplitude of low frequency fluctuations (ALFF). Group-level post-hoc standardizations for mean-centering and variance-standardization were found to be advantageous in their ability to avoid the introduction of artifactual relationships with standardization parameters; though results between individual and group-level post-hoc approaches were highly similar overall. While post-hoc standardization procedures drastically increased test–retest (TRT) reliability for ALFF, modest reductions were observed for other measures after post-hoc standardizations—a phenomena likely attributable to the separation of voxel-wise from global differences among subjects (global mean and SD demonstrated moderate TRT reliability for these measures). Finally, the present work calls into question previous observations of increased anatomical specificity for GSR over mean centering, and draws attention to the near equivalence of global and gray matter signal regression. PMID:23631983
Aung, Wint Yan; Massoumzadeh, Parinaz; Najmi, Safa; Salter, Amber; Heaps, Jodi; Benzinger, Tammie L S; Mar, Soe
2018-01-01
There are no clinical features or biomarkers that can reliably differentiate acute disseminated encephalomyelitis from multiple sclerosis at the first demyelination attack. Consequently, a final diagnosis is sometimes delayed by months and years of follow-up. Early treatment for multiple sclerosis is recommended to reduce long-term disability. Therefore, we intend to explore neuroimaging biomarkers that can reliably distinguish between the two diagnoses. We reviewed prospectively collected clinical, standard MRI and diffusion tensor imaging data from 12 pediatric patients who presented with acute demyelination with and without encephalopathy. Patients were followed for an average of 6.5 years to determine the accuracy of final diagnosis. Final diagnosis was determined using 2013 International Pediatric MS Study Group criteria. Control subjects consisted of four age-matched healthy individuals for each patient. The study population consisted of six patients with central nervous system demyelination with encephalopathy with a presumed diagnosis of acute disseminated encephalomyelitis and six without encephalopathy with a presumed diagnosis of multiple sclerosis or clinically isolated syndrome at high risk for multiple sclerosis. During follow-up, two patients with initial diagnosis of acute disseminated encephalomyelitis were later diagnosed with multiple sclerosis. Diffusion tensor imaging region of interest analysis of baseline scans showed differences between final diagnosis of multiple sclerosis and acute disseminated encephalomyelitis patients, whereby low fractional anisotropy and high radial diffusivity occurred in multiple sclerosis patients compared with acute disseminated encephalomyelitis patients and the age-matched controls. Fractional anisotropy and radial diffusivity measures may have the potential to serve as biomarkers for distinguishing acute disseminated encephalomyelitis from multiple sclerosis at the onset. Copyright © 2017 Elsevier Inc. All rights reserved.
Multi -risk assessment at a national level in Georgia
NASA Astrophysics Data System (ADS)
Tsereteli, Nino; Varazanashvili, Otar; Amiranashvili, Avtandil; Tsereteli, Emili; Elizbarashvili, Elizbar; Saluqvadze, Manana; Dolodze, Jemal
2013-04-01
Work presented here was initiated by national GNSF project " Reducing natural disasters multiple risk: a positive factor for Georgia development " and two international projects: NATO SFP 983038 "Seismic hazard and Rusk assessment for Southern Caucasus-eastern Turkey Energy Corridors" and EMME " Earthquake Model for Middle east Region". Methodology for estimation of "general" vulnerability, hazards and multiple risk to natural hazards (namely, earthquakes, landslides, snow avalanches, flash floods, mudflows, drought, hurricanes, frost, hail) where developed for Georgia. The electronic detailed databases of natural disasters were created. These databases contain the parameters of hazardous phenomena that caused natural disasters. The magnitude and intensity scale of the mentioned disasters are reviewed and the new magnitude and intensity scales are suggested for disasters for which the corresponding formalization is not yet performed. The associated economic losses were evaluated and presented in monetary terms for these hazards. Based on the hazard inventory, an approach was developed that allowed for the calculation of an overall vulnerability value for each individual hazard type, using the Gross Domestic Product per unit area (applied to population) as the indicator for elements at risk exposed. The correlation between estimated economic losses, physical exposure and the magnitude for each of the six types of hazards has been investigated in detail by using multiple linear regression analysis. Economic losses for all past events and historical vulnerability were estimated. Finally, the spatial distribution of general vulnerability was assessed, and the expected maximum economic loss was calculated as well as a multi-risk map was set-up.
Quantile Regression in the Study of Developmental Sciences
Petscher, Yaacov; Logan, Jessica A. R.
2014-01-01
Linear regression analysis is one of the most common techniques applied in developmental research, but only allows for an estimate of the average relations between the predictor(s) and the outcome. This study describes quantile regression, which provides estimates of the relations between the predictor(s) and outcome, but across multiple points of the outcome’s distribution. Using data from the High School and Beyond and U.S. Sustained Effects Study databases, quantile regression is demonstrated and contrasted with linear regression when considering models with: (a) one continuous predictor, (b) one dichotomous predictor, (c) a continuous and a dichotomous predictor, and (d) a longitudinal application. Results from each example exhibited the differential inferences which may be drawn using linear or quantile regression. PMID:24329596
Depressive disorder in pregnant Latin women: does intimate partner violence matter?
Fonseca-Machado, Mariana de Oliveira; Alves, Lisiane Camargo; Monteiro, Juliana Cristina Dos Santos; Stefanello, Juliana; Nakano, Ana Márcia Spanó; Haas, Vanderlei José; Gomes-Sponholz, Flávia
2015-05-01
To identify the association of antenatal depressive symptoms with intimate partner violence during the current pregnancy in Brazilian women. Intimate partner violence is an important risk factor for antenatal depression. To the authors' knowledge, there has been no study to date that assessed the association between intimate partner violence during pregnancy and antenatal depressive symptoms among Brazilian women. Cross-sectional study. Three hundred and fifty-eight pregnant women were enrolled in the study. The Edinburgh Postnatal Depression Scale and an adapted version of the instrument used in the World Health Organization Multi-country Study on Women's Health and Domestic Violence were used to measure antenatal depressive symptoms and psychological, physical and sexual acts of intimate partner violence during the current pregnancy respectively. Multiple logistic regression and multiple linear regression were used for data analysis. The prevalence of antenatal depressive symptoms, as determined by the cut-off score of 12 in the Edinburgh Postnatal Depression Scale, was 28·2% (101). Of the participants, 63 (17·6%) reported some type of intimate partner violence during pregnancy. Among them, 60 (95·2%) reported suffering psychological violence, 23 (36·5%) physical violence and one (1·6%) sexual violence. Multiple logistic regression and multiple linear regression indicated that antenatal depressive symptoms are extremely associated with intimate partner violence during pregnancy. Among Brazilian women, exposure to intimate partner violence during pregnancy increases the chances of experiencing antenatal depressive symptoms. Clinical nurses and nurses midwifes should pay attention to the particularities of Brazilian women, especially with regard to the occurrence of intimate partner violence, whose impacts on the mental health of this population are extremely significant, both during the gestational period and postpartum. © 2015 John Wiley & Sons Ltd.
Simple to complex modeling of breathing volume using a motion sensor.
John, Dinesh; Staudenmayer, John; Freedson, Patty
2013-06-01
To compare simple and complex modeling techniques to estimate categories of low, medium, and high ventilation (VE) from ActiGraph™ activity counts. Vertical axis ActiGraph™ GT1M activity counts, oxygen consumption and VE were measured during treadmill walking and running, sports, household chores and labor-intensive employment activities. Categories of low (<19.3 l/min), medium (19.3 to 35.4 l/min) and high (>35.4 l/min) VEs were derived from activity intensity classifications (light <2.9 METs, moderate 3.0 to 5.9 METs and vigorous >6.0 METs). We examined the accuracy of two simple techniques (multiple regression and activity count cut-point analyses) and one complex (random forest technique) modeling technique in predicting VE from activity counts. Prediction accuracy of the complex random forest technique was marginally better than the simple multiple regression method. Both techniques accurately predicted VE categories almost 80% of the time. The multiple regression and random forest techniques were more accurate (85 to 88%) in predicting medium VE. Both techniques predicted the high VE (70 to 73%) with greater accuracy than low VE (57 to 60%). Actigraph™ cut-points for light, medium and high VEs were <1381, 1381 to 3660 and >3660 cpm. There were minor differences in prediction accuracy between the multiple regression and the random forest technique. This study provides methods to objectively estimate VE categories using activity monitors that can easily be deployed in the field. Objective estimates of VE should provide a better understanding of the dose-response relationship between internal exposure to pollutants and disease. Copyright © 2013 Elsevier B.V. All rights reserved.
Functional capacity following univentricular repair--midterm outcome.
Sen, Supratim; Bandyopadhyay, Biswajit; Eriksson, Peter; Chattopadhyay, Amitabha
2012-01-01
Previous studies have seldom compared functional capacity in children following Fontan procedure alongside those with Glenn operation as destination therapy. We hypothesized that Fontan circulation enables better midterm submaximal exercise capacity as compared to Glenn physiology and evaluated this using the 6-minute walk test. Fifty-seven children aged 5-18 years with Glenn (44) or Fontan (13) operations were evaluated with standard 6-minute walk protocols. Baseline SpO(2) was significantly lower in Glenn patients younger than 10 years compared to Fontan counterparts and similar in the two groups in older children. Postexercise SpO(2) fell significantly in Glenn patients compared to the Fontan group. There was no statistically significant difference in baseline, postexercise, or postrecovery heart rates (HRs), or 6-minute walk distances in the two groups. Multiple regression analysis revealed lower resting HR, higher resting SpO(2) , and younger age at latest operation to be significant determinants of longer 6-minute walk distance. Multiple regression analysis also established that younger age at operation, higher resting SpO(2) , Fontan operation, lower resting HR, and lower postexercise HR were significant determinants of higher postexercise SpO(2) . Younger age at operation and exercise, lower resting HR and postexercise HR, higher resting SpO(2) and postexercise SpO(2) , and dominant ventricular morphology being left ventricular or indeterminate/mixed had significant association with better 6-minute work on multiple regression analysis. Lower resting HR had linear association with longer 6-minute walk distances in the Glenn patients. Compared to Glenn physiology, Fontan operation did not have better submaximal exercise capacity assessed by walk distance or work on multiple regression analysis. Lower resting HR, higher resting SpO(2) , and younger age at operation were factors uniformly associated with better submaximal exercise capacity. © 2012 Wiley Periodicals, Inc.
NASA Astrophysics Data System (ADS)
Mekanik, F.; Imteaz, M. A.; Gato-Trinidad, S.; Elmahdi, A.
2013-10-01
In this study, the application of Artificial Neural Networks (ANN) and Multiple regression analysis (MR) to forecast long-term seasonal spring rainfall in Victoria, Australia was investigated using lagged El Nino Southern Oscillation (ENSO) and Indian Ocean Dipole (IOD) as potential predictors. The use of dual (combined lagged ENSO-IOD) input sets for calibrating and validating ANN and MR Models is proposed to investigate the simultaneous effect of past values of these two major climate modes on long-term spring rainfall prediction. The MR models that did not violate the limits of statistical significance and multicollinearity were selected for future spring rainfall forecast. The ANN was developed in the form of multilayer perceptron using Levenberg-Marquardt algorithm. Both MR and ANN modelling were assessed statistically using mean square error (MSE), mean absolute error (MAE), Pearson correlation (r) and Willmott index of agreement (d). The developed MR and ANN models were tested on out-of-sample test sets; the MR models showed very poor generalisation ability for east Victoria with correlation coefficients of -0.99 to -0.90 compared to ANN with correlation coefficients of 0.42-0.93; ANN models also showed better generalisation ability for central and west Victoria with correlation coefficients of 0.68-0.85 and 0.58-0.97 respectively. The ability of multiple regression models to forecast out-of-sample sets is compatible with ANN for Daylesford in central Victoria and Kaniva in west Victoria (r = 0.92 and 0.67 respectively). The errors of the testing sets for ANN models are generally lower compared to multiple regression models. The statistical analysis suggest the potential of ANN over MR models for rainfall forecasting using large scale climate modes.
Tanaka, N; Kunihiro, Y; Kubo, M; Kawano, R; Oishi, K; Ueda, K; Gondo, T
2018-05-29
To identify characteristic high-resolution computed tomography (CT) findings for individual collagen vascular disease (CVD)-related interstitial pneumonias (IPs). The HRCT findings of 187 patients with CVD, including 55 patients with rheumatoid arthritis (RA), 50 with systemic sclerosis (SSc), 46 with polymyositis/dermatomyositis (PM/DM), 15 with mixed connective tissue disease, 11 with primary Sjögren's syndrome, and 10 with systemic lupus erythematosus, were evaluated. Lung parenchymal abnormalities were compared among CVDs using χ 2 test, Kruskal-Wallis test, and multiple logistic regression analysis. A CT-pathology correlation was performed in 23 patients. In RA-IP, honeycombing was identified as the significant indicator based on multiple logistic regression analyses. Traction bronchiectasis (81.8%) was further identified as the most frequent finding based on χ 2 test. In SSc IP, lymph node enlargement and oesophageal dilatation were identified as the indicators based on multiple logistic regression analyses, and ground-glass opacity (GGO) was the most extensive based on Kruskal-Wallis test, which reflects the higher frequency of the pathological nonspecific interstitial pneumonia (NSIP) pattern present in the CT-pathology correlation. In PM/DM IP, airspace consolidation and the absence of honeycombing were identified as the indicators based on multiple logistic regression analyses, and predominance of consolidation over GGO (32.6%) and predominant subpleural distribution of GGO/consolidation (41.3%) were further identified as the most frequent findings based on χ 2 test, which reflects the higher frequency of the pathological NSIP and/or the organising pneumonia patterns present in the CT-pathology correlation. Several characteristic high-resolution CT findings with utility for estimating underlying CVD were identified. Copyright © 2018 The Royal College of Radiologists. Published by Elsevier Ltd. All rights reserved.
Dong, J Q; Zhang, X Y; Wang, S Z; Jiang, X F; Zhang, K; Ma, G W; Wu, M Q; Li, H; Zhang, H
2018-01-01
Plasma very low-density lipoprotein (VLDL) can be used to select for low body fat or abdominal fat (AF) in broilers, but its correlation with AF is limited. We investigated whether any other biochemical indicator can be used in combination with VLDL for a better selective effect. Nineteen plasma biochemical indicators were measured in male chickens from the Northeast Agricultural University broiler lines divergently selected for AF content (NEAUHLF) in the fed state at 46 and 48 d of age. The average concentration of every parameter for the 2 d was used for statistical analysis. Levels of these 19 plasma biochemical parameters were compared between the lean and fat lines. The phenotypic correlations between these plasma biochemical indicators and AF traits were analyzed. Then, multiple linear regression models were constructed to select the best model used for selecting against AF content. and the heritabilities of plasma indicators contained in the best models were estimated. The results showed that 11 plasma biochemical indicators (triglycerides, total bile acid, total protein, globulin, albumin/globulin, aspartate transaminase, alanine transaminase, gamma-glutamyl transpeptidase, uric acid, creatinine, and VLDL) differed significantly between the lean and fat lines (P < 0.01), and correlated significantly with AF traits (P < 0.05). The best multiple linear regression models based on albumin/globulin, VLDL, triglycerides, globulin, total bile acid, and uric acid, had higher R2 (0.73) than the model based only on VLDL (0.21). The plasma parameters included in the best models had moderate heritability estimates (0.21 ≤ h2 ≤ 0.43). These results indicate that these multiple linear regression models can be used to select for lean broiler chickens. © 2017 Poultry Science Association Inc.
NASA Astrophysics Data System (ADS)
Jones, William I.
This study examined the understanding of nature of science among participants in their final year of a 4-year undergraduate teacher education program at a Midwest liberal arts university. The Logic Model Process was used as an integrative framework to focus the collection, organization, analysis, and interpretation of the data for the purpose of (1) describing participant understanding of NOS and (2) to identify participant characteristics and teacher education program features related to those understandings. The Views of Nature of Science Questionnaire form C (VNOS-C) was used to survey participant understanding of 7 target aspects of Nature of Science (NOS). A rubric was developed from a review of the literature to categorize and score participant understanding of the target aspects of NOS. Participants' high school and college transcripts, planning guides for their respective teacher education program majors, and science content and science teaching methods course syllabi were examined to identify and categorize participant characteristics and teacher education program features. The R software (R Project for Statistical Computing, 2010) was used to conduct an exploratory analysis to determine correlations of the antecedent and transaction predictor variables with participants' scores on the 7 target aspects of NOS. Fourteen participant characteristics and teacher education program features were moderately and significantly ( p < .01) correlated with participant scores on the target aspects of NOS. The 6 antecedent predictor variables were entered into multiple regression analyses to determine the best-fit model of antecedent predictor variables for each target NOS aspect. The transaction predictor variables were entered into separate multiple regression analyses to determine the best-fit model of transaction predictor variables for each target NOS aspect. Variables from the best-fit antecedent and best-fit transaction models for each target aspect of NOS were then combined. A regression analysis for each of the combined models was conducted to determine the relative effect of these variables on the target aspects of NOS. Findings from the multiple regression analyses revealed that each of the fourteen predictor variables was present in the best-fit model for at least 1 of the 7 target aspects of NOS. However, not all of the predictor variables were statistically significant (p < .007) in the models and their effect (beta) varied. Participants in the teacher education program who had higher ACT Math scores, completed more high school science credits, and were enrolled either in the Middle Childhood with a science concentration program major or in the Adolescent/Young Adult Science Education program major were more likely to have an informed understanding on each of the 7 target aspects of NOS. Analyses of the planning guides and the course syllabi in each teacher education program major revealed differences between the program majors that may account for the results.
Distorted Perceptions of Competence and Incompetence Are More than Regression Effects
ERIC Educational Resources Information Center
Albanese, M.; Dottl, S.; Mejicano, G.; Zakowski, L.; Seibert, C.; Van Eyck, S.; Prucha, C.
2006-01-01
Students inaccurately assess their own skills, especially high- or low-performers on exams. This study assessed whether regression effects account for this observation. After completing the Infection and Immunity course final exam (IIF), second year medical students (N = 143) estimated their performance on the IIF in terms of percent correct and…
Grades, Gender, and Encouragement: A Regression Discontinuity Analysis
ERIC Educational Resources Information Center
Owen, Ann L.
2010-01-01
The author employs a regression discontinuity design to provide direct evidence on the effects of grades earned in economics principles classes on the decision to major in economics and finds a differential effect for male and female students. Specifically, for female students, receiving an A for a final grade in the first economics class is…
Automating approximate Bayesian computation by local linear regression.
Thornton, Kevin R
2009-07-07
In several biological contexts, parameter inference often relies on computationally-intensive techniques. "Approximate Bayesian Computation", or ABC, methods based on summary statistics have become increasingly popular. A particular flavor of ABC based on using a linear regression to approximate the posterior distribution of the parameters, conditional on the summary statistics, is computationally appealing, yet no standalone tool exists to automate the procedure. Here, I describe a program to implement the method. The software package ABCreg implements the local linear-regression approach to ABC. The advantages are: 1. The code is standalone, and fully-documented. 2. The program will automatically process multiple data sets, and create unique output files for each (which may be processed immediately in R), facilitating the testing of inference procedures on simulated data, or the analysis of multiple data sets. 3. The program implements two different transformation methods for the regression step. 4. Analysis options are controlled on the command line by the user, and the program is designed to output warnings for cases where the regression fails. 5. The program does not depend on any particular simulation machinery (coalescent, forward-time, etc.), and therefore is a general tool for processing the results from any simulation. 6. The code is open-source, and modular.Examples of applying the software to empirical data from Drosophila melanogaster, and testing the procedure on simulated data, are shown. In practice, the ABCreg simplifies implementing ABC based on local-linear regression.
Mainou, Maria; Madenidou, Anastasia-Vasiliki; Liakos, Aris; Paschos, Paschalis; Karagiannis, Thomas; Bekiari, Eleni; Vlachaki, Efthymia; Wang, Zhen; Murad, Mohammad Hassan; Kumar, Shaji; Tsapas, Apostolos
2017-06-01
We performed a systematic review and meta-regression analysis of randomized control trials to investigate the association between response to initial treatment and survival outcomes in patients with newly diagnosed multiple myeloma (MM). Response outcomes included complete response (CR) and the combined outcome of CR or very good partial response (VGPR), while survival outcomes were overall survival (OS) and progression-free survival (PFS). We used random-effect meta-regression models and conducted sensitivity analyses based on definition of CR and study quality. Seventy-two trials were included in the systematic review, 63 of which contributed data in meta-regression analyses. There was no association between OS and CR in patients without autologous stem cell transplant (ASCT) (regression coefficient: .02, 95% confidence interval [CI] -0.06, 0.10), in patients undergoing ASCT (-.11, 95% CI -0.44, 0.22) and in trials comparing ASCT with non-ASCT patients (.04, 95% CI -0.29, 0.38). Similarly, OS did not correlate with the combined metric of CR or VGPR, and no association was evident between response outcomes and PFS. Sensitivity analyses yielded similar results. This meta-regression analysis suggests that there is no association between conventional response outcomes and survival in patients with newly diagnosed MM. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.
Brown, Angus M
2006-04-01
The objective of this present study was to demonstrate a method for fitting complex electrophysiological data with multiple functions using the SOLVER add-in of the ubiquitous spreadsheet Microsoft Excel. SOLVER minimizes the difference between the sum of the squares of the data to be fit and the function(s) describing the data using an iterative generalized reduced gradient method. While it is a straightforward procedure to fit data with linear functions, and we have previously demonstrated a method of non-linear regression analysis of experimental data based upon a single function, it is more complex to fit data with multiple functions, usually requiring specialized expensive computer software. In this paper we describe an easily understood program for fitting experimentally acquired data, in this case the stimulus-evoked compound action potential from the mouse optic nerve, with multiple Gaussian functions. The program is flexible and can be applied to describe data with a wide variety of user-input functions.
Smerbeck, A M; Parrish, J; Yeh, E A; Hoogs, M; Krupp, Lauren B; Weinstock-Guttman, B; Benedict, R H B
2011-04-01
The Brief Visuospatial Memory Test - Revised (BVMTR) and the Symbol Digit Modalities Test (SDMT) oral-only administration are known to be sensitive to cerebral disease in adult samples, but pediatric norms are not available. A demographically balanced sample of healthy control children (N = 92) ages 6-17 was tested with the BVMTR and SDMT. Multiple regression analysis (MRA) was used to develop demographically controlled normative equations. This analysis provided equations that were then used to construct demographically adjusted z-scores for the BVMTR Trial 1, Trial 2, Trial 3, Total Learning, and Delayed Recall indices, as well as the SDMT total correct score. To demonstrate the utility of this approach, a comparison group of children with acute disseminated encephalomyelitis (ADEM) or multiple sclerosis (MS) were also assessed. We find that these visual processing tests discriminate neurological patients from controls. As the tests are validated in adult multiple sclerosis, they are likely to be useful in monitoring pediatric onset multiple sclerosis patients as they transition into adulthood.
Confidence Intervals for Squared Semipartial Correlation Coefficients: The Effect of Nonnormality
ERIC Educational Resources Information Center
Algina, James; Keselman, H. J.; Penfield, Randall D.
2010-01-01
The increase in the squared multiple correlation coefficient ([delta]R[superscript 2]) associated with a variable in a regression equation is a commonly used measure of importance in regression analysis. Algina, Keselman, and Penfield found that intervals based on asymptotic principles were typically very inaccurate, even though the sample size…
Generalized and synthetic regression estimators for randomized branch sampling
David L. R. Affleck; Timothy G. Gregoire
2015-01-01
In felled-tree studies, ratio and regression estimators are commonly used to convert more readily measured branch characteristics to dry crown mass estimates. In some cases, data from multiple trees are pooled to form these estimates. This research evaluates the utility of both tactics in the estimation of crown biomass following randomized branch sampling (...
ERIC Educational Resources Information Center
Fan, Xitao
This paper empirically and systematically assessed the performance of bootstrap resampling procedure as it was applied to a regression model. Parameter estimates from Monte Carlo experiments (repeated sampling from population) and bootstrap experiments (repeated resampling from one original bootstrap sample) were generated and compared. Sample…
Progressive and Regressive Aspects of Information Technology in Society: A Third Sector Perspective
ERIC Educational Resources Information Center
Miller, Kandace R.
2009-01-01
This dissertation explores the impact of information technology on progressive and regressive values in society from the perspective of one international foundation and four of its technology-related programs. Through a critical interpretive approach employing an instrumental multiple-case method, a framework to help explain the influence of…
Correlation Weights in Multiple Regression
ERIC Educational Resources Information Center
Waller, Niels G.; Jones, Jeff A.
2010-01-01
A general theory on the use of correlation weights in linear prediction has yet to be proposed. In this paper we take initial steps in developing such a theory by describing the conditions under which correlation weights perform well in population regression models. Using OLS weights as a comparison, we define cases in which the two weighting…
No Evidence of Reaction Time Slowing in Autism Spectrum Disorder
ERIC Educational Resources Information Center
Ferraro, F. Richard
2016-01-01
A total of 32 studies comprising 238 simple reaction time and choice reaction time conditions were examined in individuals with autism spectrum disorder (n?=?964) and controls (n?=?1032). A Brinley plot/multiple regression analysis was performed on mean reaction times, regressing autism spectrum disorder performance onto the control performance as…
Criteria for the use of regression analysis for remote sensing of sediment and pollutants
NASA Technical Reports Server (NTRS)
Whitlock, C. H.; Kuo, C. Y.; Lecroy, S. R. (Principal Investigator)
1982-01-01
Data analysis procedures for quantification of water quality parameters that are already identified and are known to exist within the water body are considered. The liner multiple-regression technique was examined as a procedure for defining and calibrating data analysis algorithms for such instruments as spectrometers and multispectral scanners.
An Empirical Study of Eight Nonparametric Tests in Hierarchical Regression.
ERIC Educational Resources Information Center
Harwell, Michael; Serlin, Ronald C.
When normality does not hold, nonparametric tests represent an important data-analytic alternative to parametric tests. However, the use of nonparametric tests in educational research has been limited by the absence of easily performed tests for complex experimental designs and analyses, such as factorial designs and multiple regression analyses,…
Multiple Logistic Regression Analysis of Cigarette Use among High School Students
ERIC Educational Resources Information Center
Adwere-Boamah, Joseph
2011-01-01
A binary logistic regression analysis was performed to predict high school students' cigarette smoking behavior from selected predictors from 2009 CDC Youth Risk Behavior Surveillance Survey. The specific target student behavior of interest was frequent cigarette use. Five predictor variables included in the model were: a) race, b) frequency of…
The Development and Demonstration of Multiple Regression Models for Operant Conditioning Questions.
ERIC Educational Resources Information Center
Fanning, Fred; Newman, Isadore
Based on the assumption that inferential statistics can make the operant conditioner more sensitive to possible significant relationships, regressions models were developed to test the statistical significance between slopes and Y intercepts of the experimental and control group subjects. These results were then compared to the traditional operant…
How Many Subjects Does It Take to Do a Regression Analysis?
ERIC Educational Resources Information Center
Green, Samuel B.
1991-01-01
An evaluation of the rules-of-thumb used to determine the minimum number of subjects required to conduct multiple regression analyses suggests that researchers who use a rule of thumb rather than power analyses trade simplicity of use for accuracy and specificity of response. Insufficient power is likely to result. (SLD)
Hierarchical Multiple Regression in Counseling Research: Common Problems and Possible Remedies.
ERIC Educational Resources Information Center
Petrocelli, John V.
2003-01-01
A brief content analysis was conducted on the use of hierarchical regression in counseling research published in the "Journal of Counseling Psychology" and the "Journal of Counseling & Development" during the years 1997-2001. Common problems are cited and possible remedies are described. (Contains 43 references and 3 tables.) (Author)
Assistive Technologies for Second-Year Statistics Students Who Are Blind
ERIC Educational Resources Information Center
Erhardt, Robert J.; Shuman, Michael P.
2015-01-01
At Wake Forest University, a student who is blind enrolled in a second course in statistics. The course covered simple and multiple regression, model diagnostics, model selection, data visualization, and elementary logistic regression. These topics required that the student both interpret and produce three sets of materials: mathematical writing,…